WorldWideScience

Sample records for microbial genomes based

  1. Single gene-based distinction of individual microbial genomes from a mixed population of microbial cells

    Directory of Open Access Journals (Sweden)

    Manu Valtteri Tamminen

    2015-03-01

    Full Text Available Recent progress in environmental microbiology has revealed vast populations of microbes in any given habitat that cannot be detected by conventional culturing strategies. The use of sensitive genetic detection methods such as CARD-FISH and in situ PCR have been limited by the cell wall permeabilization requirement that cannot be performed similarly on all cell types without lysing some and leaving some unpermeabilized. Furthermore, the detection of low copy targets such as genes present in single copies in the microbial genomes, has remained problematic. We describe an emulsion-based procedure to trap individual microbial cells into picoliter-volume polyacrylamide droplets that provide a rigid support for genetic material and therefore allow complete degradation of cellular material to expose the individual genomes. The polyacrylamide droplets are subsequently converted into picoliter-scale reactors for genome amplification. The amplified genomes are labelled based on the presence of a target gene and differentiated from those that do not contain the gene by flow cytometry. Using the Escherichia coli strains XL1 and MC1061, which differ with respect to the presence (XL1 or absence (MC1061 of a single copy of a tetracycline resistance gene per genome, we demonstrate that XL1 genomes present at 0.1% of MC1061 genomes can be differentiated using this method. Using a spiked sediment microbial sample, we demonstrate that the method is applicable to highly complex environmental microbial communities as a target gene-based screen for individual microbes. The method provides a novel tool for enumerating functional cell populations in complex microbial communities. We envision that the method could be optimized for fluorescence-activated cell sorting to enrich genetic material of interest from complex environmental samples.

  2. Microbial genomic taxonomy.

    Science.gov (United States)

    Thompson, Cristiane C; Chimetto, Luciane; Edwards, Robert A; Swings, Jean; Stackebrandt, Erko; Thompson, Fabiano L

    2013-12-23

    A need for a genomic species definition is emerging from several independent studies worldwide. In this commentary paper, we discuss recent studies on the genomic taxonomy of diverse microbial groups and a unified species definition based on genomics. Accordingly, strains from the same microbial species share >95% Average Amino Acid Identity (AAI) and Average Nucleotide Identity (ANI), >95% identity based on multiple alignment genes, genomic signature, and > 70% in silico Genome-to-Genome Hybridization similarity (GGDH). Species of the same genus will form monophyletic groups on the basis of 16S rRNA gene sequences, Multilocus Sequence Analysis (MLSA) and supertree analysis. In addition to the established requirements for species descriptions, we propose that new taxa descriptions should also include at least a draft genome sequence of the type strain in order to obtain a clear outlook on the genomic landscape of the novel microbe. The application of the new genomic species definition put forward here will allow researchers to use genome sequences to define simultaneously coherent phenotypic and genomic groups.

  3. Visualization for genomics: the Microbial Genome Viewer.

    NARCIS (Netherlands)

    Kerkhoven, R.; Enckevort, F.H.J. van; Boekhorst, J.; Molenaar, D.; Siezen, R.J.

    2004-01-01

    SUMMARY: A Web-based visualization tool, the Microbial Genome Viewer, is presented that allows the user to combine complex genomic data in a highly interactive way. This Web tool enables the interactive generation of chromosome wheels and linear genome maps from genome annotation data stored in a My

  4. A Web-Based Comparative Genomics Tutorial for Investigating Microbial Genomes

    Directory of Open Access Journals (Sweden)

    Michael Strong

    2009-12-01

    Full Text Available As the number of completely sequenced microbial genomes continues to rise at an impressive rate, it is important to prepare students with the skills necessary to investigate microorganisms at the genomic level. As a part of the core curriculum for first-year graduate students in the biological sciences, we have implemented a web-based tutorial to introduce students to the fields of comparative and functional genomics. The tutorial focuses on recent computational methods for identifying functionally linked genes and proteins on a genome-wide scale and was used to introduce students to the Rosetta Stone, Phylogenetic Profile, conserved Gene Neighbor, and Operon computational methods. Students learned to use a number of publicly available web servers and databases to identify functionally linked genes in the Escherichia coli genome, with emphasis on genome organization and operon structure. The overall effectiveness of the tutorial was assessed based on student evaluations and homework assignments. The tutorial is available to other educators at http://www.doe-mbi.ucla.edu/~strong/m253.php.

  5. Genome-based microbial ecology of anammox granules in a full-scale wastewater treatment system.

    Science.gov (United States)

    Speth, Daan R; In 't Zandt, Michiel H; Guerrero-Cruz, Simon; Dutilh, Bas E; Jetten, Mike S M

    2016-03-31

    Partial-nitritation anammox (PNA) is a novel wastewater treatment procedure for energy-efficient ammonium removal. Here we use genome-resolved metagenomics to build a genome-based ecological model of the microbial community in a full-scale PNA reactor. Sludge from the bioreactor examined here is used to seed reactors in wastewater treatment plants around the world; however, the role of most of its microbial community in ammonium removal remains unknown. Our analysis yielded 23 near-complete draft genomes that together represent the majority of the microbial community. We assign these genomes to distinct anaerobic and aerobic microbial communities. In the aerobic community, nitrifying organisms and heterotrophs predominate. In the anaerobic community, widespread potential for partial denitrification suggests a nitrite loop increases treatment efficiency. Of our genomes, 19 have no previously cultivated or sequenced close relatives and six belong to bacterial phyla without any cultivated members, including the most complete Omnitrophica (formerly OP3) genome to date.

  6. Genome-based microbial ecology of anammox granules in a full-scale wastewater treatment system

    NARCIS (Netherlands)

    Speth, D.; Zandt, M.H. In 't; Guerrero-Cruz, S.; Dutilh, B.E.; Jetten, M.S.M

    2016-01-01

    Partial-nitritation anammox (PNA) is a novel wastewater treatment procedure for energy-efficient ammonium removal. Here we use genome-resolved metagenomics to build a genome-based ecological model of the microbial community in a full-scale PNA reactor. Sludge from the bioreactor examined here is use

  7. Genome-based microbial ecology of anammox granules in a full-scale wastewater treatment system

    NARCIS (Netherlands)

    Speth, D.; Zandt, M.H. In 't; Guerrero-Cruz, S.; Dutilh, B.E.; Jetten, M.S.M

    2016-01-01

    Partial-nitritation anammox (PNA) is a novel wastewater treatment procedure for energy-efficient ammonium removal. Here we use genome-resolved metagenomics to build a genome-based ecological model of the microbial community in a full-scale PNA reactor. Sludge from the bioreactor examined here is

  8. Genomics and marine microbial ecology.

    Science.gov (United States)

    Pedrós-Alió, Carlos

    2006-09-01

    Genomics has brought about a revolution in all fields of biology. Before the development of microbial ecology in the 1970s, microbes were not even considered in marine ecological studies. Today we know that half of the total primary production of the planet must be credited to microorganisms. This and other discoveries have changed dramatically the perspective and the focus of marine microbial ecology. The application of genomics-based approaches has provided new challenges and has allowed the discovery of novel functions, an appreciation of the great diversity of microorganisms, and the introduction of controversial ideas regarding the concepts of species, genome, and niche. Nevertheless, thorough knowledge of the traditional disciplines of biology is necessary to explore the possibilities arising from these new insights. This work reviews the different genomic techniques that can be applied to marine microbial ecology, including both sequencing of the complete genomes of microorganisms and metagenomics, which, in turn, can be complemented with the study of mRNAs (transcriptomics) and proteins (proteomics). The example of proteorhodopsin illustrates the type of information that can be gained from these approaches. A genomics perspective constitutes a map that will allow microbiologists to focus their research on potentially more productive aspects.

  9. Microbial metagenomics: beyond the genome.

    Science.gov (United States)

    Gilbert, Jack A; Dupont, Christopher L

    2011-01-01

    Metagenomics literally means "beyond the genome." Marine microbial metagenomic databases presently comprise approximately 400 billion base pairs of DNA, only approximately 3% of that found in 1 ml of seawater. Very soon a trillion-base-pair sequence run will be feasible, so it is time to reflect on what we have learned from metagenomics. We review the impact of metagenomics on our understanding of marine microbial communities. We consider the studies facilitated by data generated through the Global Ocean Sampling expedition, as well as the revolution wrought at the individual laboratory level through next generation sequencing technologies. We review recent studies and discoveries since 2008, provide a discussion of bioinformatic analyses, including conceptual pipelines and sequence annotation and predict the future of metagenomics, with suggestions of collaborative community studies tailored toward answering some of the fundamental questions in marine microbial ecology.

  10. Microbial Metagenomics: Beyond the Genome

    Science.gov (United States)

    Gilbert, Jack A.; Dupont, Christopher L.

    2011-01-01

    Metagenomics literally means “beyond the genome.” Marine microbial metagenomic databases presently comprise ˜400 billion base pairs of DNA, only ˜3% of that found in 1 ml of seawater. Very soon a trillion-base-pair sequence run will be feasible, so it is time to reflect on what we have learned from metagenomics. We review the impact of metagenomics on our understanding of marine microbial communities. We consider the studies facilitated by data generated through the Global Ocean Sampling expedition, as well as the revolution wrought at the individual laboratory level through next generation sequencing technologies. We review recent studies and discoveries since 2008, provide a discussion of bioinformatic analyses, including conceptual pipelines and sequence annotation and predict the future of metagenomics, with suggestions of collaborative community studies tailored toward answering some of the fundamental questions in marine microbial ecology.

  11. GENOME-BASED MODELING AND DESIGN OF METABOLIC INTERACTIONS IN MICROBIAL COMMUNITIES

    Directory of Open Access Journals (Sweden)

    Radhakrishnan Mahadevan

    2012-10-01

    With the advent of genome sequencing, omics technologies, bioinformatics and genome-scale modeling, researchers now have unprecedented capabilities to analyze and engineer the metabolism of microbial communities. The goal of this review is to summarize recent applications of genome-scale metabolic modeling to microbial communities. A brief introduction to lumped community models is used to motivate the need for genome-level descriptions of individual species and their metabolic interactions. The review of genome-scale models begins with static modeling approaches, which are appropriate for communities where the extracellular environment can be assumed to be time invariant or slowly varying. Dynamic extensions of the static modeling approach are described, and then applications of genome-scale models for design of synthetic microbial communities are reviewed. The review concludes with a summary of metagenomic tools for analyzing community metabolism and an outlook for future research.

  12. Integrative computational approach for genome-based study of microbial lipid-degrading enzymes.

    Science.gov (United States)

    Vorapreeda, Tayvich; Thammarongtham, Chinae; Laoteng, Kobkul

    2016-07-01

    Lipid-degrading or lipolytic enzymes have gained enormous attention in academic and industrial sectors. Several efforts are underway to discover new lipase enzymes from a variety of microorganisms with particular catalytic properties to be used for extensive applications. In addition, various tools and strategies have been implemented to unravel the functional relevance of the versatile lipid-degrading enzymes for special purposes. This review highlights the study of microbial lipid-degrading enzymes through an integrative computational approach. The identification of putative lipase genes from microbial genomes and metagenomic libraries using homology-based mining is discussed, with an emphasis on sequence analysis of conserved motifs and enzyme topology. Molecular modelling of three-dimensional structure on the basis of sequence similarity is shown to be a potential approach for exploring the structural and functional relationships of candidate lipase enzymes. The perspectives on a discriminative framework of cutting-edge tools and technologies, including bioinformatics, computational biology, functional genomics and functional proteomics, intended to facilitate rapid progress in understanding lipolysis mechanism and to discover novel lipid-degrading enzymes of microorganisms are discussed.

  13. Microbial Genomics Research in China

    Institute of Scientific and Technical Information of China (English)

    ZHAO Guo-ping

    2004-01-01

    @@ Microorganisms, including phage/virus, were initial targets and tools for developing DNA sequencing technology. Microbial genomic study was started as a model system for the Human Genome Project (HGP) and it did successfully supported the HGP, particularly with respect to BAC contig construction and large-scale shotgun sequencing and assembly. Microbial genomics study has become the fastest developed genomics discipline along with HGP, taking the advantage of the organisms' highly diversified physiology, extremely long history of evolution, close relationship with human/environment,as well as relatively small genome sizes and simple systems for functional analysis.

  14. Microbial Genomics Research in China

    Institute of Scientific and Technical Information of China (English)

    ZHAOGuo-ping

    2004-01-01

    Microorganisms, including phage/virus, were initial targets and tools for developing DNA sequencing technology. Microbial genomic study was started as a model system for the Human Genome Project (HGP) and it did successfully supported the HGP, particularly with respect to BAC contig construction and large-scale shotgun sequencing and assembly. Microbial genomics study has become the fastest developed genomics discipline along with HGP, taking the advantage of the organisms' highly diversified physiology, extremely long history of evolution, close relationship with human/environment,as well as relatively small genome sizes and simple systems for functional analysis.

  15. Comparative Microbial Genomics and Forensics.

    Science.gov (United States)

    Massey, Steven E

    2016-08-01

    Forensic science concerns the application of scientific techniques to questions of a legal nature and may also be used to address questions of historical importance. Forensic techniques are often used in legal cases that involve crimes against persons or property, and they increasingly may involve cases of bioterrorism, crimes against nature, medical negligence, or tracing the origin of food- and crop-borne disease. Given the rapid advance of genome sequencing and comparative genomics techniques, we ask how these might be used to address cases of a forensic nature, focusing on the use of microbial genome sequence analysis. Such analyses rely on the increasingly large numbers of microbial genomes present in public databases, the ability of individual investigators to rapidly sequence whole microbial genomes, and an increasing depth of understanding of their evolution and function. Suggestions are made as to how comparative microbial genomics might be applied forensically and may represent possibilities for the future development of forensic techniques. A particular emphasis is on the nascent field of genomic epidemiology, which utilizes rapid whole-genome sequencing to identify the source and spread of infectious outbreaks. Also discussed is the application of comparative microbial genomics to the study of historical epidemics and deaths and how the approaches developed may also be applicable to more recent and actionable cases.

  16. Genome-enabled Modeling of Microbial Biogeochemistry using a Trait-based Approach. Does Increasing Metabolic Complexity Increase Predictive Capabilities?

    Science.gov (United States)

    King, E.; Karaoz, U.; Molins, S.; Bouskill, N.; Anantharaman, K.; Beller, H. R.; Banfield, J. F.; Steefel, C. I.; Brodie, E.

    2015-12-01

    The biogeochemical functioning of ecosystems is shaped in part by genomic information stored in the subsurface microbiome. Cultivation-independent approaches allow us to extract this information through reconstruction of thousands of genomes from a microbial community. Analysis of these genomes, in turn, gives an indication of the organisms present and their functional roles. However, metagenomic analyses can currently deliver thousands of different genomes that range in abundance/importance, requiring the identification and assimilation of key physiologies and metabolisms to be represented as traits for successful simulation of subsurface processes. Here we focus on incorporating -omics information into BioCrunch, a genome-informed trait-based model that represents the diversity of microbial functional processes within a reactive transport framework. This approach models the rate of nutrient uptake and the thermodynamics of coupled electron donors and acceptors for a range of microbial metabolisms including heterotrophs and chemolithotrophs. Metabolism of exogenous substrates fuels catabolic and anabolic processes, with the proportion of energy used for cellular maintenance, respiration, biomass development, and enzyme production based upon dynamic intracellular and environmental conditions. This internal resource partitioning represents a trade-off against biomass formation and results in microbial community emergence across a fitness landscape. Biocrunch was used here in simulations that included organisms and metabolic pathways derived from a dataset of ~1200 non-redundant genomes reflecting a microbial community in a floodplain aquifer. Metagenomic data was directly used to parameterize trait values related to growth and to identify trait linkages associated with respiration, fermentation, and key enzymatic functions such as plant polymer degradation. Simulations spanned a range of metabolic complexities and highlight benefits originating from simulations

  17. Sequence-based Methods in Human Microbial Ecology: A The 2nd HumanGenome Comes of Age

    Energy Technology Data Exchange (ETDEWEB)

    Weng, Li; Rubin, Edward M.; Bristow, James

    2005-06-01

    Ecologists studying microbial life in the environment have recognized the enormous complexity of microbial diversity for more than a decade (Whitman et al. 1998). The development of a variety of culture-independent methods, many of them coupled with high-throughput DNA sequencing, has allowed this diversity to be explored in ever greater detail (Handelsman 2004; Harris et al. 2004; Hugenholtz et al. 1998; Moreira and Lopez-Garcia 2002; Rappe and Giovannoni 2003). Despite the widespread application of these new techniques to the characterization of uncultivated microbes and microbial communities in the environment, their application to human health and disease has lagged behind. Because these techniques now allow not only cataloging of microbial diversity, but also insight into microbial functions, it is time for clinical microbiologists to apply these tools to the microbial communities that abound on and within us, in what has been aptly called ''the second Human Genome Project'' (Relman and Falkow 2001). In this review we will discuss the sequence-based methods for microbial analysis that are currently available and their application to identify novel human pathogens, improve diagnosis and treatment of known infectious diseases, and finally to advance understanding of our relationship with microbial communities that normally reside in and on the human body.

  18. The microbial genomics of arsenic.

    Science.gov (United States)

    Andres, Jérémy; Bertin, Philippe N

    2016-03-01

    Arsenic, which is a major contaminant of many aquatic ecosystems worldwide, is responsible for serious public health issues. However, life has evolved various strategies for coping with this toxic element. In particular, prokaryotic organisms have developed processes enabling them to resist and metabolize this chemical. Studies based on genome sequencing and transcriptome, proteome and metabolome profiling have greatly improved our knowledge of prokaryotes' metabolic potential and functioning in contaminated environments. The increasing number of genomes available and the development of descriptive and comparative approaches have made it possible not only to identify several genetic determinants of the arsenic metabolism, but also to elucidate their phylogenetic distribution and their modes of regulation. In addition, studies using functional genomic tools have established the pleiotropic character of prokaryotes' responses to arsenic, which can be either common to several species or species-specific. These approaches also provide promising means of deciphering the functioning of microbial communities including uncultured organisms, the genetic transfers involved and the possible occurrence of metabolic interactions as well as the evolution of arsenic resistance and metabolism.

  19. Microbial genomes: Blueprints for life

    Energy Technology Data Exchange (ETDEWEB)

    Relman, David A.; Strauss, Evelyn

    2000-12-31

    Complete microbial genome sequences hold the promise of profound new insights into microbial pathogenesis, evolution, diagnostics, and therapeutics. From these insights will come a new foundation for understanding the evolution of single-celled life, as well as the evolution of more complex life forms. This report is an in-depth analysis of scientific issues that provides recommendations and will be widely disseminated to the scientific community, federal agencies, industry and the public.

  20. The integrated microbial genome resource of analysis.

    Science.gov (United States)

    Checcucci, Alice; Mengoni, Alessio

    2015-01-01

    Integrated Microbial Genomes and Metagenomes (IMG) is a biocomputational system that allows to provide information and support for annotation and comparative analysis of microbial genomes and metagenomes. IMG has been developed by the US Department of Energy (DOE)-Joint Genome Institute (JGI). IMG platform contains both draft and complete genomes, sequenced by Joint Genome Institute and other public and available genomes. Genomes of strains belonging to Archaea, Bacteria, and Eukarya domains are present as well as those of viruses and plasmids. Here, we provide some essential features of IMG system and case study for pangenome analysis.

  1. Uses of antimicrobial genes from microbial genome

    Science.gov (United States)

    Sorek, Rotem; Rubin, Edward M.

    2013-08-20

    We describe a method for mining microbial genomes to discover antimicrobial genes and proteins having broad spectrum of activity. Also described are antimicrobial genes and their expression products from various microbial genomes that were found using this method. The products of such genes can be used as antimicrobial agents or as tools for molecular biology.

  2. Improving microbial genome annotations in an integrated database context.

    Directory of Open Access Journals (Sweden)

    I-Min A Chen

    Full Text Available Effective comparative analysis of microbial genomes requires a consistent and complete view of biological data. Consistency regards the biological coherence of annotations, while completeness regards the extent and coverage of functional characterization for genomes. We have developed tools that allow scientists to assess and improve the consistency and completeness of microbial genome annotations in the context of the Integrated Microbial Genomes (IMG family of systems. All publicly available microbial genomes are characterized in IMG using different functional annotation and pathway resources, thus providing a comprehensive framework for identifying and resolving annotation discrepancies. A rule based system for predicting phenotypes in IMG provides a powerful mechanism for validating functional annotations, whereby the phenotypic traits of an organism are inferred based on the presence of certain metabolic reactions and pathways and compared to experimentally observed phenotypes. The IMG family of systems are available at http://img.jgi.doe.gov/.

  3. Promoter prediction and annotation of microbial genomes based on DNA sequence and structural responses to superhelical stress

    Directory of Open Access Journals (Sweden)

    Benham Craig J

    2006-05-01

    Full Text Available Abstract Background In our previous studies, we found that the sites in prokaryotic genomes which are most susceptible to duplex destabilization under the negative superhelical stresses that occur in vivo are statistically highly significantly associated with intergenic regions that are known or inferred to contain promoters. In this report we investigate how this structural property, either alone or together with other structural and sequence attributes, may be used to search prokaryotic genomes for promoters. Results We show that the propensity for stress-induced DNA duplex destabilization (SIDD is closely associated with specific promoter regions. The extent of destabilization in promoter-containing regions is found to be bimodally distributed. When compared with DNA curvature, deformability, thermostability or sequence motif scores within the -10 region, SIDD is found to be the most informative DNA property regarding promoter locations in the E. coli K12 genome. SIDD properties alone perform better at detecting promoter regions than other programs trained on this genome. Because this approach has a very low false positive rate, it can be used to predict with high confidence the subset of promoters that are strongly destabilized. When SIDD properties are combined with -10 motif scores in a linear classification function, they predict promoter regions with better than 80% accuracy. When these methods were tested with promoter and non-promoter sequences from Bacillus subtilis, they achieved similar or higher accuracies. We also present a strictly SIDD-based predictor for annotating promoter sequences in complete microbial genomes. Conclusion In this report we show that the propensity to undergo stress-induced duplex destabilization (SIDD is a distinctive structural attribute of many prokaryotic promoter sequences. We have developed methods to identify promoter sequences in prokaryotic genomes that use SIDD either as a sole predictor or in

  4. Development and Use of Integrated Microarray-Based Genomic Technologies for Assessing Microbial Community Composition and Dynamics

    Energy Technology Data Exchange (ETDEWEB)

    Zhou, J.; Wu, L.; Gentry, T.; Schadt, C.; He, Z.; Li, X.

    2006-04-05

    To effectively monitor microbial populations involved in various important processes, a 50-mer-based oligonucleotide microarray was developed based on known genes and pathways involved in: biodegradation, metal resistance and reduction, denitrification, nitrification, nitrogen fixation, methane oxidation, methanogenesis, carbon polymer decomposition, and sulfate reduction. This array contains approximately 2000 unique and group-specific probes with <85% similarity to their non-target sequences. Based on artificial probes, our results showed that at hybridization conditions of 50 C and 50% formamide, the 50-mer microarray hybridization can differentiate sequences having <88% similarity. Specificity tests with representative pure cultures indicated that the designed probes on the arrays appeared to be specific to their corresponding target genes. Detection limits were about 5-10ng genomic DNA in the absence of background DNA, and 50-100ng ({approx}1.3{sup o} 10{sup 7} cells) in the presence background DNA. Strong linear relationships between signal intensity and target DNA and RNA concentration were observed (r{sup 2} = 0.95-0.99). Application of this microarray to naphthalene-amended enrichments and soil microcosms demonstrated that composition of the microflora varied depending on incubation conditions. While the naphthalene-degrading genes from Rhodococcus-type microorganisms were dominant in enrichments, the genes involved in naphthalene degradation from Gram-negative microorganisms such as Ralstonia, Comamonas, and Burkholderia were most abundant in the soil microcosms (as well as those for polyaromatic hydrocarbon and nitrotoluene degradation). Although naphthalene degradation is widely known and studied in Pseudomonas, Pseudomonas genes were not detected in either system. Real-time PCR analysis of 4 representative genes was consistent with microarray-based quantification (r{sup 2} = 0.95). Currently, we are also applying this microarray to the study of several

  5. The microbial ocean from genomes to biomes.

    Science.gov (United States)

    DeLong, Edward F

    2009-05-14

    Numerically, microbial species dominate the oceans, yet their population dynamics, metabolic complexity and synergistic interactions remain largely uncharted. A full understanding of life in the ocean requires more than knowledge of marine microbial taxa and their genome sequences. The latest experimental techniques and analytical approaches can provide a fresh perspective on the biological interactions within marine ecosystems, aiding in the construction of predictive models that can interrelate microbial dynamics with the biogeochemical matter and energy fluxes that make up the ocean ecosystem.

  6. Sequencing intractable DNA to close microbial genomes.

    Science.gov (United States)

    Hurt, Richard A; Brown, Steven D; Podar, Mircea; Palumbo, Anthony V; Elias, Dwayne A

    2012-01-01

    Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled "intractable" resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such problematic regions in the "non-contiguous finished" Desulfovibrio desulfuricans ND132 genome (6 intractable gaps) and the Desulfovibrio africanus genome (1 intractable gap). The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. The developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.

  7. Sequencing intractable DNA to close microbial genomes.

    Directory of Open Access Journals (Sweden)

    Richard A Hurt

    Full Text Available Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled "intractable" resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such problematic regions in the "non-contiguous finished" Desulfovibrio desulfuricans ND132 genome (6 intractable gaps and the Desulfovibrio africanus genome (1 intractable gap. The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. The developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.

  8. Microbial genomics: from sequence to function.

    OpenAIRE

    Schwartz, I

    2000-01-01

    The era of genomics (the study of genes and their function) began a scant dozen years ago with a suggestion by James Watson that the complete DNA sequence of the human genome be determined. Since that time, the human genome project has attracted a great deal of attention in the scientific world and the general media; the scope of the sequencing effort, and the extraordinary value that it will provide, has served to mask the enormous progress in sequencing other genomes. Microbial genome seque...

  9. Millstone: software for multiplex microbial genome analysis and engineering.

    Science.gov (United States)

    Goodman, Daniel B; Kuznetsov, Gleb; Lajoie, Marc J; Ahern, Brian W; Napolitano, Michael G; Chen, Kevin Y; Chen, Changping; Church, George M

    2017-05-25

    Inexpensive DNA sequencing and advances in genome editing have made computational analysis a major rate-limiting step in adaptive laboratory evolution and microbial genome engineering. We describe Millstone, a web-based platform that automates genotype comparison and visualization for projects with up to hundreds of genomic samples. To enable iterative genome engineering, Millstone allows users to design oligonucleotide libraries and create successive versions of reference genomes. Millstone is open source and easily deployable to a cloud platform, local cluster, or desktop, making it a scalable solution for any lab.

  10. ReacKnock: identifying reaction deletion strategies for microbial strain optimization based on genome-scale metabolic network.

    Directory of Open Access Journals (Sweden)

    Zixiang Xu

    Full Text Available Gene knockout has been used as a common strategy to improve microbial strains for producing chemicals. Several algorithms are available to predict the target reactions to be deleted. Most of them apply mixed integer bi-level linear programming (MIBLP based on metabolic networks, and use duality theory to transform bi-level optimization problem of large-scale MIBLP to single-level programming. However, the validity of the transformation was not proved. Solution of MIBLP depends on the structure of inner problem. If the inner problem is continuous, Karush-Kuhn-Tucker (KKT method can be used to reformulate the MIBLP to a single-level one. We adopt KKT technique in our algorithm ReacKnock to attack the intractable problem of the solution of MIBLP, demonstrated with the genome-scale metabolic network model of E. coli for producing various chemicals such as succinate, ethanol, threonine and etc. Compared to the previous methods, our algorithm is fast, stable and reliable to find the optimal solutions for all the chemical products tested, and able to provide all the alternative deletion strategies which lead to the same industrial objective.

  11. Genomics, environmental genomics and the issue of microbial species.

    Science.gov (United States)

    Ward, D M; Cohan, F M; Bhaya, D; Heidelberg, J F; Kühl, M; Grossman, A

    2008-02-01

    A microbial species concept is crucial for interpreting the variation detected by genomics and environmental genomics among cultivated microorganisms and within natural microbial populations. Comparative genomic analyses of prokaryotic species as they are presently described and named have led to the provocative idea that prokaryotes may not form species as we think about them for plants and animals. There are good reasons to doubt whether presently recognized prokaryotic species are truly species. To achieve a better understanding of microbial species, we believe it is necessary to (i) re-evaluate traditional approaches in light of evolutionary and ecological theory, (ii) consider that different microbial species may have evolved in different ways and (iii) integrate genomic, metagenomic and genome-wide expression approaches with ecological and evolutionary theory. Here, we outline how we are using genomic methods to (i) identify ecologically distinct populations (ecotypes) predicted by theory to be species-like fundamental units of microbial communities, and (ii) test their species-like character through in situ distribution and gene expression studies. By comparing metagenomic sequences obtained from well-studied hot spring cyanobacterial mats with genomic sequences of two cultivated cyanobacterial ecotypes, closely related to predominant native populations, we can conduct in situ population genetics studies that identify putative ecotypes and functional genes that determine the ecotypes' ecological distinctness. If individuals within microbial communities are found to be grouped into ecologically distinct, species-like populations, knowing about such populations should guide us to a better understanding of how genomic variation is linked to community function.

  12. Specialized microbial databases for inductive exploration of microbial genome sequences

    Directory of Open Access Journals (Sweden)

    Cabau Cédric

    2005-02-01

    Full Text Available Abstract Background The enormous amount of genome sequence data asks for user-oriented databases to manage sequences and annotations. Queries must include search tools permitting function identification through exploration of related objects. Methods The GenoList package for collecting and mining microbial genome databases has been rewritten using MySQL as the database management system. Functions that were not available in MySQL, such as nested subquery, have been implemented. Results Inductive reasoning in the study of genomes starts from "islands of knowledge", centered around genes with some known background. With this concept of "neighborhood" in mind, a modified version of the GenoList structure has been used for organizing sequence data from prokaryotic genomes of particular interest in China. GenoChore http://bioinfo.hku.hk/genochore.html, a set of 17 specialized end-user-oriented microbial databases (including one instance of Microsporidia, Encephalitozoon cuniculi, a member of Eukarya has been made publicly available. These databases allow the user to browse genome sequence and annotation data using standard queries. In addition they provide a weekly update of searches against the world-wide protein sequences data libraries, allowing one to monitor annotation updates on genes of interest. Finally, they allow users to search for patterns in DNA or protein sequences, taking into account a clustering of genes into formal operons, as well as providing extra facilities to query sequences using predefined sequence patterns. Conclusion This growing set of specialized microbial databases organize data created by the first Chinese bacterial genome programs (ThermaList, Thermoanaerobacter tencongensis, LeptoList, with two different genomes of Leptospira interrogans and SepiList, Staphylococcus epidermidis associated to related organisms for comparison.

  13. Fourteenth-Sixteenth Microbial Genomics Conference-2006-2008

    Energy Technology Data Exchange (ETDEWEB)

    Miller, Jeffrey H

    2011-04-18

    The concept of an annual meeting on the E. coli genome was formulated at the Banbury Center Conference on the Genome of E. coli in October, 1991. The first meeting was held on September 10-14, 1992 at the University of Wisconsin, and this was followed by a yearly series of meetings, and by an expansion to include The fourteenth meeting took place September 24-28, 2006 at Lake Arrowhead, CA, the fifteenth September 16-20, 2007 at the University of Maryland, College Park, MD, and the sixteenth September 14-18, 2008 at Lake Arrowhead. The full program for the 16th meeting is attached. There have been rapid and exciting advances in microbial genomics that now make possible comparing large data sets of sequences from a wide variety of microbial genomes, and from whole microbial communities. Examining the “microbiomes”, the living microbial communities in different host organisms opens up many possibilities for understanding the landscape presented to pathogenic microorganisms. For quite some time there has been a shifting emphasis from pure sequence data to trying to understand how to use that information to solve biological problems. Towards this end new technologies are being developed and improved. Using genetics, functional genomics, and proteomics has been the recent focus of many different laboratories. A key element is the integration of different aspects of microbiology, sequencing technology, analysis techniques, and bioinformatics. The goal of these conference is to provide a regular forum for these interactions to occur. While there have been a number of genome conferences, what distinguishes the Microbial Genomics Conference is its emphasis on bringing together biology and genetics with sequencing and bioinformatics. Also, this conference is the longest continuing meeting, now established as a major regular annual meeting. In addition to its coverage of microbial genomes and biodiversity, the meetings also highlight microbial communities and the use of

  14. VirSorter: mining viral signal from microbial genomic data

    Directory of Open Access Journals (Sweden)

    Simon Roux

    2015-05-01

    Full Text Available Viruses of microbes impact all ecosystems where microbes drive key energy and substrate transformations including the oceans, humans and industrial fermenters. However, despite this recognized importance, our understanding of viral diversity and impacts remains limited by too few model systems and reference genomes. One way to fill these gaps in our knowledge of viral diversity is through the detection of viral signal in microbial genomic data. While multiple approaches have been developed and applied for the detection of prophages (viral genomes integrated in a microbial genome, new types of microbial genomic data are emerging that are more fragmented and larger scale, such as Single-cell Amplified Genomes (SAGs of uncultivated organisms or genomic fragments assembled from metagenomic sequencing. Here, we present VirSorter, a tool designed to detect viral signal in these different types of microbial sequence data in both a reference-dependent and reference-independent manner, leveraging probabilistic models and extensive virome data to maximize detection of novel viruses. Performance testing shows that VirSorter’s prophage prediction capability compares to that of available prophage predictors for complete genomes, but is superior in predicting viral sequences outside of a host genome (i.e., from extrachromosomal prophages, lytic infections, or partially assembled prophages. Furthermore, VirSorter outperforms existing tools for fragmented genomic and metagenomic datasets, and can identify viral signal in assembled sequence (contigs as short as 3kb, while providing near-perfect identification (>95% Recall and 100% Precision on contigs of at least 10kb. Because VirSorter scales to large datasets, it can also be used in “reverse” to more confidently identify viral sequence in viral metagenomes by sorting away cellular DNA whether derived from gene transfer agents, generalized transduction or contamination. Finally, VirSorter is made

  15. Microbial taxonomy in the post-genomic era: Rebuilding from scratch?

    Energy Technology Data Exchange (ETDEWEB)

    Thompson, Cristiane C. [Univ. of Rio de Janeiro (UFRJ) (Brazil); Amaral, Gilda R. [Univ. of Rio de Janeiro (UFRJ) (Brazil); Campeão, Mariana [Univ. of Rio de Janeiro (UFRJ) (Brazil); Edwards, Robert A. [Univ. of Rio de Janeiro (UFRJ) (Brazil); San Diego State Univ., CA (United States); Argonne National Lab. (ANL), Argonne, IL (United States); Polz, Martin F. [Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States); Dutilh, Bas E. [Univ. of Rio de Janeiro (UFRJ) (Brazil); Radbould Univ., Nijmegen (Netherlands); Ussery, David W. [Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States); Sawabe, Tomoo [Hokkaido Univ., Hakodate (Japan); Swings, Jean [Univ. of Rio de Janeiro (UFRJ) (Brazil); Ghent Univ. (Belgium); Thompson, Fabiano L. [Univ. of Rio de Janeiro (UFRJ) (Brazil); Advanced Systems Laboratory Production Management COPPE / UFRJ, Rio de Janeiro (Brazil)

    2014-12-23

    Microbial taxonomy should provide adequate descriptions of bacterial, archaeal, and eukaryotic microbial diversity in ecological, clinical, and industrial environments. We re-evaluated the prokaryote species twice. It is time to revisit polyphasic taxonomy, its principles, and its practice, including its underlying pragmatic species concept. We will be able to realize an old dream of our predecessor taxonomists and build a genomic-based microbial taxonomy, using standardized and automated curation of high-quality complete genome sequences as the new gold standard.

  16. Mining of Microbial Genomes for the Novel Sources of Nitrilases

    Directory of Open Access Journals (Sweden)

    Nikhil Sharma

    2017-01-01

    Full Text Available Next-generation DNA sequencing (NGS has made it feasible to sequence large number of microbial genomes and advancements in computational biology have opened enormous opportunities to mine genome sequence data for novel genes and enzymes or their sources. In the present communication in silico mining of microbial genomes has been carried out to find novel sources of nitrilases. The sequences selected were analyzed for homology and considered for designing motifs. The manually designed motifs based on amino acid sequences of nitrilases were used to screen 2000 microbial genomes (translated to proteomes. This resulted in identification of one hundred thirty-eight putative/hypothetical sequences which could potentially code for nitrilase activity. In vitro validation of nine predicted sources of nitrilases was done for nitrile/cyanide hydrolyzing activity. Out of nine predicted nitrilases, Gluconacetobacter diazotrophicus, Sphingopyxis alaskensis, Saccharomonospora viridis, and Shimwellia blattae were specific for aliphatic nitriles, whereas nitrilases from Geodermatophilus obscurus, Nocardiopsis dassonvillei, Runella slithyformis, and Streptomyces albus possessed activity for aromatic nitriles. Flavobacterium indicum was specific towards potassium cyanide (KCN which revealed the presence of nitrilase homolog, that is, cyanide dihydratase with no activity for either aliphatic, aromatic, or aryl nitriles. The present study reports the novel sources of nitrilases and cyanide dihydratase which were not reported hitherto by in silico or in vitro studies.

  17. Microbial species delineation using whole genome sequences

    Energy Technology Data Exchange (ETDEWEB)

    Kyrpides, Nikos; Mukherjee, Supratim; Ivanova, Natalia; Mavrommatics, Kostas; Pati, Amrita; Konstantinidis, Konstantinos

    2014-10-20

    Species assignments in prokaryotes use a manual, poly-phasic approach utilizing both phenotypic traits and sequence information of phylogenetic marker genes. With thousands of genomes being sequenced every year, an automated, uniform and scalable approach exploiting the rich genomic information in whole genome sequences is desired, at least for the initial assignment of species to an organism. We have evaluated pairwise genome-wide Average Nucleotide Identity (gANI) values and alignment fractions (AFs) for nearly 13,000 genomes using our fast implementation of the computation, identifying robust and widely applicable hard cut-offs for species assignments based on AF and gANI. Using these cutoffs, we generated stable species-level clusters of organisms, which enabled the identification of several species mis-assignments and facilitated the assignment of species for organisms without species definitions.

  18. Community genomics among stratified microbial assemblages in the ocean's interior

    DEFF Research Database (Denmark)

    DeLong, Edward F; Preston, Christina M; Mincer, Tracy

    2006-01-01

    Microbial life predominates in the ocean, yet little is known about its genomic variability, especially along the depth continuum. We report here genomic analyses of planktonic microbial communities in the North Pacific Subtropical Gyre, from the ocean's surface to near-sea floor depths. Sequence...

  19. Comparative Genome Analysis in the Integrated Microbial Genomes(IMG) System

    Energy Technology Data Exchange (ETDEWEB)

    Kyrpides, Nikos C.; Markowitz, Victor M.

    2006-03-01

    Comparative genome analysis is critical for the effectiveexploration of a rapidly growing number of complete and draft sequencesfor microbial genomes. The Integrated Microbial Genomes (IMG) system(img.jgi.doe.gov) has been developed as a community resource thatprovides support for comparative analysis of microbial genomes in anintegrated context. IMG allows users to navigate the multidimensionalmicrobial genome data space and focus their analysis on a subset ofgenes, genomes, and functions of interest. IMG provides graphicalviewers, summaries and occurrence profile tools for comparing genes,pathways and functions (terms) across specific genomes. Genes can befurther examined using gene neighborhoods and compared with sequencealignment tools.

  20. Finishing genomes with limited resources: lessons from an ensemble of microbial genomes

    Directory of Open Access Journals (Sweden)

    Bishop-Lilly Kimberly A

    2010-04-01

    Full Text Available Abstract While new sequencing technologies have ushered in an era where microbial genomes can be easily sequenced, the goal of routinely producing high-quality draft and finished genomes in a cost-effective fashion has still remained elusive. Due to shorter read lengths and limitations in library construction protocols, shotgun sequencing and assembly based on these technologies often results in fragmented assemblies. Correspondingly, while draft assemblies can be obtained in days, finishing can take many months and hence the time and effort can only be justified for high-priority genomes and in large sequencing centers. In this work, we revisit this issue in light of our own experience in producing finished and nearly-finished genomes for a range of microbial species in a small-lab setting. These genomes were finished with surprisingly little investments in terms of time, computational effort and lab work, suggesting that the increased access to sequencing might also eventually lead to a greater proportion of finished genomes from small labs and genomics cores.

  1. Genomic Prospecting for Microbial Biodiesel Production

    Energy Technology Data Exchange (ETDEWEB)

    Lykidis, Athanasios; Lykidis, Athanasios; Ivanova, Natalia

    2008-03-20

    Biodiesel is defined as fatty acid mono-alkylesters and is produced from triacylglycerols. In the current article we provide an overview of the structure, diversity and regulation of the metabolic pathways leading to intracellular fatty acid and triacylglycerol accumulation in three types of organisms (bacteria, algae and fungi) of potential biotechnological interest and discuss possible intervention points to increase the cellular lipid content. The key steps that regulate carbon allocation and distribution in lipids include the formation of malonyl-CoA, the synthesis of fatty acids and their attachment onto the glycerol backbone, and the formation of triacylglycerols. The lipid biosynthetic genes and pathways are largely known for select model organisms. Comparative genomics allows the examination of these pathways in organisms of biotechnological interest and reveals the evolution of divergent and yet uncharacterized regulatory mechanisms. Utilization of microbial systems for triacylglycerol and fatty acid production is in its infancy; however, genomic information and technologies combined with synthetic biology concepts provide the opportunity to further exploit microbes for the competitive production of biodiesel.

  2. Advanced Microbial Taxonomy Combined with Genome-Based-Approaches Reveals that Vibrio astriarenae sp. nov., an Agarolytic Marine Bacterium, Forms a New Clade in Vibrionaceae.

    Directory of Open Access Journals (Sweden)

    Nurhidayu Al-Saari

    Full Text Available Advances in genomic microbial taxonomy have opened the way to create a more universal and transparent concept of species but is still in a transitional stage towards becoming a defining robust criteria for describing new microbial species with minimum features obtained using both genome and classical polyphasic taxonomies. Here we performed advanced microbial taxonomies combined with both genome-based and classical approaches for new agarolytic vibrio isolates to describe not only a novel Vibrio species but also a member of a new Vibrio clade. Two novel vibrio strains (Vibrio astriarenae sp. nov. C7T and C20 showing agarolytic, halophilic and fermentative metabolic activity were isolated from a seawater sample collected in a coral reef in Okinawa. Intraspecific similarities of the isolates were identical in both sequences on the 16S rRNA and pyrH genes, but the closest relatives on the molecular phylogenetic trees on the basis of 16S rRNA and pyrH gene sequences were V. hangzhouensis JCM 15146T (97.8% similarity and V. agarivorans CECT 5085T (97.3% similarity, respectively. Further multilocus sequence analysis (MLSA on the basis of 8 protein coding genes (ftsZ, gapA, gyrB, mreB, pyrH, recA, rpoA, and topA obtained by the genome sequences clearly showed the V. astriarenae strain C7T and C20 formed a distinct new clade protruded next to V. agarivorans CECT 5085T. The singleton V. agarivorans has never been included in previous MLSA of Vibrionaceae due to the lack of some gene sequences. Now the gene sequences are completed and analysis of 100 taxa in total provided a clear picture describing the association of V. agarivorans into pre-existing concatenated network tree and concluded its relationship to our vibrio strains. Experimental DNA-DNA hybridization (DDH data showed that the strains C7T and C20 were conspecific but were separated from all of the other Vibrio species related on the basis of both 16S rRNA and pyrH gene phylogenies (e.g., V

  3. Multiple genome alignment for identifying the core structure among moderately related microbial genomes.

    Science.gov (United States)

    Uchiyama, Ikuo

    2008-10-31

    Identifying the set of intrinsically conserved genes, or the genomic core, among related genomes is crucial for understanding prokaryotic genomes where horizontal gene transfers are common. Although core genome identification appears to be obvious among very closely related genomes, it becomes more difficult when more distantly related genomes are compared. Here, we consider the core structure as a set of sufficiently long segments in which gene orders are conserved so that they are likely to have been inherited mainly through vertical transfer, and developed a method for identifying the core structure by finding the order of pre-identified orthologous groups (OGs) that maximally retains the conserved gene orders. The method was applied to genome comparisons of two well-characterized families, Bacillaceae and Enterobacteriaceae, and identified their core structures comprising 1438 and 2125 OGs, respectively. The core sets contained most of the essential genes and their related genes, which were primarily included in the intersection of the two core sets comprising around 700 OGs. The definition of the genomic core based on gene order conservation was demonstrated to be more robust than the simpler approach based only on gene conservation. We also investigated the core structures in terms of G+C content homogeneity and phylogenetic congruence, and found that the core genes primarily exhibited the expected characteristic, i.e., being indigenous and sharing the same history, more than the non-core genes. The results demonstrate that our strategy of genome alignment based on gene order conservation can provide an effective approach to identify the genomic core among moderately related microbial genomes.

  4. Multiple genome alignment for identifying the core structure among moderately related microbial genomes

    Directory of Open Access Journals (Sweden)

    Uchiyama Ikuo

    2008-10-01

    Full Text Available Abstract Background Identifying the set of intrinsically conserved genes, or the genomic core, among related genomes is crucial for understanding prokaryotic genomes where horizontal gene transfers are common. Although core genome identification appears to be obvious among very closely related genomes, it becomes more difficult when more distantly related genomes are compared. Here, we consider the core structure as a set of sufficiently long segments in which gene orders are conserved so that they are likely to have been inherited mainly through vertical transfer, and developed a method for identifying the core structure by finding the order of pre-identified orthologous groups (OGs that maximally retains the conserved gene orders. Results The method was applied to genome comparisons of two well-characterized families, Bacillaceae and Enterobacteriaceae, and identified their core structures comprising 1438 and 2125 OGs, respectively. The core sets contained most of the essential genes and their related genes, which were primarily included in the intersection of the two core sets comprising around 700 OGs. The definition of the genomic core based on gene order conservation was demonstrated to be more robust than the simpler approach based only on gene conservation. We also investigated the core structures in terms of G+C content homogeneity and phylogenetic congruence, and found that the core genes primarily exhibited the expected characteristic, i.e., being indigenous and sharing the same history, more than the non-core genes. Conclusion The results demonstrate that our strategy of genome alignment based on gene order conservation can provide an effective approach to identify the genomic core among moderately related microbial genomes.

  5. Microbial taxonomy in the post-genomic era: rebuilding from scratch?

    Science.gov (United States)

    Thompson, Cristiane C; Amaral, Gilda R; Campeão, Mariana; Edwards, Robert A; Polz, Martin F; Dutilh, Bas E; Ussery, David W; Sawabe, Tomoo; Swings, Jean; Thompson, Fabiano L

    2015-04-01

    Microbial taxonomy should provide adequate descriptions of bacterial, archaeal, and eukaryotic microbial diversity in ecological, clinical, and industrial environments. Its cornerstone, the prokaryote species has been re-evaluated twice. It is time to revisit polyphasic taxonomy, its principles, and its practice, including its underlying pragmatic species concept. Ultimately, we will be able to realize an old dream of our predecessor taxonomists and build a genomic-based microbial taxonomy, using standardized and automated curation of high-quality complete genome sequences as the new gold standard.

  6. Assembly-driven community genomics of a hypersaline microbial ecosystem.

    Directory of Open Access Journals (Sweden)

    Sheila Podell

    Full Text Available Microbial populations inhabiting a natural hypersaline lake ecosystem in Lake Tyrrell, Victoria, Australia, have been characterized using deep metagenomic sampling, iterative de novo assembly, and multidimensional phylogenetic binning. Composite genomes representing habitat-specific microbial populations were reconstructed for eleven different archaea and one bacterium, comprising between 0.6 and 14.1% of the planktonic community. Eight of the eleven archaeal genomes were from microbial species without previously cultured representatives. These new genomes provide habitat-specific reference sequences enabling detailed, lineage-specific compartmentalization of predicted functional capabilities and cellular properties associated with both dominant and less abundant community members, including organisms previously known only by their 16S rRNA sequences. Together, these data provide a comprehensive, culture-independent genomic blueprint for ecosystem-wide analysis of protein functions, population structure, and lifestyles of co-existing, co-evolving microbial groups within the same natural habitat. The "assembly-driven" community genomic approach demonstrated in this study advances our ability to push beyond single gene investigations, and promotes genome-scale reconstructions as a tangible goal in the quest to define the metabolic, ecological, and evolutionary dynamics that underpin environmental microbial diversity.

  7. Impact of genomics on microbial food safety

    NARCIS (Netherlands)

    Abee, T.; Schaik, van W.; Siezen, R.J.

    2004-01-01

    Genome sequences are now available for many of the microbes that cause food-borne diseases. The information contained in pathogen genome sequences, together with the development of themed and whole-genome DNA microarrays and improved proteomics techniques, might provide tools for the rapid detection

  8. Genomes of the T4-related bacteriophages as windows on microbial genome evolution

    Directory of Open Access Journals (Sweden)

    Miller Eric S

    2010-10-01

    Full Text Available Abstract The T4-related bacteriophages are a group of bacterial viruses that share morphological similarities and genetic homologies with the well-studied Escherichia coli phage T4, but that diverge from T4 and each other by a number of genetically determined characteristics including the bacterial hosts they infect, the sizes of their linear double-stranded (ds DNA genomes and the predicted compositions of their proteomes. The genomes of about 40 of these phages have been sequenced and annotated over the last several years and are compared here in the context of the factors that have determined their diversity and the diversity of other microbial genomes in evolution. The genomes of the T4 relatives analyzed so far range in size between ~160,000 and ~250,000 base pairs (bp and are mosaics of one another, consisting of clusters of homology between them that are interspersed with segments that vary considerably in genetic composition between the different phage lineages. Based on the known biological and biochemical properties of phage T4 and the proteins encoded by the T4 genome, the T4 relatives reviewed here are predicted to share a genetic core, or "Core Genome" that determines the structural design of their dsDNA chromosomes, their distinctive morphology and the process of their assembly into infectious agents (phage morphogenesis. The Core Genome appears to be the most ancient genetic component of this phage group and constitutes a mere 12-15% of the total protein encoding potential of the typical T4-related phage genome. The high degree of genetic heterogeneity that exists outside of this shared core suggests that horizontal DNA transfer involving many genetic sources has played a major role in diversification of the T4-related phages and their spread to a wide spectrum of bacterial species domains in evolution. We discuss some of the factors and pathways that might have shaped the evolution of these phages and point out several parallels

  9. Intercellular Genomics of Subsurface Microbial Colonies

    Energy Technology Data Exchange (ETDEWEB)

    Ortoleva, Peter; Tuncay, Kagan; Gannon, Dennis; Meile, Christof

    2007-02-14

    This report summarizes progress in the second year of this project. The objective is to develop methods and software to predict the spatial configuration, properties and temporal evolution of microbial colonies in the subsurface. To accomplish this, we integrate models of intracellular processes, cell-host medium exchange and reaction-transport dynamics on the colony scale. At the conclusion of the project, we aim to have the foundations of a predictive mathematical model and software that captures the three scales of these systems – the intracellular, pore, and colony wide spatial scales. In the second year of the project, we refined our transcriptional regulatory network discovery (TRND) approach that utilizes gene expression data along with phylogenic similarity and gene ontology analyses and applied it successfully to E.coli, human B cells, and Geobacter sulfurreducens. We have developed a new Web interface, GeoGen, which is tailored to the reconstruction of microbial TRNs and solely focuses on Geobacter as one of DOE’s high priority microbes. Our developments are designed such that the frameworks for the TRND and GeoGen can readily be used for other microbes of interest to the DOE. In the context of modeling a single bacterium, we are actively pursuing both steady-state and kinetic approaches. The steady-state approach is based on a flux balance that uses maximizing biomass growth rate as its objective, subjected to various biochemical constraints, for the optimal values of reaction rates and uptake/release of metabolites. For the kinetic approach, we use Karyote, a rigorous cell model developed by us for an earlier DOE grant and the DARPA BioSPICE Project. We are also investigating the interplay between bacterial colonies and environment at both pore and macroscopic scales. The pore scale models use detailed representations for realistic porous media accounting for the distribution of grain size whereas the macroscopic models employ the Darcy-type flow

  10. INDIGO - INtegrated data warehouse of microbial genomes with examples from the red sea extremophiles.

    KAUST Repository

    Alam, Intikhab

    2013-12-06

    The next generation sequencing technologies substantially increased the throughput of microbial genome sequencing. To functionally annotate newly sequenced microbial genomes, a variety of experimental and computational methods are used. Integration of information from different sources is a powerful approach to enhance such annotation. Functional analysis of microbial genomes, necessary for downstream experiments, crucially depends on this annotation but it is hampered by the current lack of suitable information integration and exploration systems for microbial genomes.

  11. Microbial ecology and genomics: A crossroads of opportunity

    Energy Technology Data Exchange (ETDEWEB)

    Stahl, David A. [University of Washington; Tiedje, James M. [Michigan State University

    2002-08-30

    Microbes have dominated life on Earth for most of its 4.5 billionyear history. They are the foundation of the biosphere, controlling the biogeochemical cycles and affecting geology, hydrology, and local and global climates. All life is completely dependent upon them. Humans cannot survive without the rich diversity of microbes, but most microbial species can survive without humans. Extraordinary advances in molecular technology have fostered an explosion of information in microbial biology. It is now known that microbial species in culture poorly represent their natural diversity—which dwarfs conventions established for the visible world. This was revealed over the last decade using newer molecular tools to explore environmental diversity and has sparked an explosive growth in microbial ecology and technologies that may profit from the bounty of natural biochemical diversity. Several colloquia and meetings have helped formulate policy recommendations to enable sustained research programs in these areas. One such colloquium organized by the American Academy of Microbiology (“The Microbial World: Foundation of the Biosphere,” 1997) made two key recommendations: (1) develop a more complete inventory of living organisms and the interagency cooperation needed to accomplish this goal, and (2) develop strategies to harvest this remarkable biological diversity for the benefit of science, technology, and society. Complete genome sequence information was identified as an essential part of strategy development, and the recommendation was made to sequence the genome of at least one species of each of the major divisions of microbial life.

  12. Strain-level microbial epidemiology and population genomics from shotgun metagenomics.

    Science.gov (United States)

    Scholz, Matthias; Ward, Doyle V; Pasolli, Edoardo; Tolio, Thomas; Zolfo, Moreno; Asnicar, Francesco; Truong, Duy Tin; Tett, Adrian; Morrow, Ardythe L; Segata, Nicola

    2016-05-01

    Identifying microbial strains and characterizing their functional potential is essential for pathogen discovery, epidemiology and population genomics. We present pangenome-based phylogenomic analysis (PanPhlAn; http://segatalab.cibio.unitn.it/tools/panphlan), a tool that uses metagenomic data to achieve strain-level microbial profiling resolution. PanPhlAn recognized outbreak strains, produced the largest strain-level population genomic study of human-associated bacteria and, in combination with metatranscriptomics, profiled the transcriptional activity of strains in complex communities.

  13. To bioethanol through genomics of microbial synergies

    Energy Technology Data Exchange (ETDEWEB)

    Epstein,

    2013-08-27

    The strategic goal of this project was to advance our understanding of activities and interactions of microorganisms through the advancement of microbial cultivation approaches. In this project we aimed to develop, advance, and use both culture-dependent techniques to address our main hypothesis: “uncultivable” microorganisms and their consortia represent a untapped source of novel species for efficient production of bioethanol. This project has two specific goals: 1. To develop and optimize a high throughput diffusion chamber cultivation approach to isolation of novel environmental bacteria relevant to DOE missions. 2. To use the optimized method to identify and cultivate novel microbial species and their consortia that synergistically hydrolyze various substrates and ferment the sugars to ethanol.

  14. Analysis of pan-genome content and its application in microbial identification

    DEFF Research Database (Denmark)

    Lukjancenko, Oksana

    of genomic data and use this to answer important biological questions. More specifically, comparison of prokaryotic proteomes is used to determine possible sets of functions, essential to sustain microbial life; to extract and interpret similarities and variance in genomic content within different taxonomic...... analyses for the characterization of two Listeria monocytogenes strains. Chapter 4 describes the use of profile HMMs for comparative analysis using for sequence-based homology searches. Paper III introduces PanFunPro a new, profile HMM-based method for pan-genome analysis. Paper IV illustrates...... the application of PanFunPro to a set of more than 2000 genomes; this paper aims to define set of protein families, which are conserved among all the genomes. Papers V demonstrates comparative genomics analysis of proteomes, belonging to Vibrio genus. In the last project, described in Chapter 5, both BLAST...

  15. A Novel Uncultured Bacterium of the Family Gallionellaceae: Description and Genome Reconstruction Based on the Metagenomic Analysis of Microbial Community in Acid Mine Drainage.

    Science.gov (United States)

    Kadnikov, V V; Ivasenko, D A; Beletsky, A V; Mardanov, A V; Danilova, E V; Pimenov, N V; Karnachuk, O V; Ravin, N V

    2016-07-01

    Drainage waters at the metal mining areas often have low pH and high content of dissolved metals due to oxidation of sulfide minerals. Extreme conditions limit microbial diversity in- such ecosystems. A drainage water microbial community (6.5'C, pH 2.65) in an open pit at the Sherlovaya Gora polymetallic open-cast mine (Transbaikal region, Eastern Siberia, Russia) was studied using metagenomic techniques. Metagenome sequencing provided information for taxonomic and functional characterization of the micro- bial community. The majority of microorganisms belonged to a single uncultured lineage representing a new Betaproteobacteria species of the genus Gallionella. While no.acidophiles are known among the cultured members of the family Gallionellaceae, similar 16S rRNA gene sequences were detected in acid mine drain- ages. Bacteria ofthe genera Thiobacillus, Acidobacterium, Acidisphaera, and Acidithiobacillus,-which are com- mon in acid mine drainage environments, were the minor components of the community. Metagenomic data were -used to determine the almost complete (-3.4 Mb) composite genome of the new bacterial. lineage desig- nated Candidatus Gallionella acididurans ShG14-8. Genome analysis revealed that Fe(II) oxidation probably involved the cytochromes localized on the outer membrane of the cell. The electron transport chain included NADH dehydrogenase, a cytochrome bc1 complex, an alternative complex III, and cytochrome oxidases of the bd, cbb3, and bo3 types. Oxidation of reduced sulfur compounds probably involved the Sox system, sul- fide-quinone oxidoreductase, adenyl sulfate reductase, and sulfate adenyltransferase. The genes required for autotrophic carbon assimilation via the Calvin cycle were present, while no pathway for nitrogen fixation was revealed. High numbers of RND metal transporters and P type ATPases were probably responsible for resis- tance to heavy metals. The new microorganism was an aerobic chemolithoautotroph of the group of

  16. Emerging trends in genomic approaches for microbial bioprospecting.

    Science.gov (United States)

    Akondi, K B; Lakshmi, V V

    2013-02-01

    Microorganisms constitute two out of the three domains of life on earth. They exhibit vast biodiversity and metabolic versatility. This enables the microorganisms to inhabit and thrive in even the most extreme environmental conditions, making them all pervading. The magnitude of biodiversity observed among microorganisms substantially supersedes that exhibited by the eukaryotes. These characteristics make the microbial world a very lucrative and inexhaustible resource for prospecting novel bioactive molecules. Despite their vast potential, over 99% of the microbial world still remains to be explored. The primary reason for this is that the culture-dependent methods used in the laboratories are grossly insufficient, as they support the growth of under 1% of the microorganisms found in nature. This limitation necessitated the development of techniques to circumvent culture dependency and gain access to the outstanding majority of the microorganisms. The development of culture-independent techniques has essentially reshaped the study of microbial diversity and community dynamics. Application of genomic and metagenomic approaches is contributing substantially towards characterization of the real microbial diversity. The amenability of these techniques to high throughput has opened the doors to explore the vast number of "uncultivable" microbial forms in substantially lesser time. The present article provides an update on the recent technological advances and emerging trends in exploring microbial community.

  17. The genomics of microbial domestication in the fermented food environment.

    Science.gov (United States)

    Gibbons, John G; Rinker, David C

    2015-12-01

    Shortly after the agricultural revolution, the domestication of bacteria, yeasts, and molds, played an essential role in enhancing the stability, quality, flavor, and texture of food products. These domestication events were probably the result of human food production practices that entailed the continual recycling of isolated microbial communities in the presence of abundant agricultural food sources. We suggest that within these novel agrarian food niches the metabolic requirements of those microbes became regular and predictable resulting in rapid genomic specialization through such mechanisms as pseudogenization, genome decay, interspecific hybridization, gene duplication, and horizontal gene transfer. The ultimate result was domesticated strains of microorganisms with enhanced fermentative capacities.

  18. Microbial NAD metabolism: lessons from comparative genomics.

    Science.gov (United States)

    Gazzaniga, Francesca; Stebbins, Rebecca; Chang, Sheila Z; McPeek, Mark A; Brenner, Charles

    2009-09-01

    NAD is a coenzyme for redox reactions and a substrate of NAD-consuming enzymes, including ADP-ribose transferases, Sir2-related protein lysine deacetylases, and bacterial DNA ligases. Microorganisms that synthesize NAD from as few as one to as many as five of the six identified biosynthetic precursors have been identified. De novo NAD synthesis from aspartate or tryptophan is neither universal nor strictly aerobic. Salvage NAD synthesis from nicotinamide, nicotinic acid, nicotinamide riboside, and nicotinic acid riboside occurs via modules of different genes. Nicotinamide salvage genes nadV and pncA, found in distinct bacteria, appear to have spread throughout the tree of life via horizontal gene transfer. Biochemical, genetic, and genomic analyses have advanced to the point at which the precursors and pathways utilized by a microorganism can be predicted. Challenges remain in dissecting regulation of pathways.

  19. The Fast Changing Landscape of Sequencing Technologies and Their Impact on Microbial Genome Assemblies and Annotation

    Energy Technology Data Exchange (ETDEWEB)

    Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Brettin, Thomas S [ORNL; Quest, Daniel J [ORNL; Copeland, A [U.S. Department of Energy, Joint Genome Institute; Clum, Alicia [U.S. Department of Energy, Joint Genome Institute; Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Cottingham, Robert W [ORNL; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute

    2012-01-01

    Background: The emergence of next generation sequencing (NGS) has provided the means for rapid and high throughput sequencing and data generation at low cost, while concomitantly creating a new set of challenges. The number of available assembled microbial genomes continues to grow rapidly and their quality reflects the quality of the sequencing technology used, but also of the analysis software employed for assembly and annotation. Methodology/Principal Findings: In this work, we have explored the quality of the microbial draft genomes across various sequencing technologies. We have compared the draft and finished assemblies of 133 microbial genomes sequenced at the Department of Energy-Joint Genome Institute and finished at the Los Alamos National Laboratory using a variety of combinations of sequencing technologies, reflecting the transition of the institute from Sanger-based sequencing platforms to NGS platforms. The quality of the public assemblies and of the associated gene annotations was evaluated using various metrics. Results obtained with the different sequencing technologies, as well as their effects on downstream processes, were analyzed. Our results demonstrate that the Illumina HiSeq 2000 sequencing system, the primary sequencing technology currently used for de novo genome sequencing and assembly at JGI, has various advantages in terms of total sequence throughput and cost, but it also introduces challenges for the downstream analyses. In all cases assembly results although on average are of high quality, need to be viewed critically and consider sources of errors in them prior to analysis. Conclusion: These data follow the evolution of microbial sequencing and downstream processing at the JGI from draft genome sequences with large gaps corresponding to missing genes of significant biological role to assemblies with multiple small gaps (Illumina) and finally to assemblies that generate almost complete genomes (Illumina+PacBio).

  20. Bioinformatics for whole-genome shotgun sequencing of microbial communities.

    Directory of Open Access Journals (Sweden)

    Kevin Chen

    2005-07-01

    Full Text Available The application of whole-genome shotgun sequencing to microbial communities represents a major development in metagenomics, the study of uncultured microbes via the tools of modern genomic analysis. In the past year, whole-genome shotgun sequencing projects of prokaryotic communities from an acid mine biofilm, the Sargasso Sea, Minnesota farm soil, three deep-sea whale falls, and deep-sea sediments have been reported, adding to previously published work on viral communities from marine and fecal samples. The interpretation of this new kind of data poses a wide variety of exciting and difficult bioinformatics problems. The aim of this review is to introduce the bioinformatics community to this emerging field by surveying existing techniques and promising new approaches for several of the most interesting of these computational problems.

  1. CMG-biotools, a free workbench for basic comparative microbial genomics.

    Directory of Open Access Journals (Sweden)

    Tammi Vesth

    Full Text Available BACKGROUND: Today, there are more than a hundred times as many sequenced prokaryotic genomes than were present in the year 2000. The economical sequencing of genomic DNA has facilitated a whole new approach to microbial genomics. The real power of genomics is manifested through comparative genomics that can reveal strain specific characteristics, diversity within species and many other aspects. However, comparative genomics is a field not easily entered into by scientists with few computational skills. The CMG-biotools package is designed for microbiologists with limited knowledge of computational analysis and can be used to perform a number of analyses and comparisons of genomic data. RESULTS: The CMG-biotools system presents a stand-alone interface for comparative microbial genomics. The package is a customized operating system, based on Xubuntu 10.10, available through the open source Ubuntu project. The system can be installed on a virtual computer, allowing the user to run the system alongside any other operating system. Source codes for all programs are provided under GNU license, which makes it possible to transfer the programs to other systems if so desired. We here demonstrate the package by comparing and analyzing the diversity within the class Negativicutes, represented by 31 genomes including 10 genera. The analyses include 16S rRNA phylogeny, basic DNA and codon statistics, proteome comparisons using BLAST and graphical analyses of DNA structures. CONCLUSION: This paper shows the strength and diverse use of the CMG-biotools system. The system can be installed on a vide range of host operating systems and utilizes as much of the host computer as desired. It allows the user to compare multiple genomes, from various sources using standardized data formats and intuitive visualizations of results. The examples presented here clearly shows that users with limited computational experience can perform complicated analysis without much

  2. PromBase: a web resource for various genomic features and predicted promoters in prokaryotic genomes

    Directory of Open Access Journals (Sweden)

    Bansal Manju

    2011-07-01

    Full Text Available Abstract Background As more and more genomes are being sequenced, an overview of their genomic features and annotation of their functional elements, which control the expression of each gene or transcription unit of the genome, is a fundamental challenge in genomics and bioinformatics. Findings Relative stability of DNA sequence has been used to predict promoter regions in 913 microbial genomic sequences with GC-content ranging from 16.6% to 74.9%. Irrespective of the genome GC-content the relative stability based promoter prediction method has already been proven to be robust in terms of recall and precision. The predicted promoter regions for the 913 microbial genomes have been accumulated in a database called PromBase. Promoter search can be carried out in PromBase either by specifying the gene name or the genomic position. Each predicted promoter region has been assigned to a reliability class (low, medium, high, very high and highest based on the difference between its average free energy and the downstream region. The recall and precision values for each class are shown graphically in PromBase. In addition, PromBase provides detailed information about base composition, CDS and CG/TA skews for each genome and various DNA sequence dependent structural properties (average free energy, curvature and bendability in the vicinity of all annotated translation start sites (TLS. Conclusion PromBase is a database, which contains predicted promoter regions and detailed analysis of various genomic features for 913 microbial genomes. PromBase can serve as a valuable resource for comparative genomics study and help the experimentalist to rapidly access detailed information on various genomic features and putative promoter regions in any given genome. This database is freely accessible for academic and non- academic users via the worldwide web http://nucleix.mbu.iisc.ernet.in/prombase/.

  3. From cultured to uncultured genome sequences: metagenomics and modeling microbial ecosystems.

    Science.gov (United States)

    Garza, Daniel R; Dutilh, Bas E

    2015-11-01

    Microorganisms and the viruses that infect them are the most numerous biological entities on Earth and enclose its greatest biodiversity and genetic reservoir. With strength in their numbers, these microscopic organisms are major players in the cycles of energy and matter that sustain all life. Scientists have only scratched the surface of this vast microbial world through culture-dependent methods. Recent developments in generating metagenomes, large random samples of nucleic acid sequences isolated directly from the environment, are providing comprehensive portraits of the composition, structure, and functioning of microbial communities. Moreover, advances in metagenomic analysis have created the possibility of obtaining complete or nearly complete genome sequences from uncultured microorganisms, providing important means to study their biology, ecology, and evolution. Here we review some of the recent developments in the field of metagenomics, focusing on the discovery of genetic novelty and on methods for obtaining uncultured genome sequences, including through the recycling of previously published datasets. Moreover we discuss how metagenomics has become a core scientific tool to characterize eco-evolutionary patterns of microbial ecosystems, thus allowing us to simultaneously discover new microbes and study their natural communities. We conclude by discussing general guidelines and challenges for modeling the interactions between uncultured microorganisms and viruses based on the information contained in their genome sequences. These models will significantly advance our understanding of the functioning of microbial ecosystems and the roles of microbes in the environment.

  4. Low-pass sequencing for microbial comparative genomics

    Directory of Open Access Journals (Sweden)

    Kennedy Sean

    2004-01-01

    IS-element rich genome of H. sp. NRC-1. Identification of multiple TBP and TFB homologs in these four halophiles are consistent with the hypothesis that different types of complex transcriptional regulation may occur through multiple TBP-TFB combinations in response to rapidly changing environmental conditions. Low-pass shotgun sequence analyses of genomes permit extensive and diverse analyses, and should be generally useful for comparative microbial genomics.

  5. pico-PLAZA, a genome database of microbial photosynthetic eukaryotes.

    Science.gov (United States)

    Vandepoele, Klaas; Van Bel, Michiel; Richard, Guilhem; Van Landeghem, Sofie; Verhelst, Bram; Moreau, Hervé; Van de Peer, Yves; Grimsley, Nigel; Piganeau, Gwenael

    2013-08-01

    With the advent of next generation genome sequencing, the number of sequenced algal genomes and transcriptomes is rapidly growing. Although a few genome portals exist to browse individual genome sequences, exploring complete genome information from multiple species for the analysis of user-defined sequences or gene lists remains a major challenge. pico-PLAZA is a web-based resource (http://bioinformatics.psb.ugent.be/pico-plaza/) for algal genomics that combines different data types with intuitive tools to explore genomic diversity, perform integrative evolutionary sequence analysis and study gene functions. Apart from homologous gene families, multiple sequence alignments, phylogenetic trees, Gene Ontology, InterPro and text-mining functional annotations, different interactive viewers are available to study genome organization using gene collinearity and synteny information. Different search functions, documentation pages, export functions and an extensive glossary are available to guide non-expert scientists. To illustrate the versatility of the platform, different case studies are presented demonstrating how pico-PLAZA can be used to functionally characterize large-scale EST/RNA-Seq data sets and to perform environmental genomics. Functional enrichments analysis of 16 Phaeodactylum tricornutum transcriptome libraries offers a molecular view on diatom adaptation to different environments of ecological relevance. Furthermore, we show how complementary genomic data sources can easily be combined to identify marker genes to study the diversity and distribution of algal species, for example in metagenomes, or to quantify intraspecific diversity from environmental strains.

  6. Streamlining genomes: toward the generation of simplified and stabilized microbial systems

    NARCIS (Netherlands)

    Leprince, A.; Passel, van M.W.J.; Martins Dos Santos, V.A.P.

    2012-01-01

    At the junction between systems and synthetic biology, genome streamlining provides a solid foundation both for increased understanding of cellular circuitry, and for the tailoring of microbial chassis towards innovative biotechnological applications. Iterative genomic deletions (targeted and random

  7. Assembling networks of microbial genomes using linear programming

    Directory of Open Access Journals (Sweden)

    Holloway Catherine

    2010-11-01

    Full Text Available Abstract Background Microbial genomes exhibit complex sets of genetic affinities due to lateral genetic transfer. Assessing the relative contributions of parent-to-offspring inheritance and gene sharing is a vital step in understanding the evolutionary origins and modern-day function of an organism, but recovering and showing these relationships is a challenging problem. Results We have developed a new approach that uses linear programming to find between-genome relationships, by treating tables of genetic affinities (here, represented by transformed BLAST e-values as an optimization problem. Validation trials on simulated data demonstrate the effectiveness of the approach in recovering and representing vertical and lateral relationships among genomes. Application of the technique to a set comprising Aquifex aeolicus and 75 other thermophiles showed an important role for large genomes as 'hubs' in the gene sharing network, and suggested that genes are preferentially shared between organisms with similar optimal growth temperatures. We were also able to discover distinct and common genetic contributors to each sequenced representative of genus Pseudomonas. Conclusions The linear programming approach we have developed can serve as an effective inference tool in its own right, and can be an efficient first step in a more-intensive phylogenomic analysis.

  8. Dynamic probe selection for studying microbial transcriptome with high-density genomic tiling microarrays

    Directory of Open Access Journals (Sweden)

    Chen Tsute

    2010-02-01

    Full Text Available Abstract Background Current commercial high-density oligonucleotide microarrays can hold millions of probe spots on a single microscopic glass slide and are ideal for studying the transcriptome of microbial genomes using a tiling probe design. This paper describes a comprehensive computational pipeline implemented specifically for designing tiling probe sets to study microbial transcriptome profiles. Results The pipeline identifies every possible probe sequence from both forward and reverse-complement strands of all DNA sequences in the target genome including circular or linear chromosomes and plasmids. Final probe sequence lengths are adjusted based on the maximal oligonucleotide synthesis cycles and best isothermality allowed. Optimal probes are then selected in two stages - sequential and gap-filling. In the sequential stage, probes are selected from sequence windows tiled alongside the genome. In the gap-filling stage, additional probes are selected from the largest gaps between adjacent probes that have already been selected, until a predefined number of probes is reached. Selection of the highest quality probe within each window and gap is based on five criteria: sequence uniqueness, probe self-annealing, melting temperature, oligonucleotide length, and probe position. Conclusions The probe selection pipeline evaluates global and local probe sequence properties and selects a set of probes dynamically and evenly distributed along the target genome. Unique to other similar methods, an exact number of non-redundant probes can be designed to utilize all the available probe spots on any chosen microarray platform. The pipeline can be applied to microbial genomes when designing high-density tiling arrays for comparative genomics, ChIP chip, gene expression and comprehensive transcriptome studies.

  9. Genomic Sequencing of Single Microbial Cells from Environmental Samples

    Energy Technology Data Exchange (ETDEWEB)

    Ishoey, Thomas; Woyke, Tanja; Stepanauskas, Ramunas; Novotny, Mark; Lasken, Roger S.

    2008-02-01

    Recently developed techniques allow genomic DNA sequencing from single microbial cells [Lasken RS: Single-cell genomic sequencing using multiple displacement amplification, Curr Opin Microbiol 2007, 10:510-516]. Here, we focus on research strategies for putting these methods into practice in the laboratory setting. An immediate consequence of single-cell sequencing is that it provides an alternative to culturing organisms as a prerequisite for genomic sequencing. The microgram amounts of DNA required as template are amplified from a single bacterium by a method called multiple displacement amplification (MDA) avoiding the need to grow cells. The ability to sequence DNA from individual cells will likely have an immense impact on microbiology considering the vast numbers of novel organisms, which have been inaccessible unless culture-independent methods could be used. However, special approaches have been necessary to work with amplified DNA. MDA may not recover the entire genome from the single copy present in most bacteria. Also, some sequence rearrangements can occur during the DNA amplification reaction. Over the past two years many research groups have begun to use MDA, and some practical approaches to single-cell sequencing have been developed. We review the consensus that is emerging on optimum methods, reliability of amplified template, and the proper interpretation of 'composite' genomes which result from the necessity of combining data from several single-cell MDA reactions in order to complete the assembly. Preferred laboratory methods are considered on the basis of experience at several large sequencing centers where >70% of genomes are now often recovered from single cells. Methods are reviewed for preparation of bacterial fractions from environmental samples, single-cell isolation, DNA amplification by MDA, and DNA sequencing.

  10. PATtyFams: Protein families for the microbial genomes in the PATRIC database

    Directory of Open Access Journals (Sweden)

    James J Davis

    2016-02-01

    Full Text Available The ability to build accurate protein families is a fundamental operation in bioinformatics that influences comparative analyses, genome annotation and metabolic modeling. For several years we have been maintaining protein families for all microbial genomes in the PATRIC database (Pathosystems Resource Integration Center, patricbrc.org in order to drive many of the comparative analysis tools that are available through the PATRIC website. However, due to the burgeoning number of genomes, traditional approaches for generating protein families are becoming prohibitive. In this report, we describe a new approach for generating protein families, which we call PATtyFams. This method uses the k-mer-based function assignments available through RAST (Rapid Annotation using Subsystem Technology to rapidly guide family formation, and then differentiates the function-based groups into families using a Markov Cluster algorithm (MCL. This new approach for generating protein families is rapid, scalable and has properties that are consistent with alignment-based methods.

  11. AGeS: a software system for microbial genome sequence annotation.

    Directory of Open Access Journals (Sweden)

    Kamal Kumar

    Full Text Available BACKGROUND: The annotation of genomes from next-generation sequencing platforms needs to be rapid, high-throughput, and fully integrated and automated. Although a few Web-based annotation services have recently become available, they may not be the best solution for researchers that need to annotate a large number of genomes, possibly including proprietary data, and store them locally for further analysis. To address this need, we developed a standalone software application, the Annotation of microbial Genome Sequences (AGeS system, which incorporates publicly available and in-house-developed bioinformatics tools and databases, many of which are parallelized for high-throughput performance. METHODOLOGY: The AGeS system supports three main capabilities. The first is the storage of input contig sequences and the resulting annotation data in a central, customized database. The second is the annotation of microbial genomes using an integrated software pipeline, which first analyzes contigs from high-throughput sequencing by locating genomic regions that code for proteins, RNA, and other genomic elements through the Do-It-Yourself Annotation (DIYA framework. The identified protein-coding regions are then functionally annotated using the in-house-developed Pipeline for Protein Annotation (PIPA. The third capability is the visualization of annotated sequences using GBrowse. To date, we have implemented these capabilities for bacterial genomes. AGeS was evaluated by comparing its genome annotations with those provided by three other methods. Our results indicate that the software tools integrated into AGeS provide annotations that are in general agreement with those provided by the compared methods. This is demonstrated by a >94% overlap in the number of identified genes, a significant number of identical annotated features, and a >90% agreement in enzyme function predictions.

  12. Using Microbial Genome Annotation as a Foundation for Collaborative Student Research

    Science.gov (United States)

    Reed, Kelynne E.; Richardson, John M.

    2013-01-01

    We used the Integrated Microbial Genomes Annotation Collaboration Toolkit as a framework to incorporate microbial genomics research into a microbiology and biochemistry course in a way that promoted student learning of bioinformatics and research skills and emphasized teamwork and collaboration as evidenced through multiple assessment mechanisms.…

  13. The integrated microbial genomes (IMG) system in 2007: datacontent and analysis tool extensions

    Energy Technology Data Exchange (ETDEWEB)

    Markowitz, Victor M.; Szeto, Ernest; Palaniappan, Krishna; Grechkin, Yuri; Chu, Ken; Chen, I-Min A.; Dubchak, Inna; Anderson, Iain; Lykidis, Athanasios; Mavromatis, Konstantinos; Ivanova, Natalia N.; Kyrpides, Nikos C.

    2007-08-01

    The Integrated Microbial Genomes (IMG) system is a data management, analysis and annotation platform for all publicly available genomes. IMG contains both draft and complete JGI microbial genomes integrated with all other publicly available genomes from all three domains of life, together with a large number of plasmids and viruses. IMG provides tools and viewers for analyzing and annotating genomes, genes and functions, individually or in a comparative context. Since its first release in 2005, IMG's data content and analytical capabilities have been constantly expanded through quarterly releases. IMG is provided by the DOE-Joint Genome Institute (JGI) and is available from http://img.jgi.doe.gov.

  14. Whole-Genome Sequencing in Microbial Forensic Analysis of Gamma-Irradiated Microbial Materials.

    Science.gov (United States)

    Broomall, Stacey M; Ait Ichou, Mohamed; Krepps, Michael D; Johnsky, Lauren A; Karavis, Mark A; Hubbard, Kyle S; Insalaco, Joseph M; Betters, Janet L; Redmond, Brady W; Rivers, Bryan A; Liem, Alvin T; Hill, Jessica M; Fochler, Edward T; Roth, Pierce A; Rosenzweig, C Nicole; Skowronski, Evan W; Gibbons, Henry S

    2015-11-13

    Effective microbial forensic analysis of materials used in a potential biological attack requires robust methods of morphological and genetic characterization of the attack materials in order to enable the attribution of the materials to potential sources and to exclude other potential sources. The genetic homogeneity and potential intersample variability of many of the category A to C bioterrorism agents offer a particular challenge to the generation of attributive signatures, potentially requiring whole-genome or proteomic approaches to be utilized. Currently, irradiation of mail is standard practice at several government facilities judged to be at particularly high risk. Thus, initial forensic signatures would need to be recovered from inactivated (nonviable) material. In the study described in this report, we determined the effects of high-dose gamma irradiation on forensic markers of bacterial biothreat agent surrogate organisms with a particular emphasis on the suitability of genomic DNA (gDNA) recovered from such sources as a template for whole-genome analysis. While irradiation of spores and vegetative cells affected the retention of Gram and spore stains and sheared gDNA into small fragments, we found that irradiated material could be utilized to generate accurate whole-genome sequence data on the Illumina and Roche 454 sequencing platforms. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  15. Use of Modern Chemical Protein Synthesis and Advanced Fluorescent Assay Techniques to Experimentally Validate the Functional Annotation of Microbial Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Kent, Stephen [University of Chicago

    2012-07-20

    The objective of this research program was to prototype methods for the chemical synthesis of predicted protein molecules in annotated microbial genomes. High throughput chemical methods were to be used to make large numbers of predicted proteins and protein domains, based on microbial genome sequences. Microscale chemical synthesis methods for the parallel preparation of peptide-thioester building blocks were developed; these peptide segments are used for the parallel chemical synthesis of proteins and protein domains. Ultimately, it is envisaged that these synthetic molecules would be ‘printed’ in spatially addressable arrays. The unique ability of total synthesis to precision label protein molecules with dyes and with chemical or biochemical ‘tags’ can be used to facilitate novel assay technologies adapted from state-of-the art single molecule fluorescence detection techniques. In the future, in conjunction with modern laboratory automation this integrated set of techniques will enable high throughput experimental validation of the functional annotation of microbial genomes.

  16. Phylogeny-guided (meta)genome mining approach for the targeted discovery of new microbial natural products.

    Science.gov (United States)

    Kang, Hahk-Soo

    2017-02-01

    Genomics-based methods are now commonplace in natural products research. A phylogeny-guided mining approach provides a means to quickly screen a large number of microbial genomes or metagenomes in search of new biosynthetic gene clusters of interest. In this approach, biosynthetic genes serve as molecular markers, and phylogenetic trees built with known and unknown marker gene sequences are used to quickly prioritize biosynthetic gene clusters for their metabolites characterization. An increase in the use of this approach has been observed for the last couple of years along with the emergence of low cost sequencing technologies. The aim of this review is to discuss the basic concept of a phylogeny-guided mining approach, and also to provide examples in which this approach was successfully applied to discover new natural products from microbial genomes and metagenomes. I believe that the phylogeny-guided mining approach will continue to play an important role in genomics-based natural products research.

  17. Genomic Encyclopedia of Type Strains, Phase I: The one thousand microbial genomes (KMG-I) project.

    Science.gov (United States)

    Kyrpides, Nikos C; Woyke, Tanja; Eisen, Jonathan A; Garrity, George; Lilburn, Timothy G; Beck, Brian J; Whitman, William B; Hugenholtz, Phil; Klenk, Hans-Peter

    2014-06-15

    The Genomic Encyclopedia of Bacteria and Archaea (GEBA) project was launched by the JGI in 2007 as a pilot project with the objective of sequencing 250 bacterial and archaeal genomes. The two major goals of that project were (a) to test the hypothesis that there are many benefits to the use the phylogenetic diversity of organisms in the tree of life as a primary criterion for generating their genome sequence and (b) to develop the necessary framework, technology and organization for large-scale sequencing of microbial isolate genomes. While the GEBA pilot project has not yet been entirely completed, both of the original goals have already been successfully accomplished, leading the way for the next phase of the project. Here we propose taking the GEBA project to the next level, by generating high quality draft genomes for 1,000 bacterial and archaeal strains. This represents a combined 16-fold increase in both scale and speed as compared to the GEBA pilot project (250 isolate genomes in 4+ years). We will follow a similar approach for organism selection and sequencing prioritization as was done for the GEBA pilot project (i.e. phylogenetic novelty, availability and growth of cultures of type strains and DNA extraction capability), focusing on type strains as this ensures reproducibility of our results and provides the strongest linkage between genome sequences and other knowledge about each strain. In turn, this project will constitute a pilot phase of a larger effort that will target the genome sequences of all available type strains of the Bacteria and Archaea.

  18. The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4).

    Science.gov (United States)

    Huntemann, Marcel; Ivanova, Natalia N; Mavromatis, Konstantinos; Tripp, H James; Paez-Espino, David; Palaniappan, Krishnaveni; Szeto, Ernest; Pillay, Manoj; Chen, I-Min A; Pati, Amrita; Nielsen, Torben; Markowitz, Victor M; Kyrpides, Nikos C

    2015-01-01

    The DOE-JGI Microbial Genome Annotation Pipeline performs structural and functional annotation of microbial genomes that are further included into the Integrated Microbial Genome comparative analysis system. MGAP is applied to assembled nucleotide sequence datasets that are provided via the IMG submission site. Dataset submission for annotation first requires project and associated metadata description in GOLD. The MGAP sequence data processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNA features, as well as CRISPR elements. Structural annotation is followed by assignment of protein product names and functions.

  19. The GAAS metagenomic tool and its estimations of viral and microbial average genome size in four major biomes.

    Science.gov (United States)

    Angly, Florent E; Willner, Dana; Prieto-Davó, Alejandra; Edwards, Robert A; Schmieder, Robert; Vega-Thurber, Rebecca; Antonopoulos, Dionysios A; Barott, Katie; Cottrell, Matthew T; Desnues, Christelle; Dinsdale, Elizabeth A; Furlan, Mike; Haynes, Matthew; Henn, Matthew R; Hu, Yongfei; Kirchman, David L; McDole, Tracey; McPherson, John D; Meyer, Folker; Miller, R Michael; Mundt, Egbert; Naviaux, Robert K; Rodriguez-Mueller, Beltran; Stevens, Rick; Wegley, Linda; Zhang, Lixin; Zhu, Baoli; Rohwer, Forest

    2009-12-01

    Metagenomic studies characterize both the composition and diversity of uncultured viral and microbial communities. BLAST-based comparisons have typically been used for such analyses; however, sampling biases, high percentages of unknown sequences, and the use of arbitrary thresholds to find significant similarities can decrease the accuracy and validity of estimates. Here, we present Genome relative Abundance and Average Size (GAAS), a complete software package that provides improved estimates of community composition and average genome length for metagenomes in both textual and graphical formats. GAAS implements a novel methodology to control for sampling bias via length normalization, to adjust for multiple BLAST similarities by similarity weighting, and to select significant similarities using relative alignment lengths. In benchmark tests, the GAAS method was robust to both high percentages of unknown sequences and to variations in metagenomic sequence read lengths. Re-analysis of the Sargasso Sea virome using GAAS indicated that standard methodologies for metagenomic analysis may dramatically underestimate the abundance and importance of organisms with small genomes in environmental systems. Using GAAS, we conducted a meta-analysis of microbial and viral average genome lengths in over 150 metagenomes from four biomes to determine whether genome lengths vary consistently between and within biomes, and between microbial and viral communities from the same environment. Significant differences between biomes and within aquatic sub-biomes (oceans, hypersaline systems, freshwater, and microbialites) suggested that average genome length is a fundamental property of environments driven by factors at the sub-biome level. The behavior of paired viral and microbial metagenomes from the same environment indicated that microbial and viral average genome sizes are independent of each other, but indicative of community responses to stressors and environmental conditions.

  20. The GAAS metagenomic tool and its estimations of viral and microbial average genome size in four major biomes.

    Directory of Open Access Journals (Sweden)

    Florent E Angly

    2009-12-01

    Full Text Available Metagenomic studies characterize both the composition and diversity of uncultured viral and microbial communities. BLAST-based comparisons have typically been used for such analyses; however, sampling biases, high percentages of unknown sequences, and the use of arbitrary thresholds to find significant similarities can decrease the accuracy and validity of estimates. Here, we present Genome relative Abundance and Average Size (GAAS, a complete software package that provides improved estimates of community composition and average genome length for metagenomes in both textual and graphical formats. GAAS implements a novel methodology to control for sampling bias via length normalization, to adjust for multiple BLAST similarities by similarity weighting, and to select significant similarities using relative alignment lengths. In benchmark tests, the GAAS method was robust to both high percentages of unknown sequences and to variations in metagenomic sequence read lengths. Re-analysis of the Sargasso Sea virome using GAAS indicated that standard methodologies for metagenomic analysis may dramatically underestimate the abundance and importance of organisms with small genomes in environmental systems. Using GAAS, we conducted a meta-analysis of microbial and viral average genome lengths in over 150 metagenomes from four biomes to determine whether genome lengths vary consistently between and within biomes, and between microbial and viral communities from the same environment. Significant differences between biomes and within aquatic sub-biomes (oceans, hypersaline systems, freshwater, and microbialites suggested that average genome length is a fundamental property of environments driven by factors at the sub-biome level. The behavior of paired viral and microbial metagenomes from the same environment indicated that microbial and viral average genome sizes are independent of each other, but indicative of community responses to stressors and

  1. Reference Based Genome Compression

    CERN Document Server

    Chern, Bobbie; Manolakos, Alexandros; No, Albert; Venkat, Kartik; Weissman, Tsachy

    2012-01-01

    DNA sequencing technology has advanced to a point where storage is becoming the central bottleneck in the acquisition and mining of more data. Large amounts of data are vital for genomics research, and generic compression tools, while viable, cannot offer the same savings as approaches tuned to inherent biological properties. We propose an algorithm to compress a target genome given a known reference genome. The proposed algorithm first generates a mapping from the reference to the target genome, and then compresses this mapping with an entropy coder. As an illustration of the performance: applying our algorithm to James Watson's genome with hg18 as a reference, we are able to reduce the 2991 megabyte (MB) genome down to 6.99 MB, while Gzip compresses it to 834.8 MB.

  2. Reference Based Genome Compression

    OpenAIRE

    Chern, Bobbie; Ochoa, Idoia; Manolakos, Alexandros; No, Albert; Venkat, Kartik; Weissman, Tsachy

    2012-01-01

    DNA sequencing technology has advanced to a point where storage is becoming the central bottleneck in the acquisition and mining of more data. Large amounts of data are vital for genomics research, and generic compression tools, while viable, cannot offer the same savings as approaches tuned to inherent biological properties. We propose an algorithm to compress a target genome given a known reference genome. The proposed algorithm first generates a mapping from the reference to the target gen...

  3. IMG 4 version of the integrated microbial genomes comparative analysis system

    Energy Technology Data Exchange (ETDEWEB)

    Markowitz, Victor M. [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Chen, I-Min A. [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Palaniappan, Krishna [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Chu, Ken [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Szeto, Ernest [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Pillay, Manoj [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Ratner, Anna [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Huang, Jinghua [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Woyke, Tanja [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Huntemann, Marcel [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Anderson, Iain [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Billis, Konstantinos [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Varghese, Neha [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Mavromatis, Konstantinos [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Pati, Amrita [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Ivanova, Natalia N. [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Kyrpides, Nikos C. [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program

    2013-10-27

    The Integrated Microbial Genomes (IMG) data warehouse integrates genomes from all three domains of life, as well as plasmids, viruses and genome fragments. IMG provides tools for analyzing and reviewing the structural and functional annotations of genomes in a comparative context. IMG’s data content and analytical capabilities have increased continuously since its first version released in 2005. Since the last report published in the 2012 NAR Database Issue, IMG’s annotation and data integration pipelines have evolved while new tools have been added for recording and analyzing single cell genomes, RNA Seq and biosynthetic cluster data. Finally, different IMG datamarts provide support for the analysis of publicly available genomes (IMG/W: http://img.jgi.doe.gov/w), expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er) and teaching and training in the area of microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu).

  4. IMG 4 version of the integrated microbial genomes comparative analysis system.

    Science.gov (United States)

    Markowitz, Victor M; Chen, I-Min A; Palaniappan, Krishna; Chu, Ken; Szeto, Ernest; Pillay, Manoj; Ratner, Anna; Huang, Jinghua; Woyke, Tanja; Huntemann, Marcel; Anderson, Iain; Billis, Konstantinos; Varghese, Neha; Mavromatis, Konstantinos; Pati, Amrita; Ivanova, Natalia N; Kyrpides, Nikos C

    2014-01-01

    The Integrated Microbial Genomes (IMG) data warehouse integrates genomes from all three domains of life, as well as plasmids, viruses and genome fragments. IMG provides tools for analyzing and reviewing the structural and functional annotations of genomes in a comparative context. IMG's data content and analytical capabilities have increased continuously since its first version released in 2005. Since the last report published in the 2012 NAR Database Issue, IMG's annotation and data integration pipelines have evolved while new tools have been added for recording and analyzing single cell genomes, RNA Seq and biosynthetic cluster data. Different IMG datamarts provide support for the analysis of publicly available genomes (IMG/W: http://img.jgi.doe.gov/w), expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er) and teaching and training in the area of microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu).

  5. Tools for Accurate and Efficient Analysis of Complex Evolutionary Mechanisms in Microbial Genomes. Final Report

    Energy Technology Data Exchange (ETDEWEB)

    Nakhleh, Luay

    2014-03-12

    I proposed to develop computationally efficient tools for accurate detection and reconstruction of microbes' complex evolutionary mechanisms, thus enabling rapid and accurate annotation, analysis and understanding of their genomes. To achieve this goal, I proposed to address three aspects. (1) Mathematical modeling. A major challenge facing the accurate detection of HGT is that of distinguishing between these two events on the one hand and other events that have similar "effects." I proposed to develop a novel mathematical approach for distinguishing among these events. Further, I proposed to develop a set of novel optimization criteria for the evolutionary analysis of microbial genomes in the presence of these complex evolutionary events. (2) Algorithm design. In this aspect of the project, I proposed to develop an array of e cient and accurate algorithms for analyzing microbial genomes based on the formulated optimization criteria. Further, I proposed to test the viability of the criteria and the accuracy of the algorithms in an experimental setting using both synthetic as well as biological data. (3) Software development. I proposed the nal outcome to be a suite of software tools which implements the mathematical models as well as the algorithms developed.

  6. INDIGO - INtegrated data warehouse of microbial genomes with examples from the red sea extremophiles.

    Science.gov (United States)

    Alam, Intikhab; Antunes, André; Kamau, Allan Anthony; Ba Alawi, Wail; Kalkatawi, Manal; Stingl, Ulrich; Bajic, Vladimir B

    2013-01-01

    The next generation sequencing technologies substantially increased the throughput of microbial genome sequencing. To functionally annotate newly sequenced microbial genomes, a variety of experimental and computational methods are used. Integration of information from different sources is a powerful approach to enhance such annotation. Functional analysis of microbial genomes, necessary for downstream experiments, crucially depends on this annotation but it is hampered by the current lack of suitable information integration and exploration systems for microbial genomes. We developed a data warehouse system (INDIGO) that enables the integration of annotations for exploration and analysis of newly sequenced microbial genomes. INDIGO offers an opportunity to construct complex queries and combine annotations from multiple sources starting from genomic sequence to protein domain, gene ontology and pathway levels. This data warehouse is aimed at being populated with information from genomes of pure cultures and uncultured single cells of Red Sea bacteria and Archaea. Currently, INDIGO contains information from Salinisphaera shabanensis, Haloplasma contractile, and Halorhabdus tiamatea - extremophiles isolated from deep-sea anoxic brine lakes of the Red Sea. We provide examples of utilizing the system to gain new insights into specific aspects on the unique lifestyle and adaptations of these organisms to extreme environments. We developed a data warehouse system, INDIGO, which enables comprehensive integration of information from various resources to be used for annotation, exploration and analysis of microbial genomes. It will be regularly updated and extended with new genomes. It is aimed to serve as a resource dedicated to the Red Sea microbes. In addition, through INDIGO, we provide our Automatic Annotation of Microbial Genomes (AAMG) pipeline. The INDIGO web server is freely available at http://www.cbrc.kaust.edu.sa/indigo.

  7. INDIGO - INtegrated data warehouse of microbial genomes with examples from the red sea extremophiles.

    Directory of Open Access Journals (Sweden)

    Intikhab Alam

    Full Text Available BACKGROUND: The next generation sequencing technologies substantially increased the throughput of microbial genome sequencing. To functionally annotate newly sequenced microbial genomes, a variety of experimental and computational methods are used. Integration of information from different sources is a powerful approach to enhance such annotation. Functional analysis of microbial genomes, necessary for downstream experiments, crucially depends on this annotation but it is hampered by the current lack of suitable information integration and exploration systems for microbial genomes. RESULTS: We developed a data warehouse system (INDIGO that enables the integration of annotations for exploration and analysis of newly sequenced microbial genomes. INDIGO offers an opportunity to construct complex queries and combine annotations from multiple sources starting from genomic sequence to protein domain, gene ontology and pathway levels. This data warehouse is aimed at being populated with information from genomes of pure cultures and uncultured single cells of Red Sea bacteria and Archaea. Currently, INDIGO contains information from Salinisphaera shabanensis, Haloplasma contractile, and Halorhabdus tiamatea - extremophiles isolated from deep-sea anoxic brine lakes of the Red Sea. We provide examples of utilizing the system to gain new insights into specific aspects on the unique lifestyle and adaptations of these organisms to extreme environments. CONCLUSIONS: We developed a data warehouse system, INDIGO, which enables comprehensive integration of information from various resources to be used for annotation, exploration and analysis of microbial genomes. It will be regularly updated and extended with new genomes. It is aimed to serve as a resource dedicated to the Red Sea microbes. In addition, through INDIGO, we provide our Automatic Annotation of Microbial Genomes (AAMG pipeline. The INDIGO web server is freely available at http://www.cbrc.kaust.edu.sa/indigo.

  8. Systematic evaluation of bias in microbial community profiles induced by whole genome amplification

    NARCIS (Netherlands)

    Direito, S.O.L.; Zaura, E.; Little, M.; Ehrenfreund, P.; Röling, W.F.M.

    2014-01-01

    Whole genome amplification methods facilitate the detection and characterization of microbial communities in low biomass environments. We examined the extent to which the actual community structure is reliably revealed and factors contributing to bias. One widely used [multiple displacement amplific

  9. IMG ER: A System for Microbial Genome Annotation Expert Review and Curation

    Energy Technology Data Exchange (ETDEWEB)

    Markowitz, Victor M.; Mavromatis, Konstantinos; Ivanova, Natalia N.; Chen, I-Min A.; Chu, Ken; Kyrpides, Nikos C.

    2009-05-25

    A rapidly increasing number of microbial genomes are sequenced by organizations worldwide and are eventually included into various public genome data resources. The quality of the annotations depends largely on the original dataset providers, with erroneous or incomplete annotations often carried over into the public resources and difficult to correct. We have developed an Expert Review (ER) version of the Integrated Microbial Genomes (IMG) system, with the goal of supporting systematic and efficient revision of microbial genome annotations. IMG ER provides tools for the review and curation of annotations of both new and publicly available microbial genomes within IMG's rich integrated genome framework. New genome datasets are included into IMG ER prior to their public release either with their native annotations or with annotations generated by IMG ER's annotation pipeline. IMG ER tools allow addressing annotation problems detected with IMG's comparative analysis tools, such as genes missed by gene prediction pipelines or genes without an associated function. Over the past year, IMG ER was used for improving the annotations of about 150 microbial genomes.

  10. Persisting viral sequences shape microbial CRISPR-based immunity.

    Directory of Open Access Journals (Sweden)

    Ariel D Weinberger

    Full Text Available Well-studied innate immune systems exist throughout bacteria and archaea, but a more recently discovered genomic locus may offer prokaryotes surprising immunological adaptability. Mediated by a cassette-like genomic locus termed Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR, the microbial adaptive immune system differs from its eukaryotic immune analogues by incorporating new immunities unidirectionally. CRISPR thus stores genomically recoverable timelines of virus-host coevolution in natural organisms refractory to laboratory cultivation. Here we combined a population genetic mathematical model of CRISPR-virus coevolution with six years of metagenomic sequencing to link the recoverable genomic dynamics of CRISPR loci to the unknown population dynamics of virus and host in natural communities. Metagenomic reconstructions in an acid-mine drainage system document CRISPR loci conserving ancestral immune elements to the base-pair across thousands of microbial generations. This 'trailer-end conservation' occurs despite rapid viral mutation and despite rapid prokaryotic genomic deletion. The trailer-ends of many reconstructed CRISPR loci are also largely identical across a population. 'Trailer-end clonality' occurs despite predictions of host immunological diversity due to negative frequency dependent selection (kill the winner dynamics. Statistical clustering and model simulations explain this lack of diversity by capturing rapid selective sweeps by highly immune CRISPR lineages. Potentially explaining 'trailer-end conservation,' we record the first example of a viral bloom overwhelming a CRISPR system. The polyclonal viruses bloom even though they share sequences previously targeted by host CRISPR loci. Simulations show how increasing random genomic deletions in CRISPR loci purges immunological controls on long-lived viral sequences, allowing polyclonal viruses to bloom and depressing host fitness. Our results thus link

  11. SOP for pathway inference in Integrated Microbial Genomes (IMG).

    Science.gov (United States)

    Anderson, Iain; Chen, Amy; Markowitz, Victor; Kyrpides, Nikos; Ivanova, Natalia

    2011-12-31

    One of the most important aspects of genomic analysis is the prediction of which pathways, both metabolic and non-metabolic, are present in an organism. In IMG, this is carried out by the assignment of IMG terms, which are organized into IMG pathways. Based on manual and automatic assignment of IMG terms, the presence or absence of IMG pathways is automatically inferred. The three categories of pathway assertion are asserted (likely present), not asserted (likely absent), and unknown. In the unknown category, at least one term necessary for the pathway is missing, but an ortholog in another organism has the corresponding term assigned to it. Automatic pathway inference is an important initial step in genome analysis.

  12. The DOE-JGI Standard Operating Procedure for the Annotations of the Microbial Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Mavromatis, Konstantinos; Ivanova, Natalia; Chen, I-Min A.; Szeto, Ernest; Markowitz, Victor; Kyrpides, Nikos C.

    2009-05-20

    The DOE-JGI Microbial Annotation Pipeline (DOE-JGI MAP) supports gene prediction and/or functional annotation of microbial genomes towards comparative analysis with the Integrated Microbial Genome (IMG) system. DOE-JGI MAP annotation is applied on nucleotide sequence datasets included in the IMG-ER (Expert Review) version of IMG via the IMG ER submission site. Users can submit the sequence datasets consisting of one or more contigs in a multi-fasta file. DOE-JGI MAP annotation includes prediction of protein coding and RNA genes, as well as repeats and assignment of product names to these genes.

  13. Bioreactor microbial ecosystems for thiocyanate and cyanide degradation unravelled with genome-resolved metagenomics.

    Science.gov (United States)

    Kantor, Rose S; van Zyl, A Wynand; van Hille, Robert P; Thomas, Brian C; Harrison, Susan T L; Banfield, Jillian F

    2015-12-01

    Gold ore processing uses cyanide (CN(-) ), which often results in large volumes of thiocyanate- (SCN(-) ) contaminated wastewater requiring treatment. Microbial communities can degrade SCN(-) and CN(-) , but little is known about their membership and metabolic potential. Microbial-based remediation strategies will benefit from an ecological understanding of organisms involved in the breakdown of SCN(-) and CN(-) into sulfur, carbon and nitrogen compounds. We performed metagenomic analysis of samples from two laboratory-scale bioreactors used to study SCN(-) and CN(-) degradation. Community analysis revealed the dominance of Thiobacillus spp., whose genomes harbour a previously unreported operon for SCN(-) degradation. Genome-based metabolic predictions suggest that a large portion of each bioreactor community is autotrophic, relying not on molasses in reactor feed but using energy gained from oxidation of sulfur compounds produced during SCN(-) degradation. Heterotrophs, including a bacterium from a previously uncharacterized phylum, compose a smaller portion of the reactor community. Predation by phage and eukaryotes is predicted to affect community dynamics. Genes for ammonium oxidation and denitrification were detected, indicating the potential for nitrogen removal, as required for complete remediation of wastewater. These findings suggest optimization strategies for reactor design, such as improved aerobic/anaerobic partitioning and elimination of organic carbon from reactor feed.

  14. Colloquium and Report on Systems Microbiology: Beyond Microbial Genomics

    Energy Technology Data Exchange (ETDEWEB)

    Merry R. Buckley

    2004-12-13

    The American Academy of Microbiology convened a colloquium June 4-6, 2004 to confer about the scientific promise of systems microbiology. Participants discussed the power of applying a systems approach to the study of biology and to microbiology in particular, specifics about current research efforts, technical bottlenecks, requirements for data acquisition and maintenance, educational needs, and communication issues surrounding the field. A number of recommendations were made for removing barriers to progress in systems microbiology and for improving opportunities in education and collaboration. Systems biology, as a concept, is not new, but the recent explosion of genomic sequences and related data has revived interest in the field. Systems microbiology, a subset of systems biology, represents a different approach to investigating biological systems. It attempts to examine the emergent properties of microorganisms that arise from the interplay of genes, proteins, other macromolecules, small molecules, organelles, and the environment. It is these interactions, often nonlinear, that lead to the emergent properties of biological systems that are generally not tractable by traditional approaches. As a complement to the long-standing trend toward reductionism, systems microbiology seeks to treat the organism or community as a whole, integrating fundamental biological knowledge with genomics, metabolomics, and other data to create an integrated picture of how a microbial cell or community operates. Systems microbiology promises not only to shed light on the activities of microbes, but will also provide biology the tools and approaches necessary for achieving a better understanding of life and ecosystems. Microorganisms are ideal candidates for systems biology research because they are relatively easy to manipulate and because they play critical roles in health, environment, agriculture, and energy production. Potential applications of systems microbiology research

  15. Colloquium and Report on Systems Microbiology: Beyond Microbial Genomics

    Energy Technology Data Exchange (ETDEWEB)

    Merry R. Buckley

    2004-12-13

    The American Academy of Microbiology convened a colloquium June 4-6, 2004 to confer about the scientific promise of systems microbiology. Participants discussed the power of applying a systems approach to the study of biology and to microbiology in particular, specifics about current research efforts, technical bottlenecks, requirements for data acquisition and maintenance, educational needs, and communication issues surrounding the field. A number of recommendations were made for removing barriers to progress in systems microbiology and for improving opportunities in education and collaboration. Systems biology, as a concept, is not new, but the recent explosion of genomic sequences and related data has revived interest in the field. Systems microbiology, a subset of systems biology, represents a different approach to investigating biological systems. It attempts to examine the emergent properties of microorganisms that arise from the interplay of genes, proteins, other macromolecules, small molecules, organelles, and the environment. It is these interactions, often nonlinear, that lead to the emergent properties of biological systems that are generally not tractable by traditional approaches. As a complement to the long-standing trend toward reductionism, systems microbiology seeks to treat the organism or community as a whole, integrating fundamental biological knowledge with genomics, metabolomics, and other data to create an integrated picture of how a microbial cell or community operates. Systems microbiology promises not only to shed light on the activities of microbes, but will also provide biology the tools and approaches necessary for achieving a better understanding of life and ecosystems. Microorganisms are ideal candidates for systems biology research because they are relatively easy to manipulate and because they play critical roles in health, environment, agriculture, and energy production. Potential applications of systems microbiology research

  16. FDA Bioinformatics Tool for Microbial Genomics Research on Molecular Characterization of Bacterial Foodborne Pathogens Using Microarrays

    Science.gov (United States)

    Background: Advances in microbial genomics and bioinformatics are offering greater insights into the emergence and spread of foodborne pathogens in outbreak scenarios. The Food and Drug Administration (FDA) has developed the genomics tool ArrayTrackTM, which provides extensive functionalities to man...

  17. The Integrated Microbial Genomes (IMG) System: An Expanding Comparative Analysis Resource

    Energy Technology Data Exchange (ETDEWEB)

    Markowitz, Victor M.; Chen, I-Min A.; Palaniappan, Krishna; Chu, Ken; Szeto, Ernest; Grechkin, Yuri; Ratner, Anna; Anderson, Iain; Lykidis, Athanasios; Mavromatis, Konstantinos; Ivanova, Natalia N.; Kyrpides, Nikos C.

    2009-09-13

    The integrated microbial genomes (IMG) system serves as a community resource for comparative analysis of publicly available genomes in a comprehensive integrated context. IMG contains both draft and complete microbial genomes integrated with other publicly available genomes from all three domains of life, together with a large number of plasmids and viruses. IMG provides tools and viewers for analyzing and reviewing the annotations of genes and genomes in a comparative context. Since its first release in 2005, IMG's data content and analytical capabilities have been constantly expanded through regular releases. Several companion IMG systems have been set up in order to serve domain specific needs, such as expert review of genome annotations. IMG is available at .

  18. Optimizing read mapping to reference genomes to determine composition and species prevalence in microbial communities.

    Directory of Open Access Journals (Sweden)

    John Martin

    Full Text Available The Human Microbiome Project (HMP aims to characterize the microbial communities of 18 body sites from healthy individuals. To accomplish this, the HMP generated two types of shotgun data: reference shotgun sequences isolated from different anatomical sites on the human body and shotgun metagenomic sequences from the microbial communities of each site. The alignment strategy for characterizing these metagenomic communities using available reference sequence is important to the success of HMP data analysis. Six next-generation aligners were used to align a community of known composition against a database comprising reference organisms known to be present in that community. All aligners report nearly complete genome coverage (>97% for strains with over 6X depth of coverage, however they differ in speed, memory requirement and ease of use issues such as database size limitations and supported mapping strategies. The selected aligner was tested across a range of parameters to maximize sensitivity while maintaining a low false positive rate. We found that constraining alignment length had more impact on sensitivity than does constraining similarity in all cases tested. However, when reference species were replaced with phylogenetic neighbors, similarity begins to play a larger role in detection. We also show that choosing the top hit randomly when multiple, equally strong mappings are available increases overall sensitivity at the expense of taxonomic resolution. The results of this study identified a strategy that was used to map over 3 tera-bases of microbial sequence against a database of more than 5,000 reference genomes in just over a month.

  19. Ten years of maintaining and expanding a microbial genome and metagenome analysis system.

    Science.gov (United States)

    Markowitz, Victor M; Chen, I-Min A; Chu, Ken; Pati, Amrita; Ivanova, Natalia N; Kyrpides, Nikos C

    2015-11-01

    Launched in March 2005, the Integrated Microbial Genomes (IMG) system is a comprehensive data management system that supports multidimensional comparative analysis of genomic data. At the core of the IMG system is a data warehouse that contains genome and metagenome datasets sequenced at the Joint Genome Institute or provided by scientific users, as well as public genome datasets available at the National Center for Biotechnology Information Genbank sequence data archive. Genomes and metagenome datasets are processed using IMG's microbial genome and metagenome sequence data processing pipelines and are integrated into the data warehouse using IMG's data integration toolkits. Microbial genome and metagenome application specific data marts and user interfaces provide access to different subsets of IMG's data and analysis toolkits. This review article revisits IMG's original aims, highlights key milestones reached by the system during the past 10 years, and discusses the main challenges faced by a rapidly expanding system, in particular the complexity of maintaining such a system in an academic setting with limited budgets and computing and data management infrastructure.

  20. Salmonella source attribution based on microbial subtyping

    DEFF Research Database (Denmark)

    Barco, Lisa; Barrucci, Federica; Olsen, John Elmerdahl

    2013-01-01

    Source attribution of cases of food-borne disease represents a valuable tool for identifying and prioritizing effective food-safety interventions. Microbial subtyping is one of the most common methods to infer potential sources of human food-borne infections. So far, Salmonella microbial subtyping...... source attribution models have been implemented by using serotyping and phage-typing data. Molecular-based methods may prove to be similarly valuable in the future, as already demonstrated for other food-borne pathogens like Campylobacter. This review assesses the state of the art concerning Salmonella...... in the context of their potential applicability for Salmonella source attribution studies....

  1. Large-scale contamination of microbial isolate genomes by Illumina PhiX control.

    Science.gov (United States)

    Mukherjee, Supratim; Huntemann, Marcel; Ivanova, Natalia; Kyrpides, Nikos C; Pati, Amrita

    2015-01-01

    With the rapid growth and development of sequencing technologies, genomes have become the new go-to for exploring solutions to some of the world's biggest challenges such as searching for alternative energy sources and exploration of genomic dark matter. However, progress in sequencing has been accompanied by its share of errors that can occur during template or library preparation, sequencing, imaging or data analysis. In this study we screened over 18,000 publicly available microbial isolate genome sequences in the Integrated Microbial Genomes database and identified more than 1000 genomes that are contaminated with PhiX, a control frequently used during Illumina sequencing runs. Approximately 10% of these genomes have been published in literature and 129 contaminated genomes were sequenced under the Human Microbiome Project. Raw sequence reads are prone to contamination from various sources and are usually eliminated during downstream quality control steps. Detection of PhiX contaminated genomes indicates a lapse in either the application or effectiveness of proper quality control measures. The presence of PhiX contamination in several publicly available isolate genomes can result in additional errors when such data are used in comparative genomics analyses. Such contamination of public databases have far-reaching consequences in the form of erroneous data interpretation and analyses, and necessitates better measures to proofread raw sequences before releasing them to the broader scientific community.

  2. The evolution of microbial species - a view through the genomic lens

    Energy Technology Data Exchange (ETDEWEB)

    Varghese, Neha; Mukherjee, Supratim; ivanova, Natalia; Mavromatis, Konstantinos; Konstantinidis, Konstantinos; Kyrpides, Nikos; Pati, Amrita

    2014-03-17

    For a long time prokaryotic species definition has been under debate and a constant source of turmoil in microbiology. This has recently prompted the ASM to call for a scalable and reproducible technique, which uses meaningful commonalities to cluster microorganisms into groups corresponding to prokaryotic species. Whole-genome Average Nucleotide Identity (gANI) was previously suggested as a measure of genetic distance that generally agrees with prokaryotic species assignments based on the accepted best practices (DNA-DNA hybridization and 16S rDNA similarity). In this work, we prove that gANI is indeed the meaningful commonality based on which microorganisms can be grouped into the aforementioned clusters. By analyzing 1.76 million pairs of genomes we find that identification of the closest relatives of an organism via gANI is precise, scalable, reproducible, and reflects the evolutionary dynamics of microbes. We model the previously unexplored statistical properties of gANI using 6,000 microbial genomes and apply species-specific gANI cutoffs to reveal anomalies in the current taxonomic species definitions for almost 50percent of the species with multiple genome sequences. We also provide evidence of speciation events and genetic continuums in 17.8percent of those species. We consider disagreements between gANI-based groupings and named species and demonstrate that the former have all the desired features to serve as the much-needed natural groups for moving forward with taxonomy. Further, the groupings identified are presented in detail at http://ani.jgi-psf.org to facilitate comprehensive downstream analysis for researchers across different disciplines

  3. Overview: The Impact of Microbial Genomics on Food Safety

    Science.gov (United States)

    Milillo, Sara R.; Wiedmann, Martin; Hoelzer, Karin

    The first use of the term "genome" is attributed to Hans Winkler in his 1920 publication Verbeitung und Ursache der Parthenogenesis im Pflanzen und Tierreiche (Winkler, 1920). However, it was not until 1986 that the study of genomic concepts coalesced with the creation of a new journal by the same name (McKusick, 1997). The study of genomics was initially defined as the use or the application of "informatic tools" to study features of a sequenced genome (Strauss and Falkow, 1997). Today the field of genomics is typically considered to encompass efforts to determine the nucleic acid DNA sequence of an organism as well as the expression of genetic information using high-throughput, genome-wide methods, including transcriptomic, proteomic, and metabolomic analyses.

  4. G-InforBIO: integrated system for microbial genomics

    Directory of Open Access Journals (Sweden)

    Abe Takashi

    2006-08-01

    Full Text Available Abstract Background Genome databases contain diverse kinds of information, including gene annotations and nucleotide and amino acid sequences. It is not easy to integrate such information for genomic study. There are few tools for integrated analyses of genomic data, therefore, we developed software that enables users to handle, manipulate, and analyze genome data with a variety of sequence analysis programs. Results The G-InforBIO system is a novel tool for genome data management and sequence analysis. The system can import genome data encoded as eXtensible Markup Language documents as formatted text documents, including annotations and sequences, from DNA Data Bank of Japan and GenBank encoded as flat files. The genome database is constructed automatically after importing, and the database can be exported as documents formatted with eXtensible Markup Language or tab-deliminated text. Users can retrieve data from the database by keyword searches, edit annotation data of genes, and process data with G-InforBIO. In addition, information in the G-InforBIO database can be analyzed seamlessly with nine different software programs, including programs for clustering and homology analyses. Conclusion The G-InforBIO system simplifies genome analyses by integrating several available software programs to allow efficient handling and manipulation of genome data. G-InforBIO is freely available from the download site.

  5. Rapid Prototyping of Microbial Cell Factories via Genome-scale Engineering

    Science.gov (United States)

    Si, Tong; Xiao, Han; Zhao, Huimin

    2014-01-01

    Advances in reading, writing and editing genetic materials have greatly expanded our ability to reprogram biological systems at the resolution of a single nucleotide and on the scale of a whole genome. Such capacity has greatly accelerated the cycles of design, build and test to engineer microbes for efficient synthesis of fuels, chemicals and drugs. In this review, we summarize the emerging technologies that have been applied, or are potentially useful for genome-scale engineering in microbial systems. We will focus on the development of high-throughput methodologies, which may accelerate the prototyping of microbial cell factories. PMID:25450192

  6. On the total number of genes and their length distribution in complete microbial genomes

    DEFF Research Database (Denmark)

    Skovgaard, Marie; Jensen, L.J.; Brunak, Søren;

    2001-01-01

    In sequenced microbial genomes, some of the annotated genes are actually not protein-coding genes, but rather open reading frames that occur by chance. Therefore, the number of annotated genes is higher than the actual number of genes for most of these microbes. Comparison of the length distribut......In sequenced microbial genomes, some of the annotated genes are actually not protein-coding genes, but rather open reading frames that occur by chance. Therefore, the number of annotated genes is higher than the actual number of genes for most of these microbes. Comparison of the length...

  7. Rapid prototyping of microbial cell factories via genome-scale engineering.

    Science.gov (United States)

    Si, Tong; Xiao, Han; Zhao, Huimin

    2015-11-15

    Advances in reading, writing and editing genetic materials have greatly expanded our ability to reprogram biological systems at the resolution of a single nucleotide and on the scale of a whole genome. Such capacity has greatly accelerated the cycles of design, build and test to engineer microbes for efficient synthesis of fuels, chemicals and drugs. In this review, we summarize the emerging technologies that have been applied, or are potentially useful for genome-scale engineering in microbial systems. We will focus on the development of high-throughput methodologies, which may accelerate the prototyping of microbial cell factories.

  8. Genomic Reconstruction of Carbohydrate Utilization Capacities in Microbial-Mat Derived Consortia

    Directory of Open Access Journals (Sweden)

    Semen A. Leyn

    2017-07-01

    Full Text Available Two nearly identical unicyanobacterial consortia (UCC were previously isolated from benthic microbial mats that occur in a heliothermal saline lake in northern Washington State. Carbohydrates are a primary source of carbon and energy for most heterotrophic bacteria. Since CO2 is the only carbon source provided, the cyanobacterium must provide a source of carbon to the heterotrophs. Available genomic sequences for all members of the UCC provide opportunity to investigate the metabolic routes of carbon transfer between autotroph and heterotrophs. Here, we applied a subsystem-based comparative genomics approach to reconstruct carbohydrate utilization pathways and identify glycohydrolytic enzymes, carbohydrate transporters and pathway-specific transcriptional regulators in 17 heterotrophic members of the UCC. The reconstructed metabolic pathways include 800 genes, near a one-fourth of which encode enzymes, transporters and regulators with newly assigned metabolic functions resulting in discovery of novel functional variants of carbohydrate utilization pathways. The in silico analysis revealed the utilization capabilities for 40 carbohydrates and their derivatives. Two Halomonas species demonstrated the largest number of sugar catabolic pathways. Trehalose, sucrose, maltose, glucose, and beta-glucosides are the most commonly utilized saccharides in this community. Reconstructed regulons for global regulators HexR and CceR include central carbohydrate metabolism genes in the members of Gammaproteobacteria and Alphaproteobacteria, respectively. Genomics analyses were supplemented by experimental characterization of metabolic phenotypes in four isolates derived from the consortia. Measurements of isolate growth on the defined medium supplied with individual carbohydrates confirmed most of the predicted catabolic phenotypes. Not all consortia members use carbohydrates and only a few use complex polysaccharides suggesting a hierarchical carbon flow from

  9. Genomic Reconstruction of Carbohydrate Utilization Capacities in Microbial-Mat Derived Consortia

    Science.gov (United States)

    Leyn, Semen A.; Maezato, Yukari; Romine, Margaret F.; Rodionov, Dmitry A.

    2017-01-01

    Two nearly identical unicyanobacterial consortia (UCC) were previously isolated from benthic microbial mats that occur in a heliothermal saline lake in northern Washington State. Carbohydrates are a primary source of carbon and energy for most heterotrophic bacteria. Since CO2 is the only carbon source provided, the cyanobacterium must provide a source of carbon to the heterotrophs. Available genomic sequences for all members of the UCC provide opportunity to investigate the metabolic routes of carbon transfer between autotroph and heterotrophs. Here, we applied a subsystem-based comparative genomics approach to reconstruct carbohydrate utilization pathways and identify glycohydrolytic enzymes, carbohydrate transporters and pathway-specific transcriptional regulators in 17 heterotrophic members of the UCC. The reconstructed metabolic pathways include 800 genes, near a one-fourth of which encode enzymes, transporters and regulators with newly assigned metabolic functions resulting in discovery of novel functional variants of carbohydrate utilization pathways. The in silico analysis revealed the utilization capabilities for 40 carbohydrates and their derivatives. Two Halomonas species demonstrated the largest number of sugar catabolic pathways. Trehalose, sucrose, maltose, glucose, and beta-glucosides are the most commonly utilized saccharides in this community. Reconstructed regulons for global regulators HexR and CceR include central carbohydrate metabolism genes in the members of Gammaproteobacteria and Alphaproteobacteria, respectively. Genomics analyses were supplemented by experimental characterization of metabolic phenotypes in four isolates derived from the consortia. Measurements of isolate growth on the defined medium supplied with individual carbohydrates confirmed most of the predicted catabolic phenotypes. Not all consortia members use carbohydrates and only a few use complex polysaccharides suggesting a hierarchical carbon flow from cyanobacteria to

  10. Use of genome-scale microbial models for metabolic engineering

    DEFF Research Database (Denmark)

    Patil, Kiran Raosaheb; Åkesson, M.; Nielsen, Jens

    2004-01-01

    network structures. The major challenge for metabolic engineering in the post-genomic era is to broaden its design methodologies to incorporate genome-scale biological data. Genome-scale stoichiometric models of microorganisms represent a first step in this direction.......Metabolic engineering serves as an integrated approach to design new cell factories by providing rational design procedures and valuable mathematical and experimental tools. Mathematical models have an important role for phenotypic analysis, but can also be used for the design of optimal metabolic...

  11. Direct coupling of a genome-scale microbial in silico model and a groundwater reactive transport model

    Science.gov (United States)

    Fang, Yilin; Scheibe, Timothy D.; Mahadevan, Radhakrishnan; Garg, Srinath; Long, Philip E.; Lovley, Derek R.

    2011-03-01

    The activity of microorganisms often plays an important role in dynamic natural attenuation or engineered bioremediation of subsurface contaminants, such as chlorinated solvents, metals, and radionuclides. To evaluate and/or design bioremediated systems, quantitative reactive transport models are needed. State-of-the-art reactive transport models often ignore the microbial effects or simulate the microbial effects with static growth yield and constant reaction rate parameters over simulated conditions, while in reality microorganisms can dynamically modify their functionality (such as utilization of alternative respiratory pathways) in response to spatial and temporal variations in environmental conditions. Constraint-based genome-scale microbial in silico models, using genomic data and multiple-pathway reaction networks, have been shown to be able to simulate transient metabolism of some well studied microorganisms and identify growth rate, substrate uptake rates, and byproduct rates under different growth conditions. These rates can be identified and used to replace specific microbially-mediated reaction rates in a reactive transport model using local geochemical conditions as constraints. We previously demonstrated the potential utility of integrating a constraint-based microbial metabolism model with a reactive transport simulator as applied to bioremediation of uranium in groundwater. However, that work relied on an indirect coupling approach that was effective for initial demonstration but may not be extensible to more complex problems that are of significant interest (e.g., communities of microbial species and multiple constraining variables). Here, we extend that work by presenting and demonstrating a method of directly integrating a reactive transport model (FORTRAN code) with constraint-based in silico models solved with IBM ILOG CPLEX linear optimizer base system (C library). The models were integrated with BABEL, a language interoperability tool. The

  12. Microbial comparative pan-genomics using binomial mixture models

    DEFF Research Database (Denmark)

    Ussery, David; Snipen, L; Almøy, T

    2009-01-01

    The size of the core- and pan-genome of bacterial species is a topic of increasing interest due to the growing number of sequenced prokaryote genomes, many from the same species. Attempts to estimate these quantities have been made, using regression methods or mixture models. We extend the latter...... approach by using statistical ideas developed for capture-recapture problems in ecology and epidemiology. RESULTS: We estimate core- and pan-genome sizes for 16 different bacterial species. The results reveal a complex dependency structure for most species, manifested as heterogeneous detection...... probabilities. Estimated pan-genome sizes range from small (around 2600 gene families) in Buchnera aphidicola to large (around 43000 gene families) in Escherichia coli. Results for Echerichia coli show that as more data become available, a larger diversity is estimated, indicating an extensive pool of rarely...

  13. Microbial comparative pan-genomics using binomial mixture models

    Directory of Open Access Journals (Sweden)

    Ussery David W

    2009-08-01

    Full Text Available Abstract Background The size of the core- and pan-genome of bacterial species is a topic of increasing interest due to the growing number of sequenced prokaryote genomes, many from the same species. Attempts to estimate these quantities have been made, using regression methods or mixture models. We extend the latter approach by using statistical ideas developed for capture-recapture problems in ecology and epidemiology. Results We estimate core- and pan-genome sizes for 16 different bacterial species. The results reveal a complex dependency structure for most species, manifested as heterogeneous detection probabilities. Estimated pan-genome sizes range from small (around 2600 gene families in Buchnera aphidicola to large (around 43000 gene families in Escherichia coli. Results for Echerichia coli show that as more data become available, a larger diversity is estimated, indicating an extensive pool of rarely occurring genes in the population. Conclusion Analyzing pan-genomics data with binomial mixture models is a way to handle dependencies between genomes, which we find is always present. A bottleneck in the estimation procedure is the annotation of rarely occurring genes.

  14. Selective microbial genomic DNA isolation using restriction endonucleases.

    Science.gov (United States)

    Barnes, Helen E; Liu, Guohong; Weston, Christopher Q; King, Paula; Pham, Long K; Waltz, Shannon; Helzer, Kimberly T; Day, Laura; Sphar, Dan; Yamamoto, Robert T; Forsyth, R Allyn

    2014-01-01

    To improve the metagenomic analysis of complex microbiomes, we have repurposed restriction endonucleases as methyl specific DNA binding proteins. As an example, we use DpnI immobilized on magnetic beads. The ten minute extraction technique allows specific binding of genomes containing the DpnI Gm6ATC motif common in the genomic DNA of many bacteria including γ-proteobacteria. Using synthetic genome mixtures, we demonstrate 80% recovery of Escherichia coli genomic DNA even when only femtogram quantities are spiked into 10 µg of human DNA background. Binding is very specific with less than 0.5% of human DNA bound. Next Generation Sequencing of input and enriched synthetic mixtures results in over 100-fold enrichment of target genomes relative to human and plant DNA. We also show comparable enrichment when sequencing complex microbiomes such as those from creek water and human saliva. The technique can be broadened to other restriction enzymes allowing for the selective enrichment of trace and unculturable organisms from complex microbiomes and the stratification of organisms according to restriction enzyme enrichment.

  15. A New Approach to Predict Microbial Community Assembly and Function Using a Stochastic, Genome-Enabled Modeling Framework

    Science.gov (United States)

    King, E.; Brodie, E.; Anantharaman, K.; Karaoz, U.; Bouskill, N.; Banfield, J. F.; Steefel, C. I.; Molins, S.

    2016-12-01

    Characterizing and predicting the microbial and chemical compositions of subsurface aquatic systems necessitates an understanding of the metabolism and physiology of organisms that are often uncultured or studied under conditions not relevant for one's environment of interest. Cultivation-independent approaches are therefore important and have greatly enhanced our ability to characterize functional microbial diversity. The capability to reconstruct genomes representing thousands of populations from microbial communities using metagenomic techniques provides a foundation for development of predictive models for community structure and function. Here, we discuss a genome-informed stochastic trait-based model incorporated into a reactive transport framework to represent the activities of coupled guilds of hypothetical microorganisms. Metabolic pathways for each microbe within a functional guild are parameterized from metagenomic data with a unique combination of traits governing organism fitness under dynamic environmental conditions. We simulate the thermodynamics of coupled electron donor and acceptor reactions to predict the energy available for cellular maintenance, respiration, biomass development, and enzyme production. While `omics analyses can now characterize the metabolic potential of microbial communities, it is functionally redundant as well as computationally prohibitive to explicitly include the thousands of recovered organisms into biogeochemical models. However, one can derive potential metabolic pathways from genomes along with trait-linkages to build probability distributions of traits. These distributions are used to assemble groups of microbes that couple one or more of these pathways. From the initial ensemble of microbes, only a subset will persist based on the interaction of their physiological and metabolic traits with environmental conditions, competing organisms, etc. Here, we analyze the predicted niches of these hypothetical microbes and

  16. A genome probe survey of the microbial community in oil fields

    Energy Technology Data Exchange (ETDEWEB)

    Voordouw, G.; Telang, A. [Calgary Univ., AB (Canada). Dept. of Biology

    2000-07-01

    Reverse sample genome probing (RSGP) was conducted in water injected oil fields in Western Canada in order to analyze the microbial community in the fields to identify different bacteria responsible for microbially influenced corrosion (MIC) or souring. Oil fields of moderate temperature and salinity have an anaerobic microbial community consisting of sulfate-reducing bacteria (SRB), sulfide-oxidizing bacteria (SOB) and heterotrophic bacteria. The RSGP revealed that selected SRB are enhanced at metal surfaces and can therefore contribute to corrosion. Some of the SRB were found to be insensitive to biocides that are used in the field. RSGP also showed that injection of nitrate, instead of sulfate, leads to a marked increase of Campylobacter sp., which gets its energy from the oxidation of sulfide by nitrate. It was concluded that RSGP is a useful tool to monitor the effects of chemical stresses on the oil field microbial community. 13 refs., 2 figs.

  17. Score-based prediction of genomic islands in prokaryotic genomes using hidden Markov models

    Directory of Open Access Journals (Sweden)

    Surovcik Katharina

    2006-03-01

    Full Text Available Abstract Background Horizontal gene transfer (HGT is considered a strong evolutionary force shaping the content of microbial genomes in a substantial manner. It is the difference in speed enabling the rapid adaptation to changing environmental demands that distinguishes HGT from gene genesis, duplications or mutations. For a precise characterization, algorithms are needed that identify transfer events with high reliability. Frequently, the transferred pieces of DNA have a considerable length, comprise several genes and are called genomic islands (GIs or more specifically pathogenicity or symbiotic islands. Results We have implemented the program SIGI-HMM that predicts GIs and the putative donor of each individual alien gene. It is based on the analysis of codon usage (CU of each individual gene of a genome under study. CU of each gene is compared against a carefully selected set of CU tables representing microbial donors or highly expressed genes. Multiple tests are used to identify putatively alien genes, to predict putative donors and to mask putatively highly expressed genes. Thus, we determine the states and emission probabilities of an inhomogeneous hidden Markov model working on gene level. For the transition probabilities, we draw upon classical test theory with the intention of integrating a sensitivity controller in a consistent manner. SIGI-HMM was written in JAVA and is publicly available. It accepts as input any file created according to the EMBL-format. It generates output in the common GFF format readable for genome browsers. Benchmark tests showed that the output of SIGI-HMM is in agreement with known findings. Its predictions were both consistent with annotated GIs and with predictions generated by different methods. Conclusion SIGI-HMM is a sensitive tool for the identification of GIs in microbial genomes. It allows to interactively analyze genomes in detail and to generate or to test hypotheses about the origin of acquired

  18. Genome-Enabled Modeling of Biogeochemical Processes Predicts Metabolic Dependencies that Connect the Relative Fitness of Microbial Functional Guilds

    Science.gov (United States)

    Brodie, E.; King, E.; Molins, S.; Karaoz, U.; Steefel, C. I.; Banfield, J. F.; Beller, H. R.; Anantharaman, K.; Ligocki, T. J.; Trebotich, D.

    2015-12-01

    Pore-scale processes mediated by microorganisms underlie a range of critical ecosystem services, regulating carbon stability, nutrient flux, and the purification of water. Advances in cultivation-independent approaches now provide us with the ability to reconstruct thousands of genomes from microbial populations from which functional roles may be assigned. With this capability to reveal microbial metabolic potential, the next step is to put these microbes back where they belong to interact with their natural environment, i.e. the pore scale. At this scale, microorganisms communicate, cooperate and compete across their fitness landscapes with communities emerging that feedback on the physical and chemical properties of their environment, ultimately altering the fitness landscape and selecting for new microbial communities with new properties and so on. We have developed a trait-based model of microbial activity that simulates coupled functional guilds that are parameterized with unique combinations of traits that govern fitness under dynamic conditions. Using a reactive transport framework, we simulate the thermodynamics of coupled electron donor-acceptor reactions to predict energy available for cellular maintenance, respiration, biomass development, and enzyme production. From metagenomics, we directly estimate some trait values related to growth and identify the linkage of key traits associated with respiration and fermentation, macromolecule depolymerizing enzymes, and other key functions such as nitrogen fixation. Our simulations were carried out to explore abiotic controls on community emergence such as seasonally fluctuating water table regimes across floodplain organic matter hotspots. Simulations and metagenomic/metatranscriptomic observations highlighted the many dependencies connecting the relative fitness of functional guilds and the importance of chemolithoautotrophic lifestyles. Using an X-Ray microCT-derived soil microaggregate physical model combined

  19. Application of Sequence-based Methods in Human MicrobialEcology

    Energy Technology Data Exchange (ETDEWEB)

    Weng, Li; Rubin, Edward M.; Bristow, James

    2005-08-29

    Ecologists studying microbial life in the environment have recognized the enormous complexity of microbial diversity for many years, and the development of a variety of culture-independent methods, many of them coupled with high-throughput DNA sequencing, has allowed this diversity to be explored in ever greater detail. Despite the widespread application of these new techniques to the characterization of uncultivated microbes and microbial communities in the environment, their application to human health and disease has lagged behind. Because DNA based-techniques for defining uncultured microbes allow not only cataloging of microbial diversity, but also insight into microbial functions, investigators are beginning to apply these tools to the microbial communities that abound on and within us, in what has aptly been called the second Human Genome Project. In this review we discuss the sequence-based methods for microbial analysis that are currently available and their application to identify novel human pathogens, improve diagnosis of known infectious diseases, and to advance understanding of our relationship with microbial communities that normally reside in and on the human body.

  20. Evolution of a microbial nitrilase gene family: a comparative and environmental genomics study

    Directory of Open Access Journals (Sweden)

    Eads Jonathan R

    2005-08-01

    Full Text Available Abstract Background Completed genomes and environmental genomic sequences are bringing a significant contribution to understanding the evolution of gene families, microbial metabolism and community eco-physiology. Here, we used comparative genomics and phylogenetic analyses in conjunction with enzymatic data to probe the evolution and functions of a microbial nitrilase gene family. Nitrilases are relatively rare in bacterial genomes, their biological function being unclear. Results We examined the genetic neighborhood of the different subfamily genes and discovered conserved gene clusters or operons associated with specific nitrilase clades. The inferred evolutionary transitions that separate nitrilases which belong to different gene clusters correlated with changes in their enzymatic properties. We present evidence that Darwinian adaptation acted during one of those transitions and identified sites in the enzyme that may have been under positive selection. Conclusion Changes in the observed biochemical properties of the nitrilases associated with the different gene clusters are consistent with a hypothesis that those enzymes have been recruited to a novel metabolic pathway following gene duplication and neofunctionalization. These results demonstrate the benefits of combining environmental genomic sampling and completed genomes data with evolutionary and biochemical analyses in the study of gene families. They also open new directions for studying the functions of nitrilases and the genes they are associated with.

  1. Amino Acid Usage Is Asymmetrically Biased in AT- and GC-Rich Microbial Genomes

    DEFF Research Database (Denmark)

    Bohlin, Jon; Brynildsrud, Ola Brønstad; Vesth, Tammi Camilla

    2013-01-01

    Introduction: Genomic base composition ranges from less than 25% AT to more than 85% AT in prokaryotes. Since only a small fraction of prokaryotic genomes is not protein coding even a minor change in genomic base composition will induce profound protein changes. We examined how amino acid and codon...

  2. Challenging a bioinformatic tool’s ability to detect microbial contaminants using in silico whole genome sequencing data

    Directory of Open Access Journals (Sweden)

    Nathan D. Olson

    2017-09-01

    Full Text Available High sensitivity methods such as next generation sequencing and polymerase chain reaction (PCR are adversely impacted by organismal and DNA contaminants. Current methods for detecting contaminants in microbial materials (genomic DNA and cultures are not sensitive enough and require either a known or culturable contaminant. Whole genome sequencing (WGS is a promising approach for detecting contaminants due to its sensitivity and lack of need for a priori assumptions about the contaminant. Prior to applying WGS, we must first understand its limitations for detecting contaminants and potential for false positives. Herein we demonstrate and characterize a WGS-based approach to detect organismal contaminants using an existing metagenomic taxonomic classification algorithm. Simulated WGS datasets from ten genera as individuals and binary mixtures of eight organisms at varying ratios were analyzed to evaluate the role of contaminant concentration and taxonomy on detection. For the individual genomes the false positive contaminants reported depended on the genus, with Staphylococcus, Escherichia, and Shigella having the highest proportion of false positives. For nearly all binary mixtures the contaminant was detected in the in-silico datasets at the equivalent of 1 in 1,000 cells, though F. tularensis was not detected in any of the simulated contaminant mixtures and Y. pestis was only detected at the equivalent of one in 10 cells. Once a WGS method for detecting contaminants is characterized, it can be applied to evaluate microbial material purity, in efforts to ensure that contaminants are characterized in microbial materials used to validate pathogen detection assays, generate genome assemblies for database submission, and benchmark sequencing methods.

  3. Accurate genome relative abundance estimation based on shotgun metagenomic reads.

    Directory of Open Access Journals (Sweden)

    Li C Xia

    Full Text Available Accurate estimation of microbial community composition based on metagenomic sequencing data is fundamental for subsequent metagenomics analysis. Prevalent estimation methods are mainly based on directly summarizing alignment results or its variants; often result in biased and/or unstable estimates. We have developed a unified probabilistic framework (named GRAMMy by explicitly modeling read assignment ambiguities, genome size biases and read distributions along the genomes. Maximum likelihood method is employed to compute Genome Relative Abundance of microbial communities using the Mixture Model theory (GRAMMy. GRAMMy has been demonstrated to give estimates that are accurate and robust across both simulated and real read benchmark datasets. We applied GRAMMy to a collection of 34 metagenomic read sets from four metagenomics projects and identified 99 frequent species (minimally 0.5% abundant in at least 50% of the data-sets in the human gut samples. Our results show substantial improvements over previous studies, such as adjusting the over-estimated abundance for Bacteroides species for human gut samples, by providing a new reference-based strategy for metagenomic sample comparisons. GRAMMy can be used flexibly with many read assignment tools (mapping, alignment or composition-based even with low-sensitivity mapping results from huge short-read datasets. It will be increasingly useful as an accurate and robust tool for abundance estimation with the growing size of read sets and the expanding database of reference genomes.

  4. Whole-Genome Sequencing in Microbial Forensic Analysis of Gamma-Irradiated Microbial Materials

    OpenAIRE

    Broomall, Stacey M.; Ait Ichou, Mohamed; Krepps, Michael D.; Johnsky, Lauren A.; Mark A Karavis; Kyle S Hubbard; Insalaco, Joseph M.; Betters, Janet L.; Redmond, Brady W.; Rivers, Bryan A.; Liem, Alvin T.; Hill, Jessica M.; Fochler, Edward T.; Roth, Pierce A.; Rosenzweig, C. Nicole

    2016-01-01

    Effective microbial forensic analysis of materials used in a potential biological attack requires robust methods of morphological and genetic characterization of the attack materials in order to enable the attribution of the materials to potential sources and to exclude other potential sources. The genetic homogeneity and potential intersample variability of many of the category A to C bioterrorism agents offer a particular challenge to the generation of attributive signatures, potentially re...

  5. A Paleogenomic Algorithm for Reconstruction of Ancient Operons from Complete Microbial Genome Sequences

    Institute of Scientific and Technical Information of China (English)

    WANG Yu-hong; LI Wei; FANG Xue-xun; John P. Rose; WANG Bi-Cheng; LIN Da-wei

    2004-01-01

    Operons, or co-transcribed and co-regulated contiguous sets of genes, in microbial genomes are poorly conserved across different genomes due to gene fusion, deletion, duplication and other genome shuffling processes. The currently available genomes are the results of numerous reshuffling and acceptance iterations. We hypothesized that in ancient times, when life was more primitive, functionally related genes existed in close proximity and operated together as an operon to simplify regulation. As more sophisticated regulation mechanisms became available during evolution the genes forming an operon could be separated by the above mentioned processes. If gene shuffling is a random event, neighbor gene pairs are more likely to be preserved than distant gene pairs. Thus, if enough gene pairs can be identified, the original operon could be reconstructed by assembling the pairs. Here we propose a novel paleogenomic method to reconstruct present neighbor gene pairs into "ancient" operons that possibly existed at some point during evolution.

  6. Rhizome of life, catastrophes, sequence exchanges, gene creations, and giant viruses: how microbial genomics challenges Darwin.

    Science.gov (United States)

    Merhej, Vicky; Raoult, Didier

    2012-01-01

    Darwin's theory about the evolution of species has been the object of considerable dispute. In this review, we have described seven key principles in Darwin's book The Origin of Species and tried to present how genomics challenge each of these concepts and improve our knowledge about evolution. Darwin believed that species evolution consists on a positive directional selection ensuring the "survival of the fittest." The most developed state of the species is characterized by increasing complexity. Darwin proposed the theory of "descent with modification" according to which all species evolve from a single common ancestor through a gradual process of small modification of their vertical inheritance. Finally, the process of evolution can be depicted in the form of a tree. However, microbial genomics showed that evolution is better described as the "biological changes over time." The mode of change is not unidirectional and does not necessarily favors advantageous mutations to increase fitness it is rather subject to random selection as a result of catastrophic stochastic processes. Complexity is not necessarily the completion of development: several complex organisms have gone extinct and many microbes including bacteria with intracellular lifestyle have streamlined highly effective genomes. Genomes evolve through large events of gene deletions, duplications, insertions, and genomes rearrangements rather than a gradual adaptative process. Genomes are dynamic and chimeric entities with gene repertoires that result from vertical and horizontal acquisitions as well as de novo gene creation. The chimeric character of microbial genomes excludes the possibility of finding a single common ancestor for all the genes recorded currently. Genomes are collections of genes with different evolutionary histories that cannot be represented by a single tree of life (TOL). A forest, a network or a rhizome of life may be more accurate to represent evolutionary relationships among

  7. Rhizome of life, catastrophes, sequence exchanges, gene creations and giant viruses: How microbial genomics challenges Darwin

    Directory of Open Access Journals (Sweden)

    Vicky eMerhej

    2012-08-01

    Full Text Available Darwin’s theory about the evolution of species has been the object of considerable dispute. In this review, we have described seven key principles in Darwin’s book The Origin of Species and tried to present how genomics challenge each of these concepts and improve our knowledge about evolution. Darwin believed that species evolution consists on a positive directional selection ensuring the survival of the fittest. The most developed state of the species is characterized by increasing complexity. Darwin proposed the theory of descent with modification according to which all species evolve from a single common ancestor through a gradual process of small modification of their vertical inheritance. Finally, the process of evolution can be depicted in the form of a tree. However, microbial genomics showed that evolution is better described as the biological changes over time." The mode of change is not unidirectional and does not necessarily favors advantageous mutations to increase fitness it is rather subject to random selection as a result of catastrophic stochastic processes. Complexity is not necessarily the completion of development: several complex organisms have gone extinct and many microbes including bacteria with intracellular lifestyle have streamlined highly effective genomes. Genomes evolve through large events of gene deletions, duplications, insertions and genomes rearrangements rather than a gradual adaptative process. Genomes are dynamic and chimeric entities with gene repertoires that result from vertical and horizontal acquisitions as well as de novo gene creation. The chimeric character of microbial genomes excludes the possibility of finding a single common ancestor for all the genes recorded currently. Genomes are collections of genes with different evolutionary histories that cannot be represented by a single tree of life. A forest, a network or a rhizome of life may be more accurate to represent evolutionary relationships

  8. INDIGO – INtegrated Data Warehouse of MIcrobial GenOmes with Examples from the Red Sea Extremophiles

    Science.gov (United States)

    Alam, Intikhab; Antunes, André; Kamau, Allan Anthony; Ba alawi, Wail; Kalkatawi, Manal; Stingl, Ulrich; Bajic, Vladimir B.

    2013-01-01

    Background The next generation sequencing technologies substantially increased the throughput of microbial genome sequencing. To functionally annotate newly sequenced microbial genomes, a variety of experimental and computational methods are used. Integration of information from different sources is a powerful approach to enhance such annotation. Functional analysis of microbial genomes, necessary for downstream experiments, crucially depends on this annotation but it is hampered by the current lack of suitable information integration and exploration systems for microbial genomes. Results We developed a data warehouse system (INDIGO) that enables the integration of annotations for exploration and analysis of newly sequenced microbial genomes. INDIGO offers an opportunity to construct complex queries and combine annotations from multiple sources starting from genomic sequence to protein domain, gene ontology and pathway levels. This data warehouse is aimed at being populated with information from genomes of pure cultures and uncultured single cells of Red Sea bacteria and Archaea. Currently, INDIGO contains information from Salinisphaera shabanensis, Haloplasma contractile, and Halorhabdus tiamatea - extremophiles isolated from deep-sea anoxic brine lakes of the Red Sea. We provide examples of utilizing the system to gain new insights into specific aspects on the unique lifestyle and adaptations of these organisms to extreme environments. Conclusions We developed a data warehouse system, INDIGO, which enables comprehensive integration of information from various resources to be used for annotation, exploration and analysis of microbial genomes. It will be regularly updated and extended with new genomes. It is aimed to serve as a resource dedicated to the Red Sea microbes. In addition, through INDIGO, we provide our Automatic Annotation of Microbial Genomes (AAMG) pipeline. The INDIGO web server is freely available at http://www.cbrc.kaust.edu.sa/indigo. PMID

  9. Analyzing the genomic variation of microbial cell factories in the era of “New Biotechnology”

    DEFF Research Database (Denmark)

    Herrgard, Markus; Panagiotou, Gianni

    2012-01-01

    The application of genome-scale technologies, both experimental and in silico, to industrial biotechnology has allowed improving the conversion of biomass-derived feedstocks to chemicals, materials and fuels through microbial fermentation. In particular, due to rapidly decreasing costs and its...... suitability for identifying the genetic determinants of a phenotypic trait of interest, whole genome sequencing is expected to be one of the major driving forces in industrial biotechnology in the coming years. We present some of the recent studies that have successfully applied high-throughput sequencing...

  10. Microbial Growth Modeling and Simulation Based on Cellular Automata

    Directory of Open Access Journals (Sweden)

    Hong Men

    2013-07-01

    Full Text Available In order to simulate the micro-evolutionary process of the microbial growth, [Methods] in this study, we adopt two-dimensional cellular automata as its growth space. Based on evolutionary mechanism of microbial and cell-cell interactions, we adopt Moore neighborhood and make the transition rules. Finally, we construct the microbial growth model. [Results] It can describe the relationships among the cell growth, division and death. And also can effectively reflect spatial inhibition effect and substrate limitation effect. [Conclusions] The simulation results show that CA model is not only consistent with the classic microbial kinetic model, but also be able to simulate the microbial growth and evolution.

  11. Microbial Communities and Electrochemical Performance of Titanium-Based Anodic Electrodes in a Microbial Fuel Cell

    NARCIS (Netherlands)

    Michaelidou, U.; Heijne, ter A.; Euverink, G.J.W.; Hamelers, H.V.M.; Stams, A.J.M.; Geelhoed, J.S.

    2011-01-01

    Four types of titanium (Ti)-based electrodes were tested in the same microbial fuel cell (MFC) anodic compartment. Their electrochemical performances and the dominant microbial communities of the electrode biofilms were compared. The electrodes were identical in shape, macroscopic surface area, and

  12. The MiST2 database: a comprehensive genomics resource on microbial signal transduction

    OpenAIRE

    Ulrich, Luke E.; Igor B Zhulin

    2009-01-01

    The MiST2 database (http://mistdb.com) identifies and catalogs the repertoire of signal transduction proteins in microbial genomes. Signal transduction systems regulate the majority of cellular activities including the metabolism, development, host-recognition, biofilm production, virulence, and antibiotic resistance of human pathogens. Thus, knowledge of the proteins and interactions that comprise these communication networks is an essential component to furthering biomedical discovery. Thes...

  13. A catalogue of 136 microbial draft genomes from Red Sea metagenomes.

    Science.gov (United States)

    Haroon, Mohamed F; Thompson, Luke R; Parks, Donovan H; Hugenholtz, Philip; Stingl, Ulrich

    2016-07-05

    Earth is expected to continue warming and the Red Sea is a model environment for understanding the effects of global warming on ocean microbiomes due to its unusually high temperature, salinity and solar irradiance. However, most microbial diversity analyses of the Red Sea have been limited to cultured representatives and single marker gene analyses, hence neglecting the substantial uncultured majority. Here, we report 136 microbial genomes (completion minus contamination is ≥50%) assembled from 45 metagenomes from eight stations spanning the Red Sea and taken from multiple depths between 10 to 500 m. Phylogenomic analysis showed that most of the retrieved genomes belong to seven different phyla of known marine microbes, but more than half representing currently uncultured species. The open-access data presented here is the largest number of Red Sea representative microbial genomes reported in a single study and will help facilitate future studies in understanding the physiology of these microorganisms and how they have adapted to the relatively harsh conditions of the Red Sea.

  14. A catalogue of 136 microbial draft genomes from Red Sea metagenomes

    KAUST Repository

    Haroon, Mohamed

    2016-07-05

    Earth is expected to continue warming and the Red Sea is a model environment for understanding the effects of global warming on ocean microbiomes due to its unusually high temperature, salinity and solar irradiance. However, most microbial diversity analyses of the Red Sea have been limited to cultured representatives and single marker gene analyses, hence neglecting the substantial uncultured majority. Here, we report 136 microbial genomes (completion minus contamination is ≥50%) assembled from 45 metagenomes from eight stations spanning the Red Sea and taken from multiple depths between 10 to 500 m. Phylogenomic analysis showed that most of the retrieved genomes belong to seven different phyla of known marine microbes, but more than half representing currently uncultured species. The open-access data presented here is the largest number of Red Sea representative microbial genomes reported in a single study and will help facilitate future studies in understanding the physiology of these microorganisms and how they have adapted to the relatively harsh conditions of the Red Sea.

  15. A catalogue of 136 microbial draft genomes from Red Sea metagenomes

    Science.gov (United States)

    Haroon, Mohamed F.; Thompson, Luke R.; Parks, Donovan H.; Hugenholtz, Philip; Stingl, Ulrich

    2016-07-01

    Earth is expected to continue warming and the Red Sea is a model environment for understanding the effects of global warming on ocean microbiomes due to its unusually high temperature, salinity and solar irradiance. However, most microbial diversity analyses of the Red Sea have been limited to cultured representatives and single marker gene analyses, hence neglecting the substantial uncultured majority. Here, we report 136 microbial genomes (completion minus contamination is ≥50%) assembled from 45 metagenomes from eight stations spanning the Red Sea and taken from multiple depths between 10 to 500 m. Phylogenomic analysis showed that most of the retrieved genomes belong to seven different phyla of known marine microbes, but more than half representing currently uncultured species. The open-access data presented here is the largest number of Red Sea representative microbial genomes reported in a single study and will help facilitate future studies in understanding the physiology of these microorganisms and how they have adapted to the relatively harsh conditions of the Red Sea.

  16. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae : Implications for the microbial "pan-genome"

    NARCIS (Netherlands)

    Tettelin, H; Masignani, [No Value; Cieslewicz, MJ; Donati, C; Medini, D; Ward, NL; Angiuoli, SV; Crabtree, J; Jones, AL; Durkin, AS; DeBoy, RT; Davidsen, TM; Mora, M; Scarselli, M; Ros, IMY; Peterson, JD; Hauser, CR; Sundaram, JP; Nelson, WC; Madupu, R; Brinkac, LM; Dodson, RJ; Rosovitz, MJ; Sullivan, SA; Daugherty, SC; Haft, DH; Selengut, J; Gwinn, ML; Zhou, LW; Zafar, N; Khouri, H; Radune, D; Dimitrov, G; Watkins, K; O'Connor, KJB; Smith, S; Utterback, TR; White, O; Rubens, CE; Grandi, G; Madoff, LC; Kasper, DL; Telford, JL; Wessels, MR; Rappuoli, R; Fraser, CM

    2005-01-01

    The development of efficient and inexpensive genome sequencing methods has revolutionized the study of human bacterial pathogens and improved vaccine design. Unfortunately, the sequence of a single genome does not reflect how genetic variability drives pathogenesis within a bacterial species and als

  17. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae : Implications for the microbial "pan-genome"

    NARCIS (Netherlands)

    Tettelin, H; Masignani, [No Value; Cieslewicz, MJ; Donati, C; Medini, D; Ward, NL; Angiuoli, SV; Crabtree, J; Jones, AL; Durkin, AS; DeBoy, RT; Davidsen, TM; Mora, M; Scarselli, M; Ros, IMY; Peterson, JD; Hauser, CR; Sundaram, JP; Nelson, WC; Madupu, R; Brinkac, LM; Dodson, RJ; Rosovitz, MJ; Sullivan, SA; Daugherty, SC; Haft, DH; Selengut, J; Gwinn, ML; Zhou, LW; Zafar, N; Khouri, H; Radune, D; Dimitrov, G; Watkins, K; O'Connor, KJB; Smith, S; Utterback, TR; White, O; Rubens, CE; Grandi, G; Madoff, LC; Kasper, DL; Telford, JL; Wessels, MR; Rappuoli, R; Fraser, CM

    2005-01-01

    The development of efficient and inexpensive genome sequencing methods has revolutionized the study of human bacterial pathogens and improved vaccine design. Unfortunately, the sequence of a single genome does not reflect how genetic variability drives pathogenesis within a bacterial species and als

  18. Distribution of nitrogen fixation and nitrogenase-like sequences amongst microbial genomes

    Directory of Open Access Journals (Sweden)

    Dos Santos Patricia C

    2012-05-01

    Full Text Available Abstract Background The metabolic capacity for nitrogen fixation is known to be present in several prokaryotic species scattered across taxonomic groups. Experimental detection of nitrogen fixation in microbes requires species-specific conditions, making it difficult to obtain a comprehensive census of this trait. The recent and rapid increase in the availability of microbial genome sequences affords novel opportunities to re-examine the occurrence and distribution of nitrogen fixation genes. The current practice for computational prediction of nitrogen fixation is to use the presence of the nifH and/or nifD genes. Results Based on a careful comparison of the repertoire of nitrogen fixation genes in known diazotroph species we propose a new criterion for computational prediction of nitrogen fixation: the presence of a minimum set of six genes coding for structural and biosynthetic components, namely NifHDK and NifENB. Using this criterion, we conducted a comprehensive search in fully sequenced genomes and identified 149 diazotrophic species, including 82 known diazotrophs and 67 species not known to fix nitrogen. The taxonomic distribution of nitrogen fixation in Archaea was limited to the Euryarchaeota phylum; within the Bacteria domain we predict that nitrogen fixation occurs in 13 different phyla. Of these, seven phyla had not hitherto been known to contain species capable of nitrogen fixation. Our analyses also identified protein sequences that are similar to nitrogenase in organisms that do not meet the minimum-gene-set criteria. The existence of nitrogenase-like proteins lacking conserved co-factor ligands in both diazotrophs and non-diazotrophs suggests their potential for performing other, as yet unidentified, metabolic functions. Conclusions Our predictions expand the known phylogenetic diversity of nitrogen fixation, and suggest that this trait may be much more common in nature than it is currently thought. The diverse phylogenetic

  19. How agricultural management shapes soil microbial communities: patterns emerging from genetic and genomic studies

    Science.gov (United States)

    Daly, Amanda; Grandy, A. Stuart

    2016-04-01

    Agriculture is a predominant land use and thus a large influence on global carbon (C) and nitrogen (N) balances, climate, and human health. If we are to produce food, fiber, and fuel sustainably we must maximize agricultural yield while minimizing negative environmental consequences, goals towards which we have made great strides through agronomic advances. However, most agronomic strategies have been designed with a view of soil as a black box, largely ignoring the way management is mediated by soil biota. Because soil microbes play a central role in many of the processes that deliver nutrients to crops and support their health and productivity, agricultural management strategies targeted to exploit or support microbial activity should deliver additional benefits. To do this we must determine how microbial community structure and function are shaped by agricultural practices, but until recently our characterizations of soil microbial communities in agricultural soils have been largely limited to broad taxonomic classes due to methodological constraints. With advances in high-throughput genetic and genomic sequencing techniques, better taxonomic resolution now enables us to determine how agricultural management affects specific microbes and, in turn, nutrient cycling outcomes. Here we unite findings from published research that includes genetic or genomic data about microbial community structure (e.g. 454, Illumina, clone libraries, qPCR) in soils under agricultural management regimes that differ in type and extent of tillage, cropping selections and rotations, inclusion of cover crops, organic amendments, and/or synthetic fertilizer application. We delineate patterns linking agricultural management to microbial diversity, biomass, C- and N-content, and abundance of microbial taxa; furthermore, where available, we compare patterns in microbial communities to patterns in soil extracellular enzyme activities, catabolic profiles, inorganic nitrogen pools, and nitrogen

  20. Finishing Genomes with Limited Resources: Lessons from an Ensemble of Microbial Genomes

    Science.gov (United States)

    2010-01-01

    Vibrio cholerae 0395 Salmonella enterica SL476 4.68 5.4 4.1 4.99 TIGR 12/12/07 5.642 JGI 02/06/06 2,417 TIGR 05/08/07 4,521 JCVI 07/24/08...Sanger 10/01/08 2,033 Data was collected from NCBI’s Trace Archive and Genomes Database. The V. cholerae genome was sequenced using a 454/Sanger

  1. Rumen microbial (meta)genomics and its application to ruminant production.

    Science.gov (United States)

    Morgavi, D P; Kelly, W J; Janssen, P H; Attwood, G T

    2013-03-01

    Meat and milk produced by ruminants are important agricultural products and are major sources of protein for humans. Ruminant production is of considerable economic value and underpins food security in many regions of the world. However, the sector faces major challenges because of diminishing natural resources and ensuing increases in production costs, and also because of the increased awareness of the environmental impact of farming ruminants. The digestion of feed and the production of enteric methane are key functions that could be manipulated by having a thorough understanding of the rumen microbiome. Advances in DNA sequencing technologies and bioinformatics are transforming our understanding of complex microbial ecosystems, including the gastrointestinal tract of mammals. The application of these techniques to the rumen ecosystem has allowed the study of the microbial diversity under different dietary and production conditions. Furthermore, the sequencing of genomes from several cultured rumen bacterial and archaeal species is providing detailed information about their physiology. More recently, metagenomics, mainly aimed at understanding the enzymatic machinery involved in the degradation of plant structural polysaccharides, is starting to produce new insights by allowing access to the total community and sidestepping the limitations imposed by cultivation. These advances highlight the promise of these approaches for characterising the rumen microbial community structure and linking this with the functions of the rumen microbiota. Initial results using high-throughput culture-independent technologies have also shown that the rumen microbiome is far more complex and diverse than the human caecum. Therefore, cataloguing its genes will require a considerable sequencing and bioinformatic effort. Nevertheless, the construction of a rumen microbial gene catalogue through metagenomics and genomic sequencing of key populations is an attainable goal. A rumen

  2. Clostridium ljungdahlii represents a microbial production platform based on syngas.

    Science.gov (United States)

    Köpke, Michael; Held, Claudia; Hujer, Sandra; Liesegang, Heiko; Wiezer, Arnim; Wollherr, Antje; Ehrenreich, Armin; Liebl, Wolfgang; Gottschalk, Gerhard; Dürre, Peter

    2010-07-20

    Clostridium ljungdahlii is an anaerobic homoacetogen, able to ferment sugars, other organic compounds, or CO(2)/H(2) and synthesis gas (CO/H(2)). The latter feature makes it an interesting microbe for the biotech industry, as important bulk chemicals and proteins can be produced at the expense of CO(2), thus combining industrial needs with sustained reduction of CO and CO(2) in the atmosphere. Sequencing the complete genome of C. ljungdahlii revealed that it comprises 4,630,065 bp and is one of the largest clostridial genomes known to date. Experimental data and in silico comparisons revealed a third mode of anaerobic homoacetogenic metabolism. Unlike other organisms such as Moorella thermoacetica or Acetobacterium woodii, neither cytochromes nor sodium ions are involved in energy generation. Instead, an Rnf system is present, by which proton translocation can be performed. An electroporation procedure has been developed to transform the organism with plasmids bearing heterologous genes for butanol production. Successful expression of these genes could be demonstrated, leading to formation of the biofuel. Thus, C. ljungdahlii can be used as a unique microbial production platform based on synthesis gas and carbon dioxide/hydrogen mixtures.

  3. Pore-scale simulation of microbial growth using a genome-scale metabolic model: Implications for Darcy-scale reactive transport

    Science.gov (United States)

    Tartakovsky, G. D.; Tartakovsky, A. M.; Scheibe, T. D.; Fang, Y.; Mahadevan, R.; Lovley, D. R.

    2013-09-01

    Recent advances in microbiology have enabled the quantitative simulation of microbial metabolism and growth based on genome-scale characterization of metabolic pathways and fluxes. We have incorporated a genome-scale metabolic model of the iron-reducing bacteria Geobacter sulfurreducens into a pore-scale simulation of microbial growth based on coupling of iron reduction to oxidation of a soluble electron donor (acetate). In our model, fluid flow and solute transport is governed by a combination of the Navier-Stokes and advection-diffusion-reaction equations. Microbial growth occurs only on the surface of soil grains where solid-phase mineral iron oxides are available. Mass fluxes of chemical species associated with microbial growth are described by the genome-scale microbial model, implemented using a constraint-based metabolic model, and provide the Robin-type boundary condition for the advection-diffusion equation at soil grain surfaces. Conventional models of microbially-mediated subsurface reactions use a lumped reaction model that does not consider individual microbial reaction pathways, and describe reactions rates using empirically-derived rate formulations such as the Monod-type kinetics. We have used our pore-scale model to explore the relationship between genome-scale metabolic models and Monod-type formulations, and to assess the manifestation of pore-scale variability (microenvironments) in terms of apparent Darcy-scale microbial reaction rates. The genome-scale model predicted lower biomass yield, and different stoichiometry for iron consumption, in comparison to prior Monod formulations based on energetics considerations. We were able to fit an equivalent Monod model, by modifying the reaction stoichiometry and biomass yield coefficient, that could effectively match results of the genome-scale simulation of microbial behaviors under excess nutrient conditions, but predictions of the fitted Monod model deviated from those of the genome-scale model

  4. Pore-scale simulation of microbial growth using a genome-scale metabolic model: Implications for Darcy-scale reactive transport

    Energy Technology Data Exchange (ETDEWEB)

    Tartakovsky, Guzel D.; Tartakovsky, Alexandre M.; Scheibe, Timothy D.; Fang, Yilin; Mahadevan, Radhakrishnan; Lovley, Derek R.

    2013-09-07

    Recent advances in microbiology have enabled the quantitative simulation of microbial metabolism and growth based on genome-scale characterization of metabolic pathways and fluxes. We have incorporated a genome-scale metabolic model of the iron-reducing bacteria Geobacter sulfurreducens into a pore-scale simulation of microbial growth based on coupling of iron reduction to oxidation of a soluble electron donor (acetate). In our model, fluid flow and solute transport is governed by a combination of the Navier-Stokes and advection-diffusion-reaction equations. Microbial growth occurs only on the surface of soil grains where solid-phase mineral iron oxides are available. Mass fluxes of chemical species associated with microbial growth are described by the genome-scale microbial model, implemented using a constraint-based metabolic model, and provide the Robin-type boundary condition for the advection-diffusion equation at soil grain surfaces. Conventional models of microbially-mediated subsurface reactions use a lumped reaction model that does not consider individual microbial reaction pathways, and describe reactions rates using empirically-derived rate formulations such as the Monod-type kinetics. We have used our pore-scale model to explore the relationship between genome-scale metabolic models and Monod-type formulations, and to assess the manifestation of pore-scale variability (microenvironments) in terms of apparent Darcy-scale microbial reaction rates. The genome-scale model predicted lower biomass yield, and different stoichiometry for iron consumption, in comparisonto prior Monod formulations based on energetics considerations. We were able to fit an equivalent Monod model, by modifying the reaction stoichiometry and biomass yield coefficient, that could effectively match results of the genome-scale simulation of microbial behaviors under excess nutrient conditions, but predictions of the fitted Monod model deviated from those of the genome-scale model under

  5. BrucellaBase: Genome information resource.

    Science.gov (United States)

    Sankarasubramanian, Jagadesan; Vishnu, Udayakumar S; Khader, L K M Abdul; Sridhar, Jayavel; Gunasekaran, Paramasamy; Rajendhran, Jeyaprakash

    2016-09-01

    Brucella sp. causes a major zoonotic disease, brucellosis. Brucella belongs to the family Brucellaceae under the order Rhizobiales of Alphaproteobacteria. We present BrucellaBase, a web-based platform, providing features of a genome database together with unique analysis tools. We have developed a web version of the multilocus sequence typing (MLST) (Whatmore et al., 2007) and phylogenetic analysis of Brucella spp. BrucellaBase currently contains genome data of 510 Brucella strains along with the user interfaces for BLAST, VFDB, CARD, pairwise genome alignment and MLST typing. Availability of these tools will enable the researchers interested in Brucella to get meaningful information from Brucella genome sequences. BrucellaBase will regularly be updated with new genome sequences, new features along with improvements in genome annotations. BrucellaBase is available online at http://www.dbtbrucellosis.in/brucellabase.html or http://59.99.226.203/brucellabase/homepage.html.

  6. A hybrid clustering approach to recognition of protein families in 114 microbial genomes

    Directory of Open Access Journals (Sweden)

    Gogarten J Peter

    2004-04-01

    Full Text Available Abstract Background Grouping proteins into sequence-based clusters is a fundamental step in many bioinformatic analyses (e.g., homology-based prediction of structure or function. Standard clustering methods such as single-linkage clustering capture a history of cluster topologies as a function of threshold, but in practice their usefulness is limited because unrelated sequences join clusters before biologically meaningful families are fully constituted, e.g. as the result of matches to so-called promiscuous domains. Use of the Markov Cluster algorithm avoids this non-specificity, but does not preserve topological or threshold information about protein families. Results We describe a hybrid approach to sequence-based clustering of proteins that combines the advantages of standard and Markov clustering. We have implemented this hybrid approach over a relational database environment, and describe its application to clustering a large subset of PDB, and to 328577 proteins from 114 fully sequenced microbial genomes. To demonstrate utility with difficult problems, we show that hybrid clustering allows us to constitute the paralogous family of ATP synthase F1 rotary motor subunits into a single, biologically interpretable hierarchical grouping that was not accessible using either single-linkage or Markov clustering alone. We describe validation of this method by hybrid clustering of PDB and mapping SCOP families and domains onto the resulting clusters. Conclusion Hybrid (Markov followed by single-linkage clustering combines the advantages of the Markov Cluster algorithm (avoidance of non-specific clusters resulting from matches to promiscuous domains and single-linkage clustering (preservation of topological information as a function of threshold. Within the individual Markov clusters, single-linkage clustering is a more-precise instrument, discerning sub-clusters of biological relevance. Our hybrid approach thus provides a computationally efficient

  7. Inteins in Microbial Genomes: Distribution, Mechanism, and Function

    Directory of Open Access Journals (Sweden)

    Maurice W. Southworth

    2002-01-01

    Full Text Available Inteins are self-splicing protein elements (134 to 608 amino acids. Over 125 inteins have been cataloged in InBase, the on-line intein database (http://www.neb.com/neb/inteins.html, which includes the Intein Registry[1]. Inteins naturally present in pathogenic microbes represent novel, yet unexploited drug targets. Understanding the chemistry of the splicing reaction has allowed the manipulation of inteins, which are now used in many protein engineering applications[2].

  8. Using populations of human and microbial genomes for organism detection in metagenomes

    Science.gov (United States)

    Ames, Sasha K.; Gardner, Shea N.; Marti, Jose Manuel; Slezak, Tom R.; Gokhale, Maya B.; Allen, Jonathan E.

    2015-01-01

    Identifying causative disease agents in human patients from shotgun metagenomic sequencing (SMS) presents a powerful tool to apply when other targeted diagnostics fail. Numerous technical challenges remain, however, before SMS can move beyond the role of research tool. Accurately separating the known and unknown organism content remains difficult, particularly when SMS is applied as a last resort. The true amount of human DNA that remains in a sample after screening against the human reference genome and filtering nonbiological components left from library preparation has previously been underreported. In this study, we create the most comprehensive collection of microbial and reference-free human genetic variation available in a database optimized for efficient metagenomic search by extracting sequences from GenBank and the 1000 Genomes Project. The results reveal new human sequences found in individual Human Microbiome Project (HMP) samples. Individual samples contain up to 95% human sequence, and 4% of the individual HMP samples contain 10% or more human reads. Left unidentified, human reads can complicate and slow down further analysis and lead to inaccurately labeled microbial taxa and ultimately lead to privacy concerns as more human genome data is collected. PMID:25926546

  9. Which Microbial Communities Are Present? Sequence-Based Metagenomics

    Science.gov (United States)

    Caffrey, Sean M.

    The use of metagenomic methods that directly sequence environmental samples has revealed the extraordinary microbial diversity missed by traditional culture-based methodologies. Therefore, to develop a complete and representative model of an environment's microbial community and activities, metagenomic analysis is an essential tool.

  10. MicroScope in 2017: an expanding and evolving integrated resource for community expertise of microbial genomes.

    Science.gov (United States)

    Vallenet, David; Calteau, Alexandra; Cruveiller, Stéphane; Gachet, Mathieu; Lajus, Aurélie; Josso, Adrien; Mercier, Jonathan; Renaux, Alexandre; Rollin, Johan; Rouy, Zoe; Roche, David; Scarpelli, Claude; Médigue, Claudine

    2017-01-04

    The annotation of genomes from NGS platforms needs to be automated and fully integrated. However, maintaining consistency and accuracy in genome annotation is a challenging problem because millions of protein database entries are not assigned reliable functions. This shortcoming limits the knowledge that can be extracted from genomes and metabolic models. Launched in 2005, the MicroScope platform (http://www.genoscope.cns.fr/agc/microscope) is an integrative resource that supports systematic and efficient revision of microbial genome annotation, data management and comparative analysis. Effective comparative analysis requires a consistent and complete view of biological data, and therefore, support for reviewing the quality of functional annotation is critical. MicroScope allows users to analyze microbial (meta)genomes together with post-genomic experiment results if any (i.e. transcriptomics, re-sequencing of evolved strains, mutant collections, phenotype data). It combines tools and graphical interfaces to analyze genomes and to perform the expert curation of gene functions in a comparative context. Starting with a short overview of the MicroScope system, this paper focuses on some major improvements of the Web interface, mainly for the submission of genomic data and on original tools and pipelines that have been developed and integrated in the platform: computation of pan-genomes and prediction of biosynthetic gene clusters. Today the resource contains data for more than 6000 microbial genomes, and among the 2700 personal accounts (65% of which are now from foreign countries), 14% of the users are performing expert annotations, on at least a weekly basis, contributing to improve the quality of microbial genome annotations. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. MicroScope in 2017: an expanding and evolving integrated resource for community expertise of microbial genomes

    Science.gov (United States)

    Vallenet, David; Calteau, Alexandra; Cruveiller, Stéphane; Gachet, Mathieu; Lajus, Aurélie; Josso, Adrien; Mercier, Jonathan; Renaux, Alexandre; Rollin, Johan; Rouy, Zoe; Roche, David; Scarpelli, Claude; Médigue, Claudine

    2017-01-01

    The annotation of genomes from NGS platforms needs to be automated and fully integrated. However, maintaining consistency and accuracy in genome annotation is a challenging problem because millions of protein database entries are not assigned reliable functions. This shortcoming limits the knowledge that can be extracted from genomes and metabolic models. Launched in 2005, the MicroScope platform (http://www.genoscope.cns.fr/agc/microscope) is an integrative resource that supports systematic and efficient revision of microbial genome annotation, data management and comparative analysis. Effective comparative analysis requires a consistent and complete view of biological data, and therefore, support for reviewing the quality of functional annotation is critical. MicroScope allows users to analyze microbial (meta)genomes together with post-genomic experiment results if any (i.e. transcriptomics, re-sequencing of evolved strains, mutant collections, phenotype data). It combines tools and graphical interfaces to analyze genomes and to perform the expert curation of gene functions in a comparative context. Starting with a short overview of the MicroScope system, this paper focuses on some major improvements of the Web interface, mainly for the submission of genomic data and on original tools and pipelines that have been developed and integrated in the platform: computation of pan-genomes and prediction of biosynthetic gene clusters. Today the resource contains data for more than 6000 microbial genomes, and among the 2700 personal accounts (65% of which are now from foreign countries), 14% of the users are performing expert annotations, on at least a weekly basis, contributing to improve the quality of microbial genome annotations. PMID:27899624

  12. Reconstructing Rare Soil Microbial Genomes using in situ Enrichments and Metagenomics

    Directory of Open Access Journals (Sweden)

    Tom O Delmont

    2015-04-01

    Full Text Available Despite extensive direct sequencing efforts and advanced analytical tools, reconstructing microbial genomes from soil using metagenomics have been challenging due to the tremendous diversity and relatively uniform distribution of genomes found in this system. Here we used enrichment techniques in an attempt to decrease the complexity of a soil microbiome prior to sequencing by submitting it to a range of physical and chemical stresses in 23 separate microcosms for four months. The metagenomic analysis of these microcosms at the end of the treatment yielded 540Mb of assembly using standard de novo assembly techniques (a total of 559,555 genes and 29,176 functions, from which we could recover novel bacterial genomes, plasmids and phages. The recovered genomes belonged to Leifsonia (n=2, Rhodanobacter (n=5, Acidobacteria (n=2, Sporolactobacillus (n=2, novel nitrogen fixing taxon, Ktedonobacter (n=1, second representative of the family Ktedonobacteraceae, Streptomyces (n=3, novel polyketide synthase modules and Burkholderia (n=2, includes mega-plasmids conferring mercury resistance. Assembled genomes averaged to 5.9Mb, with relative abundances ranging from rare (0.01% in the original soil microbiome. Furthermore, we detected them in samples collected from geographically distant locations, particularly more in temperate soils compared to samples originating from high-latitude soils and deserts. To the best of our knowledge, this study is the first successful attempt to assemble multiple bacterial genomes directly from a soil sample. Our findings demonstrate that developing pertinent enrichment conditions can stimulate environmental genomic discoveries that would have been impossible to achieve with canonical approaches that focus solely upon post-sequencing data treatment.

  13. Genome-resolved metagenomics reveals that sulfur metabolism dominates the microbial ecology of rising hydrothermal plumes

    Science.gov (United States)

    Anantharaman, K.; Breier, J. A., Jr.; Jain, S.; Reed, D. C.; Dick, G.

    2015-12-01

    Deep-sea hydrothermal plumes occur when hot fluids from hydrothermal vents replete with chemically reduced elements and compounds like sulfide, methane, hydrogen, ammonia, iron and manganese mix with cold, oxic seawater. Chemosynthetic microbes use these reduced chemicals to power primary production and are pervasive throughout the deep sea, even at sites far removed from hydrothermal vents. Although neutrally-buoyant hydrothermal plumes have been well-studied, rising hydrothermal plumes have received little attention even though they represent an important interface in the deep-sea where microbial metabolism and particle formation processes control the transformation of important elements and impact global biogeochemical cycles. In this study, we used genome-resolved metagenomic analyses and thermodynamic-bioenergetic modeling to study the microbial ecology of rising hydrothermal plumes at five different hydrothermal vents spanning a range of geochemical gradients at the Eastern Lau Spreading Center (ELSC) in the Western Pacific Ocean. Our analyses show that differences in the geochemistry of hydrothermal vents do not manifest in microbial diversity and community composition, both of which display only minor variance across ELSC hydrothermal plumes. Microbial metabolism is dominated by oxidation of reduced sulfur species and supports a diversity of bacteria, archaea and viruses that provide intriguing insights into metabolic plasticity and virus-mediated horizontal gene transfer in the microbial community. The manifestation of sulfur oxidation genes in hydrogen and methane oxidizing organisms hints at metabolic opportunism in deep-sea microbes that would enable them to respond to varying redox conditions in hydrothermal plumes. Finally, we infer that the abundance, diversity and metabolic versatility of microbes associated with sulfur oxidation impart functional redundancy that could allow it to persist in the dynamic settings of hydrothermal plumes.

  14. Analyzing the genomic variation of microbial cell factories in the era of “New Biotechnology”

    Directory of Open Access Journals (Sweden)

    Markus Herrgård

    2012-10-01

    Full Text Available The application of genome-scale technologies, both experimental and in silico, to industrial biotechnology has allowed improving the conversion of biomass-derived feedstocks to chemicals, materials and fuels through microbial fermentation. In particular, due to rapidly decreasing costs and its suitability for identifying the genetic determinants of a phenotypic trait of interest, whole genome sequencing is expected to be one of the major driving forces in industrial biotechnology in the coming years. We present some of the recent studies that have successfully applied high-throughput sequencing technologies for finding the underlying molecular mechanisms for (a improved carbon source utilization, (b increased product formation, and (c stress tolerance. We also discuss the strengths and weaknesses of different strategies for mapping industrially relevant genotype-to-phenotype links including exploiting natural diversity in natural isolates or crosses between isolates, classical mutagenesis and evolutionary engineering.

  15. Targeted Access to the Genomes of Low Abundance Organisms in Complex Microbial Communities

    Energy Technology Data Exchange (ETDEWEB)

    Podar, Mircea [ORNL; Abulencia, Carl [Diversa Corporation; Walcher, Marion [Diversa Corporation; Hutchinson, Don [Diversa Corporation; Zengler, Karsten [Diversa Corporation; Garcia, Joseph [Diversa Corporation; Holland, Trevin [Diversa Corporation; Cotton, Dave [Diversa Corporation; Hauser, Loren John [ORNL; Keller, Martin [ORNL

    2007-01-01

    Current metagenomic approaches to the study of complex microbial consortia provide a glimpse into the community metabolism, and occasionally allow genomic assemblies for the most abundant organisms. However, little information is gained for the members of the community present at low frequency, especially those representing yet uncultured taxa-which includes the bulk of the diversity present in most environments. Here we used phylogenetically directed cell separation by fluorescence in situ hybridization and flow cytometry, followed by amplification and sequencing of a fraction of the genomic DNA of several bacterial cells that belong to the TM7 phylum. Partial genomic assembly allowed, for the first time, a look into the evolution and potential metabolism of a soil representative from this group of organisms for which there are no species in stable laboratory cultures. Genomic reconstruction from targeted cells of uncultured organisms directly isolated from the environment represents a powerful approach to access any specific members of a community and an alternative way to assess the community metabolic potential.

  16. Targeted access to the genomes of low-abundance organisms in complex microbial communities.

    Science.gov (United States)

    Podar, Mircea; Abulencia, Carl B; Walcher, Marion; Hutchison, Don; Zengler, Karsten; Garcia, Joseph A; Holland, Trevin; Cotton, David; Hauser, Loren; Keller, Martin

    2007-05-01

    Current metagenomic approaches to the study of complex microbial consortia provide a glimpse into the community metabolism and occasionally allow genomic assemblies for the most abundant organisms. However, little information is gained for the members of the community present at low frequencies, especially those representing yet-uncultured taxa, which include the bulk of the diversity present in most environments. Here we used phylogenetically directed cell separation by fluorescence in situ hybridization and flow cytometry, followed by amplification and sequencing of a fraction of the genomic DNA of several bacterial cells that belong to the TM7 phylum. Partial genomic assembly allowed, for the first time, a look into the evolution and potential metabolism of a soil representative from this group of organisms for which there are no species in stable laboratory cultures. Genomic reconstruction from targeted cells of uncultured organisms isolated directly from the environment represents a powerful approach to access any specific members of a community and an alternative way to assess the community's metabolic potential.

  17. Environmental Whole-Genome Amplification to Access Microbial Diversity in Contaminated Sediments

    Energy Technology Data Exchange (ETDEWEB)

    Abulencia, C.B.; Wyborski, D.L.; Garcia, J.; Podar, M.; Chen, W.; Chang, S.H.; Chang, H.W.; Watson, D.; Brodie,E.I.; Hazen, T.C.; Keller, M.

    2005-12-10

    Low-biomass samples from nitrate and heavy metal contaminated soils yield DNA amounts that have limited use for direct, native analysis and screening. Multiple displacement amplification (MDA) using ?29 DNA polymerase was used to amplify whole genomes from environmental, contaminated, subsurface sediments. By first amplifying the genomic DNA (gDNA), biodiversity analysis and gDNA library construction of microbes found in contaminated soils were made possible. The MDA method was validated by analyzing amplified genome coverage from approximately five Escherichia coli cells, resulting in 99.2 percent genome coverage. The method was further validated by confirming overall representative species coverage and also an amplification bias when amplifying from a mix of eight known bacterial strains. We extracted DNA from samples with extremely low cell densities from a U.S. Department of Energy contaminated site. After amplification, small subunit rRNA analysis revealed relatively even distribution of species across several major phyla. Clone libraries were constructed from the amplified gDNA, and a small subset of clones was used for shotgun sequencing. BLAST analysis of the library clone sequences showed that 64.9 percent of the sequences had significant similarities to known proteins, and ''clusters of orthologous groups'' (COG) analysis revealed that more than half of the sequences from each library contained sequence similarity to known proteins. The libraries can be readily screened for native genes or any target of interest. Whole-genome amplification of metagenomic DNA from very minute microbial sources, while introducing an amplification bias, will allow access to genomic information that was not previously accessible.

  18. Advances in industrial microbiome based on microbial consortium for biorefinery.

    Science.gov (United States)

    Jiang, Li-Li; Zhou, Jin-Jie; Quan, Chun-Shan; Xiu, Zhi-Long

    2017-01-01

    One of the important targets of industrial biotechnology is using cheap biomass resources. The traditional strategy is microbial fermentations with single strain. However, cheap biomass normally contains so complex compositions and impurities that it is very difficult for single microorganism to utilize availably. In order to completely utilize the substrates and produce multiple products in one process, industrial microbiome based on microbial consortium draws more and more attention. In this review, we first briefly described some examples of existing industrial bioprocesses involving microbial consortia. Comparison of 1,3-propanediol production by mixed and pure cultures were then introduced, and interaction relationships between cells in microbial consortium were summarized. Finally, the outlook on how to design and apply microbial consortium in the future was also proposed.

  19. Modeling of Sustainable Base Production by Microbial Electrolysis Cell.

    Science.gov (United States)

    Blatter, Maxime; Sugnaux, Marc; Comninellis, Christos; Nealson, Kenneth; Fischer, Fabian

    2016-07-07

    A predictive model for the microbial/electrochemical base formation from wastewater was established and compared to experimental conditions within a microbial electrolysis cell. A Na2 SO4 /K2 SO4 anolyte showed that model prediction matched experimental results. Using Shewanella oneidensis MR-1, a strong base (pH≈13) was generated using applied voltages between 0.3 and 1.1 V. Due to the use of bicarbonate, the pH value in the anolyte remained unchanged, which is required to maintain microbial activity. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  20. Probabilistic quantitative microbial risk assessment model of norovirus from wastewater irrigated vegetables in Ghana using genome copies and fecal indicator ratio conversion for estimating exposure dose

    DEFF Research Database (Denmark)

    Owusu-Ansah, Emmanuel de-Graft Johnson; Sampson, Angelina; Amponsah, Samuel K.

    2017-01-01

    The need to replace the commonly applied fecal indicator conversions ratio (an assumption of 1:10− 5 virus to fecal indicator organism) in Quantitative Microbial Risk Assessment (QMRA) with models based on quantitative data on the virus of interest has gained prominence due to the different...... physical and environmental factors that might influence the reliability of using indicator organisms in microbial risk assessment. The challenges facing analytical studies on virus enumeration (genome copies or particles) have contributed to the already existing lack of data in QMRA modelling. This study...... to estimate the norovirus count. In all scenarios of using different water sources, the application of the fecal indicator conversion ratio underestimated the norovirus disease burden, measured by the Disability Adjusted Life Years (DALYs), when compared to results using the genome copies norovirus data...

  1. Planning Meeting for Colloquium and Report on: Systems Microbiology: Beyond Microbial Genomics

    Energy Technology Data Exchange (ETDEWEB)

    Buckley, Merry R.

    2005-03-22

    The steering committee for the American Academy of Microbiology's colloquium, ''Systems Microbiology: Beyond Microbial Genomics'' met September 26, 2003, in Washington, DC, to plan the colloquium and discuss the report that would be produced following the colloquium. The steering committee developed the intellectual approach to the issues relating to systems microbiology, including drafting questions for the colloquium participants to work their way through. The committee then identified the scientists that should be invited in order to ensure a comprehensive and thorough analytical report. Dates and a venue were decided upon. The colloquium was held June 4-6, 2004 in Portland, Oregon. There were 35 scientists who spent the weekend discussing specific recommendations for how to capitalize scientifically on the advances in microbial genomics and progress towards a functional understanding of individual microorganisms and microbial communities. The issues discussed at the colloquium were timely and important, and we expect the report, which will be published in 2005, to be extremely well received. Once the report is available, a copy will be forwarded to you. The following items were discussed and will be included in our published report: The focus of this colloquium was on how to capitalize scientifically on the advances in microbial genomics and progress towards a functional understanding of individual microorganisms and microbial communities. Colloquium participants discussed where the field is heading and identify scientific opportunities, challenges, and benefits of this research. An important aspect was the identification of resource and technology gaps that must be addressed in order to advance the field. Making the Case for Systems Microbiology: (1) What can we learn about life processes through studying microbiological systems (sub-cellular, cellular, community)? (2) What important, new fundamental information and potential

  2. Comparative genome analysis of a thermotolerant Escherichia coli obtained by Genome Replication Engineering Assisted Continuous Evolution (GREACE) and its parent strain provides new understanding of microbial heat tolerance.

    Science.gov (United States)

    Luan, Guodong; Bao, Guanhui; Lin, Zhao; Li, Yang; Chen, Zugen; Li, Yin; Cai, Zhen

    2015-12-25

    Heat tolerance of microbes is of great importance for efficient biorefinery and bioconversion. However, engineering and understanding of microbial heat tolerance are difficult and insufficient because it is a complex physiological trait which probably correlates with all gene functions, genetic regulations, and cellular metabolisms and activities. In this work, a novel strain engineering approach named Genome Replication Engineering Assisted Continuous Evolution (GREACE) was employed to improve the heat tolerance of Escherichia coli. When the E. coli strain carrying a mutator was cultivated under gradually increasing temperature, genome-wide mutations were continuously generated during genome replication and the mutated strains with improved thermotolerance were autonomously selected. A thermotolerant strain HR50 capable of growing at 50°C on LB agar plate was obtained within two months, demonstrating the efficiency of GREACE in improving such a complex physiological trait. To understand the improved heat tolerance, genomes of HR50 and its wildtype strain DH5α were sequenced. Evenly distributed 361 mutations covering all mutation types were found in HR50. Closed material transportations, loose genome conformation, and possibly altered cell wall structure and transcription pattern were the main differences of HR50 compared with DH5α, which were speculated to be responsible for the improved heat tolerance. This work not only expanding our understanding of microbial heat tolerance, but also emphasizing that the in vivo continuous genome mutagenesis method, GREACE, is efficient in improving microbial complex physiological trait.

  3. Cell-based detection of microbial biomaterial contaminations.

    Science.gov (United States)

    Roch, Toralf; Ma, Nan; Kratz, Karl; Lendlein, Andreas

    2015-01-01

    A major challenge in biomaterial synthesis and functionalization is the prevention of microbial contaminations such as endotoxins (lipopolysaccharides (LPS)). In addition to LPS, which are exclusively expressed by Gram negative bacteria, also other microbial products derived from fungi or Gram positive bacteria can be found as contaminations in research laboratories. Typically, the Limulus amebocyte lysate (LAL)-test is used to determine the endotoxin levels of medical devices. However, this test fails to detect material-bound LPS and other microbial contaminations and, as demonstrated in this study, detects LPS from various bacterial species with different sensitivities.In this work, a cell-based assay using genetically engineered RAW macrophages, which detect not only soluble but also material-bound microbial contaminations is introduced.The sensitivity of this cell-line towards different LPS species and different heat-inactivated microbes was investigated. As proof of principle a soft hydrophobic poly(n-butyl acrylate) network (cPnBA), which may due to adhesive properties strongly bind microbes, was deliberately contaminated with heat-inactivated bacteria. While the LAL-test failed to detect the microbial contamination, the cell-based assay clearly detected material-bound microbial contaminations. Our data demonstrate that a cell-based detection system should routinely be used as supplement to the LAL-test to determine microbial contaminations of biomaterials.

  4. Emerging concepts on microbial processes in the bathypelagic ocean - ecology, biogeochemistry, and genomics

    Science.gov (United States)

    Nagata, Toshi; Tamburini, Christian; Arístegui, Javier; Baltar, Federico; Bochdansky, Alexander B.; Fonda-Umani, Serena; Fukuda, Hideki; Gogou, Alexandra; Hansell, Dennis A.; Hansman, Roberta L.; Herndl, Gerhard J.; Panagiotopoulos, Christos; Reinthaler, Thomas; Sohrin, Rumi; Verdugo, Pedro; Yamada, Namiha; Yamashita, Youhei; Yokokawa, Taichi; Bartlett, Douglas H.

    2010-08-01

    This paper synthesizes recent findings regarding microbial distributions and processes in the bathypelagic ocean (depth >1000 m). Abundance, production and respiration of prokaryotes reflect supplies of particulate and dissolved organic matter to the bathypelagic zone. Better resolution of carbon fluxes mediated by deep microbes requires further testing on the validity of conversion factors. Archaea, especially marine Crenarchaeota Group I, are abundant in deep waters where they can fix dissolved inorganic carbon. Viruses appear to be important in the microbial loop in deep waters, displaying remarkably high virus to prokaryote abundance ratios in some oceanic regions. Sequencing of 18S rRNA genes revealed a tremendous diversity of small-sized protists in bathypelagic waters. Abundances of heterotrophic nanoflagellates (HNF) and ciliates decrease with depth more steeply than prokaryotes; nonetheless, data indicated that HNF consumed half of prokaryote production in the bathypelagic zone. Aggregates are important habitats for deep-water microbes, which produce more extracellular enzymes (on a per-cell basis) than surface communities. The theory of marine gel formation provides a framework to unravel complex interactions between microbes and organic polymers. Recent data on the effects of hydrostatic pressure on microbial activities indicate that bathypelagic microbial activity is generally higher under in situ pressure conditions than at atmospheric pressures. High-throughput sequencing of 16S rRNA genes revealed a remarkable diversity of Bacteria in the bathypelagic ocean. Metagenomics and comparative genomics of piezophiles reveal not only the high diversity of deep sea microbes but also specific functional attributes of these piezophilic microbes, interpreted as an adaptation to the deep water environment. Taken together, the data compiled on bathypelagic microbes indicate that, despite high-pressure and low-temperature conditions, microbes in the bathypelagic

  5. γ-PGA Hydrolases of Phage Origin in Bacillus subtilis and Other Microbial Genomes.

    Directory of Open Access Journals (Sweden)

    Stefania Mamberti

    Full Text Available Poly-γ-glutamate (γ-PGA is an industrially interesting polymer secreted mainly by members of the class Bacilli which forms a shield able to protect bacteria from phagocytosis and phages. Few enzymes are known to degrade γ-PGA; among them is a phage-encoded γ-PGA hydrolase, PghP. The supposed role of PghP in phages is to ensure access to the surface of bacterial cells by dismantling the γ-PGA barrier. We identified four unannotated B. subtilis genes through similarity of their encoded products to PghP; in fact these genes reside in prophage elements of B. subtilis genome. The recombinant products of two of them demonstrate efficient polymer degradation, confirming that sequence similarity reflects functional homology. Genes encoding similar γ-PGA hydrolases were identified in phages specific for the order Bacillales and in numerous microbial genomes, not only belonging to that order. The distribution of the γ-PGA biosynthesis operon was also investigated with a bioinformatics approach; it was found that the list of organisms endowed with γ-PGA biosynthetic functions is larger than expected and includes several pathogenic species. Moreover in non-Bacillales bacteria the predicted γ-PGA hydrolase genes are preferentially found in species that do not have the genetic asset for polymer production. Our findings suggest that γ-PGA hydrolase genes might have spread across microbial genomes via horizontal exchanges rather than via phage infection. We hypothesize that, in natural habitats rich in γ-PGA supplied by producer organisms, the availability of hydrolases that release glutamate oligomers from γ-PGA might be a beneficial trait under positive selection.

  6. New Perspectives on Microbial Community Distortion after Whole-Genome Amplification

    Science.gov (United States)

    DeSantis, Todd Z.; Santo Domingo, Jorge W.; Ashbolt, Nicholas

    2015-01-01

    Whole-genome amplification (WGA) has become an important tool to explore the genomic information of microorganisms in an environmental sample with limited biomass, however potential selective biases during the amplification processes are poorly understood. Here, we describe the effects of WGA on 31 different microbial communities from five biotopes that also included low-biomass samples from drinking water and groundwater. Our findings provide evidence that microbiome segregation by biotope was possible despite WGA treatment. Nevertheless, samples from different biotopes revealed different levels of distortion, with genomic GC content significantly correlated with WGA perturbation. Certain phylogenetic clades revealed a homogenous trend across various sample types, for instance Alpha- and Betaproteobacteria showed a decrease in their abundance after WGA treatment. On the other hand, Enterobacteriaceae, an important biomarker group for fecal contamination in groundwater and drinking water, were strongly affected by WGA treatment without a predictable pattern. These novel results describe the impact of WGA on low-biomass samples and may highlight issues to be aware of when designing future metagenomic studies that necessitate preceding WGA treatment. PMID:26010362

  7. A hydrogen-based subsurface microbial community dominated by methanogens.

    Science.gov (United States)

    Chapelle, Francis H; O'Neill, Kathleen; Bradley, Paul M; Methé, Barbara A; Ciufo, Stacy A; Knobel, LeRoy L; Lovley, Derek R

    2002-01-17

    The search for extraterrestrial life may be facilitated if ecosystems can be found on Earth that exist under conditions analogous to those present on other planets or moons. It has been proposed, on the basis of geochemical and thermodynamic considerations, that geologically derived hydrogen might support subsurface microbial communities on Mars and Europa in which methanogens form the base of the ecosystem. Here we describe a unique subsurface microbial community in which hydrogen-consuming, methane-producing Archaea far outnumber the Bacteria. More than 90% of the 16S ribosomal DNA sequences recovered from hydrothermal waters circulating through deeply buried igneous rocks in Idaho are related to hydrogen-using methanogenic microorganisms. Geochemical characterization indicates that geothermal hydrogen, not organic carbon, is the primary energy source for this methanogen-dominated microbial community. These results demonstrate that hydrogen-based methanogenic communities do occur in Earth's subsurface, providing an analogue for possible subsurface microbial ecosystems on other planets.

  8. Integrated Genome-Based Studies of Shewanella Echophysiology

    Energy Technology Data Exchange (ETDEWEB)

    Margrethe H. Serres

    2012-06-29

    Shewanella oneidensis MR-1 is a motile, facultative {gamma}-Proteobacterium with remarkable respiratory versatility; it can utilize a range of organic and inorganic compounds as terminal electronacceptors for anaerobic metabolism. The ability to effectively reduce nitrate, S0, polyvalent metals andradionuclides has established MR-1 as an important model dissimilatory metal-reducing microorganism for genome-based investigations of biogeochemical transformation of metals and radionuclides that are of concern to the U.S. Department of Energy (DOE) sites nationwide. Metal-reducing bacteria such as Shewanella also have a highly developed capacity for extracellular transfer of respiratory electrons to solid phase Fe and Mn oxides as well as directly to anode surfaces in microbial fuel cells. More broadly, Shewanellae are recognized free-living microorganisms and members of microbial communities involved in the decomposition of organic matter and the cycling of elements in aquatic and sedimentary systems. To function and compete in environments that are subject to spatial and temporal environmental change, Shewanella must be able to sense and respond to such changes and therefore require relatively robust sensing and regulation systems. The overall goal of this project is to apply the tools of genomics, leveraging the availability of genome sequence for 18 additional strains of Shewanella, to better understand the ecophysiology and speciation of respiratory-versatile members of this important genus. To understand these systems we propose to use genome-based approaches to investigate Shewanella as a system of integrated networks; first describing key cellular subsystems - those involved in signal transduction, regulation, and metabolism - then building towards understanding the function of whole cells and, eventually, cells within populations. As a general approach, this project will employ complimentary "top-down" - bioinformatics-based genome functional predictions, high

  9. Post-Fragmentation Whole Genome Amplification-Based Method

    Science.gov (United States)

    Benardini, James; LaDuc, Myron T.; Langmore, John

    2011-01-01

    This innovation is derived from a proprietary amplification scheme that is based upon random fragmentation of the genome into a series of short, overlapping templates. The resulting shorter DNA strands (fragments with defined 3 and 5 termini. Specific primers to these termini are then used to isothermally amplify this library into potentially unlimited quantities that can be used immediately for multiple downstream applications including gel eletrophoresis, quantitative polymerase chain reaction (QPCR), comparative genomic hybridization microarray, SNP analysis, and sequencing. The standard reaction can be performed with minimal hands-on time, and can produce amplified DNA in as little as three hours. Post-fragmentation whole genome amplification-based technology provides a robust and accurate method of amplifying femtogram levels of starting material into microgram yields with no detectable allele bias. The amplified DNA also facilitates the preservation of samples (spacecraft samples) by amplifying scarce amounts of template DNA into microgram concentrations in just a few hours. Based on further optimization of this technology, this could be a feasible technology to use in sample preservation for potential future sample return missions. The research and technology development described here can be pivotal in dealing with backward/forward biological contamination from planetary missions. Such efforts rely heavily on an increasing understanding of the burden and diversity of microorganisms present on spacecraft surfaces throughout assembly and testing. The development and implementation of these technologies could significantly improve the comprehensiveness and resolving power of spacecraft-associated microbial population censuses, and are important to the continued evolution and advancement of planetary protection capabilities. Current molecular procedures for assaying spacecraft-associated microbial burden and diversity have inherent sample loss issues at

  10. Identification and annotation of promoter regions in microbial genome sequences on the basis of DNA stability

    Indian Academy of Sciences (India)

    Vetriselvi Rangannan; Manju Bansal

    2007-08-01

    Analysis of various predicted structural properties of promoter regions in prokaryotic as well as eukaryotic genomes had earlier indicated that they have several common features, such as lower stability, higher curvature and less bendability, when compared with their neighboring regions. Based on the difference in stability between neighboring upstream and downstream regions in the vicinity of experimentally determined transcription start sites, a promoter prediction algorithm has been developed to identify prokaryotic promoter sequences in whole genomes. The average free energy (E) over known promoter sequences and the difference (D) between E and the average free energy over the entire genome (G) are used to search for promoters in the genomic sequences. Using these cutoff values to predict promoter regions across entire Escherichia coli genome, we achieved a reliability of 70% when the predicted promoters were cross verified against the 960 transcription start sites (TSSs) listed in the Ecocyc database. Annotation of the whole E. coli genome for promoter region could be carried out with 49% accuracy. The method is quite general and it can be used to annotate the promoter regions of other prokaryotic genomes.

  11. Modeling Microbial Biogeochemistry from Terrestrial to Aquatic Ecosystems Using Trait-Based Approaches

    Science.gov (United States)

    King, E.; Molins, S.; Karaoz, U.; Johnson, J. N.; Bouskill, N.; Hug, L. A.; Thomas, B. C.; Castelle, C. J.; Beller, H. R.; Banfield, J. F.; Steefel, C. I.; Brodie, E.

    2014-12-01

    Currently, there is uncertainty in how climate or land-use-induced changes in hydrology and vegetation will affect subsurface carbon flux, the spatial and temporal distribution of flow and transport, biogeochemical cycling, and microbial metabolic activity. Here we focus on the initial development of a Genome-Enabled Watershed Simulation Capability (GEWaSC), which provides a predictive framework for understanding how genomic information stored in a subsurface microbiome affects biogeochemical watershed functioning, how watershed-scale processes affect microbial function, and how these interactions co-evolve. This multiscale framework builds on a hierarchical approach to multiscale modeling, which considers coupling between defined microscale and macroscale components of a system (e.g., a catchment being defined as macroscale and biogeofacies as microscale). Here, we report our progress in the development of a trait-based modeling approach within a reactive transport framework that simulates coupled guilds of microbes. Guild selection is driven by traits extracted from, and physiological properties inferred from, large-scale assembly of metagenome data. Meta-genomic, -transcriptomic and -proteomic information are also used to complement our existing biogeochemical reaction networks and contributes key reactions where biogeochemical analyses are unequivocal. Our approach models the rate of nutrient uptake and the thermodynamics of coupled electron donors and acceptors for a range of microbial metabolisms including heterotrophs and chemolitho(auto)trophs. Metabolism of exogenous substrates fuels catabolic and anabolic processes, with the proportion of energy used for each based upon dynamic intracellular and environmental conditions. In addition to biomass development, anabolism includes the production of key enzymes, such as nitrogenase for nitrogen fixation or exo-enzymes for the hydrolysis of extracellular polymers. This internal resource partitioning represents a

  12. MicroScope—an integrated microbial resource for the curation and comparative analysis of genomic and metabolic data

    Science.gov (United States)

    Vallenet, David; Belda, Eugeni; Calteau, Alexandra; Cruveiller, Stéphane; Engelen, Stefan; Lajus, Aurélie; Le Fèvre, François; Longin, Cyrille; Mornico, Damien; Roche, David; Rouy, Zoé; Salvignol, Gregory; Scarpelli, Claude; Thil Smith, Adam Alexander; Weiman, Marion; Médigue, Claudine

    2013-01-01

    MicroScope is an integrated platform dedicated to both the methodical updating of microbial genome annotation and to comparative analysis. The resource provides data from completed and ongoing genome projects (automatic and expert annotations), together with data sources from post-genomic experiments (i.e. transcriptomics, mutant collections) allowing users to perfect and improve the understanding of gene functions. MicroScope (http://www.genoscope.cns.fr/agc/microscope) combines tools and graphical interfaces to analyse genomes and to perform the manual curation of gene annotations in a comparative context. Since its first publication in January 2006, the system (previously named MaGe for Magnifying Genomes) has been continuously extended both in terms of data content and analysis tools. The last update of MicroScope was published in 2009 in the Database journal. Today, the resource contains data for >1600 microbial genomes, of which ∼300 are manually curated and maintained by biologists (1200 personal accounts today). Expert annotations are continuously gathered in the MicroScope database (∼50 000 a year), contributing to the improvement of the quality of microbial genomes annotations. Improved data browsing and searching tools have been added, original tools useful in the context of expert annotation have been developed and integrated and the website has been significantly redesigned to be more user-friendly. Furthermore, in the context of the European project Microme (Framework Program 7 Collaborative Project), MicroScope is becoming a resource providing for the curation and analysis of both genomic and metabolic data. An increasing number of projects are related to the study of environmental bacterial (meta)genomes that are able to metabolize a large variety of chemical compounds that may be of high industrial interest. PMID:23193269

  13. MicroScope--an integrated microbial resource for the curation and comparative analysis of genomic and metabolic data.

    Science.gov (United States)

    Vallenet, David; Belda, Eugeni; Calteau, Alexandra; Cruveiller, Stéphane; Engelen, Stefan; Lajus, Aurélie; Le Fèvre, François; Longin, Cyrille; Mornico, Damien; Roche, David; Rouy, Zoé; Salvignol, Gregory; Scarpelli, Claude; Thil Smith, Adam Alexander; Weiman, Marion; Médigue, Claudine

    2013-01-01

    MicroScope is an integrated platform dedicated to both the methodical updating of microbial genome annotation and to comparative analysis. The resource provides data from completed and ongoing genome projects (automatic and expert annotations), together with data sources from post-genomic experiments (i.e. transcriptomics, mutant collections) allowing users to perfect and improve the understanding of gene functions. MicroScope (http://www.genoscope.cns.fr/agc/microscope) combines tools and graphical interfaces to analyse genomes and to perform the manual curation of gene annotations in a comparative context. Since its first publication in January 2006, the system (previously named MaGe for Magnifying Genomes) has been continuously extended both in terms of data content and analysis tools. The last update of MicroScope was published in 2009 in the Database journal. Today, the resource contains data for >1600 microbial genomes, of which ∼300 are manually curated and maintained by biologists (1200 personal accounts today). Expert annotations are continuously gathered in the MicroScope database (∼50 000 a year), contributing to the improvement of the quality of microbial genomes annotations. Improved data browsing and searching tools have been added, original tools useful in the context of expert annotation have been developed and integrated and the website has been significantly redesigned to be more user-friendly. Furthermore, in the context of the European project Microme (Framework Program 7 Collaborative Project), MicroScope is becoming a resource providing for the curation and analysis of both genomic and metabolic data. An increasing number of projects are related to the study of environmental bacterial (meta)genomes that are able to metabolize a large variety of chemical compounds that may be of high industrial interest.

  14. BREX is a novel phage resistance system widespread in microbial genomes.

    Science.gov (United States)

    Goldfarb, Tamara; Sberro, Hila; Weinstock, Eyal; Cohen, Ofir; Doron, Shany; Charpak-Amikam, Yoav; Afik, Shaked; Ofir, Gal; Sorek, Rotem

    2015-01-13

    The perpetual arms race between bacteria and phage has resulted in the evolution of efficient resistance systems that protect bacteria from phage infection. Such systems, which include the CRISPR-Cas and restriction-modification systems, have proven to be invaluable in the biotechnology and dairy industries. Here, we report on a six-gene cassette in Bacillus cereus which, when integrated into the Bacillus subtilis genome, confers resistance to a broad range of phages, including both virulent and temperate ones. This cassette includes a putative Lon-like protease, an alkaline phosphatase domain protein, a putative RNA-binding protein, a DNA methylase, an ATPase-domain protein, and a protein of unknown function. We denote this novel defense system BREX (Bacteriophage Exclusion) and show that it allows phage adsorption but blocks phage DNA replication. Furthermore, our results suggest that methylation on non-palindromic TAGGAG motifs in the bacterial genome guides self/non-self discrimination and is essential for the defensive function of the BREX system. However, unlike restriction-modification systems, phage DNA does not appear to be cleaved or degraded by BREX, suggesting a novel mechanism of defense. Pan genomic analysis revealed that BREX and BREX-like systems, including the distantly related Pgl system described in Streptomyces coelicolor, are widely distributed in ~10% of all sequenced microbial genomes and can be divided into six coherent subtypes in which the gene composition and order is conserved. Finally, we detected a phage family that evades the BREX defense, implying that anti-BREX mechanisms may have evolved in some phages as part of their arms race with bacteria.

  15. Unraveling the microbial processes of black band disease in corals through integrated genomics

    Science.gov (United States)

    Sato, Yui; Ling, Edmund Y. S.; Turaev, Dmitrij; Laffy, Patrick; Weynberg, Karen D.; Rattei, Thomas; Willis, Bette L.; Bourne, David G.

    2017-01-01

    Coral disease outbreaks contribute to the ongoing degradation of reef ecosystems, however, microbial mechanisms underlying the onset and progression of most coral diseases are poorly understood. Black band disease (BBD) manifests as a cyanobacterial-dominated microbial mat that destroys coral tissues as it rapidly spreads over coral colonies. To elucidate BBD pathogenesis, we apply a comparative metagenomic and metatranscriptomic approach to identify taxonomic and functional changes within microbial lesions during in-situ development of BBD from a comparatively benign stage termed cyanobacterial patches. Results suggest that photosynthetic CO2-fixation in Cyanobacteria substantially enhances productivity of organic matter within the lesion during disease development. Photosynthates appear to subsequently promote sulfide-production by Deltaproteobacteria, facilitating the major virulence factor of BBD. Interestingly, our metagenome-enabled transcriptomic analysis reveals that BBD-associated cyanobacteria have a putative mechanism that enables them to adapt to higher levels of hydrogen sulfide within lesions, underpinning the pivotal roles of the dominant cyanobacterium within the polymicrobial lesions during the onset of BBD. The current study presents sequence-based evidence derived from whole microbial communities that unravel the mechanism of development and progression of BBD.

  16. IMGMD: A platform for the integration and standardisation of In silico Microbial Genome-scale Metabolic Models.

    Science.gov (United States)

    Ye, Chao; Xu, Nan; Dong, Chuan; Ye, Yuannong; Zou, Xuan; Chen, Xiulai; Guo, Fengbiao; Liu, Liming

    2017-04-07

    Genome-scale metabolic models (GSMMs) constitute a platform that combines genome sequences and detailed biochemical information to quantify microbial physiology at the system level. To improve the unity, integrity, correctness, and format of data in published GSMMs, a consensus IMGMD database was built in the LAMP (Linux + Apache + MySQL + PHP) system by integrating and standardizing 328 GSMMs constructed for 139 microorganisms. The IMGMD database can help microbial researchers download manually curated GSMMs, rapidly reconstruct standard GSMMs, design pathways, and identify metabolic targets for strategies on strain improvement. Moreover, the IMGMD database facilitates the integration of wet-lab and in silico data to gain an additional insight into microbial physiology. The IMGMD database is freely available, without any registration requirements, at http://imgmd.jiangnan.edu.cn/database.

  17. Employment of Near Full-Length Ribosome Gene TA-Cloning and Primer-Blast to Detect Multiple Species in a Natural Complex Microbial Community Using Species-Specific Primers Designed with Their Genome Sequences.

    Science.gov (United States)

    Zhang, Huimin; He, Hongkui; Yu, Xiujuan; Xu, Zhaohui; Zhang, Zhizhou

    2016-11-01

    It remains an unsolved problem to quantify a natural microbial community by rapidly and conveniently measuring multiple species with functional significance. Most widely used high throughput next-generation sequencing methods can only generate information mainly for genus-level taxonomic identification and quantification, and detection of multiple species in a complex microbial community is still heavily dependent on approaches based on near full-length ribosome RNA gene or genome sequence information. In this study, we used near full-length rRNA gene library sequencing plus Primer-Blast to design species-specific primers based on whole microbial genome sequences. The primers were intended to be specific at the species level within relevant microbial communities, i.e., a defined genomics background. The primers were tested with samples collected from the Daqu (also called fermentation starters) and pit mud of a traditional Chinese liquor production plant. Sixteen pairs of primers were found to be suitable for identification of individual species. Among them, seven pairs were chosen to measure the abundance of microbial species through quantitative PCR. The combination of near full-length ribosome RNA gene library sequencing and Primer-Blast may represent a broadly useful protocol to quantify multiple species in complex microbial population samples with species-specific primers.

  18. Thousands of microbial genomes shed light on interconnected biogeochemical processes in an aquifer system.

    Science.gov (United States)

    Anantharaman, Karthik; Brown, Christopher T; Hug, Laura A; Sharon, Itai; Castelle, Cindy J; Probst, Alexander J; Thomas, Brian C; Singh, Andrea; Wilkins, Michael J; Karaoz, Ulas; Brodie, Eoin L; Williams, Kenneth H; Hubbard, Susan S; Banfield, Jillian F

    2016-10-24

    The subterranean world hosts up to one-fifth of all biomass, including microbial communities that drive transformations central to Earth's biogeochemical cycles. However, little is known about how complex microbial communities in such environments are structured, and how inter-organism interactions shape ecosystem function. Here we apply terabase-scale cultivation-independent metagenomics to aquifer sediments and groundwater, and reconstruct 2,540 draft-quality, near-complete and complete strain-resolved genomes that represent the majority of known bacterial phyla as well as 47 newly discovered phylum-level lineages. Metabolic analyses spanning this vast phylogenetic diversity and representing up to 36% of organisms detected in the system are used to document the distribution of pathways in coexisting organisms. Consistent with prior findings indicating metabolic handoffs in simple consortia, we find that few organisms within the community can conduct multiple sequential redox transformations. As environmental conditions change, different assemblages of organisms are selected for, altering linkages among the major biogeochemical cycles.

  19. Genomic mining for novel FADH₂-dependent halogenases in marine sponge-associated microbial consortia.

    Science.gov (United States)

    Bayer, Kristina; Scheuermayer, Matthias; Fieseler, Lars; Hentschel, Ute

    2013-02-01

    Many marine sponges (Porifera) are known to contain large amounts of phylogenetically diverse microorganisms. Sponges are also known for their large arsenal of natural products, many of which are halogenated. In this study, 36 different FADH₂-dependent halogenase gene fragments were amplified from various Caribbean and Mediterranean sponges using newly designed degenerate PCR primers. Four unique halogenase-positive fosmid clones, all containing the highly conserved amino acid motif "GxGxxG", were identified in the microbial metagenome of Aplysina aerophoba. Sequence analysis of one halogenase-bearing fosmid revealed notably two open reading frames with high homologies to efflux and multidrug resistance proteins. Single cell genomic analysis allowed for a taxonomic assignment of the halogenase genes to specific symbiotic lineages. Specifically, the halogenase cluster S1 is predicted to be produced by a deltaproteobacterial symbiont and halogenase cluster S2 by a poribacterial sponge symbiont. An additional halogenase gene is possibly produced by an actinobacterial symbiont of marine sponges. The identification of three novel, phylogenetically, and possibly also functionally distinct halogenase gene clusters indicates that the microbial consortia of sponges are a valuable resource for novel enzymes involved in halogenation reactions.

  20. Single-cell genomics reveal metabolic strategies for microbial growth and survival in an oligotrophic aquifer

    Energy Technology Data Exchange (ETDEWEB)

    Wilkins, Michael J.; Kennedy, David W.; Castelle, Cindy; Field, Erin; Stepanauskas, Ramunas; Fredrickson, Jim K.; Konopka, Allan

    2014-02-09

    Bacteria from the genus Pedobacter are a major component of microbial assemblages at Hanford Site and have been shown to significantly change in abundance in response to the subsurface intrusion of Columbia River water. Here we employed single cell genomics techniques to shed light on the physiological niche of these microorganisms. Analysis of four Pedobacter single amplified genomes (SAGs) from Hanford Site sediments revealed a chemoheterotrophic lifestyle, with the potential to exist under both aerobic and microaerophilic conditions via expression of both aa3­-type and cbb3-type cytochrome c oxidases. These SAGs encoded a wide-range of both intra-and extra­-cellular carbohydrate-active enzymes, potentially enabling the degradation of recalcitrant substrates such as xylan and chitin, and the utilization of more labile sugars such as mannose and fucose. Coupled to these enzymes, a diversity of transporters and sugar-binding molecules were involved in the uptake of carbon from the extracellular local environment. The SAGs were enriched in TonB-dependent receptors (TBDRs), which play a key role in uptake of substrates resulting from degradation of recalcitrant carbon. CRISPR-Cas mechanisms for resisting viral infections were identified in all SAGs. These data demonstrate the potential mechanisms utilized for persistence by heterotrophic microorganisms in a carbon-limited aquifer, and hint at potential linkages between observed Pedobacter abundance shifts within the 300 Area subsurface and biogeochemical shifts associated with Columbia River water intrusion.

  1. High-throughput automated microfluidic sample preparation for accurate microbial genomics

    Science.gov (United States)

    Kim, Soohong; De Jonghe, Joachim; Kulesa, Anthony B.; Feldman, David; Vatanen, Tommi; Bhattacharyya, Roby P.; Berdy, Brittany; Gomez, James; Nolan, Jill; Epstein, Slava; Blainey, Paul C.

    2017-01-01

    Low-cost shotgun DNA sequencing is transforming the microbial sciences. Sequencing instruments are so effective that sample preparation is now the key limiting factor. Here, we introduce a microfluidic sample preparation platform that integrates the key steps in cells to sequence library sample preparation for up to 96 samples and reduces DNA input requirements 100-fold while maintaining or improving data quality. The general-purpose microarchitecture we demonstrate supports workflows with arbitrary numbers of reaction and clean-up or capture steps. By reducing the sample quantity requirements, we enabled low-input (∼10,000 cells) whole-genome shotgun (WGS) sequencing of Mycobacterium tuberculosis and soil micro-colonies with superior results. We also leveraged the enhanced throughput to sequence ∼400 clinical Pseudomonas aeruginosa libraries and demonstrate excellent single-nucleotide polymorphism detection performance that explained phenotypically observed antibiotic resistance. Fully-integrated lab-on-chip sample preparation overcomes technical barriers to enable broader deployment of genomics across many basic research and translational applications. PMID:28128213

  2. Fast comparison of genomic and meta-genomic reads with alignment-free measures based on quality values.

    Science.gov (United States)

    Comin, Matteo; Schimd, Michele

    2016-08-12

    Sequencing technologies are generating enormous amounts of read data, however assembly of genomes and metagenomes remain among the most challenging tasks. In this paper we study the comparison of genomes and metagenomes only based on read data, using word counts statistics called alignment-free thus not requiring reference genomes or assemblies. Quality scores produced by sequencing platforms are fundamental for various analyses, moreover future-generation sequencing platforms, will produce longer reads but with error rate around 15 %. In this context it will be fundamental to exploit quality values information within the framework of alignment-free measures. In this paper we present a family of alignment-free measures, called d (q) -type, that are based on k-mer counts and quality values. These statistics can be used to compare genomes and metagenomes based on their read sets. Results show that the evolutionary relationship of genomes can be reconstructed based on the direct comparison of theirs reads sets. The use of quality values on average improves the classification accuracy, and its contribution increases when the reads are more noisy. Also the comparison of metagenomic microbial communities can be performed efficiently. Similar metagenomes are quickly detected, just by processing their read data, without the need of costly alignments.

  3. A trait-based approach for examining microbial community assembly

    Science.gov (United States)

    Prest, T. L.; Nemergut, D.

    2015-12-01

    Microorganisms regulate all of Earth's major biogeochemical cycles and an understanding of how microbial communities assemble is a key part in evaluating controls over many types of ecosystem processes. Rapid advances in technology and bioinformatics have led to a better appreciation for the variation in microbial community structure in time and space. Yet, advances in theory are necessary to make sense of these data and allow us to generate unifying hypotheses about the causes and consequences of patterns in microbial biodiversity and what they mean for ecosystem function. Here, I will present a metaanalysis of microbial community assembly from a variety of successional and post-disturbance systems. Our analysis shows various distinct patterns in community assembly, and the potential importance of nutrients and dispersal in shaping microbial community beta diversity in these systems. We also used a trait-based approach to generate hypotheses about the mechanisms driving patterns of microbial community assembly and the implications for function. Our work reveals the importance of rRNA operon copy number as a community aggregated trait in helping to reconcile differences in community dynamics between distinct types of successional and disturbed systems. Specifically, our results demonstrate that decreases in average copy number can be a common feature of communities across various drivers of ecological succession, supporting a transition from an r-selected to a K-selected community. Importantly, our work supports the scaling of the copy number trait over multiple levels of biological organization, from cells to populations and communities, and has implications for both ecology and evolution. Trait-based approaches are an important next step to generate and test hypotheses about the forces structuring microbial communities and the subsequent consequences for ecosystem function.

  4. MetaMIS: a metagenomic microbial interaction simulator based on microbial community profiles.

    Science.gov (United States)

    Shaw, Grace Tzun-Wen; Pao, Yueh-Yang; Wang, Daryi

    2016-11-25

    The complexity and dynamics of microbial communities are major factors in the ecology of a system. With the NGS technique, metagenomics data provides a new way to explore microbial interactions. Lotka-Volterra models, which have been widely used to infer animal interactions in dynamic systems, have recently been applied to the analysis of metagenomic data. In this paper, we present the Lotka-Volterra model based tool, the Metagenomic Microbial Interacticon Simulator (MetaMIS), which is designed to analyze the time series data of microbial community profiles. MetaMIS first infers underlying microbial interactions from abundance tables for operational taxonomic units (OTUs) and then interprets interaction networks using the Lotka-Volterra model. We also embed a Bray-Curtis dissimilarity method in MetaMIS in order to evaluate the similarity to biological reality. MetaMIS is designed to tolerate a high level of missing data, and can estimate interaction information without the influence of rare microbes. For each interaction network, MetaMIS systematically examines interaction patterns (such as mutualism or competition) and refines the biotic role within microbes. As a case study, we collect a human male fecal microbiome and show that Micrococcaceae, a relatively low abundance OTU, is highly connected with 13 dominant OTUs and seems to play a critical role. MetaMIS is able to organize multiple interaction networks into a consensus network for comparative studies; thus we as a case study have also identified a consensus interaction network between female and male fecal microbiomes. MetaMIS provides an efficient and user-friendly platform that may reveal new insights into metagenomics data. MetaMIS is freely available at: https://sourceforge.net/projects/metamis/ .

  5. Genome-based Taxonomic Classification of Bacteroidetes

    Directory of Open Access Journals (Sweden)

    Richard L. Hahnke

    2016-12-01

    Full Text Available The bacterial phylum Bacteroidetes, characterized by a distinct gliding motility, occurs in a broad variety of ecosystems, habitats, life styles and physiologies. Accordingly, taxonomic classification of the phylum, based on a limited number of features, proved difficult and controversial in the past, for example, when decisions were based on unresolved phylogenetic trees of the 16S rRNA gene sequence. Here we use a large collection of type-strain genomes from Bacteroidetes and closely related phyla for assessing their taxonomy based on the principles of phylogenetic classification and trees inferred from genome-scale data. No significant conflict between 16S rRNA gene and whole-genome phylogenetic analysis is found, whereas many but not all of the involved taxa are supported as monophyletic groups, particularly in the genome-scale trees. Phenotypic and phylogenomic features support the separation of Balneolaceae as new phylum Balneolaeota from Rhodothermaeota and of Saprospiraceae as new class Saprospiria from Chitinophagia. Epilithonimonas is nested within the older genus Chryseobacterium and without significant phenotypic differences; thus merging the two genera is proposed. Similarly, Vitellibacter is proposed to be included in Aequorivita. Flexibacter is confirmed as being heterogeneous and dissected, yielding six distinct genera. Hallella seregens is a later heterotypic synonym of Prevotella dentalis. Compared to values directly calculated from genome sequences, the G+C content mentioned in many species descriptions is too imprecise; moreover, corrected G+C content values have a significantly better fit to the phylogeny. Corresponding emendations of species descriptions are provided where necessary. Whereas most observed conflict with the current classification of Bacteroidetes is already visible in 16S rRNA gene trees, as expected whole-genome phylogenies are much better resolved.

  6. Genome-Based Taxonomic Classification of Bacteroidetes.

    Science.gov (United States)

    Hahnke, Richard L; Meier-Kolthoff, Jan P; García-López, Marina; Mukherjee, Supratim; Huntemann, Marcel; Ivanova, Natalia N; Woyke, Tanja; Kyrpides, Nikos C; Klenk, Hans-Peter; Göker, Markus

    2016-01-01

    The bacterial phylum Bacteroidetes, characterized by a distinct gliding motility, occurs in a broad variety of ecosystems, habitats, life styles, and physiologies. Accordingly, taxonomic classification of the phylum, based on a limited number of features, proved difficult and controversial in the past, for example, when decisions were based on unresolved phylogenetic trees of the 16S rRNA gene sequence. Here we use a large collection of type-strain genomes from Bacteroidetes and closely related phyla for assessing their taxonomy based on the principles of phylogenetic classification and trees inferred from genome-scale data. No significant conflict between 16S rRNA gene and whole-genome phylogenetic analysis is found, whereas many but not all of the involved taxa are supported as monophyletic groups, particularly in the genome-scale trees. Phenotypic and phylogenomic features support the separation of Balneolaceae as new phylum Balneolaeota from Rhodothermaeota and of Saprospiraceae as new class Saprospiria from Chitinophagia. Epilithonimonas is nested within the older genus Chryseobacterium and without significant phenotypic differences; thus merging the two genera is proposed. Similarly, Vitellibacter is proposed to be included in Aequorivita. Flexibacter is confirmed as being heterogeneous and dissected, yielding six distinct genera. Hallella seregens is a later heterotypic synonym of Prevotella dentalis. Compared to values directly calculated from genome sequences, the G+C content mentioned in many species descriptions is too imprecise; moreover, corrected G+C content values have a significantly better fit to the phylogeny. Corresponding emendations of species descriptions are provided where necessary. Whereas most observed conflict with the current classification of Bacteroidetes is already visible in 16S rRNA gene trees, as expected whole-genome phylogenies are much better resolved.

  7. Graphene-Based Flexible Micrometer-Sized Microbial Fuel Cell

    KAUST Repository

    Mink, Justine E.

    2013-10-23

    Microbial fuel cells harvest electrical energy produced by bacteria during the natural decomposition of organic matter. We report a micrometer-sized microbial fuel cell that is able to generate nanowatt-scale power from microliters of liquids. The sustainable design is comprised of a graphene anode, an air cathode, and a polymer-based substrate platform for flexibility. The graphene layer was grown on a nickel thin film by using chemical vapor deposition at atmospheric pressure. Our demonstration provides a low-cost option to generate useful power for lab-on-chip applications and could be promising to rapidly screen and scale up microbial fuel cells for water purification without consuming excessive power (unlike other water treatment technologies).

  8. High definition for systems biology of microbial communities: metagenomics gets genome-centric and strain-resolved.

    Science.gov (United States)

    Turaev, Dmitrij; Rattei, Thomas

    2016-06-01

    The systems biology of microbial communities, organismal communities inhabiting all ecological niches on earth, has in recent years been strongly facilitated by the rapid development of experimental, sequencing and data analysis methods. Novel experimental approaches and binning methods in metagenomics render the semi-automatic reconstructions of near-complete genomes of uncultivable bacteria possible, while advances in high-resolution amplicon analysis allow for efficient and less biased taxonomic community characterization. This will also facilitate predictive modeling approaches, hitherto limited by the low resolution of metagenomic data. In this review, we pinpoint the most promising current developments in metagenomics. They facilitate microbial systems biology towards a systemic understanding of mechanisms in microbial communities with scopes of application in many areas of our daily life. Copyright © 2016 Elsevier Ltd. All rights reserved.

  9. Construction of a dairy microbial genome catalog opens new perspectives for the metagenomic analysis of dairy fermented products

    DEFF Research Database (Denmark)

    Almeida, Mathieu; Hebert, Agnes; Abraham, Anne-Laure

    2014-01-01

    Kluyvera, Luteococcus and Marinilactibacillus, still missing from public database. To demonstrate the potential of this catalog, we analysed the microbial composition of the surface of two smear cheeses and one blue-veined cheese, and showed that a significant part of the microbiota of these traditional......Background: Microbial communities of traditional cheeses are complex and insufficiently characterized. The origin, safety and functional role in cheese making of these microbial communities are still not well understood. Metagenomic analysis of these communities by high throughput shotgun...... cheeses was composed of microorganisms newly sequenced in our study. Conclusions: Our study provides data, which combined with publicly available genome references, represents the most expansive catalog to date of cheese-associated bacteria. Using this extended dairy catalog, we revealed the presence...

  10. Draft Genome Sequence of Methanobacterium sp. Maddingley, Reconstructed from Metagenomic Sequencing of a Methanogenic Microbial Consortium Enriched from Coal-Seam Gas Formation Water.

    Science.gov (United States)

    Rosewarne, Carly P; Greenfield, Paul; Li, Dongmei; Tran-Dinh, Nai; Midgley, David J; Hendry, Philip

    2013-01-01

    The draft genome of Methanobacterium sp. Maddingley was reconstructed from metagenomic sequencing of a methanogenic microbial consortium enriched from coal-seam gas formation water. It is a hydrogenotrophic methanogen predicted to grow using hydrogen and carbon dioxide.

  11. Assembly-driven metagenomics of a hypersaline microbial ecosystem (2013 DOE JGI Genomics of Energy and Environment 8th Annual User Meeting)

    Energy Technology Data Exchange (ETDEWEB)

    Allen, Eric [Scripps and UCSD

    2013-03-01

    Eric Allen of Scripps and UC San Diego on "Assembly-driven metagenomics of a hypersaline microbial ecosystem" at the 8th Annual Genomics of Energy & Environment Meeting on March 27, 2013 in Walnut Creek, Calif.

  12. GO4genome: A Prokaryotic Phylogeny Based on Genome Organization

    OpenAIRE

    Merkl, Rainer; Wiezer, Arnim

    2009-01-01

    Determining the phylogeny of closely related prokaryotes may fail in an analysis of rRNA or a small set of sequences. Whole-genome phylogeny utilizes the maximally available sample space. For a precise determination of genome similarity, two aspects have to be considered when developing an algorithm of whole-genome phylogeny: (1) gene order conservation is a more precise signal than gene content; and (2) when using sequence similarity, failures in identifying orthologues or the in situ replac...

  13. Germ warfare in a microbial mat community: CRISPRs provide insights into the co-evolution of host and viral genomes.

    Directory of Open Access Journals (Sweden)

    John F Heidelberg

    Full Text Available CRISPR arrays and associated cas genes are widespread in bacteria and archaea and confer acquired resistance to viruses. To examine viral immunity in the context of naturally evolving microbial populations we analyzed genomic data from two thermophilic Synechococcus isolates (Syn OS-A and Syn OS-B' as well as a prokaryotic metagenome and viral metagenome derived from microbial mats in hotsprings at Yellowstone National Park. Two distinct CRISPR types, distinguished by the repeat sequence, are found in both the Syn OS-A and Syn OS-B' genomes. The genome of Syn OS-A contains a third CRISPR type with a distinct repeat sequence, which is not found in Syn OS-B', but appears to be shared with other microorganisms that inhabit the mat. The CRISPR repeats identified in the microbial metagenome are highly conserved, while the spacer sequences (hereafter referred to as "viritopes" to emphasize their critical role in viral immunity were mostly unique and had no high identity matches when searched against GenBank. Searching the viritopes against the viral metagenome, however, yielded several matches with high similarity some of which were within a gene identified as a likely viral lysozyme/lysin protein. Analysis of viral metagenome sequences corresponding to this lysozyme/lysin protein revealed several mutations all of which translate into silent or conservative mutations which are unlikely to affect protein function, but may help the virus evade the host CRISPR resistance mechanism. These results demonstrate the varied challenges presented by a natural virus population, and support the notion that the CRISPR/viritope system must be able to adapt quickly to provide host immunity. The ability of metagenomics to track population-level variation in viritope sequences allows for a culture-independent method for evaluating the fast co-evolution of host and viral genomes and its consequence on the structuring of complex microbial communities.

  14. Rapid detection of microbial DNA by a novel isothermal genome exponential amplification reaction (GEAR) assay.

    Science.gov (United States)

    Prithiviraj, Jothikumar; Hill, Vincent; Jothikumar, Narayanan

    2012-04-20

    In this study we report the development of a simple target-specific isothermal nucleic acid amplification technique, termed genome exponential amplification reaction (GEAR). Escherichia coli was selected as the microbial target to demonstrate the GEAR technique as a proof of concept. The GEAR technique uses a set of four primers; in the present study these primers targeted 5 regions on the 16S rRNA gene of E. coli. The outer forward and reverse Tab primer sequences are complementary to each other at their 5' end, whereas their 3' end sequences are complementary to their respective target nucleic acid sequences. The GEAR assay was performed at a constant temperature 60 °C and monitored continuously in a real-time PCR instrument in the presence of an intercalating dye (SYTO 9). The GEAR assay enabled amplification of as few as one colony forming units of E. coli per reaction within 30 min. We also evaluated the GEAR assay for rapid identification of bacterial colonies cultured on agar media directly in the reaction without DNA extraction. Cells from E. coli colonies were picked and added directly to GEAR assay mastermix without prior DNA extraction. DNA in the cells could be amplified, yielding positive results within 15 min.

  15. Reticulate classification of mosaic microbial genomes using NeAT website.

    Science.gov (United States)

    Lima-Mendez, Gipsi

    2012-01-01

    The tree of life is the classical representation of the evolutionary relationships between existent species. A tree is appropriate to display the divergence of species through mutation, i.e., by vertical descent. However, lateral gene transfer (LGT) is excluded from such representations. When LGT contribution to genome evolution cannot be neglected (e.g., for prokaryotes and mobile genetic elements), the tree becomes misleading. Networks appear as an intuitive way to represent both vertical and horizontal relationships, while overlapping groups within such graphs are more suitable for their classification. Here, we describe a method to represent both vertical and horizontal relationships. We start with a set of genomes whose coded proteins have been grouped into families based on sequence similarity. Next, all pairs of genomes are compared, counting the number of proteins classified into the same family. From this comparison, we derive a weighted graph where genomes with a significant number of similar proteins are linked. Finally, we apply a two-step clustering of this graph to produce a classification where nodes can be assigned to multiple clusters. The procedure can be performed using the Network Analysis Tools (NeAT) website.

  16. Choosing a suitable method for the identification of replication origins in microbial genomes

    Directory of Open Access Journals (Sweden)

    Chengcheng eSong

    2015-09-01

    Full Text Available As the replication of genomic DNA is arguably the most important task performed by a cell and given that it is controlled at the initiation stage, the events that occur at the replication origin play a central role in the cell cycle. Making sense of DNA replication origins is important for improving our capacity to study cellular processes and functions in the regulation of gene expression, genome integrity in much finer detail. Thus, clearly comprehending the positions and sequences of replication origins which are fundamental to chromosome organization and duplication is the first priority of all. In view of such important roles of replication origins, tremendous work has been aimed at identifying and testing the specificity of replication origins. A number of computational tools based on various skew types have been developed to predict replication origins. Using various in silico approaches such as Ori-Finder 2, and databases such as DoriC and oriDB, researchers have predicted the locations of replication origins sites for thousands bacterial chromosomes and archaeal genomes. Based on the predicted results, we should choose an effective method for identifying and confirming the interactions at origins of replication. Here we describe the main existing experimental methods that aimed to determine the replication origin regions and list some of the many the practical applications of these methods.

  17. StellaBase: The Nematostella vectensis Genomics Database

    OpenAIRE

    James C Sullivan; Ryan, Joseph F; Watson, James A.; Webb, Jeramy; Mullikin, James C; Rokhsar, Daniel; Finnerty, John R

    2005-01-01

    StellaBase, the Nematostella vectensis Genomics Database, is a web-based resource that will facilitate desktop and bench-top studies of the starlet sea anemone. Nematostella is an emerging model organism that has already proven useful for addressing fundamental questions in developmental evolution and evolutionary genomics. StellaBase allows users to query the assembled Nematostella genome, a confirmed gene library, and a predicted genome using both keyword and homology based search functions...

  18. Quantitative metagenomic analyses based on average genome size normalization

    DEFF Research Database (Denmark)

    Frank, Jeremy Alexander; Sørensen, Søren Johannes

    2011-01-01

    Over the past quarter-century, microbiologists have used DNA sequence information to aid in the characterization of microbial communities. During the last decade, this has expanded from single genes to microbial community genomics, or metagenomics, in which the gene content of an environment can...... by estimating average genome sizes. This normalization can relieve comparative biases introduced by differences in community structure, number of sequencing reads, and sequencing read lengths between different metagenomes. We demonstrate the utility of this approach by comparing metagenomes from two different...... marine sources using both conventional small-subunit (SSU) rRNA gene analyses and our quantitative method to calculate the proportion of genomes in each sample that are capable of a particular metabolic trait. With both environments, to determine what proportion of each community they make up and how...

  19. Microbial Character Related Sulfur Cycle under Dynamic Environmental Factors Based on the Microbial Population Analysis in Sewerage System

    Science.gov (United States)

    Dong, Qian; Shi, Hanchang; Liu, Yanchen

    2017-01-01

    The undesired sulfur cycle derived by microbial population can ultimately causes the serious problems of sewerage systems. However, the microbial community characters under dynamic environment factors in actual sewerage system is still not enough. This current study aimed to character the distributions and compositions of microbial communities that participate in the sulfur cycle under the dynamic environmental conditions in a local sewerage system. To accomplish this, microbial community compositions were assessed using 454 high-throughput sequencing (16S rDNA) combined with dsrB gene-based denaturing gradient gel electrophoresis. The results indicated that a higher diversity of microbial species was present at locations in sewers with high concentrations of H2S. Actinobacteria and Proteobacteria were dominant in the sewerage system, while Actinobacteria alone were dominant in regions with high concentrations of H2S. Specifically, the unique operational taxonomic units could aid to characterize the distinct microbial communities within a sewerage manhole. The proportion of sulfate-reducing bacteria, each sulfur-oxidizing bacteria (SOB) were strongly correlated with the liquid parameters (DO, ORP, COD, Sulfide, NH3-N), while the Mycobacterium and Acidophilic SOB (M&A) was strongly correlated with gaseous factors within the sewer, such as H2S, CH4, and CO. Identifying the distributions and proportions of critical microbial communities within sewerage systems could provide insights into how the microbial sulfur cycle is affected by the dynamic environmental conditions that exist in sewers and might be useful for explaining the potential sewerage problems. PMID:28261160

  20. Microbial Character Related Sulfur Cycle under Dynamic Environmental Factors Based on the Microbial Population Analysis in Sewerage System.

    Science.gov (United States)

    Dong, Qian; Shi, Hanchang; Liu, Yanchen

    2017-01-01

    The undesired sulfur cycle derived by microbial population can ultimately causes the serious problems of sewerage systems. However, the microbial community characters under dynamic environment factors in actual sewerage system is still not enough. This current study aimed to character the distributions and compositions of microbial communities that participate in the sulfur cycle under the dynamic environmental conditions in a local sewerage system. To accomplish this, microbial community compositions were assessed using 454 high-throughput sequencing (16S rDNA) combined with dsrB gene-based denaturing gradient gel electrophoresis. The results indicated that a higher diversity of microbial species was present at locations in sewers with high concentrations of H2S. Actinobacteria and Proteobacteria were dominant in the sewerage system, while Actinobacteria alone were dominant in regions with high concentrations of H2S. Specifically, the unique operational taxonomic units could aid to characterize the distinct microbial communities within a sewerage manhole. The proportion of sulfate-reducing bacteria, each sulfur-oxidizing bacteria (SOB) were strongly correlated with the liquid parameters (DO, ORP, COD, Sulfide, NH3-N), while the Mycobacterium and Acidophilic SOB (M&A) was strongly correlated with gaseous factors within the sewer, such as H2S, CH4, and CO. Identifying the distributions and proportions of critical microbial communities within sewerage systems could provide insights into how the microbial sulfur cycle is affected by the dynamic environmental conditions that exist in sewers and might be useful for explaining the potential sewerage problems.

  1. Tool for rapid annotation of microbial SNPs (TRAMS): a simple program for rapid annotation of genomic variation in prokaryotes.

    Science.gov (United States)

    Reumerman, Richard A; Tucker, Nicholas P; Herron, Paul R; Hoskisson, Paul A; Sangal, Vartul

    2013-09-01

    Next generation sequencing (NGS) has been widely used to study genomic variation in a variety of prokaryotes. Single nucleotide polymorphisms (SNPs) resulting from genomic comparisons need to be annotated for their functional impact on the coding sequences. We have developed a program, TRAMS, for functional annotation of genomic SNPs which is available to download as a single file executable for WINDOWS users with limited computational experience and as a Python script for Mac OS and Linux users. TRAMS needs a tab delimited text file containing SNP locations, reference nucleotide and SNPs in variant strains along with a reference genome sequence in GenBank or EMBL format. SNPs are annotated as synonymous, nonsynonymous or nonsense. Nonsynonymous SNPs in start and stop codons are separated as non-start and non-stop SNPs, respectively. SNPs in multiple overlapping features are annotated separately for each feature and multiple nucleotide polymorphisms within a codon are combined before annotation. We have also developed a workflow for Galaxy, a highly used tool for analysing NGS data, to map short reads to a reference genome and extract and annotate the SNPs. TRAMS is a simple program for rapid and accurate annotation of SNPs that will be very useful for microbiologists in analysing genomic diversity in microbial populations.

  2. Genomics-based plant germplasm research (GPGR

    Directory of Open Access Journals (Sweden)

    Jizeng Jia

    2017-04-01

    Full Text Available Plant germplasm underpins much of crop genetic improvement. Millions of germplasm accessions have been collected and conserved ex situ and/or in situ, and the major challenge is now how to exploit and utilize this abundant resource. Genomics-based plant germplasm research (GPGR or “Genoplasmics” is a novel cross-disciplinary research field that seeks to apply the principles and techniques of genomics to germplasm research. We describe in this paper the concept, strategy, and approach behind GPGR, and summarize current progress in the areas of the definition and construction of core collections, enhancement of germplasm with core collections, and gene discovery from core collections. GPGR is opening a new era in germplasm research. The contribution, progress and achievements of GPGR in the future are predicted.

  3. Microbial lipid-based lignocellulosic biorefinery: feasibility and challenges.

    Science.gov (United States)

    Jin, Mingjie; Slininger, Patricia J; Dien, Bruce S; Waghmode, Suresh; Moser, Bryan R; Orjuela, Andrea; Sousa, Leonardo da Costa; Balan, Venkatesh

    2015-01-01

    Although single-cell oil (SCO) has been studied for decades, lipid production from lignocellulosic biomass has received substantial attention only in recent years as biofuel research moves toward producing drop-in fuels. This review gives an overview of the feasibility and challenges that exist in realizing microbial lipid production from lignocellulosic biomass in a biorefinery. The aspects covered here include biorefinery technologies, the microbial oil market, oleaginous microbes, lipid accumulation metabolism, strain development, process configurations, lignocellulosic lipid production, technical hurdles, lipid recovery, and technoeconomics. The lignocellulosic SCO-based biorefinery will be feasible only if a combination of low- and high-value lipids are coproduced, while lignin and protein are upgraded to high-value products. Copyright © 2014 Elsevier Ltd. All rights reserved.

  4. An evaluation of multiple annealing and looping based genome amplification using a synthetic bacterial community

    KAUST Repository

    Wang, Yong

    2016-02-23

    The low biomass in environmental samples is a major challenge for microbial metagenomic studies. The amplification of a genomic DNA was frequently applied to meeting the minimum requirement of the DNA for a high-throughput next-generation-sequencing technology. Using a synthetic bacterial community, the amplification efficiency of the Multiple Annealing and Looping Based Amplification Cycles (MALBAC) kit that is originally developed to amplify the single-cell genomic DNA of mammalian organisms is examined. The DNA template of 10 pg in each reaction of the MALBAC amplification may generate enough DNA for Illumina sequencing. Using 10 pg and 100 pg templates for each reaction set, the MALBAC kit shows a stable and homogeneous amplification as indicated by the highly consistent coverage of the reads from the two amplified samples on the contigs assembled by the original unamplified sample. Although GenomePlex whole genome amplification kit allows one to generate enough DNA using 100 pg of template in each reaction, the minority of the mixed bacterial species is not linearly amplified. For both of the kits, the GC-rich regions of the genomic DNA are not efficiently amplified as suggested by the low coverage of the contigs with the high GC content. The high efficiency of the MALBAC kit is supported for the amplification of environmental microbial DNA samples, and the concerns on its application are also raised to bacterial species with the high GC content.

  5. An evaluation of multiple annealing and looping based genome amplification using a synthetic bacterial community

    Institute of Scientific and Technical Information of China (English)

    WANG Yong; GAO Zhaoming; XU Ying; LI Guangyu; HE Lisheng; QIAN Peiyuan

    2016-01-01

    The low biomass in environmental samples is a major challenge for microbial metagenomic studies. The amplification of a genomic DNA was frequently applied to meeting the minimum requirement of the DNA for a high-throughput next-generation-sequencing technology. Using a synthetic bacterial community, the amplification efficiency of the Multiple Annealing and Looping Based Amplification Cycles (MALBAC) kit that is originally developed to amplify the single-cell genomic DNA of mammalian organisms is examined. The DNA template of 10 pg in each reaction of the MALBAC amplification may generate enough DNA for Illumina sequencing. Using 10 pg and 100 pg templates for each reaction set, the MALBAC kit shows a stable and homogeneous amplification as indicated by the highly consistent coverage of the reads from the two amplified samples on the contigs assembled by the original unamplified sample. Although GenomePlex whole genome amplification kit allows one to generate enough DNA using 100 pg of template in each reaction, the minority of the mixed bacterial species is not linearly amplified. For both of the kits, the GC-rich regions of the genomic DNA are not efficiently amplified as suggested by the low coverage of the contigs with the high GC content. The high efficiency of the MALBAC kit is supported for the amplification of environmental microbial DNA samples, and the concerns on its application are also raised to bacterial species with the high GC content.

  6. Genome-Centric Analysis of Microbial Populations Enriched by Hydraulic Fracture Fluid Additives in a Coal Bed Methane Production Well.

    Science.gov (United States)

    Robbins, Steven J; Evans, Paul N; Parks, Donovan H; Golding, Suzanne D; Tyson, Gene W

    2016-01-01

    Coal bed methane (CBM) is generated primarily through the microbial degradation of coal. Despite a limited understanding of the microorganisms responsible for this process, there is significant interest in developing methods to stimulate additional methane production from CBM wells. Physical techniques including hydraulic fracture stimulation are commonly applied to CBM wells, however the effects of specific additives contained in hydraulic fracture fluids on native CBM microbial communities are poorly understood. Here, metagenomic sequencing was applied to the formation waters of a hydraulically fractured and several non-fractured CBM production wells to determine the effect of this stimulation technique on the in-situ microbial community. The hydraulically fractured well was dominated by two microbial populations belonging to the class Phycisphaerae (within phylum Planctomycetes) and candidate phylum Aminicenantes. Populations from these phyla were absent or present at extremely low abundance in non-fractured CBM wells. Detailed metabolic reconstruction of near-complete genomes from these populations showed that their high relative abundance in the hydraulically fractured CBM well could be explained by the introduction of additional carbon sources, electron acceptors, and biocides contained in the hydraulic fracture fluid.

  7. Minimum probe length for unique identification of all open reading frames in a microbial genome

    Energy Technology Data Exchange (ETDEWEB)

    Sokhansanj, B A; Ng, J; Fitch, J P

    2000-03-05

    In this paper, we determine the minimum hybridization probe length to uniquely identify at least 95% of the open reading frame (ORF) in an organism. We analyze the whole genome sequences of 17 species, 11 bacteria, 4 archaea, and 2 eukaryotes. We also present a mathematical model for minimum probe length based on assuming that all ORFs are random, of constant length, and contain an equal distribution of bases. The model accurately predicts the minimum probe length for all species, but it incorrectly predicts that all ORFs may be uniquely identified. However, a probe length of just 9 bases is adequate to identify over 95% of the ORFs for all 15 prokaryotic species we studied. Using a minimum probe length, while accepting that some ORFs may not be identified and that data will be lost due to hybridization error, may result in significant savings in microarray and oligonucleotide probe design.

  8. GenColors-based comparative genome databases for small eukaryotic genomes.

    Science.gov (United States)

    Felder, Marius; Romualdi, Alessandro; Petzold, Andreas; Platzer, Matthias; Sühnel, Jürgen; Glöckner, Gernot

    2013-01-01

    Many sequence data repositories can give a quick and easily accessible overview on genomes and their annotations. Less widespread is the possibility to compare related genomes with each other in a common database environment. We have previously described the GenColors database system (http://gencolors.fli-leibniz.de) and its applications to a number of bacterial genomes such as Borrelia, Legionella, Leptospira and Treponema. This system has an emphasis on genome comparison. It combines data from related genomes and provides the user with an extensive set of visualization and analysis tools. Eukaryote genomes are normally larger than prokaryote genomes and thus pose additional challenges for such a system. We have, therefore, adapted GenColors to also handle larger datasets of small eukaryotic genomes and to display eukaryotic gene structures. Further recent developments include whole genome views, genome list options and, for bacterial genome browsers, the display of horizontal gene transfer predictions. Two new GenColors-based databases for two fungal species (http://fgb.fli-leibniz.de) and for four social amoebas (http://sacgb.fli-leibniz.de) were set up. Both new resources open up a single entry point for related genomes for the amoebozoa and fungal research communities and other interested users. Comparative genomics approaches are greatly facilitated by these resources.

  9. New perspectives on microbial community distortion after whole-genome amplification

    Science.gov (United States)

    Whole-genome amplification (WGA) has become an important tool to explore the genomic information of microorganisms in an environmental sample with limited biomass, however potential selective biases during the amplification processes are poorly understood. Here, we describe the e...

  10. A reference guide to microbial cell surface hydrophobicity based on contact angles

    NARCIS (Netherlands)

    van der Mei, HC; Busscher, HJ; Bos, R.R.M.

    1998-01-01

    Acid-base interactions form the origin of the hydrophobicity of microbial cell-surfaces and can be quantitated from contact angle measurements on microbial lawns with water, formamide, methyleneiodide and/or alpha-bromonaphthalene. This review provides a reference guide to microbial cell surface hyd

  11. Genome shuffling在菌种改良中的应用%Rapid Improvement of Microbial Phenotypes by Genome shuffling

    Institute of Scientific and Technical Information of China (English)

    朱欣杰; 程瑶

    2006-01-01

    Genome shuffling作为一种新型的选育方法,与传统方法比更加高效快捷,近年来不断被应用于制药、食品制造、环境工程等众多领域的菌种改良研究中.综述了Genome shuffling在菌种改良中的应用.

  12. Single cell genome amplification accelerates identification of the apratoxin biosynthetic pathway from a complex microbial assemblage.

    Directory of Open Access Journals (Sweden)

    Rashel V Grindberg

    Full Text Available Filamentous marine cyanobacteria are extraordinarily rich sources of structurally novel, biomedically relevant natural products. To understand their biosynthetic origins as well as produce increased supplies and analog molecules, access to the clustered biosynthetic genes that encode for the assembly enzymes is necessary. Complicating these efforts is the universal presence of heterotrophic bacteria in the cell wall and sheath material of cyanobacteria obtained from the environment and those grown in uni-cyanobacterial culture. Moreover, the high similarity in genetic elements across disparate secondary metabolite biosynthetic pathways renders imprecise current gene cluster targeting strategies and contributes sequence complexity resulting in partial genome coverage. Thus, it was necessary to use a dual-method approach of single-cell genomic sequencing based on multiple displacement amplification (MDA and metagenomic library screening. Here, we report the identification of the putative apratoxin. A biosynthetic gene cluster, a potent cancer cell cytotoxin with promise for medicinal applications. The roughly 58 kb biosynthetic gene cluster is composed of 12 open reading frames and has a type I modular mixed polyketide synthase/nonribosomal peptide synthetase (PKS/NRPS organization and features loading and off-loading domain architecture never previously described. Moreover, this work represents the first successful isolation of a complete biosynthetic gene cluster from Lyngbya bouillonii, a tropical marine cyanobacterium renowned for its production of diverse bioactive secondary metabolites.

  13. The United States Culture Collection Network (USCCN): Enhancing Microbial Genomics Research through Living Microbe Culture Collections

    Energy Technology Data Exchange (ETDEWEB)

    Boundy-Mills, K.; Hess, Matthias; Bennett, A. R.; Ryan, Matthew; Kang, Seogchan; Nobles, David; Eisen, Jonathan A.; Inderbitzin, Patrik; Sitepu, Irnayuli R.; Torok, Tamas; Brown, Daniel R; Cho, Juliana; Wertz, John E.; Mukherjee, Supratim; Cady, Sherry L.; McCluskey, Kevin

    2015-09-01

    The mission of the United States Culture Collection Network (USCCN; http://usccn.org) is "to facilitate the safe and responsible utilization of microbial resources for research, education, industry, medicine, and agriculture for the betterment of human kind." Microbial culture collections are a key component of life science research, biotechnology, and emerging global biobased economies. Representatives and users of several microbial culture collections from the United States and Europe gathered at the University of California, Davis, to discuss how collections of microorganisms can better serve users and stakeholders and to showcase existing resources available in public culture collections.

  14. Genome-Resolved Metagenomic Analysis Reveals Roles for Candidate Phyla and Other Microbial Community Members in Biogeochemical Transformations in Oil Reservoirs

    Directory of Open Access Journals (Sweden)

    Ping Hu

    2016-01-01

    Full Text Available Oil reservoirs are major sites of methane production and carbon turnover, processes with significant impacts on energy resources and global biogeochemical cycles. We applied a cultivation-independent genomic approach to define microbial community membership and predict roles for specific organisms in biogeochemical transformations in Alaska North Slope oil fields. Produced water samples were collected from six locations between 1,128 m (24 to 27°C and 2,743 m (80 to 83°C below the surface. Microbial community complexity decreased with increasing temperature, and the potential to degrade hydrocarbon compounds was most prevalent in the lower-temperature reservoirs. Sulfate availability, rather than sulfate reduction potential, seems to be the limiting factor for sulfide production in some of the reservoirs under investigation. Most microorganisms in the intermediate- and higher-temperature samples were related to previously studied methanogenic and nonmethanogenic archaea and thermophilic bacteria, but one candidate phylum bacterium, a member of the Acetothermia (OP1, was present in Kuparuk sample K3. The greatest numbers of candidate phyla were recovered from the mesothermic reservoir samples SB1 and SB2. We reconstructed a nearly complete genome for an organism from the candidate phylum Parcubacteria (OD1 that was abundant in sample SB1. Consistent with prior findings for members of this lineage, the OD1 genome is small, and metabolic predictions support an obligately anaerobic, fermentation-based lifestyle. At moderate abundance in samples SB1 and SB2 were members of bacteria from other candidate phyla, including Microgenomates (OP11, Atribacteria (OP9, candidate phyla TA06 and WS6, and Marinimicrobia (SAR406. The results presented here elucidate potential roles of organisms in oil reservoir biological processes.

  15. Comparative genomics analysis of completely sequenced microbial genomes reveals the ubiquity of N-linked glycosylation in prokaryotes.

    Science.gov (United States)

    Kumar, Manjeet; Balaji, Petety V

    2011-05-01

    Glycosylation of proteins in prokaryotes has been known for the last few decades. Glycan structures and/or the glycosylation pathways have been experimentally characterized in only a small number of prokaryotes. Even this has become possible only during the last decade or so, primarily due to technological and methodological developments. Glycosylated proteins are diverse in their function and localization. Glycosylation has been shown to be associated with a wide range of biological phenomena. Characterization of the various types of glycans and the glycosylation machinery is critical to understand such processes. Such studies can help in the identification of novel targets for designing drugs, diagnostics, and engineering of therapeutic proteins. In view of this, the experimentally characterized pgl system of Campylobacter jejuni, responsible for N-linked glycosylation, has been used in this study to identify glycosylation loci in 865 prokaryotes whose genomes have been completely sequenced. Results from the present study show that only a small number of organisms have homologs for all the pgl enzymes and a few others have homologs for none of the pgl enzymes. Most of the organisms have homologs for only a subset of the pgl enzymes. There is no specific pattern for the presence or absence of pgl homologs vis-à-vis the 16S rRNA sequence-based phylogenetic tree. This may be due to differences in the glycan structures, high sequence divergence, horizontal gene transfer or non-orthologous gene displacement. Overall, the presence of homologs for pgl enzymes in a large number of organisms irrespective of their habitat, pathogenicity, energy generation mechanism, etc., hints towards the ubiquity of N-linked glycosylation in prokaryotes.

  16. Genomics and public health: development of Web-based training tools for increasing genomic awareness.

    Science.gov (United States)

    Bodzin, Jennifer; Kardia, Sharon L R; Goldenberg, Aaron; Raup, Sarah F; Bach, Janice V; Citrin, Toby

    2005-04-01

    In 2001, the Centers for Disease Control and Prevention funded three Centers for Genomics and Public Health to develop training tools for increasing genomic awareness. Over the past three years, the centers, working together with the Centers for Disease Control and Prevention's Office of Genomics and Disease Prevention, have developed tools to increase awareness of the impact genomics will have on public health practice, to provide a foundation for understanding basic genomic advances, and to translate the relevance of that information to public health practitioners' own work. These training tools serve to communicate genomic advances and their potential for integration into public heath practice. This paper highlights two of these training tools: 1) Genomics for Public Health Practitioners: The Practical Application of Genomics in Public Health Practice, a Web-based introduction to genomics, and 2) Six Weeks to Genomic Awareness, an in-depth training module on public health genomics. This paper focuses on the processes and collaborative efforts by which these live presentations were developed and delivered as Web-based training sessions.

  17. Microbial fuel cell based on Klebsiella pneumoniae biofilm

    Energy Technology Data Exchange (ETDEWEB)

    Zhang, Lixia [School of Chemistry and Environment, South China Normal University, Guangzhou 510006 (China); Guangdong Institute of Eco-Environmental and Soil Sciences, Guangzhou 510650 (China); Zhou, Shungui; Zhuang, Li; Zhang, Jintao; Lu, Na; Deng, Lifang [Guangdong Institute of Eco-Environmental and Soil Sciences, Guangzhou 510650 (China); Li, Weishan [School of Chemistry and Environment, South China Normal University, Guangzhou 510006 (China); Key Laboratory of Electrochemical Technology on Energy Storage and Power Generation in Guangdong Universities, Guangzhou 510006 (China)

    2008-10-15

    In this paper we reported a novel microbial fuel cell (MFC) based on Klebsiella pneumoniae (K. pneumoniae) strain L17 biofilm, which can utilize directly starch and glucose to generate electricity. The electrochemical activity of K. pneumoniae and the performance of the MFC were evaluated by cyclic voltammetry, scanning electron microscope (SEM) and polarization curve measurement. The results indicated that an established K. pneumoniae biofilm cells were responsible for the direct electron transfer from fuels to electrode during electricity production. The SEM observation proved the ability of K. pneumoniae to colonize on the electrode surface. This MFC generated power from the direct electrocatalysis by the K. pneumoniae strain L17 biofilm. (author)

  18. Synthetic biology for microbial production of lipid-based biofuels

    Energy Technology Data Exchange (ETDEWEB)

    d’Espaux, Leo [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint BioEnergy Inst.; Mendez-Perez, Daniel [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint BioEnergy Inst.; Li, Rachel [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint BioEnergy Inst.; Univ. of California, Berkeley, CA (United States). Dept. of Plant and Microbial Biology; Keasling, Jay D. [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint BioEnergy Inst.; Univ. of California, Berkeley, CA (United States). Dept. of Plant and Microbial Biology; Univ. of California, Berkeley, CA (United States). Dept. of Bioengineering, QB3 Inst.; Univ. of California, Berkeley, CA (United States). Dept. of Chemical and Biomolecular Engineering, QB3 Inst.

    2015-10-23

    The risks of maintaining current CO2 emission trends have led to interest in producing biofuels using engineered microbes. Microbial biofuels reduce emissions because CO2 produced by fuel combustion is offset by CO2 captured by growing biomass, which is later used as feedstock for biofuel fermentation. Hydrocarbons found in petroleum fuels share striking similarity with biological lipids. Here in this paper we review synthetic metabolic pathways based on fatty acid and isoprenoid metabolism to produce alkanes and other molecules suitable as biofuels. Lastly, we further discuss engineering strategies to optimize engineered biosynthetic routes, as well as the potential of synthetic biology for sustainable manufacturing.

  19. Synthetic biology for microbial production of lipid-based biofuels.

    Science.gov (United States)

    d'Espaux, Leo; Mendez-Perez, Daniel; Li, Rachel; Keasling, Jay D

    2015-12-01

    The risks of maintaining current CO2 emission trends have led to interest in producing biofuels using engineered microbes. Microbial biofuels reduce emissions because CO2 produced by fuel combustion is offset by CO2 captured by growing biomass, which is later used as feedstock for biofuel fermentation. Hydrocarbons found in petroleum fuels share striking similarity with biological lipids. Here we review synthetic metabolic pathways based on fatty acid and isoprenoid metabolism to produce alkanes and other molecules suitable as biofuels. We further discuss engineering strategies to optimize engineered biosynthetic routes, as well as the potential of synthetic biology for sustainable manufacturing.

  20. New Markov Model Approaches to Deciphering Microbial Genome Function and Evolution: Comparative Genomics of Laterally Transferred Genes

    Energy Technology Data Exchange (ETDEWEB)

    Borodovsky, M.

    2013-04-11

    Algorithmic methods for gene prediction have been developed and successfully applied to many different prokaryotic genome sequences. As the set of genes in a particular genome is not homogeneous with respect to DNA sequence composition features, the GeneMark.hmm program utilizes two Markov models representing distinct classes of protein coding genes denoted "typical" and "atypical". Atypical genes are those whose DNA features deviate significantly from those classified as typical and they represent approximately 10% of any given genome. In addition to the inherent interest of more accurately predicting genes, the atypical status of these genes may also reflect their separate evolutionary ancestry from other genes in that genome. We hypothesize that atypical genes are largely comprised of those genes that have been relatively recently acquired through lateral gene transfer (LGT). If so, what fraction of atypical genes are such bona fide LGTs? We have made atypical gene predictions for all fully completed prokaryotic genomes; we have been able to compare these results to other "surrogate" methods of LGT prediction.

  1. CyanoBase: the cyanobacteria genome database update 2010.

    Science.gov (United States)

    Nakao, Mitsuteru; Okamoto, Shinobu; Kohara, Mitsuyo; Fujishiro, Tsunakazu; Fujisawa, Takatomo; Sato, Shusei; Tabata, Satoshi; Kaneko, Takakazu; Nakamura, Yasukazu

    2010-01-01

    CyanoBase (http://genome.kazusa.or.jp/cyanobase) is the genome database for cyanobacteria, which are model organisms for photosynthesis. The database houses cyanobacteria species information, complete genome sequences, genome-scale experiment data, gene information, gene annotations and mutant information. In this version, we updated these datasets and improved the navigation and the visual display of the data views. In addition, a web service API now enables users to retrieve the data in various formats with other tools, seamlessly.

  2. The UCSC Archaeal Genome Browser: 2012 update

    OpenAIRE

    Chan, Patricia P.; Holmes, Andrew D.; Smith, Andrew M.; Tran, Danny; Lowe, Todd M.

    2011-01-01

    The UCSC Archaeal Genome Browser (http://archaea.ucsc.edu) offers a graphical web-based resource for exploration and discovery within archaeal and other selected microbial genomes. By bringing together existing gene annotations, gene expression data, multiple-genome alignments, pre-computed sequence comparisons and other specialized analysis tracks, the genome browser is a powerful aggregator of varied genomic information. The genome browser environment maintains the current look-and-feel of ...

  3. Towards a Genome-Enabled Sensor for In Situ Monitoring of Microbial Communities in Hydrothermal Vent Fields

    Science.gov (United States)

    Shi, X.; Wu, J.; Gao, W.; Chao, S.; Zhang, W.; Meldrum, D. R.

    2008-12-01

    We report the progress towards a genome-enabled instrument to monitor variations in microbial community in hydrothermal vent fields for long durations. Our long-term goal is to deploy an in situ microarray device embedded in a lab-on-a-chip device. The microarray detects both the 16S rRNA to identify prokaryotic species and cDNA (converted from mRNA) of selected functional genes to understand activities and dynamics of ocean microbial communities. Each automated, self-contained instrument contains a stack of disposable lab-on-a-chip devices. All measurements are performed on individual chips, starting with pumping seawater through on-chip filter to collect microbes, lysing cells to release nucleic acids, and then analyzing their genomic information. To aid the effort of building the first functional microarray, we participated in the TN-221 cruise funded by the National Science Foundation's Ocean Observatories Initiative to map the seafloor in areas of high scientific interest. During the cruise, multiple deep-sea water samples were collected. The microbes were filtered, frozen and shipped to our laboratory for molecular analysis. The DNA was isolated from these samples and a detailed metagenomic analysis is ongoing for samples of one site (80 km offshore of Oregon coast, 380 km away from the Axial Seamount vent field, and 5 meters above the 780- meter deep seafloor). From the isolated chromosomal DNA the 16S rRNA clone library was constructed and resultant clones were sequenced. Although the fluorescence microscopic analyses showed the density of biomass is relatively low, phylogenetic results suggested high diversity in these microbial communities. In addition, efforts were made to isolate mRNA directly from these deep-sea ocean samples. The information obtained from these analyses will be essential for development of oligonucleotide probes for the microarray device. First two authors contributed equally.

  4. Deeper insight into the structure of the anaerobic digestion microbial community; the biogas microbiome database is expanded with 157 new genomes

    DEFF Research Database (Denmark)

    Treu, Laura; Kougias, Panagiotis; Campanaro, Stefano

    2016-01-01

    This research aimed to better characterize the biogas microbiome by means of high throughput metagenomic sequencing and to elucidate the core microbial consortium existing in biogas reactors independently from the operational conditions. Assembly of shotgun reads followed by an established binning...... strategy resulted in the highest, up to now, extraction of microbial genomes involved in biogas producing systems. From the 236 extracted genome bins, it was remarkably found that the vast majority of them could only be characterized at high taxonomic levels. This result confirms that the biogas microbiome...

  5. Pyrosequencing Based Microbial Community Analysis of Stabilized Mine Soils

    Science.gov (United States)

    Park, J. E.; Lee, B. T.; Son, A.

    2015-12-01

    Heavy metals leached from exhausted mines have been causing severe environmental problems in nearby soils and groundwater. Environmental mitigation was performed based on the heavy metal stabilization using Calcite and steel slag in Korea. Since the soil stabilization only temporarily immobilizes the contaminants to soil matrix, the potential risk of re-leaching heavy metal still exists. Therefore the follow-up management of stabilized soils and the corresponding evaluation methods are required to avoid the consequent contamination from the stabilized soils. In this study, microbial community analysis using pyrosequencing was performed for assessing the potential leaching of the stabilized soils. As a result of rarefaction curve and Chao1 and Shannon indices, the stabilized soil has shown lower richness and diversity as compared to non-contaminated negative control. At the phyla level, as the degree of contamination increases, most of phyla decreased with only exception of increased proteobacteria. Among proteobacteria, gamma-proteobacteria increased against the heavy metal contamination. At the species level, Methylobacter tundripaludum of gamma-proteobacteria showed the highest relative portion of microbial community, indicating that methanotrophs may play an important role in either solubilization or immobilization of heavy metals in stabilized soils.

  6. Post-Genomics Approaches towards Monitoring Changes within the Microbial Ecology of the Gut

    Science.gov (United States)

    Tuohy, Kieran M.; Abecia, Leticia; Deaville, Eddie R.; Fava, Francesca; Klinder, Annett; Shen, Qing

    The human gut microbiota, comprising many hundreds of different microbial species, has closely co-evolved with its human host over the millennia. Diet has been a major driver of this co-evolution, in particular dietary non-digestible carbohydrates. This dietary fraction reaches the colon and becomes available for microbial fermentation, and it is in the colon that the great diversity of gut microorganisms resides. For the vast majority of our evolutionary history humans followed hunter-gatherer life-styles and consumed diets with many times more non-digestible carbohydrates, fiber and whole plant polyphenol rich foods than typical Western style diets today.

  7. Acid-base interactions in microbial adhesion to hexadecane and chloroform

    NARCIS (Netherlands)

    Bos, R; Busscher, HJ; Geertsema-Doornbusch, GI; Van Der Mei, HC; Mittal, KL

    2000-01-01

    Acid-base interactions play an important role in adhesion, including microbial adhesion to surfaces. Qualitatively acid-base interactions in microbial adhesion can be demonstrated by comparing adhesion to hexadecane (a negatively charged interface in aqueous solutions, unable to exert acid-base inte

  8. Coordinates and intervals in graph-based reference genomes.

    Science.gov (United States)

    Rand, Knut D; Grytten, Ivar; Nederbragt, Alexander J; Storvik, Geir O; Glad, Ingrid K; Sandve, Geir K

    2017-05-18

    It has been proposed that future reference genomes should be graph structures in order to better represent the sequence diversity present in a species. However, there is currently no standard method to represent genomic intervals, such as the positions of genes or transcription factor binding sites, on graph-based reference genomes. We formalize offset-based coordinate systems on graph-based reference genomes and introduce methods for representing intervals on these reference structures. We show the advantage of our methods by representing genes on a graph-based representation of the newest assembly of the human genome (GRCh38) and its alternative loci for regions that are highly variable. More complex reference genomes, containing alternative loci, require methods to represent genomic data on these structures. Our proposed notation for genomic intervals makes it possible to fully utilize the alternative loci of the GRCh38 assembly and potential future graph-based reference genomes. We have made a Python package for representing such intervals on offset-based coordinate systems, available at https://github.com/uio-cels/offsetbasedgraph . An interactive web-tool using this Python package to visualize genes on a graph created from GRCh38 is available at https://github.com/uio-cels/genomicgraphcoords .

  9. Ontology-Based Search of Genomic Metadata.

    Science.gov (United States)

    Fernandez, Javier D; Lenzerini, Maurizio; Masseroli, Marco; Venco, Francesco; Ceri, Stefano

    2016-01-01

    The Encyclopedia of DNA Elements (ENCODE) is a huge and still expanding public repository of more than 4,000 experiments and 25,000 data files, assembled by a large international consortium since 2007; unknown biological knowledge can be extracted from these huge and largely unexplored data, leading to data-driven genomic, transcriptomic, and epigenomic discoveries. Yet, search of relevant datasets for knowledge discovery is limitedly supported: metadata describing ENCODE datasets are quite simple and incomplete, and not described by a coherent underlying ontology. Here, we show how to overcome this limitation, by adopting an ENCODE metadata searching approach which uses high-quality ontological knowledge and state-of-the-art indexing technologies. Specifically, we developed S.O.S. GeM (http://www.bioinformatics.deib.polimi.it/SOSGeM/), a system supporting effective semantic search and retrieval of ENCODE datasets. First, we constructed a Semantic Knowledge Base by starting with concepts extracted from ENCODE metadata, matched to and expanded on biomedical ontologies integrated in the well-established Unified Medical Language System. We prove that this inference method is sound and complete. Then, we leveraged the Semantic Knowledge Base to semantically search ENCODE data from arbitrary biologists' queries. This allows correctly finding more datasets than those extracted by a purely syntactic search, as supported by the other available systems. We empirically show the relevance of found datasets to the biologists' queries.

  10. CRISPR/Cas9 based genome editing of Penicillium chrysogenum

    NARCIS (Netherlands)

    Pohl, Carsten; Kiel, Jan A K W; Driessen, Arnold J M; Bovenberg, Roel A L; Nygård, Yvonne

    2016-01-01

    CRISPR/Cas9 based systems have emerged as versatile platforms for precision genome editing in a wide range of organisms. Here we have developed powerful CRISPR/Cas9 tools for marker-based and marker-free genome modifications in Penicillium chrysogenum, a model filamentous fungus and industrially rel

  11. Hellbender genome sequences shed light on genomic expansion at the base of crown salamanders.

    Science.gov (United States)

    Sun, Cheng; Mueller, Rachel Lockridge

    2014-07-01

    Among animals, genome sizes range from 20 Mb to 130 Gb, with 380-fold variation across vertebrates. Most of the largest vertebrate genomes are found in salamanders, an amphibian clade of 660 species. Thus, salamanders are an important system for studying causes and consequences of genomic gigantism. Previously, we showed that plethodontid salamander genomes accumulate higher levels of long terminal repeat (LTR) retrotransposons than do other vertebrates, although the evolutionary origins of such sequences remained unexplored. We also showed that some salamanders in the family Plethodontidae have relatively slow rates of DNA loss through small insertions and deletions. Here, we present new data from Cryptobranchus alleganiensis, the hellbender. Cryptobranchus and Plethodontidae span the basal phylogenetic split within salamanders; thus, analyses incorporating these taxa can shed light on the genome of the ancestral crown salamander lineage, which underwent expansion. We show that high levels of LTR retrotransposons likely characterize all crown salamanders, suggesting that disproportionate expansion of this transposable element (TE) class contributed to genomic expansion. Phylogenetic and age distribution analyses of salamander LTR retrotransposons indicate that salamanders' high TE levels reflect persistence and diversification of ancestral TEs rather than horizontal transfer events. Finally, we show that relatively slow DNA loss rates through small indels likely characterize all crown salamanders, suggesting that a decreased DNA loss rate contributed to genomic expansion at the clade's base. Our identification of shared genomic features across phylogenetically distant salamanders is a first step toward identifying the evolutionary processes underlying accumulation and persistence of high levels of repetitive sequence in salamander genomes.

  12. Comparisons of Shewanella strains based on genome annotations, modeling and experiments

    Energy Technology Data Exchange (ETDEWEB)

    Ong, Wai Kit; Vu, Trang; Lovendahl, Klaus N.; Llull, Jenna; Serres, Margaret; Romine, Margaret F.; Reed, Jennifer L.

    2014-01-01

    Shewanella is a genus of facultatively anaerobic, Gram-negative bacteria that have highly adaptable metabolism which allows them to thrive in diverse environments. This quality makes them attractive target bacteria for research in bioremediation and microbial fuel cell applications. Constraint-based modeling is a useful tool for helping researchers gain insights into the metabolic capabilities of these bacteria. However, Shewanella oneidensis MR-1 is the only strain with a genome-scale metabolic model constructed out of the 22 sequenced Shewanella strains.

  13. StellaBase: the Nematostella vectensis Genomics Database.

    Science.gov (United States)

    Sullivan, James C; Ryan, Joseph F; Watson, James A; Webb, Jeramy; Mullikin, James C; Rokhsar, Daniel; Finnerty, John R

    2006-01-01

    StellaBase, the Nematostella vectensis Genomics Database, is a web-based resource that will facilitate desktop and bench-top studies of the starlet sea anemone. Nematostella is an emerging model organism that has already proven useful for addressing fundamental questions in developmental evolution and evolutionary genomics. StellaBase allows users to query the assembled Nematostella genome, a confirmed gene library, and a predicted genome using both keyword and homology based search functions. Data provided by these searches will elucidate gene family evolution in early animals. Unique research tools, including a Nematostella genetic stock library, a primer library, a literature repository and a gene expression library will provide support to the burgeoning Nematostella research community. The development of StellaBase accompanies significant upgrades to CnidBase, the Cnidarian Evolutionary Genomics Database. With the completion of the first sequenced cnidarian genome, genome comparison tools have been added to CnidBase. In addition, StellaBase provides a framework for the integration of additional species-specific databases into CnidBase. StellaBase is available at http://www.stellabase.org.

  14. Rapid detection of structural variation in a human genome using nanochannel-based genome mapping technology

    DEFF Research Database (Denmark)

    Cao, Hongzhi; Hastie, Alex R.; Cao, Dandan;

    2014-01-01

    than 1 kb. Excluding the 59 SVs (54 insertions/deletions, 5 inversions) that overlap with N-base gaps in the reference assembly hg19, 666 non-gap SVs remained, and 396 of them (60%) were verified by paired-end data from whole-genome sequencing-based re-sequencing or de novo assembly sequence from...... fosmid data. Of the remaining 270 SVs, 260 are insertions and 213 overlap known SVs in the Database of Genomic Variants. Overall, 609 out of 666 (90%) variants were supported by experimental orthogonal methods or historical evidence in public databases. At the same time, genome mapping also provides...

  15. Role of Genomic Typing in Taxonomy, Evolutionary Genetics, and Microbial Epidemiology

    Science.gov (United States)

    van Belkum, Alex; Struelens, Marc; de Visser, Arjan; Verbrugh, Henri; Tibayrenc, Michel

    2001-01-01

    Currently, genetic typing of microorganisms is widely used in several major fields of microbiological research. Taxonomy, research aimed at elucidation of evolutionary dynamics or phylogenetic relationships, population genetics of microorganisms, and microbial epidemiology all rely on genetic typing data for discrimination between genotypes. Apart from being an essential component of these fundamental sciences, microbial typing clearly affects several areas of applied microbiogical research. The epidemiological investigation of outbreaks of infectious diseases and the measurement of genetic diversity in relation to relevant biological properties such as pathogenicity, drug resistance, and biodegradation capacities are obvious examples. The diversity among nucleic acid molecules provides the basic information for all fields described above. However, researchers in various disciplines tend to use different vocabularies, a wide variety of different experimental methods to monitor genetic variation, and sometimes widely differing modes of data processing and interpretation. The aim of the present review is to summarize the technological and fundamental concepts used in microbial taxonomy, evolutionary genetics, and epidemiology. Information on the nomenclature used in the different fields of research is provided, descriptions of the diverse genetic typing procedures are presented, and examples of both conceptual and technological research developments for Escherichia coli are included. Recommendations for unification of the different fields through standardization of laboratory techniques are made. PMID:11432813

  16. Coexisting/Coexpressing Genomic Libraries (CoGeL) identify interactions among distantly located genetic loci for developing complex microbial phenotypes.

    Science.gov (United States)

    Nicolaou, Sergios A; Gaida, Stefan M; Papoutsakis, Eleftherios T

    2011-12-01

    In engineering novel microbial strains for biotechnological applications, beyond a priori identifiable pathways to be engineered, it is becoming increasingly important to develop complex, ill-defined cellular phenotypes. One approach is to screen genomic or metagenomic libraries to identify genes imparting desirable phenotypes, such as tolerance to stressors or novel catabolic programs. Such libraries are limited by their inability to identify interactions among distant genetic loci. To solve this problem, we constructed plasmid- and fosmid-based Escherichia coli Coexisting/Coexpressing Genomic Libraries (CoGeLs). As a proof of principle, four sets of two genes of the l-lysine biosynthesis pathway distantly located on the E. coli chromosome were knocked out. Upon transformation of these auxotrophs with CoGeLs, cells growing without supplementation were found to harbor library inserts containing the knocked-out genes demonstrating the interaction between the two libraries. CoGeLs were also screened to identify genetic loci that work synergistically to create the considerably more complex acid-tolerance phenotype. CoGeL screening identified combination of genes known to enhance acid tolerance (gadBC operon and adiC), but also identified the novel combination of arcZ and recA that greatly enhanced acid tolerance by 9000-fold. arcZ is a small RNA that we show increases pH tolerance alone and together with recA.

  17. Deeper insight into the structure of the anaerobic digestion microbial community; the biogas microbiome database is expanded with 157 new genomes.

    Science.gov (United States)

    Treu, Laura; Kougias, Panagiotis G; Campanaro, Stefano; Bassani, Ilaria; Angelidaki, Irini

    2016-09-01

    This research aimed to better characterize the biogas microbiome by means of high throughput metagenomic sequencing and to elucidate the core microbial consortium existing in biogas reactors independently from the operational conditions. Assembly of shotgun reads followed by an established binning strategy resulted in the highest, up to now, extraction of microbial genomes involved in biogas producing systems. From the 236 extracted genome bins, it was remarkably found that the vast majority of them could only be characterized at high taxonomic levels. This result confirms that the biogas microbiome is comprised by a consortium of unknown species. A comparative analysis between the genome bins of the current study and those extracted from a previous metagenomic assembly demonstrated a similar phylogenetic distribution of the main taxa. Finally, this analysis led to the identification of a subset of common microbes that could be considered as the core essential group in biogas production.

  18. Microarray-based whole-genome hybridization as a tool for determining procaryotic species relatedness

    Energy Technology Data Exchange (ETDEWEB)

    Wu, L.; Liu, X.; Fields, M.W.; Thompson, D.K.; Bagwell, C.E.; Tiedje, J. M.; Hazen, T.C.; Zhou, J.

    2008-01-15

    The definition and delineation of microbial species are of great importance and challenge due to the extent of evolution and diversity. Whole-genome DNA-DNA hybridization is the cornerstone for defining procaryotic species relatedness, but obtaining pairwise DNA-DNA reassociation values for a comprehensive phylogenetic analysis of procaryotes is tedious and time consuming. A previously described microarray format containing whole-genomic DNA (the community genome array or CGA) was rigorously evaluated as a high-throughput alternative to the traditional DNA-DNA reassociation approach for delineating procaryotic species relationships. DNA similarities for multiple bacterial strains obtained with the CGA-based hybridization were comparable to those obtained with various traditional whole-genome hybridization methods (r=0.87, P<0.01). Significant linear relationships were also observed between the CGA-based genome similarities and those derived from small subunit (SSU) rRNA gene sequences (r=0.79, P<0.0001), gyrB sequences (r=0.95, P<0.0001) or REP- and BOX-PCR fingerprinting profiles (r=0.82, P<0.0001). The CGA hybridization-revealed species relationships in several representative genera, including Pseudomonas, Azoarcus and Shewanella, were largely congruent with previous classifications based on various conventional whole-genome DNA-DNA reassociation, SSU rRNA and/or gyrB analyses. These results suggest that CGA-based DNA-DNA hybridization could serve as a powerful, high-throughput format for determining species relatedness among microorganisms.

  19. From DNA Sequences to Chemical Structures – Methods for Mining Microbial Genomic and Metagenomic Data Sets for New Natural Products

    Directory of Open Access Journals (Sweden)

    Jurica Zucko

    2010-01-01

    Full Text Available Rapid mining of large genomic and metagenomic data sets for modular polyketide synthases, non-ribosomal peptide synthetases and hybrid polyketide synthase/non-ribosomal peptide synthetase biosynthetic gene clusters has been achieved using the generic computer program packages ClustScan and CompGen. These program packages perform the annotation with the hierarchical structuring into polypeptides, modules and domains, as well as storage and graphical presentations of the data. This aims to achieve the most accurate predictions of the activities and specificities of catalytically active domains that can be made with present knowledge, leading to a prediction of the most likely chemical structures produced by these enzymes. The program packages also allow generation of novel clusters by homologous recombination of the annotated genes in silico. ClustScan and CompGen were used to construct a custom database of known compounds (CSDB and of predicted entirely novel recombinant products (r-CSDB that can be used for in silico screening with computer aided drug design technology. The use of these programs has been exemplified by analysing genomic sequences from terrestrial prokaryotes and eukaryotic microorganisms, a marine metagenomic data set and a newly discovered example of a 'shared metabolic pathway' in marine-microbial endosymbiosis.

  20. An expanded role for microbial physiology in metabolic engineering and functional genomics: moving towards systems biology

    DEFF Research Database (Denmark)

    Nielsen, Jens; Olsson, Lisbeth

    2002-01-01

    . With the progress in molecular biology it has become possible to optimize industrial fermentations through introduction of directed genetic modification - an approach referred to as metabolic engineering. Furthermore, as a consequence of large sequencing programs the complete genomic sequence has become available...

  1. Robust Inference of Genetic Exchange Communities from Microbial Genomes Using TF-IDF

    Science.gov (United States)

    Cong, Yingnan; Chan, Yao-ban; Phillips, Charles A.; Langston, Michael A.; Ragan, Mark A.

    2017-01-01

    Bacteria and archaea can exchange genetic material across lineages through processes of lateral genetic transfer (LGT). Collectively, these exchange relationships can be modeled as a network and analyzed using concepts from graph theory. In particular, densely connected regions within an LGT network have been defined as genetic exchange communities (GECs). However, it has been problematic to construct networks in which edges solely represent LGT. Here we apply term frequency-inverse document frequency (TF-IDF), an alignment-free method originating from document analysis, to infer regions of lateral origin in bacterial genomes. We examine four empirical datasets of different size (number of genomes) and phyletic breadth, varying a key parameter (word length k) within bounds established in previous work. We map the inferred lateral regions to genes in recipient genomes, and construct networks in which the nodes are groups of genomes, and the edges natively represent LGT. We then extract maximum and maximal cliques (i.e., GECs) from these graphs, and identify nodes that belong to GECs across a wide range of k. Most surviving lateral transfer has happened within these GECs. Using Gene Ontology enrichment tests we demonstrate that biological processes associated with metabolism, regulation and transport are often over-represented among the genes affected by LGT within these communities. These enrichments are largely robust to change of k. PMID:28154557

  2. INTEGRATED GENOME-BASED STUDIES OF SHEWANELLA ECOPHYSIOLOGY

    Energy Technology Data Exchange (ETDEWEB)

    NEALSON, KENNETH H.

    2013-10-15

    laboratories. Applications: 1. Corrosion: Electron flow is often part of the corrosive process, and several studies were done in concert with this proposal with regard to the ability of EET-capable bacteria to enhance, inhibit, or detect corrosion. These included using EET-capable bacteria to detect corrosion in its earliest stages [5], to use corrosion-causing bacteria for the study of the microbe/mineral interface during corrosion [1], and to study the groups of microbes involved with corrosion of natural systems [19]. 2. Bioenergy and microbial fuel cells: The production of electricity by Shewanella was shown early in this program (several years ago) to be dependent on the genes for extracellular electron transport (EET), and applied work involved the testing of various strains and conditions for the optimization of current production by the shewanellae [11,14,16]. 3. Identification of shewanellae strains: Based on similarities seen in genomic comparisons, a rapid method was employed for distinguishing between shewanellae strains [17]. Interactions with other laboratories: This grant was an extension of a grant involving the so-called ?Shewanella Federation?, and as such, a number of our publications were joint with other members of this group. The groups included: 1. Pacific Northwest Laboratories ? 2. Oak Ridge National Labs 3. Michigan State University 4. University of Oklahoma 5. Naval Research Laboratory, Washington DC 6. Burnham Medical Research Institute, San Diego 7. J. Craig Venter Institute, San Diego Education: Graduate Students: Michael Waters, Ph.D. ? at NIST, Washington D.C. Lewis Hsu, Ph.D. ? at NRL, San Diego Howard Harris, Ph.D. ? Postdoc at University, France Everett Salas, Ph.D. ? Scientist at Chevron McLean, Jeffrey, Ph.D. ? Scientist at J. Craig Venter Institute McCrow, John, Ph.D. ? Scientist at J. Craig Venter Institute Postdocs: Mohamed El-Naggar ? Professor of Physics, USC Jinjun Kan ? Senior Researcher at Undergraduatges: During this year, we had

  3. SISP: a Fast Species Identification System for Prokaryotes Based on Total Nucleotide Identity of Whole Genome Sequences

    Directory of Open Access Journals (Sweden)

    Jiapeng Chen

    2015-06-01

    Full Text Available In the genomic era, new techniques and criteria are proposed to improve the traditionally phenotypic and biochemical test–based approaches for prokaryotic species definition. Among them, average nucleotide identity (ANI mirrors DNA-DNA hybridization and is widely used by the microbial research community. However, our test shows that ANI possibly defines distinct taxa as the same species when they shared highly homologous sequences in a very short genomic region. In this study, we propose an improved algorithm named total nucleotide identity (TNI for use in bacterial taxonomy; this algorithm provided higher accuracy for species classification than ANI. Furthermore, we developed a species identification system for prokaryotes (SISP based on pairwise TNI of 3,073 genomes acquired from GenBank. For a submitted query genome, SISP can quickly find its most closely related genome from the established database based on the TNI calculation and infer the possible species of the query genome. Given a criterion of TNI > 70%, SISP has an accuracy that was above 90% for 3,596 prokaryotic genomes. SISP is open source and is available at https://github.com/chjp/SISProkaryotes.

  4. wFleaBase: the Daphnia genome database

    Directory of Open Access Journals (Sweden)

    Singan Vasanth R

    2005-03-01

    Full Text Available Abstract Background wFleaBase is a database with the necessary infrastructure to curate, archive and share genetic, molecular and functional genomic data and protocols for an emerging model organism, the microcrustacean Daphnia. Commonly known as the water-flea, Daphnia's ecological merit is unequaled among metazoans, largely because of its sentinel role within freshwater ecosystems and over 200 years of biological investigations. By consequence, the Daphnia Genomics Consortium (DGC has launched an interdisciplinary research program to create the resources needed to study genes that affect ecological and evolutionary success in natural environments. Discussion These tools include the genome database wFleaBase, which currently contains functions to search and extract information from expressed sequenced tags, genome survey sequences and full genome sequencing projects. This new database is built primarily from core components of the Generic Model Organism Database project, and related bioinformatics tools. Summary Over the coming year, preliminary genetic maps and the nearly complete genomic sequence of Daphnia pulex will be integrated into wFleaBase, including gene predictions and ortholog assignments based on sequence similarities with eukaryote genes of known function. wFleaBase aims to serve a large ecological and evolutionary research community. Our challenge is to rapidly expand its content and to ultimately integrate genetic and functional genomic information with population-level responses to environmental challenges. URL: http://wfleabase.org/.

  5. INTEGRATED GENOME-BASED STUDIES OF SHEWANELLA ECOPHYSIOLOGY

    Energy Technology Data Exchange (ETDEWEB)

    NEALSON, KENNETH H.

    2013-10-15

    laboratories. Applications: 1. Corrosion: Electron flow is often part of the corrosive process, and several studies were done in concert with this proposal with regard to the ability of EET-capable bacteria to enhance, inhibit, or detect corrosion. These included using EET-capable bacteria to detect corrosion in its earliest stages [5], to use corrosion-causing bacteria for the study of the microbe/mineral interface during corrosion [1], and to study the groups of microbes involved with corrosion of natural systems [19]. 2. Bioenergy and microbial fuel cells: The production of electricity by Shewanella was shown early in this program (several years ago) to be dependent on the genes for extracellular electron transport (EET), and applied work involved the testing of various strains and conditions for the optimization of current production by the shewanellae [11,14,16]. 3. Identification of shewanellae strains: Based on similarities seen in genomic comparisons, a rapid method was employed for distinguishing between shewanellae strains [17]. Interactions with other laboratories: This grant was an extension of a grant involving the so-called ?Shewanella Federation?, and as such, a number of our publications were joint with other members of this group. The groups included: 1. Pacific Northwest Laboratories ? 2. Oak Ridge National Labs 3. Michigan State University 4. University of Oklahoma 5. Naval Research Laboratory, Washington DC 6. Burnham Medical Research Institute, San Diego 7. J. Craig Venter Institute, San Diego Education: Graduate Students: Michael Waters, Ph.D. ? at NIST, Washington D.C. Lewis Hsu, Ph.D. ? at NRL, San Diego Howard Harris, Ph.D. ? Postdoc at University, France Everett Salas, Ph.D. ? Scientist at Chevron McLean, Jeffrey, Ph.D. ? Scientist at J. Craig Venter Institute McCrow, John, Ph.D. ? Scientist at J. Craig Venter Institute Postdocs: Mohamed El-Naggar ? Professor of Physics, USC Jinjun Kan ? Senior Researcher at Undergraduatges: During this year, we had

  6. Rhizome of life, catastrophes, sequence exchanges, gene creations and giant viruses: How microbial genomics challenges Darwin

    OpenAIRE

    Vicky eMerhej; Didier eRaoult

    2012-01-01

    Darwin’s theory about the evolution of species has been the object of considerable dispute. In this review, we have described seven key principles in Darwin’s book The Origin of Species and tried to present how genomics challenge each of these concepts and improve our knowledge about evolution. Darwin believed that species evolution consists on a positive directional selection ensuring the survival of the fittest. The most developed state of the species is characterized by increasing complexi...

  7. Rhizome of life, catastrophes, sequence exchanges, gene creations, and giant viruses: how microbial genomics challenges Darwin

    OpenAIRE

    Merhej, Vicky; Raoult, Didier

    2012-01-01

    Darwin's theory about the evolution of species has been the object of considerable dispute. In this review, we have described seven key principles in Darwin's book The Origin of Species and tried to present how genomics challenge each of these concepts and improve our knowledge about evolution. Darwin believed that species evolution consists on a positive directional selection ensuring the “survival of the fittest.” The most developed state of the species is characterized by increasing comple...

  8. CMG-Biotools, a Free Workbench for Basic Comparative Microbial Genomics

    DEFF Research Database (Denmark)

    Vesth, Tammi Camilla; Lagesen, Karin; Acar, Öncel;

    2013-01-01

    This paper shows the strength and diverse use of the CMG-biotools system. The system can be installed on a vide range of host operating systems and utilizes as much of the host computer as desired. It allows the user to compare multiple genomes, from various sources using standardized data format...... and intuitive visualizations of results. The examples presented here clearly shows that users with limited computational experience can perform complicated analysis without much training....

  9. Genomic and metagenomic surveys of hydrogenase distribution indicate H2 is a widely utilised energy source for microbial growth and survival.

    Science.gov (United States)

    Greening, Chris; Biswas, Ambarish; Carere, Carlo R; Jackson, Colin J; Taylor, Matthew C; Stott, Matthew B; Cook, Gregory M; Morales, Sergio E

    2016-03-01

    Recent physiological and ecological studies have challenged the long-held belief that microbial metabolism of molecular hydrogen (H2) is a niche process. To gain a broader insight into the importance of microbial H2 metabolism, we comprehensively surveyed the genomic and metagenomic distribution of hydrogenases, the reversible enzymes that catalyse the oxidation and evolution of H2. The protein sequences of 3286 non-redundant putative hydrogenases were curated from publicly available databases. These metalloenzymes were classified into multiple groups based on (1) amino acid sequence phylogeny, (2) metal-binding motifs, (3) predicted genetic organisation and (4) reported biochemical characteristics. Four groups (22 subgroups) of [NiFe]-hydrogenase, three groups (6 subtypes) of [FeFe]-hydrogenases and a small group of [Fe]-hydrogenases were identified. We predict that this hydrogenase diversity supports H2-based respiration, fermentation and carbon fixation processes in both oxic and anoxic environments, in addition to various H2-sensing, electron-bifurcation and energy-conversion mechanisms. Hydrogenase-encoding genes were identified in 51 bacterial and archaeal phyla, suggesting strong pressure for both vertical and lateral acquisition. Furthermore, hydrogenase genes could be recovered from diverse terrestrial, aquatic and host-associated metagenomes in varying proportions, indicating a broad ecological distribution and utilisation. Oxygen content (pO2) appears to be a central factor driving the phylum- and ecosystem-level distribution of these genes. In addition to compounding evidence that H2 was the first electron donor for life, our analysis suggests that the great diversification of hydrogenases has enabled H2 metabolism to sustain the growth or survival of microorganisms in a wide range of ecosystems to the present day. This work also provides a comprehensive expanded system for classifying hydrogenases and identifies new prospects for investigating H2

  10. Metabolic potential of microbial mats and microbialites: Autotrophic capabilities described by an in silico stoichiometric approach from shared genomic resources.

    Science.gov (United States)

    Cerqueda-García, Daniel; Falcón, Luisa I

    2016-08-01

    Microbialites and microbial mats are complex communities with high phylogenetic diversity. These communities are mostly composed of bacteria and archaea, which are the earliest living forms on Earth and relevant to biogeochemical evolution. In this study, we identified the shared metabolic pathways for uptake of inorganic C and N in microbial mats and microbialites based on metagenomic data sets. An in silico analysis for autotrophic pathways was used to trace the paths of C and N to the system, following an elementary flux modes (EFM) approach, resulting in a stoichiometric model. The fragility was analyzed by the minimal cut sets method. We found four relevant pathways for the incorporation of CO2 (Calvin cycle, reverse tricarboxylic acid cycle, reductive acetyl-CoA pathway, and dicarboxylate/4-hydroxybutyrate cycle), some of them present only in archaea, while nitrogen fixation was the most important source of N to the system. The metabolic potential to incorporate nitrate to biomass was also relevant. The fragility of the network was low, suggesting a high redundancy of the autotrophic pathways due to their broad metabolic diversity, and highlighting the relevance of reducing power source. This analysis suggests that microbial mats and microbialites are "metabolic pumps" for the incorporation of inorganic gases and formation of organic matter.

  11. Complete genome sequence of Enterobacter sp. IIT-BT 08: A potential microbial strain for high rate hydrogen production.

    Science.gov (United States)

    Khanna, Namita; Ghosh, Ananta Kumar; Huntemann, Marcel; Deshpande, Shweta; Han, James; Chen, Amy; Kyrpides, Nikos; Mavrommatis, Kostas; Szeto, Ernest; Markowitz, Victor; Ivanova, Natalia; Pagani, Ioanna; Pati, Amrita; Pitluck, Sam; Nolan, Matt; Woyke, Tanja; Teshima, Hazuki; Chertkov, Olga; Daligault, Hajnalka; Davenport, Karen; Gu, Wei; Munk, Christine; Zhang, Xiaojing; Bruce, David; Detter, Chris; Xu, Yan; Quintana, Beverly; Reitenga, Krista; Kunde, Yulia; Green, Lance; Erkkila, Tracy; Han, Cliff; Brambilla, Evelyne-Marie; Lang, Elke; Klenk, Hans-Peter; Goodwin, Lynne; Chain, Patrick; Das, Debabrata

    2013-12-20

    Enterobacter sp. IIT-BT 08 belongs to Phylum: Proteobacteria, Class: Gammaproteobacteria, Order: Enterobacteriales, Family: Enterobacteriaceae. The organism was isolated from the leaves of a local plant near the Kharagpur railway station, Kharagpur, West Bengal, India. It has been extensively studied for fermentative hydrogen production because of its high hydrogen yield. For further enhancement of hydrogen production by strain development, complete genome sequence analysis was carried out. Sequence analysis revealed that the genome was linear, 4.67 Mbp long and had a GC content of 56.01%. The genome properties encode 4,393 protein-coding and 179 RNA genes. Additionally, a putative pathway of hydrogen production was suggested based on the presence of formate hydrogen lyase complex and other related genes identified in the genome. Thus, in the present study we describe the specific properties of the organism and the generation, annotation and analysis of its genome sequence as well as discuss the putative pathway of hydrogen production by this organism.

  12. Effects of phosphate limitation on soluble microbial products and microbial community structure in semi-continuous Synechocystis-based photobioreactors.

    Science.gov (United States)

    Zevin, Alexander S; Nam, Taekgul; Rittmann, Bruce; Krajmalnik-Brown, Rosa

    2015-09-01

    All bacteria release organic compounds called soluble microbial products (SMP) as a part of their normal metabolism. In photobioreactor (PBR) settings, SMP produced by cyanobacteria represent a major pool of carbon and electrons available to heterotrophic bacteria. Thus, SMP in PBRs are a major driver for the growth of heterotrophic bacteria, and understanding the distribution of SMP in PBRs is an important step toward proper management of PBR microbial communities. Here, we analyzed the SMP and microbial communities in two Synechocystis sp. PCC6803-based PBRs. The first PBR (PBRP0) became phosphate limited after several days of operation, while the second PBR (PBRP+) did not have phosphate limitation. Heterotrophic bacteria were detected in both PBRs, but PBRP0 had a much higher proportion of heterotrophic bacteria than PBRP+. Furthermore, PBRP+ had greater biomass production and lower SMP production per unit biomass than PBRP0. Carbohydrates that were most likely derived from hydrolysis of extracellular polymeric substances (EPS) dominated the SMP in PBRP0, while products resulting from cell lysis or decay dominated the SMP in PBRP+. Together, our data support that maintaining phosphate availability in Synechocystis-based PBRs is important for managing SMP and, thus, the heterotrophic community.

  13. Genome-based discovery, structure prediction and functional analysis of cyclic lipopeptide antibiotics in Pseudomonas species.

    Science.gov (United States)

    de Bruijn, Irene; de Kock, Maarten J D; Yang, Meng; de Waard, Pieter; van Beek, Teris A; Raaijmakers, Jos M

    2007-01-01

    Analysis of microbial genome sequences have revealed numerous genes involved in antibiotic biosynthesis. In Pseudomonads, several gene clusters encoding non-ribosomal peptide synthetases (NRPSs) were predicted to be involved in the synthesis of cyclic lipopeptide (CLP) antibiotics. Most of these predictions, however, are untested and the association between genome sequence and biological function of the predicted metabolite is lacking. Here we report the genome-based identification of previously unknown CLP gene clusters in plant pathogenic Pseudomonas syringae strains B728a and DC3000 and in plant beneficial Pseudomonas fluorescens Pf0-1 and SBW25. For P. fluorescens SBW25, a model strain in studying bacterial evolution and adaptation, the structure of the CLP with a predicted 9-amino acid peptide moiety was confirmed by chemical analyses. Mutagenesis confirmed that the three identified NRPS genes are essential for CLP synthesis in strain SBW25. CLP production was shown to play a key role in motility, biofilm formation and in activity of SBW25 against zoospores of Phytophthora infestans. This is the first time that an antimicrobial metabolite is identified from strain SBW25. The results indicate that genome mining may enable the discovery of unknown gene clusters and traits that are highly relevant in the lifestyle of plant beneficial and plant pathogenic bacteria.

  14. AgBase: a functional genomics resource for agriculture

    Directory of Open Access Journals (Sweden)

    Hill David P

    2006-09-01

    Full Text Available Abstract Background Many agricultural species and their pathogens have sequenced genomes and more are in progress. Agricultural species provide food, fiber, xenotransplant tissues, biopharmaceuticals and biomedical models. Moreover, many agricultural microorganisms are human zoonoses. However, systems biology from functional genomics data is hindered in agricultural species because agricultural genome sequences have relatively poor structural and functional annotation and agricultural research communities are smaller with limited funding compared to many model organism communities. Description To facilitate systems biology in these traditionally agricultural species we have established "AgBase", a curated, web-accessible, public resource http://www.agbase.msstate.edu for structural and functional annotation of agricultural genomes. The AgBase database includes a suite of computational tools to use GO annotations. We use standardized nomenclature following the Human Genome Organization Gene Nomenclature guidelines and are currently functionally annotating chicken, cow and sheep gene products using the Gene Ontology (GO. The computational tools we have developed accept and batch process data derived from different public databases (with different accession codes, return all existing GO annotations, provide a list of products without GO annotation, identify potential orthologs, model functional genomics data using GO and assist proteomics analysis of ESTs and EST assemblies. Our journal database helps prevent redundant manual GO curation. We encourage and publicly acknowledge GO annotations from researchers and provide a service for researchers interested in GO and analysis of functional genomics data. Conclusion The AgBase database is the first database dedicated to functional genomics and systems biology analysis for agriculturally important species and their pathogens. We use experimental data to improve structural annotation of genomes and to

  15. Comparative genomics-based investigation of resequencing targets in Vibrio fischeri: Focus on point miscalls and artefactual expansions

    Directory of Open Access Journals (Sweden)

    Ruby Edward G

    2008-03-01

    Full Text Available Abstract Background Sequence closure often represents the end-point of a genome project, without a system in place for subsequent improvement and refinement. Building on the genome project of Vibrio fischeri ES114, we used a comparative approach to identify and investigate genes that had a high likelihood of sequence error. Results Comparison of the V. fischeri ES114 genome with that of conspecific strain MJ11 identified 82 target loci in ES114 as containing likely errors, and thus of high-priority for resequencing. Analysis of the targets identified 75 loci in which an error had occurred, resulting in the correction of 10,457 base pairs to generate the new ES114 genomic sequence. A majority of the inaccurate loci involved frameshift errors, correction of which fused adjacent ORFs. Although insertions/deletions are thought to be rare in microbial genome assemblies, fourteen of the loci contained extraneous sequence of over 300 bp, likely due to imperfect contig ends that were misassembled in tandem rather than as overlapping segments. Additionally we updated the entire genome annotation with 113 new features including previously uncalled protein-coding genes, regulatory RNA genes and operon leader peptides, and we analyzed the transcriptional apparatus encoded by ES114. Conclusion We demonstrate that errors in microbial genome sequences, thought to largely be confined to point mutations, may also consist of other prevalent large-scale rearrangements such as insertions. Ongoing genome quality control and annotation programs are necessary to accompany technological advancements in data generation. These updates further advance V. fischeri as an important model for understanding intercellular communication and colonization of animal tissue.

  16. Future Public Policy and Ethical Issues Facing the Agricultural and Microbial Genomics Sectors of the Biotechnology Industry: A Roundtable Discussion

    Energy Technology Data Exchange (ETDEWEB)

    Diane E. Hoffmann

    2003-09-12

    On September 12, 2003, the University of Maryland School of Law's Intellectual Property and Law & Health Care Programs jointly sponsored and convened a roundtable discussion on the future public policy and ethical issues that will likely face the agricultural and microbial genomics sectors of the biotechnology industry. As this industry has developed over the last two decades, societal concerns have moved from what were often local issues, e.g., the safety of laboratories where scientists conducted recombinant DNA research on transgenic microbes, animals and crops, to more global issues. These newer issues include intellectual property, international trade, risks of genetically engineered foods and microbes, bioterrorism, and marketing and labeling of new products sold worldwide. The fast paced nature of the biotechnology industry and its new developments often mean that legislators, regulators and society, in general, must play ''catch up'' in their efforts to understand the issues, the risks, and even the benefits, that may result from the industry's new ways of conducting research, new products, and novel methods of product marketing and distribution. The goal of the roundtable was to develop a short list of the most significant public policy and ethical issues that will emerge as a result of advances in these sectors of the biotechnology industry over the next five to six years. More concretely, by ''most significant'' the conveners meant the types of issues that would come to the attention of members of Congress or state legislators during this time frame and for which they would be better prepared if they had well researched and timely background information. A concomitant goal was to provide a set of focused issues for academic debate and scholarship so that policy makers, industry leaders and regulators would have the intellectual resources they need to better understand the issues and concerns at stake. The

  17. Prevention of microbial communities: novel approaches based natural products.

    Science.gov (United States)

    Mogosanu, George D; Grumezescu, Alexandru M; Huang, Keng-Shiang; Bejenaru, Ludovic E; Bejenaru, Cornelia

    2015-01-01

    Firmly attached to different living or non-living, solid or fluid surfaces rich in nutrients and moisture, microbial biofilm is a matter of great interest due to its major importance for the healthcare community. Depending on common strategies such as mutual protection and hibernation (quiescent bacteria), the resistance, survival and virulence of microbial communities have large implications for human pathology, clinical environment and biomedical devices. The microbial biofilm is continuously changing, stimulating inflammation, increasing vascular permeability and preventing the action of macrophages. About 80% of human infections affecting the gastrointestinal, genitourinary and respiratory systems, oral mucosa and teeth, eyes, middle ear and skin are caused by biofilm-associated microorganisms. Therefore, the search for modern strategies is even more important as microbial biofilms resistant to conventional antibiotics, antiseptics and disinfectants are involved in the frequent treatment failures of some chronic inflammatory diseases and wounds. Natural products containing secondary metabolites, such as aromatic compounds, sulphurated derivatives, terpenoids (essential oils) and alkaloids as quorum-sensing inhibitors and biofilm disruptors, are promising alternatives for the prophylaxis and treatment of chronic infections. Surface modification of medical devices with non-polar functionalized nanoparticles stabilizes the natural compounds antibiofilm activity and inhibits microbial adhesion and biofilm formation and growth for a longer period of time. In this regard, an interdisciplinary approach is needed due to the large number of natural derivatives alone or in combination with biocompatible and biodegradable micro-/ nano-engineered materials.

  18. Microbial-based inoculants impact nitrous oxide emissions from an incubated soil medium containing urea fertilizers.

    Science.gov (United States)

    Calvo, Pamela; Watts, Dexter B; Ames, Robert N; Kloepper, Joseph W; Torbert, H Allen

    2013-01-01

    There is currently much interest in developing crop management practices that will decrease NO emissions from agricultural soils. Many different approaches are being investigated, but to date, no studies have been published on how microbial inoculants affect NO emissions. This study was conducted to test the hypothesis that microbial-based inoculants known to promote root growth and nutrient uptake can reduce NO emissions in the presence of N fertilizers under controlled conditions. Carbon dioxide and CH fluxes were also measured to evaluate microbial respiration and determine the aerobic and anaerobic conditions of the incubated soil. The microbial-based treatments investigated were SoilBuilder (SB), a metabolite extract of SoilBuilder (SBF), and a mixture of four strains of plant growth-promoting spp. Experiments included two different N fertilizer treatments, urea and urea-NHNO 32% N (UAN), and an unfertilized control. Emissions of NO and CO were determined from soil incubations and analyzed with gas chromatography. After 29 d of incubation, cumulative NO emissions were reduced 80% by SB and 44% by SBF in soils fertilized with UAN. Treatment with spp. significantly reduced NO production on Days 1 and 2 of the incubation in soils fertilized with UAN. In the unfertilized treatment, cumulative emissions of NO were significantly reduced 92% by SBF. Microbial-based treatments did not reduce NO emissions associated with urea application. Microbial-based treatments increased CO emissions from soils fertilized with UAN, suggesting a possible increase in microbial activity. Overall, the results demonstrated that microbial-based inoculants can reduce NO emissions associated with N fertilizer application, and this response varies with the type of microbial-based inoculant and fertilizer.

  19. Secondary metabolomics: the impact of mass spectrometry-based approaches on the discovery and characterization of microbial natural products.

    Science.gov (United States)

    Krug, Daniel; Müller, Rolf

    2014-06-01

    Covering: up to the end of 2013 The in-depth analysis of secondary metabolomes of many microbes offers tremendous opportunities for the discovery of novel natural products which often exhibit promising biological activities. However, over the last years the increasing availability of whole-genome information has led to raised expectations, as bioinformatic analysis revealed that traditional strategies to discover novel secondary metabolites apparently have so far only scratched the surface of the real microbial "secondary metabolome landscape". Metabolomics-based approaches using modern mass spectrometry techniques can help to bridge the gap between genome-encoded potential for the production of secondary metabolites and the usually contradictory low numbers of compounds known from a specific producer. In this article recent studies are highlighted in which metabolomics-driven analysis played a crucial role for the discovery of novel secondary metabolites from microbial sources. We also exemplify how the implementation of metabolomics techniques facilitates the structural characterization of novel metabolites and contributes to the in-depth investigation of underlying biosynthetic pathways. Furthermore, the constantly increasing role of secondary metabolomics for the identification of novel natural products in a drug discovery context is discussed.

  20. Accessing the SEED genome databases via Web services API: tools for programmers

    National Research Council Canada - National Science Library

    Disz, Terry; Akhter, Sajia; Cuevas, Daniel; Olson, Robert; Overbeek, Ross; Vonstein, Veronika; Stevens, Rick; Edwards, Robert A

    2010-01-01

    .... The database contains accurate and up-to-date annotations based on the subsystems concept that leverages clustering between genomes and other clues to accurately and efficiently annotate microbial genomes...

  1. Genomic organisation, activity and distribution analysis of the microbial putrescine oxidase degradation pathway.

    Science.gov (United States)

    Foster, Alexander; Barnes, Nicole; Speight, Robert; Keane, Mark A

    2013-10-01

    The catalytic action of putrescine specific amine oxidases acting in tandem with 4-aminobutyraldehyde dehydrogenase is explored as a degradative pathway in Rhodococcus opacus. By limiting the nitrogen source, increased catalytic activity was induced leading to a coordinated response in the oxidative deamination of putrescine to 4-aminobutyraldehyde and subsequent dehydrogenation to 4-aminobutyrate. Isolating the dehydrogenase by ion exchange chromatography and gel filtration revealed that the enzyme acts principally on linear aliphatic aldehydes possessing an amino moiety. Michaelis-Menten kinetic analysis delivered a Michaelis constant (K(M)=0.014 mM) and maximum rate (Vmax=11.2 μmol/min/mg) for the conversion of 4-aminobutyraldehyde to 4-aminobutyrate. The dehydrogenase identified by MALDI-TOF mass spectrometric analysis (E value=0.031, 23% coverage) belongs to a functionally related genomic cluster that includes the amine oxidase, suggesting their association in a directed cell response. Key regulatory, stress and transport encoding genes have been identified, along with candidate dehydrogenases and transaminases for the further conversion of 4-aminobutyrate to succinate. Genomic analysis has revealed highly similar metabolic gene clustering among members of Actinobacteria, providing insight into putrescine degradation notably among Micrococcaceae, Rhodococci and Corynebacterium by a pathway that was previously uncharacterised in bacteria.

  2. effect of a microbial-based acaricidal product on spotted and ...

    African Journals Online (AJOL)

    Administrator

    EFFECT OF A MICROBIAL-BASED ACARICIDAL PRODUCT ON SPOTTED ... reproductive potential and short life cycle facilitate rapid resistance ... This product is currently used in the .... abamectin (5 or 7 ppm), we observed faster decline.

  3. Network Based Prediction Model for Genomics Data Analysis*

    OpenAIRE

    Huang, Ying; Wang, Pei

    2012-01-01

    Biological networks, such as genetic regulatory networks and protein interaction networks, provide important information for studying gene/protein activities. In this paper, we propose a new method, NetBoosting, for incorporating a priori biological network information in analyzing high dimensional genomics data. Specially, we are interested in constructing prediction models for disease phenotypes of interest based on genomics data, and at the same time identifying disease susceptible genes. ...

  4. Going from Microbial Ecology to Genome Data and Back: Studies on a Haloalkaliphilic Bacterium Isolated from Soap Lake, Washington State

    Directory of Open Access Journals (Sweden)

    Melanie R. Mormile

    2014-11-01

    Full Text Available Soap Lake is a meromictic, alkaline (~pH 9.8 and saline (~14 to 140 g liter-1 lake located in the semiarid area of eastern Washington State. Of note is the length of time it has been meromictic (at least 2000 years and the extremely high sulfide level (~140 mM in its monimolimnion. As expected, the microbial ecology of this lake is greatly influenced by these conditions. A bacterium, Halanaerobium hydrogeniformans, was isolated from the mixolimnion region of this lake. H. hydrogeniformans is a haloalkaliphilic bacterium capable of forming hydrogen from 5- and 6-carbon sugars derived from hemicellulose and cellulose. Due to its ability to produce hydrogen under saline and alkaline conditions, in amounts that rival genetically modified organisms, its genome was sequenced. This sequence data provides an opportunity to explore the unique metabolic capabilities of this organism, including the mechanisms for tolerating the extreme conditions of both high salinity and alkalinity of its environment.

  5. IMG-ABC: new features for bacterial secondary metabolism analysis and targeted biosynthetic gene cluster discovery in thousands of microbial genomes

    Science.gov (United States)

    Hadjithomas, Michalis; Chen, I-Min A.; Chu, Ken; Huang, Jinghua; Ratner, Anna; Palaniappan, Krishna; Andersen, Evan; Markowitz, Victor; Kyrpides, Nikos C.; Ivanova, Natalia N.

    2017-01-01

    Secondary metabolites produced by microbes have diverse biological functions, which makes them a great potential source of biotechnologically relevant compounds with antimicrobial, anti-cancer and other activities. The proteins needed to synthesize these natural products are often encoded by clusters of co-located genes called biosynthetic gene clusters (BCs). In order to advance the exploration of microbial secondary metabolism, we developed the largest publically available database of experimentally verified and predicted BCs, the Integrated Microbial Genomes Atlas of Biosynthetic gene Clusters (IMG-ABC) (https://img.jgi.doe.gov/abc/). Here, we describe an update of IMG-ABC, which includes ClusterScout, a tool for targeted identification of custom biosynthetic gene clusters across 40 000 isolate microbial genomes, and a new search capability to query more than 700 000 BCs from isolate genomes for clusters with similar Pfam composition. Additional features enable fast exploration and analysis of BCs through two new interactive visualization features, a BC function heatmap and a BC similarity network graph. These new tools and features add to the value of IMG-ABC's vast body of BC data, facilitating their in-depth analysis and accelerating secondary metabolite discovery. PMID:27903896

  6. Array-based comparative genomic hybridization for genome-wide screening of DNA copy number in bladder tumors.

    NARCIS (Netherlands)

    Veltman, J.A.; Fridlyand, J.; Pejavar, S.; Olshen, A.B.; Korkola, J.E.; Vries, S. de; Carroll, P.; Kuo, W.L.; Pinkel, D.; Albertson, D.; Cordon-Cardo, C.; Jain, A.N.; Waldman, F.M.

    2003-01-01

    Genome-wide copy number profiles were characterized in 41 primary bladder tumors using array-based comparative genomic hybridization (array CGH). In addition to previously identified alterations in large chromosomal regions, alterations were identified in many small genomic regions, some with high-l

  7. Genomic comparisons of Brucella spp. and closely related bacteria using base compositional and proteome based methods

    DEFF Research Database (Denmark)

    Bohlin, Jon; Snipen, Lars; Cloeckaert, Axel

    2010-01-01

    , genomic codon and amino acid frequencies based comparisons) and proteomes (all-against-all BLAST protein comparisons and pan-genomic analyses). RESULTS: We found that the oligonucleotide based methods gave different results compared to that of the proteome based methods. Differences were also found...... than proteome comparisons between species in genus Brucella and genus Ochrobactrum. Pan-genomic analyses indicated that uptake of DNA from outside genus Brucella appears to be limited. CONCLUSIONS: While both the proteome based methods and the Markov chain based genomic signatures were able to reflect...

  8. Bacterial Recombineering: Genome Engineering via Phage-Based Homologous Recombination.

    Science.gov (United States)

    Pines, Gur; Freed, Emily F; Winkler, James D; Gill, Ryan T

    2015-11-20

    The ability to specifically modify bacterial genomes in a precise and efficient manner is highly desired in various fields, ranging from molecular genetics to metabolic engineering and synthetic biology. Much has changed from the initial realization that phage-derived genes may be employed for such tasks to today, where recombineering enables complex genetic edits within a genome or a population. Here, we review the major developments leading to recombineering becoming the method of choice for in situ bacterial genome editing while highlighting the various applications of recombineering in pushing the boundaries of synthetic biology. We also present the current understanding of the mechanism of recombineering. Finally, we discuss in detail issues surrounding recombineering efficiency and future directions for recombineering-based genome editing.

  9. Rapid detection of structural variation in a human genome using nanochannel-based genome mapping technology

    DEFF Research Database (Denmark)

    Cao, Hongzhi; Hastie, Alex R.; Cao, Dandan

    2014-01-01

    mutations; however, none of the current detection methods are comprehensive, and currently available methodologies are incapable of providing sufficient resolution and unambiguous information across complex regions in the human genome. To address these challenges, we applied a high-throughput, cost......BACKGROUND: Structural variants (SVs) are less common than single nucleotide polymorphisms and indels in the population, but collectively account for a significant fraction of genetic polymorphism and diseases. Base pair differences arising from SVs are on a much higher order (>100 fold) than point...... mapping technology as a comprehensive and cost-effective method for detecting structural variation and studying complex regions in the human genome, as well as deciphering viral integration into the host genome....

  10. NEMiD: a web-based curated microbial diversity database with geo-based plotting.

    Science.gov (United States)

    Bhattacharjee, Kaushik; Joshi, Santa Ram

    2014-01-01

    The majority of the Earth's microbes remain unknown, and that their potential utility cannot be exploited until they are discovered and characterized. They provide wide scope for the development of new strains as well as biotechnological uses. The documentation and bioprospection of microorganisms carry enormous significance considering their relevance to human welfare. This calls for an urgent need to develop a database with emphasis on the microbial diversity of the largest untapped reservoirs in the biosphere. The data annotated in the North-East India Microbial database (NEMiD) were obtained by the isolation and characterization of microbes from different parts of the Eastern Himalayan region. The database was constructed as a relational database management system (RDBMS) for data storage in MySQL in the back-end on a Linux server and implemented in an Apache/PHP environment. This database provides a base for understanding the soil microbial diversity pattern in this megabiodiversity hotspot and indicates the distribution patterns of various organisms along with identification. The NEMiD database is freely available at www.mblabnehu.info/nemid/.

  11. NEMiD: a web-based curated microbial diversity database with geo-based plotting.

    Directory of Open Access Journals (Sweden)

    Kaushik Bhattacharjee

    Full Text Available The majority of the Earth's microbes remain unknown, and that their potential utility cannot be exploited until they are discovered and characterized. They provide wide scope for the development of new strains as well as biotechnological uses. The documentation and bioprospection of microorganisms carry enormous significance considering their relevance to human welfare. This calls for an urgent need to develop a database with emphasis on the microbial diversity of the largest untapped reservoirs in the biosphere. The data annotated in the North-East India Microbial database (NEMiD were obtained by the isolation and characterization of microbes from different parts of the Eastern Himalayan region. The database was constructed as a relational database management system (RDBMS for data storage in MySQL in the back-end on a Linux server and implemented in an Apache/PHP environment. This database provides a base for understanding the soil microbial diversity pattern in this megabiodiversity hotspot and indicates the distribution patterns of various organisms along with identification. The NEMiD database is freely available at www.mblabnehu.info/nemid/.

  12. Global genomic diversity of human papillomavirus 6 based on 724 isolates and 190 complete genome sequences.

    Science.gov (United States)

    Jelen, Mateja M; Chen, Zigui; Kocjan, Boštjan J; Burt, Felicity J; Chan, Paul K S; Chouhy, Diego; Combrinck, Catharina E; Coutlée, François; Estrade, Christine; Ferenczy, Alex; Fiander, Alison; Franco, Eduardo L; Garland, Suzanne M; Giri, Adriana A; González, Joaquín Víctor; Gröning, Arndt; Heidrich, Kerstin; Hibbitts, Sam; Hošnjak, Lea; Luk, Tommy N M; Marinic, Karina; Matsukura, Toshihiko; Neumann, Anna; Oštrbenk, Anja; Picconi, Maria Alejandra; Richardson, Harriet; Sagadin, Martin; Sahli, Roland; Seedat, Riaz Y; Seme, Katja; Severini, Alberto; Sinchi, Jessica L; Smahelova, Jana; Tabrizi, Sepehr N; Tachezy, Ruth; Tohme, Sarah; Uloza, Virgilijus; Vitkauskiene, Astra; Wong, Yong Wee; Zidovec Lepej, Snježana; Burk, Robert D; Poljak, Mario

    2014-07-01

    Human papillomavirus type 6 (HPV6) is the major etiological agent of anogenital warts and laryngeal papillomas and has been included in both the quadrivalent and nonavalent prophylactic HPV vaccines. This study investigated the global genomic diversity of HPV6, using 724 isolates and 190 complete genomes from six continents, and the association of HPV6 genomic variants with geographical location, anatomical site of infection/disease, and gender. Initially, a 2,800-bp E5a-E5b-L1-LCR fragment was sequenced from 492/530 (92.8%) HPV6-positive samples collected for this study. Among them, 130 exhibited at least one single nucleotide polymorphism (SNP), indel, or amino acid change in the E5a-E5b-L1-LCR fragment and were sequenced in full. A global alignment and maximum likelihood tree of 190 complete HPV6 genomes (130 fully sequenced in this study and 60 obtained from sequence repositories) revealed two variant lineages, A and B, and five B sublineages: B1, B2, B3, B4, and B5. HPV6 (sub)lineage-specific SNPs and a 960-bp representative region for whole-genome-based phylogenetic clustering within the L2 open reading frame were identified. Multivariate logistic regression analysis revealed that lineage B predominated globally. Sublineage B3 was more common in Africa and North and South America, and lineage A was more common in Asia. Sublineages B1 and B3 were associated with anogenital infections, indicating a potential lesion-specific predilection of some HPV6 sublineages. Females had higher odds for infection with sublineage B3 than males. In conclusion, a global HPV6 phylogenetic analysis revealed the existence of two variant lineages and five sublineages, showing some degree of ethnogeographic, gender, and/or disease predilection in their distribution. This study established the largest database of globally circulating HPV6 genomic variants and contributed a total of 130 new, complete HPV6 genome sequences to available sequence repositories. Two HPV6 variant lineages

  13. Genomic profiling of oral squamous cell carcinoma by array-based comparative genomic hybridization.

    Directory of Open Access Journals (Sweden)

    Shunichi Yoshioka

    Full Text Available We designed a study to investigate genetic relationships between primary tumors of oral squamous cell carcinoma (OSCC and their lymph node metastases, and to identify genomic copy number aberrations (CNAs related to lymph node metastasis. For this purpose, we collected a total of 42 tumor samples from 25 patients and analyzed their genomic profiles by array-based comparative genomic hybridization. We then compared the genetic profiles of metastatic primary tumors (MPTs with their paired lymph node metastases (LNMs, and also those of LNMs with non-metastatic primary tumors (NMPTs. Firstly, we found that although there were some distinctive differences in the patterns of genomic profiles between MPTs and their paired LNMs, the paired samples shared similar genomic aberration patterns in each case. Unsupervised hierarchical clustering analysis grouped together 12 of the 15 MPT-LNM pairs. Furthermore, similarity scores between paired samples were significantly higher than those between non-paired samples. These results suggested that MPTs and their paired LNMs are composed predominantly of genetically clonal tumor cells, while minor populations with different CNAs may also exist in metastatic OSCCs. Secondly, to identify CNAs related to lymph node metastasis, we compared CNAs between grouped samples of MPTs and LNMs, but were unable to find any CNAs that were more common in LNMs. Finally, we hypothesized that subpopulations carrying metastasis-related CNAs might be present in both the MPT and LNM. Accordingly, we compared CNAs between NMPTs and LNMs, and found that gains of 7p, 8q and 17q were more common in the latter than in the former, suggesting that these CNAs may be involved in lymph node metastasis of OSCC. In conclusion, our data suggest that in OSCCs showing metastasis, the primary and metastatic tumors share similar genomic profiles, and that cells in the primary tumor may tend to metastasize after acquiring metastasis-associated CNAs.

  14. Microbial iron management mechanisms in extremely acidic environments: comparative genomics evidence for diversity and versatility

    Directory of Open Access Journals (Sweden)

    Nieto Pamela A

    2008-11-01

    uptake systems could reflect their obligatory occupation of extremely low pH environments where high concentrations of soluble iron may always be available and were oxidized sulfur species might not compromise iron speciation dynamics. Presence of bacterioferritin in the Acidithiobacilli, polyphosphate accumulation functions and variants of FieF-like diffusion facilitators in both Acidithiobacilli and Leptospirilla, indicate that they may remove or store iron under conditions of variable availability. In addition, the Fe(II-oxidizing capacity of both A. ferrooxidans and Leptospirilla could itself be a way to evade iron stress imposed by readily available Fe(II ions at low pH. Fur regulatory sites have been predicted for a number of gene clusters including iron related and non-iron related functions in both the Acidithiobacilli and Leptospirilla, laying the foundation for the future discovery of iron regulated and iron-phosphate coordinated regulatory control circuits. Conclusion In silico analyses of the genomes of acidophilic bacteria are beginning to tease apart the mechanisms that mediate iron uptake and homeostasis in low pH environments. Initial models pinpoint significant differences in abundance and diversity of iron management mechanisms between Leptospirilla and Acidithiobacilli, and begin to reveal how these two groups respond to iron cycling and iron fluctuations in naturally acidic environments and in industrial operations. Niche partitions and ecological successions between acidophilic microorganisms may be partially explained by these observed differences. Models derived from these analyses pave the way for improved hypothesis testing and well directed experimental investigation. In addition, aspects of these models should challenge investigators to evaluate alternative iron management strategies in non-acidophilic model organisms.

  15. Marine genomics

    DEFF Research Database (Denmark)

    Oliveira Ribeiro, Ângela Maria; Foote, Andrew D.; Kupczok, Anne

    2017-01-01

    Marine ecosystems occupy 71% of the surface of our planet, yet we know little about their diversity. Although the inventory of species is continually increasing, as registered by the Census of Marine Life program, only about 10% of the estimated two million marine species are known. This lag......-throughput sequencing approaches have been helping to improve our knowledge of marine biodiversity, from the rich microbial biota that forms the base of the tree of life to a wealth of plant and animal species. In this review, we present an overview of the applications of genomics to the study of marine life, from...... evolutionary biology of non-model organisms to species of commercial relevance for fishing, aquaculture and biomedicine. Instead of providing an exhaustive list of available genomic data, we rather set to present contextualized examples that best represent the current status of the field of marine genomics....

  16. Comparative genomics beyond sequence-based alignments

    DEFF Research Database (Denmark)

    Þórarinsson, Elfar; Yao, Zizhen; Wiklund, Eric D.;

    2008-01-01

    Recent computational scans for non-coding RNAs (ncRNAs) in multiple organisms have relied on existing multiple sequence alignments. However, as sequence similarity drops, a key signal of RNA structure--frequent compensating base changes--is increasingly likely to cause sequence-based alignment me...

  17. WormBase 2016: expanding to enable helminth genomic research

    Science.gov (United States)

    Howe, Kevin L.; Bolt, Bruce J.; Cain, Scott; Chan, Juancarlos; Chen, Wen J.; Davis, Paul; Done, James; Down, Thomas; Gao, Sibyl; Grove, Christian; Harris, Todd W.; Kishore, Ranjana; Lee, Raymond; Lomax, Jane; Li, Yuling; Muller, Hans-Michael; Nakamura, Cecilia; Nuin, Paulo; Paulini, Michael; Raciti, Daniela; Schindelman, Gary; Stanley, Eleanor; Tuli, Mary Ann; Van Auken, Kimberly; Wang, Daniel; Wang, Xiaodong; Williams, Gary; Wright, Adam; Yook, Karen; Berriman, Matthew; Kersey, Paul; Schedl, Tim; Stein, Lincoln; Sternberg, Paul W.

    2016-01-01

    WormBase (www.wormbase.org) is a central repository for research data on the biology, genetics and genomics of Caenorhabditis elegans and other nematodes. The project has evolved from its original remit to collect and integrate all data for a single species, and now extends to numerous nematodes, ranging from evolutionary comparators of C. elegans to parasitic species that threaten plant, animal and human health. Research activity using C. elegans as a model system is as vibrant as ever, and we have created new tools for community curation in response to the ever-increasing volume and complexity of data. To better allow users to navigate their way through these data, we have made a number of improvements to our main website, including new tools for browsing genomic features and ontology annotations. Finally, we have developed a new portal for parasitic worm genomes. WormBase ParaSite (parasite.wormbase.org) contains all publicly available nematode and platyhelminth annotated genome sequences, and is designed specifically to support helminth genomic research. PMID:26578572

  18. Changing Histopathological Diagnostics by Genome-Based Tumor Classification

    Directory of Open Access Journals (Sweden)

    Michael Kloth

    2014-05-01

    Full Text Available Traditionally, tumors are classified by histopathological criteria, i.e., based on their specific morphological appearances. Consequently, current therapeutic decisions in oncology are strongly influenced by histology rather than underlying molecular or genomic aberrations. The increase of information on molecular changes however, enabled by the Human Genome Project and the International Cancer Genome Consortium as well as the manifold advances in molecular biology and high-throughput sequencing techniques, inaugurated the integration of genomic information into disease classification. Furthermore, in some cases it became evident that former classifications needed major revision and adaption. Such adaptations are often required by understanding the pathogenesis of a disease from a specific molecular alteration, using this molecular driver for targeted and highly effective therapies. Altogether, reclassifications should lead to higher information content of the underlying diagnoses, reflecting their molecular pathogenesis and resulting in optimized and individual therapeutic decisions. The objective of this article is to summarize some particularly important examples of genome-based classification approaches and associated therapeutic concepts. In addition to reviewing disease specific markers, we focus on potentially therapeutic or predictive markers and the relevance of molecular diagnostics in disease monitoring.

  19. An integrative computational model for large-scale identification of metalloproteins in microbial genomes: a focus on iron-sulfur cluster proteins.

    Science.gov (United States)

    Estellon, Johan; Ollagnier de Choudens, Sandrine; Smadja, Myriam; Fontecave, Marc; Vandenbrouck, Yves

    2014-10-01

    Metalloproteins represent a ubiquitous group of molecules which are crucial to the survival of all living organisms. While several metal-binding motifs have been defined, it remains challenging to confidently identify metalloproteins from primary protein sequences using computational approaches alone. Here, we describe a comprehensive strategy based on a machine learning approach to design and assess a penalized generalized linear model. We used this strategy to detect members of the iron-sulfur cluster protein family. A new category of descriptors, whose profile is based on profile hidden Markov models, encoding structural information was combined with public descriptors into a linear model. The model was trained and tested on distinct datasets composed of well-characterized iron-sulfur protein sequences, and the resulting model provided higher sensitivity compared to a motif-based approach, while maintaining a good level of specificity. Analysis of this linear model allows us to detect and quantify the contribution of each descriptor, providing us with a better understanding of this complex protein family along with valuable indications for further experimental characterization. Two newly-identified proteins, YhcC and YdiJ, were functionally validated as genuine iron-sulfur proteins, confirming the prediction. The computational model was then applied to over 550 prokaryotic genomes to screen for iron-sulfur proteomes; the results are publicly available at: . This study represents a proof-of-concept for the application of a penalized linear model to identify metalloprotein superfamilies on a large-scale. The application employed here, screening for iron-sulfur proteomes, provides new candidates for further biochemical and structural analysis as well as new resources for an extensive exploration of iron-sulfuromes in the microbial world.

  20. BPhyOG: An interactive server for genome-wide inference of bacterial phylogenies based on overlapping genes

    Directory of Open Access Journals (Sweden)

    Lin Kui

    2007-07-01

    Full Text Available Abstract Background Overlapping genes (OGs in bacterial genomes are pairs of adjacent genes of which the coding sequences overlap partly or entirely. With the rapid accumulation of sequence data, many OGs in bacterial genomes have now been identified. Indeed, these might prove a consistent feature across all microbial genomes. Our previous work suggests that OGs can be considered as robust markers at the whole genome level for the construction of phylogenies. An online, interactive web server for inferring phylogenies is needed for biologists to analyze phylogenetic relationships among a set of bacterial genomes of interest. Description BPhyOG is an online interactive server for reconstructing the phylogenies of completely sequenced bacterial genomes on the basis of their shared overlapping genes. It provides two tree-reconstruction methods: Neighbor Joining (NJ and Unweighted Pair-Group Method using Arithmetic averages (UPGMA. Users can apply the desired method to generate phylogenetic trees, which are based on an evolutionary distance matrix for the selected genomes. The distance between two genomes is defined by the normalized number of their shared OG pairs. BPhyOG also allows users to browse the OGs that were used to infer the phylogenetic relationships. It provides detailed annotation for each OG pair and the features of the component genes through hyperlinks. Users can also retrieve each of the homologous OG pairs that have been determined among 177 genomes. It is a useful tool for analyzing the tree of life and overlapping genes from a genomic standpoint. Conclusion BPhyOG is a useful interactive web server for genome-wide inference of any potential evolutionary relationship among the genomes selected by users. It currently includes 177 completely sequenced bacterial genomes containing 79,855 OG pairs, the annotation and homologous OG pairs of which are integrated comprehensively. The reliability of phylogenies complemented by

  1. Introducing BASE: the Biomes of Australian Soil Environments soil microbial diversity database.

    Science.gov (United States)

    Bissett, Andrew; Fitzgerald, Anna; Meintjes, Thys; Mele, Pauline M; Reith, Frank; Dennis, Paul G; Breed, Martin F; Brown, Belinda; Brown, Mark V; Brugger, Joel; Byrne, Margaret; Caddy-Retalic, Stefan; Carmody, Bernie; Coates, David J; Correa, Carolina; Ferrari, Belinda C; Gupta, Vadakattu V S R; Hamonts, Kelly; Haslem, Asha; Hugenholtz, Philip; Karan, Mirko; Koval, Jason; Lowe, Andrew J; Macdonald, Stuart; McGrath, Leanne; Martin, David; Morgan, Matt; North, Kristin I; Paungfoo-Lonhienne, Chanyarat; Pendall, Elise; Phillips, Lori; Pirzl, Rebecca; Powell, Jeff R; Ragan, Mark A; Schmidt, Susanne; Seymour, Nicole; Snape, Ian; Stephen, John R; Stevens, Matthew; Tinning, Matt; Williams, Kristen; Yeoh, Yun Kit; Zammit, Carla M; Young, Andrew

    2016-01-01

    Microbial inhabitants of soils are important to ecosystem and planetary functions, yet there are large gaps in our knowledge of their diversity and ecology. The 'Biomes of Australian Soil Environments' (BASE) project has generated a database of microbial diversity with associated metadata across extensive environmental gradients at continental scale. As the characterisation of microbes rapidly expands, the BASE database provides an evolving platform for interrogating and integrating microbial diversity and function. BASE currently provides amplicon sequences and associated contextual data for over 900 sites encompassing all Australian states and territories, a wide variety of bioregions, vegetation and land-use types. Amplicons target bacteria, archaea and general and fungal-specific eukaryotes. The growing database will soon include metagenomics data. Data are provided in both raw sequence (FASTQ) and analysed OTU table formats and are accessed via the project's data portal, which provides a user-friendly search tool to quickly identify samples of interest. Processed data can be visually interrogated and intersected with other Australian diversity and environmental data using tools developed by the 'Atlas of Living Australia'. Developed within an open data framework, the BASE project is the first Australian soil microbial diversity database. The database will grow and link to other global efforts to explore microbial, plant, animal, and marine biodiversity. Its design and open access nature ensures that BASE will evolve as a valuable tool for documenting an often overlooked component of biodiversity and the many microbe-driven processes that are essential to sustain soil function and ecosystem services.

  2. GI-SVM: A sensitive method for predicting genomic islands based on unannotated sequence of a single genome.

    Science.gov (United States)

    Lu, Bingxin; Leong, Hon Wai

    2016-02-01

    Genomic islands (GIs) are clusters of functionally related genes acquired by lateral genetic transfer (LGT), and they are present in many bacterial genomes. GIs are extremely important for bacterial research, because they not only promote genome evolution but also contain genes that enhance adaption and enable antibiotic resistance. Many methods have been proposed to predict GI. But most of them rely on either annotations or comparisons with other closely related genomes. Hence these methods cannot be easily applied to new genomes. As the number of newly sequenced bacterial genomes rapidly increases, there is a need for methods to detect GI based solely on sequences of a single genome. In this paper, we propose a novel method, GI-SVM, to predict GIs given only the unannotated genome sequence. GI-SVM is based on one-class support vector machine (SVM), utilizing composition bias in terms of k-mer content. From our evaluations on three real genomes, GI-SVM can achieve higher recall compared with current methods, without much loss of precision. Besides, GI-SVM allows flexible parameter tuning to get optimal results for each genome. In short, GI-SVM provides a more sensitive method for researchers interested in a first-pass detection of GI in newly sequenced genomes.

  3. INTEGRATED GENOME-BASED STUDIES OF SHEWANELLA ECOPHYSIOLOGY

    Energy Technology Data Exchange (ETDEWEB)

    TIEDJE, JAMES M; KONSTANTINIDIS, KOSTAS; WORDEN, MARK

    2014-01-08

    The aim of the work reported is to study Shewanella population genomics, and to understand the evolution, ecophysiology, and speciation of Shewanella. The tasks supporting this aim are: to study genetic and ecophysiological bases defining the core and diversification of Shewanella species; to determine gene content patterns along redox gradients; and to Investigate the evolutionary processes, patterns and mechanisms of Shewanella.

  4. Identification of genomic sites for CRISPR/Cas9-based genome editing in the Vitis vinifera genome

    Science.gov (United States)

    CRISPR/Cas9 has been recently demonstrated as an effective and popular genome editing tool for modifying genomes of human, animals, microorganisms, and plants. Success of such genome editing is highly dependent on the availability of suitable target sites in the genomes to be edited. Many specific t...

  5. Impulsive Neural Networks Algorithm Based on the Artificial Genome Model

    Directory of Open Access Journals (Sweden)

    Yuan Gao

    2014-05-01

    Full Text Available To describe gene regulatory networks, this article takes the framework of the artificial genome model and proposes impulsive neural networks algorithm based on the artificial genome model. Firstly, the gene expression and the cell division tree are applied to generate spiking neurons with specific attributes, neural network structure, connection weights and specific learning rules of each neuron. Next, the gene segment duplications and divergence model are applied to design the evolutionary algorithm of impulsive neural networks at the level of the artificial genome. The dynamic changes of developmental gene regulatory networks are controlled during the whole evolutionary process. Finally, the behavior of collecting food for autonomous intelligent agent is simulated, which is driven by nerves. Experimental results demonstrate that the algorithm in this article has the evolutionary ability on large-scale impulsive neural networks

  6. The effect of genealogy-based haplotypes on genomic prediction

    DEFF Research Database (Denmark)

    Edriss, Vahid; Fernando, Rohan L.; Su, Guosheng

    2013-01-01

    Background Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression...... on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. Methods A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using...... local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (pi) of the haplotype covariates had zero effect...

  7. CFGP: a web-based, comparative fungal genomics platform.

    Science.gov (United States)

    Park, Jongsun; Park, Bongsoo; Jung, Kyongyong; Jang, Suwang; Yu, Kwangyul; Choi, Jaeyoung; Kong, Sunghyung; Park, Jaejin; Kim, Seryun; Kim, Hyojeong; Kim, Soonok; Kim, Jihyun F; Blair, Jaime E; Lee, Kwangwon; Kang, Seogchan; Lee, Yong-Hwan

    2008-01-01

    Since the completion of the Saccharomyces cerevisiae genome sequencing project in 1996, the genomes of over 80 fungal species have been sequenced or are currently being sequenced. Resulting data provide opportunities for studying and comparing fungal biology and evolution at the genome level. To support such studies, the Comparative Fungal Genomics Platform (CFGP; http://cfgp.snu.ac.kr), a web-based multifunctional informatics workbench, was developed. The CFGP comprises three layers, including the basal layer, middleware and the user interface. The data warehouse in the basal layer contains standardized genome sequences of 65 fungal species. The middleware processes queries via six analysis tools, including BLAST, ClustalW, InterProScan, SignalP 3.0, PSORT II and a newly developed tool named BLASTMatrix. The BLASTMatrix permits the identification and visualization of genes homologous to a query across multiple species. The Data-driven User Interface (DUI) of the CFGP was built on a new concept of pre-collecting data and post-executing analysis instead of the 'fill-in-the-form-and-press-SUBMIT' user interfaces utilized by most bioinformatics sites. A tool termed Favorite, which supports the management of encapsulated sequence data and provides a personalized data repository to users, is another novel feature in the DUI.

  8. Substrates and pathway of electricity generation in a nitrification-based microbial fuel cell.

    Science.gov (United States)

    Chen, Hui; Zheng, Ping; Zhang, Jiqiang; Xie, Zuofu; Ji, Junyuan; Ghulam, Abbas

    2014-06-01

    Nitrification-based microbial fuel cell (N-MFC) is a novel inorganic microbial fuel cell based on nitrification in the anode compartment. So far, little information is available on the substrates and pathway of N-MFC. The results of this study indicated that apart from the primary nitrification substrate (ammonium), the intermediates (hydroxylamine and nitrite) could also serve as anodic fuel to generate current, and the end product nitrate showed an inhibitory effect on electricity generation. Based on the research, a pathway of electricity generation was proposed for N-MFC: ammonium was oxidized first to nitrite by ammonia-oxidizing bacteria (AOB), then the nitrite in anolyte and the potassium permanganate in catholyte constituted a chemical cell to generate current. In other words, the electricity generation in N-MFC was not only supported by microbial reaction as we expected, but both biological and electrochemical reactions contributed.

  9. The molecular dimension of microbial species: 3. Comparative genomics of Synechococcus strains with different light responses and in situ diel transcription patterns of associated ecotypes in the Mushroom Spring microbial mat

    Directory of Open Access Journals (Sweden)

    Millie T. Olsen

    2015-06-01

    Full Text Available Genomes were obtained for three closely related strains of Synechococcus that are representative of putative ecotypes that predominate at different depths in the 1 mm-thick, upper-green layer in the 60°C mat of Mushroom Spring, Yellowstone National Park, and exhibit different light adaptation and acclimation responses. The genomes were compared to the published genome of a previously obtained, closely related strain from a neighboring spring, and differences in both gene content and orthologous gene alleles between high-light-adapted and low-light-adapted strains were identified. Evidence of genetic differences that relate to adaptation to light intensity and/or quality, CO2 uptake, nitrogen metabolism, organic carbon metabolism, and uptake of other nutrients were found between strains of the different putative ecotypes. In situ diel transcription patterns of genes, including genes unique to either low-light-adapted or high-light-adapted strains and different alleles of an orthologous photosystem gene, revealed that expression is fine-tuned to the different light environments experienced by ecotypes prevalent at various depths in the mat. This study suggests that strains of closely related putative ecotypes have different genomic adaptations that enable them to inhabit distinct ecological niches while living in close proximity within a microbial community.

  10. Vertical microbial community variability of carbonate-based cones may provide insight into ancient conical stromatolite formation

    Science.gov (United States)

    Bradley, James; Daille, Leslie; Trivedi, Christopher; Bojanowski, Caitlin; Nunn, Heather; Stamps, Blake; Johnson, Hope; Stevenson, Bradley; Berelson, Will; Corsetti, Frank; Spear, John

    2016-04-01

    Stromatolite morphogenesis is poorly understood, and the process by which microbial mats become mineralized is a primary question in microbialite formation. Ancient conical stromatolites are primarily carbonate-based whereas the few modern analogues in hot springs are either non-mineralized or mineralized by silica. A team from the 2015 International GeoBiology Course investigated carbonate-rich microbial cones from near Little Hot Creek (LHC), Long Valley Caldera, California, to investigate how conical stromatolites might form in a hot spring carbonate system. The cones rise up from a layered microbial mat on the east side of a 45° C pool with very low flow that is super-saturated with respect to CaCO3. Cone structures are 8-30 mm in height, are rigid and do not deform when removed from the pool. Morphological characterization through environmental scanning electronic microscopy revealed that the cone structure is maintained by a matrix of intertwining microbial filaments around carbonate grains. This matrix gives rise to cone-filaments that are arranged vertically or horizontally, and provides further stability to the cone. Preliminary 16S rRNA gene analysis indicated variability of community composition between different vertical levels of the cone. The cone tip had comparatively greater abundance of filamentous cyanobacteria including Leptolingbya, Phormidium and Isosphaera and fewer heterotrophs (e.g. Chloroflexi) compared to the cone bottom. This supports the hypothesis that cone formation may depend on the differential abundance of the microbial community and their potential functional roles. Metagenomic analyses of the cones revealed potential genes related to chemotaxis and motility. Specifically, a genomic bin identified as a member of the genus Isosphaera contained an hmp chemotaxis operon implicated in gliding motility in the cyanobacterium Nostoc punctiforme. Isosphaera is a Planctomycete shown to have phototactic capabilities, and may play a role in

  11. Microbial nitrogen cycling response to forest-based bioenergy production.

    Science.gov (United States)

    Minick, Kevan J; Strahm, Brian D; Fox, Thomas R; Sucre, Eric B; Leggett, Zakiya H

    2015-12-01

    Concern over rising atmospheric CO2 and other greenhouse gases due to fossil fuel combustion has intensified research into carbon-neutral energy production. Approximately 15.8 million ha of pine plantations exist across the southeastern United States, representing a vast land area advantageous for bioenergy production without significant landuse change or diversion of agricultural resources from food production. Furthermore, intercropping of pine with bioenergy grasses could provide annually harvestable, lignocellulosic biomass feedstocks along with production of traditional wood products. Viability of such a system hinges in part on soil nitrogen (N) availability and effects of N competition between pines and grasses on ecosystem productivity. We investigated effects of intercropping loblolly pine (Pinus taeda) with switchgrass (Panicum virgatum) on microbial N cycling processes in the Lower Coastal Plain of North Carolina, USA. Soil samples were collected from bedded rows of pine and interbed space of two treatments, composed of either volunteer native woody and herbaceous vegetation (pine-native) or pure switchgrass (pine-switchgrass) in interbeds. An in vitro 15N pool-dilution technique was employed to quantify gross N transformations at two soil depths (0-5 and 5-15 cm) on four dates in 2012-2013. At the 0-5 cm depth in beds of the pine-switchgrass treatment, gross N mineralization was two to three times higher in November and February compared to the pine-native treatment, resulting in increased NH4(+) availability. Gross and net nitrification were also significantly higher in February in the same pine beds. In interbeds of the pine-switchgrass treatment, gross N mineralization was lower from April to November, but higher in February, potentially reflecting positive effects of switchgrass root-derived C inputs during dormancy on microbial activity. These findings indicate soil N cycling and availability has increased in pine beds of the pine

  12. Evidence against hydrogen-based microbial ecosystems in basalt aquifers

    Science.gov (United States)

    Anderson; Chapelle; Lovley

    1998-08-14

    It has been proposed that hydrogen produced from basalt-ground-water interactions may serve as an energy source that supports the existence of microorganisms in the deep subsurface on Earth and possibly on other planets. However, experiments demonstrated that hydrogen is not produced from basalt at an environmentally relevant, alkaline pH. Small amounts of hydrogen were produced at a lower pH in laboratory incubations, but even this hydrogen production was transitory. Furthermore, geochemical considerations suggest that previously reported rates of hydrogen production cannot be sustained over geologically significant time frames. These findings indicate that hydrogen production from basalt-ground-water interactions may not support microbial metabolism in the subsurface.

  13. Performance of the Salt Bridge Based Microbial Fuel Cell

    Directory of Open Access Journals (Sweden)

    Maksudur R. Khan

    2012-04-01

    Full Text Available Electricity generation from readily biodegradable organic substrates accompanied by decolorization of azo dye was investigated using a Microbial fuel cell (MFC. Biodegradation was the dominant mechanism of the dye removal, and glucose was the optimal substrate for Red Cibacron-2G (RC decolorization. Batch experiments were conducted to evaluate the performance of the MFC.  As compared to traditional anaerobic technology higher decolorization efficiency was achieved by MFC. Effect of initial dye concentration and external resistance on power generation were studied. Polarization experiments were also directed to find the maximum power density. Maximum Power density of 100mW/m2 (1.04A/m2 was recorded at optimum operating conditions.

  14. Complete genome sequence of DSM 30083(T), the type strain (U5/41(T)) of Escherichia coli, and a proposal for delineating subspecies in microbial taxonomy.

    Science.gov (United States)

    Meier-Kolthoff, Jan P; Hahnke, Richard L; Petersen, Jörn; Scheuner, Carmen; Michael, Victoria; Fiebig, Anne; Rohde, Christine; Rohde, Manfred; Fartmann, Berthold; Goodwin, Lynne A; Chertkov, Olga; Reddy, Tbk; Pati, Amrita; Ivanova, Natalia N; Markowitz, Victor; Kyrpides, Nikos C; Woyke, Tanja; Göker, Markus; Klenk, Hans-Peter

    2014-01-01

    Although Escherichia coli is the most widely studied bacterial model organism and often considered to be the model bacterium per se, its type strain was until now forgotten from microbial genomics. As a part of the G enomic E ncyclopedia of B acteria and A rchaea project, we here describe the features of E. coli DSM 30083(T) together with its genome sequence and annotation as well as novel aspects of its phenotype. The 5,038,133 bp containing genome sequence includes 4,762 protein-coding genes and 175 RNA genes as well as a single plasmid. Affiliation of a set of 250 genome-sequenced E. coli strains, Shigella and outgroup strains to the type strain of E. coli was investigated using digital DNA:DNA-hybridization (dDDH) similarities and differences in genomic G+C content. As in the majority of previous studies, results show Shigella spp. embedded within E. coli and in most cases forming a single subgroup of it. Phylogenomic trees also recover the proposed E. coli phylotypes as monophyla with minor exceptions and place DSM 30083(T) in phylotype B2 with E. coli S88 as its closest neighbor. The widely used lab strain K-12 is not only genomically but also physiologically strongly different from the type strain. The phylotypes do not express a uniform level of character divergence as measured using dDDH, however, thus an alternative arrangement is proposed and discussed in the context of bacterial subspecies. Analyses of the genome sequences of a large number of E. coli strains and of strains from > 100 other bacterial genera indicate a value of 79-80% dDDH as the most promising threshold for delineating subspecies, which in turn suggests the presence of five subspecies within E. coli.

  15. Pyrosequencing-based assessment of microbial community shifts in leachate from animal carcass burial lysimeter.

    Science.gov (United States)

    Kim, Hyun Young; Seo, Jiyoung; Kim, Tae-Hun; Shim, Bomi; Cha, Seok Mun; Yu, Seungho

    2017-02-26

    This study examined the use of microbial community structure as a bio-indicator of decomposition levels. High-throughput pyrosequencing technology was used to assess the shift in microbial community of leachate from animal carcass lysimeter. The leachate samples were collected monthly for one year and a total of 164,639 pyrosequencing reads were obtained and used in the taxonomic classification and operational taxonomy units (OTUs) distribution analysis based on sequence similarity. Our results show considerable changes in the phylum-level bacterial composition, suggesting that the microbial community is a sensitive parameter affected by the burial environment. The phylum classification results showed that Proteobacteria (Pseudomonas) were the most influential taxa in earlier decomposition stage whereas Firmicutes (Clostridium, Sporanaerobacter, and Peptostreptococcus) were dominant in later stage under anaerobic conditions. The result of this study can provide useful information on a time series of leachate profiles of microbial community structures and suggest patterns of microbial diversity in livestock burial sites. In addition, this result can be applicable to predict the decomposition stages under clay loam based soil conditions of animal livestock.

  16. Genome-based discovery, structure prediction and functional analysis of cyclic lipopeptide antibiotics in Pseudomonas species

    NARCIS (Netherlands)

    Bruijn, de I.; Kock, de M.J.D.; Meng, Y.; Waard, de P.; Beek, van T.A.; Raaijmakers, J.M.

    2007-01-01

    Analysis of microbial genome sequences have revealed numerous genes involved in antibiotic biosynthesis. In Pseudomonads, several gene clusters encoding non-ribosomal peptide synthetases (NRPSs) were predicted to be involved in the synthesis of cyclic lipopeptide (CLP) antibiotics. Most of these

  17. Genome-based discovery, structure prediction and functional analysis of cyclic lipopeptide antibiotics in Pseudomonas species

    NARCIS (Netherlands)

    Bruijn, de I.; Kock, de M.J.D.; Meng, Y.; Waard, de P.; Beek, van T.A.; Raaijmakers, J.M.

    2007-01-01

    Analysis of microbial genome sequences have revealed numerous genes involved in antibiotic biosynthesis. In Pseudomonads, several gene clusters encoding non-ribosomal peptide synthetases (NRPSs) were predicted to be involved in the synthesis of cyclic lipopeptide (CLP) antibiotics. Most of these pre

  18. Development of bacteriophage-based bioluminescent bioreporters for monitoring of microbial pathogens

    Science.gov (United States)

    Ozen, Aysu; Montgomery, Kacey; Jegier, Pat; Patterson, Stacey; Daumer, Kathleen A.; Ripp, Steven A.; Garland, Jay L.; Sayler, Gary S.

    2004-03-01

    Microorganisms pose numerous problems when present in human occupied enclosed environments. Primary among these are health related hazards, manifested as infectious diseases related to contaminated drinking water, food, or air circulation systems or non-infectious allergy related complications associated with microbial metabolites (sick building syndrome). As a means towards rapid detection of microbial pathogens, we are attempting to harness the specificity of bacterial phage for their host with a modified quorum sensing amplification signal to produce quantifiable bioluminescent (lux) detection on a silicon microluminometer. The bacteriophage itself is metabolically inactive, only achieving replicative capabilities upon infection of its specific host bacterium. Bacteriophage bioluminescent bioreporters contain a genomically inserted luxI component. During an infection event, the phage genes and accompanying luxI construct are taken up by the host bacterium and transcribed, resulting in luxI expression and subsequent activation of a homoserine lactone inducible bioluminescent bioreporter. We constructed a vector carrying the luxI gene under the control of a strong E. coli promoter and cloned it into E. coli. We have shown that it can induce luminescence up to 14,000 counts per second when combined with the bioreporter strain. In their final embodiment, these sensors will be fully independent microelectronic monitors for microbial contamination, requiring only exposure of the biochip to the sample, with on-chip signal processing downloaded directly to the local area network of the environmental control system.

  19. Genomic-based-breeding tools for tropical maize improvement.

    Science.gov (United States)

    Chakradhar, Thammineni; Hindu, Vemuri; Reddy, Palakolanu Sudhakar

    2017-09-05

    Maize has traditionally been the main staple diet in the Southern Asia and Sub-Saharan Africa and widely grown by millions of resource poor small scale farmers. Approximately, 35.4 million hectares are sown to tropical maize, constituting around 59% of the developing worlds. Tropical maize encounters tremendous challenges besides poor agro-climatic situations with average yields recorded <3 tones/hectare that is far less than the average of developed countries. On the contrary to poor yields, the demand for maize as food, feed, and fuel is continuously increasing in these regions. Heterosis breeding introduced in early 90 s improved maize yields significantly, but genetic gains is still a mirage, particularly for crop growing under marginal environments. Application of molecular markers has accelerated the pace of maize breeding to some extent. The availability of array of sequencing and genotyping technologies offers unrivalled service to improve precision in maize-breeding programs through modern approaches such as genomic selection, genome-wide association studies, bulk segregant analysis-based sequencing approaches, etc. Superior alleles underlying complex traits can easily be identified and introgressed efficiently using these sequence-based approaches. Integration of genomic tools and techniques with advanced genetic resources such as nested association mapping and backcross nested association mapping could certainly address the genetic issues in maize improvement programs in developing countries. Huge diversity in tropical maize and its inherent capacity for doubled haploid technology offers advantage to apply the next generation genomic tools for accelerating production in marginal environments of tropical and subtropical world. Precision in phenotyping is the key for success of any molecular-breeding approach. This article reviews genomic technologies and their application to improve agronomic traits in tropical maize breeding has been reviewed in

  20. CRISPR/Cas9 Based Genome Editing of Penicillium chrysogenum.

    Science.gov (United States)

    Pohl, C; Kiel, J A K W; Driessen, A J M; Bovenberg, R A L; Nygård, Y

    2016-07-15

    CRISPR/Cas9 based systems have emerged as versatile platforms for precision genome editing in a wide range of organisms. Here we have developed powerful CRISPR/Cas9 tools for marker-based and marker-free genome modifications in Penicillium chrysogenum, a model filamentous fungus and industrially relevant cell factory. The developed CRISPR/Cas9 toolbox is highly flexible and allows editing of new targets with minimal cloning efforts. The Cas9 protein and the sgRNA can be either delivered during transformation, as preassembled CRISPR-Cas9 ribonucleoproteins (RNPs) or expressed from an AMA1 based plasmid within the cell. The direct delivery of the Cas9 protein with in vitro synthesized sgRNA to the cells allows for a transient method for genome engineering that may rapidly be applicable for other filamentous fungi. The expression of Cas9 from an AMA1 based vector was shown to be highly efficient for marker-free gene deletions.

  1. Use of Alignment-Free Phylogenetics for Rapid Genome Sequence-Based Typing of Helicobacter pylori Virulence Markers and Antibiotic Susceptibility

    NARCIS (Netherlands)

    van Vliet, Arnoud H M; Kusters, Johannes G.

    2015-01-01

    Whole-genome sequencing is becoming a leading technology in the typing and epidemiology of microbial pathogens, but the increase in genomic information necessitates significant investment in bioinformatic resources and expertise, and currently used methodologies struggle with genetically heterogeneo

  2. Unraveling microbial ecology of industrial-scale Kombucha fermentations by metabarcoding and culture-based methods.

    Science.gov (United States)

    Coton, Monika; Pawtowski, Audrey; Taminiau, Bernard; Burgaud, Gaëtan; Deniel, Franck; Coulloumme-Labarthe, Laurent; Fall, Abdoulaye; Daube, Georges; Coton, Emmanuel

    2017-05-01

    Kombucha, historically an Asian tea-based fermented drink, has recently become trendy in Western countries. Producers claim it bears health-enhancing properties that may come from the tea or metabolites produced by its microbiome. Despite its long history of production, microbial richness and dynamics have not been fully unraveled, especially at an industrial scale. Moreover, the impact of tea type (green or black) on microbial ecology was not studied. Here, we compared microbial communities from industrial-scale black and green tea fermentations, still traditionally carried out by a microbial biofilm, using culture-dependent and metabarcoding approaches. Dominant bacterial species belonged to Acetobacteraceae and to a lesser extent Lactobacteriaceae, while the main identified yeasts corresponded to Dekkera, Hanseniaspora and Zygosaccharomyces during all fermentations. Species richness decreased over the 8-day fermentation. Among acetic acid bacteria, Gluconacetobacter europaeus, Gluconobacter oxydans, G. saccharivorans and Acetobacter peroxydans emerged as dominant species. The main lactic acid bacteria, Oenococcus oeni, was strongly associated with green tea fermentations. Tea type did not influence yeast community, with Dekkera bruxellensis, D. anomala, Zygosaccharomyces bailii and Hanseniaspora valbyensis as most dominant. This study unraveled a distinctive core microbial community which is essential for fermentation control and could lead to Kombucha quality standardization. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  3. Genomics and Public Health: Development of Web-based Training Tools for Increasing Genomic Awareness

    OpenAIRE

    Kardia, Sharon LR; Bodzin, Jennifer; Goldenberg, Aaron; Citrin, Toby; Raup, Sarah F; Bach, Janice V

    2005-01-01

    In 2001, the Centers for Disease Control and Prevention funded three Centers for Genomics and Public Health to develop training tools for increasing genomic awareness. Over the past three years, the centers, working together with the Centers for Disease Control and Prevention's Office of Genomics and Disease Prevention, have developed tools to increase awareness of the impact genomics will have on public health practice, to provide a foundation for understanding basic genomic advances, and to...

  4. Novel extraction strategy of ribosomal RNA and genomic DNA from cheese for PCR-based investigations.

    Science.gov (United States)

    Bonaïti, Catherine; Parayre, Sandrine; Irlinger, Françoise

    2006-03-15

    Cheese microorganisms, such as bacteria and fungi, constitute a complex ecosystem that plays a central role in cheeses ripening. The molecular study of cheese microbial diversity and activity is essential but the extraction of high quality nucleic acid may be problematic: the cheese samples are characterised by a strong buffering capacity which negatively influenced the yield of the extracted rRNA. The objective of this study is to develop an effective method for the direct and simultaneous isolation of yeast and bacterial ribosomal RNA and genomic DNA from the same cheese samples. DNA isolation was based on a protocol used for nucleic acids isolation from anaerobic digestor, without preliminary washing step with the combined use of the action of chaotropic agent (acid guanidinium thiocyanate), detergents (SDS, N-lauroylsarcosine), chelating agent (EDTA) and a mechanical method (bead beating system). The DNA purification was carried out by two washing steps of phenol-chloroform. RNA was isolated successfully after the second acid extraction step by recovering it from the phenolic phase of the first acid extraction. The novel method yielded pure preparation of undegraded RNA accessible for reverse transcription-PCR. The extraction protocol of genomic DNA and rRNA was applicable to complex ecosystem of different cheese matrices.

  5. Microchemostat - microbial continuous culture in a polymer-based, instrumented microbioreactor

    DEFF Research Database (Denmark)

    Zhang, Z.; Bocazzi, P.; Choi, H. G.

    2006-01-01

    -based microbioreactor system integrated with optical density (OD), pH, and dissolved oxygen (DO) real-time measurements for continuous cultivation of microbial cells. Escherichia coli (E. coli) cells are continuously cultured in a 150 mL, membrane-aerated, well-mixed microbioreactor fed by a pressure-driven flow......In a chemostat, microbial cells reach a steady state condition at which cell biomass production, substrates and the product concentrations remain constant. These features make continuous culture a unique and powerful tool for biological and physiological research. We present a polymer...

  6. Characterization of three plant biomass-degrading microbial consortia by metagenomics- and metasecretomics-based approaches

    DEFF Research Database (Denmark)

    Jiménez, Diego Javier; Brossi, Maria Julia de Lima; Schückel, Julia

    2016-01-01

    ), switchgrass (SG-M) and corn stover (CS-M) under aerobic and mesophilic conditions. Molecular fingerprintings, bacterial 16S ribosomal RNA (rRNA) gene amplicon sequencing and metagenomic analyses showed that the three microbial consortia were taxonomically distinct. Based on the taxonomic affiliation......). The highest degradation rates of lignin (~59 %) were observed with SG-M, whereas CS-M showed a high consumption of cellulose and hemicellulose. Analyses of the carbohydrate-active enzymes in the three microbial consortia showed the dominance of glycosyl hydrolases (e.g. of families GH3, GH43, GH13, GH10, GH29...

  7. In situ microbial fuel cell-based biosensor for organic carbon

    DEFF Research Database (Denmark)

    de Jesus dos Santos Peixoto, Luciana; Min, Booki; Martins, Gilberto

    2011-01-01

    The biological oxygen demand (BOD) may be the most used test to assess the amount of pollutant organic matter in water; however, it is time and labor consuming, and is done ex-situ. A BOD biosensor based on the microbial fuel cell principle was tested for online and in situ monitoring of biodegra...

  8. An Ontology-Based GIS for Genomic Data Management of Rumen Microbes.

    Science.gov (United States)

    Jelokhani-Niaraki, Saber; Tahmoorespur, Mojtaba; Minuchehr, Zarrin; Nassiri, Mohammad Reza

    2015-03-01

    During recent years, there has been exponential growth in biological information. With the emergence of large datasets in biology, life scientists are encountering bottlenecks in handling the biological data. This study presents an integrated geographic information system (GIS)-ontology application for handling microbial genome data. The application uses a linear referencing technique as one of the GIS functionalities to represent genes as linear events on the genome layer, where users can define/change the attributes of genes in an event table and interactively see the gene events on a genome layer. Our application adopted ontology to portray and store genomic data in a semantic framework, which facilitates data-sharing among biology domains, applications, and experts. The application was developed in two steps. In the first step, the genome annotated data were prepared and stored in a MySQL database. The second step involved the connection of the database to both ArcGIS and Protégé as the GIS engine and ontology platform, respectively. We have designed this application specifically to manage the genome-annotated data of rumen microbial populations. Such a GIS-ontology application offers powerful capabilities for visualizing, managing, reusing, sharing, and querying genome-related data.

  9. Diffuse flow environments within basalt- and sediment-based hydrothermal vent ecosystems harbor specialized microbial communities.

    Science.gov (United States)

    Campbell, Barbara J; Polson, Shawn W; Zeigler Allen, Lisa; Williamson, Shannon J; Lee, Charles K; Wommack, K Eric; Cary, S Craig

    2013-01-01

    Hydrothermal vents differ both in surface input and subsurface geochemistry. The effects of these differences on their microbial communities are not clear. Here, we investigated both alpha and beta diversity of diffuse flow-associated microbial communities emanating from vents at a basalt-based hydrothermal system along the East Pacific Rise (EPR) and a sediment-based hydrothermal system, Guaymas Basin. Both Bacteria and Archaea were targeted using high throughput 16S rRNA gene pyrosequencing analyses. A unique aspect of this study was the use of a universal set of 16S rRNA gene primers to characterize total and diffuse flow-specific microbial communities from varied deep-sea hydrothermal environments. Both surrounding seawater and diffuse flow water samples contained large numbers of Marine Group I (MGI) Thaumarchaea and Gammaproteobacteria taxa previously observed in deep-sea systems. However, these taxa were geographically distinct and segregated according to type of spreading center. Diffuse flow microbial community profiles were highly differentiated. In particular, EPR dominant diffuse flow taxa were most closely associated with chemolithoautotrophs, and off axis water was dominated by heterotrophic-related taxa, whereas the opposite was true for Guaymas Basin. The diversity and richness of diffuse flow-specific microbial communities were strongly correlated to the relative abundance of Epsilonproteobacteria, proximity to macrofauna, and hydrothermal system type. Archaeal diversity was higher than or equivalent to bacterial diversity in about one third of the samples. Most diffuse flow-specific communities were dominated by OTUs associated with Epsilonproteobacteria, but many of the Guaymas Basin diffuse flow samples were dominated by either OTUs within the Planctomycetes or hyperthermophilic Archaea. This study emphasizes the unique microbial communities associated with geochemically and geographically distinct hydrothermal diffuse flow environments.

  10. Use of a similarity index based on microbial fatty acid (MFA) analysis to monitor biological wastewater treatment systems.

    Science.gov (United States)

    Son, K S; Hall, E R

    2003-09-01

    Estimating the stability of microbial community structures may be useful in advanced biological wastewater treatment system design and operation. In this research, a monitoring method using fatty acid profiles was evaluated for detecting changes in microbial community structures. For the evaluation, the operating parameters such as pH, organic loading, and chlorine addition were varied in two identical laboratory scale conventional activated sludge systems. A similarity index based on microbial fatty acid analysis was used to express the stability of microbial community structures in the systems. Experiments using a model microbial community showed that microbial compositions changed daily even under constant operating conditions and that the rate of change increased under dynamic operating conditions. Substrate changes brought about a relatively large change in a microbial community structure, eventually resulting in a very different microbial community. After only 7 days following a substrate change in a lab-scale bioreactor, the biomass exhibited only 45% similarity to the original structure. The analysis of microbial fatty acids conveys additional information, in that it could be used for the calculation of biomass concentrations in a wastewater treatment system if microbial fatty acid analyses are executed on a routine basis as a monitoring tool for biological wastewater treatment systems. The total fatty acid concentrations were about 0.61% of the biomass concentration as mixed liquor volatile suspended solid concentrations in this research.

  11. Evaluation on the Nanoscale Zero Valent Iron Based Microbial Denitrification for Nitrate Removal from Groundwater.

    Science.gov (United States)

    Peng, Lai; Liu, Yiwen; Gao, Shu-Hong; Chen, Xueming; Xin, Pei; Dai, Xiaohu; Ni, Bing-Jie

    2015-07-22

    Nanoscale zero valent iron (NZVI) based microbial denitrification has been demonstrated to be a promising technology for nitrate removal from groundwater. In this work, a mathematical model is developed to evaluate the performance of this new technology and to provide insights into the chemical and microbial interactions in the system in terms of nitrate reduction, ammonium accumulation and hydrogen turnover. The developed model integrates NZVI-based abiotic reduction of nitrate, NZVI corrosion for hydrogen production and hydrogen-based microbial denitrification and satisfactorily describes all of the nitrate and ammonium dynamics from two systems with highly different conditions. The high NZVI corrosion rate revealed by the model indicates the high reaction rate of NZVI with water due to their large specific surface area and high surface reactivity, leading to an effective microbial nitrate reduction by utilizing the produced hydrogen. The simulation results further suggest a NZVI dosing strategy (3-6 mmol/L in temperature range of 30-40 °C, 6-10 mmol/L in temperature range of 15-30 °C and 10-14 mmol/L in temperature range of 5-15 °C) during groundwater remediation to make sure a low ammonium yield and a high nitrogen removal efficiency.

  12. Vertical Microbial Community Variability of Carbonate-based Cones may Provide Insight into Formation in the Rock Record

    Science.gov (United States)

    Trivedi, C.; Bojanowski, C.; Daille, L. K.; Bradley, J.; Johnson, H.; Stamps, B. W.; Stevenson, B. S.; Berelson, W.; Corsetti, F. A.; Spear, J. R.

    2015-12-01

    Stromatolite morphogenesis is poorly understood, and the process by which microbial mats become mineralized is a primary question in microbialite formation. Ancient conical stromatolites are primarily carbonate-based whereas the few modern analogues in hot springs are either non-mineralized or mineralized by silica. A team from the 2015 International GeoBiology Course investigated carbonate-rich microbial cones from near Little Hot Creek (LHC), Long Valley Caldera, California, to investigate how conical stromatolites might form in a hot spring carbonate system. The cones are up to 3 cm tall and are found in a calm, ~45° C pool near LHC that is 4 times super-saturated with respect to CaCO3. The cones rise from a flat, layered microbial mat at the edge of the pool. Scanning electron microscopy revealed filamentous bacteria associated with calcite crystals within the cone tips. Preliminary 16S rRNA gene analysis indicated variability of community composition between different vertical levels of the cone. The cone tip had comparatively greater abundance of filamentous cyanobacteria (Leptolyngbya and Phormidium) and fewer heterotrophs (e.g. Chloroflexi) compared to the cone bottom. This supports the hypothesis that cone formation may depend on the differential abundance of the microbial community and their potential functional roles. Metagenomic analyses of the cones revealed potential genes related to chemotaxis and motility. Specifically, a genomic bin identified as a member of the genus Isosphaera contained an hmp chemotaxis operon implicated in gliding motility in the cyanobacterium Nostoc punctiforme [1]. Isosphaera is a Planctomycete shown to have phototactic capabilities [2], and may play a role in conjunction with cyanobacteria in the vertical formation of the cones. This analysis of actively growing cones indicates a complex interplay of geochemistry and microbiology that form structures which can serve as models for processes that occurred in the past and are

  13. Whole genome sequence-based serogrouping of Listeria monocytogenes isolates.

    Science.gov (United States)

    Hyden, Patrick; Pietzka, Ariane; Lennkh, Anna; Murer, Andrea; Springer, Burkhard; Blaschitz, Marion; Indra, Alexander; Huhulescu, Steliana; Allerberger, Franz; Ruppitsch, Werner; Sensen, Christoph W

    2016-10-10

    Whole genome sequencing (WGS) is currently becoming the method of choice for characterization of Listeria monocytogenes isolates in national reference laboratories (NRLs). WGS is superior with regards to accuracy, resolution and analysis speed in comparison to several other methods including serotyping, PCR, pulsed field gel electrophoresis (PFGE), multilocus sequence typing (MLST), multilocus variable number tandem repeat analysis (MLVA), and multivirulence-locus sequence typing (MVLST), which have been used thus far for the characterization of bacterial isolates (and are still important tools in reference laboratories today) to control and prevent listeriosis, one of the major sources of foodborne diseases for humans. Backward compatibility of WGS to former methods can be maintained by extraction of the respective information from WGS data. Serotyping was the first subtyping method for L. monocytogenes capable of differentiating 12 serovars and national reference laboratories still perform serotyping and PCR-based serogrouping as a first level classification method for Listeria monocytogenes surveillance. Whole genome sequence based core genome MLST analysis of a L. monocytogenes collection comprising 172 isolates spanning all 12 serotypes was performed for serogroup determination. These isolates clustered according to their serotypes and it was possible to group them either into the IIa, IIc, IVb or IIb clusters, respectively, which were generated by minimum spanning tree (MST) and neighbor joining (NJ) tree data analysis, demonstrating the power of the new approach. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

  14. Genomic and epigenetic insights into the molecular bases of heterosis.

    Science.gov (United States)

    Chen, Z Jeffrey

    2013-07-01

    Heterosis, also known as hybrid vigour, is widespread in plants and animals, but the molecular bases for this phenomenon remain elusive. Recent studies in hybrids and allopolyploids using transcriptomic, proteomic, metabolomic, epigenomic and systems biology approaches have provided new insights. Emerging genomic and epigenetic perspectives suggest that heterosis arises from allelic interactions between parental genomes, leading to altered programming of genes that promote the growth, stress tolerance and fitness of hybrids. For example, epigenetic modifications of key regulatory genes in hybrids and allopolyploids can alter complex regulatory networks of physiology and metabolism, thus modulating biomass and leading to heterosis. The conceptual advances could help to improve plant and animal productivity through the manipulation of heterosis.

  15. Genome-based bioprospecting of microbes for new therapeutics.

    Science.gov (United States)

    Zotchev, Sergey B; Sekurova, Olga N; Katz, Leonard

    2012-12-01

    Bioprospecting of natural sources for new medicines has a long and successful history, exemplified by the fact that over 50% of all drugs currently on the market are either derived from or inspired by natural products. However, development of new natural product-based therapeutics has been on the decline over the past 20 years, mainly owing to frequent re-discovery of already known compounds coupled with high costs for screening, characterization and development. With the onset of the genomic era allowing rapid sequencing and analysis of bacterial and fungal genomes, it became evident that these organisms possess 'hidden treasures' in the form of gene clusters potentially governing biosynthesis of novel biologically active compounds. This review highlights current progress in mining for and expression of these gene clusters, which may revolutionize the drug discovery pipelines in the near future.

  16. RAPD-based screening of genomic libraries for positional cloning.

    Science.gov (United States)

    Dioh, W; Tharreau, D; Lebrun, M H

    1997-12-15

    RAPD markers are frequently used for positional cloning. However, RAPD markers often contain repeated sequences which prevent genomic library screening by hybridisation. We have developed a simple RAPD analysis of genomic libraries based on the identification of cosmid pools and clones amplifying the RAPD marker of interest. Our method does not require the cloning or characterisation of the RAPD marker as it relies on the analysis of cosmid pools or clones using a simple RAPD protocol. We applied this strategy using four RAPD markers composed of single copy or repeated sequences linked to avirulence genes of the rice blast fungus Magnaporthe grisea . Cosmids containing these RAPD markers were easily and rapidly identified allowing the construction of physical contigs at these loci.

  17. Base-By-Base: Single nucleotide-level analysis of whole viral genome alignments

    Directory of Open Access Journals (Sweden)

    Tcherepanov Vasily

    2004-07-01

    Full Text Available Abstract Background With ever increasing numbers of closely related virus genomes being sequenced, it has become desirable to be able to compare two genomes at a level more detailed than gene content because two strains of an organism may share the same set of predicted genes but still differ in their pathogenicity profiles. For example, detailed comparison of multiple isolates of the smallpox virus genome (each approximately 200 kb, with 200 genes is not feasible without new bioinformatics tools. Results A software package, Base-By-Base, has been developed that provides visualization tools to enable researchers to 1 rapidly identify and correct alignment errors in large, multiple genome alignments; and 2 generate tabular and graphical output of differences between the genomes at the nucleotide level. Base-By-Base uses detailed annotation information about the aligned genomes and can list each predicted gene with nucleotide differences, display whether variations occur within promoter regions or coding regions and whether these changes result in amino acid substitutions. Base-By-Base can connect to our mySQL database (Virus Orthologous Clusters; VOCs to retrieve detailed annotation information about the aligned genomes or use information from text files. Conclusion Base-By-Base enables users to quickly and easily compare large viral genomes; it highlights small differences that may be responsible for important phenotypic differences such as virulence. It is available via the Internet using Java Web Start and runs on Macintosh, PC and Linux operating systems with the Java 1.4 virtual machine.

  18. Genomic comparisons of Brucella spp. and closely related bacteria using base compositional and proteome based methods

    Science.gov (United States)

    2010-01-01

    Background Classification of bacteria within the genus Brucella has been difficult due in part to considerable genomic homogeneity between the different species and biovars, in spite of clear differences in phenotypes. Therefore, many different methods have been used to assess Brucella taxonomy. In the current work, we examine 32 sequenced genomes from genus Brucella representing the six classical species, as well as more recently described species, using bioinformatical methods. Comparisons were made at the level of genomic DNA using oligonucleotide based methods (Markov chain based genomic signatures, genomic codon and amino acid frequencies based comparisons) and proteomes (all-against-all BLAST protein comparisons and pan-genomic analyses). Results We found that the oligonucleotide based methods gave different results compared to that of the proteome based methods. Differences were also found between the oligonucleotide based methods used. Whilst the Markov chain based genomic signatures grouped the different species in genus Brucella according to host preference, the codon and amino acid frequencies based methods reflected small differences between the Brucella species. Only minor differences could be detected between all genera included in this study using the codon and amino acid frequencies based methods. Proteome comparisons were found to be in strong accordance with current Brucella taxonomy indicating a remarkable association between gene gain or loss on one hand and mutations in marker genes on the other. The proteome based methods found greater similarity between Brucella species and Ochrobactrum species than between species within genus Agrobacterium compared to each other. In other words, proteome comparisons of species within genus Agrobacterium were found to be more diverse than proteome comparisons between species in genus Brucella and genus Ochrobactrum. Pan-genomic analyses indicated that uptake of DNA from outside genus Brucella appears to be

  19. Annotation-based genome-wide SNP discovery in the large and complex Aegilops tauschii genome using next-generation sequencing without a reference genome sequence

    Directory of Open Access Journals (Sweden)

    Luo Ming-Cheng

    2011-01-01

    Full Text Available Abstract Background Many plants have large and complex genomes with an abundance of repeated sequences. Many plants are also polyploid. Both of these attributes typify the genome architecture in the tribe Triticeae, whose members include economically important wheat, rye and barley. Large genome sizes, an abundance of repeated sequences, and polyploidy present challenges to genome-wide SNP discovery using next-generation sequencing (NGS of total genomic DNA by making alignment and clustering of short reads generated by the NGS platforms difficult, particularly in the absence of a reference genome sequence. Results An annotation-based, genome-wide SNP discovery pipeline is reported using NGS data for large and complex genomes without a reference genome sequence. Roche 454 shotgun reads with low genome coverage of one genotype are annotated in order to distinguish single-copy sequences and repeat junctions from repetitive sequences and sequences shared by paralogous genes. Multiple genome equivalents of shotgun reads of another genotype generated with SOLiD or Solexa are then mapped to the annotated Roche 454 reads to identify putative SNPs. A pipeline program package, AGSNP, was developed and used for genome-wide SNP discovery in Aegilops tauschii-the diploid source of the wheat D genome, and with a genome size of 4.02 Gb, of which 90% is repetitive sequences. Genomic DNA of Ae. tauschii accession AL8/78 was sequenced with the Roche 454 NGS platform. Genomic DNA and cDNA of Ae. tauschii accession AS75 was sequenced primarily with SOLiD, although some Solexa and Roche 454 genomic sequences were also generated. A total of 195,631 putative SNPs were discovered in gene sequences, 155,580 putative SNPs were discovered in uncharacterized single-copy regions, and another 145,907 putative SNPs were discovered in repeat junctions. These SNPs were dispersed across the entire Ae. tauschii genome. To assess the false positive SNP discovery rate, DNA

  20. When whole-genome alignments just won't work: kSNP v2 software for alignment-free SNP discovery and phylogenetics of hundreds of microbial genomes.

    Science.gov (United States)

    Gardner, Shea N; Hall, Barry G

    2013-01-01

    Effective use of rapid and inexpensive whole genome sequencing for microbes requires fast, memory efficient bioinformatics tools for sequence comparison. The kSNP v2 software finds single nucleotide polymorphisms (SNPs) in whole genome data. kSNP v2 has numerous improvements over kSNP v1 including SNP gene annotation; better scaling for draft genomes available as assembled contigs or raw, unassembled reads; a tool to identify the optimal value of k; distribution of packages of executables for Linux and Mac OS X for ease of installation and user-friendly use; and a detailed User Guide. SNP discovery is based on k-mer analysis, and requires no multiple sequence alignment or the selection of a single reference genome. Most target sets with hundreds of genomes complete in minutes to hours. SNP phylogenies are built by maximum likelihood, parsimony, and distance, based on all SNPs, only core SNPs, or SNPs present in some intermediate user-specified fraction of targets. The SNP-based trees that result are consistent with known taxonomy. kSNP v2 can handle many gigabases of sequence in a single run, and if one or more annotated genomes are included in the target set, SNPs are annotated with protein coding and other information (UTRs, etc.) from Genbank file(s). We demonstrate application of kSNP v2 on sets of viral and bacterial genomes, and discuss in detail analysis of a set of 68 finished E. coli and Shigella genomes and a set of the same genomes to which have been added 47 assemblies and four "raw read" genomes of H104:H4 strains from the recent European E. coli outbreak that resulted in both bloody diarrhea and hemolytic uremic syndrome (HUS), and caused at least 50 deaths.

  1. Methods for understanding microbial community structures and functions in microbial fuel cells: a review.

    Science.gov (United States)

    Zhi, Wei; Ge, Zheng; He, Zhen; Zhang, Husen

    2014-11-01

    Microbial fuel cells (MFCs) employ microorganisms to recover electric energy from organic matter. However, fundamental knowledge of electrochemically active bacteria is still required to maximize MFCs power output for practical applications. This review presents microbiological and electrochemical techniques to help researchers choose the appropriate methods for the MFCs study. Pre-genomic and genomic techniques such as 16S rRNA based phylogeny and metagenomics have provided important information in the structure and genetic potential of electrode-colonizing microbial communities. Post-genomic techniques such as metatranscriptomics allow functional characterizations of electrode biofilm communities by quantifying gene expression levels. Isotope-assisted phylogenetic analysis can further link taxonomic information to microbial metabolisms. A combination of electrochemical, phylogenetic, metagenomic, and post-metagenomic techniques offers opportunities to a better understanding of the extracellular electron transfer process, which in turn can lead to process optimization for power output.

  2. GeoChip-based insights into the microbial functional gene repertoire of marine sponges (high microbial abundance, low microbial abundance) and seawater

    KAUST Repository

    Bayer, Kristina

    2015-01-08

    The GeoChip 4.2 gene array was employed to interrogate the microbial functional gene repertoire of sponges and seawater collected from the Red Sea and the Mediterranean. Complementary amplicon sequencing confirmed the microbial community composition characteristic of high microbial abundance (HMA) and low microbial abundance (LMA) sponges. By use of GeoChip, altogether 20 273 probes encoding for 627 functional genes and representing 16 gene categories were identified. Minimum curvilinear embedding analyses revealed a clear separation between the samples. The HMA/LMA dichotomy was stronger than any possible geographic pattern, which is shown here for the first time on the level of functional genes. However, upon inspection of individual genes, very few specific differences were discernible. Differences were related to microbial ammonia oxidation, ammonification, and archaeal autotrophic carbon fixation (higher gene abundance in sponges over seawater) as well as denitrification and radiation-stress-related genes (lower gene abundance in sponges over seawater). Except for few documented specific differences the functional gene repertoire between the different sources appeared largely similar. This study expands previous reports in that functional gene convergence is not only reported between HMA and LMA sponges but also between sponges and seawater.

  3. Putting microbes into microbial-based models of soil organic matter formation

    Science.gov (United States)

    Grandy, S.

    2012-12-01

    Soil organic matter formation (SOM) is a two-step process that begins with litter decomposition and is followed by the incorporation of C into stable SOM pools. It is known that microbes play a critical role in both processes, but their specific contributions remain undefined and even the newest models of SOM formation largely 'black box' the microbial community. For example, the current paradigm predicts that litter chemical changes during decomposition are predictable based on the extent or stage of decomposition, and that chemically distinct litter types will eventually become indistinguishable after decomposition. This convergence in chemical composition is thought to occur regardless of initial differences in litter quality or variation in biological communities. Further, while microbial ecophysiology may have direct influences on the rates of SOM accumulation, chemistry and stability, how the expression of ecophysiological traits vary across environments and between microbial communities is unclear. Here, I examine the role of decomposer communities in regulating the two-step SOM formation process. First, I examine litter chemical changes and convergence during decomposition and the degree to which they are influenced by decomposer community characteristics. In contrast to the current paradigm, I show that decomposer communities strongly mediate changes in litter chemistry during decomposition, and that there is neither chemical convergence nor a consistent, predictable change in litter chemistry during decomposition. Instead, litter chemical changes are a function of the interactions between initial litter quality and the resident decomposer communities, which often vary among soil ecosystems. Second, I identify several knowledge gaps that need to be addressed in order to advance our understanding of litter incorporation into stable SOM pools, focusing on the importance of microbial physiological processes to SOM formation, including: 1) differences in

  4. Estimation of the duodenal flow of microbial nitrogen in ruminants based on the chemical composition of forages: a literature review

    NARCIS (Netherlands)

    Gosselink, J.M.J.; Poncet, C.; Dulphy, J.P.; Cone, J.W.

    2003-01-01

    The objective of this study was to evaluate the estimation of the duodenal flow of microbial nitrogen (N) in ruminants fed forage only, per kilogram of dry matter (DM) intake, which is the yield of microbial protein (YMP). The estimation was based on the chemical composition of forages. A data file

  5. Integrated Genome-Based Studies of Shewanella Ecophysiology

    Energy Technology Data Exchange (ETDEWEB)

    Andrei L. Osterman, Ph.D.

    2012-12-17

    Integration of bioinformatics and experimental techniques was applied to mapping and characterization of the key components (pathways, enzymes, transporters, regulators) of the core metabolic machinery in Shewanella oneidensis and related species with main focus was on metabolic and regulatory pathways involved in utilization of various carbon and energy sources. Among the main accomplishments reflected in ten joint publications with other participants of Shewanella Federation are: (i) A systems-level reconstruction of carbohydrate utilization pathways in the genus of Shewanella (19 species). This analysis yielded reconstruction of 18 sugar utilization pathways including 10 novel pathway variants and prediction of > 60 novel protein families of enzymes, transporters and regulators involved in these pathways. Selected functional predictions were verified by focused biochemical and genetic experiments. Observed growth phenotypes were consistent with bioinformatic predictions providing strong validation of the technology and (ii) Global genomic reconstruction of transcriptional regulons in 16 Shewanella genomes. The inferred regulatory network includes 82 transcription factors, 8 riboswitches and 6 translational attenuators. Of those, 45 regulons were inferred directly from the genome context analysis, whereas others were propagated from previously characterized regulons in other species. Selected regulatory predictions were experimentally tested. Integration of this analysis with microarray data revealed overall consistency and provided additional layer of interactions between regulons. All the results were captured in the new database RegPrecise, which is a joint development with the LBNL team. A more detailed analysis of the individual subsystems, pathways and regulons in Shewanella spp included bioinfiormatics-based prediction and experimental characterization of: (i) N-Acetylglucosamine catabolic pathway; (ii)Lactate utilization machinery; (iii) Novel Nrt

  6. Annotation-Based Whole Genomic Prediction and Selection

    DEFF Research Database (Denmark)

    Kadarmideen, Haja; Do, Duy Ngoc; Janss, Luc;

    in their contribution to estimated genomic variances and in prediction of genomic breeding values by applying SNP annotation approaches to feed efficiency. Ensembl Variant Predictor (EVP) and Pig QTL database were used as the source of genomic annotation for 60K chip. Genomic prediction was performed using the Bayes...... prove useful for less heritable traits such as diseases and fertility...

  7. Draft genome sequence of Lampropedia cohaerens strain CT6(T) isolated from arsenic rich microbial mats of a Himalayan hot water spring.

    Science.gov (United States)

    Tripathi, Charu; Mahato, Nitish K; Rani, Pooja; Singh, Yogendra; Kamra, Komal; Lal, Rup

    2016-01-01

    Lampropedia cohaerens strain CT6(T), a non-motile, aerobic and coccoid strain was isolated from arsenic rich microbial mats (temperature ~45 °C) of a hot water spring located atop the Himalayan ranges at Manikaran, India. The present study reports the first genome sequence of type strain CT6(T) of genus Lampropedia cohaerens. Sequencing data was generated using the Illumina HiSeq 2000 platform and assembled with ABySS v 1.3.5. The 3,158,922 bp genome was assembled into 41 contigs with a mean GC content of 63.5 % and 2823 coding sequences. Strain CT6(T) was found to harbour genes involved in both the Entner-Duodoroff pathway and non-phosphorylated ED pathway. Strain CT6(T) also contained genes responsible for imparting resistance to arsenic, copper, cobalt, zinc, cadmium and magnesium, providing survival advantages at a thermal location. Additionally, the presence of genes associated with biofilm formation, pyrroloquinoline-quinone production, isoquinoline degradation and mineral phosphate solubilisation in the genome demonstrate the diverse genetic potential for survival at stressed niches.

  8. Heteropolymeric triplex-based genomic assay to detect pathogens or single-nucleotide polymorphisms in human genomic samples.

    Directory of Open Access Journals (Sweden)

    Jasmine I Daksis

    Full Text Available Human genomic samples are complex and are considered difficult to assay directly without denaturation or PCR amplification. We report the use of a base-specific heteropolymeric triplex, formed by native duplex genomic target and an oligonucleotide third strand probe, to assay for low copy pathogen genomes present in a sample also containing human genomic duplex DNA, or to assay human genomic duplex DNA for Single Nucleotide Polymorphisms (SNP, without PCR amplification. Wild-type and mutant probes are used to identify triplexes containing FVL G1691A, MTHFR C677T and CFTR mutations. The specific triplex structure forms rapidly at room temperature in solution and may be detected without a separation step. YOYO-1, a fluorescent bis-intercalator, promotes and signals the formation of the specific triplex. Genomic duplexes may be assayed homogeneously with single base pair resolution. The specific triple-stranded structures of the assay may approximate homologous recombination intermediates, which various models suggest may form in either the major or minor groove of the duplex. The bases of the stable duplex target are rendered specifically reactive to the bases of the probe because of the activity of intercalated YOYO-1, which is known to decondense duplex locally 1.3 fold. This may approximate the local decondensation effected by recombination proteins such as RecA in vivo. Our assay, while involving triplex formation, is sui generis, as it is not homopurine sequence-dependent, as are "canonical triplexes". Rather, the base pair-specific heteropolymeric triplex of the assay is conformation-dependent. The highly sensitive diagnostic assay we present allows for the direct detection of base sequence in genomic duplex samples, including those containing human genomic duplex DNA, thereby bypassing the inherent problems and cost associated with conventional PCR based diagnostic assays.

  9. Microbial Murders Crime Scene Investigation: An Active Team-Based Learning Project that Enhances Student Enthusiasm and Comprehension of Clinical Microbial Pathogens.

    Science.gov (United States)

    Steel, J Jordan

    2017-01-01

    Microbial disease knowledge is a critical component of microbiology courses and is beneficial for many students' future careers. Microbiology courses traditionally cover core concepts through lectures and labs, but specific instruction on microbial diseases varies greatly depending on the instructor and course. A common project involves students researching and presenting a disease to the class. This method alone is not very effective, and course evaluations have consistently indicated that students felt they lacked adequate disease knowledge; therefore, a more hands-on and interactive disease project was developed called Microbial Murders. For this team-based project, a group of students chooses a pathogen, researches the disease, creates a "mugshot" of the pathogen, and develops a corresponding "crime scene," where a hypothetical patient has died from the microbe. Each group gives a presentation introducing the microbial pathogen, signs/symptoms, treatments, and overall characteristics. The students then visit each other's crime scenes to match the pathogen with the correct crime scene by critically thinking through the clues. This project has shown remarkable success. Surveys indicate that 73% of students thought the project helped them understand the material and 84% said it was worth their time. Student participation, excitement, understanding, and application of microbial disease knowledge have increased and are evident through an increase in course evaluations and in student assessment scores. This project is easy to implement and can be used in a wide variety of biology, microbiology, or health classes for any level (middle school through college).

  10. Colibri: a functional data base for the Escherichia coli genome.

    Science.gov (United States)

    Médigue, C; Viari, A; Hénaut, A; Danchin, A

    1993-09-01

    Several data libraries have been created to organize all the data obtained worldwide about the Escherichia coli genome. Because the known data now amount to more than 40% of the whole genome sequence, it has become necessary to organize the data in such a way that appropriate procedures can associate knowledge produced by experiments about each gene to its position on the chromosome and its relation to other relevant genes, for example. In addition, global properties of genes, affected by the introduction of new entries, should be present as appropriate description fields. A data base, implemented on Macintosh by using the data base management system 4th Dimension, is described. It is constructed around a core constituted by known contigs of E. coli sequences and links data collected in general libraries (unmodified) to data associated with evolving knowledge (with modifiable fields). Biologically significant results obtained through the coupling of appropriate procedures (learning or statistical data analysis) are presented. The data base is available through a 4th Dimension runtime and through FTP on Internet. It has been regularly updated and will be systematically linked to other E. coli data bases (M. Kroger, R. Wahl, G. Schachtel, and P. Rice, Nucleic Acids Res. 20(Suppl.):2119-2144, 1992; K. E. Rudd, W. Miller, C. Werner, J. Ostell, C. Tolstoshev, and S. G. Satterfield, Nucleic Acids Res. 19:637-647, 1991) in the near future.

  11. Genome size evolution in pufferfish: an insight from BAC clone-based Diodon holocanthus genome sequencing

    Directory of Open Access Journals (Sweden)

    Gan Xiaoni

    2010-06-01

    Full Text Available Abstract Background Variations in genome size within and between species have been observed since the 1950 s in diverse taxonomic groups. Serving as model organisms, smooth pufferfish possess the smallest vertebrate genomes. Interestingly, spiny pufferfish from its sister family have genome twice as large as smooth pufferfish. Therefore, comparative genomic analysis between smooth pufferfish and spiny pufferfish is useful for our understanding of genome size evolution in pufferfish. Results Ten BAC clones of a spiny pufferfish Diodon holocanthus were randomly selected and shotgun sequenced. In total, 776 kb of non-redundant sequences without gap representing 0.1% of the D. holocanthus genome were identified, and 77 distinct genes were predicted. In the sequenced D. holocanthus genome, 364 kb is homologous with 265 kb of the Takifugu rubripes genome, and 223 kb is homologous with 148 kb of the Tetraodon nigroviridis genome. The repetitive DNA accounts for 8% of the sequenced D. holocanthus genome, which is higher than that in the T. rubripes genome (6.89% and that in the Te. nigroviridis genome (4.66%. In the repetitive DNA, 76% is retroelements which account for 6% of the sequenced D. holocanthus genome and belong to known families of transposable elements. More than half of retroelements were distributed within genes. In the non-homologous regions, repeat element proportion in D. holocanthus genome increased to 10.6% compared with T. rubripes and increased to 9.19% compared with Te. nigroviridis. A comparison of 10 well-defined orthologous genes showed that the average intron size (566 bp in D. holocanthus genome is significantly longer than that in the smooth pufferfish genome (435 bp. Conclusion Compared with the smooth pufferfish, D. holocanthus has a low gene density and repeat elements rich genome. Genome size variation between D. holocanthus and the smooth pufferfish exhibits as length variation between homologous region and different

  12. Genomics and Public Health: Development of Web-based Training

    OpenAIRE

    Janice V. Bach, MS; Aaron Goldenberg, MA, MPH; Toby Citrin, JD; Sarah F. Raup, MPH; Jennifer Bodzin, MPH; Sharon L.R. Kardia, PhD

    2005-01-01

    In 2001, the Centers for Disease Control and Prevention funded three Centers for Genomics and Public Health to develop training tools for increasing genomic awareness. Over the past three years, the centers, working together with the Centers for Disease Control and Preventions Office of Genomics and Disease Prevention, have developed tools to increase awareness of the impact genomics will have on public health practice, to provide a foundation for understanding basic genomic advances, and to ...

  13. Microbial composition during Chinese soy sauce koji-making based on culture dependent and independent methods.

    Science.gov (United States)

    Yan, Yin-zhuo; Qian, Yu-lin; Ji, Feng-di; Chen, Jing-yu; Han, Bei-zhong

    2013-05-01

    Koji-making is a key process for production of high quality soy sauce. The microbial composition during koji-making was investigated by culture-dependent and culture-independent methods to determine predominant bacterial and fungal populations. The culture-dependent methods used were direct culture and colony morphology observation, and PCR amplification of 16S/26S rDNA fragments followed by sequencing analysis. The culture-independent method was based on the analysis of 16S/26S rDNA clone libraries. There were differences between the results obtained by different methods. However, sufficient overlap existed between the different methods to identify potentially significant microbial groups. 16 and 20 different bacterial species were identified using culture-dependent and culture-independent methods, respectively. 7 species could be identified by both methods. The most predominant bacterial genera were Weissella and Staphylococcus. Both 6 different fungal species were identified using culture-dependent and culture-independent methods, respectively. Only 3 species could be identified by both sets of methods. The most predominant fungi were Aspergillus and Candida species. This work illustrated the importance of a comprehensive polyphasic approach in the analysis of microbial composition during soy sauce koji-making, the knowledge of which will enable further optimization of microbial composition and quality control of koji to upgrade Chinese traditional soy sauce product.

  14. Soil microbial toxicity assessment of a copper-based fungicide in two contrasting soils

    Science.gov (United States)

    Dober, Melanie; Deltedesco, Evi; Jöchlinger, Lisa; Schneider, Martin; Gorfer, Markus; Bruckner, Alexander; Zechmeister-Boltenstern, Sophie; Soja, Gerhard; Zehetner, Franz; Keiblinger, Katharina Maria

    2016-04-01

    The infestation with the fungus downy mildew (Plasmopara viticola) causes dramatic losses in wine production. Copper (Cu) based fungicides have been used in viticulture since the end of the 19th century, and until today both conventional and organic viticulture strongly rely on Cu to prevent and reduce fungal diseases. Consequently, Cu has built up in many vineyard soils and it is still unclear how this affects soil functioning. The aim of the present study is the evaluation of the soil microbial toxicity of Cu contamination. Two contrasting agricultural soils, an acidic sandy soil and a calcareous loamy soil, were sampled to conduct an eco-toxicological greenhouse pot experiment. The soils were spiked with a commonly used fungicide based on copper hydroxid in seven concentrations (0, 50, 100, 200, 500, 1500 and 5000 mg Cu kg-1 soil) and Lucerne (Medicago sativa L. cultivar. Plato) was grown in the pots for 3 months. Sampling was conducted at the beginning and at the end of the study period to test copper's soil microbial toxicity in total microbial biomass and basal respiration, as well as enzyme activities, such as exoglucanase, β-glucosidase, exochitinase, phosphatase, protease, phenol-, peroxidase and urease. Additionally, DOC, TN, Cmic, Nmic, NO3 and NH4 were determined to provide further insight into the carbon and nitrogen cycle. Microbial community structure was analysed by phospholipid fatty acids (PLFAs), and ergosterol as a fungal biomarker. In addition, molecular tools were applied by extracting soil DNA and performing real time quantitative polymerase chain reaction (qPCR) and a metagenomic approach using 16S and ITS amplification and sequencing with MiSeq platform for the second sampling. Hydrolytic extracellular enzymes were not clearly affected by rising Cu concentrations, while a trend of increasing activity of oxidative enzymes (phenol- and peroxidase) was observed. Microbial respiration rate as well as the amount of Cmic and Nmic decreased with

  15. An Integrative Pathway-based Clinical-genomic Model for Cancer Survival Prediction.

    Science.gov (United States)

    Chen, Xi; Wang, Lily; Ishwaran, Hemant

    2010-09-01

    Prediction models that use gene expression levels are now being proposed for personalized treatment of cancer, but building accurate models that are easy to interpret remains a challenge. In this paper, we describe an integrative clinical-genomic approach that combines both genomic pathway and clinical information. First, we summarize information from genes in each pathway using Supervised Principal Components (SPCA) to obtain pathway-based genomic predictors. Next, we build a prediction model based on clinical variables and pathway-based genomic predictors using Random Survival Forests (RSF). Our rationale for this two-stage procedure is that the underlying disease process may be influenced by environmental exposure (measured by clinical variables) and perturbations in different pathways (measured by pathway-based genomic variables), as well as their interactions. Using two cancer microarray datasets, we show that the pathway-based clinical-genomic model outperforms gene-based clinical-genomic models, with improved prediction accuracy and interpretability.

  16. Systems biology: Functional analysis of natural microbial consortia using community proteomics.

    Science.gov (United States)

    VerBerkmoes, Nathan C; Denef, Vincent J; Hettich, Robert L; Banfield, Jillian F

    2009-03-01

    We know very little about the metabolic functioning and evolutionary dynamics of microbial communities. Recent advances in comprehensive, sequencing-based methods, however, are laying a molecular foundation for new insights into how microbial communities shape the Earth's biosphere. Here we explore the convergence of microbial ecology, genomics, biological mass spectrometry and informatics that form the new field of microbial community proteogenomics. We discuss the first applications of proteogenomics and its potential for studying the physiology, ecology and evolution of microbial populations and communities.

  17. Solution-based targeted genomic enrichment for precious DNA samples

    Directory of Open Access Journals (Sweden)

    Shearer Aiden

    2012-05-01

    Full Text Available Abstract Background Solution-based targeted genomic enrichment (TGE protocols permit selective sequencing of genomic regions of interest on a massively parallel scale. These protocols could be improved by: 1 modifying or eliminating time consuming steps; 2 increasing yield to reduce input DNA and excessive PCR cycling; and 3 enhancing reproducible. Results We developed a solution-based TGE method for downstream Illumina sequencing in a non-automated workflow, adding standard Illumina barcode indexes during the post-hybridization amplification to allow for sample pooling prior to sequencing. The method utilizes Agilent SureSelect baits, primers and hybridization reagents for the capture, off-the-shelf reagents for the library preparation steps, and adaptor oligonucleotides for Illumina paired-end sequencing purchased directly from an oligonucleotide manufacturing company. Conclusions This solution-based TGE method for Illumina sequencing is optimized for small- or medium-sized laboratories and addresses the weaknesses of standard protocols by reducing the amount of input DNA required, increasing capture yield, optimizing efficiency, and improving reproducibility.

  18. Removal Kinetics of Organic Matter and Nitrogen Using Microbial Electrochemical Based - Constructed Wetlands (iMETland)

    DEFF Research Database (Denmark)

    Ramírez Vargas, Carlos Andrés; Arias, Carlos Alberto; Carvalho, Pedro

    In recent years the combination of Constructed Wetlands and Microbial Fuel Cell (MFC), has led to an innovative set- up for wastewater treatment and energy harvesting, relaying on electrodes and external circuits (CW – MFC). Based on this approach, a new concept is being developed to create...... the Microbial Electrochemical-based Constructed Wetland (iMETland). In this system electro- active bacteria – EAB (e.g. Geobacter sp., Shewanella spp) are stimulated to release and transfer electrons to an electro-conductive material that act as unlimited electron acceptor, maximizing the substrate consumption....... The iMETland technology is still in development and therefore uncertainties still exist regarding the dynamics in the removal of pollutants, as well as in its performance along time. To elucidate these uncertainties, a benchmark study is being conducted to characterize the processes and interactions n...

  19. Identification of genomic sites for CRISPR/Cas9-based genome editing in the Vitis vinifera genome.

    Science.gov (United States)

    Wang, Yi; Liu, Xianju; Ren, Chong; Zhong, Gan-Yuan; Yang, Long; Li, Shaohua; Liang, Zhenchang

    2016-04-21

    CRISPR/Cas9 has been recently demonstrated as an effective and popular genome editing tool for modifying genomes of humans, animals, microorganisms, and plants. Success of such genome editing is highly dependent on the availability of suitable target sites in the genomes to be edited. Many specific target sites for CRISPR/Cas9 have been computationally identified for several annual model and crop species, but such sites have not been reported for perennial, woody fruit species. In this study, we identified and characterized five types of CRISPR/Cas9 target sites in the widely cultivated grape species Vitis vinifera and developed a user-friendly database for editing grape genomes in the future. A total of 35,767,960 potential CRISPR/Cas9 target sites were identified from grape genomes in this study. Among them, 22,597,817 target sites were mapped to specific genomic locations and 7,269,788 were found to be highly specific. Protospacers and PAMs were found to distribute uniformly and abundantly in the grape genomes. They were present in all the structural elements of genes with the coding region having the highest abundance. Five PAM types, TGG, AGG, GGG, CGG and NGG, were observed. With the exception of the NGG type, they were abundantly present in the grape genomes. Synteny analysis of similar genes revealed that the synteny of protospacers matched the synteny of homologous genes. A user-friendly database containing protospacers and detailed information of the sites was developed and is available for public use at the Grape-CRISPR website ( http://biodb.sdau.edu.cn/gc/index.html ). Grape genomes harbour millions of potential CRISPR/Cas9 target sites. These sites are widely distributed among and within chromosomes with predominant abundance in the coding regions of genes. We developed a publicly-accessible Grape-CRISPR database for facilitating the use of the CRISPR/Cas9 system as a genome editing tool for functional studies and molecular breeding of grapes. Among

  20. Pyrosequencing for microbial identification and characterization.

    Science.gov (United States)

    Cummings, Patrick J; Ahmed, Ray; Durocher, Jeffrey A; Jessen, Adam; Vardi, Tamar; Obom, Kristina M

    2013-08-22

    Pyrosequencing is a versatile technique that facilitates microbial genome sequencing that can be used to identify bacterial species, discriminate bacterial strains and detect genetic mutations that confer resistance to anti-microbial agents. The advantages of pyrosequencing for microbiology applications include rapid and reliable high-throughput screening and accurate identification of microbes and microbial genome mutations. Pyrosequencing involves sequencing of DNA by synthesizing the complementary strand a single base at a time, while determining the specific nucleotide being incorporated during the synthesis reaction. The reaction occurs on immobilized single stranded template DNA where the four deoxyribonucleotides (dNTP) are added sequentially and the unincorporated dNTPs are enzymatically degraded before addition of the next dNTP to the synthesis reaction. Detection of the specific base incorporated into the template is monitored by generation of chemiluminescent signals. The order of dNTPs that produce the chemiluminescent signals determines the DNA sequence of the template. The real-time sequencing capability of pyrosequencing technology enables rapid microbial identification in a single assay. In addition, the pyrosequencing instrument, can analyze the full genetic diversity of anti-microbial drug resistance, including typing of SNPs, point mutations, insertions, and deletions, as well as quantification of multiple gene copies that may occur in some anti-microbial resistance patterns.

  1. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources.

    Directory of Open Access Journals (Sweden)

    Stéphane Vuilleumier

    Full Text Available BACKGROUND: Methylotrophy describes the ability of organisms to grow on reduced organic compounds without carbon-carbon bonds. The genomes of two pink-pigmented facultative methylotrophic bacteria of the Alpha-proteobacterial genus Methylobacterium, the reference species Methylobacterium extorquens strain AM1 and the dichloromethane-degrading strain DM4, were compared. METHODOLOGY/PRINCIPAL FINDINGS: The 6.88 Mb genome of strain AM1 comprises a 5.51 Mb chromosome, a 1.26 Mb megaplasmid and three plasmids, while the 6.12 Mb genome of strain DM4 features a 5.94 Mb chromosome and two plasmids. The chromosomes are highly syntenic and share a large majority of genes, while plasmids are mostly strain-specific, with the exception of a 130 kb region of the strain AM1 megaplasmid which is syntenic to a chromosomal region of strain DM4. Both genomes contain large sets of insertion elements, many of them strain-specific, suggesting an important potential for genomic plasticity. Most of the genomic determinants associated with methylotrophy are nearly identical, with two exceptions that illustrate the metabolic and genomic versatility of Methylobacterium. A 126 kb dichloromethane utilization (dcm gene cluster is essential for the ability of strain DM4 to use DCM as the sole carbon and energy source for growth and is unique to strain DM4. The methylamine utilization (mau gene cluster is only found in strain AM1, indicating that strain DM4 employs an alternative system for growth with methylamine. The dcm and mau clusters represent two of the chromosomal genomic islands (AM1: 28; DM4: 17 that were defined. The mau cluster is flanked by mobile elements, but the dcm cluster disrupts a gene annotated as chelatase and for which we propose the name "island integration determinant" (iid. CONCLUSION/SIGNIFICANCE: These two genome sequences provide a platform for intra- and interspecies genomic comparisons in the genus Methylobacterium, and for investigations of the

  2. Reads2Type: a web application for rapid microbial taxonomy identification

    DEFF Research Database (Denmark)

    Saputra, Dhany; Rasmussen, Simon; Larsen, Mette Voldby

    2015-01-01

    genome of microbial isolates. Therefore we have developed Reads2Type, a web-based tool for taxonomy identification based on whole bacterial genome sequence data. Raw sequencing data provided by the user are mapped against a set of marker probes that are derived from currently available bacteria complete...

  3. Robust identification of orthologues and paralogues for microbial pan-genomics using GET_HOMOLOGUES: a case study of pIncA/C plasmids.

    Science.gov (United States)

    Vinuesa, Pablo; Contreras-Moreira, Bruno

    2015-01-01

    GET_HOMOLOGUES is an open-source software package written in Perl and R to define robust core- and pan-genomes by computing consensus clusters of orthologous gene families from whole-genome sequences using the bidirectional best-hit, COGtriangles, and OrthoMCL clustering algorithms. The granularity of the clusters can be fine-tuned by a user-configurable filtering strategy based on a combination of blastp pairwise alignment parameters, hmmscan-based scanning of Pfam domain composition of the proteins in each cluster, and a partial synteny criterion. We present detailed protocols to fit exponential and binomial mixture models to estimate core- and pan-genome sizes, compute pan-genome trees from the pan-genome matrix using a parsimony criterion, analyze and graphically represent the pan-genome structure, and identify lineage-specific gene families for the 12 complete pIncA/C plasmids currently available in NCBI's RefSeq. The software package, license, and detailed user manual can be downloaded for free for academic use from two mirrors: http://www.eead.csic.es/compbio/soft/gethoms.php and http://maya.ccg.unam.mx/soft/gethoms.php.

  4. Analysis of chimpanzee history based on genome sequence alignments.

    Directory of Open Access Journals (Sweden)

    Jennifer L Caswell

    2008-04-01

    Full Text Available Population geneticists often study small numbers of carefully chosen loci, but it has become possible to obtain orders of magnitude for more data from overlaps of genome sequences. Here, we generate tens of millions of base pairs of multiple sequence alignments from combinations of three western chimpanzees, three central chimpanzees, an eastern chimpanzee, a bonobo, a human, an orangutan, and a macaque. Analysis provides a more precise understanding of demographic history than was previously available. We show that bonobos and common chimpanzees were separated approximately 1,290,000 years ago, western and other common chimpanzees approximately 510,000 years ago, and eastern and central chimpanzees at least 50,000 years ago. We infer that the central chimpanzee population size increased by at least a factor of 4 since its separation from western chimpanzees, while the western chimpanzee effective population size decreased. Surprisingly, in about one percent of the genome, the genetic relationships between humans, chimpanzees, and bonobos appear to be different from the species relationships. We used PCR-based resequencing to confirm 11 regions where chimpanzees and bonobos are not most closely related. Study of such loci should provide information about the period of time 5-7 million years ago when the ancestors of humans separated from those of the chimpanzees.

  5. Kernel Based Nonlinear Dimensionality Reduction and Classification for Genomic Microarray

    Directory of Open Access Journals (Sweden)

    Lan Shu

    2008-07-01

    Full Text Available Genomic microarrays are powerful research tools in bioinformatics and modern medicinal research because they enable massively-parallel assays and simultaneous monitoring of thousands of gene expression of biological samples. However, a simple microarray experiment often leads to very high-dimensional data and a huge amount of information, the vast amount of data challenges researchers into extracting the important features and reducing the high dimensionality. In this paper, a nonlinear dimensionality reduction kernel method based locally linear embedding(LLE is proposed, and fuzzy K-nearest neighbors algorithm which denoises datasets will be introduced as a replacement to the classical LLE’s KNN algorithm. In addition, kernel method based support vector machine (SVM will be used to classify genomic microarray data sets in this paper. We demonstrate the application of the techniques to two published DNA microarray data sets. The experimental results confirm the superiority and high success rates of the presented method.

  6. CRISPR-Cas9 Based Engineering of Actinomycetal Genomes

    DEFF Research Database (Denmark)

    Tong, Yaojun; Charusanti, Pep; Zhang, Lixin

    2015-01-01

    . To facilitate the genetic manipulation of actinomycetes, we developed a highly efficient CRISPR-Cas9 system to delete gene(s) or gene cluster(s), implement precise gene replacements, and reversibly control gene expression in actinomycetes. We demonstrate our system by targeting two genes, actIORF1 (SCO5087......) and actVB (SCO5092), from the actinorhodin biosynthetic gene cluster in Streptomyces coelicolor A3(2). Our CRISPR-Cas9 system successfully inactivated the targeted genes. When no templates for homology-directed repair (HDR) were present, the site-specific DNA double-strand breaks (DSBs) introduced by Cas9....... Moreover, we developed a system to efficiently and reversibly control expression of target genes, deemed CRISPRi, based on a catalytically dead variant of Cas9 (dCas9). The CRISPR-Cas9 based system described here comprises a powerful and broadly applicable set of tools to manipulate actinomycetal genomes....

  7. The tool of microbial genomics research for interpreting the lability of permafrost carbon and potential greenhouse gas feedbacks at different scales of resolution.

    Science.gov (United States)

    Waldrop, M. P.; Machelprang, R.; Hultman, J.; Wickland, K. P.

    2012-12-01

    community composition to gain insight into greenhouse gas production. At Bonanza Creek, there are large differences in microbial community composition between intact permafrost and thermokarst bog soils. Metagenomic and metatransciptome analysis of microbial communities reveal that changes in community composition and representative functional genes reflect the dominant greenhouse gas producing processes occurring in those environments. Results indicated that the relative abundance of microbial functional groups based upon functional genes (e.g. methanogens, nitrate reducers, sulfate reducers) mirror the geochemistry of the system and pathways of potential greenhouse gas production. These data help to understand linkages between the level of detail undertaken within -omics research and the temporal scale of biogeochemical processes under examination.

  8. Using a trait-based approach to link microbial community composition and functioning to soil salinity

    Science.gov (United States)

    Rath, Kristin; Fierer, Noah; Rousk, Johannes

    2017-04-01

    Our knowledge of the dynamics structuring microbial communities and the consequences this has for soil functions is rudimentary. In particular, predictions of the response of microbial communities to environmental change and the implications for associated ecosystem processes remain elusive. Understanding how environmental factors structure microbial communities and regulate the functions they perform is key to a mechanistic understanding of how biogeochemical cycles respond to environmental change. Soil salinization is an agricultural problem in many parts of the world. The activity of soil microorganisms is reduced in saline soils compared to non-saline soil. However, soil salinity often co-varies with other factors, making it difficult to assign responses of microbial communities to direct effects of salinity. A trait-based approach allows us to connect the environmental factor salinity with the responses of microbial community composition and functioning. Salinity along a salinity gradient serves as a filter for the community trait distribution of salt tolerance, selecting for higher salt tolerance at more saline sites. This trait-environment relationship can be used to predict responses of microbial communities to environmental change. Our aims were to (i) use salinity along natural salinity gradients as an environmental filter, and (ii) link the resulting filtered trait-distributions of the communities (the trait being salt tolerance) to the community composition. Soil samples were obtained from two replicated salinity gradients along an Australian salt lake, spanning a wide range of soil salinities (0.1 dS m-1 to >50 dS m-1). In one of the two gradients salinity was correlated with pH. Community trait distributions for salt tolerance were assessed by establishing dose-dependences for extracted bacterial communities using growth rate assays. In addition, functional parameters were measured along the salt gradients. Community composition of sites was compared

  9. Microbial synthesis of iron-based nanomaterials—A review

    Indian Academy of Sciences (India)

    Abhilash; K Revati; B D Pandey

    2011-04-01

    Nanoparticles are the materials having dimensions of the order of 100 nm or less. They exhibit a high surface/volume ratio leading to different properties far different from those of the bulk materials. The development of uniform nanoparticles has been intensively pursued because of their technological and fundamental scientific importance. A number of chemical methods are available and are extensively used, but these are often energy intensive and employ toxic chemicals. An alternative approach for the synthesis of uniform nanoparticles is the biological route that occurs at ambient temperature, pressure and at neutral pH. The main aim of this review is to enlist and compare various methods of synthesis of iron-based nanoparticles with emphasis on the biological method. Biologically induced and controlled mineralization mechanisms are the two modes through which the micro-organisms synthesize iron oxide nanoparticles. In biologically induced mineralization (BIM) mode, the environmental factors like pH, pO2, pCO2, redox potential, temperature etc govern the synthesis of iron oxide nanoparticles. In contrast, biologically controlled mineralization (BCM) process initiates the micro-organism itself to control the synthesis. BIM can be observed in the Fe(III) reducing bacterial species of Shewanella, Geobacter, Thermoanaerobacter, and sulphate reducing bacterial species of Archaeoglobus fulgidus, Desulfuromonas acetoxidans, whereas BCM mode can be observed in the magnetotactic bacteria (MTB) like Magnetospirillum magnetotacticum, M. gryphiswaldense and sulphate-reducing magnetic bacteria (Desulfovibrio magneticus). Magnetite crystals formed by Fe(III)-reducing bacteria are epicellular, poorly crystalline, irregular in shapes, having a size range of 10–50 nm super-paramagnetic particles, with a saturation magnetization value ranging from 75–77 emu/g and are not aligned in chains. Magnetite crystals produced by MTB have uniform species-specific morphologies and sizes

  10. AgBase: a functional genomics resource for agriculture

    OpenAIRE

    2006-01-01

    Abstract Background Many agricultural species and their pathogens have sequenced genomes and more are in progress. Agricultural species provide food, fiber, xenotransplant tissues, biopharmaceuticals and biomedical models. Moreover, many agricultural microorganisms are human zoonoses. However, systems biology from functional genomics data is hindered in agricultural species because agricultural genome sequences have relatively poor structural and functional annotation and agricultural researc...

  11. An electrodynamics-based model for ion diffusion in microbial polysaccharides.

    Science.gov (United States)

    Liu, Chongxuan; Zachara, John M; Felmy, Andrew; Gorby, Yuri

    2004-10-10

    An electrodynamics-based model was formulated for simulation of ion diffusion in microbial polysaccharides. The fixed charges and electrostatic double layers that may associate with microbial polysaccharides and their effects on ion diffusion were explicitly built into the model. The model extends a common multicomponent ion diffusion formulation that is based on irreversible thermodynamics under a zero ionic charge flux condition, which is only applicable to the regions without fixed charges and electrostatic double layers. An efficient numerical procedure was presented to solve the differential equations in the model. The model well described key features of experimental observations of ion diffusion in negatively charged microbial polysaccharides including accelerated diffusive transport of cations, exclusion of anions, and increased rate of cation transport with increasing negative charge density. The simulated diffusive fluxes of cations and anions were consistent with a cation exchange diffusion concept in negatively charged polysaccharides at the interface of plant roots and soils; and the developed model allows to mathematically study such diffusion phenomena. An illustrative example was also provided to simulate dynamic behavior of ionic current during ion diffusion within a charged bacterial cell wall polysaccharide and the effects of the ionic current on the compression or expansion of the bacterial electrostatic double layer at the interface of the cell wall and bulk solution.

  12. Genome-scale constraint-based modeling of Geobacter metallireducens

    Directory of Open Access Journals (Sweden)

    Famili Iman

    2009-01-01

    Full Text Available Abstract Background Geobacter metallireducens was the first organism that can be grown in pure culture to completely oxidize organic compounds with Fe(III oxide serving as electron acceptor. Geobacter species, including G. sulfurreducens and G. metallireducens, are used for bioremediation and electricity generation from waste organic matter and renewable biomass. The constraint-based modeling approach enables the development of genome-scale in silico models that can predict the behavior of complex biological systems and their responses to the environments. Such a modeling approach was applied to provide physiological and ecological insights on the metabolism of G. metallireducens. Results The genome-scale metabolic model of G. metallireducens was constructed to include 747 genes and 697 reactions. Compared to the G. sulfurreducens model, the G. metallireducens metabolic model contains 118 unique reactions that reflect many of G. metallireducens' specific metabolic capabilities. Detailed examination of the G. metallireducens model suggests that its central metabolism contains several energy-inefficient reactions that are not present in the G. sulfurreducens model. Experimental biomass yield of G. metallireducens growing on pyruvate was lower than the predicted optimal biomass yield. Microarray data of G. metallireducens growing with benzoate and acetate indicated that genes encoding these energy-inefficient reactions were up-regulated by benzoate. These results suggested that the energy-inefficient reactions were likely turned off during G. metallireducens growth with acetate for optimal biomass yield, but were up-regulated during growth with complex electron donors such as benzoate for rapid energy generation. Furthermore, several computational modeling approaches were applied to accelerate G. metallireducens research. For example, growth of G. metallireducens with different electron donors and electron acceptors were studied using the genome

  13. Phylogeny and evolution of Cervidae based on complete mitochondrial genomes.

    Science.gov (United States)

    Zhang, W-Q; Zhang, M-H

    2012-03-14

    Mitochondrial DNA sequences can be used to estimate phylogenetic relationships among animal taxa and for molecular phylogenetic evolution analysis. With the development of sequencing technology, more and more mitochondrial sequences have been made available in public databases, including whole mitochondrial DNA sequences. These data have been used for phylogenetic analysis of animal species, and for studies of evolutionary processes. We made phylogenetic analyses of 19 species of Cervidae, with Bos taurus as the outgroup. We used neighbor joining, maximum likelihood, maximum parsimony, and Bayesian inference methods on whole mitochondrial genome sequences. The consensus phylogenetic trees supported monophyly of the family Cervidae; it was divided into two subfamilies, Plesiometacarpalia and Telemetacarpalia, and four tribes, Cervinae, Muntiacinae, Hydropotinae, and Odocoileinae. The divergence times in these families were estimated by phylogenetic analysis using the Bayesian method with a relaxed molecular clock method; the results were consistent with those of previous studies. We concluded that the evolutionary structure of the family Cervidae can be reconstructed by phylogenetic analysis based on whole mitochondrial genomes; this method could be used broadly in phylogenetic evolutionary analysis of animal taxa.

  14. Biosurveillance enterprise for operational awareness, a genomic-based approach for tracking pathogen virulence.

    Science.gov (United States)

    Valdivia-Granda, Willy A

    2013-11-15

    To protect our civilians and warfighters against both known and unknown pathogens, biodefense stakeholders must be able to foresee possible technological trends that could affect their threat risk assessment. However, significant flaws in how we prioritize our countermeasure-needs continue to limit their development. As recombinant biotechnology becomes increasingly simplified and inexpensive, small groups, and even individuals, can now achieve the design, synthesis, and production of pathogenic organisms for offensive purposes. Under these daunting circumstances, a reliable biosurveillance approach that supports a diversity of users could better provide early warnings about the emergence of new pathogens (both natural and manmade), reverse engineer pathogens carrying traits to avoid available countermeasures, and suggest the most appropriate detection, prophylactic, and therapeutic solutions. While impressive in data mining capabilities, real-time content analysis of social media data misses much of the complexity in the factual reality. Quality issues within freeform user-provided hashtags and biased referencing can significantly undermine our confidence in the information obtained to make critical decisions about the natural vs. intentional emergence of a pathogen. At the same time, errors in pathogen genomic records, the narrow scope of most databases, and the lack of standards and interoperability across different detection and diagnostic devices, continue to restrict the multidimensional biothreat assessment. The fragmentation of our biosurveillance efforts into different approaches has stultified attempts to implement any new foundational enterprise that is more reliable, more realistic and that avoids the scenario of the warning that comes too late. This discussion focus on the development of genomic-based decentralized medical intelligence and laboratory system to track emerging and novel microbial health threats in both military and civilian settings and

  15. Identification of Ohnolog Genes Originating from Whole Genome Duplication in Early Vertebrates, Based on Synteny Comparison across Multiple Genomes.

    Science.gov (United States)

    Singh, Param Priya; Arora, Jatin; Isambert, Hervé

    2015-07-01

    Whole genome duplications (WGD) have now been firmly established in all major eukaryotic kingdoms. In particular, all vertebrates descend from two rounds of WGDs, that occurred in their jawless ancestor some 500 MY ago. Paralogs retained from WGD, also coined 'ohnologs' after Susumu Ohno, have been shown to be typically associated with development, signaling and gene regulation. Ohnologs, which amount to about 20 to 35% of genes in the human genome, have also been shown to be prone to dominant deleterious mutations and frequently implicated in cancer and genetic diseases. Hence, identifying ohnologs is central to better understand the evolution of vertebrates and their susceptibility to genetic diseases. Early computational analyses to identify vertebrate ohnologs relied on content-based synteny comparisons between the human genome and a single invertebrate outgroup genome or within the human genome itself. These approaches are thus limited by lineage specific rearrangements in individual genomes. We report, in this study, the identification of vertebrate ohnologs based on the quantitative assessment and integration of synteny conservation between six amniote vertebrates and six invertebrate outgroups. Such a synteny comparison across multiple genomes is shown to enhance the statistical power of ohnolog identification in vertebrates compared to earlier approaches, by overcoming lineage specific genome rearrangements. Ohnolog gene families can be browsed and downloaded for three statistical confidence levels or recompiled for specific, user-defined, significance criteria at http://ohnologs.curie.fr/. In the light of the importance of WGD on the genetic makeup of vertebrates, our analysis provides a useful resource for researchers interested in gaining further insights on vertebrate evolution and genetic diseases.

  16. Identification of Ohnolog Genes Originating from Whole Genome Duplication in Early Vertebrates, Based on Synteny Comparison across Multiple Genomes.

    Directory of Open Access Journals (Sweden)

    Param Priya Singh

    2015-07-01

    Full Text Available Whole genome duplications (WGD have now been firmly established in all major eukaryotic kingdoms. In particular, all vertebrates descend from two rounds of WGDs, that occurred in their jawless ancestor some 500 MY ago. Paralogs retained from WGD, also coined 'ohnologs' after Susumu Ohno, have been shown to be typically associated with development, signaling and gene regulation. Ohnologs, which amount to about 20 to 35% of genes in the human genome, have also been shown to be prone to dominant deleterious mutations and frequently implicated in cancer and genetic diseases. Hence, identifying ohnologs is central to better understand the evolution of vertebrates and their susceptibility to genetic diseases. Early computational analyses to identify vertebrate ohnologs relied on content-based synteny comparisons between the human genome and a single invertebrate outgroup genome or within the human genome itself. These approaches are thus limited by lineage specific rearrangements in individual genomes. We report, in this study, the identification of vertebrate ohnologs based on the quantitative assessment and integration of synteny conservation between six amniote vertebrates and six invertebrate outgroups. Such a synteny comparison across multiple genomes is shown to enhance the statistical power of ohnolog identification in vertebrates compared to earlier approaches, by overcoming lineage specific genome rearrangements. Ohnolog gene families can be browsed and downloaded for three statistical confidence levels or recompiled for specific, user-defined, significance criteria at http://ohnologs.curie.fr/. In the light of the importance of WGD on the genetic makeup of vertebrates, our analysis provides a useful resource for researchers interested in gaining further insights on vertebrate evolution and genetic diseases.

  17. An acquisition account of genomic islands based on genome signature comparisons

    Directory of Open Access Journals (Sweden)

    Luyf ACM

    2005-11-01

    Full Text Available Abstract Background Recent analyses of prokaryotic genome sequences have demonstrated the important force horizontal gene transfer constitutes in genome evolution. Horizontally acquired sequences are detectable by, among others, their dinucleotide composition (genome signature dissimilarity with the host genome. Genomic islands (GIs comprise important and interesting horizontally transferred sequences, but information about acquisition events or relatedness between GIs is scarce. In Vibrio vulnificus CMCP6, 10 and 11 GIs have previously been identified in the sequenced chromosomes I and II, respectively. We assessed the compositional similarity and putative acquisition account of these GIs using the genome signature. For this analysis we developed a new algorithm, available as a web application. Results Of 21 GIs, VvI-1 and VvI-10 of chromosome I have similar genome signatures, and while artificially divided due to a linear annotation, they are adjacent on the circular chromosome and therefore comprise one GI. Similarly, GIs VvI-3 and VvI-4 of chromosome I together with the region between these two islands are compositionally similar, suggesting that they form one GI (making a total of 19 GIs in chromosome I + chromosome II. Cluster analysis assigned the 19 GIs to 11 different branches above our conservative threshold. This suggests a limited number of compositionally similar donors or intragenomic dispersion of ancestral acquisitions. Furthermore, 2 GIs of chromosome II cluster with chromosome I, while none of the 19 GIs group with chromosome II, suggesting an unidirectional dispersal of large anomalous gene clusters from chromosome I to chromosome II. Conclusion From the results, we infer 10 compositionally dissimilar donors for 19 GIs in the V. vulnificus CMCP6 genome, including chromosome I donating to chromosome II. This suggests multiple transfer events from individual donor types or from donors with similar genome signatures. Applied to

  18. Annotation-Based Whole Genomic Prediction and Selection

    DEFF Research Database (Denmark)

    Kadarmideen, Haja; Do, Duy Ngoc; Janss, Luc

    Cπ method and applied to 1,272 Duroc pigs with both genotypic and phenotypic records including residual (RFI) and daily feed intake (DFI), average daily gain (ADG) and back fat (BF)). Records were split into a training (968 pigs) and a validation dataset (304 pigs). SNPs were annotated by 14 different...... groups. Genomic prediction has accuracy comparable to an own phenotype and use of genomic prediction can be cost effective by replacing feed intake measurement. Use of genomic annotation of SNPs and QTL information had no largely significant impact on predictive accuracy for the current traits but may...... in their contribution to estimated genomic variances and in prediction of genomic breeding values by applying SNP annotation approaches to feed efficiency. Ensembl Variant Predictor (EVP) and Pig QTL database were used as the source of genomic annotation for 60K chip. Genomic prediction was performed using the Bayes...

  19. GAMOLA2, a Comprehensive Software Package for the Annotation and Curation of Draft and Complete Microbial Genomes.

    Science.gov (United States)

    Altermann, Eric; Lu, Jingli; McCulloch, Alan

    2017-01-01

    Expert curated annotation remains one of the critical steps in achieving a reliable biological relevant annotation. Here we announce the release of GAMOLA2, a user friendly and comprehensive software package to process, annotate and curate draft and complete bacterial, archaeal, and viral genomes. GAMOLA2 represents a wrapping tool to combine gene model determination, functional Blast, COG, Pfam, and TIGRfam analyses with structural predictions including detection of tRNAs, rRNA genes, non-coding RNAs, signal protein cleavage sites, transmembrane helices, CRISPR repeats and vector sequence contaminations. GAMOLA2 has already been validated in a wide range of bacterial and archaeal genomes, and its modular concept allows easy addition of further functionality in future releases. A modified and adapted version of the Artemis Genome Viewer (Sanger Institute) has been developed to leverage the additional features and underlying information provided by the GAMOLA2 analysis, and is part of the software distribution. In addition to genome annotations, GAMOLA2 features, among others, supplemental modules that assist in the creation of custom Blast databases, annotation transfers between genome versions, and the preparation of Genbank files for submission via the NCBI Sequin tool. GAMOLA2 is intended to be run under a Linux environment, whereas the subsequent visualization and manual curation in Artemis is mobile and platform independent. The development of GAMOLA2 is ongoing and community driven. New functionality can easily be added upon user requests, ensuring that GAMOLA2 provides information relevant to microbiologists. The software is available free of charge for academic use.

  20. Genome-scale comparison and constraint-based metabolic reconstruction of the facultative anaerobic Fe(III-reducer Rhodoferax ferrireducens

    Directory of Open Access Journals (Sweden)

    Daugherty Sean

    2009-09-01

    Full Text Available Abstract Background Rhodoferax ferrireducens is a metabolically versatile, Fe(III-reducing, subsurface microorganism that is likely to play an important role in the carbon and metal cycles in the subsurface. It also has the unique ability to convert sugars to electricity, oxidizing the sugars to carbon dioxide with quantitative electron transfer to graphite electrodes in microbial fuel cells. In order to expand our limited knowledge about R. ferrireducens, the complete genome sequence of this organism was further annotated and then the physiology of R. ferrireducens was investigated with a constraint-based, genome-scale in silico metabolic model and laboratory studies. Results The iterative modeling and experimental approach unveiled exciting, previously unknown physiological features, including an expanded range of substrates that support growth, such as cellobiose and citrate, and provided additional insights into important features such as the stoichiometry of the electron transport chain and the ability to grow via fumarate dismutation. Further analysis explained why R. ferrireducens is unable to grow via photosynthesis or fermentation of sugars like other members of this genus and uncovered novel genes for benzoate metabolism. The genome also revealed that R. ferrireducens is well-adapted for growth in the subsurface because it appears to be capable of dealing with a number of environmental insults, including heavy metals, aromatic compounds, nutrient limitation and oxidative stress. Conclusion This study demonstrates that combining genome-scale modeling with the annotation of a new genome sequence can guide experimental studies and accelerate the understanding of the physiology of under-studied yet environmentally relevant microorganisms.

  1. Enabling the democratization of the genomics revolution with a fully integrated web-based bioinformatics platform

    Science.gov (United States)

    Li, Po-E; Lo, Chien-Chi; Anderson, Joseph J.; Davenport, Karen W.; Bishop-Lilly, Kimberly A.; Xu, Yan; Ahmed, Sanaa; Feng, Shihai; Mokashi, Vishwesh P.; Chain, Patrick S.G.

    2017-01-01

    Continued advancements in sequencing technologies have fueled the development of new sequencing applications and promise to flood current databases with raw data. A number of factors prevent the seamless and easy use of these data, including the breadth of project goals, the wide array of tools that individually perform fractions of any given analysis, the large number of associated software/hardware dependencies, and the detailed expertise required to perform these analyses. To address these issues, we have developed an intuitive web-based environment with a wide assortment of integrated and cutting-edge bioinformatics tools in pre-configured workflows. These workflows, coupled with the ease of use of the environment, provide even novice next-generation sequencing users with the ability to perform many complex analyses with only a few mouse clicks and, within the context of the same environment, to visualize and further interrogate their results. This bioinformatics platform is an initial attempt at Empowering the Development of Genomics Expertise (EDGE) in a wide range of applications for microbial research. PMID:27899609

  2. Localized electron transfer rates and microelectrode-based enrichment of microbial communities within a phototrophic microbial mat

    Energy Technology Data Exchange (ETDEWEB)

    Babauta, Jerome T. [Washington State Univ., Pullman, WA (United States); Atci, Erhan [Washington State Univ., Pullman, WA (United States); Ha, Phuc T. [Washington State Univ., Pullman, WA (United States); Lindemann, Stephen R. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Ewing, Timothy [Washington State Univ., Pullman, WA (United States); Call, Douglas R. [Washington State Univ., Pullman, WA (United States); Fredrickson, James K. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Beyenal, Haluk [Washington State Univ., Pullman, WA (United States)

    2014-01-01

    Phototrophic microbial mats frequently exhibit sharp, light-dependent redox gradients that regulate microbial respiration on specific electron acceptors as a function of depth. In this work, a benthic phototrophic microbial mat from Hot Lake, a hypersaline, epsomitic lake located near Oroville in north-central Washington, was used to develop a microscale electrochemical method to study local electron transfer processes within the mat. To characterize the physicochemical variables influencing electron transfer, we initially quantified redox potential, pH, and dissolved oxygen gradients by depth in the mat under photic and aphotic conditions. We further demonstrated that power output of a mat fuel cell was light-dependent. To study local electron transfer processes, we deployed a microscale electrode (microelectrode) with tip size ~20 μm. To enrich a subset of microorganisms capable of interacting with the microelectrode, we anodically polarized the microelectrode at depth in the mat. Subsequently, to characterize the microelectrode- associated community and compare it to the neighboring mat community, we performed amplicon sequencing of the V1-V3 region of the 16S gene. Differences in Bray-Curtis beta diversity, illustrated by large changes in relative abundance at the phylum level, suggested successful enrichment of specific mat community members on the microelectrode surface. The microelectrode-associated community exhibited substantially reduced alpha diversity and elevated relative abundances of Prosthecochloris, Loktanella, Catellibacterium, other unclassified members of Rhodobacteraceae, Thiomicrospira, and Limnobacter, compared with the community at an equivalent depth in the mat. Our results suggest that local electron transfer to an anodically polarized microelectrode selected for a specific microbial population, with substantially more abundance and diversity of sulfur-oxidizing phylotypes compared with the neighboring mat community.

  3. ArraySearch: A Web-Based Genomic Search Engine.

    Science.gov (United States)

    Wilson, Tyler J; Ge, Steven X

    2012-01-01

    Recent advances in microarray technologies have resulted in a flood of genomics data. This large body of accumulated data could be used as a knowledge base to help researchers interpret new experimental data. ArraySearch finds statistical correlations between newly observed gene expression profiles and the huge source of well-characterized expression signatures deposited in the public domain. A search query of a list of genes will return experiments on which the genes are significantly up- or downregulated collectively. Searches can also be conducted using gene expression signatures from new experiments. This resource will empower biological researchers with a statistical method to explore expression data from their own research by comparing it with expression signatures from a large public archive.

  4. Ascaris phylogeny based on multiple whole mtDNA genomes

    DEFF Research Database (Denmark)

    Nejsum, Peter; Hawash, Mohamed B F; Betson, Martha

    2016-01-01

    Ascaris lumbricoides and A. suum are two parasitic nematodes infecting humans and pigs, respectively. There has been considerable debate as to whether Ascaris in the two hosts should be considered a single or two separate species. Previous studies identified at least three major clusters (A, B...... and C) of human and pig Ascaris based on partial cox1 sequences. In the present study, we selected major haplotypes from these different clusters to characterize their whole mitochondrial genomes for phylogenetic analysis. We also undertook coalescent simulations to investigate the evolutionary history...... events: the first one occurring early in the Neolithic period which resulted in a differentiated population of Ascaris in pigs (cluster C), the second occurring more recently (~ 900 generations ago), resulting in clusters A and B which might have been spread worldwide by human activities....

  5. FN-Identify: Novel Restriction Enzymes-Based Method for Bacterial Identification in Absence of Genome Sequencing.

    Science.gov (United States)

    Awad, Mohamed; Ouda, Osama; El-Refy, Ali; El-Feky, Fawzy A; Mosa, Kareem A; Helmy, Mohamed

    2015-01-01

    Sequencing and restriction analysis of genes like 16S rRNA and HSP60 are intensively used for molecular identification in the microbial communities. With aid of the rapid progress in bioinformatics, genome sequencing became the method of choice for bacterial identification. However, the genome sequencing technology is still out of reach in the developing countries. In this paper, we propose FN-Identify, a sequencing-free method for bacterial identification. FN-Identify exploits the gene sequences data available in GenBank and other databases and the two algorithms that we developed, CreateScheme and GeneIdentify, to create a restriction enzyme-based identification scheme. FN-Identify was tested using three different and diverse bacterial populations (members of Lactobacillus, Pseudomonas, and Mycobacterium groups) in an in silico analysis using restriction enzymes and sequences of 16S rRNA gene. The analysis of the restriction maps of the members of three groups using the fragment numbers information only or along with fragments sizes successfully identified all of the members of the three groups using a minimum of four and maximum of eight restriction enzymes. Our results demonstrate the utility and accuracy of FN-Identify method and its two algorithms as an alternative method that uses the standard microbiology laboratories techniques when the genome sequencing is not available.

  6. FN-Identify: Novel Restriction Enzymes-Based Method for Bacterial Identification in Absence of Genome Sequencing

    Science.gov (United States)

    Ouda, Osama; El-Refy, Ali; El-Feky, Fawzy A.; Mosa, Kareem A.

    2015-01-01

    Sequencing and restriction analysis of genes like 16S rRNA and HSP60 are intensively used for molecular identification in the microbial communities. With aid of the rapid progress in bioinformatics, genome sequencing became the method of choice for bacterial identification. However, the genome sequencing technology is still out of reach in the developing countries. In this paper, we propose FN-Identify, a sequencing-free method for bacterial identification. FN-Identify exploits the gene sequences data available in GenBank and other databases and the two algorithms that we developed, CreateScheme and GeneIdentify, to create a restriction enzyme-based identification scheme. FN-Identify was tested using three different and diverse bacterial populations (members of Lactobacillus, Pseudomonas, and Mycobacterium groups) in an in silico analysis using restriction enzymes and sequences of 16S rRNA gene. The analysis of the restriction maps of the members of three groups using the fragment numbers information only or along with fragments sizes successfully identified all of the members of the three groups using a minimum of four and maximum of eight restriction enzymes. Our results demonstrate the utility and accuracy of FN-Identify method and its two algorithms as an alternative method that uses the standard microbiology laboratories techniques when the genome sequencing is not available. PMID:26880910

  7. The genomics revolution and its effect on water quality

    Science.gov (United States)

    Genomic-based molecular tools are emerging as powerful laboratory methods for assessing water quality characteristics and improving our ability to assess the human health risks posed by microbial contaminants in drinking water. To a great extent, this revolution in genomics-rese...

  8. The genomics revolution and its effect on water quality

    Science.gov (United States)

    Genomic-based molecular tools are emerging as powerful laboratory methods for assessing water quality characteristics and improving our ability to assess the human health risks posed by microbial contaminants in drinking water. To a great extent, this revolution in genomics-rese...

  9. Microbial community structure in fermentation process of Shaoxing rice wine by Illumina-based metagenomic sequencing.

    Science.gov (United States)

    Xie, Guangfa; Wang, Lan; Gao, Qikang; Yu, Wenjing; Hong, Xutao; Zhao, Lingyun; Zou, Huijun

    2013-09-01

    To understand the role of the community structure of microbes in the environment in the fermentation of Shaoxing rice wine, samples collected from a wine factory were subjected to Illumina-based metagenomic sequencing. De novo assembly of the sequencing reads allowed the characterisation of more than 23 thousand microbial genes derived from 1.7 and 1.88 Gbp of sequences from two samples fermented for 5 and 30 days respectively. The microbial community structure at different fermentation times of Shaoxing rice wine was revealed, showing the different roles of the microbiota in the fermentation process of Shaoxing rice wine. The gene function of both samples was also studied in the COG database, with most genes belonging to category S (function unknown), category E (amino acid transport and metabolism) and unclassified group. The results show that both the microbial community structure and gene function composition change greatly at different time points of Shaoxing rice wine fermentation. © 2013 Society of Chemical Industry.

  10. A pyrosequencing-based analysis of microbial diversity governed by ecological conditions in the Winogradsky column.

    Science.gov (United States)

    Abbasian, Firouz; Lockington, Robin; Mallavarapu, Megharaj; Naidu, Ravi

    2015-07-01

    The Winogradsky column is used as a microcosm to mimic both the microbial diversity and the ecological relationships between the organisms in lake sediments. In this study, a pyrosequencing approach was used to obtain a more complete list of the microbial organisms present in such columns and their ratios in different layers of this microcosm. Overall, 27 different phyla in these columns were detected in these columns, most (20 phyla) belonged to bacteria. Based on this study, Proteobacteria (mostly Sphingomonadales), Cyanobacteria (mostly Oscillatoriales) and Bacteroidetes (mostly Flavobacteriales) were the dominant microorganisms in the water, middle, and bottom layers of this column, respectively. Although the majority of organism in the water layer were photoautotrophic organisms, the ratio of the phototrophic organisms decreased in the lower layers, replaced by chemoheterotrophic bacteria. Furthermore, the proportion of aerobic chemoheterotrophic bacteria was greater in the higher layers of the column in comparison to the bottom. The green and purple sulfur phototrophic bacteria inhabited the bottom and middle of these columns, with none of them found in the water layer. Although the sulfur oxidizing bacteria were the dominant chemolithotrophic bacteria in the water layer, their ratio decreases in lower layers, being replaced with nitrogen oxidizing bacteria in the middle and bottom layers. Overall, the microbial population of these layers changes from a phototrophic and aerobic chemoheterotrophic organisms in the water layer to a mostly anaerobic chemoheterotrophic population of bacteria in the bottom layers.

  11. Synthesis, characterization, anti-microbial, DNA binding and cleavage studies of Schiff base metal complexes

    Directory of Open Access Journals (Sweden)

    Poomalai Jayaseelan

    2016-09-01

    Full Text Available A novel Schiff base ligand has been prepared by the condensation between butanedione monoxime with 3,3′-diaminobenzidine. The ligand and metal complexes have been characterized by elemental analysis, UV, IR, 1H NMR, conductivity measurements, EPR and magnetic studies. The molar conductance studies of Cu(II, Ni(II, Co(II and Mn(II complexes showed non-electrolyte in nature. The ligand acts as dibasic with two N4-tetradentate sites and can coordinate with two metal ions to form binuclear complexes. The spectroscopic data of metal complexes indicated that the metal ions are complexed with azomethine nitrogen and oxyimino nitrogen atoms. The binuclear metal complexes exhibit octahedral arrangements. DNA binding properties of copper(II metal complex have been investigated by electronic absorption spectroscopy. Results suggest that the copper(II complex bind to DNA via an intercalation binding mode. The nucleolytic cleavage activities of the ligand and their complexes were assayed on CT-DNA using gel electrophoresis in the presence and absence of H2O2. The ligand showed increased nuclease activity when administered as copper complex and copper(II complex behave as efficient chemical nucleases with hydrogen peroxide activation. The anti-microbial activities and thermal studies have also been studied. In anti-microbial activity all complexes showed good anti-microbial activity higher than ligand against gram positive, gram negative bacteria and fungi.

  12. Microbial enzymes for the recycling of recalcitrant petroleum-based plastics: how far are we?

    Science.gov (United States)

    Wei, Ren; Zimmermann, Wolfgang

    2017-03-28

    Petroleum-based plastics have replaced many natural materials in their former applications. With their excellent properties, they have found widespread uses in almost every area of human life. However, the high recalcitrance of many synthetic plastics results in their long persistence in the environment, and the growing amount of plastic waste ending up in landfills and in the oceans has become a global concern. In recent years, a number of microbial enzymes capable of modifying or degrading recalcitrant synthetic polymers have been identified. They are emerging as candidates for the development of biocatalytic plastic recycling processes, by which valuable raw materials can be recovered in an environmentally sustainable way. This review is focused on microbial biocatalysts involved in the degradation of the synthetic plastics polyethylene, polystyrene, polyurethane and polyethylene terephthalate (PET). Recent progress in the application of polyester hydrolases for the recovery of PET building blocks and challenges for the application of these enzymes in alternative plastic waste recycling processes will be discussed. © 2017 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.

  13. Dynamic relationships between microbial biomass, respiration, inorganic nutrients and enzyme activities: informing enzyme based decomposition models

    Directory of Open Access Journals (Sweden)

    Daryl L Moorhead

    2013-08-01

    Full Text Available We re-examined data from a recent litter decay study to determine if additional insights could be gained to inform decomposition modeling. Rinkes et al. (2013 conducted 14-day laboratory incubations of sugar maple (Acer saccharum or white oak (Quercus alba leaves, mixed with sand (0.4% organic C content or loam (4.1% organic C. They measured microbial biomass C, carbon dioxide efflux, soil ammonium, nitrate, and phosphate concentrations, and β-glucosidase (BG, β-N-acetyl-glucosaminidase (NAG, and acid phosphatase (AP activities on days 1, 3, and 14. Analyses of relationships among variables yielded different insights than original analyses of individual variables. For example, although respiration rates per g soil were higher for loam than sand, rates per g soil C were actually higher for sand than loam, and rates per g microbial C showed little difference between treatments. Microbial biomass C peaked on day 3 when biomass-specific activities of enzymes were lowest, suggesting uptake of litter C without extracellular hydrolysis. This result refuted a common model assumption that all enzyme production is constitutive and thus proportional to biomass, and/or indicated that part of litter decay is independent of enzyme activity. The length and angle of vectors defined by ratios of enzyme activities (BG/NAG versus BG/AP represent relative microbial investments in C (length, and N and P (angle acquiring enzymes. Shorter lengths on day 3 suggested low C limitation, whereas greater lengths on day 14 suggested an increase in C limitation with decay. The soils and litter in this study generally had stronger P limitation (angles > 45˚. Reductions in vector angles to < 45˚ for sand by day 14 suggested a shift to N limitation. These relational variables inform enzyme-based models, and are usually much less ambiguous when obtained from a single study in which measurements were made on the same samples than when extrapolated from separate studies.

  14. Analysis Of Segmental Duplications In The Pig Genome Based On Next-Generation Sequencing

    DEFF Research Database (Denmark)

    Fadista, João; Bendixen, Christian

    extensively studied in other organisms, its analysis in pig has been hampered by the lack of a complete pig genome assembly. By measuring the depth of coverage of Illumina whole-genome shotgun sequencing reads of the Tabasco animal aligned to the latest pig genome assembly (Sus scrofa 10 – based also...... on Tabasco), led us to the detection of a high-resolution map of segmental duplications in the pig genome. Comparing these segments with four other Duroc animals sequenced at our institute, supplied the resources needed to describe the first genome-wide and systematic analysis of segmental duplications...

  15. GeNemo: a search engine for web-based functional genomic data

    OpenAIRE

    Zhang, Yongqing; Cao, Xiaoyi; Zhong, Sheng

    2016-01-01

    A set of new data types emerged from functional genomic assays, including ChIP-seq, DNase-seq, FAIRE-seq and others. The results are typically stored as genome-wide intensities (WIG/bigWig files) or functional genomic regions (peak/BED files). These data types present new challenges to big data science. Here, we present GeNemo, a web-based search engine for functional genomic data. GeNemo searches user-input data against online functional genomic datasets, including the entire collection of E...

  16. Global Microbial Identifier

    DEFF Research Database (Denmark)

    Wielinga, Peter; Hendriksen, Rene S.; Aarestrup, Frank Møller

    2017-01-01

    microbial identifier (GMI) initiative, aims to build a database of whole microbial genome sequencing data linked to relevant metadata, which can be used to identify microorganisms, their communities and the diseases they cause. It would be a platform for storing whole genome sequencing (WGS) data......) will likely also enable a much better understanding of the pathogenesis of the infection and the molecular basis of the host response to infection. But the full potential of these advances will only transpire if the data in this area become transferable and thereby comparable, preferably in open......-source systems. There is therefore an obvious need to develop a global system of whole microbial genome databases to aggregate, share, mine and use microbiological genomic data, to address global public health and clinical challenges, and most importantly to identify and diagnose infectious diseases. The global...

  17. Functional diversity of microbial communities in pristine aquifers inferred by PLFA- and sequencing-based approaches

    Science.gov (United States)

    Schwab, Valérie F.; Herrmann, Martina; Roth, Vanessa-Nina; Gleixner, Gerd; Lehmann, Robert; Pohnert, Georg; Trumbore, Susan; Küsel, Kirsten; Totsche, Kai U.

    2017-05-01

    Microorganisms in groundwater play an important role in aquifer biogeochemical cycles and water quality. However, the mechanisms linking the functional diversity of microbial populations and the groundwater physico-chemistry are still not well understood due to the complexity of interactions between surface and subsurface. Within the framework of Hainich (north-western Thuringia, central Germany) Critical Zone Exploratory of the Collaborative Research Centre AquaDiva, we used the relative abundances of phospholipid-derived fatty acids (PLFAs) to link specific biochemical markers within the microbial communities to the spatio-temporal changes of the groundwater physico-chemistry. The functional diversities of the microbial communities were mainly correlated with groundwater chemistry, including dissolved O2, Fet and NH4+ concentrations. Abundances of PLFAs derived from eukaryotes and potential nitrite-oxidizing bacteria (11Me16:0 as biomarker for Nitrospira moscoviensis) were high at sites with elevated O2 concentration where groundwater recharge supplies bioavailable substrates. In anoxic groundwaters more rich in Fet, PLFAs abundant in sulfate-reducing bacteria (SRB), iron-reducing bacteria and fungi increased with Fet and HCO3- concentrations, suggesting the occurrence of active iron reduction and the possible role of fungi in meditating iron solubilization and transport in those aquifer domains. In more NH4+-rich anoxic groundwaters, anammox bacteria and SRB-derived PLFAs increased with NH4+ concentration, further evidencing the dependence of the anammox process on ammonium concentration and potential links between SRB and anammox bacteria. Additional support of the PLFA-based bacterial communities was found in DNA- and RNA-based Illumina MiSeq amplicon sequencing of bacterial 16S rRNA genes, which showed high predominance of nitrite-oxidizing bacteria Nitrospira, e.g. Nitrospira moscoviensis, in oxic aquifer zones and of anammox bacteria in more NH4+-rich

  18. Phylogeography, salinity adaptations and metabolic potential of the Candidate Division KB1 Bacteria based on a partial single cell genome.

    Directory of Open Access Journals (Sweden)

    Lisa M Nigro

    2016-08-01

    Full Text Available Deep-sea hypersaline anoxic basins (DHABs and other hypersaline environments contain abundant and diverse microbial life that has adapted to these extreme conditions. The bacterial Candidate Division KB1 represents one of several uncultured groups that has been consistently observed in hypersaline microbial diversity studies. Here we report the phylogeography of KB1, its phylogenetic relationships to Candidate Division OP1 Bacteria, and its potential metabolic and osmotic stress adaptations based on a partial single cell amplified genome (SAG of KB1 from Orca Basin, the largest hypersaline seafloor brine basin in the Gulf of Mexico. Our results are consistent with the hypothesis – previously developed based on 14C incorporation experiments with mixed-species enrichments from Mediterranean seafloor brines - that KB1 has adapted its proteins to elevated intracellular salinity, but at the same time KB1 apparently imports glycine betaine; this compatible solute is potentially not limited to osmoregulation but could also serve as a carbon and energy source.

  19. Technology-Driven and Evidence-Based Genomic Analysis for Integrated Pediatric and Prenatal Genetics Evaluation

    Institute of Scientific and Technical Information of China (English)

    Yuan Wei; Fang Xu; Peining Li

    2013-01-01

    The first decade since the completion of the Human Genome Project has been marked with rapid development of genomic technologies and their immediate clinical applications.Genomic analysis using oligonucleotide array comparative genomic hybridization (aCGH) or single nucleotide polymorphism (SNP) chips has been applied to pediatric patients with developmental and intellectual disabilities (DD/ID),multiple congenital anomalies (MCA) and autistic spectrum disorders (ASD).Evaluation of analytical and clinical validities of aCGH showed > 99% sensitivity and specificity and increased analytical resolution by higher density probe coverage.Reviews of case series,multi-center comparison and large patient-control studies demonstrated a diagnostic yield of 12%-20%; approximately 60% of these abnormalities were recurrent genomic disorders.This pediatric experience has been extended toward prenatal diagnosis.A series of reports indicated approximately 10% of pregnancies with ultrasound-detected structural anomalies and normal cytogenetic findings had genomic abnormalities,and 30% of these abnormalities were syndromic genomic disorders.Evidence-based practice guidelines and standards for implementing genomic analysis and web-delivered knowledge resources for interpreting genomic findings have been established.The progress from this technology-driven and evidence-based genomic analysis provides not only opportunities to dissect disease-causing mechanisms and develop rational therapeutic interventions but also important lessons for integrating genomic sequencing into pediatric and prenatal genetic evaluation.

  20. Development of a Species-specific PCR Assay for Three Xanthomonas Species, Causing Bulb and Flower Diseases, Based on Their Genome Sequences

    Directory of Open Access Journals (Sweden)

    Chang-Gi Back

    2015-09-01

    Full Text Available In this study, we developed a species-specific PCR assay for rapid and accurate detection of three Xanthomonas species, X. axonopodis pv. poinsettiicola (XAP, X. hyacinthi (XH and X. campestris pv. zantedeschiae (XCZ, based on their draft genome sequences. XAP, XH and XCZ genomes consist of single chromosomes that contain 5,221, 4,395 and 7,986 protein coding genes, respectively. Species-specific primers were designed from variable regions of the draft genome sequence data and assessed by a PCR-based detection method. These primers were also tested for specificity against 17 allied Xanthomonas species as well as against the host DNA and the microbial community of the host surface. Three primer sets were found to be very specific and no amplification product was obtained with the host DNA and the microbial community of the host surface. In addition, a detection limit of 1 pg/μl per PCR reaction was detected when these primer sets were used to amplify corresponding bacterial DNAs. Therefore, these primer sets and the developed species-specific PCR assay represent a valuable, sensitive, and rapid diagnostic tool that can be used to detect three specific pathogens at early stages of infection and may help control diseases.

  1. The Microbiology of Subsurface, Salt-Based Nuclear Waste Repositories: Using Microbial Ecology, Bioenergetics, and Projected Conditions to Help Predict Microbial Effects on Repository Performance

    Energy Technology Data Exchange (ETDEWEB)

    Swanson, Juliet S. [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Cherkouk, Andrea [Helmholtz-Zentrum Dresden-Rossendorf, Rossendorf (Germany); Arnold, Thuro [Helmholtz-Zentrum Dresden-Rossendorf, Rossendorf (Germany); Meleshyn, Artur [Gesellschaft fur Anlagen und Reaktorsicherheit, Braunschweig (Germany); Reed, Donald T. [Los Alamos National Lab. (LANL), Los Alamos, NM (United States)

    2016-11-17

    This report summarizes the potential role of microorganisms in salt-based nuclear waste repositories using available information on the microbial ecology of hypersaline environments, the bioenergetics of survival under high ionic strength conditions, and “repository microbiology” related studies. In areas where microbial activity is in question, there may be a need to shift the research focus toward feasibility studies rather than studies that generate actual input for performance assessments. In areas where activity is not necessary to affect performance (e.g., biocolloid transport), repository-relevant data should be generated. Both approaches will lend a realistic perspective to a safety case/performance scenario that will most likely underscore the conservative value of that case.

  2. GeNemo: a search engine for web-based functional genomic data.

    Science.gov (United States)

    Zhang, Yongqing; Cao, Xiaoyi; Zhong, Sheng

    2016-07-08

    A set of new data types emerged from functional genomic assays, including ChIP-seq, DNase-seq, FAIRE-seq and others. The results are typically stored as genome-wide intensities (WIG/bigWig files) or functional genomic regions (peak/BED files). These data types present new challenges to big data science. Here, we present GeNemo, a web-based search engine for functional genomic data. GeNemo searches user-input data against online functional genomic datasets, including the entire collection of ENCODE and mouse ENCODE datasets. Unlike text-based search engines, GeNemo's searches are based on pattern matching of functional genomic regions. This distinguishes GeNemo from text or DNA sequence searches. The user can input any complete or partial functional genomic dataset, for example, a binding intensity file (bigWig) or a peak file. GeNemo reports any genomic regions, ranging from hundred bases to hundred thousand bases, from any of the online ENCODE datasets that share similar functional (binding, modification, accessibility) patterns. This is enabled by a Markov Chain Monte Carlo-based maximization process, executed on up to 24 parallel computing threads. By clicking on a search result, the user can visually compare her/his data with the found datasets and navigate the identified genomic regions. GeNemo is available at www.genemo.org.

  3. Energy recovery from waste streams with microbial fuel cell (MFC)-based technologies

    Energy Technology Data Exchange (ETDEWEB)

    Zhang, Y.

    2012-09-15

    Microbial fuel cell (MFC)-based technologies are promising technologies for direct energy production from various wastewaters and waste streams. Beside electrical power production, more emphasis is recently devoted to alternative applications such as hydrogen production, bioremediation, seawater desalination, and biosensors. Although the technologies are promising, a number of hurdles need to be overcome before that field applications are economically feasible. The main purpose of this work was to improve the performance, reduce the construction cost, and expand the application scopes of MFC-based bio-electrochemical systems. To reduce the energy cost in nitrogen removal and during the same process achieve phosphorus elimination, a sediment-type photomicrobial fuel cell was developed based on the cooperation between microalgae (Chlorella vulgaris) and electrochemically active bacteria. The main removal mechanism of nitrogen and phosphorus was algae biomass uptake, while nitrification and denitrification process contributed to part of nitrogen removal. The key factors such as algae concentration, COD/N ratios and photoperiod were systemically studied. A self-powered submersible microbial electrolysis cell was developed for in situ biohydrogen production from anaerobic reactors. The hydrogen production increased along with acetate and buffer concentration. The hydrogen production rate of 32.2 mL/L/d and yield of 1.43 mol-H2/mol-acetate were achieved. Alternate exchanging the function between the two cell units was found to be an effective approach to inhibit methanogens. A sensor, based on a submersible microbial fuel cell, was developed for in situ monitoring of microbial activity and biochemical oxygen demand in groundwater. Presence or absence of a biofilm on the anode was a decisive factor for the applicability of the sensor. Temperature, pH, conductivity and inorganic solid content were significantly affecting the sensitivity of the sensor. The sensor showed

  4. Genome-based versus gene-based theory of cancer: Possible implications for clinical practice

    Indian Academy of Sciences (India)

    Nataša Todorović-Raković

    2011-09-01

    The current state in oncology research indicates that the attempts to explain such complex process as cancerogenesis by a single or several genetic mutations were not successful enough. On the other hand, chromosomal/genomic instability – almost universal features of malignant tumours which influence a global pattern of gene expression and, subsequently, many oncogenic pathways – were often disregarded and considered nonessential to clinical application. However, a new arising field of system biology including ‘new forms’ of genome diversity such as copy number variations (CNV) and high-throughput oncogene mutation profiling now reveal all the complexity of cancer and provide the final explanation of the oncogenic pathways, based on stochastic (onco)genomic variation rather than on (onco)genic concepts.

  5. Evaluation of Microbial load and quality of milk & milk based dairy products

    Directory of Open Access Journals (Sweden)

    Suvra Das

    2015-06-01

    Full Text Available Milk has an importance as valuable & nutritious food product. So, it is essential to evaluate their microbiological quality before consumption. Our study was carried out to examine the microbial load and quality of milk & milk based products. Total 87 samples of 13 different types of milk & dairy foods were collected from different locations in Dhaka city, Bangladesh were taken to the laboratory and stored for analysis. Total viable counts, Total coliform count and fungus count was analyzed & average count of these parameters were compared to FDA microbiological standards to evaluate their quality.

  6. The FlyBase database of the Drosophila genome projects andcommunity literature

    Energy Technology Data Exchange (ETDEWEB)

    Gelbart, William; Bayraktaroglu, Leyla; Bettencourt, Brian; Campbell, Kathy; Crosby, Madeline; Emmert, David; Hradecky, Pavel; Huang,Yanmei; Letovsky, Stan; Matthews, Beverly; Russo, Susan; Schroeder,Andrew; Smutniak, Frank; Zhou, Pinglei; Zytkovicz, Mark; Ashburner,Michael; Drysdale, Rachel; de Grey, Aubrey; Foulger, Rebecca; Millburn,Gillian; Yamada, Chihiro; Kaufman, Thomas; Matthews, Kathy; Gilbert, Don; Grumbling, Gary; Strelets, Victor; Shemen, C.; Rubin, Gerald; Berman,Brian; Frise, Erwin; Gibson, Mark; Harris, Nomi; Kaminker, Josh; Lewis,Suzanna; Marshall, Brad; Misra, Sima; Mungall, Christopher; Prochnik,Simon; Richter, John; Smith, Christopher; Shu, ShengQiang; Tupy,Jonathan; Wiel, Colin

    2002-09-16

    FlyBase (http://flybase.bio.indiana.edu/) provides an integrated view of the fundamental genomic and genetic data on the major genetic model Drosophila melanogaster and related species. FlyBase has primary responsibility for the continual reannotation of the D.melanogaster genome. The ultimate goal of the reannotation effort is to decorate the euchromatic sequence of the genome with as much biological information as is available from the community and from the major genome project centers. A complete revision of the annotations of the now-finished euchromatic genomic sequence has been completed. There are many points of entry to the genome within FlyBase, most notably through maps, gene products and ontologies, structured phenotypic and gene expression data, and anatomy.

  7. A SNP based linkage map of the turkey genome reveals multiple intrachromosomal rearrangements between the Turkey and Chicken genomes

    Directory of Open Access Journals (Sweden)

    Vereijken Addie

    2010-11-01

    Full Text Available Abstract Background The turkey (Meleagris gallopavo is an important agricultural species that is the second largest contributor to the world's poultry meat production. The genomic resources of turkey provide turkey breeders with tools needed for the genetic improvement of commercial breeds of turkey for economically important traits. A linkage map of turkey is essential not only for the mapping of quantitative trait loci, but also as a framework to enable the assignment of sequence contigs to specific chromosomes. Comparative genomics with chicken provides insight into mechanisms of genome evolution and helps in identifying rare genomic events such as genomic rearrangements and duplications/deletions. Results Eighteen full sib families, comprising 1008 (35 F1 and 973 F2 birds, were genotyped for 775 single nucleotide polymorphisms (SNPs. Of the 775 SNPs, 570 were informative and used to construct a linkage map in turkey. The final map contains 531 markers in 28 linkage groups. The total genetic distance covered by these linkage groups is 2,324 centimorgans (cM with the largest linkage group (81 loci measuring 326 cM. Average marker interval for all markers across the 28 linkage groups is 4.6 cM. Comparative mapping of turkey and chicken revealed two inter-, and 57 intrachromosomal rearrangements between these two species. Conclusion Our turkey genetic map of 531 markers reveals a genome length of 2,324 cM. Our linkage map provides an improvement of previously published maps because of the more even distribution of the markers and because the map is completely based on SNP markers enabling easier and faster genotyping assays than the microsatellitemarkers used in previous linkage maps. Turkey and chicken are shown to have a highly conserved genomic structure with a relatively low number of inter-, and intrachromosomal rearrangements.

  8. Genomic analysis of plant chromosomes based on meiotic pairing

    Directory of Open Access Journals (Sweden)

    Lisete Chamma Davide

    2007-12-01

    Full Text Available This review presents the principles and applications of classical genomic analysis, with emphasis on plant breeding. The main mathematical models used to estimate the preferential chromosome pairing in diploid or polyploid, interspecific or intergenera hybrids are presented and discussed, with special reference to the applications and studies for the definition of genome relationships among species of the Poaceae family.

  9. A Genomics-Based Classification of Human Lung Tumors

    NARCIS (Netherlands)

    Seidel, Danila; Zander, Thomas; Heukamp, Lukas C.; Peifer, Martin; Bos, Marc; Fernandez-Cuesta, Lynnette; Leenders, Frauke; Lu, Xin; Ansen, Sascha; Gardizi, Masyar; Nguyen, Chau; Berg, Johannes; Russell, Prudence; Wainer, Zoe; Schildhaus, Hans-Ulrich; Rogers, Toni-Maree; Solomon, Benjamin; Pao, William; Carter, Scott L.; Getz, Gad; Hayes, D. Neil; Wilkerson, Matthew D.; Thunnissen, Erik; Travis, William D.; Perner, Sven; Wright, Gavin; Brambilla, Elisabeth; Buettner, Reinhard; Wolf, Juergen; Thomas, Roman; Gabler, Franziska; Wilkening, Ines; Mueller, Christian; Dahmen, Ilona; Menon, Roopika; Koenig, Katharina; Albus, Kerstin; Merkelbach-Bruse, Sabine; Fassunke, Jana; Schmitz, Katja; Kuenstlinger, Helen; Kleine, Michaela; Binot, Elke; Querings, Silvia; Altmueller, Janine; Boessmann, Ingelore; Nuemberg, Peter; Schneider, Peter; Bogus, Magdalena; Buettner, Reinhard; Perner, Sven; Russell, Prudence; Thunnissen, Erik; Travis, William D.; Brambilla, Elisabeth; Soltermann, Alex; Moch, Holger; Brustugun, Odd Terje; Solberg, Steinar; Lund-Iversen, Marius; Helland, Aslaug; Muley, Thomas; Hoffmann, Hans; Schnabel, Philipp A.; Chen, Yuan; Groen, Herman; Timens, Wim; Sietsma, Hannie; Clement, Joachim H.; Weder, Walter; Saenger, Joerg; Stoelben, Erich; Ludwig, Corinna; Engel-Riedel, Walburga; Smit, Egbert; Heideman, Danille A. M.; Snijders, Peter J. F.; Nogova, Lucia; Sos, Martin L.; Mattonet, Christian; Toepelt, Karin; Scheffler, Matthias; Goekkurt, Eray; Kappes, Rainer; Krueger, Stefan; Kambartel, Kato; Behringer, Dirk; Schulte, Wolfgang; Galetke, Wolfgang; Randerath, Winfried; Heldwein, Matthias; Schlesinger, Andreas; Serke, Monika; Hekmat, Khosro; Frank, Konrad F.; Schnell, Roland; Reiser, Marcel; Huenerlituerkoglu, Ali-Nuri; Schmitz, Stephan; Meffert, Lisa; Ko, Yon-Dschun; Litt-Lampe, Markus; Gerigk, Ulrich; Fricke, Rainer; Besse, Benjamin; Brambilla, Christian; Lantuejoul, Sylvie; Lorimier, Philippe; Moro-Sibilot, Denis; Cappuzzo, Federico; Ligorio, Claudia; Damiani, Stefania; Field, John K.; Hyde, Russell; Validire, Pierre; Girard, Philippe; Muscarella, Lucia A.; Fazio, Vito M.; Hallek, Michael; Soria, Jean-Charles; Carter, Scott L.; Getz, Gad; Hayes, D. Neil; Wilkerson, Matthew D.; Achter, Viktor; Lang, Ulrich; Seidel, Danila; Zander, Thomas; Heukamp, Lukas C.; Peifer, Martin; Bos, Marc; Pao, William; Travis, William D.; Brambilla, Elisabeth; Buettner, Reinhard; Wolf, Juergen; Thomas, Roman K.

    2013-01-01

    We characterized genome alterations in 1255 clinically annotated lung tumors of all histological subgroups to identify genetically defined and clinically relevant subtypes. More than 55% of all cases had at least one oncogenic genome alteration potentially amenable to specific therapeutic interventi

  10. Whole-genome sequence-based analysis of thyroid function

    DEFF Research Database (Denmark)

    Taylor, Peter N.; Porcu, Eleonora; Chew, Shelby

    2015-01-01

    Normal thyroid function is essential for health, but its genetic architecture remains poorly understood. Here, for the heritable thyroid traits thyrotropin (TSH) and free thyroxine (FT4), we analyse whole-genome sequence data from the UK10K project (N = 2,287). Using additional whole-genome seque...

  11. Energy recovery from waste streams with microbial fuel cell (MFC)-based technologies

    DEFF Research Database (Denmark)

    Zhang, Yifeng

    Microbial fuel cell (MFC)-based technologies are promising technologies for direct energy production from various wastewaters and waste streams. Beside electrical power production, more emphasis is recently devoted to alternative applications such as hydrogen production, bioremediation, seawater......-based bio-electrochemical systems. To reduce the energy cost in nitrogen removal and during the same process achieve phosphorus elimination, a sediment-type photomicrobial fuel cell was developed based on the cooperation between microalgae (Chlorella vulgaris) and electrochemically active bacteria. The main....... A sediment-type MFC based on two pieces of bioelectrodes was employed as a novel in situ applicable approach for nitrate/nitrite removal, as well as electricity production from eutrophic lakes. The nitrogen removal and power generation were limited by the DO level in the water and acetate level injected...

  12. CRISPR-Cas9 Based Engineering of Actinomycetal Genomes.

    Science.gov (United States)

    Tong, Yaojun; Charusanti, Pep; Zhang, Lixin; Weber, Tilmann; Lee, Sang Yup

    2015-09-18

    Bacteria of the order Actinomycetales are one of the most important sources of pharmacologically active and industrially relevant secondary metabolites. Unfortunately, many of them are still recalcitrant to genetic manipulation, which is a bottleneck for systematic metabolic engineering. To facilitate the genetic manipulation of actinomycetes, we developed a highly efficient CRISPR-Cas9 system to delete gene(s) or gene cluster(s), implement precise gene replacements, and reversibly control gene expression in actinomycetes. We demonstrate our system by targeting two genes, actIORF1 (SCO5087) and actVB (SCO5092), from the actinorhodin biosynthetic gene cluster in Streptomyces coelicolor A3(2). Our CRISPR-Cas9 system successfully inactivated the targeted genes. When no templates for homology-directed repair (HDR) were present, the site-specific DNA double-strand breaks (DSBs) introduced by Cas9 were repaired through the error-prone nonhomologous end joining (NHEJ) pathway, resulting in a library of deletions with variable sizes around the targeted sequence. If templates for HDR were provided at the same time, precise deletions of the targeted gene were observed with near 100% frequency. Moreover, we developed a system to efficiently and reversibly control expression of target genes, deemed CRISPRi, based on a catalytically dead variant of Cas9 (dCas9). The CRISPR-Cas9 based system described here comprises a powerful and broadly applicable set of tools to manipulate actinomycetal genomes.

  13. Recent advances in genome-based polyketide discovery.

    Science.gov (United States)

    Helfrich, Eric J N; Reiter, Silke; Piel, Jörn

    2014-10-01

    Polyketides are extraordinarily diverse secondary metabolites of great pharmacological value and with interesting ecological functions. The post-genomics era has led to fundamental changes in natural product research by inverting the workflow of secondary metabolite discovery. As opposed to traditional bioactivity-guided screenings, genome mining is an in silico method to screen and analyze sequenced genomes for natural product biosynthetic gene clusters. Since genes for known compounds can be recognized at the early computational stage, genome mining presents an opportunity for dereplication. This review highlights recent progress in bioinformatics, pathway engineering and chemical analytics to extract the biosynthetic secrets hidden in the genome of both well-known natural product sources as well as previously neglected bacteria.

  14. [Transcription activator-like effectors(TALEs)based genome engineering].

    Science.gov (United States)

    Zhao, Mei-Wei; Duan, Cheng-Li; Liu, Jiang

    2013-10-01

    Systematic reverse-engineering of functional genome architecture requires precise modifications of gene sequences and transcription levels. The development and application of transcription activator-like effectors(TALEs) has created a wealth of genome engineering possibilities. TALEs are a class of naturally occurring DNA-binding proteins found in the plant pathogen Xanthomonas species. The DNA-binding domain of each TALE typically consists of tandem 34-amino acid repeat modules rearranged according to a simple cipher to target new DNA sequences. Customized TALEs can be used for a wide variety of genome engineering applications, including transcriptional modulation and genome editing. Such "genome engineering" has now been established in human cells and a number of model organisms, thus opening the door to better understanding gene function in model organisms, improving traits in crop plants and treating human genetic disorders.

  15. WormBase: network access to the genome and biology of Caenorhabditis elegans.

    Science.gov (United States)

    Stein, L; Sternberg, P; Durbin, R; Thierry-Mieg, J; Spieth, J

    2001-01-01

    WormBase (http://www.wormbase.org) is a web-based resource for the Caenorhabditis elegans genome and its biology. It builds upon the existing ACeDB database of the C.elegans genome by providing data curation services, a significantly expanded range of subject areas and a user-friendly front end.

  16. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd

    Energy Technology Data Exchange (ETDEWEB)

    Fleischmann, R.D.; Adams, M.D.; White, O. [Institute for Genomic Research, Gaithersburg, MD (United States)] [and others

    1995-07-28

    An approach for genome analysis based on sequencing and assembly of unselected pieces of DNA from the whole chromosome has been applied to obtain the complete nucleotide sequence (1,830,137 base pairs) of the genome from the bacterium Haemophilus influenzae Rd. This approach eliminates the need for initial mapping efforts and is therefore applicable to the vast array of microbial species for which genome maps are unavailable. The H. influenzae Rd genome sequence (Genome Sequence DataBase accession number L42023) represents the only complete genome sequence from a free-living organism. 46 refs., 4 figs., 4 tabs.

  17. [CRISPR/Cas9-based genome editing systems and the analysis of targeted genome mutations in plants].

    Science.gov (United States)

    Xingliang, Ma; Yaoguang, Liu

    2016-02-01

    Targeted genomic editing technologies use programmable DNA nucleases to cleave genomic target sites, thus inducing targeted mutations in the genomes. The newly prevailed clustered regularly interspaced short palindromic repeats/CRISPR-associated protein 9 (CRISPR/Cas9) system that consists of the Cas9 nuclease and single guide RNA (sgRNA) has the advantages of simplicity and high efficiency as compared to other programmable DNA nuclease systems such as zinc finger nucleases (ZFNs) and transcription activator like effector nucleases (TALENs). Currently, a number of cases have been reported on the application of the CRISPR/Cas9 genomic editing technology in plants. In this review, we summarize the strategies for preparing the Cas9 and sgRNA expression constructs, the transformation method for obtaining targeted mutations, the efficiency and features of the resulting mutations and the methods for detecting or genotyping of the mutation sites. We also discuss the existing problems and perspectives of CRISPR/Cas9-based genomic editing in plants.

  18. Research progress of plant population genomics based on high-throughput sequencing.

    Science.gov (United States)

    Yunsheng, Wang

    2016-08-01

    Population genomics, a new paradigm for population genetics, combine the concepts and techniques of genomics with the theoretical system of population genetics and improve our understanding of microevolution through identification of site-specific effect and genome-wide effects using genome-wide polymorphic sites genotypeing. With the appearance and improvement of the next generation high-throughput sequencing technology, the numbers of plant species with complete genome sequences increased rapidly and large scale resequencing has also been carried out in recent years. Parallel sequencing has also been done in some plant species without complete genome sequences. These studies have greatly promoted the development of population genomics and deepened our understanding of the genetic diversity, level of linking disequilibium, selection effect, demographical history and molecular mechanism of complex traits of relevant plant population at a genomic level. In this review, I briely introduced the concept and research methods of population genomics and summarized the research progress of plant population genomics based on high-throughput sequencing. I also discussed the prospect as well as existing problems of plant population genomics in order to provide references for related studies.

  19. Analysis of the genomic homologous recombination in Theilovirus based on complete genomes

    Directory of Open Access Journals (Sweden)

    Yi Maoli

    2011-09-01

    Full Text Available Abstract At present, Theilovirus is considered to comprise four distinct serotypes, including Theiler's murine encephalomyelitis virus, Vilyuisk human encephalomyelitis virus, Thera virus, and Saffold virus. So far, there is no systematical study that investigated the genomic recombination of Theilovirus. The present study performed the phylogenetic and recombination analysis of Theilovirus over the complete genomes. Seven potentially significant recombination events were identified. However, according to the strains information and references related to the recombinants and their parental strains, four of the recombination events might happen non-naturally. These results will provide valuable hints for future research on evolution and antigenic variability of Theilovirus.

  20. Analysis of the genomic homologous recombination in Theilovirus based on complete genomes.

    Science.gov (United States)

    Sun, Guangming; Zhang, Xiaodan; Yi, Maoli; Shao, Shihe; Zhang, Wen

    2011-09-17

    At present, Theilovirus is considered to comprise four distinct serotypes, including Theiler's murine encephalomyelitis virus, Vilyuisk human encephalomyelitis virus, Thera virus, and Saffold virus. So far, there is no systematical study that investigated the genomic recombination of Theilovirus. The present study performed the phylogenetic and recombination analysis of Theilovirus over the complete genomes. Seven potentially significant recombination events were identified. However, according to the strains information and references related to the recombinants and their parental strains, four of the recombination events might happen non-naturally. These results will provide valuable hints for future research on evolution and antigenic variability of Theilovirus.

  1. Rapid determination of soil quality and earthworm impacts on soil microbial communities using fluorescence-based respirometry

    Science.gov (United States)

    Prendergast-Miller, Miranda T.; Thurston, Josh; Taylor, Joe; Helgason, Thorunn; Ashauer, Roman; Hodson, Mark E.

    2017-04-01

    We applied a fluorescence-based respirometry method currently devised for aquatic ecotoxicology studies to rapidly measure soil microbial oxygen consumption as a function of soil quality. In this study, soil was collected from an arable wheat field and the field margin. These two soil habitats are known to differ in their soil quality due to differences in their use and management as well as plant, microbial and earthworm community. The earthworm Lumbricus terrestris was incubated in arable or margin soil for three weeks. After this initial phase, a transfer experiment was then conducted to test the hypothesis that earthworm 'migration' alters soil microbial community function and diversity. In this transfer experiment, earthworms incubated in margin soil were transferred to arable soil. The converse transfer (i.e. earthworms incubated in arable soil) was also conducted. Soils of each type with no earthworms were also incubated as controls. After a further four week incubation, the impact of earthworm migration on the soil microbial community was tested by measuring oxygen consumption. Replicated soil slurry subsamples were aliquoted into individual respirometer wells (600 μl volume) on a glass 24-well microplate (Loligo Systems, Denmark) fitted with non-invasive, reusable oxygen sensor spots. The sealed microplate was then attached to an oxygen fluorescence sensor (SDR SensorDish Reader, PreSens, Germany). Oxygen consumption was measured in real-time over a 2 hr period following standard operating procedures. Soil microbial activity was measured with and without an added carbon source (glucose or cellulose, 50 mg C L-1). Using this system, we were able to differentiate between soil type, earthworm treatment and C source. Earthworm-driven impacts on soil microbial oxygen consumption were also supported by changes in soil microbial community structure and diversity revealed using DNA-based sequencing techniques. This method provides a simple and rapid system for

  2. A hybrid microbial fuel cell stack based on single and double chamber microbial fuel cells for self-sustaining pH control

    Science.gov (United States)

    Yang, Wei; Li, Jun; Ye, Dingding; Zhang, Liang; Zhu, Xun; Liao, Qiang

    2016-02-01

    Proton accumulation in the anode chamber is the major problem that affects the operational stability and electricity generation performance of double chamber microbial fuel cells (MFCs). In this study, a hybrid microbial fuel cell stack (DS-DS stack) based on single (SCMFCs) and double chamber MFCs (DCMFCs) is proposed for self-sustaining pH control in the MFC stack. It is found that the aerobic microbial oxidation of acetate by the biofilm that is attached to the air cathode of SCMFCs is responsible for the self-sustaining removal of accumulated H+ in the effluent of DCMFCs. Compared with the stack that solely consists of SCMFCs (SS-SS stack) or DCMFCs (DD-DD stack), the hybrid stack exhibits the highest electricity output performance and the most effective conversion of acetate into electricity at high power levels. Furthermore, the hybrid stack demonstrates the operation time of 15.7 ± 1.1 h when the operating voltage is above 0.8 V. This value is much higher than that of the DD-DD (8.5 ± 2.4 h) and SS-SS (8.1 ± 1.4 h) stacks, which suggests that the hybrid stack had a good operational stability.

  3. The relationship between odd- and branched-chain fatty acids and microbial nucleic acid bases in rumen.

    Science.gov (United States)

    Liu, Keyuan; Hao, Xiaoyan; Li, Yang; Luo, Guobin; Zhang, Yonggen; Xin, Hangshu

    2017-06-26

    This study aims to identify the relationship between odd- and branched-chain fatty acids (OBCFAs) and microbial nucleic acid base in the rumen, and to establish a model to accurately predict microbial protein flow by using OBCFA. To develop the regression equations, data on the rumen contents of individual cows was obtained from 2 feeding experiments. In the first experiment, 3 rumen-fistulated dry dairy cows arranged in a 3×3 Latin square were fed diets of differing forage to concentration ratios (F:C). The second experiment consisted of 9 lactating Holstein dairy cows of similar body weights at the same fetal time. For each milk stage, 3 cows with similar milk production were selected. The rumen contents were sampled at 4 time points of every two hours after morning feeding 6 h, and then to analyse the concentrations of OBCFA and microbial nucleic acid bases in the rumen samples. The ruminal bacteria nucleic acid bases were significantly influenced by feeding diets of differing forge to concentration ratios and milk stages of dairy (P acids, especially odd-chain fatty acids and C15:0 isomers, strongly correlated with the microbial nucleic acid bases in the rumen (P acid bases which established by ruminal OBCFAs contents showed a good a good predictive capacity, as indicated by reasonably low SEs and high R-squared values. This finding suggests that the rumen OBCFA composition could be used as an internal marker of rumen microbial matter.

  4. Utilization and control of ecological interactions in polymicrobial infections and community-based microbial cell factories

    DEFF Research Database (Denmark)

    Wigneswaran, Vinoth; Amador Hierro, Cristina Isabel; Jelsbak, Lotte;

    2016-01-01

    Microbial activities are most often shaped by interactions between co-existing microbes within mixed-species communities. Dissection of the molecular mechanisms of species interactions within communities is a central issue in microbial ecology, and our ability to engineer and control microbial co...

  5. [Bioanode for a microbial fuel cell based on Gluconobacter oxydans inummobilized into a polymer matrix].

    Science.gov (United States)

    Alferov, S V; Minaĭcheva, P R; Arliapov, V A; Asulian, L D; Alferov, V A; Ponomareva, O N; Reshetilov, A N

    2014-01-01

    Acetic acid bacteria Gluconobacter oxydans subsp. industrius RKM V-1280 were immobilized into a synthetic matrix based on polyvinyl alcohol modified with N-vinylpyrrolidone and used as biocatalysts for the development ofbioanodes for microbial fuel cells. The immobilization method did not significantly affect bacterial substrate specificity. Bioanodes based on immobilized bacteria functioned stably for 7 days. The maximum voltage (fuel cell signal) was reached when 100-130 µM of an electron transport mediator, 2,6-dichlorophenolindophenol, was added into the anode compartment. The fuel cell signals reached a maximum at a glucose concentration higher than 6 mM. The power output of the laboratory model of a fuel cell based on the developed bioanode reached 7 mW/m2 with the use of fermentation industry wastes as fuel.

  6. A paper-based microbial fuel cell: instant battery for disposable diagnostic devices.

    Science.gov (United States)

    Fraiwan, Arwa; Mukherjee, Sayantika; Sundermier, Steven; Lee, Hyung-Sool; Choi, Seokheun

    2013-11-15

    We present a microfabricated paper-based microbial fuel cell (MFC) generating a maximum power of 5.5 μW/cm(2). The MFC features (1) a paper-based proton exchange membrane by infiltrating sulfonated sodium polystyrene sulfonate and (2) micro-fabricated paper chambers by patterning hydrophobic barriers of photoresist. Once inoculum and catholyte were added to the MFC, a current of 74 μA was generated immediately. This paper-based MFC has the advantages of ease of use, low production cost, and high portability. The voltage produced was increased by 1.9 × when two MFC devices were stacked in series, while operating lifetime was significantly enhanced in parallel.

  7. Inhibition of soil microbial activity by nitrogen-based energetic materials.

    Science.gov (United States)

    Kuperman, Roman G; Minyard, Morgan L; Checkai, Ronald T; Sunahara, Geoffrey I; Rocheleau, Sylvie; Dodard, Sabine G; Paquet, Louise; Hawari, Jalal

    2017-05-18

    We investigated individual toxicities of the nitrogen-based energetic materials (EMs) 2,4-dinitrotoluene (2,4-DNT); 2-amino-4,6-dinitrotoluene (2-ADNT); 4-amino-2,6-dinitrotoluene (4-ADNT); and nitroglycerin (NG) on microbial activity in Sassafras sandy loam (SSL) soil, which has physicochemical characteristics that support very high qualitative relative bioavailability for organic chemicals. Batches of SSL soil for basal respiration (BR) and substrate-induced respiration (SIR) assays were separately amended with individual EMs or acetone carrier control. Total microbial biomass carbon (biomass C) was determined from CO2 production increases after addition of 2500 mg/kg of glucose-water slurry to the soil. Exposure concentrations of each EM in soil were determined using US Environmental Protection Agency method 8330A. Basal respiration was the most sensitive endpoint for assessing the effects of nitroaromatic EMs on microbial activity in SSL, whereas SIR and biomass C were more sensitive endpoints for assessing the effects of NG in soil. The orders of toxicity (from greatest to least) were 4-ADNT > 2,4-DNT = 2-ADNT > NG for BR; but for SIR and biomass C, the order of toxicity was NG > 2,4-DNT > 2-ADNT = 4-ADNT. No inhibition of SIR was found up to and including the greatest concentration of each ADNT tested in SSL. These ecotoxicological data will be helpful in identifying concentrations of contaminant EMs in soil that present acceptable ecological risks for biologically mediated processes in soil. Environ Toxicol Chem 2017;9999:1-10. Published 2017 Wiley Periodicals Inc. on behalf of SETAC. This article is a US government work and, as such, is in the public domain in the United States of America. Published 2017 Wiley Periodicals Inc., on behalf of SETAC.

  8. Compatibility of pedigree-based and marker-based relationships for single-step genomic prediction

    DEFF Research Database (Denmark)

    Christensen, Ole Fredslund

    2012-01-01

    Single-step methods for genomic prediction have recently become popular because they are conceptually simple and in practice such a method can completely replace a pedigree-based method for routine genetic evaluation. An issue with single-step methods is compatibility between the marker-based rel......Single-step methods for genomic prediction have recently become popular because they are conceptually simple and in practice such a method can completely replace a pedigree-based method for routine genetic evaluation. An issue with single-step methods is compatibility between the marker......-based relationship matrix and the pedigree-based relationship matrix. The compatibility issue involves which allele frequencies to use in the marker-based relationship matrix, and also that adjustments of this matrix to the pedigree-based relationship matrix are needed. In addition, it has been overlooked...... in the base population. Here, two ideas are explored. The first idea is to instead adjust the pedigree-based relationship matrix to be compatible to the marker-based relationship matrix, whereas the second idea is to include the likelihood for the observed markers. A single-step method is used where...

  9. Integrated Genome-Based Studies of Shewanella Ecophysiology

    Energy Technology Data Exchange (ETDEWEB)

    Zhou, Jizhong; He, Zhili

    2014-04-08

    As a part of the Shewanella Federation project, we have used integrated genomic, proteomic and computational technologies to study various aspects of energy metabolism of two Shewanella strains from a systems-level perspective.

  10. A trait-based framework for predicting when and where microbial adaptation to climate change will affect ecosystem functioning

    Science.gov (United States)

    Wallenstein, Matthew D.; Hall, Edward K.

    2012-01-01

    As the earth system changes in response to human activities, a critical objective is to predict how biogeochemical process rates (e.g. nitrification, decomposition) and ecosystem function (e.g. net ecosystem productivity) will change under future conditions. A particular challenge is that the microbial communities that drive many of these processes are capable of adapting to environmental change in ways that alter ecosystem functioning. Despite evidence that microbes can adapt to temperature, precipitation regimes, and redox fluctuations, microbial communities are typically not optimally adapted to their local environment. For example, temperature optima for growth and enzyme activity are often greater than in situ temperatures in their environment. Here we discuss fundamental constraints on microbial adaptation and suggest specific environments where microbial adaptation to climate change (or lack thereof) is most likely to alter ecosystem functioning. Our framework is based on two principal assumptions. First, there are fundamental ecological trade-offs in microbial community traits that occur across environmental gradients (in time and space). These trade-offs result in shifting of microbial function (e.g. ability to take up resources at low temperature) in response to adaptation of another trait (e.g. limiting maintenance respiration at high temperature). Second, the mechanism and level of microbial community adaptation to changing environmental parameters is a function of the potential rate of change in community composition relative to the rate of environmental change. Together, this framework provides a basis for developing testable predictions about how the rate and degree of microbial adaptation to climate change will alter biogeochemical processes in aquatic and terrestrial ecosystems across the planet.

  11. Grappling with Proteus: Population level approaches to understanding microbial diversity

    Directory of Open Access Journals (Sweden)

    Mallory J. Choudoir

    2012-09-01

    Full Text Available The emerging fields of microbial population genetics and genomics provide an avenue to study the ecological rules that govern how communities form, function, and evolve. Our struggle to understand the causes and consequences of microbial diversity stems from our inability to define ecologically and evolutionarily meaningful units of diversity. The 16S rRNA-based tools that have been so useful in charting microbial diversity may lack sufficient sensitivity to answer many questions about the ecology and evolution of microbes. Examining genetic diversity with increased resolution is vital to understanding the forces shaping community structure. Population genetic analyses enabled by whole genome sequencing, multilocus sequence analyses, or single nucleotide polymorphism analyses permit the testing of hypotheses pertaining to the geographic distribution, migration, and habitat preference of specific microbial lineages. Furthermore, these approaches can reveal patterns of gene exchange within and between populations and communities. Tools from microbial population genetics and population genomics can be used to increase the resolution with which we measure microbial diversity, enabling a focus on the scale of genetic diversity at which

  12. Base-Catalyzed Depolymerization of Solid Lignin-Rich Streams Enables Microbial Conversion

    Energy Technology Data Exchange (ETDEWEB)

    Beckham, Gregg T [National Renewable Energy Laboratory (NREL), Golden, CO (United States); Salvachua Rodriguez, Davinia [National Renewable Energy Laboratory (NREL), Golden, CO (United States); Katahira, Rui [National Renewable Energy Laboratory (NREL), Golden, CO (United States); Pleitner, Brenna P [National Renewable Energy Laboratory (NREL), Golden, CO (United States); Cleveland, Nicholas S [National Renewable Energy Laboratory (NREL), Golden, CO (United States); Nolker, Michelle L [National Renewable Energy Laboratory (NREL), Golden, CO (United States); Smith, Holly K [National Renewable Energy Laboratory (NREL), Golden, CO (United States); Rodriguez, Alberto [Sandia National Laboratories; Baidoo, Edward E. K. [Lawrence Berkeley National Laboratory; DOE Joint BioEnergy Institute; Keasling, Jay D. [Lawrence Berkeley National Laboratory; DOE Joint BioEnergy InstituteUniversity of California, Berkeley; Technical University of Denmark; Simmons, Blake A. [Sandia National Laboratories; Lawrence Berkeley National Laboratory; DOE Joint BioEnergy Institute; Technical University of Denmark; Gladden, John M. [Sandia National Laboratories; DOE Joint BioEnergy Institute

    2017-08-01

    Lignin valorization offers significant potential to enhance the economic viability of lignocellulosic biorefineries. However, because of its heterogeneous and recalcitrant nature, conversion of lignin to value-added coproducts remains a considerable technical challenge. In this study, we employ base-catalyzed depolymerization (BCD) using a process-relevant solid lignin stream produced via deacetylation, mechanical refining, and enzymatic hydrolysis to enable biological lignin conversion. BCD was conducted with the solid lignin substrate over a range of temperatures at two NaOH concentrations, and the results demonstrate that the lignin can be partially extracted and saponified at temperatures as low as 60 degrees C. At 120 degrees C and 2% NaOH, the high extent of lignin solubility was accompanied by a considerable decrease in the lignin average molecular weight and the release of lignin-derived monomers including hydroxycinnamic acids. BCD liquors were tested for microbial growth using seven aromatic-catabolizing bacteria and two yeasts. Three organisms (Pseudomonas putida KT2440, Rhodotorula mucilaginosa, and Corynebacterium glutamicum) tolerate high BCD liquor concentrations (up to 90% v/v) and rapidly consume the main lignin-derived monomers, resulting in lignin conversion of up to 15%. Furthermore, as a proof of concept, muconic acid production from a representative lignin BCD liquor was demonstrated with an engineered P. putida KT2440 strain. These results highlight the potential for a mild lignin depolymerization process to enhance the microbial conversion of solid lignin-rich biorefinery streams.

  13. Ocean microbial metagenomics

    Science.gov (United States)

    Kerkhof, Lee J.; Goodman, Robert M.

    2009-09-01

    Technology for accessing the genomic DNA of microorganisms, directly from environmental samples without prior cultivation, has opened new vistas to understanding microbial diversity and functions. Especially as applied to soils and the oceans, environments on Earth where microbial diversity is vast, metagenomics and its emergent approaches have the power to transform rapidly our understanding of environmental microbiology. Here we explore select recent applications of the metagenomic suite to ocean microbiology.

  14. Pairagon: a highly accurate, HMM-based cDNA-to-genome aligner

    DEFF Research Database (Denmark)

    Lu, David V; Brown, Randall H; Arumugam, Manimozhiyan

    2009-01-01

    MOTIVATION: The most accurate way to determine the intron-exon structures in a genome is to align spliced cDNA sequences to the genome. Thus, cDNA-to-genome alignment programs are a key component of most annotation pipelines. The scoring system used to choose the best alignment is a primary......' simulated cDNA sequences by splicing the sequences of exons in the reference genome sequences of fly and human. The complete reference genome sequences were then mutated to various degrees using a realistic mutation simulator and the perfect cDNAs were aligned to them using Pairagon and 12 other aligners...... heuristics. RESULTS: We present Pairagon, a pair hidden Markov model based cDNA-to-genome alignment program, as the most accurate aligner for sequences with high- and low-identity levels. We conducted a series of experiments testing alignment accuracy with varying sequence identity. We first created 'perfect...

  15. Acorn: A grid computing system for constraint based modeling and visualization of the genome scale metabolic reaction networks via a web interface

    Directory of Open Access Journals (Sweden)

    Bushell Michael E

    2011-05-01

    Full Text Available Abstract Background Constraint-based approaches facilitate the prediction of cellular metabolic capabilities, based, in turn on predictions of the repertoire of enzymes encoded in the genome. Recently, genome annotations have been used to reconstruct genome scale metabolic reaction networks for numerous species, including Homo sapiens, which allow simulations that provide valuable insights into topics, including predictions of gene essentiality of pathogens, interpretation of genetic polymorphism in metabolic disease syndromes and suggestions for novel approaches to microbial metabolic engineering. These constraint-based simulations are being integrated with the functional genomics portals, an activity that requires efficient implementation of the constraint-based simulations in the web-based environment. Results Here, we present Acorn, an open source (GNU GPL grid computing system for constraint-based simulations of genome scale metabolic reaction networks within an interactive web environment. The grid-based architecture allows efficient execution of computationally intensive, iterative protocols such as Flux Variability Analysis, which can be readily scaled up as the numbers of models (and users increase. The web interface uses AJAX, which facilitates efficient model browsing and other search functions, and intuitive implementation of appropriate simulation conditions. Research groups can install Acorn locally and create user accounts. Users can also import models in the familiar SBML format and link reaction formulas to major functional genomics portals of choice. Selected models and simulation results can be shared between different users and made publically available. Users can construct pathway map layouts and import them into the server using a desktop editor integrated within the system. Pathway maps are then used to visualise numerical results within the web environment. To illustrate these features we have deployed Acorn and created a

  16. PCR Based Microbial Monitor for Analysis of Recycled Water Aboard the ISSA: Issues and Prospects

    Science.gov (United States)

    Cassell, Gail H.; Lefkowitz, Elliot J.; Glass, John I.

    1995-01-01

    The monitoring of spacecraft life support systems for the presence of health threatening microorganisms is paramount for crew well being and successful completion of missions. Development of technology to monitor spacecraft recycled water based on detection and identification of the genetic material of contaminating microorganisms and viruses would be a substantial improvement over current NASA plans to monitor recycled water samples that call for the use of conventional microbiology techniques which are slow, insensitive, and labor intensive. The union of the molecular biology techniques of DNA probe hybridization and polymerase chain reaction (PCR) offers a powerful method for the detection, identification, and quantification of microorganisms and viruses. This technology is theoretically capable of assaying samples in as little as two hours with specificity and sensitivity unmatched by any other method. A major advance in probe-hybridization/PCR has come about in a technology called TaqMan(TM), which was invented by Perkin Elmer. Instrumentation using TaqMan concepts is evolving towards devices that could meet NASA's needs of size, low power use, and simplicity of operation. The chemistry and molecular biology needed to utilize these probe-hybridization/PCR instruments must evolve in parallel with the hardware. The following issues of chemistry and biology must be addressed in developing a monitor: Early in the development of a PCR-based microbial monitor it will be necessary to decide how many and which organisms does the system need the capacity to detect. We propose a set of 17 different tests that would detect groups of bacteria and fungus, as well as specific eukaryotic parasites and viruses; In order to use the great sensitivity of PCR it will be necessary to concentrate water samples using filtration. If a lower limit of detection of 1 microorganism per 100 ml is required then the microbes in a 100 ml sample must be concentrated into a volume that can be

  17. Comprehensive genome characterization of solitary fibrous tumors using high-resolution array-based comparative genomic hybridization.

    Science.gov (United States)

    Bertucci, François; Bouvier-Labit, Corinne; Finetti, Pascal; Adélaïde, José; Metellus, Philippe; Mokhtari, Karima; Decouvelaere, Anne-Valérie; Miquel, Catherine; Jouvet, Anne; Figarella-Branger, Dominique; Pedeutour, Florence; Chaffanet, Max; Birnbaum, Daniel

    2013-02-01

    Solitary fibrous tumors (SFTs) are rare spindle cell tumors with limited therapeutic options. Their molecular basis is poorly known. No consistent cytogenetic abnormality has been reported. We used high-resolution whole-genome array-based comparative genomic hybridization (Agilent 244K oligonucleotide chips) to profile 47 samples, meningeal in >75% of cases. Few copy number aberrations (CNAs) were observed. Sixty-eight percent of samples did not show any gene CNA after exclusion of probes located in regions with referenced copy number variation (CNV). Only low-level CNAs were observed. The genomic profiles were very homogeneous among samples. No molecular class was revealed by clustering of DNA copy numbers. All cases displayed a "simplex" profile. No recurrent CNA was identified. Imbalances occurring in >20%, such as the gain of 8p11.23-11.22 region, contained known CNVs. The 13q14.11-13q31.1 region (lost in 4% of cases) was the largest altered region and contained the lowest percentage of genes with referenced CNVs. A total of 425 genes without CNV showed copy number transition in at least one sample, but only but only 1 in at least 10% of samples. The genomic profiles of meningeal and extra-meningeal cases did not show any differences.

  18. Evaluation of the impact of six different DNA extraction methods for the representation of the microbial community associated with human chronic wound infections using a gel-based DNA profiling method.

    Science.gov (United States)

    Dilhari, Ayomi; Sampath, Asanga; Gunasekara, Chinthika; Fernando, Neluka; Weerasekara, Deepaka; Sissons, Chris; McBain, Andrew; Weerasekera, Manjula

    2017-09-19

    Infected chronic wounds are polymicrobial in nature which include a diverse group of aerobic and anaerobic microorganisms. Majority of these communal microorganisms are difficult to grow in vitro. DNA fingerprinting methods such as polymerase chain reaction-denaturation gradient gel electrophoresis (PCR-DGGE) facilitate the microbial profiling of complex ecosystems including infected chronic wounds. Six different DNA extraction methods were compared for profiling of the microbial community associated with chronic wound infections using PCR-DGGE. Tissue debris obtained from chronic wound ulcers of ten patients were used for DNA extraction. Total nucleic acid was extracted from each specimen using six DNA extraction methods. The yield, purity and quality of DNA was measured and used for PCR amplification targeting V2-V3 region of eubacterial 16S rRNA gene. QIAGEN DNeasy Blood and Tissue Kit (K method) produced good quality genomic DNA compared to the other five DNA extraction methods and gave a broad diversity of bacterial communities in chronic wounds. Among the five conventional methods, bead beater/phenol-chloroform based DNA extraction method with STES buffer (BP1 method) gave a yield of DNA with a high purity and resulted in a higher DGGE band diversity. Although DNA extraction using heat and NaOH had the lowest purity, DGGE revealed a higher bacterial diversity. The findings suggest that the quality and the yield of genomic DNA are influenced by the DNA extraction protocol, thus a method should be carefully selected in profiling a complex microbial community.

  19. Genomes-based phylogeny of the genus Xanthomonas

    Directory of Open Access Journals (Sweden)

    Rodriguez-R Luis M

    2012-03-01

    Full Text Available Abstract Background The genus Xanthomonas comprises several plant pathogenic bacteria affecting a wide range of hosts. Despite the economic, industrial and biological importance of Xanthomonas, the classification and phylogenetic relationships within the genus are still under active debate. Some of the relationships between pathovars and species have not been thoroughly clarified, with old pathovars becoming new species. A change in the genus name has been recently suggested for Xanthomonas albilineans, an early branching species currently located in this genus, but a thorough phylogenomic reconstruction would aid in solving these and other discrepancies in this genus. Results Here we report the results of the genome-wide analysis of DNA sequences from 989 orthologous groups from 17 Xanthomonas spp. genomes available to date, representing all major lineages within the genus. The phylogenetic and computational analyses used in this study have been automated in a Perl package designated Unus, which provides a framework for phylogenomic analyses which can be applied to other datasets at the genomic level. Unus can also be easily incorporated into other phylogenomic pipelines. Conclusions Our phylogeny agrees with previous phylogenetic topologies on the genus, but revealed that the genomes of Xanthomonas citri and Xanthomonas fuscans belong to the same species, and that of Xanthomonas albilineans is basal to the joint clade of Xanthomonas and Xylella fastidiosa. Genome reduction was identified in the species Xanthomonas vasicola in addition to the previously identified reduction in Xanthomonas albilineans. Lateral gene transfer was also observed in two gene clusters.

  20. Cas9-based genome editing in Arabidopsis and tobacco.

    Science.gov (United States)

    Li, Jian-Feng; Zhang, Dandan; Sheen, Jen

    2014-01-01

    Targeted modification of plant genome is key to elucidating and manipulating gene functions in plant research and biotechnology. The clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated protein (Cas) technology is emerging as a powerful genome-editing method in diverse plants that traditionally lacked facile and versatile tools for targeted genetic engineering. This technology utilizes easily reprogrammable guide RNAs (sgRNAs) to direct Streptococcus pyogenes Cas9 endonuclease to generate DNA double-stranded breaks in targeted genome sequences, which facilitates efficient mutagenesis by error-prone nonhomologous end-joining (NHEJ) or sequence replacement by homology-directed repair (HDR). In this chapter, we describe the procedure to design and evaluate dual sgRNAs for plant codon-optimized Cas9-mediated genome editing using mesophyll protoplasts as model cell systems in Arabidopsis thaliana and Nicotiana benthamiana. We also discuss future directions in sgRNA/Cas9 applications for generating targeted genome modifications and gene regulations in plants.

  1. A web-based multi-genome synteny viewer for customized data

    Directory of Open Access Journals (Sweden)

    Revanna Kashi V

    2012-08-01

    Full Text Available Abstract Background Web-based synteny visualization tools are important for sharing data and revealing patterns of complicated genome conservation and rearrangements. Such tools should allow biologists to upload genomic data for their own analysis. This requirement is critical because individual biologists are generating large amounts of genomic sequences that quickly overwhelm any centralized web resources to collect and display all those data. Recently, we published a web-based synteny viewer, GSV, which was designed to satisfy the above requirement. However, GSV can only compare two genomes at a given time. Extending the functionality of GSV to visualize multiple genomes is important to meet the increasing demand of the research community. Results We have developed a multi-Genome Synteny Viewer (mGSV. Similar to GSV, mGSV is a web-based tool that allows users to upload their own genomic data files for visualization. Multiple genomes can be presented in a single integrated view with an enhanced user interface. Users can navigate through all the selected genomes in either pairwise or multiple viewing mode to examine conserved genomic regions as well as the accompanying genome annotations. Besides serving users who manually interact with the web server, mGSV also provides Web Services for machine-to-machine communication to accept data sent by other remote resources. The entire mGSV package can also be downloaded for easy local installation. Conclusions mGSV significantly enhances the original functionalities of GSV. A web server hosting mGSV is provided at http://cas-bioinfo.cas.unt.edu/mgsv.

  2. Integrated Approach to Reconstruction of Microbial Regulatory Networks

    Energy Technology Data Exchange (ETDEWEB)

    Rodionov, Dmitry A [Sanford-Burnham Medical Research Institute; Novichkov, Pavel S [Lawrence Berkeley National Laboratory

    2013-11-04

    This project had the goal(s) of development of integrated bioinformatics platform for genome-scale inference and visualization of transcriptional regulatory networks (TRNs) in bacterial genomes. The work was done in Sanford-Burnham Medical Research Institute (SBMRI, P.I. D.A. Rodionov) and Lawrence Berkeley National Laboratory (LBNL, co-P.I. P.S. Novichkov). The developed computational resources include: (1) RegPredict web-platform for TRN inference and regulon reconstruction in microbial genomes, and (2) RegPrecise database for collection, visualization and comparative analysis of transcriptional regulons reconstructed by comparative genomics. These analytical resources were selected as key components in the DOE Systems Biology KnowledgeBase (SBKB). The high-quality data accumulated in RegPrecise will provide essential datasets of reference regulons in diverse microbes to enable automatic reconstruction of draft TRNs in newly sequenced genomes. We outline our progress toward the three aims of this grant proposal, which were: Develop integrated platform for genome-scale regulon reconstruction; Infer regulatory annotations in several groups of bacteria and building of reference collections of microbial regulons; and Develop KnowledgeBase on microbial transcriptional regulation.

  3. VibrioBase: A Model for Next-Generation Genome and Annotation Database Development

    Directory of Open Access Journals (Sweden)

    Siew Woh Choo

    2014-01-01

    Full Text Available To facilitate the ongoing research of Vibrio spp., a dedicated platform for the Vibrio research community is needed to host the fast-growing amount of genomic data and facilitate the analysis of these data. We present VibrioBase, a useful resource platform, providing all basic features of a sequence database with the addition of unique analysis tools which could be valuable for the Vibrio research community. VibrioBase currently houses a total of 252 Vibrio genomes developed in a user-friendly manner and useful to enable the analysis of these genomic data, particularly in the field of comparative genomics. Besides general data browsing features, VibrioBase offers analysis tools such as BLAST interfaces and JBrowse genome browser. Other important features of this platform include our newly developed in-house tools, the pairwise genome comparison (PGC tool, and pathogenomics profiling tool (PathoProT. The PGC tool is useful in the identification and comparative analysis of two genomes, whereas PathoProT is designed for comparative pathogenomics analysis of Vibrio strains. Both of these tools will enable researchers with little experience in bioinformatics to get meaningful information from Vibrio genomes with ease. We have tested the validity and suitability of these tools and features for use in the next-generation database development.

  4. Manifold Based Optimization for Single-Cell 3D Genome Reconstruction.

    Directory of Open Access Journals (Sweden)

    Jonas Paulsen

    2015-08-01

    Full Text Available The three-dimensional (3D structure of the genome is important for orchestration of gene expression and cell differentiation. While mapping genomes in 3D has for a long time been elusive, recent adaptations of high-throughput sequencing to chromosome conformation capture (3C techniques, allows for genome-wide structural characterization for the first time. However, reconstruction of "consensus" 3D genomes from 3C-based data is a challenging problem, since the data are aggregated over millions of cells. Recent single-cell adaptations to the 3C-technique, however, allow for non-aggregated structural assessment of genome structure, but data suffer from sparse and noisy interaction sampling. We present a manifold based optimization (MBO approach for the reconstruction of 3D genome structure from chromosomal contact data. We show that MBO is able to reconstruct 3D structures based on the chromosomal contacts, imposing fewer structural violations than comparable methods. Additionally, MBO is suitable for efficient high-throughput reconstruction of large systems, such as entire genomes, allowing for comparative studies of genomic structure across cell-lines and different species.

  5. Genomic organization, annotation, and ligand-receptor inferences of chicken chemokines and chemokine receptor genes based on comparative genomics

    Directory of Open Access Journals (Sweden)

    Sze Sing-Hoi

    2005-03-01

    Full Text Available Abstract Background Chemokines and their receptors play important roles in host defense, organogenesis, hematopoiesis, and neuronal communication. Forty-two chemokines and 19 cognate receptors have been found in the human genome. Prior to this report, only 11 chicken chemokines and 7 receptors had been reported. The objectives of this study were to systematically identify chicken chemokines and their cognate receptor genes in the chicken genome and to annotate these genes and ligand-receptor binding by a comparative genomics approach. Results Twenty-three chemokine and 14 chemokine receptor genes were identified in the chicken genome. All of the chicken chemokines contained a conserved CC, CXC, CX3C, or XC motif, whereas all the chemokine receptors had seven conserved transmembrane helices, four extracellular domains with a conserved cysteine, and a conserved DRYLAIV sequence in the second intracellular domain. The number of coding exons in these genes and the syntenies are highly conserved between human, mouse, and chicken although the amino acid sequence homologies are generally low between mammalian and chicken chemokines. Chicken genes were named with the systematic nomenclature used in humans and mice based on phylogeny, synteny, and sequence homology. Conclusion The independent nomenclature of chicken chemokines and chemokine receptors suggests that the chicken may have ligand-receptor pairings similar to mammals. All identified chicken chemokines and their cognate receptors were identified in the chicken genome except CCR9, whose ligand was not identified in this study. The organization of these genes suggests that there were a substantial number of these genes present before divergence between aves and mammals and more gene duplications of CC, CXC, CCR, and CXCR subfamilies in mammals than in aves after the divergence.

  6. Whole-genome sequence-based analysis of thyroid function

    OpenAIRE

    Taylor, Peter N; Porcu, Eleonora; Chew, Shelby; Campbell, Purdey J.; Traglia, Michela; Brown, Suzanne J.; Mullin, Benjamin H; Shihab, Hashem A.; Min, Josine; Walter, Klaudia; Memari, Yasin; Huang, Jie; Barnes, Michael R.; Beilby, John P.; Charoen, Pimphen

    2015-01-01

    Normal thyroid function is essential for health, but its genetic architecture remains poorly understood. Here, for the heritable thyroid traits thyrotropin (TSH) and free thyroxine (FT4), we analyse whole-genome sequence data from the UK10K project (N=2,287). Using additional whole-genome sequence and deeply imputed data sets, we report meta-analysis results for common variants (MAF≥1%) associated with TSH and FT4 (N=16,335). For TSH, we identify a novel variant in SYN2 (MAF=23.5%, P=6.15 × 1...

  7. [Current perspectives on genome-based diagnostic tests in Pediatrics].

    Science.gov (United States)

    Lay-Son, R Guillermo; León, P Luis

    2015-01-01

    Etiological diagnosis is essential in the clinical management of individual patients. Some children with complex medical conditions are subjected to numerous testing, known as "diagnostic odyssey", which often gives no conclusive results. In recent years, a revolution in genomic medicine is underway with the use of technologies that promise to increase the ability to make a diagnosis and reduce the time involved. The main advantages and limitations of genomic diagnosis, as opposed to usual methodologies are reviewed with an emphasis on Pediatrics. Copyright © 2015. Publicado por Elsevier España, S.L.U.

  8. Construction of a phylogenetic tree of photosynthetic prokaryotes based on average similarities of whole genome sequences.

    Science.gov (United States)

    Satoh, Soichirou; Mimuro, Mamoru; Tanaka, Ayumi

    2013-01-01

    Phylogenetic trees have been constructed for a wide range of organisms using gene sequence information, especially through the identification of orthologous genes that have been vertically inherited. The number of available complete genome sequences is rapidly increasing, and many tools for construction of genome trees based on whole genome sequences have been proposed. However, development of a reasonable method of using complete genome sequences for construction of phylogenetic trees has not been established. We have developed a method for construction of phylogenetic trees based on the average sequence similarities of whole genome sequences. We used this method to examine the phylogeny of 115 photosynthetic prokaryotes, i.e., cyanobacteria, Chlorobi, proteobacteria, Chloroflexi, Firmicutes and nonphotosynthetic organisms including Archaea. Although the bootstrap values for the branching order of phyla were low, probably due to lateral gene transfer and saturated mutation, the obtained tree was largely consistent with the previously reported phylogenetic trees, indicating that this method is a robust alternative to traditional phylogenetic methods.

  9. Impacts of radiation exposure on the experimental microbial ecosystem: a particle-based model simulation approach

    Energy Technology Data Exchange (ETDEWEB)

    Doi, M.; Tanaka, N.; Fuma, S.; Kawabata, Z.

    2004-07-01

    Well-designed experimental model ecosystem could be a simple reference of the actual environment and complex ecological systems. For ecological toxicity test of radiation and other environmental toxicants, we investigated and aquatic microbial ecosystem (closed microcosm) in the test tube with initial substrates,autotroph flagellate algae (Euglena, G.), heterotroph ciliate protozoa (Tetrahymena T.) and saprotroph bacteria (E, coli). These species organizes by itself to construct the ecological system, that keeps the sustainable population dynamics for more than 2 years after inoculation only by adding light diurnally and controlling temperature at 25 degree Celsius. Objective of the study is to develop the particle-based computer simulation by reviewing interactions among microbes and environment, and analyze the ecological toxicities of radiation on the microcosm by replicating experimental results in the computer simulation. (Author) 14 refs.

  10. Numerical Modeling of Microbial Fuel Cell Based on Redox Electron Mediator

    Institute of Scientific and Technical Information of China (English)

    Nanqi Ren

    2015-01-01

    To investigate the behavior of redox electron mediator and its impact to power generation of microbial fuel cell ( MFC ) , this study carries out the numerical modeling of a typical two⁃chamber MFC based on assumption of interfacial electron transfer via redox electron mediator and acetate as sole electron donor. The model simulates the development of cell voltage, current, substrate concentration, redox electron mediator concentration, polarization and power density output under defined conditions. The results demonstrate that the developed models can fit the experimental results well on a qualitative basis, and concentration of electron reduced mediator plays a dominant role in electron transfer process, and the mass transfer may constitute the limiting step when its concentration is at a relatively low level. This study not only provides a better understanding of electron redox mediator behavior during power generation, but also suggests a strategy to improve electron transfer in the anode of MFC.

  11. Microbial fuel cell based biosensor for in situ monitoring of anaerobic digestion process.

    Science.gov (United States)

    Liu, Zhidan; Liu, Jing; Zhang, Songping; Xing, Xin-Hui; Su, Zhiguo

    2011-11-01

    A wall-jet microbial fuel cell (MFC) was developed for the monitoring of anaerobic digestion (AD). This biofilm based MFC biosensor had a character of being portable, short hydraulic retention time (HRT) for sample flow through and convenient for continuous operation. The MFC was installed in the recirculation loop of an upflow anaerobic fixed-bed (UAFB) reactor in bench-scale where pH of the fermentation broth and biogas flow were monitored in real time. External disturbances to the AD were added on purpose by changing feedstock concentration, as well as process configuration. MFC signals had good correlations with online measurements (i.e. pH, gas flow rate) and offline analysis (i.e. COD) over 6-month operation. These results suggest that the MFC signal can reflect the dynamic variation of AD and can potentially be a valuable tool for monitoring and control of bioprocess.

  12. A Hydration-Based Biophysical Index for the Onset of Soil Microbial Coexistence

    OpenAIRE

    Gang Wang; Dani Or

    2012-01-01

    Mechanistic exploration of the origins of the unparalleled soil microbial biodiversity represents a vast and uncharted scientific frontier. Quantification of candidate mechanisms that promote and sustain such diversity must be linked with microbial functions and measurable biophysical interactions at appropriate scales. We report a novel microbial coexistence index (CI) that links macroscopic soil hydration conditions with microscale aquatic habitat fragmentation that impose restrictions on c...

  13. YersiniaBase: a genomic resource and analysis platform for comparative analysis of Yersinia.

    Science.gov (United States)

    Tan, Shi Yang; Dutta, Avirup; Jakubovics, Nicholas S; Ang, Mia Yang; Siow, Cheuk Chuen; Mutha, Naresh Vr; Heydari, Hamed; Wee, Wei Yee; Wong, Guat Jah; Choo, Siew Woh

    2015-01-16

    Yersinia is a Gram-negative bacteria that includes serious pathogens such as the Yersinia pestis, which causes plague, Yersinia pseudotuberculosis, Yersinia enterocolitica. The remaining species are generally considered non-pathogenic to humans, although there is evidence that at least some of these species can cause occasional infections using distinct mechanisms from the more pathogenic species. With the advances in sequencing technologies, many genomes of Yersinia have been sequenced. However, there is currently no specialized platform to hold the rapidly-growing Yersinia genomic data and to provide analysis tools particularly for comparative analyses, which are required to provide improved insights into their biology, evolution and pathogenicity. To facilitate the ongoing and future research of Yersinia, especially those generally considered non-pathogenic species, a well-defined repository and analysis platform is needed to hold the Yersinia genomic data and analysis tools for the Yersinia research community. Hence, we have developed the YersiniaBase, a robust and user-friendly Yersinia resource and analysis platform for the analysis of Yersinia genomic data. YersiniaBase has a total of twelve species and 232 genome sequences, of which the majority are Yersinia pestis. In order to smooth the process of searching genomic data in a large database, we implemented an Asynchronous JavaScript and XML (AJAX)-based real-time searching system in YersiniaBase. Besides incorporating existing tools, which include JavaScript-based genome browser (JBrowse) and Basic Local Alignment Search Tool (BLAST), YersiniaBase also has in-house developed tools: (1) Pairwise Genome Comparison tool (PGC) for comparing two user-selected genomes; (2) Pathogenomics Profiling Tool (PathoProT) for comparative pathogenomics analysis of Yersinia genomes; (3) YersiniaTree for constructing phylogenetic tree of Yersinia. We ran analyses based on the tools and genomic data in YersiniaBase and the

  14. Genomic tools development for Aquilegia: construction of a BAC-based physical map

    Directory of Open Access Journals (Sweden)

    Hodges Scott A

    2010-11-01

    Full Text Available Abstract Background The genus Aquilegia, consisting of approximately 70 taxa, is a member of the basal eudicot lineage, Ranuculales, which is evolutionarily intermediate between monocots and core eudicots, and represents a relatively unstudied clade in the angiosperm phylogenetic tree that bridges the gap between these two major plant groups. Aquilegia species are closely related and their distribution covers highly diverse habitats. These provide rich resources to better understand the genetic basis of adaptation to different pollinators and habitats that in turn leads to rapid speciation. To gain insights into the genome structure and facilitate gene identification, comparative genomics and whole-genome shotgun sequencing assembly, BAC-based genomics resources are of crucial importance. Results BAC-based genomic resources, including two BAC libraries, a physical map with anchored markers and BAC end sequences, were established from A. formosa. The physical map was composed of a total of 50,155 BAC clones in 832 contigs and 3939 singletons, covering 21X genome equivalents. These contigs spanned a physical length of 689.8 Mb (~2.3X of the genome suggesting the complex heterozygosity of the genome. A set of 197 markers was developed from ESTs induced by drought-stress, or involved in anthocyanin biosynthesis or floral development, and was integrated into the physical map. Among these were 87 genetically mapped markers that anchored 54 contigs, spanning 76.4 Mb (25.5% across the genome. Analysis of a selection of 12,086 BAC end sequences (BESs from the minimal tiling path (MTP allowed a preview of the Aquilegia genome organization, including identification of transposable elements, simple sequence repeats and gene content. Common repetitive elements previously reported in both monocots and core eudicots were identified in Aquilegia suggesting the value of this genome in connecting the two major plant clades. Comparison with sequenced plant genomes

  15. Isolation and Characterization of Methyl Parathion-degrading Bacteria Based on Microbial Sensor Construction

    Directory of Open Access Journals (Sweden)

    GENG Fang-fang

    2014-12-01

    Full Text Available Methyl parathion (MP, a kind of typical organophosphates pesticides (OPs, is widely used as agricultural insecticides. However, due to their neurotoxic effects on humans, the elimination of OPs has become increasingly important. Microbial sensors are consisted of biological components and transducers. Owing to their attractive advantages including low cost, easy of miniaturization and excellent selectivity, they have been widely used for environmental analysis. In this paper, four novel bacterial strains capable of utilizing methyl parathion as the sole carbon source were isolated from pesticide contaminated soils. These four isolates were identified based on morphological characteristics and 16S rRNA gene sequences analysis, and their capability of degrading methyl parathion were investigated by high performance liquid chromatography. The highest degrading efficiency strain was selected for further study of degrading mechanism. The results indicated that degradation rate of these four strains were all over 78% after incubation at 30 ℃, pH 7.0 for 7 d with the original concentration of methyl parathion 50 mg·L-1. The highest degradation rate was up to 100%. 16S rRNA gene sequences indicated that strain MP-6 was affiliated into the genus klebsiella. The LC-MS results indicated that methyl parathion was hydrolyzed to dimethyl thiophosphoric acid and p-nitrophenol by MP-6. A little of p-nitrophenol molecules could be further metabolized to 4-nitrocatechol and 1, 2, 4-benzenetrio. The results indicated that based on detecting the potential signal of intermediate product p-nitrophenol, the strain MP-6 could be used to construct microbial sensors for determination of organophosphorus pesticides in environment.

  16. Identification of novel biomass-degrading enzymes from genomic dark matter: Populating genomic sequence space with functional annotation.

    Science.gov (United States)

    Piao, Hailan; Froula, Jeff; Du, Changbin; Kim, Tae-Wan; Hawley, Erik R; Bauer, Stefan; Wang, Zhong; Ivanova, Nathalia; Clark, Douglas S; Klenk, Hans-Peter; Hess, Matthias

    2014-08-01

    Although recent nucleotide sequencing technologies have significantly enhanced our understanding of microbial genomes, the function of ∼35% of genes identified in a genome currently remains unknown. To improve the understanding of microbial genomes and consequently of microbial processes it will be crucial to assign a function to this "genomic dark matter." Due to the urgent need for additional carbohydrate-active enzymes for improved production of transportation fuels from lignocellulosic biomass, we screened the genomes of more than 5,500 microorganisms for hypothetical proteins that are located in the proximity of already known cellulases. We identified, synthesized and expressed a total of 17 putative cellulase genes with insufficient sequence similarity to currently known cellulases to be identified as such using traditional sequence annotation techniques that rely on significant sequence similarity. The recombinant proteins of the newly identified putative cellulases were subjected to enzymatic activity assays to verify their hydrolytic activity towards cellulose and lignocellulosic biomass. Eleven (65%) of the tested enzymes had significant activity towards at least one of the substrates. This high success rate highlights that a gene context-based approach can be used to assign function to genes that are otherwise categorized as "genomic dark matter" and to identify biomass-degrading enzymes that have little sequence similarity to already known cellulases. The ability to assign function to genes that have no related sequence representatives with functional annotation will be important to enhance our understanding of microbial processes and to identify microbial proteins for a wide range of applications.

  17. A stackable, two-chambered, paper-based microbial fuel cell.

    Science.gov (United States)

    Fraiwan, Arwa; Choi, Seokheun

    2016-09-15

    We developed a stackable and integrable paper-based microbial fuel cell (MFC) for potentially powering on-chip paper-based devices. Four MFCs were prepared on a T-shaped filter paper which was eventually folded three times to connect these MFCs in series. Each MFC was fabricated by sandwiching multifunctional paper layers for two-chambered fuel cell configuration. One drop of bacteria-containing anolyte into the anodic inlet and another drop of potassium ferricyanide for cathodic reaction flowed through patterned fluidic pathways within the paper matrix, both vertically and horizontally, reaching each of the four MFCs and filling the reservoir of each device. Bacterial respiration then transferred electrons to the anode, which traveled across an external load to the cathode where they combined with protons. The MFC stack connected in series generated a high power density (1.2μW/cm(2)), which is two orders of magnitude higher than the previous report on the paper-based MFC stack. This work will represent the fusion of the art of origami and paper-based MFC technology, which could provide a paradigm shift for the architecture and design of paper-based batteries. Copyright © 2016 Elsevier B.V. All rights reserved.

  18. Whole genome homology-based identification of candidate genes ...

    African Journals Online (AJOL)

    Josephine Erhiakporeh

    2016-07-06

    Jul 6, 2016 ... identification of a set of 75 candidate genes (42, 22 and 11 from Arabidopsis, potato and tomato, ... understanding on the genetic basis of drought tolerance by using the .... Comparative genomics and genes expression assay ... Primer code ... physiological and molecular responses to drought stress.

  19. The ethical introduction of genome-based information and technologies into public health.

    Science.gov (United States)

    Howard, H C; Swinnen, E; Douw, K; Vondeling, H; Cassiman, J-J; Cambon-Thomsen, A; Borry, P

    2013-01-01

    With the human genome project running from 1989 until its completion in 2003, and the incredible advances in sequencing technology and in bioinformatics during the last decade, there has been a shift towards an increase focus on studying common complex disorders which develop due to the interplay of many different genes as well as environmental factors. Although some susceptibility genes have been identified in some populations for disorders such as cancer, diabetes and cardiovascular diseases, the integration of this information into the health care system has proven to be much more problematic than for single gene disorders. Furthermore, with the 1000$ genome supposedly just around the corner, and whole genome sequencing gradually being integrated into research protocols as well as in the clinical context, there is a strong push for the uptake of additional genomic testing. Indeed, the advent of public health genomics, wherein genomics would be integrated in all aspects of health care and public health, should be taken seriously. Although laudable, these advances also bring with them a slew of ethical and social issues that challenge the normative frameworks used in clinical genetics until now. With this in mind, we highlight herein 5 principles that are used as a primer to discuss the ethical introduction of genome-based information and genome-based technologies into public health.

  20. Microbial Enzymes: Tools for Biotechnological Processes

    Directory of Open Access Journals (Sweden)

    Jose L. Adrio

    2014-01-01

    Full Text Available Microbial enzymes are of great importance in the development of industrial bioprocesses. Current applications are focused on many different markets including pulp and paper, leather, detergents and textiles, pharmaceuticals, chemical, food and beverages, biofuels, animal feed and personal care, among others. Today there is a need for new, improved or/and more versatile enzymes in order to develop more novel, sustainable and economically competitive production processes. Microbial diversity and modern molecular techniques, such as metagenomics and genomics, are being used to discover new microbial enzymes whose catalytic properties can be improved/modified by different strategies based on rational, semi-rational and random directed evolution. Most industrial enzymes are recombinant forms produced in bacteria and fungi.

  1. Microbial enzymes: tools for biotechnological processes.

    Science.gov (United States)

    Adrio, Jose L; Demain, Arnold L

    2014-01-16

    Microbial enzymes are of great importance in the development of industrial bioprocesses. Current applications are focused on many different markets including pulp and paper, leather, detergents and textiles, pharmaceuticals, chemical, food and beverages, biofuels, animal feed and personal care, among others. Today there is a need for new, improved or/and more versatile enzymes in order to develop more novel, sustainable and economically competitive production processes. Microbial diversity and modern molecular techniques, such as metagenomics and genomics, are being used to discover new microbial enzymes whose catalytic properties can be improved/modified by different strategies based on rational, semi-rational and random directed evolution. Most industrial enzymes are recombinant forms produced in bacteria and fungi.

  2. A comprehensive 1000 Genomes-based genome-wide association meta-analysis of coronary artery disease

    Science.gov (United States)

    Kyriakou, Theodosios; Nelson, Christopher P; Hopewell, Jemma C; Webb, Thomas R; Zeng, Lingyao; Dehghan, Abbas; Alver, Maris; Armasu, Sebastian M; Auro, Kirsi; Bjonnes, Andrew; Chasman, Daniel I; Chen, Shufeng; Ford, Ian; Franceschini, Nora; Gieger, Christian; Grace, Christopher; Gustafsson, Stefan; Huang, Jie; Hwang, Shih-Jen; Kim, Yun Kyoung; Kleber, Marcus E; Lau, King Wai; Lu, Xiangfeng; Lu, Yingchang; Lyytikäinen, Leo-Pekka; Mihailov, Evelin; Morrison, Alanna C; Pervjakova, Natalia; Qu, Liming; Rose, Lynda M; Salfati, Elias; Saxena, Richa; Scholz, Markus; Smith, Albert V; Tikkanen, Emmi; Uitterlinden, Andre; Yang, Xueli; Zhang, Weihua; Zhao, Wei; de Andrade, Mariza; de Vries, Paul S; van Zuydam, Natalie R; Anand, Sonia S; Bertram, Lars; Beutner, Frank; Dedoussis, George; Frossard, Philippe; Gauguier, Dominique; Goodall, Alison H; Gottesman, Omri; Haber, Marc; Han, Bok-Ghee; Huang, Jianfeng; Jalilzadeh, Shapour; Kessler, Thorsten; König, Inke R; Lannfelt, Lars; Lieb, Wolfgang; Lind, Lars; Lindgren, Cecilia M; Lokki, Marja-Liisa; Magnusson, Patrik K; Mallick, Nadeem H; Mehra, Narinder; Meitinger, Thomas; Memon, Fazal-ur-Rehman; Morris, Andrew P; Nieminen, Markku S; Pedersen, Nancy L; Peters, Annette; Rallidis, Loukianos S; Rasheed, Asif; Samuel, Maria; Shah, Svati H; Sinisalo, Juha; Stirrups, Kathleen E; Trompet, Stella; Wang, Laiyuan; Zaman, Khan S; Ardissino, Diego; Boerwinkle, Eric; Borecki, Ingrid B; Bottinger, Erwin P; Buring, Julie E; Chambers, John C; Collins, Rory; Cupples, L Adrienne; Danesh, John; Demuth, Ilja; Elosua, Roberto; Epstein, Stephen E; Esko, Tõnu; Feitosa, Mary F; Franco, Oscar H; Franzosi, Maria Grazia; Granger, Christopher B; Gu, Dongfeng; Gudnason, Vilmundur; Hall, Alistair S; Hamsten, Anders; Harris, Tamara B; Hazen, Stanley L; Hengstenberg, Christian; Hofman, Albert; Ingelsson, Erik; Iribarren, Carlos; Jukema, J Wouter; Karhunen, Pekka J; Kim, Bong-Jo; Kooner, Jaspal S; Kullo, Iftikhar J; Lehtimäki, Terho; Loos, Ruth J F; Melander, Olle; Metspalu, Andres; März, Winfried; Palmer, Colin N; Perola, Markus; Quertermous, Thomas; Rader, Daniel J; Ridker, Paul M; Ripatti, Samuli; Roberts, Robert; Salomaa, Veikko; Sanghera, Dharambir K; Schwartz, Stephen M; Seedorf, Udo; Stewart, Alexandre F; Stott, David J; Thiery, Joachim; Zalloua, Pierre A; O’Donnell, Christopher J; Reilly, Muredach P; Assimes, Themistocles L; Thompson, John R; Erdmann, Jeanette; Clarke, Robert; Watkins, Hugh; Kathiresan, Sekar; McPherson, Ruth; Deloukas, Panos; Schunkert, Heribert; Samani, Nilesh J; Farrall, Martin

    2015-01-01

    Existing knowledge of genetic variants affecting risk of coronary artery disease (CAD) is largely based on genome-wide association studies (GWAS) analysis of common SNPs. Leveraging phased haplotypes from the 1000 Genomes Project, we report a GWAS meta-analysis of 185 thousand CAD cases and controls, interrogating 6.7 million common (MAF>0.05) as well as 2.7 million low frequency (0.005

  3. Microarray-based analysis of survival of soil microbial community during ozonation

    Energy Technology Data Exchange (ETDEWEB)

    Wang, Jian; Van Nostrand, Joy D.; He, Zhili; Wu, Liyou; Deng, Ye; Zhang, Xu; Zhou, Jizhong; Li, Guanghe

    2010-05-17

    A 15 h ozonation was performed on bioremediated soil to remove recalcitrant residual oil. To monitor the survival of indigenous microorganisms in the soil during in-situ chemical oxidation(ISCO) culturing and a functional genearray, GeoChip, was used to examine the functional genes and structure of the microbial community during ozonation (0h, 2h, 4h, 6h, 10hand15h). Breakthrough ozonation decreased the population of cultivable heterotrophic bacteria by about 3 orders of magnitude. The total functional gene abundance and diversity decreased during ozonation, as the number of functional genes was reduced by 48percent after 15 h. However, functional genes were evenly distributed during ozonation as judged by the Shannon-Weaver Evenness index. A sharp decrease in gene number was observed in the first 6 h of ozonation followed by a slower decrease in the next 9 h, which was consistent with microbial populations measured by a culture based method. Functional genes involved in carbon, nitrogen, phosphors and sulfur cycling, metal resistance and organic remediation were detected in all samples. Though the pattern of gene categories detected was similar for all time points, hierarchica lcluster of all functional genes and major functional categories all showed a time-serial pattern. Bacteria, archaea and fungi decreased by 96.1percent, 95.1percent and 91.3percent, respectively, after 15 h ozonation. Delta proteobacteria, which were reduced by 94.3percent, showed the highest resistance to ozonation while Actinobacteria, reduced by 96.3percent, showed the lowest resistance. Microorganisms similar to Rhodothermus, Obesumbacterium, Staphylothermus, Gluconobacter, and Enterococcus were dominant at all time points. Functional genes related to petroleum degradation decreased 1~;;2 orders of magnitude. Most of the key functional genes were still detected after ozonation, allowing a rapid recovery of the microbial community after ozonation. While ozone had a large impact on the

  4. Microarray-based analysis of survival of soil microbial community during ozonation

    Energy Technology Data Exchange (ETDEWEB)

    Wang, Jian; Van Nostrand, Joy D.; He, Zhili; Wu, Liyou; Deng, Ye; Zhang, Xu; Zhou, Jizhong; Li, Guanghe

    2010-05-17

    A 15 h ozonation was performed on bioremediated soil to remove recalcitrant residual oil. To monitor the survival of indigenous microorganisms in the soil during in-situ chemical oxidation(ISCO) culturing and a functional genearray, GeoChip, was used to examine the functional genes and structure of the microbial community during ozonation (0h, 2h, 4h, 6h, 10hand15h). Breakthrough ozonation decreased the population of cultivable heterotrophic bacteria by about 3 orders of magnitude. The total functional gene abundance and diversity decreased during ozonation, as the number of functional genes was reduced by 48percent after 15 h. However, functional genes were evenly distributed during ozonation as judged by the Shannon-Weaver Evenness index. A sharp decrease in gene number was observed in the first 6 h of ozonation followed by a slower decrease in the next 9 h, which was consistent with microbial populations measured by a culture based method. Functional genes involved in carbon, nitrogen, phosphors and sulfur cycling, metal resistance and organic remediation were detected in all samples. Though the pattern of gene categories detected was similar for all time points, hierarchica lcluster of all functional genes and major functional categories all showed a time-serial pattern. Bacteria, archaea and fungi decreased by 96.1percent, 95.1percent and 91.3percent, respectively, after 15 h ozonation. Delta proteobacteria, which were reduced by 94.3percent, showed the highest resistance to ozonation while Actinobacteria, reduced by 96.3percent, showed the lowest resistance. Microorganisms similar to Rhodothermus, Obesumbacterium, Staphylothermus, Gluconobacter, and Enterococcus were dominant at all time points. Functional genes related to petroleum degradation decreased 1~;;2 orders of magnitude. Most of the key functional genes were still detected after ozonation, allowing a rapid recovery of the microbial community after ozonation. While ozone had a large impact on the

  5. Engineered microbial biosensors based on bacterial two-component systems as synthetic biotechnology platforms in bioremediation and biorefinery.

    Science.gov (United States)

    Ravikumar, Sambandam; Baylon, Mary Grace; Park, Si Jae; Choi, Jong-Il

    2017-04-14

    Two-component regulatory systems (TCRSs) mediate cellular response by coupling sensing and regulatory mechanisms. TCRSs are comprised of a histidine kinase (HK), which serves as a sensor, and a response regulator, which regulates expression of the effector gene after being phosphorylated by HK. Using these attributes, bacterial TCRSs can be engineered to design microbial systems for different applications. This review focuses on the current advances in TCRS-based biosensors and on the design of microbial systems for bioremediation and their potential application in biorefinery.

  6. Bio-grout based on microbially induced sand solidification by means of asparaginase activity.

    Science.gov (United States)

    Li, Mengmeng; Fu, Qing-Long; Zhang, Qiuzhuo; Achal, Varenyam; Kawasaki, Satoru

    2015-11-03

    Bio-grout, a new ground improvement method, has been recently developed to improve the mechanical properties, decrease the permeability of porous materials, reinforce or repair cementitious materials and modify the properties of soil or sand. Bio-grout production depends on microbially induced calcite precipitation (MICP), which is driven mainly by an enzyme, urease. However, urease-based MICP process produces excessive ammonia, in addition to secondary pollution generated by urea that is used as substrate in it. In the present study, we reported asparaginase-based MICP process for sand bio-grout development using Bacillus megaterium, and results were also compared with urease-based bio-grouts. The asparaginase activity led to significantly less ammonia production compared to urease without compromising with desired properties of a novel grout. The UCS of bio-grout was obtained at 980 kPa, while the permeability was decreased substantially. The mineralogical composition of precipitated substance was identified as calcite using XRD and the crystal morphology was observed under SEM. The mass percentage of calcite in bio-grout was calculated by thermogravimetric analysis and XCT verified calcite precipitation in it. The results confirmed that biocalcification by means of bacterial asparaginase is a potential solution for geotechnical problems. The asparaginase-based MICP process could be of wider acceptance in future.

  7. Bio-grout based on microbially induced sand solidification by means of asparaginase activity

    Science.gov (United States)

    Li, Mengmeng; Fu, Qing-Long; Zhang, Qiuzhuo; Achal, Varenyam; Kawasaki, Satoru

    2015-11-01

    Bio-grout, a new ground improvement method, has been recently developed to improve the mechanical properties, decrease the permeability of porous materials, reinforce or repair cementitious materials and modify the properties of soil or sand. Bio-grout production depends on microbially induced calcite precipitation (MICP), which is driven mainly by an enzyme, urease. However, urease-based MICP process produces excessive ammonia, in addition to secondary pollution generated by urea that is used as substrate in it. In the present study, we reported asparaginase-based MICP process for sand bio-grout development using Bacillus megaterium, and results were also compared with urease-based bio-grouts. The asparaginase activity led to significantly less ammonia production compared to urease without compromising with desired properties of a novel grout. The UCS of bio-grout was obtained at 980 kPa, while the permeability was decreased substantially. The mineralogical composition of precipitated substance was identified as calcite using XRD and the crystal morphology was observed under SEM. The mass percentage of calcite in bio-grout was calculated by thermogravimetric analysis and XCT verified calcite precipitation in it. The results confirmed that biocalcification by means of bacterial asparaginase is a potential solution for geotechnical problems. The asparaginase-based MICP process could be of wider acceptance in future.

  8. Microbial transformations 59: first kilogram scale asymmetric microbial Baeyer-Villiger oxidation with optimized productivity using a resin-based in situ SFPR strategy.

    Science.gov (United States)

    Hilker, Iris; Wohlgemuth, Roland; Alphand, Véronique; Furstoss, Roland

    2005-12-20

    This study is demonstrating the scale up of asymmetric microbial Baeyer-Villiger oxidation of racemic bicyclo[3.2.0]hept-2-en-6-one (1) to the kilogram scale using a 50 L bioreactor. The process has been optimized with respect to bottlenecks identified in downscaled experiments. A high productivity was obtained combining a resin-based in situ substrate feeding and product removal methodology (in situ SFPR), a glycerol feed control, and an improved oxygenation device (using a sintered-metal sparger). As expected both regioisomeric lactones [(-)-(1S,5R)-2 and (-)-(1R,5S)-3] were obtained in nearly enantiopure form (ee > 98%) and good yield. This represents the first example of such an asymmetric Baeyer-Villiger biooxidation reaction ever operated at that scale. This novel resin-based in situ SFPR technology therefore clearly opens the way to further (industrial) upscaling of this highly valuable (asymmetric) reaction.

  9. The Photochemical Oxidation of Siderite That Drove Hydrogen Based Microbial Redox Reactions in The Archean Biosphere

    Science.gov (United States)

    Kim, J. D.; Yee, N.; Falkowski, P. G.

    2012-12-01

    Hydrogen is the most abundant element in the universe and molecular hydrogen (H2) is a rich source of electron in a mildly reducing environment for microbial redox reactions, such as anoxygenic photosynthesis and methanogenesis. Subaerial volcanoes, ocean crust serpentinization and mid-ocean ridge volcanoes have been believed to be the major source of the hydrogen flux to the atmosphere. Although ferrous ion (Fe2+) photooxidation has been proposed as an alternative mechanism by which hydrogen gas was produced, ferruginous water in contact with a CO2-bearing atmosphere is supersaturated with respect to FeCO3 (siderite), thus the precipitation of siderite would have been thermodynamically favored in the Archean environment. Siderite is the critical mineral component of the oldest fossilized microbial mat. It has also been inferred as a component of chemical sedimentary protolith in the >3750 Ma Nuvvuagittuq supracrustal belt, Canada and the presence of siderite in the protolith suggests the occurrence of siderite extends to Hadean time. Analyses of photooxidation of siderite suggest a significant flux of hydrogen in the early atmosphere. Our estimate of the hydrogen production rate under Archean solar flux is approximately 50 times greater than the estimated hydrogen production rate by the volcanic activity based on a previous report (Tian et al. Science 2005). Our analyses on siderite photooxidation also suggest a mechanism by which banded iron formation (BIF) was formed. The photooxidation transforms siderite to magnetite/maghemite (spinnel iron oxide), while oxygenic oxidation of siderite leads to goethite, and subsequently to hematite (Fe3+2O3) upon dehydration. We will discuss the photochemical reaction, which was once one of the most ubiquitous photochemical reactions before the rise of oxygen in the atmosphere. Photooxidation of siderite over time by UV light From left to right: UV oxidized siderite, pristine siderite, oxidized siderite by oxygen

  10. A contig-based strategy for the genome-wide discovery of microRNAs without complete genome resources.

    Directory of Open Access Journals (Sweden)

    Jun-Zhi Wen

    Full Text Available MicroRNAs (miRNAs are important regulators of many cellular processes and exist in a wide range of eukaryotes. High-throughput sequencing is a mainstream method of miRNA identification through which it is possible to obtain the complete small RNA profile of an organism. Currently, most approaches to miRNA identification rely on a reference genome for the prediction of hairpin structures. However, many species of economic and phylogenetic importance are non-model organisms without complete genome sequences, and this limits miRNA discovery. Here, to overcome this limitation, we have developed a contig-based miRNA identification strategy. We applied this method to a triploid species of edible banana (GCTCV-119, Musa spp. AAA group and identified 180 pre-miRNAs and 314 mature miRNAs, which is three times more than those were predicted by the available dataset-based methods (represented by EST+GSS. Based on the recently published miRNA data set of Musa acuminate, the recall rate and precision of our strategy are estimated to be 70.6% and 92.2%, respectively, significantly better than those of EST+GSS-based strategy (10.2% and 50.0%, respectively. Our novel, efficient and cost-effective strategy facilitates the study of the functional and evolutionary role of miRNAs, as well as miRNA-based molecular breeding, in non-model species of economic or evolutionary interest.

  11. Similarity-based disease risk assessment for personal genomes: proof of concept.

    Science.gov (United States)

    Woo, Jung Hoon; Lai, Albert M; Chung, Wendy K; Weng, Chunhua

    2011-01-01

    The increasing availability of personal genome data has led to escalating needs by consumers to understand the implications of their gene sequences. At present, poorly integrated genetic knowledge has not met these needs. This proof-of-concept study proposes a similarity-based approach to assess the disease risk predisposition for personal genomes. We hypothesize that the semantic similarity between a personal genome and a disease can indicate the disease risks in the person. We developed a knowledge network that integrates existing knowledge of genes, diseases, and symptoms from six sources using the Semantic Web standard, Resource Description Framework (RDF). We then used latent relationships between genes and diseases derived from our knowledge network to measure the semantic similarity between a personal genome and a genetic disease. For demonstration, we showed the feasibility of assessing the disease risks in one personal genome and discussed related methodology issues.

  12. Multilevel Genomics-Based Taxonomy of Renal Cell Carcinoma

    Directory of Open Access Journals (Sweden)

    Fengju Chen

    2016-03-01

    Full Text Available On the basis of multidimensional and comprehensive molecular characterization (including DNA methalylation and copy number, RNA, and protein expression, we classified 894 renal cell carcinomas (RCCs of various histologic types into nine major genomic subtypes. Site of origin within the nephron was one major determinant in the classification, reflecting differences among clear cell, chromophobe, and papillary RCC. Widespread molecular changes associated with TFE3 gene fusion or chromatin modifier genes were present within a specific subtype and spanned multiple subtypes. Differences in patient survival and in alteration of specific pathways (including hypoxia, metabolism, MAP kinase, NRF2-ARE, Hippo, immune checkpoint, and PI3K/AKT/mTOR could further distinguish the subtypes. Immune checkpoint markers and molecular signatures of T cell infiltrates were both highest in the subtype associated with aggressive clear cell RCC. Differences between the genomic subtypes suggest that therapeutic strategies could be tailored to each RCC disease subset.

  13. Biomarker-based classification of bacterial and fungal whole-blood infections in a genome-wide expression study

    Directory of Open Access Journals (Sweden)

    Andreas eDix

    2015-03-01

    Full Text Available Sepsis is a clinical syndrome that can be caused by bacteria or fungi. Early knowledge on the nature of the causative agent is a prerequisite for targeted anti-microbial therapy. Besides currently used detection methods like blood culture and PCR-based assays, the analysis of the transcriptional response of the host to infecting organisms holds great promise. In this study, we aim to examine the transcriptional footprint of infections caused by the bacterial pathogens Staphylococcus aureus and Escherichia coli and the fungal pathogens Candida albicans and Aspergillus fumigatus in a human whole-blood model. Moreover, we use the expression information to build a random forest classifier to classify if a sample contains a bacterial, fungal, or mock-infection. After normalizing the transcription intensities using stably expressed reference genes, we filtered the gene set for biomarkers of bacterial or fungal blood infections. This selection is based on differential expression and an additional gene relevance measure. In this way, we identified 38 biomarker genes, including IL6, SOCS3, and IRG1 which were already associated to sepsis by other studies. Using these genes, we trained the classifier and assessed its performance. It yielded a 96% accuracy (sensitivities >93%, specificities >97% for a 10-fold stratified cross-validation and a 92% accuracy (sensitivities and specificities >83% for an additional test dataset comprising Cryptococcus neoformans infections. Furthermore, the classifier is robust to Gaussian noise, indicating correct class predictions on datasets of new species. In conclusion, this genome-wide approach demonstrates an effective feature selection process in combination with the construction of a well-performing classification model. Further analyses of genes with pathogen-dependent expression patterns can provide insights into the systemic host responses, which may lead to new anti-microbial therapeutic advances.

  14. Genome-based comparative analyses of Antarctic and temperate species of Paenibacillus.

    Directory of Open Access Journals (Sweden)

    Melissa Dsouza

    Full Text Available Antarctic soils represent a unique environment characterised by extremes of temperature, salinity, elevated UV radiation, low nutrient and low water content. Despite the harshness of this environment, members of 15 bacterial phyla have been identified in soils of the Ross Sea Region (RSR. However, the survival mechanisms and ecological roles of these phyla are largely unknown. The aim of this study was to investigate whether strains of Paenibacillus darwinianus owe their resilience to substantial genomic changes. For this, genome-based comparative analyses were performed on three P. darwinianus strains, isolated from gamma-irradiated RSR soils, together with nine temperate, soil-dwelling Paenibacillus spp. The genome of each strain was sequenced to over 1,000-fold coverage, then assembled into contigs totalling approximately 3 Mbp per genome. Based on the occurrence of essential, single-copy genes, genome completeness was estimated at approximately 88%. Genome analysis revealed between 3,043-3,091 protein-coding sequences (CDSs, primarily associated with two-component systems, sigma factors, transporters, sporulation and genes induced by cold-shock, oxidative and osmotic stresses. These comparative analyses provide an insight into the metabolic potential of P. darwinianus, revealing potential adaptive mechanisms for survival in Antarctic soils. However, a large proportion of these mechanisms were also identified in temperate Paenibacillus spp., suggesting that these mechanisms are beneficial for growth and survival in a range of soil environments. These analyses have also revealed that the P. darwinianus genomes contain significantly fewer CDSs and have a lower paralogous content. Notwithstanding the incompleteness of the assemblies, the large differences in genome sizes, determined by the number of genes in paralogous clusters and the CDS content, are indicative of genome content scaling. Finally, these sequences are a resource for further

  15. Evolutionary insights from suffix array-based genome sequence analysis

    Indian Academy of Sciences (India)

    Anindya Poddar; Nagasuma Chandra; Madhavi Ganapathiraju; K Sekar; Judith Klein-Seetharaman; Raj Reddy; N Balakrishnan

    2007-08-01

    Gene and protein sequence analyses, central components of studies in modern biology are easily amenable to string matching and pattern recognition algorithms. The growing need of analysing whole genome sequences more efficiently and thoroughly, has led to the emergence of new computational methods. Suffix trees and suffix arrays are data structures, well known in many other areas and are highly suited for sequence analysis too. Here we report an improvement to the design of construction of suffix arrays. Enhancement in versatility and scalability, enabled by this approach, is demonstrated through the use of real-life examples. The scalability of the algorithm to whole genomes renders it suitable to address many biologically interesting problems. One example is the evolutionary insight gained by analysing unigrams, bi-grams and higher n-grams, indicating that the genetic code has a direct influence on the overall composition of the genome. Further, different proteomes have been analysed for the coverage of the possible peptide space, which indicate that as much as a quarter of the total space at the tetra-peptide level is left un-sampled in prokaryotic organisms, although almost all tri-peptides can be seen in one protein or another in a proteome. Besides, distinct patterns begin to emerge for the counts of particular tetra and higher peptides, indicative of a ‘meaning’ for tetra and higher n-grams. The toolkit has also been used to demonstrate the usefulness of identifying repeats in whole proteomes efficiently. As an example, 16 members of one COG, coded by the genome of Mycobacterium tuberculosis H37Rv have been found to contain a repeating sequence of 300 amino acids.

  16. Semi-specific Microbacterium phyllosphaerae-based microbial sensor for biochemical oxygen demand measurements in dairy wastewater.

    Science.gov (United States)

    Kibena, Elo; Raud, Merlin; Jõgi, Eerik; Kikas, Timo

    2013-04-01

    Although the long incubation time of biochemical oxygen demand (BOD7) measurements has been addressed by the use of microbial biosensors, the resulting sensor-BOD values gained from the measurements with specific industrial wastewaters still underestimates the BOD value of such samples. This research aims to provide fast and more accurate BOD measurements in the dairy wastewater samples. Unlike municipal wastewater, wastewater from the dairy industry contains many substrates that are not easily accessible to a majority of microorganisms. Therefore, a bacterial culture, Microbacterium phyllosphaerae, isolated from dairy wastewater was used to construct a semi-specific microbial biosensor. A universal microbial biosensor based on Pseudomonas fluorescens, which has a wide substrate spectrum but is nonspecific to dairy wastewater, was used as a comparison. BOD biosensors were calibrated with OECD synthetic wastewater, and experiments with different synthetic and actual wastewater samples were carried out. Results show that the semi-specific M. phyllosphaerae-based microbial biosensor is more sensitive towards wastewaters that contain milk derivates and butter whey than the P. fluorescens-based biosensor. Although the M. phyllosphaerae biosensor underestimates the BOD7 value of actual dairy wastewaters by 25-32%, this bacterial culture is more suitable for BOD monitoring in dairy wastewater than P. fluorescens, which underestimated the same samples by 46-61%.

  17. PBrowse: a web-based platform for real-time collaborative exploration of genomic data.

    Science.gov (United States)

    Szot, Peter S; Yang, Andrian; Wang, Xin; Parsania, Chirag; Röhm, Uwe; Wong, Koon Ho; Ho, Joshua W K

    2017-05-19

    Genome browsers are widely used for individually exploring various types of genomic data. A handful of genome browsers offer limited tools for collaboration among multiple users. Here, we describe PBrowse, an integrated real-time collaborative genome browser that enables multiple users to simultaneously view and access genomic data, thereby harnessing the wisdom of the crowd. PBrowse is based on the Dalliance genome browser and has a re-designed user and data management system with novel collaborative functionalities, including real-time collaborative view, track comment and an integrated group chat feature. Through the Distributed Annotation Server protocol, PBrowse can easily access a wide range of publicly available genomic data, such as the ENCODE data sets. We argue that PBrowse represents a paradigm shift from using a genome browser as a static data visualization tool to a platform that enables real-time human-human interaction and knowledge exchange in a collaborative setting. PBrowse is available at http://pbrowse.victorchang.edu.au, and its source code is available via an open source BSD 3 license at http://github.com/VCCRI/PBrowse. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Analysis of the relationship between genomic GC Content and patterns of base usage, codon usage and amino acid usage in prokaryotes: similar GC content adopts similar compositional frequencies regardless of the phylogenetic lineages.

    Directory of Open Access Journals (Sweden)

    Hui-Qi Zhou

    Full Text Available The GC contents of 2670 prokaryotic genomes that belong to diverse phylogenetic lineages were analyzed in this paper. These genomes had GC contents that ranged from 13.5% to 74.9%. We analyzed the distance of base frequencies at the three codon positions, codon frequencies, and amino acid compositions across genomes with respect to the differences in the GC content of these prokaryotic species. We found that although the phylogenetic lineages were remote among some species, a similar genomic GC content forced them to adopt similar base usage patterns at the three codon positions, codon usage patterns, and amino acid usage patterns. Our work demonstrates that in prokaryotic genomes: a base usage, codon usage, and amino acid usage change with GC content with a linear correlation; b the distance of each usage has a linear correlation with the GC content difference; and c GC content is more essential than phylogenetic lineage in determining base usage, codon usage, and amino acid usage. This work is exceptional in that we adopted intuitively graphic methods for all analyses, and we used these analyses to examine as many as 2670 prokaryotes. We hope that this work is helpful for understanding common features in the organization of microbial genomes.

  19. Genome-wide prediction, display and refinement of binding sites with information theory-based models

    Directory of Open Access Journals (Sweden)

    Leeder J Steven

    2003-09-01

    Full Text Available Abstract Background We present Delila-genome, a software system for identification, visualization and analysis of protein binding sites in complete genome sequences. Binding sites are predicted by scanning genomic sequences with information theory-based (or user-defined weight matrices. Matrices are refined by adding experimentally-defined binding sites to published binding sites. Delila-Genome was used to examine the accuracy of individual information contents of binding sites detected with refined matrices as a measure of the strengths of the corresponding protein-nucleic acid interactions. The software can then be used to predict novel sites by rescanning the genome with the refined matrices. Results Parameters for genome scans are entered using a Java-based GUI interface and backend scripts in Perl. Multi-processor CPU load-sharing minimized the average response time for scans of different chromosomes. Scans of human genome assemblies required 4–6 hours for transcription factor binding sites and 10–19 hours for splice sites, respectively, on 24- and 3-node Mosix and Beowulf clusters. Individual binding sites are displayed either as high-resolution sequence walkers or in low-resolution custom tracks in the UCSC genome browser. For large datasets, we applied a data reduction strategy that limited displays of binding sites exceeding a threshold information content to specific chromosomal regions within or adjacent to genes. An HTML document is produced listing binding sites ranked by binding site strength or chromosomal location hyperlinked to the UCSC custom track, other annotation databases and binding site sequences. Post-genome scan tools parse binding site annotations of selected chromosome intervals and compare the results of genome scans using different weight matrices. Comparisons of multiple genome scans can display binding sites that are unique to each scan and identify sites with significantly altered binding strengths

  20. Event-based text mining for biology and functional genomics

    Science.gov (United States)

    Thompson, Paul; Nawaz, Raheel; McNaught, John; Kell, Douglas B.

    2015-01-01

    The assessment of genome function requires a mapping between genome-derived entities and biochemical reactions, and the biomedical literature represents a rich source of information about reactions between biological components. However, the increasingly rapid growth in the volume of literature provides both a challenge and an opportunity for researchers to isolate information about reactions of interest in a timely and efficient manner. In response, recent text mining research in the biology domain has been largely focused on the identification and extraction of ‘events’, i.e. categorised, structured representations of relationships between biochemical entities, from the literature. Functional genomics analyses necessarily encompass events as so defined. Automatic event extraction systems facilitate the development of sophisticated semantic search applications, allowing researchers to formulate structured queries over extracted events, so as to specify the exact types of reactions to be retrieved. This article provides an overview of recent research into event extraction. We cover annotated corpora on which systems are trained, systems that achieve state-of-the-art performance and details of the community shared tasks that have been instrumental in increasing the quality, coverage and scalability of recent systems. Finally, several concrete applications of event extraction are covered, together with emerging directions of research. PMID:24907365

  1. Event-based text mining for biology and functional genomics.

    Science.gov (United States)

    Ananiadou, Sophia; Thompson, Paul; Nawaz, Raheel; McNaught, John; Kell, Douglas B

    2015-05-01

    The assessment of genome function requires a mapping between genome-derived entities and biochemical reactions, and the biomedical literature represents a rich source of information about reactions between biological components. However, the increasingly rapid growth in the volume of literature provides both a challenge and an opportunity for researchers to isolate information about reactions of interest in a timely and efficient manner. In response, recent text mining research in the biology domain has been largely focused on the identification and extraction of 'events', i.e. categorised, structured representations of relationships between biochemical entities, from the literature. Functional genomics analyses necessarily encompass events as so defined. Automatic event extraction systems facilitate the development of sophisticated semantic search applications, allowing researchers to formulate structured queries over extracted events, so as to specify the exact types of reactions to be retrieved. This article provides an overview of recent research into event extraction. We cover annotated corpora on which systems are trained, systems that achieve state-of-the-art performance and details of the community shared tasks that have been instrumental in increasing the quality, coverage and scalability of recent systems. Finally, several concrete applications of event extraction are covered, together with emerging directions of research.

  2. Generation of a BAC-based physical map of the melon genome

    Directory of Open Access Journals (Sweden)

    Puigdomènech Pere

    2010-05-01

    Full Text Available Abstract Background Cucumis melo (melon belongs to the Cucurbitaceae family, whose economic importance among horticulture crops is second only to Solanaceae. Melon has high intra-specific genetic variation, morphologic diversity and a small genome size (450 Mb, which make this species suitable for a great variety of molecular and genetic studies that can lead to the development of tools for breeding varieties of the species. A number of genetic and genomic resources have already been developed, such as several genetic maps and BAC genomic libraries. These tools are essential for the construction of a physical map, a valuable resource for map-based cloning, comparative genomics and assembly of whole genome sequencing data. However, no physical map of any Cucurbitaceae has yet been developed. A project has recently been started to sequence the complete melon genome following a whole-genome shotgun strategy, which makes use of massive sequencing data. A BAC-based melon physical map will be a useful tool to help assemble and refine the draft genome data that is being produced. Results A melon physical map was constructed using a 5.7 × BAC library and a genetic map previously developed in our laboratories. High-information-content fingerprinting (HICF was carried out on 23,040 BAC clones, digesting with five restriction enzymes and SNaPshot labeling, followed by contig assembly with FPC software. The physical map has 1,355 contigs and 441 singletons, with an estimated physical length of 407 Mb (0.9 × coverage of the genome and the longest contig being 3.2 Mb. The anchoring of 845 BAC clones to 178 genetic markers (100 RFLPs, 76 SNPs and 2 SSRs also allowed the genetic positioning of 183 physical map contigs/singletons, representing 55 Mb (12% of the melon genome, to individual chromosomal loci. The melon FPC database is available for download at http://melonomics.upv.es/static/files/public/physical_map/. Conclusions Here we report the construction

  3. Applications of Genome-based Science in Shaping Citrus Industries of the World (JGI Seventh Annual User Meeting, 2012: Genomics of Energy and Environment)

    Energy Technology Data Exchange (ETDEWEB)

    Gmitter Jr, Fred [University of Florida

    2012-03-21

    Fred Gmitter from the University of Florida on "Applications of Genome-based Science in Shaping the Future of the World's Citrus Industries" at the 7th Annual Genomics of Energy & Environment Meeting on March 21, 2012 in Walnut Creek, California.

  4. Stabilising metal(loid)s in soil with iron and aluminium-based products: microbial, biochemical and plant growth impact.

    Science.gov (United States)

    Garau, Giovanni; Silvetti, Margherita; Castaldi, Paola; Mele, Elena; Deiana, Pietrino; Deiana, Salvatore

    2014-06-15

    Four iron and aluminium-based products, including red mud (RM), hematite (Fe2O3), an iron-rich water treatment residual (Fe-WTR) and amorphous Al hydroxide (Al-OH), were evaluated for their effectiveness at stabilising As and heavy metals (i.e. Cd, Cu, Pb, Zn) in a circumneutral contaminated soil [As (2105 mg kg(-1)), Cd (18 mg kg(-1)), Cu (264 mg kg(-1)), Pb (710 mg kg(-1)), Zn (522 mg kg(-1))]. Treatment impacts on soil microbial and biochemical features (i.e. microbial biomass-C, microbial counts, 16S rRNA PCR-TTGE of culturable bacteria, dehydrogenase, urease and β-glucosidase activity, Biolog derived parameters-AWCD and richness) as well as bean (Phaseolus vulgaris) and wheat (Triticum vulgare) growth were also assessed. After 6 months equilibration, all the amendments (application rate 3% w/w) but RM reduced labile As while only Al-OH reduced the concentration of water-soluble heavy metals. Despite the highest bioavailability of contaminants, most of the soil mi