WorldWideScience

Sample records for metagenomic metatranscriptomic metaproteomic

  1. Metagenomic Taxonomy-Guided Database-Searching Strategy for Improving Metaproteomic Analysis.

    Xiao, Jinqiu; Tanca, Alessandro; Jia, Ben; Yang, Runqing; Wang, Bo; Zhang, Yu; Li, Jing

    2018-04-06

    Metaproteomics provides a direct measure of the functional information by investigating all proteins expressed by a microbiota. However, due to the complexity and heterogeneity of microbial communities, it is very hard to construct a sequence database suitable for a metaproteomic study. Using a public database, researchers might not be able to identify proteins from poorly characterized microbial species, while a sequencing-based metagenomic database may not provide adequate coverage for all potentially expressed protein sequences. To address this challenge, we propose a metagenomic taxonomy-guided database-search strategy (MT), in which a merged database is employed, consisting of both taxonomy-guided reference protein sequences from public databases and proteins from metagenome assembly. By applying our MT strategy to a mock microbial mixture, about two times as many peptides were detected as with the metagenomic database only. According to the evaluation of the reliability of taxonomic attribution, the rate of misassignments was comparable to that obtained using an a priori matched database. We also evaluated the MT strategy with a human gut microbial sample, and we found 1.7 times as many peptides as using a standard metagenomic database. In conclusion, our MT strategy allows the construction of databases able to provide high sensitivity and precision in peptide identification in metaproteomic studies, enabling the detection of proteins from poorly characterized species within the microbiota.

  2. Bioinformatics tools for quantitative and functional metagenome and metatranscriptome data analysis in microbes.

    Niu, Sheng-Yong; Yang, Jinyu; McDermaid, Adam; Zhao, Jing; Kang, Yu; Ma, Qin

    2017-05-08

    Metagenomic and metatranscriptomic sequencing approaches are more frequently being used to link microbiota to important diseases and ecological changes. Many analyses have been used to compare the taxonomic and functional profiles of microbiota across habitats or individuals. While a large portion of metagenomic analyses focus on species-level profiling, some studies use strain-level metagenomic analyses to investigate the relationship between specific strains and certain circumstances. Metatranscriptomic analysis provides another important insight into activities of genes by examining gene expression levels of microbiota. Hence, combining metagenomic and metatranscriptomic analyses will help understand the activity or enrichment of a given gene set, such as drug-resistant genes among microbiome samples. Here, we summarize existing bioinformatics tools of metagenomic and metatranscriptomic data analysis, the purpose of which is to assist researchers in deciding the appropriate tools for their microbiome studies. Additionally, we propose an Integrated Meta-Function mapping pipeline to incorporate various reference databases and accelerate functional gene mapping procedures for both metagenomic and metatranscriptomic analyses. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  3. Metagenome and Metatranscriptome Analyses Using Protein Family Profiles.

    Cuncong Zhong

    2016-07-01

    Full Text Available Analyses of metagenome data (MG and metatranscriptome data (MT are often challenged by a paucity of complete reference genome sequences and the uneven/low sequencing depth of the constituent organisms in the microbial community, which respectively limit the power of reference-based alignment and de novo sequence assembly. These limitations make accurate protein family classification and abundance estimation challenging, which in turn hamper downstream analyses such as abundance profiling of metabolic pathways, identification of differentially encoded/expressed genes, and de novo reconstruction of complete gene and protein sequences from the protein family of interest. The profile hidden Markov model (HMM framework enables the construction of very useful probabilistic models for protein families that allow for accurate modeling of position specific matches, insertions, and deletions. We present a novel homology detection algorithm that integrates banded Viterbi algorithm for profile HMM parsing with an iterative simultaneous alignment and assembly computational framework. The algorithm searches a given profile HMM of a protein family against a database of fragmentary MG/MT sequencing data and simultaneously assembles complete or near-complete gene and protein sequences of the protein family. The resulting program, HMM-GRASPx, demonstrates superior performance in aligning and assembling homologs when benchmarked on both simulated marine MG and real human saliva MG datasets. On real supragingival plaque and stool MG datasets that were generated from healthy individuals, HMM-GRASPx accurately estimates the abundances of the antimicrobial resistance (AMR gene families and enables accurate characterization of the resistome profiles of these microbial communities. For real human oral microbiome MT datasets, using the HMM-GRASPx estimated transcript abundances significantly improves detection of differentially expressed (DE genes. Finally, HMM

  4. Metagenomic analysis of microbial communities and beyond

    Schreiber, Lars

    2014-01-01

    From small clone libraries to large next-generation sequencing datasets – the field of community genomics or metagenomics has developed tremendously within the last years. This chapter will summarize some of these developments and will also highlight pitfalls of current metagenomic analyses...... heterologous expression of metagenomic DNA fragments to discover novel metabolic functions. Lastly, the chapter will shortly discuss the meta-analysis of gene expression of microbial communities, more precisely metatranscriptomics and metaproteomics....

  5. Metagenomics, metatranscriptomics and single cell genomics reveal functional response of active Oceanospirillales to Gulf oil spill

    Mason, Olivia U.; Hazen, Terry C.; Borglin, Sharon; Chain, Patrick S. G.; Dubinsky, Eric A.; Fortney, Julian L.; Han, James; Holman, Hoi-Ying N.; Hultman, Jenni; Lamendella, Regina; Mackelprang, Rachel; Malfatti, Stephanie; Tom, Lauren M.; Tringe, Susannah G.; Woyke, Tanja; Zhou, Jizhong; Rubin, Edward M.; Jansson, Janet K.

    2012-06-12

    The Deepwater Horizon oil spill in the Gulf of Mexico resulted in a deep-sea hydrocarbon plume that caused a shift in the indigenous microbial community composition with unknown ecological consequences. Early in the spill history, a bloom of uncultured, thus uncharacterized, members of the Oceanospirillales was previously detected, but their role in oil disposition was unknown. Here our aim was to determine the functional role of the Oceanospirillales and other active members of the indigenous microbial community using deep sequencing of community DNA and RNA, as well as single-cell genomics. Shotgun metagenomic and metatranscriptomic sequencing revealed that genes for motility, chemotaxis and aliphatic hydrocarbon degradation were significantly enriched and expressed in the hydrocarbon plume samples compared with uncontaminated seawater collected from plume depth. In contrast, although genes coding for degradation of more recalcitrant compounds, such as benzene, toluene, ethylbenzene, total xylenes and polycyclic aromatic hydrocarbons, were identified in the metagenomes, they were expressed at low levels, or not at all based on analysis of the metatranscriptomes. Isolation and sequencing of two Oceanospirillales single cells revealed that both cells possessed genes coding for n-alkane and cycloalkane degradation. Specifically, the near-complete pathway for cyclohexane oxidation in the Oceanospirillales single cells was elucidated and supported by both metagenome and metatranscriptome data. The draft genome also included genes for chemotaxis, motility and nutrient acquisition strategies that were also identified in the metagenomes and metatranscriptomes. These data point towards a rapid response of members of the Oceanospirillales to aliphatic hydrocarbons in the deep sea.

  6. Integrated Metagenomics/Metaproteomics Reveals Human Host-Microbiota Signatures of Crohn's Disease

    Darzi, Youssef; Mongodin, Emmanuel F.; Pan, Chongle; Shah, Manesh; Halfvarson, Jonas; Tysk, Curt; Henrissat, Bernard; Raes, Jeroen; Verberkmoes, Nathan C.; Jansson, Janet K.

    2012-01-01

    Crohn's disease (CD) is an inflammatory bowel disease of complex etiology, although dysbiosis of the gut microbiota has been implicated in chronic immune-mediated inflammation associated with CD. Here we combined shotgun metagenomic and metaproteomic approaches to identify potential functional signatures of CD in stool samples from six twin pairs that were either healthy, or that had CD in the ileum (ICD) or colon (CCD). Integration of these omics approaches revealed several genes, proteins, and pathways that primarily differentiated ICD from healthy subjects, including depletion of many proteins in ICD. In addition, the ICD phenotype was associated with alterations in bacterial carbohydrate metabolism, bacterial-host interactions, as well as human host-secreted enzymes. This eco-systems biology approach underscores the link between the gut microbiota and functional alterations in the pathophysiology of Crohn's disease and aids in identification of novel diagnostic targets and disease specific biomarkers. PMID:23209564

  7. Integrated metagenomics/metaproteomics reveals human host-microbiota signatures of Crohn's disease.

    Alison R Erickson

    Full Text Available Crohn's disease (CD is an inflammatory bowel disease of complex etiology, although dysbiosis of the gut microbiota has been implicated in chronic immune-mediated inflammation associated with CD. Here we combined shotgun metagenomic and metaproteomic approaches to identify potential functional signatures of CD in stool samples from six twin pairs that were either healthy, or that had CD in the ileum (ICD or colon (CCD. Integration of these omics approaches revealed several genes, proteins, and pathways that primarily differentiated ICD from healthy subjects, including depletion of many proteins in ICD. In addition, the ICD phenotype was associated with alterations in bacterial carbohydrate metabolism, bacterial-host interactions, as well as human host-secreted enzymes. This eco-systems biology approach underscores the link between the gut microbiota and functional alterations in the pathophysiology of Crohn's disease and aids in identification of novel diagnostic targets and disease specific biomarkers.

  8. FMAP: Functional Mapping and Analysis Pipeline for metagenomics and metatranscriptomics studies.

    Kim, Jiwoong; Kim, Min Soo; Koh, Andrew Y; Xie, Yang; Zhan, Xiaowei

    2016-10-10

    Given the lack of a complete and comprehensive library of microbial reference genomes, determining the functional profile of diverse microbial communities is challenging. The available functional analysis pipelines lack several key features: (i) an integrated alignment tool, (ii) operon-level analysis, and (iii) the ability to process large datasets. Here we introduce our open-sourced, stand-alone functional analysis pipeline for analyzing whole metagenomic and metatranscriptomic sequencing data, FMAP (Functional Mapping and Analysis Pipeline). FMAP performs alignment, gene family abundance calculations, and statistical analysis (three levels of analyses are provided: differentially-abundant genes, operons and pathways). The resulting output can be easily visualized with heatmaps and functional pathway diagrams. FMAP functional predictions are consistent with currently available functional analysis pipelines. FMAP is a comprehensive tool for providing functional analysis of metagenomic/metatranscriptomic sequencing data. With the added features of integrated alignment, operon-level analysis, and the ability to process large datasets, FMAP will be a valuable addition to the currently available functional analysis toolbox. We believe that this software will be of great value to the wider biology and bioinformatics communities.

  9. Multi-omics approach to elucidate the gut microbiota activity: Metaproteomics and metagenomics connection.

    Guirro, Maria; Costa, Andrea; Gual-Grau, Andreu; Mayneris-Perxachs, Jordi; Torrell, Helena; Herrero, Pol; Canela, Núria; Arola, Lluís

    2018-02-10

    Over the last few years, the application of high-throughput meta-omics methods has provided great progress in improving the knowledge of the gut ecosystem and linking its biodiversity to host health conditions, offering complementary support to classical microbiology. Gut microbiota plays a crucial role in relevant diseases such as obesity or cardiovascular disease (CVD), and its regulation is closely influenced by several factors, such as dietary composition. In fact, polyphenol-rich diets are the most palatable treatment to prevent hypertension associated with CVD, although the polyphenol-microbiota interactions have not been completely elucidated. For this reason, the aim of this study was to evaluate microbiota effect in obese rats supplemented by hesperidin, after being fed with cafeteria or standard diet, using a multi meta-omics approaches combining strategy of metagenomics and metaproteomics analysis. We reported that cafeteria diet induces obesity, resulting in changes in the microbiota composition, which are related to functional alterations at proteome level. In addition, hesperidin supplementation alters microbiota diversity and also proteins involved in important metabolic pathways. Overall, going deeper into strategies to integrate omics sciences is necessary to understand the complex relationships between the host, gut microbiota, and diet. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  10. Microbial community structure and dynamics in thermophilic composting viewed through metagenomics and metatranscriptomics

    Antunes, Luciana Principal; Martins, Layla Farage; Pereira, Roberta Verciano; Thomas, Andrew Maltez; Barbosa, Deibs; Lemos, Leandro Nascimento; Silva, Gianluca Major Machado; Moura, Livia Maria Silva; Epamino, George Willian Condomitti; Digiampietri, Luciano Antonio; Lombardi, Karen Cristina; Ramos, Patricia Locosque; Quaggio, Ronaldo Bento; de Oliveira, Julio Cezar Franco; Pascon, Renata Castiglioni; Cruz, João Batista da; da Silva, Aline Maria; Setubal, João Carlos

    2016-01-01

    Composting is a promising source of new organisms and thermostable enzymes that may be helpful in environmental management and industrial processes. Here we present results of metagenomic- and metatranscriptomic-based analyses of a large composting operation in the São Paulo Zoo Park. This composting exhibits a sustained thermophilic profile (50 °C to 75 °C), which seems to preclude fungal activity. The main novelty of our study is the combination of time-series sampling with shotgun DNA, 16S rRNA gene amplicon, and metatranscriptome high-throughput sequencing, enabling an unprecedented detailed view of microbial community structure, dynamics, and function in this ecosystem. The time-series data showed that the turning procedure has a strong impact on the compost microbiota, restoring to a certain extent the population profile seen at the beginning of the process; and that lignocellulosic biomass deconstruction occurs synergistically and sequentially, with hemicellulose being degraded preferentially to cellulose and lignin. Moreover, our sequencing data allowed near-complete genome reconstruction of five bacterial species previously found in biomass-degrading environments and of a novel biodegrading bacterial species, likely a new genus in the order Bacillales. The data and analyses provided are a rich source for additional investigations of thermophilic composting microbiology. PMID:27941956

  11. Anaerobic digestion of the microalga Spirulina at extreme alkaline conditions: biogas production, metagenome, and metatranscriptome

    Nolla-Ardèvol, Vímac; Strous, Marc; Tegetmeyer, Halina E.

    2015-01-01

    A haloalkaline anaerobic microbial community obtained from soda lake sediments was used to inoculate anaerobic reactors for the production of methane rich biogas. The microalga Spirulina was successfully digested by the haloalkaline microbial consortium at alkaline conditions (pH 10, 2.0 M Na+). Continuous biogas production was observed and the obtained biogas was rich in methane, up to 96%. Alkaline medium acted as a CO2 scrubber which resulted in low amounts of CO2 and no traces of H2S in the produced biogas. A hydraulic retention time (HRT) of 15 days and 0.25 g Spirulina L−1 day−1 organic loading rate (OLR) were identified as the optimal operational parameters. Metagenomic and metatranscriptomic analysis showed that the hydrolysis of the supplied substrate was mainly carried out by Bacteroidetes of the “ML635J-40 aquatic group” while the hydrogenotrophic pathway was the main producer of methane in a methanogenic community dominated by Methanocalculus. PMID:26157422

  12. Ecology of Subglacial Lake Vostok (Antarctica, Based on Metagenomic/Metatranscriptomic Analyses of Accretion Ice

    Tom D'Elia

    2013-03-01

    Full Text Available Lake Vostok is the largest of the nearly 400 subglacial Antarctic lakes and has been continuously buried by glacial ice for 15 million years. Extreme cold, heat (from possible hydrothermal activity, pressure (from the overriding glacier and dissolved oxygen (delivered by melting meteoric ice, in addition to limited nutrients and complete darkness, combine to produce one of the most extreme environments on Earth. Metagenomic/metatranscriptomic analyses of ice that accreted over a shallow embayment and over the southern main lake basin indicate the presence of thousands of species of organisms (94% Bacteria, 6% Eukarya, and two Archaea. The predominant bacterial sequences were closest to those from species of Firmicutes, Proteobacteria and Actinobacteria, while the predominant eukaryotic sequences were most similar to those from species of ascomycetous and basidiomycetous Fungi. Based on the sequence data, the lake appears to contain a mixture of autotrophs and heterotrophs capable of performing nitrogen fixation, nitrogen cycling, carbon fixation and nutrient recycling. Sequences closest to those of psychrophiles and thermophiles indicate a cold lake with possible hydrothermal activity. Sequences most similar to those from marine and aquatic species suggest the presence of marine and freshwater regions.

  13. Metatranscriptomic and functional metagenomic analysis of methylphosphonate utilization by marine bacteria

    Asuncion eMartinez

    2013-11-01

    Full Text Available Aerobic degradation of methylphosphonate (MPn by marine bacterioplankton has been hypothesized to contribute significantly to the ocean’s methane supersaturation, yet little is known about MPn utilization by marine microbes. To identify the microbial taxa and metabolic functions associated with MPn-driven methane production we performed parallel metagenomic, metatranscriptomic, and functional screening of microcosm perturbation experiments using surface water collected in North Pacific Subtropical Gyre. In nutrient amended microcosms containing MPn, a substrate-driven microbial succession occurred. Initially, the addition of glucose and nitrate resulted in a bloom of Vibrionales and a transcriptional profile dominated by glucose-specific PTS transport and polyhydroxyalkanoate biosynthesis. Transcripts associated with phosphorus (P acquisition were also overrepresented and suggested that the addition of glucose and nitrate had driven the community to P depletion. At this point, a second community shift occurred characterized by the increase in C-P lyase containing microbes of the Vibrionales and Rhodobacterales orders. Transcripts associated with C-P lyase components were among the most highly expressed at the community level, and only C-P lyase clusters were recovered in a functional screen for MPn utilization, consistent with this pathway being responsible for the majority, if not all the methane accumulation we observed. Our results identify specific bacterioplankton taxa that can utilize MPn aerobically under conditions of P limitation using the C-P lyase pathway, and thereby elicit a significant increase in the dissolved methane concentration.

  14. Comparative metagenomic and metatranscriptomic analyses of microbial communities in acid mine drainage.

    Chen, Lin-xing; Hu, Min; Huang, Li-nan; Hua, Zheng-shuang; Kuang, Jia-liang; Li, Sheng-jin; Shu, Wen-sheng

    2015-07-01

    The microbial communities in acid mine drainage have been extensively studied to reveal their roles in acid generation and adaption to this environment. Lacking, however, are integrated community- and organism-wide comparative gene transcriptional analyses that could reveal the response and adaptation mechanisms of these extraordinary microorganisms to different environmental conditions. In this study, comparative metagenomics and metatranscriptomics were performed on microbial assemblages collected from four geochemically distinct acid mine drainage (AMD) sites. Taxonomic analysis uncovered unexpectedly high microbial biodiversity of these extremely acidophilic communities, and the abundant taxa of Acidithiobacillus, Leptospirillum and Acidiphilium exhibited high transcriptional activities. Community-wide comparative analyses clearly showed that the AMD microorganisms adapted to the different environmental conditions via regulating the expression of genes involved in multiple in situ functional activities, including low-pH adaptation, carbon, nitrogen and phosphate assimilation, energy generation, environmental stress resistance, and other functions. Organism-wide comparative analyses of the active taxa revealed environment-dependent gene transcriptional profiles, especially the distinct strategies used by Acidithiobacillus ferrivorans and Leptospirillum ferrodiazotrophum in nutrients assimilation and energy generation for survival under different conditions. Overall, these findings demonstrate that the gene transcriptional profiles of AMD microorganisms are closely related to the site physiochemical characteristics, providing clues into the microbial response and adaptation mechanisms in the oligotrophic, extremely acidic environments.

  15. Ecological roles of dominant and rare prokaryotes in acid mine drainage revealed by metagenomics and metatranscriptomics.

    Hua, Zheng-Shuang; Han, Yu-Jiao; Chen, Lin-Xing; Liu, Jun; Hu, Min; Li, Sheng-Jin; Kuang, Jia-Liang; Chain, Patrick S G; Huang, Li-Nan; Shu, Wen-Sheng

    2015-06-01

    High-throughput sequencing is expanding our knowledge of microbial diversity in the environment. Still, understanding the metabolic potentials and ecological roles of rare and uncultured microbes in natural communities remains a major challenge. To this end, we applied a 'divide and conquer' strategy that partitioned a massive metagenomic data set (>100 Gbp) into subsets based on K-mer frequency in sequence assembly to a low-diversity acid mine drainage (AMD) microbial community and, by integrating with an additional metatranscriptomic assembly, successfully obtained 11 draft genomes most of which represent yet uncultured and/or rare taxa (relative abundance 90%) and its metabolic potentials and gene expression profile, providing initial molecular insights into the ecological role of these lesser known, but potentially important, microorganisms in the AMD environment. Gene transcriptional analysis of the active taxa revealed major metabolic capabilities executed in situ, including carbon- and nitrogen-related metabolisms associated with syntrophic interactions, iron and sulfur oxidation, which are key in energy conservation and AMD generation, and the mechanisms of adaptation and response to the environmental stresses (heavy metals, low pH and oxidative stress). Remarkably, nitrogen fixation and sulfur oxidation were performed by the rare taxa, indicating their critical roles in the overall functioning and assembly of the AMD community. Our study demonstrates the potential of the 'divide and conquer' strategy in high-throughput sequencing data assembly for genome reconstruction and functional partitioning analysis of both dominant and rare species in natural microbial assemblages.

  16. Metatranscriptomic and metagenomic description of the bacterial nitrogen metabolism in waste water wet oxidation effluents

    Julien Crovadore

    2017-10-01

    Full Text Available Anaerobic digestion is a common method for reducing the amount of sludge solids in used waters and enabling biogas production. The wet oxidation process (WOX improves anaerobic digestion by converting carbon into methane through oxidation of organic compounds. WOX produces effluents rich in ammonia, which must be removed to maintain the activity of methanogens. Ammonia removal from WOX could be biologically operated by aerobic granules. To this end, granulation experiments were conducted in 2 bioreactors containing an activated sludge (AS. For the first time, the dynamics of the microbial community structure and the expression levels of 7 enzymes of the nitrogen metabolism in such active microbial communities were followed in regard to time by metagenomics and metatranscriptomics. It was shown that bacterial communities adapt to the wet oxidation effluent by increasing the expression level of the nitrogen metabolism, suggesting that these biological activities could be a less costly alternative for the elimination of ammonia, resulting in a reduction of the use of chemicals and energy consumption in sewage plants. This study reached a strong sequencing depth (from 4.4 to 7.6 Gb and enlightened a yet unknown diversity of the microorganisms involved in the nitrogen pathway. Moreover, this approach revealed the abundance and expression levels of specialised enzymes involved in nitrification, denitrification, ammonification, dissimilatory nitrate reduction to ammonium (DNRA and nitrogen fixation processes in AS. Keywords: Applied sciences, Biological sciences, Environmental science, Genetics, Microbiology

  17. Ecological and genetic interactions between cyanobacteria and viruses in a low-oxygen mat community inferred through metagenomics and metatranscriptomics.

    Voorhies, Alexander A; Eisenlord, Sarah D; Marcus, Daniel N; Duhaime, Melissa B; Biddanda, Bopaiah A; Cavalcoli, James D; Dick, Gregory J

    2016-02-01

    Metagenomic and metatranscriptomic sequencing was conducted on cyanobacterial mats of the Middle Island Sinkhole (MIS), Lake Huron. Metagenomic data from 14 samples collected over 5 years were used to reconstruct genomes of two genotypes of a novel virus, designated PhV1 type A and PhV1 type B. Both viral genotypes encode and express nblA, a gene involved in degrading phycobilisomes, which are complexes of pigmented proteins that harvest light for photosynthesis. Phylogenetic analysis indicated that the viral-encoded nblA is derived from the host cyanobacterium, Phormidium MIS-PhA. The cyanobacterial host also has two complete CRISPR (clustered regularly interspaced short palindromic repeats) systems that serve as defence mechanisms for bacteria and archaea against viruses and plasmids. One 45 bp CRISPR spacer from Phormidium had 100% nucleotide identity to PhV1 type B, but this region was absent from PhV1 type A. Transcripts from PhV1 and the Phormidium CRISPR loci were detected in all six metatranscriptomic data sets (three during the day and three at night), indicating that both are transcriptionally active in the environment. These results reveal ecological and genetic interactions between viruses and cyanobacteria at MIS, highlighting the value of parallel analysis of viruses and hosts in understanding ecological interactions in natural communities. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.

  18. An Integrated Metagenomics/Metaproteomics Investigation of the Microbial Communities and Enzymes in Solid-state Fermentation of Pu-erh tea

    Zhao, Ming; Zhang, Dong-lian; Su, Xiao-qin; Duan, Shuang-mei; Wan, Jin-qiong; Yuan, Wen-xia; Liu, Ben-ying; Ma, Yan; Pan, Ying-hong

    2015-01-01

    Microbial enzymes during solid-state fermentation (SSF), which play important roles in the food, chemical, pharmaceutical and environmental fields, remain relatively unknown. In this work, the microbial communities and enzymes in SSF of Pu-erh tea, a well-known traditional Chinese tea, were investigated by integrated metagenomics/metaproteomics approach. The dominant bacteria and fungi were identified as Proteobacteria (48.42%) and Aspergillus (94.98%), through pyrosequencing-based analyses of the bacterial 16S and fungal 18S rRNA genes, respectively. In total, 335 proteins with at least two unique peptides were identified and classified into 28 Biological Processes and 35 Molecular Function categories using a metaproteomics analysis. The integration of metagenomics and metaproteomics data demonstrated that Aspergillus was dominant fungus and major host of identified proteins (50.45%). Enzymes involved in the degradation of the plant cell wall were identified and associated with the soft-rotting of tea leaves. Peroxiredoxins, catalase and peroxidases were associated with the oxidation of catechins. In conclusion, this work greatly advances our understanding of the SSF of Pu-erh tea and provides a powerful tool for studying SSF mechanisms, especially in relation to the microbial communities present. PMID:25974221

  19. Microbiota composition, gene pool and its expression in Gir cattle (Bos indicus) rumen under different forage diets using metagenomic and metatranscriptomic approaches.

    Pandit, Ramesh J; Hinsu, Ankit T; Patel, Shriram H; Jakhesara, Subhash J; Koringa, Prakash G; Bruno, Fosso; Psifidi, Androniki; Shah, S V; Joshi, Chaitanya G

    2018-03-09

    Zebu (Bos indicus) is a domestic cattle species originating from the Indian subcontinent and now widely domesticated on several continents. In this study, we were particularly interested in understanding the functionally active rumen microbiota of an important Zebu breed, the Gir, under different dietary regimes. Metagenomic and metatranscriptomic data were compared at various taxonomic levels to elucidate the differential microbial population and its functional dynamics in Gir cattle rumen under different roughage dietary regimes. Different proportions of roughage rather than the type of roughage (dry or green) modulated microbiome composition and the expression of its gene pool. Fibre degrading bacteria (i.e. Clostridium, Ruminococcus, Eubacterium, Butyrivibrio, Bacillus and Roseburia) were higher in the solid fraction of rumen (Pcomparison of metagenomic shotgun and metatranscriptomic sequencing appeared to be a much richer source of information compared to conventional metagenomic analysis. Copyright © 2018 Elsevier GmbH. All rights reserved.

  20. Using metagenomics and metatranscriptomics to study specific bacterial species involved in biological phosphorus removal from wastewater

    Albertsen, Mads; McIlroy, Simon Jon; Stokholm-Bjerregaard, Mikkel

    an enrichment of the target organisms in laboratory scale reactors under controlled conditions. We demonstrate that it is now easy and affordable to extract genomes of all the dominant organisms from reactors due to reduced micro-diversity and further use these to examine their individual gene expression...... profiles by metatranscriptomics. To demonstrate this we revisited the bacteria involved in enhanced biological phosphorus removal (EBPR) from wastewater treatment plants. The EBPR process is used all over the world, has a large body of information regarding the underlying microbiology, and is often studied...

  1. MerCat: a versatile k-mer counter and diversity estimator for database-independent property analysis obtained from metagenomic and/or metatranscriptomic sequencing data

    White, Richard A.; Panyala, Ajay R.; Glass, Kevin A.; Colby, Sean M.; Glaesemann, Kurt R.; Jansson, Georg C.; Jansson, Janet K.

    2017-02-21

    MerCat is a parallel, highly scalable and modular property software package for robust analysis of features in next-generation sequencing data. MerCat inputs include assembled contigs and raw sequence reads from any platform resulting in feature abundance counts tables. MerCat allows for direct analysis of data properties without reference sequence database dependency commonly used by search tools such as BLAST and/or DIAMOND for compositional analysis of whole community shotgun sequencing (e.g. metagenomes and metatranscriptomes).

  2. Metagenomic and Metatranscriptomic Analyses of Diverse Watermelon Cultivars Reveal the Role of Fruit Associated Microbiome in Carbohydrate Metabolism and Ripening of Mature Fruits

    Thangasamy Saminathan

    2018-01-01

    Full Text Available The plant microbiome is a key determinant of plant health and productivity, and changes in the plant microbiome can alter the tolerance to biotic and abiotic stresses and the quality of end produce. Little is known about the microbial diversity and its effect on carbohydrate metabolism in ripe fruits. In this study, we aimed to understand the diversity and function of microorganisms in relation to carbohydrate metabolism of ripe watermelon fruits. We used 16S metagenomics and RNAseq metatranscriptomics for analysis of red (PI459074, Congo, and SDRose and yellow fruit-flesh cultivars (PI227202, PI435990, and JBush of geographically and metabolically diverse watermelon cultivars. Metagenomics data showed that Proteobacteria were abundant in SDRose and PI227202, whereas Cyanobacteria were most abundant in Congo and PI4559074. In the case of metatranscriptome data, Proteobacteria was the most abundant in all cultivars. High expression of genes linked to infectious diseases and the expression of peptidoglycan hydrolases associated to pathogenicity of eukaryotic hosts was observed in SDRose, which could have resulted in low microbial diversity in this cultivar. The presence of GH28, associated with polygalacturonase activity in JBush and SDRose could be related to cell wall modifications including de-esterification and depolymerization, and consequent loss of galacturonic acid and neutral sugars. Moreover, based on the KEGG annotation of the expressed genes, nine α-galactosidase genes involved in key processes of galactosyl oligosaccharide metabolism, such as raffinose family were identified and galactose metabolism pathway was reconstructed. Results of this study underline the links between the host and fruit-associated microbiome in carbohydrate metabolism of the ripe fruits. The cultivar difference in watermelon reflects the quantum and diversity of the microbiome, which would benefit watermelon and other plant breeders aiming at the holobiont

  3. Metagenomic and Metatranscriptomic Analyses Reveal the Structure and Dynamics of a Dechlorinating Community Containing Dehalococcoides mccartyi and Corrinoid-Providing Microorganisms under Cobalamin-Limited Conditions

    Men, Yujie; Yu, Ke; Bælum, Jacob; Gao, Ying; Tremblay, Julien; Prestat, Emmanuel; Stenuit, Ben; Tringe, Susannah G.; Jansson, Janet; Zhang, Tong; Alvarez-Cohen, Lisa; Liu, Shuang-Jiang

    2017-02-10

    ABSTRACT

    The aim of this study is to obtain a systems-level understanding of the interactions betweenDehalococcoidesand corrinoid-supplying microorganisms by analyzing community structures and functional compositions, activities, and dynamics in trichloroethene (TCE)-dechlorinating enrichments. Metagenomes and metatranscriptomes of the dechlorinating enrichments with and without exogenous cobalamin were compared. Seven putative draft genomes were binned from the metagenomes. At an early stage (2 days), more transcripts of genes in theVeillonellaceaebin-genome were detected in the metatranscriptome of the enrichment without exogenous cobalamin than in the one with the addition of cobalamin. Among these genes, sporulation-related genes exhibited the highest differential expression when cobalamin was not added, suggesting a possible release route of corrinoids from corrinoid producers. Other differentially expressed genes include those involved in energy conservation and nutrient transport (including cobalt transport). The most highly expressed corrinoidde novobiosynthesis pathway was also assigned to theVeillonellaceaebin-genome. Targeted quantitative PCR (qPCR) analyses confirmed higher transcript abundances of those corrinoid biosynthesis genes in the enrichment without exogenous cobalamin than in the enrichment with cobalamin. Furthermore, the corrinoid salvaging and modification pathway ofDehalococcoideswas upregulated in response to the cobalamin stress. This study provides important insights into the microbial interactions and roles played by members of dechlorinating communities under cobalamin-limited conditions.

    IMPORTANCEThe key

  4. The rumen microbial metaproteome as revealed by SDS-PAGE.

    Snelling, Timothy J; Wallace, R John

    2017-01-07

    Ruminal digestion is carried out by large numbers of bacteria, archaea, protozoa and fungi. Understanding the microbiota is important because ruminal fermentation dictates the efficiency of feed utilisation by the animal and is also responsible for major emissions of the greenhouse gas, methane. Recent metagenomic and metatranscriptomic studies have helped to elucidate many features of the composition and activity of the microbiota. The metaproteome provides complementary information to these other -omics technologies. The aim of this study was to explore the metaproteome of bovine and ovine ruminal digesta using 2D SDS-PAGE. Digesta samples were taken via ruminal fistulae and by gastric intubation, or at slaughter, and stored in glycerol at -80 °C. A protein extraction protocol was developed to maximise yield and representativeness of the protein content. The proteome of ruminal digesta taken from dairy cows fed a high concentrate diet was dominated by a few very highly expressed proteins, which were identified by LC-MS/MS to be structural proteins, such as actin and α- and β-tubulins, derived from ciliate protozoa. Removal of protozoa from digesta before extraction of proteins revealed the prokaryotic metaproteome, which was dominated by enzymes involved in glycolysis, such as glyceraldehyde-3-phosphate dehydrogenase, phosphoenolpyruvate carboxykinase, phosphoglycerate kinase and triosephosphate isomerase. The enzymes were predominantly from the Firmicutes and Bacteroidetes phyla. Enzymes from methanogenic archaea were also abundant, consistent with the importance of methane formation in the rumen. Gels from samples from dairy cows fed a high proportion of grass silage were consistently obscured by co-staining of humic compounds. Samples from beef cattle and fattening lambs receiving a predominantly concentrate diet produced clearer gels, but the pattern of spots was inconsistent between samples, making comparisons difficult. This work demonstrated for the

  5. MetaPro-IQ: a universal metaproteomic approach to studying human and mouse gut microbiota.

    Zhang, Xu; Ning, Zhibin; Mayne, Janice; Moore, Jasmine I; Li, Jennifer; Butcher, James; Deeke, Shelley Ann; Chen, Rui; Chiang, Cheng-Kang; Wen, Ming; Mack, David; Stintzi, Alain; Figeys, Daniel

    2016-06-24

    The gut microbiota has been shown to be closely associated with human health and disease. While next-generation sequencing can be readily used to profile the microbiota taxonomy and metabolic potential, metaproteomics is better suited for deciphering microbial biological activities. However, the application of gut metaproteomics has largely been limited due to the low efficiency of protein identification. Thus, a high-performance and easy-to-implement gut metaproteomic approach is required. In this study, we developed a high-performance and universal workflow for gut metaproteome identification and quantification (named MetaPro-IQ) by using the close-to-complete human or mouse gut microbial gene catalog as database and an iterative database search strategy. An average of 38 and 33 % of the acquired tandem mass spectrometry (MS) spectra was confidently identified for the studied mouse stool and human mucosal-luminal interface samples, respectively. In total, we accurately quantified 30,749 protein groups for the mouse metaproteome and 19,011 protein groups for the human metaproteome. Moreover, the MetaPro-IQ approach enabled comparable identifications with the matched metagenome database search strategy that is widely used but needs prior metagenomic sequencing. The response of gut microbiota to high-fat diet in mice was then assessed, which showed distinct metaproteome patterns for high-fat-fed mice and identified 849 proteins as significant responders to high-fat feeding in comparison to low-fat feeding. We present MetaPro-IQ, a metaproteomic approach for highly efficient intestinal microbial protein identification and quantification, which functions as a universal workflow for metaproteomic studies, and will thus facilitate the application of metaproteomics for better understanding the functions of gut microbiota in health and disease.

  6. ATLAS (Automatic Tool for Local Assembly Structures) - A Comprehensive Infrastructure for Assembly, Annotation, and Genomic Binning of Metagenomic and Metaranscripomic Data

    White, Richard A.; Brown, Joseph M.; Colby, Sean M.; Overall, Christopher C.; Lee, Joon-Yong; Zucker, Jeremy D.; Glaesemann, Kurt R.; Jansson, Georg C.; Jansson, Janet K.

    2017-03-02

    ATLAS (Automatic Tool for Local Assembly Structures) is a comprehensive multiomics data analysis pipeline that is massively parallel and scalable. ATLAS contains a modular analysis pipeline for assembly, annotation, quantification and genome binning of metagenomics and metatranscriptomics data and a framework for reference metaproteomic database construction. ATLAS transforms raw sequence data into functional and taxonomic data at the microbial population level and provides genome-centric resolution through genome binning. ATLAS provides robust taxonomy based on majority voting of protein coding open reading frames rolled-up at the contig level using modified lowest common ancestor (LCA) analysis. ATLAS provides robust taxonomy based on majority voting of protein coding open reading frames rolled-up at the contig level using modified lowest common ancestor (LCA) analysis. ATLAS is user-friendly, easy install through bioconda maintained as open-source on GitHub, and is implemented in Snakemake for modular customizable workflows.

  7. Metaproteomics: extracting and mining proteome information to characterize metabolic activities in microbial communities.

    Abraham, Paul E; Giannone, Richard J; Xiong, Weili; Hettich, Robert L

    2014-06-17

    Contemporary microbial ecology studies usually employ one or more "omics" approaches to investigate the structure and function of microbial communities. Among these, metaproteomics aims to characterize the metabolic activities of the microbial membership, providing a direct link between the genetic potential and functional metabolism. The successful deployment of metaproteomics research depends on the integration of high-quality experimental and bioinformatic techniques for uncovering the metabolic activities of a microbial community in a way that is complementary to other "meta-omic" approaches. The essential, quality-defining informatics steps in metaproteomics investigations are: (1) construction of the metagenome, (2) functional annotation of predicted protein-coding genes, (3) protein database searching, (4) protein inference, and (5) extraction of metabolic information. In this article, we provide an overview of current bioinformatic approaches and software implementations in metaproteome studies in order to highlight the key considerations needed for successful implementation of this powerful community-biology tool. Copyright © 2014 John Wiley & Sons, Inc.

  8. Shotgun metaproteomics of the human distal gut microbiota

    VerBerkmoes, N.C.; Russell, A.L.; Shah, M.; Godzik, A.; Rosenquist, M.; Halfvarsson, J.; Lefsrud, M.G.; Apajalahti, J.; Tysk, C.; Hettich, R.L.; Jansson, Janet K.

    2008-10-15

    The human gut contains a dense, complex and diverse microbial community, comprising the gut microbiome. Metagenomics has recently revealed the composition of genes in the gut microbiome, but provides no direct information about which genes are expressed or functioning. Therefore, our goal was to develop a novel approach to directly identify microbial proteins in fecal samples to gain information about the genes expressed and about key microbial functions in the human gut. We used a non-targeted, shotgun mass spectrometry-based whole community proteomics, or metaproteomics, approach for the first deep proteome measurements of thousands of proteins in human fecal samples, thus demonstrating this approach on the most complex sample type to date. The resulting metaproteomes had a skewed distribution relative to the metagenome, with more proteins for translation, energy production and carbohydrate metabolism when compared to what was earlier predicted from metagenomics. Human proteins, including antimicrobial peptides, were also identified, providing a non-targeted glimpse of the host response to the microbiota. Several unknown proteins represented previously undescribed microbial pathways or host immune responses, revealing a novel complex interplay between the human host and its associated microbes.

  9. COMAN: a web server for comprehensive metatranscriptomics analysis.

    Ni, Yueqiong; Li, Jun; Panagiotou, Gianni

    2016-08-11

    Microbiota-oriented studies based on metagenomic or metatranscriptomic sequencing have revolutionised our understanding on microbial ecology and the roles of both clinical and environmental microbes. The analysis of massive metatranscriptomic data requires extensive computational resources, a collection of bioinformatics tools and expertise in programming. We developed COMAN (Comprehensive Metatranscriptomics Analysis), a web-based tool dedicated to automatically and comprehensively analysing metatranscriptomic data. COMAN pipeline includes quality control of raw reads, removal of reads derived from non-coding RNA, followed by functional annotation, comparative statistical analysis, pathway enrichment analysis, co-expression network analysis and high-quality visualisation. The essential data generated by COMAN are also provided in tabular format for additional analysis and integration with other software. The web server has an easy-to-use interface and detailed instructions, and is freely available at http://sbb.hku.hk/COMAN/ CONCLUSIONS: COMAN is an integrated web server dedicated to comprehensive functional analysis of metatranscriptomic data, translating massive amount of reads to data tables and high-standard figures. It is expected to facilitate the researchers with less expertise in bioinformatics in answering microbiota-related biological questions and to increase the accessibility and interpretation of microbiota RNA-Seq data.

  10. Illuminating structural proteins in viral "dark matter" with metaproteomics.

    Brum, Jennifer R; Ignacio-Espinoza, J Cesar; Kim, Eun-Hae; Trubl, Gareth; Jones, Robert M; Roux, Simon; VerBerkmoes, Nathan C; Rich, Virginia I; Sullivan, Matthew B

    2016-03-01

    Viruses are ecologically important, yet environmental virology is limited by dominance of unannotated genomic sequences representing taxonomic and functional "viral dark matter." Although recent analytical advances are rapidly improving taxonomic annotations, identifying functional dark matter remains problematic. Here, we apply paired metaproteomics and dsDNA-targeted metagenomics to identify 1,875 virion-associated proteins from the ocean. Over one-half of these proteins were newly functionally annotated and represent abundant and widespread viral metagenome-derived protein clusters (PCs). One primarily unannotated PC dominated the dataset, but structural modeling and genomic context identified this PC as a previously unidentified capsid protein from multiple uncultivated tailed virus families. Furthermore, four of the five most abundant PCs in the metaproteome represent capsid proteins containing the HK97-like protein fold previously found in many viruses that infect all three domains of life. The dominance of these proteins within our dataset, as well as their global distribution throughout the world's oceans and seas, supports prior hypotheses that this HK97-like protein fold is the most abundant biological structure on Earth. Together, these culture-independent analyses improve virion-associated protein annotations, facilitate the investigation of proteins within natural viral communities, and offer a high-throughput means of illuminating functional viral dark matter.

  11. MetaComp: comprehensive analysis software for comparative meta-omics including comparative metagenomics.

    Zhai, Peng; Yang, Longshu; Guo, Xiao; Wang, Zhe; Guo, Jiangtao; Wang, Xiaoqi; Zhu, Huaiqiu

    2017-10-02

    During the past decade, the development of high throughput nucleic sequencing and mass spectrometry analysis techniques have enabled the characterization of microbial communities through metagenomics, metatranscriptomics, metaproteomics and metabolomics data. To reveal the diversity of microbial communities and interactions between living conditions and microbes, it is necessary to introduce comparative analysis based upon integration of all four types of data mentioned above. Comparative meta-omics, especially comparative metageomics, has been established as a routine process to highlight the significant differences in taxon composition and functional gene abundance among microbiota samples. Meanwhile, biologists are increasingly concerning about the correlations between meta-omics features and environmental factors, which may further decipher the adaptation strategy of a microbial community. We developed a graphical comprehensive analysis software named MetaComp comprising a series of statistical analysis approaches with visualized results for metagenomics and other meta-omics data comparison. This software is capable to read files generated by a variety of upstream programs. After data loading, analyses such as multivariate statistics, hypothesis testing of two-sample, multi-sample as well as two-group sample and a novel function-regression analysis of environmental factors are offered. Here, regression analysis regards meta-omic features as independent variable and environmental factors as dependent variables. Moreover, MetaComp is capable to automatically choose an appropriate two-group sample test based upon the traits of input abundance profiles. We further evaluate the performance of its choice, and exhibit applications for metagenomics, metaproteomics and metabolomics samples. MetaComp, an integrative software capable for applying to all meta-omics data, originally distills the influence of living environment on microbial community by regression analysis

  12. Detection of large numbers of novel sequences in the metatranscriptomes of complex marine microbial communities.

    Gilbert, Jack A; Field, Dawn; Huang, Ying; Edwards, Rob; Li, Weizhong; Gilna, Paul; Joint, Ian

    2008-08-22

    Sequencing the expressed genetic information of an ecosystem (metatranscriptome) can provide information about the response of organisms to varying environmental conditions. Until recently, metatranscriptomics has been limited to microarray technology and random cloning methodologies. The application of high-throughput sequencing technology is now enabling access to both known and previously unknown transcripts in natural communities. We present a study of a complex marine metatranscriptome obtained from random whole-community mRNA using the GS-FLX Pyrosequencing technology. Eight samples, four DNA and four mRNA, were processed from two time points in a controlled coastal ocean mesocosm study (Bergen, Norway) involving an induced phytoplankton bloom producing a total of 323,161,989 base pairs. Our study confirms the finding of the first published metatranscriptomic studies of marine and soil environments that metatranscriptomics targets highly expressed sequences which are frequently novel. Our alternative methodology increases the range of experimental options available for conducting such studies and is characterized by an exceptional enrichment of mRNA (99.92%) versus ribosomal RNA. Analysis of corresponding metagenomes confirms much higher levels of assembly in the metatranscriptomic samples and a far higher yield of large gene families with >100 members, approximately 91% of which were novel. This study provides further evidence that metatranscriptomic studies of natural microbial communities are not only feasible, but when paired with metagenomic data sets, offer an unprecedented opportunity to explore both structure and function of microbial communities--if we can overcome the challenges of elucidating the functions of so many never-seen-before gene families.

  13. Snapshot of the Eukaryotic Gene Expression in Muskoxen Rumen—A Metatranscriptomic Approach

    O'Toole, Nicholas; Barboza, Perry S.; Ungerfeld, Emilio; Leigh, Mary Beth; Selinger, L. Brent; Butler, Greg; Tsang, Adrian; McAllister, Tim A.; Forster, Robert J.

    2011-01-01

    Background Herbivores rely on digestive tract lignocellulolytic microorganisms, including bacteria, fungi and protozoa, to derive energy and carbon from plant cell wall polysaccharides. Culture independent metagenomic studies have been used to reveal the genetic content of the bacterial species within gut microbiomes. However, the nature of the genes encoded by eukaryotic protozoa and fungi within these environments has not been explored using metagenomic or metatranscriptomic approaches. Methodology/Principal Findings In this study, a metatranscriptomic approach was used to investigate the functional diversity of the eukaryotic microorganisms within the rumen of muskoxen (Ovibos moschatus), with a focus on plant cell wall degrading enzymes. Polyadenylated RNA (mRNA) was sequenced on the Illumina Genome Analyzer II system and 2.8 gigabases of sequences were obtained and 59129 contigs assembled. Plant cell wall degrading enzyme modules including glycoside hydrolases, carbohydrate esterases and polysaccharide lyases were identified from over 2500 contigs. These included a number of glycoside hydrolase family 6 (GH6), GH48 and swollenin modules, which have rarely been described in previous gut metagenomic studies. Conclusions/Significance The muskoxen rumen metatranscriptome demonstrates a much higher percentage of cellulase enzyme discovery and an 8.7x higher rate of total carbohydrate active enzyme discovery per gigabase of sequence than previous rumen metagenomes. This study provides a snapshot of eukaryotic gene expression in the muskoxen rumen, and identifies a number of candidate genes coding for potentially valuable lignocellulolytic enzymes. PMID:21655220

  14. Snapshot of the eukaryotic gene expression in muskoxen rumen--a metatranscriptomic approach.

    Meng Qi

    Full Text Available BACKGROUND: Herbivores rely on digestive tract lignocellulolytic microorganisms, including bacteria, fungi and protozoa, to derive energy and carbon from plant cell wall polysaccharides. Culture independent metagenomic studies have been used to reveal the genetic content of the bacterial species within gut microbiomes. However, the nature of the genes encoded by eukaryotic protozoa and fungi within these environments has not been explored using metagenomic or metatranscriptomic approaches. METHODOLOGY/PRINCIPAL FINDINGS: In this study, a metatranscriptomic approach was used to investigate the functional diversity of the eukaryotic microorganisms within the rumen of muskoxen (Ovibos moschatus, with a focus on plant cell wall degrading enzymes. Polyadenylated RNA (mRNA was sequenced on the Illumina Genome Analyzer II system and 2.8 gigabases of sequences were obtained and 59129 contigs assembled. Plant cell wall degrading enzyme modules including glycoside hydrolases, carbohydrate esterases and polysaccharide lyases were identified from over 2500 contigs. These included a number of glycoside hydrolase family 6 (GH6, GH48 and swollenin modules, which have rarely been described in previous gut metagenomic studies. CONCLUSIONS/SIGNIFICANCE: The muskoxen rumen metatranscriptome demonstrates a much higher percentage of cellulase enzyme discovery and an 8.7x higher rate of total carbohydrate active enzyme discovery per gigabase of sequence than previous rumen metagenomes. This study provides a snapshot of eukaryotic gene expression in the muskoxen rumen, and identifies a number of candidate genes coding for potentially valuable lignocellulolytic enzymes.

  15. An integrated metagenome and -proteome analysis of the microbial community residing in a biogas production plant.

    Ortseifen, Vera; Stolze, Yvonne; Maus, Irena; Sczyrba, Alexander; Bremges, Andreas; Albaum, Stefan P; Jaenicke, Sebastian; Fracowiak, Jochen; Pühler, Alfred; Schlüter, Andreas

    2016-08-10

    To study the metaproteome of a biogas-producing microbial community, fermentation samples were taken from an agricultural biogas plant for microbial cell and protein extraction and corresponding metagenome analyses. Based on metagenome sequence data, taxonomic community profiling was performed to elucidate the composition of bacterial and archaeal sub-communities. The community's cytosolic metaproteome was represented in a 2D-PAGE approach. Metaproteome databases for protein identification were compiled based on the assembled metagenome sequence dataset for the biogas plant analyzed and non-corresponding biogas metagenomes. Protein identification results revealed that the corresponding biogas protein database facilitated the highest identification rate followed by other biogas-specific databases, whereas common public databases yielded insufficient identification rates. Proteins of the biogas microbiome identified as highly abundant were assigned to the pathways involved in methanogenesis, transport and carbon metabolism. Moreover, the integrated metagenome/-proteome approach enabled the examination of genetic-context information for genes encoding identified proteins by studying neighboring genes on the corresponding contig. Exemplarily, this approach led to the identification of a Methanoculleus sp. contig encoding 16 methanogenesis-related gene products, three of which were also detected as abundant proteins within the community's metaproteome. Thus, metagenome contigs provide additional information on the genetic environment of identified abundant proteins. Copyright © 2016 Elsevier B.V. All rights reserved.

  16. Metaproteomics Identifies the Protein Machinery Involved in Metal and Radionuclide Reduction in Subsurface Microbiomes and Elucidates Mechanisms and U(VI) Reduction Immobilization

    Pfiffner, Susan M. [Univ. of Tennessee, Knoxville, TN (United States); Löffler, Frank [Univ. of Tennessee, Knoxville, TN (United States); Ritalahti, Kirsti [Univ. of Tennessee, Knoxville, TN (United States); Sayler, Gary [Univ. of Tennessee, Knoxville, TN (United States); Layton, Alice [Univ. of Tennessee, Knoxville, TN (United States); Hettich, Robert [Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)

    2015-08-31

    The overall goal for this funded project was to develop and exploit environmental metaproteomics tools to identify biomarkers for monitoring microbial activity affecting U speciation at U-contaminated sites, correlate metaproteomics profiles with geochemical parameters and U(VI) reduction activity (or lack thereof), elucidate mechanisms contributing to U(VI) reduction, and provide remediation project managers with additional information to make science-based site management decisions for achieving cleanup goals more efficiently. Although significant progress has been made in elucidating the microbiology contribution to metal and radionuclide reduction, the cellular components, pathway(s), and mechanisms involved in U trans-formation remain poorly understood. Recent advances in (meta)proteomics technology enable detailed studies of complex samples, including environmental samples, which differ between sites and even show considerable variability within the same site (e.g., the Oak Ridge IFRC site). Additionally, site-specific geochemical conditions affect microbial activity and function, suggesting generalized assessment and interpretations may not suffice. This research effort integrated current understanding of the microbiology and biochemistry of U(VI) reduction and capitalize on advances in proteomics technology made over the past few years. Field-related analyses used Oak Ridge IFRC field ground water samples from locations where slow-release substrate biostimulation has been implemented to accelerate in situ U(VI) reduction rates. Our overarching hypothesis was that the metabolic signature in environmental samples, as deciphered by the metaproteome measurements, would show a relationship with U(VI) reduction activity. Since metaproteomic and metagenomic characterizations were computationally challenging and time-consuming, we used a tiered approach that combines database mining, controlled laboratory studies, U(VI) reduction activity measurements, phylogenetic

  17. Unipept web services for metaproteomics analysis.

    Mesuere, Bart; Willems, Toon; Van der Jeugt, Felix; Devreese, Bart; Vandamme, Peter; Dawyndt, Peter

    2016-06-01

    Unipept is an open source web application that is designed for metaproteomics analysis with a focus on interactive datavisualization. It is underpinned by a fast index built from UniProtKB and the NCBI taxonomy that enables quick retrieval of all UniProt entries in which a given tryptic peptide occurs. Unipept version 2.4 introduced web services that provide programmatic access to the metaproteomics analysis features. This enables integration of Unipept functionality in custom applications and data processing pipelines. The web services are freely available at http://api.unipept.ugent.be and are open sourced under the MIT license. Unipept@ugent.be Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  18. Metagenomic and metatranscriptomic analysis of the microbiome of watermelon fruits

    The plant microbiome is a key determinant of plant health and productivity, and alteration of the plant microbiome can increase the quality of agricultural products. Little is known about the microbial population in fruit development of plants. In this study, we aimed to understand the function of m...

  19. Organic matter processing by microbial communities throughout the Atlantic water column as revealed by metaproteomics

    Bergauer, Kristin; Fernandez-Guerra, Antonio; Garcia, Juan A L

    2018-01-01

    The phylogenetic composition of the heterotrophic microbial community is depth stratified in the oceanic water column down to abyssopelagic layers. In the layers below the euphotic zone, it has been suggested that heterotrophic microbes rely largely on solubilized particulate organic matter...... as a carbon and energy source rather than on dissolved organic matter. To decipher whether changes in the phylogenetic composition with depth are reflected in changes in the bacterial and archaeal transporter proteins, we generated an extensive metaproteomic and metagenomic dataset of microbial communities...... collected from 100- to 5,000-m depth in the Atlantic Ocean. By identifying which compounds of the organic matter pool are absorbed, transported, and incorporated into microbial cells, intriguing insights into organic matter transformation in the deep ocean emerged. On average, solute transporters accounted...

  20. Metaproteomic analysis of human gut microbiota: where are we heading?

    Lee, Pey Yee; Chin, Siok-Fong; Neoh, Hui-Min; Jamal, Rahman

    2017-06-12

    The human gut is home to complex microbial populations that change dynamically in response to various internal and external stimuli. The gut microbiota provides numerous functional benefits that are crucial for human health but in the setting of a disturbed equilibrium, the microbial community can cause deleterious outcomes such as diseases and cancers. Characterization of the functional activities of human gut microbiota is fundamental to understand their roles in human health and disease. Metaproteomics, which refers to the study of the entire protein collection of the microbial community in a given sample is an emerging area of research that provides informative details concerning functional aspects of the microbiota. In this mini review, we present a summary of the progress of metaproteomic analysis for studying the functional role of gut microbiota. This is followed by an overview of the experimental approaches focusing on fecal specimen for metaproteomics and is concluded by a discussion on the challenges and future directions of metaproteomic research.

  1. Alignment-free Transcriptomic and Metatranscriptomic Comparison Using Sequencing Signatures with Variable Length Markov Chains.

    Liao, Weinan; Ren, Jie; Wang, Kun; Wang, Shun; Zeng, Feng; Wang, Ying; Sun, Fengzhu

    2016-11-23

    The comparison between microbial sequencing data is critical to understand the dynamics of microbial communities. The alignment-based tools analyzing metagenomic datasets require reference sequences and read alignments. The available alignment-free dissimilarity approaches model the background sequences with Fixed Order Markov Chain (FOMC) yielding promising results for the comparison of microbial communities. However, in FOMC, the number of parameters grows exponentially with the increase of the order of Markov Chain (MC). Under a fixed high order of MC, the parameters might not be accurately estimated owing to the limitation of sequencing depth. In our study, we investigate an alternative to FOMC to model background sequences with the data-driven Variable Length Markov Chain (VLMC) in metatranscriptomic data. The VLMC originally designed for long sequences was extended to apply to high-throughput sequencing reads and the strategies to estimate the corresponding parameters were developed. The flexible number of parameters in VLMC avoids estimating the vast number of parameters of high-order MC under limited sequencing depth. Different from the manual selection in FOMC, VLMC determines the MC order adaptively. Several beta diversity measures based on VLMC were applied to compare the bacterial RNA-Seq and metatranscriptomic datasets. Experiments show that VLMC outperforms FOMC to model the background sequences in transcriptomic and metatranscriptomic samples. A software pipeline is available at https://d2vlmc.codeplex.com.

  2. Insights from quantitative metaproteomics and protein-stable isotope probing into microbial ecology.

    von Bergen, Martin; Jehmlich, Nico; Taubert, Martin; Vogt, Carsten; Bastida, Felipe; Herbst, Florian-Alexander; Schmidt, Frank; Richnow, Hans-Hermann; Seifert, Jana

    2013-10-01

    The recent development of metaproteomics has enabled the direct identification and quantification of expressed proteins from microbial communities in situ, without the need for microbial enrichment. This became possible by (1) significant increases in quality and quantity of metagenome data and by improvements of (2) accuracy and (3) sensitivity of modern mass spectrometers (MS). The identification of physiologically relevant enzymes can help to understand the role of specific species within a community or an ecological niche. Beside identification, relative and absolute quantitation is also crucial. We will review label-free and label-based methods of quantitation in MS-based proteome analysis and the contribution of quantitative proteome data to microbial ecology. Additionally, approaches of protein-based stable isotope probing (protein-SIP) for deciphering community structures are reviewed. Information on the species-specific metabolic activity can be obtained when substrates or nutrients are labeled with stable isotopes in a protein-SIP approach. The stable isotopes ((13)C, (15)N, (36)S) are incorporated into proteins and the rate of incorporation can be used for assessing the metabolic activity of the corresponding species. We will focus on the relevance of the metabolic and phylogenetic information retrieved with protein-SIP studies and for detecting and quantifying the carbon flux within microbial consortia. Furthermore, the combination of protein-SIP with established tools in microbial ecology such as other stable isotope probing techniques are discussed.

  3. Prospects and challenges for fungal metatranscriptomics of complex communities

    Cheryl R. Kuske; Cedar N. Hesse; Jean F. Challacombe; Daniel Cullen; Joshua R. Herr; Rebecca C. Mueller; Adrian Tsang; Rytas Vilgalys

    2015-01-01

    The ability to extract and purify messenger RNA directly from plants, decomposing organic matter and soil, followed by highthroughput sequencing of the pool of expressed genes, has spawned the emerging research area of metatranscriptomics. Each metatranscriptome provides a snapshot of the composition and relative abundance of actively transcribed genes, and thus...

  4. Coupled RNA-SIP and metatranscriptomics of active chemolithoautotrophic communities at a deep-sea hydrothermal vent.

    Fortunato, Caroline S; Huber, Julie A

    2016-08-01

    The chemolithoautotrophic microbial community of the rocky subseafloor potentially provides a large amount of organic carbon to the deep ocean, yet our understanding of the activity and metabolic complexity of subseafloor organisms remains poorly described. A combination of metagenomic, metatranscriptomic, and RNA stable isotope probing (RNA-SIP) analyses were used to identify the metabolic potential, expression patterns, and active autotrophic bacteria and archaea and their pathways present in low-temperature hydrothermal fluids from Axial Seamount, an active submarine volcano. Metagenomic and metatranscriptomic results showed the presence of genes and transcripts for sulfur, hydrogen, and ammonium oxidation, oxygen respiration, denitrification, and methanogenesis, as well as multiple carbon fixation pathways. In RNA-SIP experiments across a range of temperatures under reducing conditions, the enriched (13)C fractions showed differences in taxonomic and functional diversity. At 30 °C and 55 °C, Epsilonproteobacteria were dominant, oxidizing hydrogen and primarily reducing nitrate. Methanogenic archaea were also present at 55 °C, and were the only autotrophs present at 80 °C. Correspondingly, the predominant CO2 fixation pathways changed from the reductive tricarboxylic acid (rTCA) cycle to the reductive acetyl-CoA pathway with increasing temperature. By coupling RNA-SIP with meta-omics, this study demonstrates the presence and activity of distinct chemolithoautotrophic communities across a thermal gradient of a deep-sea hydrothermal vent.

  5. The metatranscriptome of a deep-sea hydrothermal plume is dominated by water column methanotrophs and lithotrophs.

    Lesniewski, Ryan A; Jain, Sunit; Anantharaman, Karthik; Schloss, Patrick D; Dick, Gregory J

    2012-12-01

    Microorganisms mediate geochemical processes in deep-sea hydrothermal vent plumes, which are a conduit for transfer of elements and energy from the subsurface to the oceans. Despite this important microbial influence on marine geochemistry, the ecology and activity of microbial communities in hydrothermal plumes is largely unexplored. Here, we use a coordinated metagenomic and metatranscriptomic approach to compare microbial communities in Guaymas Basin hydrothermal plumes to background waters above the plume and in the adjacent Carmen Basin. Despite marked increases in plume total RNA concentrations (3-4 times) and microbially mediated manganese oxidation rates (15-125 times), plume and background metatranscriptomes were dominated by the same groups of methanotrophs and chemolithoautotrophs. Abundant community members of Guaymas Basin seafloor environments (hydrothermal sediments and chimneys) were not prevalent in the plume metatranscriptome. De novo metagenomic assembly was used to reconstruct genomes of abundant populations, including Marine Group I archaea, Methylococcaceae, SAR324 Deltaproteobacteria and SUP05 Gammaproteobacteria. Mapping transcripts to these genomes revealed abundant expression of genes involved in the chemolithotrophic oxidation of ammonia (amo), methane (pmo) and sulfur (sox). Whereas amo and pmo gene transcripts were abundant in both plume and background, transcripts of sox genes for sulfur oxidation from SUP05 groups displayed a 10-20-fold increase in plumes. We conclude that the biogeochemistry of Guaymas Basin hydrothermal plumes is mediated by microorganisms that are derived from seawater rather than from seafloor hydrothermal environments such as chimneys or sediments, and that hydrothermal inputs serve as important electron donors for primary production in the deep Gulf of California.

  6. Metagenomic and proteomic analyses to elucidate the mechanism of anaerobic benzene degradation

    Abu Laban, Nidal [Helmholtz (Germany)

    2011-07-01

    This paper presents the mechanism of anaerobic benzene degradation using metagenomic and proteomic analyses. The objective of the study is to find out the microbes and biochemistry involved in benzene degradation. Hypotheses are proposed for the initial activation mechanism of benzene under anaerobic conditions. Two methods for degradation, molecular characterization and identification of benzene-degrading enzymes, are described. The physiological and molecular characteristics of iron-reducing enrichment culture are given and the process is detailed. Metagenome analysis of iron-reducing culture is presented using a pie chart. From the metagenome analysis of benzene-degrading culture, putative mobile element genes were identified in the aromatic-degrading configurations. Metaproteomic analysis of iron-reducing cultures and the anaerobic benzene degradation pathway are also elucidated. From the study, it can be concluded that gram-positive bacteria are involved in benzene degradation under iron-reducing conditions and that the catalysis mechanism of putative anaerobic benzene carboxylase needs further investigation.

  7. Metatranscriptomic census of active protists in soils.

    Geisen, Stefan; Tveit, Alexander T; Clark, Ian M; Richter, Andreas; Svenning, Mette M; Bonkowski, Michael; Urich, Tim

    2015-10-01

    The high numbers and diversity of protists in soil systems have long been presumed, but their true diversity and community composition have remained largely concealed. Traditional cultivation-based methods miss a majority of taxa, whereas molecular barcoding approaches employing PCR introduce significant biases in reported community composition of soil protists. Here, we applied a metatranscriptomic approach to assess the protist community in 12 mineral and organic soil samples from different vegetation types and climatic zones using small subunit ribosomal RNA transcripts as marker. We detected a broad diversity of soil protists spanning across all known eukaryotic supergroups and revealed a strikingly different community composition than shown before. Protist communities differed strongly between sites, with Rhizaria and Amoebozoa dominating in forest and grassland soils, while Alveolata were most abundant in peat soils. The Amoebozoa were comprised of Tubulinea, followed with decreasing abundance by Discosea, Variosea and Mycetozoa. Transcripts of Oomycetes, Apicomplexa and Ichthyosporea suggest soil as reservoir of parasitic protist taxa. Further, Foraminifera and Choanoflagellida were ubiquitously detected, showing that these typically marine and freshwater protists are autochthonous members of the soil microbiota. To the best of our knowledge, this metatranscriptomic study provides the most comprehensive picture of active protist communities in soils to date, which is essential to target the ecological roles of protists in the complex soil system.

  8. Microbial metatranscriptomics in a permanent marine oxygen minimum zone.

    Stewart, Frank J; Ulloa, Osvaldo; DeLong, Edward F

    2012-01-01

    Simultaneous characterization of taxonomic composition, metabolic gene content and gene expression in marine oxygen minimum zones (OMZs) has potential to broaden perspectives on the microbial and biogeochemical dynamics in these environments. Here, we present a metatranscriptomic survey of microbial community metabolism in the Eastern Tropical South Pacific OMZ off northern Chile. Community RNA was sampled in late austral autumn from four depths (50, 85, 110, 200 m) extending across the oxycline and into the upper OMZ. Shotgun pyrosequencing of cDNA yielded 180,000 to 550,000 transcript sequences per depth. Based on functional gene representation, transcriptome samples clustered apart from corresponding metagenome samples from the same depth, highlighting the discrepancies between metabolic potential and actual transcription. BLAST-based characterizations of non-ribosomal RNA sequences revealed a dominance of genes involved with both oxidative (nitrification) and reductive (anammox, denitrification) components of the marine nitrogen cycle. Using annotations of protein-coding genes as proxies for taxonomic affiliation, we observed depth-specific changes in gene expression by key functional taxonomic groups. Notably, transcripts most closely matching the genome of the ammonia-oxidizing archaeon Nitrosopumilus maritimus dominated the transcriptome in the upper three depths, representing one in five protein-coding transcripts at 85 m. In contrast, transcripts matching the anammox bacterium Kuenenia stuttgartiensis dominated at the core of the OMZ (200 m; 1 in 12 protein-coding transcripts). The distribution of N. maritimus-like transcripts paralleled that of transcripts matching ammonia monooxygenase genes, which, despite being represented by both bacterial and archaeal sequences in the community DNA, were dominated (> 99%) by archaeal sequences in the RNA, suggesting a substantial role for archaeal nitrification in the upper OMZ. These data, as well as those

  9. Genome-centric metatranscriptomes and ecological roles of the active microbial populations during cellulosic biomass anaerobic digestion.

    Jia, Yangyang; Ng, Siu-Kin; Lu, Hongyuan; Cai, Mingwei; Lee, Patrick K H

    2018-01-01

    Although anaerobic digestion for biogas production is used worldwide in treatment processes to recover energy from carbon-rich waste such as cellulosic biomass, the activities and interactions among the microbial populations that perform anaerobic digestion deserve further investigations, especially at the population genome level. To understand the cellulosic biomass-degrading potentials in two full-scale digesters, this study examined five methanogenic enrichment cultures derived from the digesters that anaerobically digested cellulose or xylan for more than 2 years under 35 or 55 °C conditions. Metagenomics and metatranscriptomics were used to capture the active microbial populations in each enrichment culture and reconstruct their meta-metabolic network and ecological roles. 107 population genomes were reconstructed from the five enrichment cultures using a differential coverage binning approach, of which only a subset was highly transcribed in the metatranscriptomes. Phylogenetic and functional convergence of communities by enrichment condition and phase of fermentation was observed for the highly transcribed populations in the metatranscriptomes. In the 35 °C cultures grown on cellulose, Clostridium cellulolyticum -related and Ruminococcus -related bacteria were identified as major hydrolyzers and primary fermenters in the early growth phase, while Clostridium leptum -related bacteria were major secondary fermenters and potential fatty acid scavengers in the late growth phase. While the meta-metabolism and trophic roles of the cultures were similar, the bacterial populations performing each function were distinct between the enrichment conditions. Overall, a population genome-centric view of the meta-metabolism and functional roles of key active players in anaerobic digestion of cellulosic biomass was obtained. This study represents a major step forward towards understanding the microbial functions and interactions at population genome level during the

  10. The human oral metaproteome reveals potential biomarkers for caries disease

    Belda-Ferre, Pedro; Williamson, James; Simón-Soro, Áurea

    2015-01-01

    metabolism and immune response. We applied multivariate analysis in order to find the minimum set of proteins that better allows discrimination of healthy and caries-affected dental plaque samples, detecting seven bacterial and five human protein functions that allow determining the health status......Tooth decay is considered the most prevalent human disease worldwide. We present the first metaproteomic study of the oral biofilm, using different mass spectrometry approaches that have allowed us to quantify individual peptides in healthy and caries-bearing individuals. A total of 7771 bacterial...... and 853 human proteins were identified in 17 individuals, which provide the first available protein repertoire of human dental plaque. Actinomyces and Coryneybacterium represent a large proportion of the protein activity followed by Rothia and Streptococcus. Those four genera account for 60-90% of total...

  11. Metatranscriptomics of the human gut microbiome

    Sicheritz-Pontén, Thomas

    2011-01-01

    Our ‘other’ genome is the collective genetic information in all of the microorganisms that are living on and within us. Collectively known as the microbiome, these microbial cells outnumber human cells in the body by more than 10 to 1, and the genes carried by these organisms outnumber the genes ...... that there is a division of labor between the bacterial species in the human gut microbiome.......Our ‘other’ genome is the collective genetic information in all of the microorganisms that are living on and within us. Collectively known as the microbiome, these microbial cells outnumber human cells in the body by more than 10 to 1, and the genes carried by these organisms outnumber the genes...... in the human genome by more than 100 to 1. How these organisms contribute to and affect human health is poorly understood, but the emerging field of metagenomics promises a more comprehensive and complete understanding of the human microbiome. In the European-funded Metagenomics of the Human Intestinal Tract...

  12. Integrated metabolism in sponge-microbe symbiosis revealed by genome-centered metatranscriptomics.

    Moitinho-Silva, Lucas; Díez-Vives, Cristina; Batani, Giampiero; Esteves, Ana Is; Jahn, Martin T; Thomas, Torsten

    2017-07-01

    Despite an increased understanding of functions in sponge microbiomes, the interactions among the symbionts and between symbionts and host are not well characterized. Here we reconstructed the metabolic interactions within the sponge Cymbastela concentrica microbiome in the context of functional features of symbiotic diatoms and the host. Three genome bins (CcPhy, CcNi and CcThau) were recovered from metagenomic data of C. concentrica, belonging to the proteobacterial family Phyllobacteriaceae, the Nitrospira genus and the thaumarchaeal order Nitrosopumilales. Gene expression was estimated by mapping C. concentrica metatranscriptomic reads. Our analyses indicated that CcPhy is heterotrophic, while CcNi and CcThau are chemolithoautotrophs. CcPhy expressed many transporters for the acquisition of dissolved organic compounds, likely available through the sponge's filtration activity and symbiotic carbon fixation. Coupled nitrification by CcThau and CcNi was reconstructed, supported by the observed close proximity of the cells in fluorescence in situ hybridization. CcPhy facultative anaerobic respiration and assimilation by diatoms may consume the resulting nitrate. Transcriptional analysis of diatom and sponge functions indicated that these organisms are likely sources of organic compounds, for example, creatine/creatinine and dissolved organic carbon, for other members of the symbiosis. Our results suggest that organic nitrogen compounds, for example, creatine, creatinine, urea and cyanate, fuel the nitrogen cycle within the sponge. This study provides an unprecedented view of the metabolic interactions within sponge-microbe symbiosis, bridging the gap between cell- and community-level knowledge.

  13. Automated and Accurate Estimation of Gene Family Abundance from Shotgun Metagenomes.

    Stephen Nayfach

    2015-11-01

    Full Text Available Shotgun metagenomic DNA sequencing is a widely applicable tool for characterizing the functions that are encoded by microbial communities. Several bioinformatic tools can be used to functionally annotate metagenomes, allowing researchers to draw inferences about the functional potential of the community and to identify putative functional biomarkers. However, little is known about how decisions made during annotation affect the reliability of the results. Here, we use statistical simulations to rigorously assess how to optimize annotation accuracy and speed, given parameters of the input data like read length and library size. We identify best practices in metagenome annotation and use them to guide the development of the Shotgun Metagenome Annotation Pipeline (ShotMAP. ShotMAP is an analytically flexible, end-to-end annotation pipeline that can be implemented either on a local computer or a cloud compute cluster. We use ShotMAP to assess how different annotation databases impact the interpretation of how marine metagenome and metatranscriptome functional capacity changes across seasons. We also apply ShotMAP to data obtained from a clinical microbiome investigation of inflammatory bowel disease. This analysis finds that gut microbiota collected from Crohn's disease patients are functionally distinct from gut microbiota collected from either ulcerative colitis patients or healthy controls, with differential abundance of metabolic pathways related to host-microbiome interactions that may serve as putative biomarkers of disease.

  14. Validation of two ribosomal RNA removal methods for microbial metatranscriptomics

    He, Shaomei; Wurtzel, Omri; Singh, Kanwar; Froula, Jeff L; Yilmaz, Suzan; Tringe, Susannah G; Wang, Zhong; Chen, Feng; Lindquist, Erika A; Sorek, Rotem; Hugenholtz, Philip

    2010-10-01

    The predominance of rRNAs in the transcriptome is a major technical challenge in sequence-based analysis of cDNAs from microbial isolates and communities. Several approaches have been applied to deplete rRNAs from (meta)transcriptomes, but no systematic investigation of potential biases introduced by any of these approaches has been reported. Here we validated the effectiveness and fidelity of the two most commonly used approaches, subtractive hybridization and exonuclease digestion, as well as combinations of these treatments, on two synthetic five-microorganism metatranscriptomes using massively parallel sequencing. We found that the effectiveness of rRNA removal was a function of community composition and RNA integrity for these treatments. Subtractive hybridization alone introduced the least bias in relative transcript abundance, whereas exonuclease and in particular combined treatments greatly compromised mRNA abundance fidelity. Illumina sequencing itself also can compromise quantitative data analysis by introducing a G+C bias between runs.

  15. Metagenomics at Grass Roots

    CAMERA (Community Cyber-infrastructure for Advanced Mi- crobial Ecology .... Acidobacteria known to metabolize a variety of car- bon sources .... [7] J Nesme et al., Back to the future of soil metagenomics, Frontiers in Microbi- ology, Vol.7 ...

  16. Metagenomics at Grass Roots

    Metagenomics is a robust, interdisciplinary approach for studyingmicrobial community composition, function, and dynamics.It typically involves a core of molecular biology, microbiology,ecology, statistics, and computational biology. Excitingoutcomes anticipated from these studies include unravelingof complex interactions ...

  17. Genome-resolved metaproteomic characterization of preterm infant gut microbiota development reveals species-specific metabolic shifts and variabilities during early life.

    Xiong, Weili; Brown, Christopher T; Morowitz, Michael J; Banfield, Jillian F; Hettich, Robert L

    2017-07-10

    Establishment of the human gut microbiota begins at birth. This early-life microbiota development can impact host physiology during infancy and even across an entire life span. However, the functional stability and population structure of the gut microbiota during initial colonization remain poorly understood. Metaproteomics is an emerging technology for the large-scale characterization of metabolic functions in complex microbial communities (gut microbiota). We applied a metagenome-informed metaproteomic approach to study the temporal and inter-individual differences of metabolic functions during microbial colonization of preterm human infants' gut. By analyzing 30 individual fecal samples, we identified up to 12,568 protein groups for each of four infants, including both human and microbial proteins. With genome-resolved matched metagenomics, proteins were confidently identified at the species/strain level. The maximum percentage of the proteome detected for the abundant organisms was ~45%. A time-dependent increase in the relative abundance of microbial versus human proteins suggested increasing microbial colonization during the first few weeks of early life. We observed remarkable variations and temporal shifts in the relative protein abundances of each organism in these preterm gut communities. Given the dissimilarity of the communities, only 81 microbial EggNOG orthologous groups and 57 human proteins were observed across all samples. These conserved microbial proteins were involved in carbohydrate, energy, amino acid and nucleotide metabolism while conserved human proteins were related to immune response and mucosal maturation. We identified seven proteome clusters for the communities and showed infant gut proteome profiles were unstable across time and not individual-specific. Applying a gut-specific metabolic module (GMM) analysis, we found that gut communities varied primarily in the contribution of nutrient (carbohydrates, lipids, and amino acids

  18. Microbial metatranscriptomics in a permanent marine oxygen minimum zone

    Stewart, Frank J.; Ulloa, Osvaldo; DeLong, Edward

    2010-01-01

    Simultaneous characterization of taxonomic composition, metabolic gene content and gene expression in marine oxygen minimum zones (OMZs) has potential to broaden perspectives on the microbial and biogeochemical dynamics in these environments. Here, we present a metatranscriptomic survey of microbial community metabolism in the Eastern Tropical South Pacific OMZ off northern Chile. Community RNA was sampled in late austral autumn from four depths (50, 85, 110, 200 m) extending across the oxycl...

  19. Metatranscriptomic analyses of honey bee colonies.

    Tozkar, Cansu Ö; Kence, Meral; Kence, Aykut; Huang, Qiang; Evans, Jay D

    2015-01-01

    Honey bees face numerous biotic threats from viruses to bacteria, fungi, protists, and mites. Here we describe a thorough analysis of microbes harbored by worker honey bees collected from field colonies in geographically distinct regions of Turkey. Turkey is one of the World's most important centers of apiculture, harboring five subspecies of Apis mellifera L., approximately 20% of the honey bee subspecies in the world. We use deep ILLUMINA-based RNA sequencing to capture RNA species for the honey bee and a sampling of all non-endogenous species carried by bees. After trimming and mapping these reads to the honey bee genome, approximately 10% of the sequences (9-10 million reads per library) remained. These were then mapped to a curated set of public sequences containing ca. Sixty megabase-pairs of sequence representing known microbial species associated with honey bees. Levels of key honey bee pathogens were confirmed using quantitative PCR screens. We contrast microbial matches across different sites in Turkey, showing new country recordings of Lake Sinai virus, two Spiroplasma bacterium species, symbionts Candidatus Schmidhempelia bombi, Frischella perrara, Snodgrassella alvi, Gilliamella apicola, Lactobacillus spp.), neogregarines, and a trypanosome species. By using metagenomic analysis, this study also reveals deep molecular evidence for the presence of bacterial pathogens (Melissococcus plutonius, Paenibacillus larvae), Varroa destructor-1 virus, Sacbrood virus, and fungi. Despite this effort we did not detect KBV, SBPV, Tobacco ringspot virus, VdMLV (Varroa Macula like virus), Acarapis spp., Tropilaeleps spp. and Apocephalus (phorid fly). We discuss possible impacts of management practices and honey bee subspecies on microbial retinues. The described workflow and curated microbial database will be generally useful for microbial surveys of healthy and declining honey bees.

  20. A primer on metagenomics.

    John C Wooley

    2010-02-01

    Full Text Available Metagenomics is a discipline that enables the genomic study of uncultured microorganisms. Faster, cheaper sequencing technologies and the ability to sequence uncultured microbes sampled directly from their habitats are expanding and transforming our view of the microbial world. Distilling meaningful information from the millions of new genomic sequences presents a serious challenge to bioinformaticians. In cultured microbes, the genomic data come from a single clone, making sequence assembly and annotation tractable. In metagenomics, the data come from heterogeneous microbial communities, sometimes containing more than 10,000 species, with the sequence data being noisy and partial. From sampling, to assembly, to gene calling and function prediction, bioinformatics faces new demands in interpreting voluminous, noisy, and often partial sequence data. Although metagenomics is a relative newcomer to science, the past few years have seen an explosion in computational methods applied to metagenomic-based research. It is therefore not within the scope of this article to provide an exhaustive review. Rather, we provide here a concise yet comprehensive introduction to the current computational requirements presented by metagenomics, and review the recent progress made. We also note whether there is software that implements any of the methods presented here, and briefly review its utility. Nevertheless, it would be useful if readers of this article would avail themselves of the comment section provided by this journal, and relate their own experiences. Finally, the last section of this article provides a few representative studies illustrating different facets of recent scientific discoveries made using metagenomics.

  1. MetaGOmics: A Web-Based Tool for Peptide-Centric Functional and Taxonomic Analysis of Metaproteomics Data.

    Riffle, Michael; May, Damon H; Timmins-Schiffman, Emma; Mikan, Molly P; Jaschob, Daniel; Noble, William Stafford; Nunn, Brook L

    2017-12-27

    Metaproteomics is the characterization of all proteins being expressed by a community of organisms in a complex biological sample at a single point in time. Applications of metaproteomics range from the comparative analysis of environmental samples (such as ocean water and soil) to microbiome data from multicellular organisms (such as the human gut). Metaproteomics research is often focused on the quantitative functional makeup of the metaproteome and which organisms are making those proteins. That is: What are the functions of the currently expressed proteins? How much of the metaproteome is associated with those functions? And, which microorganisms are expressing the proteins that perform those functions? However, traditional protein-centric functional analysis is greatly complicated by the large size, redundancy, and lack of biological annotations for the protein sequences in the database used to search the data. To help address these issues, we have developed an algorithm and web application (dubbed "MetaGOmics") that automates the quantitative functional (using Gene Ontology) and taxonomic analysis of metaproteomics data and subsequent visualization of the results. MetaGOmics is designed to overcome the shortcomings of traditional proteomics analysis when used with metaproteomics data. It is easy to use, requires minimal input, and fully automates most steps of the analysis-including comparing the functional makeup between samples. MetaGOmics is freely available at https://www.yeastrc.org/metagomics/.

  2. MetaGOmics: A Web-Based Tool for Peptide-Centric Functional and Taxonomic Analysis of Metaproteomics Data

    Michael Riffle

    2017-12-01

    Full Text Available Metaproteomics is the characterization of all proteins being expressed by a community of organisms in a complex biological sample at a single point in time. Applications of metaproteomics range from the comparative analysis of environmental samples (such as ocean water and soil to microbiome data from multicellular organisms (such as the human gut. Metaproteomics research is often focused on the quantitative functional makeup of the metaproteome and which organisms are making those proteins. That is: What are the functions of the currently expressed proteins? How much of the metaproteome is associated with those functions? And, which microorganisms are expressing the proteins that perform those functions? However, traditional protein-centric functional analysis is greatly complicated by the large size, redundancy, and lack of biological annotations for the protein sequences in the database used to search the data. To help address these issues, we have developed an algorithm and web application (dubbed “MetaGOmics” that automates the quantitative functional (using Gene Ontology and taxonomic analysis of metaproteomics data and subsequent visualization of the results. MetaGOmics is designed to overcome the shortcomings of traditional proteomics analysis when used with metaproteomics data. It is easy to use, requires minimal input, and fully automates most steps of the analysis—including comparing the functional makeup between samples. MetaGOmics is freely available at https://www.yeastrc.org/metagomics/.

  3. Critical Assessment of Metagenome Interpretation

    Sczyrba, Alexander; Hofmann, Peter; Belmann, Peter

    2017-01-01

    Methods for assembly, taxonomic profiling and binning are key to interpreting metagenome data, but a lack of consensus about benchmarking complicates performance assessment. The Critical Assessment of Metagenome Interpretation (CAMI) challenge has engaged the global developer community to benchma...

  4. Characterizing the metatranscriptomic profile of archaeal metabolic genes at deep-sea hydrothermal vents in the Mid-Cayman Rise

    Galambos, D.; Reveillaud, J. C.; Anderson, R.; Huber, J. A.

    2017-12-01

    Deep-sea hydrothermal vent systems host a wide diversity of bacteria, archaea and viruses. Although the geochemical conditions at these vents are well-documented, the relative metabolic activity of microbial lineages, especially among archaea, remains poorly characterized. The deep, slow-spreading Mid-Cayman Rise, which hosts the mafic-influenced Piccard and ultramafic-influenced Von Damm vent fields, allows for the comparison of vent sites with different geochemical characteristics. Previous metagenomic work indicated that despite the distinct geochemistry at Von Damm and Piccard, the functional profile of microbial communities between the two sites was similar. We examined relative metabolic gene activity using a metatranscriptomic analysis and observed functional similarity between Von Damm and Piccard, which is consistent with previous results. Notably, the relative expression of the methyl-coenzyme M reductase (mcr) gene was elevated in both vent fields. Additionally, we analyzed the ratio of RNA expression to DNA abundance of fifteen archaeal metagenome-assembled genomes (MAGs) across the two fields. Previous work showed higher archaeal diversity at Von Damm; our results indicate relatively even expression among archaeal lineages at Von Damm. In contrast, we observed lower archaeal diversity at Piccard, but individual archaeal lineages were very highly expressed; Thermoprotei showed elevated transcriptional activity, which is consistent with higher temperatures and sulfur levels at Piccard. At both Von Damm and Piccard, specific Methanococcus lineages were more highly expressed than others. Future analyses will more closely examine metabolic genes in these Methanococcus MAGs to determine why some lineages are more active at a vent field than others. We will conduct further statistical analyses to determine whether significant differences exist between Von Damm and Piccard and whether there are correlations between geochemical metadata and metabolic gene or

  5. Beyond biodiversity: fish metagenomes.

    Alba Ardura

    Full Text Available Biodiversity and intra-specific genetic diversity are interrelated and determine the potential of a community to survive and evolve. Both are considered together in Prokaryote communities treated as metagenomes or ensembles of functional variants beyond species limits.Many factors alter biodiversity in higher Eukaryote communities, and human exploitation can be one of the most important for some groups of plants and animals. For example, fisheries can modify both biodiversity and genetic diversity (intra specific. Intra-specific diversity can be drastically altered by overfishing. Intense fishing pressure on one stock may imply extinction of some genetic variants and subsequent loss of intra-specific diversity. The objective of this study was to apply a metagenome approach to fish communities and explore its value for rapid evaluation of biodiversity and genetic diversity at community level. Here we have applied the metagenome approach employing the barcoding target gene coi as a model sequence in catch from four very different fish assemblages exploited by fisheries: freshwater communities from the Amazon River and northern Spanish rivers, and marine communities from the Cantabric and Mediterranean seas.Treating all sequences obtained from each regional catch as a biological unit (exploited community we found that metagenomic diversity indices of the Amazonian catch sample here examined were lower than expected. Reduced diversity could be explained, at least partially, by overexploitation of the fish community that had been independently estimated by other methods.We propose using a metagenome approach for estimating diversity in Eukaryote communities and early evaluating genetic variation losses at multi-species level.

  6. Beyond biodiversity: fish metagenomes.

    Ardura, Alba; Planes, Serge; Garcia-Vazquez, Eva

    2011-01-01

    Biodiversity and intra-specific genetic diversity are interrelated and determine the potential of a community to survive and evolve. Both are considered together in Prokaryote communities treated as metagenomes or ensembles of functional variants beyond species limits.Many factors alter biodiversity in higher Eukaryote communities, and human exploitation can be one of the most important for some groups of plants and animals. For example, fisheries can modify both biodiversity and genetic diversity (intra specific). Intra-specific diversity can be drastically altered by overfishing. Intense fishing pressure on one stock may imply extinction of some genetic variants and subsequent loss of intra-specific diversity. The objective of this study was to apply a metagenome approach to fish communities and explore its value for rapid evaluation of biodiversity and genetic diversity at community level. Here we have applied the metagenome approach employing the barcoding target gene coi as a model sequence in catch from four very different fish assemblages exploited by fisheries: freshwater communities from the Amazon River and northern Spanish rivers, and marine communities from the Cantabric and Mediterranean seas.Treating all sequences obtained from each regional catch as a biological unit (exploited community) we found that metagenomic diversity indices of the Amazonian catch sample here examined were lower than expected. Reduced diversity could be explained, at least partially, by overexploitation of the fish community that had been independently estimated by other methods.We propose using a metagenome approach for estimating diversity in Eukaryote communities and early evaluating genetic variation losses at multi-species level.

  7. Comparative metatranscriptomics reveals decline of a neustonic planktonic population

    Mojib, Nazia; Thimma, Manjula; Kumaran, M.; Sougrat, Rachid; Irigoien, Xabier

    2016-01-01

    The neuston layer in tropical seas provides a good model to study the effects of increased levels of different stressors (e.g., temperature, ultraviolet radiation and Trichodesmium blooms). Here, we use a comparative in situ metatranscriptomics approach to reveal the functional genomic composition of metabolically active neustonic mesozooplankton community in response to the summer conditions in the Red Sea. The neustonic population exhibited changes in composition and abundance with a significant decline in copepods and appendicularia in July, when Trichodesmium cells were more abundant along with high temperatures and UV-B radiation. Nearly 23,000 genes were differentially expressed at the community level when the metatranscriptomes of the neustonic zooplankton were compared in April, July, and October. On a wider Phylum level, the genes related to oxidative phosphorylation, carbon, nucleotides, amino acids, and lipids were significantly overrepresented in both arthropods and chordates in April and October. On organism level for copepods, expression of genes responsive to oxidative stress, defense against bacteria, immune response, and virus reproduction were increased along with the observed increased appearance of copepod carcasses in the samples collected during July. The differences in expression correspond either to secondary effects of the Trichodesmium bloom or more likely to the increased UV-B radiation in July. Given the dearth of information on the zooplankton gene expression in response to environmental stimuli, our study provides the first transcriptome landscape of the mesozooplankton community during a period of increased mortality of the copepod and appendicularia population.

  8. Comparative metatranscriptomics reveals decline of a neustonic planktonic population

    Mojib, Nazia

    2016-10-20

    The neuston layer in tropical seas provides a good model to study the effects of increased levels of different stressors (e.g., temperature, ultraviolet radiation and Trichodesmium blooms). Here, we use a comparative in situ metatranscriptomics approach to reveal the functional genomic composition of metabolically active neustonic mesozooplankton community in response to the summer conditions in the Red Sea. The neustonic population exhibited changes in composition and abundance with a significant decline in copepods and appendicularia in July, when Trichodesmium cells were more abundant along with high temperatures and UV-B radiation. Nearly 23,000 genes were differentially expressed at the community level when the metatranscriptomes of the neustonic zooplankton were compared in April, July, and October. On a wider Phylum level, the genes related to oxidative phosphorylation, carbon, nucleotides, amino acids, and lipids were significantly overrepresented in both arthropods and chordates in April and October. On organism level for copepods, expression of genes responsive to oxidative stress, defense against bacteria, immune response, and virus reproduction were increased along with the observed increased appearance of copepod carcasses in the samples collected during July. The differences in expression correspond either to secondary effects of the Trichodesmium bloom or more likely to the increased UV-B radiation in July. Given the dearth of information on the zooplankton gene expression in response to environmental stimuli, our study provides the first transcriptome landscape of the mesozooplankton community during a period of increased mortality of the copepod and appendicularia population.

  9. Combining metagenomics with metaproteomics and stable isotope probing reveals metabolic pathways used by a naturally occurring marine methylotroph

    Grob, Carolina; Taubert, Martin; Howat, Alexandra M.

    2015-01-01

    A variety of culture-independent techniques have been developed that can be used in conjunction with culture-dependent physiological and metabolic studies of key microbial organisms in order to better understand how the activity of natural populations influences and regulates all major......, we retrieved virtually the whole genome of this bacterium and determined its metabolic potential. Through protein-stable isotope probing, the RuMP cycle was established as the main carbon assimilation pathway, and the classical methanol dehydrogenase-encoding gene mxaF, as well as three out of four...... identified xoxF homologues were found to be expressed. This proof-of-concept study is the first in which the culture-independent techniques of DNA-SIP and protein-SIP have been used to characterize the metabolism of a naturally occurring Methylophaga-like bacterium in the marine environment (i...

  10. Soil metagenomics and tropical soil productivity

    Garrett, Karen A.

    2009-01-01

    This presentation summarizes research in the soil metagenomics cross cutting research activity. Soil metagenomics studies soil microbial communities as contributors to soil health.C CCRA-4 (Soil Metagenomics)

  11. Metaproteomics of cellulose methanisation under thermophilic conditions reveals a surprisingly high proteolytic activity.

    Lü, Fan; Bize, Ariane; Guillot, Alain; Monnet, Véronique; Madigou, Céline; Chapleur, Olivier; Mazéas, Laurent; He, Pinjing; Bouchez, Théodore

    2014-01-01

    Cellulose is the most abundant biopolymer on Earth. Optimising energy recovery from this renewable but recalcitrant material is a key issue. The metaproteome expressed by thermophilic communities during cellulose anaerobic digestion was investigated in microcosms. By multiplying the analytical replicates (65 protein fractions analysed by MS/MS) and relying solely on public protein databases, more than 500 non-redundant protein functions were identified. The taxonomic community structure as inferred from the metaproteomic data set was in good overall agreement with 16S rRNA gene tag pyrosequencing and fluorescent in situ hybridisation analyses. Numerous functions related to cellulose and hemicellulose hydrolysis and fermentation catalysed by bacteria related to Caldicellulosiruptor spp. and Clostridium thermocellum were retrieved, indicating their key role in the cellulose-degradation process and also suggesting their complementary action. Despite the abundance of acetate as a major fermentation product, key methanogenesis enzymes from the acetoclastic pathway were not detected. In contrast, enzymes from the hydrogenotrophic pathway affiliated to Methanothermobacter were almost exclusively identified for methanogenesis, suggesting a syntrophic acetate oxidation process coupled to hydrogenotrophic methanogenesis. Isotopic analyses confirmed the high dominance of the hydrogenotrophic methanogenesis. Very surprising was the identification of an abundant proteolytic activity from Coprothermobacter proteolyticus strains, probably acting as scavenger and/or predator performing proteolysis and fermentation. Metaproteomics thus appeared as an efficient tool to unravel and characterise metabolic networks as well as ecological interactions during methanisation bioprocesses. More generally, metaproteomics provides direct functional insights at a limited cost, and its attractiveness should increase in the future as sequence databases are growing exponentially.

  12. Metaproteome analysis of endodontic infections in association with different clinical conditions.

    José Claudio Provenzano

    Full Text Available Analysis of the metaproteome of microbial communities is important to provide an insight of community physiology and pathogenicity. This study evaluated the metaproteome of endodontic infections associated with acute apical abscesses and asymptomatic apical periodontitis lesions. Proteins persisting or expressed after root canal treatment were also evaluated. Finally, human proteins associated with these infections were identified. Samples were taken from root canals of teeth with asymptomatic apical periodontitis before and after chemomechanical treatment using either NaOCl or chlorhexidine as the irrigant. Samples from abscesses were taken by aspiration of the purulent exudate. Clinical samples were processed for analysis of the exoproteome by using two complementary mass spectrometry platforms: nanoflow liquid chromatography coupled with linear ion trap quadrupole Velos Orbitrap and liquid chromatography-quadrupole time-of-flight. A total of 308 proteins of microbial origin were identified. The number of proteins in abscesses was higher than in asymptomatic cases. In canals irrigated with chlorhexidine, the number of identified proteins decreased substantially, while in the NaOCl group the number of proteins increased. The large majority of microbial proteins found in endodontic samples were related to metabolic and housekeeping processes, including protein synthesis, energy metabolism and DNA processes. Moreover, several other proteins related to pathogenicity and resistance/survival were found, including proteins involved with adhesion, biofilm formation and antibiotic resistance, stress proteins, exotoxins, invasins, proteases and endopeptidases (mostly in abscesses, and an archaeal protein linked to methane production. The majority of human proteins detected were related to cellular processes and metabolism, as well as immune defense. Interrogation of the metaproteome of endodontic microbial communities provides information on the

  13. Metaproteome analysis of endodontic infections in association with different clinical conditions.

    Provenzano, José Claudio; Siqueira, José F; Rôças, Isabela N; Domingues, Romênia R; Paes Leme, Adriana F; Silva, Márcia R S

    2013-01-01

    Analysis of the metaproteome of microbial communities is important to provide an insight of community physiology and pathogenicity. This study evaluated the metaproteome of endodontic infections associated with acute apical abscesses and asymptomatic apical periodontitis lesions. Proteins persisting or expressed after root canal treatment were also evaluated. Finally, human proteins associated with these infections were identified. Samples were taken from root canals of teeth with asymptomatic apical periodontitis before and after chemomechanical treatment using either NaOCl or chlorhexidine as the irrigant. Samples from abscesses were taken by aspiration of the purulent exudate. Clinical samples were processed for analysis of the exoproteome by using two complementary mass spectrometry platforms: nanoflow liquid chromatography coupled with linear ion trap quadrupole Velos Orbitrap and liquid chromatography-quadrupole time-of-flight. A total of 308 proteins of microbial origin were identified. The number of proteins in abscesses was higher than in asymptomatic cases. In canals irrigated with chlorhexidine, the number of identified proteins decreased substantially, while in the NaOCl group the number of proteins increased. The large majority of microbial proteins found in endodontic samples were related to metabolic and housekeeping processes, including protein synthesis, energy metabolism and DNA processes. Moreover, several other proteins related to pathogenicity and resistance/survival were found, including proteins involved with adhesion, biofilm formation and antibiotic resistance, stress proteins, exotoxins, invasins, proteases and endopeptidases (mostly in abscesses), and an archaeal protein linked to methane production. The majority of human proteins detected were related to cellular processes and metabolism, as well as immune defense. Interrogation of the metaproteome of endodontic microbial communities provides information on the physiology and

  14. A metaproteomic approach to study human-microbial ecosystems at the mucosal luminal interface.

    Xiaoxiao Li

    Full Text Available Aberrant interactions between the host and the intestinal bacteria are thought to contribute to the pathogenesis of many digestive diseases. However, studying the complex ecosystem at the human mucosal-luminal interface (MLI is challenging and requires an integrative systems biology approach. Therefore, we developed a novel method integrating lavage sampling of the human mucosal surface, high-throughput proteomics, and a unique suite of bioinformatic and statistical analyses. Shotgun proteomic analysis of secreted proteins recovered from the MLI confirmed the presence of both human and bacterial components. To profile the MLI metaproteome, we collected 205 mucosal lavage samples from 38 healthy subjects, and subjected them to high-throughput proteomics. The spectral data were subjected to a rigorous data processing pipeline to optimize suitability for quantitation and analysis, and then were evaluated using a set of biostatistical tools. Compared to the mucosal transcriptome, the MLI metaproteome was enriched for extracellular proteins involved in response to stimulus and immune system processes. Analysis of the metaproteome revealed significant individual-related as well as anatomic region-related (biogeographic features. Quantitative shotgun proteomics established the identity and confirmed the biogeographic association of 49 proteins (including 3 functional protein networks demarcating the proximal and distal colon. This robust and integrated proteomic approach is thus effective for identifying functional features of the human mucosal ecosystem, and a fresh understanding of the basic biology and disease processes at the MLI.

  15. Metatranscriptomics reveals the molecular mechanism of large granule formation in granular anammox reactor

    Bagchi, Samik; Lamendella, Regina; Strutt, Steven; Van Loosdrecht, Mark C. M.; Saikaly, Pascal

    2016-01-01

    to formation of large granules. Size distribution analysis revealed the spatial distribution of granules in which large granules having higher abundance of anammox bacteria (genus Brocadia) dominated the bottom biomass. Metatranscriptomics analysis detected all

  16. The YNP metagenome project

    Inskeep, William P.; Jay, Zackary J.; Tringe, Susannah G.

    2013-01-01

    The Yellowstone geothermal complex contains over 10,000 diverse geothermal features that host numerous phylogenetically deeply rooted and poorly understood archaea, bacteria, and viruses. Microbial communities in high-temperature environments are generally less diverse than soil, marine, sediment......, and environmental variables. Twenty geochemically distinct geothermal ecosystems representing a broad spectrum of Yellowstone hot-spring environments were used for metagenomic and geochemical analysis and included approximately equal numbers of: (1) phototrophic mats, (2) “filamentous streamer” communities, and (3...

  17. RNA extraction from decaying wood for (meta)transcriptomic analyses.

    Adamo, Martino; Voyron, Samuele; Girlanda, Mariangela; Marmeisse, Roland

    2017-10-01

    Wood decomposition is a key step of the terrestrial carbon cycle and is of economic importance. It is essentially a microbiological process performed by fungi and to an unknown extent by bacteria. To gain access to the genes expressed by the diverse microbial communities participating in wood decay, we developed an RNA extraction protocol from this recalcitrant material rich in polysaccharides and phenolic compounds. This protocol was implemented on 22 wood samples representing as many tree species from 11 plant families in the Angiosperms and Gymnosperms. RNA was successfully extracted from all samples and converted into cDNAs from which were amplified both fungal and bacterial protein coding genes, including genes encoding hydrolytic enzymes participating in lignocellulose hydrolysis. This protocol applicable to a wide range of decomposing wood types represents a first step towards a metatranscriptomic analysis of wood degradation under natural conditions.

  18. Databases of the marine metagenomics

    Mineta, Katsuhiko

    2015-10-28

    The metagenomic data obtained from marine environments is significantly useful for understanding marine microbial communities. In comparison with the conventional amplicon-based approach of metagenomics, the recent shotgun sequencing-based approach has become a powerful tool that provides an efficient way of grasping a diversity of the entire microbial community at a sampling point in the sea. However, this approach accelerates accumulation of the metagenome data as well as increase of data complexity. Moreover, when metagenomic approach is used for monitoring a time change of marine environments at multiple locations of the seawater, accumulation of metagenomics data will become tremendous with an enormous speed. Because this kind of situation has started becoming of reality at many marine research institutions and stations all over the world, it looks obvious that the data management and analysis will be confronted by the so-called Big Data issues such as how the database can be constructed in an efficient way and how useful knowledge should be extracted from a vast amount of the data. In this review, we summarize the outline of all the major databases of marine metagenome that are currently publically available, noting that database exclusively on marine metagenome is none but the number of metagenome databases including marine metagenome data are six, unexpectedly still small. We also extend our explanation to the databases, as reference database we call, that will be useful for constructing a marine metagenome database as well as complementing important information with the database. Then, we would point out a number of challenges to be conquered in constructing the marine metagenome database.

  19. Linking metatranscriptomic to bioremediation processes of oil contaminated marine sediments

    Cuny, P.; Atkinson, A.; Léa, S.; Guasco, S.; Jezequel, R.; Armougom, F.; Michotey, V.; Bonin, P.; Militon, C.

    2016-02-01

    Oil-derived hydrocarbons are one major source of pollution of marine ecosystems. In coastal marine areas they tend to accumulate in the sediment where they can impact the benthic communities. Oil hydrocarbons biodegradation by microorganisms is known to be one of the prevalent processes acting in the removal of these contaminants from sediments. The redox oscillation regimes generated by bioturbation, and the efficiency of metabolic coupling between functional groups associated to these specific redox regimes, are probably determinant factors controlling hydrocarbon biodegradation. Metatranscriptomic analysis appears like a promising approach to shed new light on the metabolic processes involved in the response of microbial communities to oil contamination in such oxic/anoxic oscillating environments. In the framework of the DECAPAGE project (ANR CESA-2011-006 01), funded by the French National Agency for Research, the metatranscriptomes (RNA-seq) of oil contaminated or not (Ural blend crude oil, 5 000 ppm) and bioturbated or not (addition of the common burrowing organism Hediste diversicolor, 1000 ind/m2) mudflat sediments, incubated in microcosms during 4 months at 19±1°C, were compared. The analysis of active microbial communities by SSU rRNA barcoding shows that the main observable changes are due to the presence of H. diversicolor. On the contrary, oil addition is the main factor explaining the observed changes in the genes expression patterns with 1949 genes specifically up or down-regulated (which is the case of only 245 genes when only H. diversicolor worms are added). In particular, the oil contamination leads to a marked overexpression (i) of benzyl- and alkylsuccinate synthase genes (ass and bss) that are involved in the anaerobic metabolism of aromatics (toluene) and alkanes, respectively and, (ii) of genes coding for nucleotide excision repair exonucleases indicating that DNA repair processes are also activated.

  20. Microbiota and Metatranscriptome Changes Accompanying the Onset of Gingivitis

    2018-01-01

    ABSTRACT Over half of adults experience gingivitis, a mild yet treatable form of periodontal disease caused by the overgrowth of oral microbes. Left untreated, gingivitis can progress to a more severe and irreversible disease, most commonly chronic periodontitis. While periodontal diseases are associated with a shift in the oral microbiota composition, it remains unclear how this shift impacts microbiota function early in disease progression. Here, we analyzed the transition from health to gingivitis through both 16S v4-v5 rRNA amplicon and metatranscriptome sequencing of subgingival plaque samples from individuals undergoing an experimental gingivitis treatment. Beta-diversity analysis of 16S rRNA reveals that samples cluster based on disease severity and patient but not by oral hygiene status. Significant shifts in the abundance of several genera occurred during disease transition, suggesting a dysbiosis due to development of gingivitis. Comparing taxonomic abundance with transcriptomic activity revealed concordance of bacterial diversity composition between the two quantification assays in samples originating from both healthy and diseased teeth. Metatranscriptome sequencing analysis indicates that during the early stages of transition to gingivitis, a number of virulence-related transcripts were significantly differentially expressed in individual and across pooled patient samples. Upregulated genes include those involved in proteolytic and nucleolytic processes, while expression levels of those involved in surface structure assembly and other general virulence functions leading to colonization or adaptation within the host are more dynamic. These findings help characterize the transition from health to periodontal disease and identify genes associated with early disease. PMID:29666288

  1. Microbiota and Metatranscriptome Changes Accompanying the Onset of Gingivitis

    Emily M. Nowicki

    2018-04-01

    Full Text Available Over half of adults experience gingivitis, a mild yet treatable form of periodontal disease caused by the overgrowth of oral microbes. Left untreated, gingivitis can progress to a more severe and irreversible disease, most commonly chronic periodontitis. While periodontal diseases are associated with a shift in the oral microbiota composition, it remains unclear how this shift impacts microbiota function early in disease progression. Here, we analyzed the transition from health to gingivitis through both 16S v4-v5 rRNA amplicon and metatranscriptome sequencing of subgingival plaque samples from individuals undergoing an experimental gingivitis treatment. Beta-diversity analysis of 16S rRNA reveals that samples cluster based on disease severity and patient but not by oral hygiene status. Significant shifts in the abundance of several genera occurred during disease transition, suggesting a dysbiosis due to development of gingivitis. Comparing taxonomic abundance with transcriptomic activity revealed concordance of bacterial diversity composition between the two quantification assays in samples originating from both healthy and diseased teeth. Metatranscriptome sequencing analysis indicates that during the early stages of transition to gingivitis, a number of virulence-related transcripts were significantly differentially expressed in individual and across pooled patient samples. Upregulated genes include those involved in proteolytic and nucleolytic processes, while expression levels of those involved in surface structure assembly and other general virulence functions leading to colonization or adaptation within the host are more dynamic. These findings help characterize the transition from health to periodontal disease and identify genes associated with early disease.

  2. Metagenome Assembly at the DOE JGI (Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Chain, Patrick

    2011-10-13

    Patrick Chain of DOE JGI at LANL, Co-Chair of the Metagenome-specific Assembly session, on Metagenome Assembly at the DOE JGIat the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  3. Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes

    White, Richard Allen; Bottos, Eric M.; Roy Chowdhury, Taniya; Zucker, Jeremy D.; Brislawn, Colin J.; Nicora, Carrie D.; Fansler, Sarah J.; Glaesemann, Kurt R.; Glass, Kevin; Jansson, Janet K.; Langille, Morgan

    2016-06-28

    ABSTRACT

    Soil metagenomics has been touted as the “grand challenge” for metagenomics, as the high microbial diversity and spatial heterogeneity of soils make them unamenable to current assembly platforms. Here, we aimed to improve soil metagenomic sequence assembly by applying the Moleculo synthetic long-read sequencing technology. In total, we obtained 267 Gbp of raw sequence data from a native prairie soil; these data included 109.7 Gbp of short-read data (~100 bp) from the Joint Genome Institute (JGI), an additional 87.7 Gbp of rapid-mode read data (~250 bp), plus 69.6 Gbp (>1.5 kbp) from Moleculo sequencing. The Moleculo data alone yielded over 5,600 reads of >10 kbp in length, and over 95% of the unassembled reads mapped to contigs of >1.5 kbp. Hybrid assembly of all data resulted in more than 10,000 contigs over 10 kbp in length. We mapped three replicate metatranscriptomes derived from the same parent soil to the Moleculo subassembly and found that 95% of the predicted genes, based on their assignments to Enzyme Commission (EC) numbers, were expressed. The Moleculo subassembly also enabled binning of >100 microbial genome bins. We obtained via direct binning the first complete genome, that of “CandidatusPseudomonas sp. strain JKJ-1” from a native soil metagenome. By mapping metatranscriptome sequence reads back to the bins, we found that several bins corresponding to low-relative-abundanceAcidobacteriawere highly transcriptionally active, whereas bins corresponding to high-relative-abundanceVerrucomicrobiawere not. These results demonstrate that Moleculo sequencing provides a significant advance for resolving complex soil microbial communities.

    IMPORTANCESoil microorganisms carry out key processes for life on our planet, including cycling of carbon and other nutrients and supporting growth of plants. However, there is poor molecular-level understanding of their

  4. Metaproteomics of Microbiota in Naturally Fermented Soybean Paste, Da-jiang.

    Zhang, Ping; Zhang, Pengfei; Xie, Mengxi; An, Feiyu; Qiu, Boshu; Wu, Rina

    2018-05-01

    Da-jiang is a typical traditional fermented soybean product in China. At present, the proteins in da-jiang are needed to be explored. The composition and species of microbial proteins in traditional fermented da-jiang were analyzed by metaproteomics based on sodium dodecyl sulfonate-polyacrylamide gel electrophoresis (SDS-PAGE) and liquid chromatography-mass spectrometry/mass spectrometry (LC-MS/MS). The results showed that the number and variety of microbial proteins in the traditional fermented da-jiang from different regions were different. The production site influences the fermentation in da-jiang. Then we analyzed the functions of the microbial proteins identified in da-jiang, and found that they were mainly involved in the process of protein synthesis, glycometabolism and nucleic acid synthesis. In addtion, we compared the proteins composition in different da-jiang. There are 51 common proteins of naturally fermented da-jiang, and 25 common microbial sources. The main commonly microbial sources of fungal proteins are Saccharomyces cerevisiae and Schizosaccharomyces; the main commonly microbial sources of bacterial proteins are Enterococcus faecalis, Leuconostoc mesenteroides, Acinetobacter baumannii, and Bacillus subtilis. These common microbes play the predominant role in da-jiang fermentation. The present results help us to understand the fermentation of da-jiang and improve the quality and safety of final products in the future. The study illustrated metaproteome of microbiota in traditional fermented soybean paste, da-jiang, by sodium dodecyl sulfonate-polyacrylamide gel electrophoresis (SDS-PAGE) and liquid chromatography-mass spectrometry/mass spectrometry (LC-MS/MS). A method of extracting metaproteome from microbiota in da-jiang was attempted. The findings help to understand the fermentation of da-jiang and improve the quality and safety of da-jiang in fermented industry. © 2018 Institute of Food Technologists®.

  5. Microbiota and Metatranscriptome Changes Accompanying the Onset of Gingivitis.

    Nowicki, Emily M; Shroff, Raghav; Singleton, Jacqueline A; Renaud, Diane E; Wallace, Debra; Drury, Julie; Zirnheld, Jolene; Colleti, Brock; Ellington, Andrew D; Lamont, Richard J; Scott, David A; Whiteley, Marvin

    2018-04-17

    Over half of adults experience gingivitis, a mild yet treatable form of periodontal disease caused by the overgrowth of oral microbes. Left untreated, gingivitis can progress to a more severe and irreversible disease, most commonly chronic periodontitis. While periodontal diseases are associated with a shift in the oral microbiota composition, it remains unclear how this shift impacts microbiota function early in disease progression. Here, we analyzed the transition from health to gingivitis through both 16S v4-v5 rRNA amplicon and metatranscriptome sequencing of subgingival plaque samples from individuals undergoing an experimental gingivitis treatment. Beta-diversity analysis of 16S rRNA reveals that samples cluster based on disease severity and patient but not by oral hygiene status. Significant shifts in the abundance of several genera occurred during disease transition, suggesting a dysbiosis due to development of gingivitis. Comparing taxonomic abundance with transcriptomic activity revealed concordance of bacterial diversity composition between the two quantification assays in samples originating from both healthy and diseased teeth. Metatranscriptome sequencing analysis indicates that during the early stages of transition to gingivitis, a number of virulence-related transcripts were significantly differentially expressed in individual and across pooled patient samples. Upregulated genes include those involved in proteolytic and nucleolytic processes, while expression levels of those involved in surface structure assembly and other general virulence functions leading to colonization or adaptation within the host are more dynamic. These findings help characterize the transition from health to periodontal disease and identify genes associated with early disease. IMPORTANCE Although more than 50% of adults have some form of periodontal disease, there remains a significant gap in our understanding of its underlying cause. We initiated this study in order to

  6. Assembling large, complex environmental metagenomes

    Howe, A. C. [Michigan State Univ., East Lansing, MI (United States). Microbiology and Molecular Genetics, Plant Soil and Microbial Sciences; Jansson, J. [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Earth Sciences Division; Malfatti, S. A. [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Tringe, S. G. [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Tiedje, J. M. [Michigan State Univ., East Lansing, MI (United States). Microbiology and Molecular Genetics, Plant Soil and Microbial Sciences; Brown, C. T. [Michigan State Univ., East Lansing, MI (United States). Microbiology and Molecular Genetics, Computer Science and Engineering

    2012-12-28

    The large volumes of sequencing data required to sample complex environments deeply pose new challenges to sequence analysis approaches. De novo metagenomic assembly effectively reduces the total amount of data to be analyzed but requires significant computational resources. We apply two pre-assembly filtering approaches, digital normalization and partitioning, to make large metagenome assemblies more computationaly tractable. Using a human gut mock community dataset, we demonstrate that these methods result in assemblies nearly identical to assemblies from unprocessed data. We then assemble two large soil metagenomes from matched Iowa corn and native prairie soils. The predicted functional content and phylogenetic origin of the assembled contigs indicate significant taxonomic differences despite similar function. The assembly strategies presented are generic and can be extended to any metagenome; full source code is freely available under a BSD license.

  7. Microbial metaproteomics for characterizing the range of metabolic functions and activities of human gut microbiota.

    Xiong, Weili; Abraham, Paul E; Li, Zhou; Pan, Chongle; Hettich, Robert L

    2015-10-01

    The human gastrointestinal tract is a complex, dynamic ecosystem that consists of a carefully tuned balance of human host and microbiota membership. The microbiome is not merely a collection of opportunistic parasites, but rather provides important functions to the host that are absolutely critical to many aspects of health, including nutrient transformation and absorption, drug metabolism, pathogen defense, and immune system development. Microbial metaproteomics provides the ability to characterize the human gut microbiota functions and metabolic activities at a remarkably deep level, revealing information about microbiome development and stability as well as their interactions with their human host. Generally, microbial and human proteins can be extracted and then measured by high performance MS-based proteomics technology. Here, we review the field of human gut microbiome metaproteomics, with a focus on the experimental and informatics considerations involved in characterizing systems ranging from low-complexity model gut microbiota in gnotobiotic mice, to the emerging gut microbiome in the GI tract of newborn human infants, and finally to an established gut microbiota in human adults. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  8. Metaproteomics of Colonic Microbiota Unveils Discrete Protein Functions among Colitic Mice and Control Groups.

    Moon, Clara; Stupp, Gregory S; Su, Andrew I; Wolan, Dennis W

    2018-02-01

    Metaproteomics can greatly assist established high-throughput sequencing methodologies to provide systems biological insights into the alterations of microbial protein functionalities correlated with disease-associated dysbiosis of the intestinal microbiota. Here, the authors utilize the well-characterized murine T cell transfer model of colitis to find specific changes within the intestinal luminal proteome associated with inflammation. MS proteomic analysis of colonic samples permitted the identification of ≈10 000-12 000 unique peptides that corresponded to 5610 protein clusters identified across three groups, including the colitic Rag1 -/- T cell recipients, isogenic Rag1 -/- controls, and wild-type mice. The authors demonstrate that the colitic mice exhibited a significant increase in Proteobacteria and Verrucomicrobia and show that such alterations in the microbial communities contributed to the enrichment of specific proteins with transcription and translation gene ontology terms. In combination with 16S sequencing, the authors' metaproteomics-based microbiome studies provide a foundation for assessing alterations in intestinal luminal protein functionalities in a robust and well-characterized mouse model of colitis, and set the stage for future studies to further explore the functional mechanisms of altered protein functionalities associated with dysbiosis and inflammation. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  9. Metagenomic and metatranscriptomic analysis of saliva reveals disease-associated microbiota in patients with periodontitis and dental caries

    Belstrøm, Daniel; Constancias, Florentin; Liu, Yang

    2017-01-01

    relative abundance of traditional periodontal pathogens such as Porphyromonas gingivalis and Filifactor alocis and salivary microbial activity of F. alocis was associated with periodontitis. Significantly higher relative abundance of caries-associated bacteria such as Streptococcus mutans and Lactobacillus...

  10. Comparative metagenomics of the Red Sea

    Mineta, Katsuhiko

    2016-01-01

    started monthly samplings of the metagenomes in the Red Sea under KAUST-CCF project. In collaboration with Kitasato University, we also collected the metagenome data from the ocean in Japan, which shows contrasting features to the Red Sea. Therefore

  11. Marine metagenomics as a source for bioprospecting

    Kodzius, Rimantas; Gojobori, Takashi

    2015-01-01

    This review summarizes usage of genome-editing technologies for metagenomic studies; these studies are used to retrieve and modify valuable microorganisms for production, particularly in marine metagenomics. Organisms may be cultivable

  12. Web Resources for Metagenomics Studies

    Pravin Dudhagara

    2015-10-01

    Full Text Available The development of next-generation sequencing (NGS platforms spawned an enormous volume of data. This explosion in data has unearthed new scalability challenges for existing bioinformatics tools. The analysis of metagenomic sequences using bioinformatics pipelines is complicated by the substantial complexity of these data. In this article, we review several commonly-used online tools for metagenomics data analysis with respect to their quality and detail of analysis using simulated metagenomics data. There are at least a dozen such software tools presently available in the public domain. Among them, MGRAST, IMG/M, and METAVIR are the most well-known tools according to the number of citations by peer-reviewed scientific media up to mid-2015. Here, we describe 12 online tools with respect to their web link, annotation pipelines, clustering methods, online user support, and availability of data storage. We have also done the rating for each tool to screen more potential and preferential tools and evaluated five best tools using synthetic metagenome. The article comprehensively deals with the contemporary problems and the prospects of metagenomics from a bioinformatics viewpoint.

  13. Phyllosphere Metaproteomes of Trees from the Brazilian Atlantic Forest Show High Levels of Functional Redundancy.

    Lambais, M R; Barrera, S E; Santos, E C; Crowley, D E; Jumpponen, A

    2017-01-01

    The phyllosphere of the Brazilian Atlantic Forest has been estimated to contain several million bacterial species that are associated with approximately 20000 plant species. Despite the high bacterial diversity in the phyllosphere, the function of these microorganisms and the mechanisms driving their community assembly are largely unknown. In this study, we characterized the bacterial communities in the phyllospheres of four tree species of the Atlantic Forest (Mollinedia schottiana, Ocotea dispersa, Ocotea teleiandra, and Tabebuia serratifolia) and their metaproteomes to examine the basic protein functional groups expressed in the phyllosphere. Bacterial community analyses using 16S rRNA gene sequencing confirmed prior observations that plant species harbor distinct bacterial communities and that plants of the same taxon have more similar communities than more distantly related taxa. Using LC-ESI-Q-TOF, we identified 216 nonredundant proteins, based on 3503 peptide mass spectra. Most protein families were shared among the phyllosphere communities, suggesting functional redundancy despite differences in the species compositions of the bacterial communities. Proteins involved in glycolysis and anaerobic carbohydrate metabolism, solute transport, protein metabolism, cell motility, stress and antioxidant responses, nitrogen metabolism, and iron homeostasis were among the most frequently detected. In contrast to prior studies on crop plants and Arabidopsis, a low abundance of OTUs related to Methylobacterium and no proteins associated with the metabolism of one-carbon molecules were detected in the phyllospheres of the tree species studied here. Our data suggest that even though the phyllosphere bacterial communities of different tree species are phylogenetically diverse, their metaproteomes are functionally convergent with respect to traits required for survival on leaf surfaces.

  14. Metagenomic Analysis of Dairy Bacteriophages

    Muhammed, Musemma K.; Kot, Witold; Neve, Horst

    2017-01-01

    Despite their huge potential for characterizing the biodiversity of phages, metagenomic studies are currently not available for dairy bacteriophages, partly due to the lack of a standard procedure for phage extraction. We optimized an extraction method that allows to remove the bulk protein from...

  15. Studying Microbial Mat Functioning Amidst "Unexpected Diversity": Methodological Approaches and Initial Results from Metatranscriptomes of Mats Over Diel cycles, iTags from Long Term Manipulations, and Biogeochemical Cycling in Simplified Microbial Mats Constructed from Cultures

    Bebout, B.; Bebout, L. E.; Detweiler, A. M.; Everroad, R. C.; Lee, J.; Pett-Ridge, J.; Weber, P. K.

    2014-12-01

    Microbial mats are famously amongst the most diverse microbial ecosystems on Earth, inhabiting some of the most inclement environments known, including hypersaline, dry, hot, cold, nutrient poor, and high UV environments. The high microbial diversity of microbial mats makes studies of microbial ecology notably difficult. To address this challenge, we have been using a combination of metagenomics, metatranscriptomics, iTags and culture-based simplified microbial mats to study biogeochemical cycling (H2 production, N2 fixation, and fermentation) in microbial mats collected from Elkhorn Slough, Monterey Bay, California. Metatranscriptomes of microbial mats incubated over a diel cycle have revealed that a number of gene systems activate only during the day in Cyanobacteria, while the remaining appear to be constitutive. The dominant cyanobacterium in the mat (Microcoleus chthonoplastes) expresses several pathways for nitrogen scavenging undocumented in cultured strains, as well as the expression of two starch storage and utilization cycles. Community composition shifts in response to long term manipulations of mats were assessed using iTags. Changes in community diversity were observed as hydrogen fluxes increased in response to a lowering of sulfate concentrations. To produce simplified microbial mats, we have isolated members of 13 of the 15 top taxa from our iTag libraries into culture. Simplified microbial mats and simple co-cultures and consortia constructed from these isolates reproduce many of the natural patterns of biogeochemical cycling in the parent natural microbial mats, but against a background of far lower overall diversity, simplifying studies of changes in gene expression (over the short term), interactions between community members, and community composition changes (over the longer term), in response to environmental forcing.

  16. Metatranscriptomic analysis of diverse microbial communities reveals core metabolic pathways and microbiome-specific functionality.

    Jiang, Yue; Xiong, Xuejian; Danska, Jayne; Parkinson, John

    2016-01-12

    Metatranscriptomics is emerging as a powerful technology for the functional characterization of complex microbial communities (microbiomes). Use of unbiased RNA-sequencing can reveal both the taxonomic composition and active biochemical functions of a complex microbial community. However, the lack of established reference genomes, computational tools and pipelines make analysis and interpretation of these datasets challenging. Systematic studies that compare data across microbiomes are needed to demonstrate the ability of such pipelines to deliver biologically meaningful insights on microbiome function. Here, we apply a standardized analytical pipeline to perform a comparative analysis of metatranscriptomic data from diverse microbial communities derived from mouse large intestine, cow rumen, kimchi culture, deep-sea thermal vent and permafrost. Sequence similarity searches allowed annotation of 19 to 76% of putative messenger RNA (mRNA) reads, with the highest frequency in the kimchi dataset due to its relatively low complexity and availability of closely related reference genomes. Metatranscriptomic datasets exhibited distinct taxonomic and functional signatures. From a metabolic perspective, we identified a common core of enzymes involved in amino acid, energy and nucleotide metabolism and also identified microbiome-specific pathways such as phosphonate metabolism (deep sea) and glycan degradation pathways (cow rumen). Integrating taxonomic and functional annotations within a novel visualization framework revealed the contribution of different taxa to metabolic pathways, allowing the identification of taxa that contribute unique functions. The application of a single, standard pipeline confirms that the rich taxonomic and functional diversity observed across microbiomes is not simply an artefact of different analysis pipelines but instead reflects distinct environmental influences. At the same time, our findings show how microbiome complexity and availability of

  17. Efficiency of RNA extraction from selected bacteria in the context of biogas production and metatranscriptomics.

    Stark, Lucy; Giersch, Tina; Wünschiers, Röbbe

    2014-10-01

    Understanding the microbial population in anaerobic digestion is an essential task to increase efficient substrate use and process stability. The metabolic state, represented e.g. by the transcriptome, of a fermenting system can help to find markers for monitoring industrial biogas production to prevent failures or to model the whole process. Advances in next-generation sequencing make transcriptomes accessible for large-scale analyses. In order to analyze the metatranscriptome of a mixed-species sample, isolation of high-quality RNA is the first step. However, different extraction methods may yield different efficiencies in different species. Especially in mixed-species environmental samples, unbiased isolation of transcripts is important for meaningful conclusions. We applied five different RNA-extraction protocols to nine taxonomic diverse bacterial species. Chosen methods are based on various lysis and extraction principles. We found that the extraction efficiency of different methods depends strongly on the target organism. RNA isolation of gram-positive bacteria was characterized by low yield whilst from gram-negative species higher concentrations can be obtained. Transferring our results to mixed-species investigations, such as metatranscriptomics with biofilms or biogas plants, leads to the conclusion that particular microorganisms might be over- or underrepresented depending on the method applied. Special care must be taken when using such metatranscriptomics data for, e.g. process modeling. Copyright © 2013 Elsevier Ltd. All rights reserved.

  18. A de-novo-assembly-based Data Analysis Pipeline for Plant Obligate Parasite Metatranscriptomic Studies

    Li Guo

    2016-07-01

    Full Text Available Current and emerging plant diseases caused by obligate parasitic microbes such as rusts, downy mildews, and powdery mildews threaten worldwide crop production and food safety. These obligate parasites are typically unculturable in the laboratory, posing technical challenges to characterize them at the genetic and genomic level. Here we have developed a data analysis pipeline integrating several bioinformatic software programs. This pipeline facilitates rapid gene discovery and expression analysis of a plant host and its obligate parasite simultaneously by next generation sequencing of mixed host and pathogen RNA (i.e. metatranscriptomics. We applied this pipeline to metatranscriptomic sequencing data of sweet basil (Ocimum basilicum and its obligate downy mildew parasite Peronospora belbahrii, both lacking a sequenced genome. Even with a single data point, we were able to identify both candidate host defense genes and pathogen virulence genes that are highly expressed during infection. This demonstrates the power of this pipeline for identifying genes important in host-pathogen interactions without prior genomic information for either the plant host or the obligate biotrophic pathogen. The simplicity of this pipeline makes it accessible to researchers with limited computational skills and applicable to metatranscriptomic data analysis in a wide range of plant-obligate-parasite systems.

  19. Metagenomics and the protein universe

    Godzik, Adam

    2011-01-01

    Metagenomics sequencing projects have dramatically increased our knowledge of the protein universe and provided over one-half of currently known protein sequences; they have also introduced a much broader phylogenetic diversity into the protein databases. The full analysis of metagenomic datasets is only beginning, but it has already led to the discovery of thousands of new protein families, likely representing novel functions specific to given environments. At the same time, a deeper analysis of such novel families, including experimental structure determination of some representatives, suggests that most of them represent distant homologs of already characterized protein families, and thus most of the protein diversity present in the new environments are due to functional divergence of the known protein families rather than the emergence of new ones. PMID:21497084

  20. Exploration of noncoding sequences in metagenomes.

    Fabián Tobar-Tosse

    Full Text Available Environment-dependent genomic features have been defined for different metagenomes, whose genes and their associated processes are related to specific environments. Identification of ORFs and their functional categories are the most common methods for association between functional and environmental features. However, this analysis based on finding ORFs misses noncoding sequences and, therefore, some metagenome regulatory or structural information could be discarded. In this work we analyzed 23 whole metagenomes, including coding and noncoding sequences using the following sequence patterns: (G+C content, Codon Usage (Cd, Trinucleotide Usage (Tn, and functional assignments for ORF prediction. Herein, we present evidence of a high proportion of noncoding sequences discarded in common similarity-based methods in metagenomics, and the kind of relevant information present in those. We found a high density of trinucleotide repeat sequences (TRS in noncoding sequences, with a regulatory and adaptive function for metagenome communities. We present associations between trinucleotide values and gene function, where metagenome clustering correlate with microorganism adaptations and kinds of metagenomes. We propose here that noncoding sequences have relevant information to describe metagenomes that could be considered in a whole metagenome analysis in order to improve their organization, classification protocols, and their relation with the environment.

  1. Challenges and Opportunities of Airborne Metagenomics

    Behzad, H.; Gojobori, Takashi; Mineta, K.

    2015-01-01

    microorganisms. Airborne metagenomic studies could also lead to discoveries of novel genes and metabolic pathways relevant to meteorological and industrial applications, environmental bioremediation, and biogeochemical cycles.

  2. Marine metagenomics as a source for bioprospecting

    Kodzius, Rimantas

    2015-08-12

    This review summarizes usage of genome-editing technologies for metagenomic studies; these studies are used to retrieve and modify valuable microorganisms for production, particularly in marine metagenomics. Organisms may be cultivable or uncultivable. Metagenomics is providing especially valuable information for uncultivable samples. The novel genes, pathways and genomes can be deducted. Therefore, metagenomics, particularly genome engineering and system biology, allows for the enhancement of biological and chemical producers and the creation of novel bioresources. With natural resources rapidly depleting, genomics may be an effective way to efficiently produce quantities of known and novel foods, livestock feed, fuels, pharmaceuticals and fine or bulk chemicals.

  3. Integrative Workflows for Metagenomic Analysis

    Efthymios eLadoukakis

    2014-11-01

    Full Text Available The rapid evolution of all sequencing technologies, described by the term Next Generation Sequencing (NGS, have revolutionized metagenomic analysis. They constitute a combination of high-throughput analytical protocols, coupled to delicate measuring techniques, in order to potentially discover, properly assemble and map allelic sequences to the correct genomes, achieving particularly high yields for only a fraction of the cost of traditional processes (i.e. Sanger. From a bioinformatic perspective, this boils down to many gigabytes of data being generated from each single sequencing experiment, rendering the management or even the storage, critical bottlenecks with respect to the overall analytical endeavor. The enormous complexity is even more aggravated by the versatility of the processing steps available, represented by the numerous bioinformatic tools that are essential, for each analytical task, in order to fully unveil the genetic content of a metagenomic dataset. These disparate tasks range from simple, nonetheless non-trivial, quality control of raw data to exceptionally complex protein annotation procedures, requesting a high level of expertise for their proper application or the neat implementation of the whole workflow. Furthermore, a bioinformatic analysis of such scale, requires grand computational resources, imposing as the sole realistic solution, the utilization of cloud computing infrastructures. In this review article we discuss different, integrative, bioinformatic solutions available, which address the aforementioned issues, by performing a critical assessment of the available automated pipelines for data management, quality control and annotation of metagenomic data, embracing various, major sequencing technologies and applications.

  4. Exploring neighborhoods in the metagenome universe.

    Aßhauer, Kathrin P; Klingenberg, Heiner; Lingner, Thomas; Meinicke, Peter

    2014-07-14

    The variety of metagenomes in current databases provides a rapidly growing source of information for comparative studies. However, the quantity and quality of supplementary metadata is still lagging behind. It is therefore important to be able to identify related metagenomes by means of the available sequence data alone. We have studied efficient sequence-based methods for large-scale identification of similar metagenomes within a database retrieval context. In a broad comparison of different profiling methods we found that vector-based distance measures are well-suitable for the detection of metagenomic neighbors. Our evaluation on more than 1700 publicly available metagenomes indicates that for a query metagenome from a particular habitat on average nine out of ten nearest neighbors represent the same habitat category independent of the utilized profiling method or distance measure. While for well-defined labels a neighborhood accuracy of 100% can be achieved, in general the neighbor detection is severely affected by a natural overlap of manually annotated categories. In addition, we present results of a novel visualization method that is able to reflect the similarity of metagenomes in a 2D scatter plot. The visualization method shows a similarly high accuracy in the reduced space as compared with the high-dimensional profile space. Our study suggests that for inspection of metagenome neighborhoods the profiling methods and distance measures can be chosen to provide a convenient interpretation of results in terms of the underlying features. Furthermore, supplementary metadata of metagenome samples in the future needs to comply with readily available ontologies for fine-grained and standardized annotation. To make profile-based k-nearest-neighbor search and the 2D-visualization of the metagenome universe available to the research community, we included the proposed methods in our CoMet-Universe server for comparative metagenome analysis.

  5. Current and future resources for functional metagenomics

    Kathy Nguyen Lam

    2015-10-01

    Full Text Available Functional metagenomics is a powerful experimental approach for studying gene function, starting from the extracted DNA of mixed microbial populations. A functional approach relies on the construction and screening of metagenomic libraries – physical libraries that contain DNA cloned from environmental metagenomes. The information obtained from functional metagenomics can help in future annotations of gene function and serve as a complement to sequence-based metagenomics. In this Perspective, we begin by summarizing the technical challenges of constructing metagenomic libraries and emphasize their value as resources. We then discuss libraries constructed using the popular cloning vector, pCC1FOS, and highlight the strengths and shortcomings of this system, alongside possible strategies to maximize existing pCC1FOS-based libraries by screening in diverse hosts. Finally, we discuss the known bias of libraries constructed from human gut and marine water samples, present results that suggest bias may also occur for soil libraries, and consider factors that bias metagenomic libraries in general. We anticipate that discussion of current resources and limitations will advance tools and technologies for functional metagenomics research.

  6. Back to the Future of Soil Metagenomics.\

    Nesme J, J.; Achouak, W.; Agathos SN, S.N.; Bailey, M.; Baldrian, Petr; Brunel, D.; Frostegård, Å.; Heulin, T.; Jansson JK, J.K.; Jurkevitch, E.; Kruus, K.L.; Kowalchuk, G.A.; Lagares, A.; Lapin-Scott, H.M.; Lemanceau, P.; Le Paslier, D.; Mandic-Mulec, I.; Murrell, J.C.; Myrold, D.D.; Nalin, R.; Nannipieri, P.; Neufeld, J.D.; O'Gara, F.; Parnell, J.J.; Pühler, A.; Pylro, V.; Ramos, J.L.; Roesch, L.F.; Schloter, M.; Schleper, C.; Sczyrba, A.; Sessitsch, A.; Sjöling, S.; Sørensen, J.; Sørensen, S.J.; Tebbe, C.C.; Topp, E.; Tsiamis, G.; van Elsas, J.D.; van Keulen, G.; Widmer, F.; Wagner, M.; Zhang, T.; Zhang, X.; Zhao, L; Zhu, Y-G.; Vogel, T.M.; Simonet, P.

    2016-01-01

    Roč. 7, FEB 10 (2016), s. 73 ISSN 1664-302X Institutional support: RVO:61388971 Keywords : metagenomic * soil microbiology; terrestrial microbiology * metagenomic; soil microbiology; terrestrial microbiology Subject RIV: EE - Microbiology, Virology Impact factor: 4.076, year: 2016

  7. Metagenomic applications in environmental monitoring and bioremediation.

    Techtmann, Stephen M; Hazen, Terry C

    2016-10-01

    With the rapid advances in sequencing technology, the cost of sequencing has dramatically dropped and the scale of sequencing projects has increased accordingly. This has provided the opportunity for the routine use of sequencing techniques in the monitoring of environmental microbes. While metagenomic applications have been routinely applied to better understand the ecology and diversity of microbes, their use in environmental monitoring and bioremediation is increasingly common. In this review we seek to provide an overview of some of the metagenomic techniques used in environmental systems biology, addressing their application and limitation. We will also provide several recent examples of the application of metagenomics to bioremediation. We discuss examples where microbial communities have been used to predict the presence and extent of contamination, examples of how metagenomics can be used to characterize the process of natural attenuation by unculturable microbes, as well as examples detailing the use of metagenomics to understand the impact of biostimulation on microbial communities.

  8. Insights into Microalga and Bacteria Interactions of Selected Phycosphere Biofilms Using Metagenomic, Transcriptomic, and Proteomic Approaches

    Ines Krohn-Molt

    2017-10-01

    Full Text Available Microalga are of high relevance for the global carbon cycling and it is well-known that they are associated with a microbiota. However, it remains unclear, if the associated microbiota, often found in phycosphere biofilms, is specific for the microalga strains and which role individual bacterial taxa play. Here we provide experimental evidence that Chlorella saccharophila, Scenedesmus quadricauda, and Micrasterias crux-melitensis, maintained in strain collections, are associated with unique and specific microbial populations. Deep metagenome sequencing, binning approaches, secretome analyses in combination with RNA-Seq data implied fundamental differences in the gene expression profiles of the microbiota associated with the different microalga. Our metatranscriptome analyses indicates that the transcriptionally most active bacteria with respect to key genes commonly involved in plant–microbe interactions in the Chlorella (Trebouxiophyceae and Scenedesmus (Chlorophyceae strains belong to the phylum of the α-Proteobacteria. In contrast, in the Micrasterias (Zygnematophyceae phycosphere biofilm bacteria affiliated with the phylum of the Bacteroidetes showed the highest gene expression rates. We furthermore show that effector molecules known from plant–microbe interactions as inducers for the innate immunity are already of relevance at this evolutionary early plant-microbiome level.

  9. Nodeomics: pathogen detection in vertebrate lymph nodes using meta-transcriptomics.

    Nicola E Wittekindt

    Full Text Available The ongoing emergence of human infections originating from wildlife highlights the need for better knowledge of the microbial community in wildlife species where traditional diagnostic approaches are limited. Here we evaluate the microbial biota in healthy mule deer (Odocoileus hemionus by analyses of lymph node meta-transcriptomes. cDNA libraries from five individuals and two pools of samples were prepared from retropharyngeal lymph node RNA enriched for polyadenylated RNA and sequenced using Roche-454 Life Sciences technology. Protein-coding and 16S ribosomal RNA (rRNA sequences were taxonomically profiled using protein and rRNA specific databases. Representatives of all bacterial phyla were detected in the seven libraries based on protein-coding transcripts indicating that viable microbiota were present in lymph nodes. Residents of skin and rumen, and those ubiquitous in mule deer habitat dominated classifiable bacterial species. Based on detection of both rRNA and protein-coding transcripts, we identified two new proteobacterial species; a Helicobacter closely related to Helicobacter cetorum in the Helicobacter pylori/Helicobacter acinonychis complex and an Acinetobacter related to Acinetobacter schindleri. Among viruses, a novel gamma retrovirus and other members of the Poxviridae and Retroviridae were identified. We additionally evaluated bacterial diversity by amplicon sequencing the hypervariable V6 region of 16S rRNA and demonstrate that overall taxonomic diversity is higher with the meta-transcriptomic approach. These data provide the most complete picture to date of the microbial diversity within a wildlife host. Our research advances the use of meta-transcriptomics to study microbiota in wildlife tissues, which will facilitate detection of novel organisms with pathogenic potential to human and animals.

  10. Human milk metagenome: a functional capacity analysis

    2013-01-01

    Background Human milk contains a diverse population of bacteria that likely influences colonization of the infant gastrointestinal tract. Recent studies, however, have been limited to characterization of this microbial community by 16S rRNA analysis. In the present study, a metagenomic approach using Illumina sequencing of a pooled milk sample (ten donors) was employed to determine the genera of bacteria and the types of bacterial open reading frames in human milk that may influence bacterial establishment and stability in this primal food matrix. The human milk metagenome was also compared to that of breast-fed and formula-fed infants’ feces (n = 5, each) and mothers’ feces (n = 3) at the phylum level and at a functional level using open reading frame abundance. Additionally, immune-modulatory bacterial-DNA motifs were also searched for within human milk. Results The bacterial community in human milk contained over 360 prokaryotic genera, with sequences aligning predominantly to the phyla of Proteobacteria (65%) and Firmicutes (34%), and the genera of Pseudomonas (61.1%), Staphylococcus (33.4%) and Streptococcus (0.5%). From assembled human milk-derived contigs, 30,128 open reading frames were annotated and assigned to functional categories. When compared to the metagenome of infants’ and mothers’ feces, the human milk metagenome was less diverse at the phylum level, and contained more open reading frames associated with nitrogen metabolism, membrane transport and stress response (P milk metagenome also contained a similar occurrence of immune-modulatory DNA motifs to that of infants’ and mothers’ fecal metagenomes. Conclusions Our results further expand the complexity of the human milk metagenome and enforce the benefits of human milk ingestion on the microbial colonization of the infant gut and immunity. Discovery of immune-modulatory motifs in the metagenome of human milk indicates more exhaustive analyses of the functionality of the human

  11. Metaproteome analysis to determine the metabolically active part of a thermophilic microbial community producing biogas from agricultural biomass.

    Hanreich, Angelika; Heyer, Robert; Benndorf, Dirk; Rapp, Erdmann; Pioch, Markus; Reichl, Udo; Klocke, Michael

    2012-07-01

    Complex consortia of microorganisms are responsible for biogas production. A lot of information about the taxonomic structure and enzymatic potential of such communities has been collected by a variety of gene-based approaches, yet little is known about which of all the assumable metabolic pathways are active throughout the process of biogas formation. To tackle this problem, we established a protocol for the metaproteomic analysis of samples taken from biogas reactors fed with agricultural biomass. In contrast to previous studies where an anaerobic digester was fed with synthetic wastewater, the complex matrix in this study required the extraction of proteins with liquid phenol and the application of paper bridge loading for 2-dimensional gel electrophoresis. Proteins were subjected to nanoHPLC (high-performance liquid chromatography) coupled to tandem mass spectrometry for characterization. Several housekeeping proteins as well as methanogenesis-related enzymes were identified by a MASCOT search and de novo sequencing, which proved the feasibility of our approach. The establishment of such an approach is the basis for further metaproteomic studies of biogas-producing communities. In particular, the apparent status of metabolic activities within the communities can be monitored. The knowledge collected from such experiments could lead to further improvements of biogas production.

  12. Metatranscriptomics reveal differences in in situ energy and nitrogen metabolism among hydrothermal vent snail symbionts.

    Sanders, J G; Beinart, R A; Stewart, F J; Delong, E F; Girguis, P R

    2013-08-01

    Despite the ubiquity of chemoautotrophic symbioses at hydrothermal vents, our understanding of the influence of environmental chemistry on symbiont metabolism is limited. Transcriptomic analyses are useful for linking physiological poise to environmental conditions, but recovering samples from the deep sea is challenging, as the long recovery times can change expression profiles before preservation. Here, we present a novel, in situ RNA sampling and preservation device, which we used to compare the symbiont metatranscriptomes associated with Alviniconcha, a genus of vent snail, in which specific host-symbiont combinations are predictably distributed across a regional geochemical gradient. Metatranscriptomes of these symbionts reveal key differences in energy and nitrogen metabolism relating to both environmental chemistry (that is, the relative expression of genes) and symbiont phylogeny (that is, the specific pathways employed). Unexpectedly, dramatic differences in expression of transposases and flagellar genes suggest that different symbiont types may also have distinct life histories. These data further our understanding of these symbionts' metabolic capabilities and their expression in situ, and suggest an important role for symbionts in mediating their hosts' interaction with regional-scale differences in geochemistry.

  13. Metatranscriptomics reveals the molecular mechanism of large granule formation in granular anammox reactor

    Bagchi, Samik

    2016-06-20

    Granules enriched with anammox bacteria are essential in enhancing the treatment of ammonia-rich wastewater, but little is known about how anammox bacteria grow and multiply inside granules. Here, we combined metatranscriptomics, quantitative PCR and 16S rRNA gene sequencing to study the changes in community composition, metabolic gene content and gene expression in a granular anammox reactor with the objective of understanding the molecular mechanism of anammox growth and multiplication that led to formation of large granules. Size distribution analysis revealed the spatial distribution of granules in which large granules having higher abundance of anammox bacteria (genus Brocadia) dominated the bottom biomass. Metatranscriptomics analysis detected all the essential transcripts for anammox metabolism. During the later stage of reactor operation, higher expression of ammonia and nitrite transport proteins and key metabolic enzymes mainly in the bottom large granules facilitated anammox bacteria activity. The high activity resulted in higher growth and multiplication of anammox bacteria and expanded the size of the granules. This conceptual model for large granule formation proposed here may assist in the future design of anammox processes for mainstream wastewater treatment.

  14. Diatoms dominate the eukaryotic metatranscriptome during spring in coastal 'dead zone' sediments.

    Broman, Elias; Sachpazidou, Varvara; Dopson, Mark; Hylander, Samuel

    2017-10-11

    An important characteristic of marine sediments is the oxygen concentration that affects many central metabolic processes. There has been a widespread increase in hypoxia in coastal systems (referred to as 'dead zones') mainly caused by eutrophication. Hence, it is central to understand the metabolism and ecology of eukaryotic life in sediments during changing oxygen conditions. Therefore, we sampled coastal 'dead zone' Baltic Sea sediment during autumn and spring, and analysed the eukaryotic metatranscriptome from field samples and after incubation in the dark under oxic or anoxic conditions. Bacillariophyta (diatoms) dominated the eukaryotic metatranscriptome in spring and were also abundant during autumn. A large fraction of the diatom RNA reads was associated with the photosystems suggesting a constitutive expression in darkness. Microscope observation showed intact diatom cells and these would, if hatched, represent a significant part of the pelagic phytoplankton biomass. Oxygenation did not significantly change the relative proportion of diatoms nor resulted in any major shifts in metabolic 'signatures'. By contrast, diatoms rapidly responded when exposed to light suggesting that light is limiting diatom development in hypoxic sediments. Hence, it is suggested that diatoms in hypoxic sediments are on 'standby' to exploit the environment if they reach suitable habitats. © 2017 The Author(s).

  15. Tapping uncultured microorganisms through metagenomics for drug ...

    African Journal of Biotechnology ... Microorganisms are major source of bioactive natural products, and several ... This review highlights the recent methodologies, limitations, and applications of metagenomics for the discovery of new drugs.

  16. Tapping uncultured microorganisms through metagenomics for drug ...

    bdelnasser

    reached the market using this new technology. For these reasons and others, the interest in natural products has ..... Functional metagenomic library screening strategy ..... Bertrand H, Poly F, Van VT, Lombard N, Nalin R, Vogel TM, Simonet P.

  17. Comparative metagenomics of the Red Sea

    Mineta, Katsuhiko

    2016-01-26

    Metagenome produces a tremendous amount of data that comes from the organisms living in the environments. This big data enables us to examine not only microbial genes but also the community structure, interaction and adaptation mechanisms at the specific location and condition. The Red Sea has several unique characteristics such as high salinity, high temperature and low nutrition. These features must contribute to form the unique microbial community during the evolutionary process. Since 2014, we started monthly samplings of the metagenomes in the Red Sea under KAUST-CCF project. In collaboration with Kitasato University, we also collected the metagenome data from the ocean in Japan, which shows contrasting features to the Red Sea. Therefore, the comparative metagenomics of those data provides a comprehensive view of the Red Sea microbes, leading to identify key microbes, genes and networks related to those environmental differences.

  18. Challenges and Opportunities of Airborne Metagenomics

    Behzad, Hayedeh; Gojobori, Takashi; Mineta, Katsuhiko

    2015-01-01

    Recent metagenomic studies of environments, such as marine and soil, have significantly enhanced our understanding of the diverse microbial communities living in these habitats and their essential roles in sustaining vast ecosystems. The increase in the number of publications related to soil and marine metagenomics is in sharp contrast to those of air, yet airborne microbes are thought to have significant impacts on many aspects of our lives from their potential roles in atmospheric events su...

  19. Impact of Dietary Resistant Starch on the Human Gut Microbiome, Metaproteome, and Metabolome.

    Maier, Tanja V; Lucio, Marianna; Lee, Lang Ho; VerBerkmoes, Nathan C; Brislawn, Colin J; Bernhardt, Jörg; Lamendella, Regina; McDermott, Jason E; Bergeron, Nathalie; Heinzmann, Silke S; Morton, James T; González, Antonio; Ackermann, Gail; Knight, Rob; Riedel, Katharina; Krauss, Ronald M; Schmitt-Kopplin, Philippe; Jansson, Janet K

    2017-10-17

    Diet can influence the composition of the human microbiome, and yet relatively few dietary ingredients have been systematically investigated with respect to their impact on the functional potential of the microbiome. Dietary resistant starch (RS) has been shown to have health benefits, but we lack a mechanistic understanding of the metabolic processes that occur in the gut during digestion of RS. Here, we collected samples during a dietary crossover study with diets containing large or small amounts of RS. We determined the impact of RS on the gut microbiome and metabolic pathways in the gut, using a combination of "omics" approaches, including 16S rRNA gene sequencing, metaproteomics, and metabolomics. This multiomics approach captured changes in the abundance of specific bacterial species, proteins, and metabolites after a diet high in resistant starch (HRS), providing key insights into the influence of dietary interventions on the gut microbiome. The combined data showed that a high-RS diet caused an increase in the ratio of Firmicutes to Bacteroidetes , including increases in relative abundances of some specific members of the Firmicutes and concurrent increases in enzymatic pathways and metabolites involved in lipid metabolism in the gut. IMPORTANCE This work was undertaken to obtain a mechanistic understanding of the complex interplay between diet and the microorganisms residing in the intestine. Although it is known that gut microbes play a key role in digestion of the food that we consume, the specific contributions of different microorganisms are not well understood. In addition, the metabolic pathways and resultant products of metabolism during digestion are highly complex. To address these knowledge gaps, we used a combination of molecular approaches to determine the identities of the microorganisms in the gut during digestion of dietary starch as well as the metabolic pathways that they carry out. Together, these data provide a more complete picture of

  20. Impact of Dietary Resistant Starch on the Human Gut Microbiome, Metaproteome, and Metabolome

    Maier, Tanja V.; Lucio, Marianna; Lee, Lang Ho; VerBerkmoes, Nathan C.; Brislawn, Colin J.; Bernhardt, Jörg; Lamendella, Regina; McDermott, Jason E.; Bergeron, Nathalie; Heinzmann, Silke S.; Morton, James T.; González, Antonio; Ackermann, Gail; Knight, Rob; Riedel, Katharina; Krauss, Ronald M.; Schmitt-Kopplin, Philippe; Jansson, Janet K.; Moran, Mary Ann

    2017-10-17

    ABSTRACT

    Diet can influence the composition of the human microbiome, and yet relatively few dietary ingredients have been systematically investigated with respect to their impact on the functional potential of the microbiome. Dietary resistant starch (RS) has been shown to have health benefits, but we lack a mechanistic understanding of the metabolic processes that occur in the gut during digestion of RS. Here, we collected samples during a dietary crossover study with diets containing large or small amounts of RS. We determined the impact of RS on the gut microbiome and metabolic pathways in the gut, using a combination of “omics” approaches, including 16S rRNA gene sequencing, metaproteomics, and metabolomics. This multiomics approach captured changes in the abundance of specific bacterial species, proteins, and metabolites after a diet high in resistant starch (HRS), providing key insights into the influence of dietary interventions on the gut microbiome. The combined data showed that a high-RS diet caused an increase in the ratio ofFirmicutestoBacteroidetes, including increases in relative abundances of some specific members of theFirmicutesand concurrent increases in enzymatic pathways and metabolites involved in lipid metabolism in the gut.

    IMPORTANCEThis work was undertaken to obtain a mechanistic understanding of the complex interplay between diet and the microorganisms residing in the intestine. Although it is known that gut microbes play a key role in digestion of the food that we consume, the specific contributions of different microorganisms are not well understood. In addition, the metabolic pathways and resultant products of metabolism during digestion are highly complex. To address these knowledge gaps, we used a combination of molecular approaches to determine the identities of the microorganisms in the gut during digestion of dietary starch as well as the

  1. Challenges and Opportunities of Airborne Metagenomics

    Behzad, H.

    2015-05-06

    Recent metagenomic studies of environments, such as marine and soil, have significantly enhanced our understanding of the diverse microbial communities living in these habitats and their essential roles in sustaining vast ecosystems. The increase in the number of publications related to soil and marine metagenomics is in sharp contrast to those of air, yet airborne microbes are thought to have significant impacts on many aspects of our lives from their potential roles in atmospheric events such as cloud formation, precipitation, and atmospheric chemistry to their major impact on human health. In this review, we will discuss the current progress in airborne metagenomics, with a special focus on exploring the challenges and opportunities of undertaking such studies. The main challenges of conducting metagenomic studies of airborne microbes are as follows: 1) Low density of microorganisms in the air, 2) efficient retrieval of microorganisms from the air, 3) variability in airborne microbial community composition, 4) the lack of standardized protocols and methodologies, and 5) DNA sequencing and bioinformatics-related challenges. Overcoming these challenges could provide the groundwork for comprehensive analysis of airborne microbes and their potential impact on the atmosphere, global climate, and our health. Metagenomic studies offer a unique opportunity to examine viral and bacterial diversity in the air and monitor their spread locally or across the globe, including threats from pathogenic microorganisms. Airborne metagenomic studies could also lead to discoveries of novel genes and metabolic pathways relevant to meteorological and industrial applications, environmental bioremediation, and biogeochemical cycles.

  2. Interactive metagenomic visualization in a Web browser

    Phillippy Adam M

    2011-09-01

    Full Text Available Abstract Background A critical output of metagenomic studies is the estimation of abundances of taxonomical or functional groups. The inherent uncertainty in assignments to these groups makes it important to consider both their hierarchical contexts and their prediction confidence. The current tools for visualizing metagenomic data, however, omit or distort quantitative hierarchical relationships and lack the facility for displaying secondary variables. Results Here we present Krona, a new visualization tool that allows intuitive exploration of relative abundances and confidences within the complex hierarchies of metagenomic classifications. Krona combines a variant of radial, space-filling displays with parametric coloring and interactive polar-coordinate zooming. The HTML5 and JavaScript implementation enables fully interactive charts that can be explored with any modern Web browser, without the need for installed software or plug-ins. This Web-based architecture also allows each chart to be an independent document, making them easy to share via e-mail or post to a standard Web server. To illustrate Krona's utility, we describe its application to various metagenomic data sets and its compatibility with popular metagenomic analysis tools. Conclusions Krona is both a powerful metagenomic visualization tool and a demonstration of the potential of HTML5 for highly accessible bioinformatic visualizations. Its rich and interactive displays facilitate more informed interpretations of metagenomic analyses, while its implementation as a browser-based application makes it extremely portable and easily adopted into existing analysis packages. Both the Krona rendering code and conversion tools are freely available under a BSD open-source license, and available from: http://krona.sourceforge.net.

  3. Metaproteomics of saliva identifies human protein markers specific for individuals with periodontitis and dental caries compared to orally healthy controls

    Belstrøm, Daniel; Jersie-Christensen, Rosa R; Lyon, David

    2016-01-01

    BACKGROUND: The composition of the salivary microbiota has been reported to differentiate between patients with periodontitis, dental caries and orally healthy individuals. To identify characteristics of diseased and healthy saliva we thus wanted to compare saliva metaproteomes from patients...... with periodontitis and dental caries to healthy individuals. METHODS: Stimulated saliva samples were collected from 10 patients with periodontitis, 10 patients with dental caries and 10 orally healthy individuals. The proteins in the saliva samples were subjected to denaturing buffer and digested enzymatically...... and inflammatory markers in periodontitis and dental caries compared to healthy controls. Bacterial proteome profiles and functional annotation were very similar in health and disease. CONCLUSIONS: Overexpression of proteins related to the complement system and inflammation seems to correlate with oral disease...

  4. A meta-proteomics approach to study the interspecies interactions affecting microbial biofilm development in a model community

    Herschend, Jakob; Damholt, Zacharias Brimnes Visby; Marquard, Andrea Marion

    2017-01-01

    Microbial biofilms are omnipresent in nature and relevant to a broad spectrum of industries ranging from bioremediation and food production to biomedical applications. To date little is understood about how multi-species biofilm communities develop and function on a molecular level, due to the co......Microbial biofilms are omnipresent in nature and relevant to a broad spectrum of industries ranging from bioremediation and food production to biomedical applications. To date little is understood about how multi-species biofilm communities develop and function on a molecular level, due...... to the complexity of these biological systems. Here we apply a meta-proteomics approach to investigate the mechanisms influencing biofilm formation in a model consortium of four bacterial soil isolates; Stenotrophomonas rhizophila, Xanthomonas retroflexus, Microbacterium oxydans and Paenibacillus amylolyticus...

  5. Metaproteomics reveals major microbial players and their biodegradation functions in a large-scale aerobic composting plant

    Liu, Dongming; Li, Mingxiao; Xi, Beidou; Zhao, Yue; Wei, Zimin; Song, Caihong; Zhu, Chaowei

    2015-01-01

    Composting is an appropriate management alternative for municipal solid waste; however, our knowledge about the microbial regulation of this process is still scare. We employed metaproteomics to elucidate the main biodegradation pathways in municipal solid waste composting system across the main phases in a large-scale composting plant. The investigation of microbial succession revealed that Bacillales, Actinobacteria and Saccharomyces increased significantly with respect to abundance in composting process. The key microbiologic population for cellulose degradation in different composting stages was different. Fungi were found to be the main producers of cellulase in earlier phase. However, the cellulolytic fungal communities were gradually replaced by a purely bacterial one in active phase, which did not support the concept that the thermophilic fungi are active through the thermophilic phase. The effective decomposition of cellulose required the synergy between bacteria and fungi in the curing phase. PMID:25989417

  6. Marine Metagenome as A Resource for Novel Enzymes

    Alma’ abadi, Amani D.; Gojobori, Takashi; Mineta, Katsuhiko

    2015-01-01

    the metagenomics approach has many limitations, it is expected to provide not only scientific insights but also economic benefits, especially in industry. This review highlights the importance of metagenomics in mining microbial lipases, as an example, by using

  7. Metaproteomic identification of diazotrophic methanotrophs and their localization in root tissues of field-grown rice plants.

    Bao, Zhihua; Okubo, Takashi; Kubota, Kengo; Kasahara, Yasuhiro; Tsurumaru, Hirohito; Anda, Mizue; Ikeda, Seishi; Minamisawa, Kiwamu

    2014-08-01

    In a previous study by our group, CH4 oxidation and N2 fixation were simultaneously activated in the roots of wild-type rice plants in a paddy field with no N input; both processes are likely controlled by a rice gene for microbial symbiosis. The present study examined which microorganisms in rice roots were responsible for CH4 oxidation and N2 fixation under the field conditions. Metaproteomic analysis of root-associated bacteria from field-grown rice (Oryza sativa Nipponbare) revealed that nitrogenase complex-containing nitrogenase reductase (NifH) and the alpha subunit (NifD) and beta subunit (NifK) of dinitrogenase were mainly derived from type II methanotrophic bacteria of the family Methylocystaceae, including Methylosinus spp. Minor nitrogenase proteins such as Methylocella, Bradyrhizobium, Rhodopseudomonas, and Anaeromyxobacter were also detected. Methane monooxygenase proteins (PmoCBA and MmoXYZCBG) were detected in the same bacterial group of the Methylocystaceae. Because these results indicated that Methylocystaceae members mediate both CH4 oxidation and N2 fixation, we examined their localization in rice tissues by using catalyzed reporter deposition-fluorescence in situ hybridization (CARD-FISH). The methanotrophs were localized around the epidermal cells and vascular cylinder in the root tissues of the field-grown rice plants. Our metaproteomics and CARD-FISH results suggest that CH4 oxidation and N2 fixation are performed mainly by type II methanotrophs of the Methylocystaceae, including Methylosinus spp., inhabiting the vascular bundles and epidermal cells of rice roots. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  8. Customized workflow development and data modularization concepts for RNA-Sequencing and metatranscriptome experiments.

    Lott, Steffen C; Wolfien, Markus; Riege, Konstantin; Bagnacani, Andrea; Wolkenhauer, Olaf; Hoffmann, Steve; Hess, Wolfgang R

    2017-11-10

    RNA-Sequencing (RNA-Seq) has become a widely used approach to study quantitative and qualitative aspects of transcriptome data. The variety of RNA-Seq protocols, experimental study designs and the characteristic properties of the organisms under investigation greatly affect downstream and comparative analyses. In this review, we aim to explain the impact of structured pre-selection, classification and integration of best-performing tools within modularized data analysis workflows and ready-to-use computing infrastructures towards experimental data analyses. We highlight examples for workflows and use cases that are presented for pro-, eukaryotic and mixed dual RNA-Seq (meta-transcriptomics) experiments. In addition, we are summarizing the expertise of the laboratories participating in the project consortium "Structured Analysis and Integration of RNA-Seq experiments" (de.STAIR) and its integration with the Galaxy-workbench of the RNA Bioinformatics Center (RBC). Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.

  9. Viral Metagenomics: MetaView Software

    Zhou, C; Smith, J

    2007-10-22

    The purpose of this report is to design and develop a tool for analysis of raw sequence read data from viral metagenomics experiments. The tool should compare read sequences of known viral nucleic acid sequence data and enable a user to attempt to determine, with some degree of confidence, what virus groups may be present in the sample. This project was conducted in two phases. In phase 1 we surveyed the literature and examined existing metagenomics tools to educate ourselves and to more precisely define the problem of analyzing raw read data from viral metagenomic experiments. In phase 2 we devised an approach and built a prototype code and database. This code takes viral metagenomic read data in fasta format as input and accesses all complete viral genomes from Kpath for sequence comparison. The system executes at the UNIX command line, producing output that is stored in an Oracle relational database. We provide here a description of the approach we came up with for handling un-assembled, short read data sets from viral metagenomics experiments. We include a discussion of the current MetaView code capabilities and additional functionality that we believe should be added, should additional funding be acquired to continue the work.

  10. Preliminary High-Throughput Metagenome Assembly

    Dusheyko, Serge; Furman, Craig; Pangilinan, Jasmyn; Shapiro, Harris; Tu, Hank

    2007-03-26

    Metagenome data sets present a qualitatively different assembly problem than traditional single-organism whole-genome shotgun (WGS) assembly. The unique aspects of such projects include the presence of a potentially large number of distinct organisms and their representation in the data set at widely different fractions. In addition, multiple closely related strains could be present, which would be difficult to assemble separately. Failure to take these issues into account can result in poor assemblies that either jumble together different strains or which fail to yield useful results. The DOE Joint Genome Institute has sequenced a number of metagenomic projects and plans to considerably increase this number in the coming year. As a result, the JGI has a need for high-throughput tools and techniques for handling metagenome projects. We present the techniques developed to handle metagenome assemblies in a high-throughput environment. This includes a streamlined assembly wrapper, based on the JGI?s in-house WGS assembler, Jazz. It also includes the selection of sensible defaults targeted for metagenome data sets, as well as quality control automation for cleaning up the raw results. While analysis is ongoing, we will discuss preliminary assessments of the quality of the assembly results (http://fames.jgi-psf.org).

  11. Shotgun metagenomic data streams: surfing without fear

    Berendzen, Joel R [Los Alamos National Laboratory

    2010-12-06

    Timely information about bio-threat prevalence, consequence, propagation, attribution, and mitigation is needed to support decision-making, both routinely and in a crisis. One DNA sequencer can stream 25 Gbp of information per day, but sampling strategies and analysis techniques are needed to turn raw sequencing power into actionable knowledge. Shotgun metagenomics can enable biosurveillance at the level of a single city, hospital, or airplane. Metagenomics characterizes viruses and bacteria from complex environments such as soil, air filters, or sewage. Unlike targeted-primer-based sequencing, shotgun methods are not blind to sequences that are truly novel, and they can measure absolute prevalence. Shotgun metagenomic sampling can be non-invasive, efficient, and inexpensive while being informative. We have developed analysis techniques for shotgun metagenomic sequencing that rely upon phylogenetic signature patterns. They work by indexing local sequence patterns in a manner similar to web search engines. Our methods are laptop-fast and favorable scaling properties ensure they will be sustainable as sequencing methods grow. We show examples of application to soil metagenomic samples.

  12. The metatranscriptome of the rhesus macaque: investigating potential causes of idiopathic chronic diarrhea

    The study of the gut microbiome—the collection of microbes within the intestinal tract and the genes they express—is growing in popularity as associations are found between diet, gut microbiome activity, and host health and disease. However, current metagenomic and ribosomal profiling approaches are...

  13. Metatranscriptomes reveal functional variation in diatom communities from the Antarctic Peninsula

    Pearson, Gareth A

    2015-04-14

    Functional genomics of diatom-dominated communities fromthe Antarctic Peninsula was studied using comparative metatranscriptomics. Samples obtained from diatom-rich communities in the Bransfield Strait, the western Weddell Sea and sea ice in the Bellingshausen Sea/Wilkins Ice Shelf yielded more than 500K pyrosequencing reads that were combined to produce a global metatranscriptome assembly. Multi-gene phylogenies recovered three distinct communities, and diatom-assigned contigs further indicated little read-sharing between communities, validating an assembly-based annotation and analysis approach. Although functional analysis recovered a core of abundant shared annotations that were expressed across the three diatom communities, over 40% of annotations (but accounting for <10% of sequences) were community-specific. The two pelagic communities differed in their expression of N-metabolism and acquisition genes, which was almost absent in post-bloom conditions in the Weddell Sea community, while enrichment of transporters for ammonia and urea in Bransfield Strait diatoms suggests a physiological stance towards acquisition of reduced N-sources. The depletion of carbohydrate and energy metabolism pathways in sea ice relative to pelagic communities, together with increased light energy dissipation (via LHCSR proteins), photorespiration, and NO3 - uptake and utilization all pointed to irradiance stress and/or inorganic carbon limitation within sea ice. Ice-binding proteins and cold-shock transcription factors were also enriched in sea ice diatoms. Surprisingly, the abundance of gene transcripts for the translational machinery tracked decreasing environmental temperature across only a 4 °C range, possibly reflecting constraints on translational efficiency and protein production in cold environments. © 2015 International Society for Microbial Ecology All rights reserved.

  14. Metatranscriptomes reveal functional variation in diatom communities from the Antarctic Peninsula

    Pearson, Gareth A; Lago-Leston, Asuncion; Cá novas, Fernando; Cox, Cymon J; Verret, Frederic; Lasternas, Sebastian; Duarte, Carlos M.; Agusti, Susana; Serrã o, Ester A

    2015-01-01

    Functional genomics of diatom-dominated communities fromthe Antarctic Peninsula was studied using comparative metatranscriptomics. Samples obtained from diatom-rich communities in the Bransfield Strait, the western Weddell Sea and sea ice in the Bellingshausen Sea/Wilkins Ice Shelf yielded more than 500K pyrosequencing reads that were combined to produce a global metatranscriptome assembly. Multi-gene phylogenies recovered three distinct communities, and diatom-assigned contigs further indicated little read-sharing between communities, validating an assembly-based annotation and analysis approach. Although functional analysis recovered a core of abundant shared annotations that were expressed across the three diatom communities, over 40% of annotations (but accounting for <10% of sequences) were community-specific. The two pelagic communities differed in their expression of N-metabolism and acquisition genes, which was almost absent in post-bloom conditions in the Weddell Sea community, while enrichment of transporters for ammonia and urea in Bransfield Strait diatoms suggests a physiological stance towards acquisition of reduced N-sources. The depletion of carbohydrate and energy metabolism pathways in sea ice relative to pelagic communities, together with increased light energy dissipation (via LHCSR proteins), photorespiration, and NO3 - uptake and utilization all pointed to irradiance stress and/or inorganic carbon limitation within sea ice. Ice-binding proteins and cold-shock transcription factors were also enriched in sea ice diatoms. Surprisingly, the abundance of gene transcripts for the translational machinery tracked decreasing environmental temperature across only a 4 °C range, possibly reflecting constraints on translational efficiency and protein production in cold environments. © 2015 International Society for Microbial Ecology All rights reserved.

  15. Metaproteomics: Harnessing the power of high performance mass spectrometry to identify the suite of proteins that control metabolic activities in microbial communities

    Hettich, Robert L.; Pan, Chongle; Chourey, Karuna; Giannone, Richard J.

    2013-01-01

    Summary The availability of extensive genome information for many different microbes, including unculturable species in mixed communities from environmental samples, has enabled systems-biology interrogation by providing a means to access genomic, transcriptomic, and proteomic information. To this end, metaproteomics exploits the power of high performance mass spectrometry for extensive characterization of the complete suite of proteins expressed by a microbial community in an environmental sample. PMID:23469896

  16. FANTOM: Functional and taxonomic analysis of metagenomes

    Sanli Kemal

    2013-02-01

    Full Text Available Abstract Background Interpretation of quantitative metagenomics data is important for our understanding of ecosystem functioning and assessing differences between various environmental samples. There is a need for an easy to use tool to explore the often complex metagenomics data in taxonomic and functional context. Results Here we introduce FANTOM, a tool that allows for exploratory and comparative analysis of metagenomics abundance data integrated with metadata information and biological databases. Importantly, FANTOM can make use of any hierarchical database and it comes supplied with NCBI taxonomic hierarchies as well as KEGG Orthology, COG, PFAM and TIGRFAM databases. Conclusions The software is implemented in Python, is platform independent, and is available at http://www.sysbio.se/Fantom.

  17. A catalog of the mouse gut metagenome

    Xiao, Liang; Feng, Qiang; Liang, Suisha

    2015-01-01

    laboratories and fed either a low-fat or high-fat diet. Similar to the human gut microbiome, >99% of the cataloged genes are bacterial. We identified 541 metagenomic species and defined a core set of 26 metagenomic species found in 95% of the mice. The mouse gut microbiome is functionally similar to its human......We established a catalog of the mouse gut metagenome comprising ∼2.6 million nonredundant genes by sequencing DNA from fecal samples of 184 mice. To secure high microbiome diversity, we used mouse strains of diverse genetic backgrounds, from different providers, kept in different housing...... counterpart, with 95.2% of its Kyoto Encyclopedia of Genes and Genomes (KEGG) orthologous groups in common. However, only 4.0% of the mouse gut microbial genes were shared (95% identity, 90% coverage) with those of the human gut microbiome. This catalog provides a useful reference for future studies....

  18. Challenges and opportunities of airborne metagenomics.

    Behzad, Hayedeh; Gojobori, Takashi; Mineta, Katsuhiko

    2015-05-06

    Recent metagenomic studies of environments, such as marine and soil, have significantly enhanced our understanding of the diverse microbial communities living in these habitats and their essential roles in sustaining vast ecosystems. The increase in the number of publications related to soil and marine metagenomics is in sharp contrast to those of air, yet airborne microbes are thought to have significant impacts on many aspects of our lives from their potential roles in atmospheric events such as cloud formation, precipitation, and atmospheric chemistry to their major impact on human health. In this review, we will discuss the current progress in airborne metagenomics, with a special focus on exploring the challenges and opportunities of undertaking such studies. The main challenges of conducting metagenomic studies of airborne microbes are as follows: 1) Low density of microorganisms in the air, 2) efficient retrieval of microorganisms from the air, 3) variability in airborne microbial community composition, 4) the lack of standardized protocols and methodologies, and 5) DNA sequencing and bioinformatics-related challenges. Overcoming these challenges could provide the groundwork for comprehensive analysis of airborne microbes and their potential impact on the atmosphere, global climate, and our health. Metagenomic studies offer a unique opportunity to examine viral and bacterial diversity in the air and monitor their spread locally or across the globe, including threats from pathogenic microorganisms. Airborne metagenomic studies could also lead to discoveries of novel genes and metabolic pathways relevant to meteorological and industrial applications, environmental bioremediation, and biogeochemical cycles. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  19. Metagenomic Detection Methods in Biopreparedness Outbreak Scenarios

    Karlsson, Oskar Erik; Hansen, Trine; Knutsson, Rickard

    2013-01-01

    In the field of diagnostic microbiology, rapid molecular methods are critically important for detecting pathogens. With rapid and accurate detection, preventive measures can be put in place early, thereby preventing loss of life and further spread of a disease. From a preparedness perspective...... of a clinical sample, creating a metagenome, in a single week of laboratory work. As new technologies emerge, their dissemination and capacity building must be facilitated, and criteria for use, as well as guidelines on how to report results, must be established. This article focuses on the use of metagenomics...

  20. Gene Prediction in Metagenomic Fragments with Deep Learning

    Shao-Wu Zhang

    2017-01-01

    Full Text Available Next generation sequencing technologies used in metagenomics yield numerous sequencing fragments which come from thousands of different species. Accurately identifying genes from metagenomics fragments is one of the most fundamental issues in metagenomics. In this article, by fusing multifeatures (i.e., monocodon usage, monoamino acid usage, ORF length coverage, and Z-curve features and using deep stacking networks learning model, we present a novel method (called Meta-MFDL to predict the metagenomic genes. The results with 10 CV and independent tests show that Meta-MFDL is a powerful tool for identifying genes from metagenomic fragments.

  1. Metagenomics as a Tool for Enzyme Discovery: Hydrolytic Enzymes from Marine-Related Metagenomes.

    Popovic, Ana; Tchigvintsev, Anatoly; Tran, Hai; Chernikova, Tatyana N; Golyshina, Olga V; Yakimov, Michail M; Golyshin, Peter N; Yakunin, Alexander F

    2015-01-01

    This chapter discusses metagenomics and its application for enzyme discovery, with a focus on hydrolytic enzymes from marine metagenomic libraries. With less than one percent of culturable microorganisms in the environment, metagenomics, or the collective study of community genetics, has opened up a rich pool of uncharacterized metabolic pathways, enzymes, and adaptations. This great untapped pool of genes provides the particularly exciting potential to mine for new biochemical activities or novel enzymes with activities tailored to peculiar sets of environmental conditions. Metagenomes also represent a huge reservoir of novel enzymes for applications in biocatalysis, biofuels, and bioremediation. Here we present the results of enzyme discovery for four enzyme activities, of particular industrial or environmental interest, including esterase/lipase, glycosyl hydrolase, protease and dehalogenase.

  2. Assembly of viral genomes from metagenomes

    Saskia L Smits

    2014-12-01

    Full Text Available Viral infections remain a serious global health issue. Metagenomic approaches are increasingly used in the detection of novel viral pathogens but also to generate complete genomes of uncultivated viruses. In silico identification of complete viral genomes from sequence data would allow rapid phylogenetic characterization of these new viruses. Often, however, complete viral genomes are not recovered, but rather several distinct contigs derived from a single entity, some of which have no sequence homology to any known proteins. De novo assembly of single viruses from a metagenome is challenging, not only because of the lack of a reference genome, but also because of intrapopulation variation and uneven or insufficient coverage. Here we explored different assembly algorithms, remote homology searches, genome-specific sequence motifs, k-mer frequency ranking, and coverage profile binning to detect and obtain viral target genomes from metagenomes. All methods were tested on 454-generated sequencing datasets containing three recently described RNA viruses with a relatively large genome which were divergent to previously known viruses from the viral families Rhabdoviridae and Coronaviridae. Depending on specific characteristics of the target virus and the metagenomic community, different assembly and in silico gap closure strategies were successful in obtaining near complete viral genomes.

  3. Assembly of viral genomes from metagenomes

    S.L. Smits (Saskia); R. Bodewes (Rogier); A. Ruiz-Gonzalez (Aritz); V. Baumgärtner (Volkmar); M.P.G. Koopmans D.V.M. (Marion); A.D.M.E. Osterhaus (Albert); A. Schürch (Anita)

    2014-01-01

    textabstractViral infections remain a serious global health issue. Metagenomic approaches are increasingly used in the detection of novel viral pathogens but also to generate complete genomes of uncultivated viruses. In silico identification of complete viral genomes from sequence data would allow

  4. Tentacle: distributed quantification of genes in metagenomes.

    Boulund, Fredrik; Sjögren, Anders; Kristiansson, Erik

    2015-01-01

    In metagenomics, microbial communities are sequenced at increasingly high resolution, generating datasets with billions of DNA fragments. Novel methods that can efficiently process the growing volumes of sequence data are necessary for the accurate analysis and interpretation of existing and upcoming metagenomes. Here we present Tentacle, which is a novel framework that uses distributed computational resources for gene quantification in metagenomes. Tentacle is implemented using a dynamic master-worker approach in which DNA fragments are streamed via a network and processed in parallel on worker nodes. Tentacle is modular, extensible, and comes with support for six commonly used sequence aligners. It is easy to adapt Tentacle to different applications in metagenomics and easy to integrate into existing workflows. Evaluations show that Tentacle scales very well with increasing computing resources. We illustrate the versatility of Tentacle on three different use cases. Tentacle is written for Linux in Python 2.7 and is published as open source under the GNU General Public License (v3). Documentation, tutorials, installation instructions, and the source code are freely available online at: http://bioinformatics.math.chalmers.se/tentacle.

  5. Bracken: estimating species abundance in metagenomics data

    Jennifer Lu

    2017-01-01

    Full Text Available Metagenomic experiments attempt to characterize microbial communities using high-throughput DNA sequencing. Identification of the microorganisms in a sample provides information about the genetic profile, population structure, and role of microorganisms within an environment. Until recently, most metagenomics studies focused on high-level characterization at the level of phyla, or alternatively sequenced the 16S ribosomal RNA gene that is present in bacterial species. As the cost of sequencing has fallen, though, metagenomics experiments have increasingly used unbiased shotgun sequencing to capture all the organisms in a sample. This approach requires a method for estimating abundance directly from the raw read data. Here we describe a fast, accurate new method that computes the abundance at the species level using the reads collected in a metagenomics experiment. Bracken (Bayesian Reestimation of Abundance after Classification with KrakEN uses the taxonomic assignments made by Kraken, a very fast read-level classifier, along with information about the genomes themselves to estimate abundance at the species level, the genus level, or above. We demonstrate that Bracken can produce accurate species- and genus-level abundance estimates even when a sample contains multiple near-identical species.

  6. Investigation of the activity of the microorganisms in a Reblochon-style cheese by metatranscriptomic analysis

    Christophe eMonnet

    2016-04-01

    Full Text Available The microbial communities in cheeses are composed of varying bacteria, yeasts, and molds, which contribute to the development of their typical sensory properties. In situ studies are needed to better understand their growth and activity during cheese ripening. Our objective was to investigate the activity of the microorganisms used for manufacturing a surface-ripened cheese by means of metatranscriptomic analysis. The cheeses were produced using two lactic acid bacteria (Streptococcus thermophilus and Lactobacillus delbrueckii ssp. bulgaricus, one ripening bacterium (Brevibacterium aurantiacum, and two yeasts (Debaryomyces hansenii and Geotrichum candidum. RNA was extracted from the cheese rinds and, after depletion of most ribosomal RNA, sequencing was performed using a short-read sequencing technology that generated approximately 75 million reads per sample. Except for Brevibacterium aurantiacum, which failed to grow in the cheeses, a large number of CDS reads were generated for the inoculated species, making it possible to investigate their individual transcriptome over time. From day 5 to day 35, G. candidum accounted for the largest proportion of CDS reads, suggesting that this species was the most active. Only minor changes occurred in the transcriptomes of the lactic acid bacteria. For the two yeasts, we compared the expression of genes involved in the catabolism of lactose, galactose, lactate, amino acids and free fatty acids. During ripening, genes involved in ammonia assimilation and galactose catabolism were down-regulated in the two species. Genes involved in amino acid catabolism were up-regulated in G. candidum from day 14 to day 35, whereas in D. hansenii, they were up-regulated mainly at day 35, suggesting that this species catabolized the cheese amino acids later. In addition, after 35 days of ripening, there was a down-regulation of genes involved in the electron transport chain, suggesting a lower cellular activity. The

  7. Meta-Transcriptomic Analysis of a Chromate-Reducing Aquifer Microbial Community

    Beller, H. R.; Brodie, E. L.; Han, R.; Karaoz, U.

    2010-12-01

    A major challenge for microbial ecology that has become more tractable in the advent of new molecular techniques is characterizing gene expression in complex microbial communities. We are using meta-transcriptomic analysis to characterize functional changes in an aquifer-derived, chromate-reducing microbial community as it transitions through various electron-accepting conditions. We inoculated anaerobic microcosms with groundwater from the Cr-contaminated Hanford 100H site and supplemented them with lactate and electron acceptors present at the site, namely, nitrate, sulfate, and Fe(III). The microcosms progressed successively through various electron-accepting conditions (e.g., denitrifying, sulfate-reducing, and ferric iron-reducing conditions, as well as nitrate-dependent, chemolithotrophic Fe(II)-oxidizing conditions). Cr(VI) was rapidly reduced initially and again upon further Cr(VI) amendments. Extensive geochemical sampling and analysis (e.g., lactate, acetate, chloride, nitrate, nitrite, sulfate, dissolved Cr(VI), total Fe(II)), RNA/DNA harvesting, and PhyloChip analyses were conducted. Methods were developed for removal of rRNA from total RNA in preparation for meta-transcriptome sequencing. To date, samples representing denitrifying and fermentative/sulfate-reducing conditions have been sequenced using 454 Titanium technology. Of the non-rRNA related reads for the denitrifying sample (which was also actively reducing chromate), ca. 8% were associated with denitrification and ca. 0.9% were associated with chromate resistance/transport, in contrast to the fermentative/sulfate-reducing sample (in which chromate had already been reduced), which had zero reads associated with either of these categories but many predicted proteins associated with sulfate-reducing bacteria. We observed sequences for key functional transcripts that were unique at the nucleotide level compared to the GenBank non-redundant database [such as L-lactate dehydrogenase (iron

  8. Metaproteomics and metabolomics analyses of chronically petroleum-polluted sites reveal the importance of general anaerobic processes uncoupled with degradation.

    Bargiela, Rafael; Herbst, Florian-Alexander; Martínez-Martínez, Mónica; Seifert, Jana; Rojo, David; Cappello, Simone; Genovese, María; Crisafi, Francesca; Denaro, Renata; Chernikova, Tatyana N; Barbas, Coral; von Bergen, Martin; Yakimov, Michail M; Ferrer, Manuel; Golyshin, Peter N

    2015-10-01

    Crude oil is one of the most important natural assets for humankind, yet it is a major environmental pollutant, notably in marine environments. One of the largest crude oil polluted areas in the word is the semi-enclosed Mediterranean Sea, in which the metabolic potential of indigenous microbial populations towards the large-scale chronic pollution is yet to be defined, particularly in anaerobic and micro-aerophilic sites. Here, we provide an insight into the microbial metabolism in sediments from three chronically polluted marine sites along the coastline of Italy: the Priolo oil terminal/refinery site (near Siracuse, Sicily), harbour of Messina (Sicily) and shipwreck of MT Haven (near Genoa). Using shotgun metaproteomics and community metabolomics approaches, the presence of 651 microbial proteins and 4776 metabolite mass features have been detected in these three environments, revealing a high metabolic heterogeneity between the investigated sites. The proteomes displayed the prevalence of anaerobic metabolisms that were not directly related with petroleum biodegradation, indicating that in the absence of oxygen, biodegradation is significantly suppressed. This suppression was also suggested by examining the metabolome patterns. The proteome analysis further highlighted the metabolic coupling between methylotrophs and sulphate reducers in oxygen-depleted petroleum-polluted sediments. © 2015 The Authors. PROTEOMICS published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  9. Metaproteomics and metabolomics analyses of chronically petroleum‐polluted sites reveal the importance of general anaerobic processes uncoupled with degradation

    Bargiela, Rafael; Herbst, Florian‐Alexander; Martínez‐Martínez, Mónica; Seifert, Jana; Rojo, David; Cappello, Simone; Genovese, María; Crisafi, Francesca; Denaro, Renata; Chernikova, Tatyana N.; Barbas, Coral; von Bergen, Martin; Yakimov, Michail M.; Golyshin, Peter N.

    2015-01-01

    Crude oil is one of the most important natural assets for humankind, yet it is a major environmental pollutant, notably in marine environments. One of the largest crude oil polluted areas in the word is the semi‐enclosed Mediterranean Sea, in which the metabolic potential of indigenous microbial populations towards the large‐scale chronic pollution is yet to be defined, particularly in anaerobic and micro‐aerophilic sites. Here, we provide an insight into the microbial metabolism in sediments from three chronically polluted marine sites along the coastline of Italy: the Priolo oil terminal/refinery site (near Siracuse, Sicily), harbour of Messina (Sicily) and shipwreck of MT Haven (near Genoa). Using shotgun metaproteomics and community metabolomics approaches, the presence of 651 microbial proteins and 4776 metabolite mass features have been detected in these three environments, revealing a high metabolic heterogeneity between the investigated sites. The proteomes displayed the prevalence of anaerobic metabolisms that were not directly related with petroleum biodegradation, indicating that in the absence of oxygen, biodegradation is significantly suppressed. This suppression was also suggested by examining the metabolome patterns. The proteome analysis further highlighted the metabolic coupling between methylotrophs and sulphate reducers in oxygen‐depleted petroleum‐polluted sediments. PMID:26201687

  10. Solar Radiation Stress in Natural Acidophilic Biofilms of Euglena mutabilis Revealed by Metatranscriptomics and PAM Fluorometry.

    Puente-Sánchez, Fernando; Olsson, Sanna; Gómez-Rodriguez, Manuel; Souza-Egipsy, Virginia; Altamirano-Jeschke, Maria; Amils, Ricardo; Parro, Victor; Aguilera, Angeles

    2016-02-01

    The daily photosynthetic performance of a natural biofilm of the extreme acidophilic Euglena mutabilis from Río Tinto (SW, Spain) under full solar radiation was analyzed by means of pulse amplitude-modulated (PAM) fluorescence measurements and metatrascriptomic analysis. Natural E. mutabilis biofilms undergo large-scale transcriptomic reprogramming during midday due to a dynamic photoinhibition and solar radiation stress. Photoinhibition is due to UV radiation and not to light intensity, as revealed by PAM fluorometry analysis. In order to minimize the negative effects of solar radiation, our data supports the presence of a circadian rhythm in this euglenophyte that increases their opportunity to survive. Differential gene expression throughout the day (at 12:00, 20:00 and night) was monitored by massive Illumina parallel sequencing of metatranscriptomic libraries. The transcription pattern was altered in genes involved in Photosystem II stability and repair, UV damaged DNA repair, non-photochemical quenching and oxidative stress, supporting the photoinhibition detected by PAM fluorometry at midday. Copyright © 2016 Elsevier GmbH. All rights reserved.

  11. Metatranscriptome analysis of the reef-building coral Orbicella faveolata indicates holobiont response to coral disease

    Daniels, Camille Arian

    2015-09-11

    White Plague Disease (WPD) is implicated in coral reef decline in the Caribbean and is characterized by microbial community shifts in coral mucus and tissue. Studies thus far have focused on assessing microbial communities or the identification of specific pathogens, yet few have addressed holobiont response across metaorganism compartments in coral disease. Here, we report on the first metatranscriptomic assessment of the coral host, algal symbiont, and microbial compartment in order to survey holobiont structure and function in healthy and diseased samples from Orbicella faveolata collected at reef sites off Puerto Rico. Our data indicate holobiont-wide as well as compartment-specific responses to WPD. Gene expression changes in the diseased coral host involved proteins playing a role in innate immunity, cytoskeletal integrity, cell adhesion, oxidative stress, chemical defense, and retroelements. In contrast, the algal symbiont showed comparatively few expression changes, but of large magnitude, of genes related to stress, photosynthesis, and metal transport. Concordant with the coral host response, the bacterial compartment showed increased abundance of heat shock proteins, genes related to oxidative stress, DNA repair, and potential retroelement activity. Importantly, analysis of the expressed bacterial gene functions establishes the participation of multiple bacterial families in WPD pathogenesis and also suggests a possible involvement of viruses and/or phages in structuring the bacterial assemblage. In this study, we implement an experimental approach to partition the coral holobiont and resolve compartment- and taxa-specific responses in order to understand metaorganism function in coral disease.

  12. Metatranscriptome Analysis of Fig Flowers Provides Insights into Potential Mechanisms for Mutualism Stability and Gall Induction.

    Ellen O Martinson

    Full Text Available A striking property of the mutualism between figs and their pollinating wasps is that wasps consistently oviposit in the inner flowers of the fig syconium, which develop into galls that house developing larvae. Wasps typically do not use the outer ring of flowers, which develop into seeds. To better understand differences between gall and seed flowers, we used a metatranscriptomic approach to analyze eukaryotic gene expression within fig flowers at the time of oviposition choice and early gall development. Consistent with the unbeatable seed hypothesis, we found significant differences in gene expression between gall- and seed flowers in receptive syconia prior to oviposition. In particular, transcripts assigned to flavonoids and carbohydrate metabolism were significantly up-regulated in gall flowers relative to seed flowers. In response to oviposition, gall flowers significantly up-regulated the expression of chalcone synthase, which previously has been connected to gall formation in other plants. We propose several genes encoding proteins with signal peptides or associations with venom of other Hymenoptera as candidate genes for gall initiation or growth. This study simultaneously evaluates the gene expression profile of both mutualistic partners in a plant-insect mutualism and provides insight into a possible stability mechanism in the ancient fig-fig wasp association.

  13. Metatranscriptome analysis of the reef-buidling coral Orbicella faveolata indicates holobiont response to coral disease

    Camille eDaniels

    2015-09-01

    Full Text Available White Plague Disease (WPD is implicated in coral reef decline in the Caribbean and is characterized by microbial community shifts in coral mucus and tissue. Studies thus far have focused on assessing microbial communities or the identification of specific pathogens, yet few have addressed holobiont response across metaorganism compartments in coral disease. Here, we report on the first metatranscriptomic assessment of the coral host, algal symbiont, and microbial compartment in order to survey holobiont structure and function in healthy and diseased samples from Orbicella faveolata collected at reef sites off Puerto Rico. Our data indicate metaorganism-wide as well as compartment-specific responses to WPD. Gene expression changes in the diseased coral host involved proteins playing a role in innate immunity, cytoskeletal integrity, cell adhesion, oxidative stress, chemical defense, and retroelements. In contrast, the algal symbiont showed comparatively few expression changes, but of large magnitude, of genes related to stress, photosynthesis, and metal transport. Concordant with the coral host response, the bacterial compartment showed increased abundance of heat shock proteins, genes related to oxidative stress, DNA repair, and potential retroelement activity. Importantly, analysis of the expressed bacterial gene functions establishes the participation of multiple bacterial families in WPD pathogenesis and also suggests a possible involvement of viruses and/or phages in structuring the bacterial assemblage. In this study, we implement an experimental approach to partition the coral holobiont and resolve compartment- and taxa-specific responses in order to understand metaorganism function in coral disease.

  14. Metatranscriptome analysis of the reef-building coral Orbicella faveolata indicates holobiont response to coral disease

    Daniels, Camille Arian; Baumgarten, Sebastian; Yum, Lauren; Michell, Craig; Bayer, Till; Arif, Chatchanit; Roder, Cornelia; Weil, Ernesto; Voolstra, Christian R.

    2015-01-01

    White Plague Disease (WPD) is implicated in coral reef decline in the Caribbean and is characterized by microbial community shifts in coral mucus and tissue. Studies thus far have focused on assessing microbial communities or the identification of specific pathogens, yet few have addressed holobiont response across metaorganism compartments in coral disease. Here, we report on the first metatranscriptomic assessment of the coral host, algal symbiont, and microbial compartment in order to survey holobiont structure and function in healthy and diseased samples from Orbicella faveolata collected at reef sites off Puerto Rico. Our data indicate holobiont-wide as well as compartment-specific responses to WPD. Gene expression changes in the diseased coral host involved proteins playing a role in innate immunity, cytoskeletal integrity, cell adhesion, oxidative stress, chemical defense, and retroelements. In contrast, the algal symbiont showed comparatively few expression changes, but of large magnitude, of genes related to stress, photosynthesis, and metal transport. Concordant with the coral host response, the bacterial compartment showed increased abundance of heat shock proteins, genes related to oxidative stress, DNA repair, and potential retroelement activity. Importantly, analysis of the expressed bacterial gene functions establishes the participation of multiple bacterial families in WPD pathogenesis and also suggests a possible involvement of viruses and/or phages in structuring the bacterial assemblage. In this study, we implement an experimental approach to partition the coral holobiont and resolve compartment- and taxa-specific responses in order to understand metaorganism function in coral disease.

  15. Metagenomic studies of the Red Sea.

    Behzad, Hayedeh; Ibarra, Martin Augusto; Mineta, Katsuhiko; Gojobori, Takashi

    2016-02-01

    Metagenomics has significantly advanced the field of marine microbial ecology, revealing the vast diversity of previously unknown microbial life forms in different marine niches. The tremendous amount of data generated has enabled identification of a large number of microbial genes (metagenomes), their community interactions, adaptation mechanisms, and their potential applications in pharmaceutical and biotechnology-based industries. Comparative metagenomics reveals that microbial diversity is a function of the local environment, meaning that unique or unusual environments typically harbor novel microbial species with unique genes and metabolic pathways. The Red Sea has an abundance of unique characteristics; however, its microbiota is one of the least studied among marine environments. The Red Sea harbors approximately 25 hot anoxic brine pools, plus a vibrant coral reef ecosystem. Physiochemical studies describe the Red Sea as an oligotrophic environment that contains one of the warmest and saltiest waters in the world with year-round high UV radiations. These characteristics are believed to have shaped the evolution of microbial communities in the Red Sea. Over-representation of genes involved in DNA repair, high-intensity light responses, and osmoregulation were found in the Red Sea metagenomic databases suggesting acquisition of specific environmental adaptation by the Red Sea microbiota. The Red Sea brine pools harbor a diverse range of halophilic and thermophilic bacterial and archaeal communities, which are potential sources of enzymes for pharmaceutical and biotechnology-based application. Understanding the mechanisms of these adaptations and their function within the larger ecosystem could also prove useful in light of predicted global warming scenarios where global ocean temperatures are expected to rise by 1-3°C in the next few decades. In this review, we provide an overview of the published metagenomic studies that were conducted in the Red Sea, and

  16. Laboratory procedures to generate viral metagenomes.

    Thurber, Rebecca V; Haynes, Matthew; Breitbart, Mya; Wegley, Linda; Rohwer, Forest

    2009-01-01

    This collection of laboratory protocols describes the steps to collect viruses from various samples with the specific aim of generating viral metagenome sequence libraries (viromes). Viral metagenomics, the study of uncultured viral nucleic acid sequences from different biomes, relies on several concentration, purification, extraction, sequencing and heuristic bioinformatic methods. No single technique can provide an all-inclusive approach, and therefore the protocols presented here will be discussed in terms of hypothetical projects. However, care must be taken to individualize each step depending on the source and type of viral-particles. This protocol is a description of the processes we have successfully used to: (i) concentrate viral particles from various types of samples, (ii) eliminate contaminating cells and free nucleic acids and (iii) extract, amplify and purify viral nucleic acids. Overall, a sample can be processed to isolate viral nucleic acids suitable for high-throughput sequencing in approximately 1 week.

  17. Genomics and metagenomics in medical microbiology.

    Padmanabhan, Roshan; Mishra, Ajay Kumar; Raoult, Didier; Fournier, Pierre-Edouard

    2013-12-01

    Over the last two decades, sequencing tools have evolved from laborious time-consuming methodologies to real-time detection and deciphering of genomic DNA. Genome sequencing, especially using next generation sequencing (NGS) has revolutionized the landscape of microbiology and infectious disease. This deluge of sequencing data has not only enabled advances in fundamental biology but also helped improve diagnosis, typing of pathogen, virulence and antibiotic resistance detection, and development of new vaccines and culture media. In addition, NGS also enabled efficient analysis of complex human micro-floras, both commensal, and pathological, through metagenomic methods, thus helping the comprehension and management of human diseases such as obesity. This review summarizes technological advances in genomics and metagenomics relevant to the field of medical microbiology. Copyright © 2013 Elsevier B.V. All rights reserved.

  18. Construction and screening of marine metagenomic libraries.

    Weiland, Nancy; Löscher, Carolin; Metzger, Rebekka; Schmitz, Ruth

    2010-01-01

    Marine microbial communities are highly diverse and have evolved during extended evolutionary processes of physiological adaptations under the influence of a variety of ecological conditions and selection pressures. They harbor an enormous diversity of microbes with still unknown and probably new physiological characteristics. Besides, the surfaces of marine multicellular organisms are typically covered by a consortium of epibiotic bacteria and act as barriers, where diverse interactions between microorganisms and hosts take place. Thus, microbial diversity in the water column of the oceans and the microbial consortia on marine tissues of multicellular organisms are rich sources for isolating novel bioactive compounds and genes. Here we describe the sampling, construction of large-insert metagenomic libraries from marine habitats and exemplarily one function based screen of metagenomic clones.

  19. An Experimental Metagenome Data Management and AnalysisSystem

    Markowitz, Victor M.; Korzeniewski, Frank; Palaniappan, Krishna; Szeto, Ernest; Ivanova, Natalia N.; Kyrpides, Nikos C.; Hugenholtz, Philip

    2006-03-01

    The application of shotgun sequencing to environmental samples has revealed a new universe of microbial community genomes (metagenomes) involving previously uncultured organisms. Metagenome analysis, which is expected to provide a comprehensive picture of the gene functions and metabolic capacity of microbial community, needs to be conducted in the context of a comprehensive data management and analysis system. We present in this paper IMG/M, an experimental metagenome data management and analysis system that is based on the Integrated Microbial Genomes (IMG) system. IMG/M provides tools and viewers for analyzing both metagenomes and isolate genomes individually or in a comparative context.

  20. MetaQUAST: evaluation of metagenome assemblies.

    Mikheenko, Alla; Saveliev, Vladislav; Gurevich, Alexey

    2016-04-01

    During the past years we have witnessed the rapid development of new metagenome assembly methods. Although there are many benchmark utilities designed for single-genome assemblies, there is no well-recognized evaluation and comparison tool for metagenomic-specific analogues. In this article, we present MetaQUAST, a modification of QUAST, the state-of-the-art tool for genome assembly evaluation based on alignment of contigs to a reference. MetaQUAST addresses such metagenome datasets features as (i) unknown species content by detecting and downloading reference sequences, (ii) huge diversity by giving comprehensive reports for multiple genomes and (iii) presence of highly relative species by detecting chimeric contigs. We demonstrate MetaQUAST performance by comparing several leading assemblers on one simulated and two real datasets. http://bioinf.spbau.ru/metaquast aleksey.gurevich@spbu.ru Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  1. Phylogenetic convolutional neural networks in metagenomics.

    Fioravanti, Diego; Giarratano, Ylenia; Maggio, Valerio; Agostinelli, Claudio; Chierici, Marco; Jurman, Giuseppe; Furlanello, Cesare

    2018-03-08

    Convolutional Neural Networks can be effectively used only when data are endowed with an intrinsic concept of neighbourhood in the input space, as is the case of pixels in images. We introduce here Ph-CNN, a novel deep learning architecture for the classification of metagenomics data based on the Convolutional Neural Networks, with the patristic distance defined on the phylogenetic tree being used as the proximity measure. The patristic distance between variables is used together with a sparsified version of MultiDimensional Scaling to embed the phylogenetic tree in a Euclidean space. Ph-CNN is tested with a domain adaptation approach on synthetic data and on a metagenomics collection of gut microbiota of 38 healthy subjects and 222 Inflammatory Bowel Disease patients, divided in 6 subclasses. Classification performance is promising when compared to classical algorithms like Support Vector Machines and Random Forest and a baseline fully connected neural network, e.g. the Multi-Layer Perceptron. Ph-CNN represents a novel deep learning approach for the classification of metagenomics data. Operatively, the algorithm has been implemented as a custom Keras layer taking care of passing to the following convolutional layer not only the data but also the ranked list of neighbourhood of each sample, thus mimicking the case of image data, transparently to the user.

  2. A retrospective metagenomics approach to studying Blastocystis.

    Andersen, Lee O'Brien; Bonde, Ida; Nielsen, Henrik Bjørn; Stensvold, Christen Rune

    2015-07-01

    Blastocystis is a common single-celled intestinal parasitic genus, comprising several subtypes. Here, we screened data obtained by metagenomic analysis of faecal DNA for Blastocystis by searching for subtype-specific genes in coabundance gene groups, which are groups of genes that covary across a selection of 316 human faecal samples, hence representing genes originating from a single subtype. The 316 faecal samples were from 236 healthy individuals, 13 patients with Crohn's disease (CD) and 67 patients with ulcerative colitis (UC). The prevalence of Blastocystis was 20.3% in the healthy individuals and 14.9% in patients with UC. Meanwhile, Blastocystis was absent in patients with CD. Individuals with intestinal microbiota dominated by Bacteroides were much less prone to having Blastocystis-positive stool (Matthew's correlation coefficient = -0.25, P < 0.0001) than individuals with Ruminococcus- and Prevotella-driven enterotypes. This is the first study to investigate the relationship between Blastocystis and communities of gut bacteria using a metagenomics approach. The study serves as an example of how it is possible to retrospectively investigate microbial eukaryotic communities in the gut using metagenomic datasets targeting the bacterial component of the intestinal microbiome and the interplay between these microbial communities. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  3. Bayesian mixture analysis for metagenomic community profiling.

    Morfopoulou, Sofia; Plagnol, Vincent

    2015-09-15

    Deep sequencing of clinical samples is now an established tool for the detection of infectious pathogens, with direct medical applications. The large amount of data generated produces an opportunity to detect species even at very low levels, provided that computational tools can effectively profile the relevant metagenomic communities. Data interpretation is complicated by the fact that short sequencing reads can match multiple organisms and by the lack of completeness of existing databases, in particular for viral pathogens. Here we present metaMix, a Bayesian mixture model framework for resolving complex metagenomic mixtures. We show that the use of parallel Monte Carlo Markov chains for the exploration of the species space enables the identification of the set of species most likely to contribute to the mixture. We demonstrate the greater accuracy of metaMix compared with relevant methods, particularly for profiling complex communities consisting of several related species. We designed metaMix specifically for the analysis of deep transcriptome sequencing datasets, with a focus on viral pathogen detection; however, the principles are generally applicable to all types of metagenomic mixtures. metaMix is implemented as a user friendly R package, freely available on CRAN: http://cran.r-project.org/web/packages/metaMix sofia.morfopoulou.10@ucl.ac.uk Supplementary data are available at Bionformatics online. © The Author 2015. Published by Oxford University Press.

  4. Comparative metatranscriptomics identifies molecular bases for the physiological responses of phytoplankton to varying iron availability.

    Marchetti, Adrian; Schruth, David M; Durkin, Colleen A; Parker, Micaela S; Kodner, Robin B; Berthiaume, Chris T; Morales, Rhonda; Allen, Andrew E; Armbrust, E Virginia

    2012-02-07

    In vast expanses of the oceans, growth of large phytoplankton such as diatoms is limited by iron availability. Diatoms respond almost immediately to the delivery of iron and rapidly compose the majority of phytoplankton biomass. The molecular bases underlying the subsistence of diatoms in iron-poor waters and the plankton community dynamics that follow iron resupply remain largely unknown. Here we use comparative metatranscriptomics to identify changes in gene expression associated with iron-stimulated growth of diatoms and other eukaryotic plankton. A microcosm iron-enrichment experiment using mixed-layer waters from the northeastern Pacific Ocean resulted in increased proportions of diatom transcripts and reduced proportions of transcripts from most other taxa within 98 h after iron addition. Hundreds of diatom genes were differentially expressed in the iron-enriched community compared with the iron-limited community; transcripts of diatom genes required for synthesis of photosynthesis and chlorophyll components, nitrate assimilation and the urea cycle, and synthesis of carbohydrate storage compounds were significantly overrepresented. Transcripts of genes encoding rhodopsins in eukaryotic phytoplankton were significantly underrepresented following iron enrichment, suggesting rhodopsins help cells cope with low-iron conditions. Oceanic diatoms appear to display a distinctive transcriptional response to iron enrichment that allows chemical reduction of available nitrogen and carbon sources along with a continued dependence on iron-free photosynthetic proteins rather than substituting for iron-containing functional equivalents present within their gene repertoire. This ability of diatoms to divert their newly acquired iron toward nitrate assimilation may underlie why diatoms consistently dominate iron enrichments in high-nitrate, low-chlorophyll regions.

  5. Dissection of Microbial Community Functions during a Cyanobacterial Bloom in the Baltic Sea via Metatranscriptomics

    Carlo Berg

    2018-02-01

    Full Text Available Marine and brackish surface waters are highly dynamic habitats that undergo repeated seasonal variations in microbial community composition and function throughout time. While succession of the various microbial groups has been well investigated, little is known about the underlying gene-expression of the microbial community. We investigated microbial interactions via metatranscriptomics over a spring to fall seasonal cycle in the brackish Baltic Sea surface waters, a temperate brackish water ecosystem periodically promoting massive cyanobacterial blooms, which have implications for primary production, nutrient cycling, and expansion of hypoxic zones. Network analysis of the gene expression of all microbes from 0.22 to 200 μm in size and of the major taxonomic groups dissected the seasonal cycle into four components that comprised genes peaking during different periods of the bloom. Photoautotrophic nitrogen-fixing Cyanobacteria displayed the highest connectivity among the microbes, in contrast to chemoautotrophic ammonia-oxidizing Thaumarchaeota, while heterotrophs dominated connectivity among pre- and post-bloom peaking genes. The network was also composed of distinct functional connectivities, with an early season balance between carbon metabolism and ATP synthesis shifting to a dominance of ATP synthesis during the bloom, while carbon degradation, specifically through the glyoxylate shunt, characterized the post-bloom period, driven by Alphaproteobacteria as well as by Gammaproteobacteria of the SAR86 and SAR92 clusters. Our study stresses the exceptionally strong biotic driving force executed by cyanobacterial blooms on associated microbial communities in the Baltic Sea and highlights the impact cyanobacterial blooms have on functional microbial community composition.

  6. A Metaproteomics Approach to Elucidate Host and Pathogen Protein Expression during Catheter-Associated Urinary Tract Infections (CAUTIs)

    Lassek, Christian; Burghartz, Melanie; Chaves-Moreno, Diego; Otto, Andreas; Hentschker, Christian; Fuchs, Stephan; Bernhardt, Jörg; Jauregui, Ruy; Neubauer, Rüdiger; Becher, Dörte; Pieper, Dietmar H.; Jahn, Martina; Jahn, Dieter; Riedel, Katharina

    2015-01-01

    Long-term catheterization inevitably leads to a catheter-associated bacteriuria caused by multispecies bacterial biofilms growing on and in the catheters. The overall goal of the presented study was (1) to unravel bacterial community structure and function of such a uropathogenic biofilm and (2) to elucidate the interplay between bacterial virulence and the human immune system within the urine. To this end, a metaproteomics approach combined with in vitro proteomics analyses was employed to investigate both, the pro- and eukaryotic protein inventory. Our proteome analyses demonstrated that the biofilm of the investigated catheter is dominated by three bacterial species, that is, Pseudomonas aeruginosa, Morganella morganii, and Bacteroides sp., and identified iron limitation as one of the major challenges in the bladder environment. In vitro proteome analysis of P. aeruginosa and M. morganii isolated from the biofilm revealed that these opportunistic pathogens are able to overcome iron restriction via the production of siderophores and high expression of corresponding receptors. Notably, a comparison of in vivo and in vitro protein profiles of P. aeruginosa and M. morganii also indicated that the bacteria employ different strategies to adapt to the urinary tract. Although P. aeruginosa seems to express secreted and surface-exposed proteases to escape the human innate immune system and metabolizes amino acids, M. morganii is able to take up sugars and to degrade urea. Most interestingly, a comparison of urine protein profiles of three long-term catheterized patients and three healthy control persons demonstrated the elevated level of proteins associated with neutrophils, macrophages, and the complement system in the patient's urine, which might point to a specific activation of the innate immune system in response to biofilm-associated urinary tract infections. We thus hypothesize that the often asymptomatic nature of catheter-associated urinary tract infections

  7. SmashCommunity: A metagenomic annotation and analysis tool

    Arumugam, Manimozhiyan; Harrington, Eoghan D; Foerstner, Konrad U

    2010-01-01

    the quantitative phylogenetic and functional compositions of metagenomes, to compare compositions of multiple metagenomes and to produce intuitive visual representations of such analyses. AVAILABILITY: SmashCommunity is freely available at http://www.bork.embl.de/software/smash CONTACT: bork@embl.de....

  8. Antibiotic Resistome: Improving Detection and Quantification Accuracy for Comparative Metagenomics.

    Elbehery, Ali H A; Aziz, Ramy K; Siam, Rania

    2016-04-01

    The unprecedented rise of life-threatening antibiotic resistance (AR), combined with the unparalleled advances in DNA sequencing of genomes and metagenomes, has pushed the need for in silico detection of the resistance potential of clinical and environmental metagenomic samples through the quantification of AR genes (i.e., genes conferring antibiotic resistance). Therefore, determining an optimal methodology to quantitatively and accurately assess AR genes in a given environment is pivotal. Here, we optimized and improved existing AR detection methodologies from metagenomic datasets to properly consider AR-generating mutations in antibiotic target genes. Through comparative metagenomic analysis of previously published AR gene abundance in three publicly available metagenomes, we illustrate how mutation-generated resistance genes are either falsely assigned or neglected, which alters the detection and quantitation of the antibiotic resistome. In addition, we inspected factors influencing the outcome of AR gene quantification using metagenome simulation experiments, and identified that genome size, AR gene length, total number of metagenomics reads and selected sequencing platforms had pronounced effects on the level of detected AR. In conclusion, our proposed improvements in the current methodologies for accurate AR detection and resistome assessment show reliable results when tested on real and simulated metagenomic datasets.

  9. Unlocking the potential of metagenomics through replicated experimental design

    Knight, R.; Jansson, J.; Field, D.; Fierer, N.; Desai, N.; Fuhrman, J.A.; Hugenholtz, P.; Van der Lelie, D.; Meyer, F.; Stevens, R.; Bailey, M.J.; Gordon, J.I.; Kowalchuk, G.A.; Gilbert, J.A.

    2012-01-01

    Metagenomics holds enormous promise for discovering novel enzymes and organisms that are biomarkers or drivers of processes relevant to disease, industry and the environment. In the past two years, we have seen a paradigm shift in metagenomics to the application of cross-sectional and longitudinal

  10. Unlocking the potential of metagenomics through replicated experimental design.

    Knight, R.; Jansson, J.; Field, D.; Fierer, N.; Desai, N.; Fuhrman, J.A.; Hugenholtz, P.; van der Lelie, D.; Meyer, F.; Stevens, R.; Bailey, M.J.; Gordon, J.I.; Kowalchuk, G.A.; Gilbert, J.A.

    2012-01-01

    Metagenomics holds enormous promise for discovering novel enzymes and organisms that are biomarkers or drivers of processes relevant to disease, industry and the environment. In the past two years, we have seen a paradigm shift in metagenomics to the application of cross-sectional and longitudinal

  11. Cross-cutting activities: Soil quality and soil metagenomics

    Motavalli, Peter P.; Garrett, Karen A.

    2008-01-01

    This presentation reports on the work of the SANREM CRSP cross-cutting activities "Assessing and Managing Soil Quality for Sustainable Agricultural Systems" and "Soil Metagenomics to Construct Indicators of Soil Degradation." The introduction gives an overview of the extensiveness of soil degradation globally and defines soil quality. The objectives of the soil quality cross cutting activity are: CCRA-4 (Soil Metagenomics)

  12. Critical Assessment of Metagenome Interpretation – a benchmark of computational metagenomics software

    Sczyrba, Alexander; Hofmann, Peter; Belmann, Peter; Koslicki, David; Janssen, Stefan; Dröge, Johannes; Gregor, Ivan; Majda, Stephan; Fiedler, Jessika; Dahms, Eik; Bremges, Andreas; Fritz, Adrian; Garrido-Oter, Ruben; Jørgensen, Tue Sparholt; Shapiro, Nicole; Blood, Philip D.; Gurevich, Alexey; Bai, Yang; Turaev, Dmitrij; DeMaere, Matthew Z.; Chikhi, Rayan; Nagarajan, Niranjan; Quince, Christopher; Meyer, Fernando; Balvočiūtė, Monika; Hansen, Lars Hestbjerg; Sørensen, Søren J.; Chia, Burton K. H.; Denis, Bertrand; Froula, Jeff L.; Wang, Zhong; Egan, Robert; Kang, Dongwan Don; Cook, Jeffrey J.; Deltel, Charles; Beckstette, Michael; Lemaitre, Claire; Peterlongo, Pierre; Rizk, Guillaume; Lavenier, Dominique; Wu, Yu-Wei; Singer, Steven W.; Jain, Chirag; Strous, Marc; Klingenberg, Heiner; Meinicke, Peter; Barton, Michael; Lingner, Thomas; Lin, Hsin-Hung; Liao, Yu-Chieh; Silva, Genivaldo Gueiros Z.; Cuevas, Daniel A.; Edwards, Robert A.; Saha, Surya; Piro, Vitor C.; Renard, Bernhard Y.; Pop, Mihai; Klenk, Hans-Peter; Göker, Markus; Kyrpides, Nikos C.; Woyke, Tanja; Vorholt, Julia A.; Schulze-Lefert, Paul; Rubin, Edward M.; Darling, Aaron E.; Rattei, Thomas; McHardy, Alice C.

    2018-01-01

    In metagenome analysis, computational methods for assembly, taxonomic profiling and binning are key components facilitating downstream biological data interpretation. However, a lack of consensus about benchmarking datasets and evaluation metrics complicates proper performance assessment. The Critical Assessment of Metagenome Interpretation (CAMI) challenge has engaged the global developer community to benchmark their programs on datasets of unprecedented complexity and realism. Benchmark metagenomes were generated from ~700 newly sequenced microorganisms and ~600 novel viruses and plasmids, including genomes with varying degrees of relatedness to each other and to publicly available ones and representing common experimental setups. Across all datasets, assembly and genome binning programs performed well for species represented by individual genomes, while performance was substantially affected by the presence of related strains. Taxonomic profiling and binning programs were proficient at high taxonomic ranks, with a notable performance decrease below the family level. Parameter settings substantially impacted performances, underscoring the importance of program reproducibility. While highlighting current challenges in computational metagenomics, the CAMI results provide a roadmap for software selection to answer specific research questions. PMID:28967888

  13. Metagenomics and Bioinformatics in Microbial Ecology: Current Status and Beyond.

    Hiraoka, Satoshi; Yang, Ching-Chia; Iwasaki, Wataru

    2016-09-29

    Metagenomic approaches are now commonly used in microbial ecology to study microbial communities in more detail, including many strains that cannot be cultivated in the laboratory. Bioinformatic analyses make it possible to mine huge metagenomic datasets and discover general patterns that govern microbial ecosystems. However, the findings of typical metagenomic and bioinformatic analyses still do not completely describe the ecology and evolution of microbes in their environments. Most analyses still depend on straightforward sequence similarity searches against reference databases. We herein review the current state of metagenomics and bioinformatics in microbial ecology and discuss future directions for the field. New techniques will allow us to go beyond routine analyses and broaden our knowledge of microbial ecosystems. We need to enrich reference databases, promote platforms that enable meta- or comprehensive analyses of diverse metagenomic datasets, devise methods that utilize long-read sequence information, and develop more powerful bioinformatic methods to analyze data from diverse perspectives.

  14. Metaproteomics analysis of the functional insights into microbial communities of combined hydrogen and methane production by anaerobic fermentation from reed straw.

    Xuan Jia

    Full Text Available A metaproteomic approach was used to analyse the proteins expressed and provide functional evidence of key metabolic pathways in the combined production of hydrogen and methane by anaerobic fermentation (CHMP-AF for reed straw utilisation. The functions and structures of bacteria and archaea populations show significant succession in the CHMP-AF process. There are many kinds of bacterial functional proteins, mainly belonging to phyla Firmicutes, Proteobacteria, Actinobacteria and Bacteroidetes, that are involved in carbohydrate metabolism, energy metabolism, lipid metabolism, and amino acid metabolism. Ferredoxin-NADP reductase, present in bacteria in genus Azotobacter, is an important enzyme for NADH/NAD+ equilibrium regulation in hydrogen production. The archaeal functional proteins are mainly involved in methane metabolism in energy metabolism, such as acetyl-CoA decarboxylase, and methyl-coenzyme M reductase, and the acetic acid pathway exhibited the highest proportion of the total. The archaea of genus Methanosarcina in phylum Euryarchaeota can produce methane under the effect of multi-functional proteins through acetic acid, CO2 reduction, and methyl nutrient pathways. The study demonstrates metaproteomics as a new way of uncovering community functional and metabolic activity. The combined information was used to identify the metabolic pathways and organisms crucial for lignocellulosic biomass degradation and biogas production. This also regulates the process from its protein levels and improves the efficiency of biogas production using reed straw biomass.

  15. Symbiotic Interplay of Fungi, Algae, and Bacteria within the Lung Lichen Lobaria pulmonaria L. Hoffm. as Assessed by State-of-the-Art Metaproteomics.

    Eymann, Christine; Lassek, Christian; Wegner, Uwe; Bernhardt, Jörg; Fritsch, Ole Arno; Fuchs, Stephan; Otto, Andreas; Albrecht, Dirk; Schiefelbein, Ulf; Cernava, Tomislav; Aschenbrenner, Ines; Berg, Gabriele; Grube, Martin; Riedel, Katharina

    2017-06-02

    Lichens are recognized by macroscopic structures formed by a heterotrophic fungus, the mycobiont, which hosts internal autotrophic photosynthetic algal and/or cyanobacterial partners, referred to as the photobiont. We analyzed the structure and functionality of the entire lung lichen Lobaria pulmonaria L. Hoffm. collected from two different sites by state-of-the-art metaproteomics. In addition to the green algae and the ascomycetous fungus, a lichenicolous fungus as well as a complex prokaryotic community (different from the cyanobacteria) was found, the latter dominated by methanotrophic Rhizobiales. Various partner-specific proteins could be assigned to the different lichen symbionts, for example, fungal proteins involved in vesicle transport, algal proteins functioning in photosynthesis, cyanobacterial nitrogenase and GOGAT involved in nitrogen fixation, and bacterial enzymes responsible for methanol/C1-compound metabolism as well as CO-detoxification. Structural and functional information on proteins expressed by the lichen community complemented and extended our recent symbiosis model depicting the functional multiplayer network of single holobiont partners.1 Our new metaproteome analysis strongly supports the hypothesis (i) that interactions within the self-supporting association are multifaceted and (ii) that the strategy of functional diversification within the single lichen partners may support the longevity of L. pulmonaria under certain ecological conditions.

  16. Metaproteomics analysis of the functional insights into microbial communities of combined hydrogen and methane production by anaerobic fermentation from reed straw

    Yang, Yang; Wang, Yong

    2017-01-01

    A metaproteomic approach was used to analyse the proteins expressed and provide functional evidence of key metabolic pathways in the combined production of hydrogen and methane by anaerobic fermentation (CHMP-AF) for reed straw utilisation. The functions and structures of bacteria and archaea populations show significant succession in the CHMP-AF process. There are many kinds of bacterial functional proteins, mainly belonging to phyla Firmicutes, Proteobacteria, Actinobacteria and Bacteroidetes, that are involved in carbohydrate metabolism, energy metabolism, lipid metabolism, and amino acid metabolism. Ferredoxin-NADP reductase, present in bacteria in genus Azotobacter, is an important enzyme for NADH/NAD+ equilibrium regulation in hydrogen production. The archaeal functional proteins are mainly involved in methane metabolism in energy metabolism, such as acetyl-CoA decarboxylase, and methyl-coenzyme M reductase, and the acetic acid pathway exhibited the highest proportion of the total. The archaea of genus Methanosarcina in phylum Euryarchaeota can produce methane under the effect of multi-functional proteins through acetic acid, CO2 reduction, and methyl nutrient pathways. The study demonstrates metaproteomics as a new way of uncovering community functional and metabolic activity. The combined information was used to identify the metabolic pathways and organisms crucial for lignocellulosic biomass degradation and biogas production. This also regulates the process from its protein levels and improves the efficiency of biogas production using reed straw biomass. PMID:28817657

  17. Exploiting HPC Platforms for Metagenomics: Challenges and Opportunities (MICW - Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Canon, Shane

    2011-10-12

    DOE JGI's Zhong Wang, chair of the High-performance Computing session, gives a brief introduction before Berkeley Lab's Shane Canon talks about "Exploiting HPC Platforms for Metagenomics: Challenges and Opportunities" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  18. Analysis of composition-based metagenomic classification.

    Higashi, Susan; Barreto, André da Motta Salles; Cantão, Maurício Egidio; de Vasconcelos, Ana Tereza Ribeiro

    2012-01-01

    An essential step of a metagenomic study is the taxonomic classification, that is, the identification of the taxonomic lineage of the organisms in a given sample. The taxonomic classification process involves a series of decisions. Currently, in the context of metagenomics, such decisions are usually based on empirical studies that consider one specific type of classifier. In this study we propose a general framework for analyzing the impact that several decisions can have on the classification problem. Instead of focusing on any specific classifier, we define a generic score function that provides a measure of the difficulty of the classification task. Using this framework, we analyze the impact of the following parameters on the taxonomic classification problem: (i) the length of n-mers used to encode the metagenomic sequences, (ii) the similarity measure used to compare sequences, and (iii) the type of taxonomic classification, which can be conventional or hierarchical, depending on whether the classification process occurs in a single shot or in several steps according to the taxonomic tree. We defined a score function that measures the degree of separability of the taxonomic classes under a given configuration induced by the parameters above. We conducted an extensive computational experiment and found out that reasonable values for the parameters of interest could be (i) intermediate values of n, the length of the n-mers; (ii) any similarity measure, because all of them resulted in similar scores; and (iii) the hierarchical strategy, which performed better in all of the cases. As expected, short n-mers generate lower configuration scores because they give rise to frequency vectors that represent distinct sequences in a similar way. On the other hand, large values for n result in sparse frequency vectors that represent differently metagenomic fragments that are in fact similar, also leading to low configuration scores. Regarding the similarity measure, in

  19. De-MetaST-BLAST: a tool for the validation of degenerate primer sets and data mining of publicly available metagenomes.

    Christopher A Gulvik

    Full Text Available Development and use of primer sets to amplify nucleic acid sequences of interest is fundamental to studies spanning many life science disciplines. As such, the validation of primer sets is essential. Several computer programs have been created to aid in the initial selection of primer sequences that may or may not require multiple nucleotide combinations (i.e., degeneracies. Conversely, validation of primer specificity has remained largely unchanged for several decades, and there are currently few available programs that allows for an evaluation of primers containing degenerate nucleotide bases. To alleviate this gap, we developed the program De-MetaST that performs an in silico amplification using user defined nucleotide sequence dataset(s and primer sequences that may contain degenerate bases. The program returns an output file that contains the in silico amplicons. When De-MetaST is paired with NCBI's BLAST (De-MetaST-BLAST, the program also returns the top 10 nr NCBI database hits for each recovered in silico amplicon. While the original motivation for development of this search tool was degenerate primer validation using the wealth of nucleotide sequences available in environmental metagenome and metatranscriptome databases, this search tool has potential utility in many data mining applications.

  20. Coral-zooxanthellae meta-transcriptomics reveals integrated response to pollutant stress.

    Gust, Kurt A; Najar, Fares Z; Habib, Tanwir; Lotufo, Guilherme R; Piggot, Alan M; Fouke, Bruce W; Laird, Jennifer G; Wilbanks, Mitchell S; Rawat, Arun; Indest, Karl J; Roe, Bruce A; Perkins, Edward J

    2014-07-12

    Corals represent symbiotic meta-organisms that require harmonization among the coral animal, photosynthetic zooxanthellae and associated microbes to survive environmental stresses. We investigated integrated-responses among coral and zooxanthellae in the scleractinian coral Acropora formosa in response to an emerging marine pollutant, the munitions constituent, 1,3,5-trinitro-1,3,5 triazine (RDX; 5 day exposures to 0 (control), 0.5, 0.9, 1.8, 3.7, and 7.2 mg/L, measured in seawater). RDX accumulated readily in coral soft tissues with bioconcentration factors ranging from 1.1 to 1.5. Next-generation sequencing of a normalized meta-transcriptomic library developed for the eukaryotic components of the A. formosa coral holobiont was leveraged to conduct microarray-based global transcript expression analysis of integrated coral/zooxanthellae responses to the RDX exposure. Total differentially expressed transcripts (DET) increased with increasing RDX exposure concentrations as did the proportion of zooxanthellae DET relative to the coral animal. Transcriptional responses in the coral demonstrated higher sensitivity to RDX compared to zooxanthellae where increased expression of gene transcripts coding xenobiotic detoxification mechanisms (i.e. cytochrome P450 and UDP glucuronosyltransferase 2 family) were initiated at the lowest exposure concentration. Increased expression of these detoxification mechanisms was sustained at higher RDX concentrations as well as production of a physical barrier to exposure through a 40% increase in mucocyte density at the maximum RDX exposure. At and above the 1.8 mg/L exposure concentration, DET coding for genes involved in central energy metabolism, including photosynthesis, glycolysis and electron-transport functions, were decreased in zooxanthellae although preliminary data indicated that zooxanthellae densities were not affected. In contrast, significantly increased transcript expression for genes involved in cellular energy production

  1. Metatranscriptome Sequencing Reveals Insights into the Gene Expression and Functional Potential of Rumen Wall Bacteria

    Evelyne Mann

    2018-01-01

    Full Text Available Microbiota of the rumen wall constitute an important niche of rumen microbial ecology and their composition has been elucidated in different ruminants during the last years. However, the knowledge about the function of rumen wall microbes is still limited. Rumen wall biopsies were taken from three fistulated dairy cows under a standard forage-based diet and after 4 weeks of high concentrate feeding inducing a subacute rumen acidosis (SARA. Extracted RNA was used for metatranscriptome sequencing using Illumina HiSeq sequencing technology. The gene expression of the rumen wall microbial community was analyzed by mapping 35 million sequences against the Kyoto Encyclopedia for Genes and Genomes (KEGG database and determining differentially expressed genes. A total of 1,607 functional features were assigned with high expression of genes involved in central metabolism, galactose, starch and sucrose metabolism. The glycogen phosphorylase (EC:2.4.1.1 which degrades (1->4-alpha-D-glucans was among the highest expressed genes being transcribed by 115 bacterial genera. Energy metabolism genes were also highly expressed, including the pyruvate orthophosphate dikinase (EC:2.7.9.1 involved in pyruvate metabolism, which was covered by 177 genera. Nitrogen metabolism genes, in particular glutamate dehydrogenase (EC:1.4.1.4, glutamine synthetase (EC:6.3.1.2 and glutamate synthase (EC:1.4.1.13, EC:1.4.1.14 were also found to be highly expressed and prove rumen wall microbiota to be actively involved in providing host-relevant metabolites for exchange across the rumen wall. In addition, we found all four urease subunits (EC:3.5.1.5 transcribed by members of the genera Flavobacterium, Corynebacterium, Helicobacter, Clostridium, and Bacillus, and the dissimilatory sulfate reductase (EC 1.8.99.5 dsrABC, which is responsible for the reduction of sulfite to sulfide. We also provide in situ evidence for cellulose and cellobiose degradation, a key step in fiber-rich feed

  2. Metatranscriptomics reveals the diversity of genes expressed by eukaryotes in forest soils.

    Coralie Damon

    Full Text Available Eukaryotic organisms play essential roles in the biology and fertility of soils. For example the micro and mesofauna contribute to the fragmentation and homogenization of plant organic matter, while its hydrolysis is primarily performed by the fungi. To get a global picture of the activities carried out by soil eukaryotes we sequenced 2×10,000 cDNAs synthesized from polyadenylated mRNA directly extracted from soils sampled in beech (Fagus sylvatica and spruce (Picea abies forests. Taxonomic affiliation of both cDNAs and 18S rRNA sequences showed a dominance of sequences from fungi (up to 60% and metazoans while protists represented less than 12% of the 18S rRNA sequences. Sixty percent of cDNA sequences from beech forest soil and 52% from spruce forest soil had no homologs in the GenBank/EMBL/DDJB protein database. A Gene Ontology term was attributed to 39% and 31.5% of the spruce and beech soil sequences respectively. Altogether 2076 sequences were putative homologs to different enzyme classes participating to 129 KEGG pathways among which several were implicated in the utilisation of soil nutrients such as nitrogen (ammonium, amino acids, oligopeptides, sugars, phosphates and sulfate. Specific annotation of plant cell wall degrading enzymes identified enzymes active on major polymers (cellulose, hemicelluloses, pectin, lignin and glycoside hydrolases represented 0.5% (beech soil-0.8% (spruce soil of the cDNAs. Other sequences coding enzymes active on organic matter (extracellular proteases, lipases, a phytase, P450 monooxygenases were identified, thus underlining the biotechnological potential of eukaryotic metatranscriptomes. The phylogenetic affiliation of 12 full-length carbohydrate active enzymes showed that most of them were distantly related to sequences from known fungi. For example, a putative GH45 endocellulase was closely associated to molluscan sequences, while a GH7 cellobiohydrolase was closest to crustacean sequences, thus

  3. Metagenome Fragment Classification Using -Mer Frequency Profiles

    Gail Rosen

    2008-01-01

    Full Text Available A vast amount of microbial sequencing data is being generated through large-scale projects in ecology, agriculture, and human health. Efficient high-throughput methods are needed to analyze the mass amounts of metagenomic data, all DNA present in an environmental sample. A major obstacle in metagenomics is the inability to obtain accuracy using technology that yields short reads. We construct the unique -mer frequency profiles of 635 microbial genomes publicly available as of February 2008. These profiles are used to train a naive Bayes classifier (NBC that can be used to identify the genome of any fragment. We show that our method is comparable to BLAST for small 25 bp fragments but does not have the ambiguity of BLAST's tied top scores. We demonstrate that this approach is scalable to identify any fragment from hundreds of genomes. It also performs quite well at the strain, species, and genera levels and achieves strain resolution despite classifying ubiquitous genomic fragments (gene and nongene regions. Cross-validation analysis demonstrates that species-accuracy achieves 90% for highly-represented species containing an average of 8 strains. We demonstrate that such a tool can be used on the Sargasso Sea dataset, and our analysis shows that NBC can be further enhanced.

  4. New Bacterial Phytase through Metagenomic Prospection

    Nathálya Farias

    2018-02-01

    Full Text Available Alkaline phytases from uncultured microorganisms, which hydrolyze phytate to less phosphorylated myo-inositols and inorganic phosphate, have great potential as additives in agricultural industry. The development of metagenomics has stemmed from the ineluctable evidence that as-yet-uncultured microorganisms represent the vast majority of organisms in most environments on earth. In this study, a gene encoding a phytase was cloned from red rice crop residues and castor bean cake using a metagenomics strategy. The amino acid identity between this gene and its closest published counterparts is lower than 60%. The phytase was named PhyRC001 and was biochemically characterized. This recombinant protein showed activity on sodium phytate, indicating that PhyRC001 is a hydrolase enzyme. The enzymatic activity was optimal at a pH of 7.0 and at a temperature of 35 °C. β-propeller phytases possess great potential as feed additives because they are the only type of phytase with high activity at neutral pH. Therefore, to explore and exploit the underlying mechanism for β-propeller phytase functions could be of great benefit to biotechnology.

  5. Comparative Metatranscriptomics of Wheat Rhizosphere Microbiomes in Disease Suppressive and Non-suppressive Soils for Rhizoctonia solani AG8

    Helen L. Hayden

    2018-05-01

    Full Text Available The soilborne fungus Rhizoctonia solani anastomosis group (AG 8 is a major pathogen of grain crops resulting in substantial production losses. In the absence of resistant cultivars of wheat or barley, a sustainable and enduring method for disease control may lie in the enhancement of biological disease suppression. Evidence of effective biological control of R. solani AG8 through disease suppression has been well documented at our study site in Avon, South Australia. A comparative metatranscriptomic approach was applied to assess the taxonomic and functional characteristics of the rhizosphere microbiome of wheat plants grown in adjacent fields which are suppressive and non-suppressive to the plant pathogen R. solani AG8. Analysis of 12 rhizosphere metatranscriptomes (six per field was undertaken using two bioinformatic approaches involving unassembled and assembled reads. Differential expression analysis showed the dominant taxa in the rhizosphere based on mRNA annotation were Arthrobacter spp. and Pseudomonas spp. for non-suppressive samples and Stenotrophomonas spp. and Buttiauxella spp. for the suppressive samples. The assembled metatranscriptome analysis identified more differentially expressed genes than the unassembled analysis in the comparison of suppressive and non-suppressive samples. Suppressive samples showed greater expression of a polyketide cyclase, a terpenoid biosynthesis backbone gene (dxs and many cold shock proteins (csp. Non-suppressive samples were characterised by greater expression of antibiotic genes such as non-heme chloroperoxidase (cpo which is involved in pyrrolnitrin synthesis, and phenazine biosynthesis family protein F (phzF and its transcriptional activator protein (phzR. A large number of genes involved in detoxifying reactive oxygen species (ROS and superoxide radicals (sod, cat, ahp, bcp, gpx1, trx were also expressed in the non-suppressive rhizosphere samples most likely in response to the infection of wheat

  6. Effective Analysis of NGS Metagenomic Data with Ultra-Fast Clustering Algorithms (MICW - Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Li, Weizhong

    2011-10-12

    San Diego Supercomputer Center's Weizhong Li on "Effective Analysis of NGS Metagenomic Data with Ultra-fast Clustering Algorithms" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  7. Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software.

    Sczyrba, Alexander; Hofmann, Peter; Belmann, Peter; Koslicki, David; Janssen, Stefan; Dröge, Johannes; Gregor, Ivan; Majda, Stephan; Fiedler, Jessika; Dahms, Eik; Bremges, Andreas; Fritz, Adrian; Garrido-Oter, Ruben; Jørgensen, Tue Sparholt; Shapiro, Nicole; Blood, Philip D; Gurevich, Alexey; Bai, Yang; Turaev, Dmitrij; DeMaere, Matthew Z; Chikhi, Rayan; Nagarajan, Niranjan; Quince, Christopher; Meyer, Fernando; Balvočiūtė, Monika; Hansen, Lars Hestbjerg; Sørensen, Søren J; Chia, Burton K H; Denis, Bertrand; Froula, Jeff L; Wang, Zhong; Egan, Robert; Don Kang, Dongwan; Cook, Jeffrey J; Deltel, Charles; Beckstette, Michael; Lemaitre, Claire; Peterlongo, Pierre; Rizk, Guillaume; Lavenier, Dominique; Wu, Yu-Wei; Singer, Steven W; Jain, Chirag; Strous, Marc; Klingenberg, Heiner; Meinicke, Peter; Barton, Michael D; Lingner, Thomas; Lin, Hsin-Hung; Liao, Yu-Chieh; Silva, Genivaldo Gueiros Z; Cuevas, Daniel A; Edwards, Robert A; Saha, Surya; Piro, Vitor C; Renard, Bernhard Y; Pop, Mihai; Klenk, Hans-Peter; Göker, Markus; Kyrpides, Nikos C; Woyke, Tanja; Vorholt, Julia A; Schulze-Lefert, Paul; Rubin, Edward M; Darling, Aaron E; Rattei, Thomas; McHardy, Alice C

    2017-11-01

    Methods for assembly, taxonomic profiling and binning are key to interpreting metagenome data, but a lack of consensus about benchmarking complicates performance assessment. The Critical Assessment of Metagenome Interpretation (CAMI) challenge has engaged the global developer community to benchmark their programs on highly complex and realistic data sets, generated from ∼700 newly sequenced microorganisms and ∼600 novel viruses and plasmids and representing common experimental setups. Assembly and genome binning programs performed well for species represented by individual genomes but were substantially affected by the presence of related strains. Taxonomic profiling and binning programs were proficient at high taxonomic ranks, with a notable performance decrease below family level. Parameter settings markedly affected performance, underscoring their importance for program reproducibility. The CAMI results highlight current challenges but also provide a roadmap for software selection to answer specific research questions.

  8. Comparative Metagenomics of Freshwater Microbial Communities

    Hemme, Chris; Deng, Ye; Tu, Qichao; Fields, Matthew; Gentry, Terry; Wu, Liyou; Tringe, Susannah; Watson, David; He, Zhili; Hazen, Terry; Tiedje, James; Rubin, Eddy; Zhou, Jizhong

    2010-01-01

    Previous analyses of a microbial metagenome from uranium and nitric-acid contaminated groundwater (FW106) showed significant environmental effects resulting from the rapid introduction of multiple contaminants. Effects include a massive loss of species and strain biodiversity, accumulation of toxin resistant genes in the metagenome and lateral transfer of toxin resistance genes between community members. To better understand these results in an ecological context, a second metagenome from a pristine groundwater system located along the same geological strike was sequenced and analyzed (FW301). It is hypothesized that FW301 approximates the ancestral FW106 community based on phylogenetic profiles and common geological parameters; however, even if is not the case, the datasets still permit comparisons between healthy and stressed groundwater ecosystems. Complex carbohydrate metabolism has been almost entirely lost in the stressed ecosystem. In contrast, the pristine system encodes a wide diversity of complex carbohydrate metabolism systems, suggesting that carbon turnover is very rapid and less leaky in the healthy groundwater system. FW301 encodes many (∼160+) carbon monoxide dehydrogenase genes while FW106 encodes none. This result suggests that the community is frequently exposed to oxygen from aerated rainwater percolating into the subsurface, with a resulting high rate of carbon metabolism and CO production. When oxygen levels fall, the CO then serves as a major carbon source for the community. FW301 appears to be capable of CO2 fixation via the reductive carboxylase (reverse TCA) cycle and possibly acetogenesis, activities; these activities are lacking in the heterotrophic FW106 system which relies exclusively on respiration of nitrate and/or oxygen for energy production. FW301 encodes a complete set of B12 biosynthesis pathway at high abundance suggesting the use of sodium gradients for energy production in the healthy groundwater community. Overall

  9. Single Cell and Metagenomic Assemblies: Biology Drives Technical Choices and Goals (Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Stepanauskas, Ramunas

    2011-10-13

    DOE JGI's Tanja Woyke, chair of the Single Cells and Metagenomes session, delivers an introduction, followed by Bigelow Laboratory's Ramunas Stepanauskas on "Single Cell and Metagenomic Assemblies: Biology Drives Technical Choices and Goals" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  10. Unraveling Core Functional Microbiota in Traditional Solid-State Fermentation by High-Throughput Amplicons and Metatranscriptomics Sequencing.

    Song, Zhewei; Du, Hai; Zhang, Yan; Xu, Yan

    2017-01-01

    Fermentation microbiota is specific microorganisms that generate different types of metabolites in many productions. In traditional solid-state fermentation, the structural composition and functional capacity of the core microbiota determine the quality and quantity of products. As a typical example of food fermentation, Chinese Maotai-flavor liquor production involves a complex of various microorganisms and a wide variety of metabolites. However, the microbial succession and functional shift of the core microbiota in this traditional food fermentation remain unclear. Here, high-throughput amplicons (16S rRNA gene amplicon sequencing and internal transcribed space amplicon sequencing) and metatranscriptomics sequencing technologies were combined to reveal the structure and function of the core microbiota in Chinese soy sauce aroma type liquor production. In addition, ultra-performance liquid chromatography and headspace-solid phase microextraction-gas chromatography-mass spectrometry were employed to provide qualitative and quantitative analysis of the major flavor metabolites. A total of 10 fungal and 11 bacterial genera were identified as the core microbiota. In addition, metatranscriptomic analysis revealed pyruvate metabolism in yeasts (genera Pichia, Schizosaccharomyces, Saccharomyces , and Zygosaccharomyces ) and lactic acid bacteria (genus Lactobacillus ) classified into two stages in the production of flavor components. Stage I involved high-level alcohol (ethanol) production, with the genus Schizosaccharomyces serving as the core functional microorganism. Stage II involved high-level acid (lactic acid and acetic acid) production, with the genus Lactobacillus serving as the core functional microorganism. The functional shift from the genus Schizosaccharomyces to the genus Lactobacillus drives flavor component conversion from alcohol (ethanol) to acid (lactic acid and acetic acid) in Chinese Maotai-flavor liquor production. Our findings provide insight into

  11. Unraveling Core Functional Microbiota in Traditional Solid-State Fermentation by High-Throughput Amplicons and Metatranscriptomics Sequencing

    Zhewei Song

    2017-07-01

    Full Text Available Fermentation microbiota is specific microorganisms that generate different types of metabolites in many productions. In traditional solid-state fermentation, the structural composition and functional capacity of the core microbiota determine the quality and quantity of products. As a typical example of food fermentation, Chinese Maotai-flavor liquor production involves a complex of various microorganisms and a wide variety of metabolites. However, the microbial succession and functional shift of the core microbiota in this traditional food fermentation remain unclear. Here, high-throughput amplicons (16S rRNA gene amplicon sequencing and internal transcribed space amplicon sequencing and metatranscriptomics sequencing technologies were combined to reveal the structure and function of the core microbiota in Chinese soy sauce aroma type liquor production. In addition, ultra-performance liquid chromatography and headspace-solid phase microextraction-gas chromatography-mass spectrometry were employed to provide qualitative and quantitative analysis of the major flavor metabolites. A total of 10 fungal and 11 bacterial genera were identified as the core microbiota. In addition, metatranscriptomic analysis revealed pyruvate metabolism in yeasts (genera Pichia, Schizosaccharomyces, Saccharomyces, and Zygosaccharomyces and lactic acid bacteria (genus Lactobacillus classified into two stages in the production of flavor components. Stage I involved high-level alcohol (ethanol production, with the genus Schizosaccharomyces serving as the core functional microorganism. Stage II involved high-level acid (lactic acid and acetic acid production, with the genus Lactobacillus serving as the core functional microorganism. The functional shift from the genus Schizosaccharomyces to the genus Lactobacillus drives flavor component conversion from alcohol (ethanol to acid (lactic acid and acetic acid in Chinese Maotai-flavor liquor production. Our findings provide

  12. Analysis and comparison of very large metagenomes with fast clustering and functional annotation

    Li Weizhong

    2009-10-01

    Full Text Available Abstract Background The remarkable advance of metagenomics presents significant new challenges in data analysis. Metagenomic datasets (metagenomes are large collections of sequencing reads from anonymous species within particular environments. Computational analyses for very large metagenomes are extremely time-consuming, and there are often many novel sequences in these metagenomes that are not fully utilized. The number of available metagenomes is rapidly increasing, so fast and efficient metagenome comparison methods are in great demand. Results The new metagenomic data analysis method Rapid Analysis of Multiple Metagenomes with a Clustering and Annotation Pipeline (RAMMCAP was developed using an ultra-fast sequence clustering algorithm, fast protein family annotation tools, and a novel statistical metagenome comparison method that employs a unique graphic interface. RAMMCAP processes extremely large datasets with only moderate computational effort. It identifies raw read clusters and protein clusters that may include novel gene families, and compares metagenomes using clusters or functional annotations calculated by RAMMCAP. In this study, RAMMCAP was applied to the two largest available metagenomic collections, the "Global Ocean Sampling" and the "Metagenomic Profiling of Nine Biomes". Conclusion RAMMCAP is a very fast method that can cluster and annotate one million metagenomic reads in only hundreds of CPU hours. It is available from http://tools.camera.calit2.net/camera/rammcap/.

  13. Metagenomic mining of feruloyl esterases from termite enteric flora

    Rashamuse, K

    2014-01-01

    Full Text Available A metagenome expression library was created from Trinervitermes trinervoides termite hindgut symbionts and subsequently screened for feruloyl esterase (FAE) activities, resulting in seven recombinant fosmids conferring feruloyl esterase phenotypes...

  14. Towards diagnostic metagenomics of Campylobacter in fecal samples

    Andersen, Sandra Christine; Kiil, Kristoffer; Harder, Christoffer Bugge

    2017-01-01

    The development of diagnostic metagenomics is driven by the need for universal, culture-independent methods for detection and characterization of pathogens to substitute the time-consuming, organism-specific, and often culture-based laboratory procedures for epidemiological source-tracing. Some...... of the challenges in diagnostic metagenomics are, that it requires a great next-generation sequencing depth and unautomated data analysis. DNA from human fecal samples spiked with 7.75 × 101-7.75 × 107 colony forming unit (CFU)/ml Campylobacter jejuni and chicken fecal samples spiked with 1 × 102-1 × 106 CFU...... Campylobacter in all the clinical samples. Sensitivity in diagnostic metagenomics is improving and has reached a clinically relevant level. There are still challenges to overcome before real-time diagnostic metagenomics can replace quantitative polymerase chain reaction (qPCR) or culture-based surveillance...

  15. Oral Metagenomic Biomarkers in Rheumatoid Arthritis

    2017-09-01

    individuals with rheumatoid arthritis (RA). The goal is to test the  hypothesis that oral microbiome and metagenomic analyses will allow  us  to identify new...biomarkers  that are  useful  for the diagnosis of early RA and/or biomarkers that help to predict the efficacy of  specific therapeutic interventions... RNA  microbiome analysis as well as whole genome shotgun sequencing.  Upon completion of these aims, any identified bacterial biomarkers may be

  16. FY11 Report on Metagenome Analysis using Pathogen Marker Libraries

    Gardner, Shea N. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Allen, Jonathan E. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); McLoughlin, Kevin S. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Slezak, Tom [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

    2011-06-02

    A method, sequence library, and software suite was invented to rapidly assess whether any member of a pre-specified list of threat organisms or their near neighbors is present in a metagenome. The system was designed to handle mega- to giga-bases of FASTA-formatted raw sequence reads from short or long read next generation sequencing platforms. The approach is to pre-calculate a viral and a bacterial "Pathogen Marker Library" (PML) containing sub-sequences specific to pathogens or their near neighbors. A list of expected matches comparing every bacterial or viral genome against the PML sequences is also pre-calculated. To analyze a metagenome, reads are compared to the PML, and observed PML-metagenome matches are compared to the expected PML-genome matches, and the ratio of observed relative to expected matches is reported. In other words, a 3-way comparison among the PML, metagenome, and existing genome sequences is used to quickly assess which (if any) species included in the PML is likely to be present in the metagenome, based on available sequence data. Our tests showed that the species with the most PML matches correctly indicated the organism sequenced for empirical metagenomes consisting of a cultured, relatively pure isolate. These runs completed in 1 minute to 3 hours on 12 CPU (1 thread/CPU), depending on the metagenome and PML. Using more threads on the same number of CPU resulted in speed improvements roughly proportional to the number of threads. Simulations indicated that detection sensitivity depends on both sequencing coverage levels for a species and the size of the PML: species were correctly detected even at ~0.003x coverage by the large PMLs, and at ~0.03x coverage by the smaller PMLs. Matches to true positive species were 3-4 orders of magnitude higher than to false positives. Simulations with short reads (36 nt and ~260 nt) showed that species were usually detected for metagenome coverage above 0.005x and coverage in the PML above 0.05x, and

  17. Expanding the marine virosphere using metagenomics.

    Carolina Megumi Mizuno

    Full Text Available Viruses infecting prokaryotic cells (phages are the most abundant entities of the biosphere and contain a largely uncharted wealth of genomic diversity. They play a critical role in the biology of their hosts and in ecosystem functioning at large. The classical approaches studying phages require isolation from a pure culture of the host. Direct sequencing approaches have been hampered by the small amounts of phage DNA present in most natural habitats and the difficulty in applying meta-omic approaches, such as annotation of small reads and assembly. Serendipitously, it has been discovered that cellular metagenomes of highly productive ocean waters (the deep chlorophyll maximum contain significant amounts of viral DNA derived from cells undergoing the lytic cycle. We have taken advantage of this phenomenon to retrieve metagenomic fosmids containing viral DNA from a Mediterranean deep chlorophyll maximum sample. This method allowed description of complete genomes of 208 new marine phages. The diversity of these genomes was remarkable, contributing 21 genomic groups of tailed bacteriophages of which 10 are completely new. Sequence based methods have allowed host assignment to many of them. These predicted hosts represent a wide variety of important marine prokaryotic microbes like members of SAR11 and SAR116 clades, Cyanobacteria and also the newly described low GC Actinobacteria. A metavirome constructed from the same habitat showed that many of the new phage genomes were abundantly represented. Furthermore, other available metaviromes also indicated that some of the new phages are globally distributed in low to medium latitude ocean waters. The availability of many genomes from the same sample allows a direct approach to viral population genomics confirming the remarkable mosaicism of phage genomes.

  18. Metagenomic Sequencing of an In Vitro-Simulated Microbial Community

    Morgan, Jenna L.; Darling, Aaron E.; Eisen, Jonathan A.

    2009-12-01

    Background: Microbial life dominates the earth, but many species are difficult or even impossible to study under laboratory conditions. Sequencing DNA directly from the environment, a technique commonly referred to as metagenomics, is an important tool for cataloging microbial life. This culture-independent approach involves collecting samples that include microbes in them, extracting DNA from the samples, and sequencing the DNA. A sample may contain many different microorganisms, macroorganisms, and even free-floating environmental DNA. A fundamental challenge in metagenomics has been estimating the abundance of organisms in a sample based on the frequency with which the organism's DNA was observed in reads generated via DNA sequencing. Methodology/Principal Findings: We created mixtures of ten microbial species for which genome sequences are known. Each mixture contained an equal number of cells of each species. We then extracted DNA from the mixtures, sequenced the DNA, and measured the frequency with which genomic regions from each organism was observed in the sequenced DNA. We found that the observed frequency of reads mapping to each organism did not reflect the equal numbers of cells that were known to be included in each mixture. The relative organism abundances varied significantly depending on the DNA extraction and sequencing protocol utilized. Conclusions/Significance: We describe a new data resource for measuring the accuracy of metagenomic binning methods, created by in vitro-simulation of a metagenomic community. Our in vitro simulation can be used to complement previous in silico benchmark studies. In constructing a synthetic community and sequencing its metagenome, we encountered several sources of observation bias that likely affect most metagenomic experiments to date and present challenges for comparative metagenomic studies. DNA preparation methods have a particularly profound effect in our study, implying that samples prepared with

  19. Exploration of Metagenome Assemblies with an Interactive Visualization Tool

    Cantor, Michael; Nordberg, Henrik; Smirnova, Tatyana; Andersen, Evan; Tringe, Susannah; Hess, Matthias; Dubchak, Inna

    2014-07-09

    Metagenomics, one of the fastest growing areas of modern genomic science, is the genetic profiling of the entire community of microbial organisms present in an environmental sample. Elviz is a web-based tool for the interactive exploration of metagenome assemblies. Elviz can be used with publicly available data sets from the Joint Genome Institute or with custom user-loaded assemblies. Elviz is available at genome.jgi.doe.gov/viz

  20. Multiple comparative metagenomics using multiset k-mer counting

    Gaëtan Benoit

    2016-11-01

    Full Text Available Background Large scale metagenomic projects aim to extract biodiversity knowledge between different environmental conditions. Current methods for comparing microbial communities face important limitations. Those based on taxonomical or functional assignation rely on a small subset of the sequences that can be associated to known organisms. On the other hand, de novo methods, that compare the whole sets of sequences, either do not scale up on ambitious metagenomic projects or do not provide precise and exhaustive results. Methods These limitations motivated the development of a new de novo metagenomic comparative method, called Simka. This method computes a large collection of standard ecological distances by replacing species counts by k-mer counts. Simka scales-up today’s metagenomic projects thanks to a new parallel k-mer counting strategy on multiple datasets. Results Experiments on public Human Microbiome Project datasets demonstrate that Simka captures the essential underlying biological structure. Simka was able to compute in a few hours both qualitative and quantitative ecological distances on hundreds of metagenomic samples (690 samples, 32 billions of reads. We also demonstrate that analyzing metagenomes at the k-mer level is highly correlated with extremely precise de novo comparison techniques which rely on all-versus-all sequences alignment strategy or which are based on taxonomic profiling.

  1. Evaluation of ddRADseq for reduced representation metagenome sequencing

    Michael Y. Liu

    2017-09-01

    Full Text Available Background Profiling of microbial communities via metagenomic shotgun sequencing has enabled researches to gain unprecedented insight into microbial community structure and the functional roles of community members. This study describes a method and basic analysis for a metagenomic adaptation of the double digest restriction site associated DNA sequencing (ddRADseq protocol for reduced representation metagenome profiling. Methods This technique takes advantage of the sequence specificity of restriction endonucleases to construct an Illumina-compatible sequencing library containing DNA fragments that are between a pair of restriction sites located within close proximity. This results in a reduced sequencing library with coverage breadth that can be tuned by size selection. We assessed the performance of the metagenomic ddRADseq approach by applying the full method to human stool samples and generating sequence data. Results The ddRADseq data yields a similar estimate of community taxonomic profile as obtained from shotgun metagenome sequencing of the same human stool samples. No obvious bias with respect to genomic G + C content and the estimated relative species abundance was detected. Discussion Although ddRADseq does introduce some bias in taxonomic representation, the bias is likely to be small relative to DNA extraction bias. ddRADseq appears feasible and could have value as a tool for metagenome-wide association studies.

  2. A Bioinformatician's Guide to Metagenomics

    Kunin, Victor; Copeland, Alex; Lapidus, Alla; Mavromatis, Konstantinos; Hugenholtz, Philip

    2008-08-01

    As random shotgun metagenomic projects proliferate and become the dominant source of publicly available sequence data, procedures for best practices in their execution and analysis become increasingly important. Based on our experience at the Joint Genome Institute, we describe step-by-step the chain of decisions accompanying a metagenomic project from the viewpoint of a bioinformatician. We guide the reader through a standard workflow for a metagenomic project beginning with pre-sequencing considerations such as community composition and sequence data type that will greatly influence downstream analyses. We proceed with recommendations for sampling and data generation including sample and metadata collection, community profiling, construction of shotgun libraries and sequencing strategies. We then discuss the application of generic sequence processing steps (read preprocessing, assembly, and gene prediction and annotation) to metagenomic datasets by contrast to genome projects. Different types of data analyses particular to metagenomes are then presented including binning, dominant population analysis and gene-centric analysis. Finally data management systems and issues are presented and discussed. We hope that this review will assist bioinformaticians and biologists in making better-informed decisions on their journey during a metagenomic project.

  3. Introduction to Metagenomics at DOE JGI (Opening Remarks for the Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Kyrpides, Nikos [DOE JGI

    2011-10-12

    After a quick introduction by DOE JGI Director Eddy Rubin, DOE JGI's Nikos Kyrpides delivers the opening remarks at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011

  4. Metagenomic analysis of phosphorus removing sludgecommunities

    Garcia Martin, Hector; Ivanova, Natalia; Kunin, Victor; Warnecke,Falk; Barry, Kerrie; McHardy, Alice C.; Yeates, Christine; He, Shaomei; Salamov, Asaf; Szeto, Ernest; Dalin, Eileen; Putnam, Nik; Shapiro, HarrisJ.; Pangilinan, Jasmyn L.; Rigoutsos, Isidore; Kyrpides, Nikos C.; Blackall, Linda Louise; McMahon, Katherine D.; Hugenholtz, Philip

    2006-02-01

    Enhanced Biological Phosphorus Removal (EBPR) is not wellunderstood at the metabolic level despite being one of the best-studiedmicrobially-mediated industrial processes due to its ecological andeconomic relevance. Here we present a metagenomic analysis of twolab-scale EBPR sludges dominated by the uncultured bacterium, "CandidatusAccumulibacter phosphatis." This analysis resolves several controversiesin EBPR metabolic models and provides hypotheses explaining the dominanceof A. phosphatis in this habitat, its lifestyle outside EBPR and probablecultivation requirements. Comparison of the same species from differentEBPR sludges highlights recent evolutionary dynamics in the A. phosphatisgenome that could be linked to mechanisms for environmental adaptation.In spite of an apparent lack of phylogenetic overlap in the flankingcommunities of the two sludges studied, common functional themes werefound, at least one of them complementary to the inferred metabolism ofthe dominant organism. The present study provides a much-needed blueprintfor a systems-level understanding of EBPR and illustrates thatmetagenomics enables detailed, often novel, insights into evenwell-studied biological systems.

  5. OTU analysis using metagenomic shotgun sequencing data.

    Xiaolin Hao

    Full Text Available Because of technological limitations, the primer and amplification biases in targeted sequencing of 16S rRNA genes have veiled the true microbial diversity underlying environmental samples. However, the protocol of metagenomic shotgun sequencing provides 16S rRNA gene fragment data with natural immunity against the biases raised during priming and thus the potential of uncovering the true structure of microbial community by giving more accurate predictions of operational taxonomic units (OTUs. Nonetheless, the lack of statistically rigorous comparison between 16S rRNA gene fragments and other data types makes it difficult to interpret previously reported results using 16S rRNA gene fragments. Therefore, in the present work, we established a standard analysis pipeline that would help confirm if the differences in the data are true or are just due to potential technical bias. This pipeline is built by using simulated data to find optimal mapping and OTU prediction methods. The comparison between simulated datasets revealed a relationship between 16S rRNA gene fragments and full-length 16S rRNA sequences that a 16S rRNA gene fragment having a length >150 bp provides the same accuracy as a full-length 16S rRNA sequence using our proposed pipeline, which could serve as a good starting point for experimental design and making the comparison between 16S rRNA gene fragment-based and targeted 16S rRNA sequencing-based surveys possible.

  6. Denoising PCR-amplified metagenome data

    Rosen Michael J

    2012-10-01

    Full Text Available Abstract Background PCR amplification and high-throughput sequencing theoretically enable the characterization of the finest-scale diversity in natural microbial and viral populations, but each of these methods introduces random errors that are difficult to distinguish from genuine biological diversity. Several approaches have been proposed to denoise these data but lack either speed or accuracy. Results We introduce a new denoising algorithm that we call DADA (Divisive Amplicon Denoising Algorithm. Without training data, DADA infers both the sample genotypes and error parameters that produced a metagenome data set. We demonstrate performance on control data sequenced on Roche’s 454 platform, and compare the results to the most accurate denoising software currently available, AmpliconNoise. Conclusions DADA is more accurate and over an order of magnitude faster than AmpliconNoise. It eliminates the need for training data to establish error parameters, fully utilizes sequence-abundance information, and enables inclusion of context-dependent PCR error rates. It should be readily extensible to other sequencing platforms such as Illumina.

  7. Unsupervised Two-Way Clustering of Metagenomic Sequences

    Shruthi Prabhakara

    2012-01-01

    Full Text Available A major challenge facing metagenomics is the development of tools for the characterization of functional and taxonomic content of vast amounts of short metagenome reads. The efficacy of clustering methods depends on the number of reads in the dataset, the read length and relative abundances of source genomes in the microbial community. In this paper, we formulate an unsupervised naive Bayes multispecies, multidimensional mixture model for reads from a metagenome. We use the proposed model to cluster metagenomic reads by their species of origin and to characterize the abundance of each species. We model the distribution of word counts along a genome as a Gaussian for shorter, frequent words and as a Poisson for longer words that are rare. We employ either a mixture of Gaussians or mixture of Poissons to model reads within each bin. Further, we handle the high-dimensionality and sparsity associated with the data, by grouping the set of words comprising the reads, resulting in a two-way mixture model. Finally, we demonstrate the accuracy and applicability of this method on simulated and real metagenomes. Our method can accurately cluster reads as short as 100 bps and is robust to varying abundances, divergences and read lengths.

  8. MetaStorm: A Public Resource for Customizable Metagenomics Annotation.

    Arango-Argoty, Gustavo; Singh, Gargi; Heath, Lenwood S; Pruden, Amy; Xiao, Weidong; Zhang, Liqing

    2016-01-01

    Metagenomics is a trending research area, calling for the need to analyze large quantities of data generated from next generation DNA sequencing technologies. The need to store, retrieve, analyze, share, and visualize such data challenges current online computational systems. Interpretation and annotation of specific information is especially a challenge for metagenomic data sets derived from environmental samples, because current annotation systems only offer broad classification of microbial diversity and function. Moreover, existing resources are not configured to readily address common questions relevant to environmental systems. Here we developed a new online user-friendly metagenomic analysis server called MetaStorm (http://bench.cs.vt.edu/MetaStorm/), which facilitates customization of computational analysis for metagenomic data sets. Users can upload their own reference databases to tailor the metagenomics annotation to focus on various taxonomic and functional gene markers of interest. MetaStorm offers two major analysis pipelines: an assembly-based annotation pipeline and the standard read annotation pipeline used by existing web servers. These pipelines can be selected individually or together. Overall, MetaStorm provides enhanced interactive visualization to allow researchers to explore and manipulate taxonomy and functional annotation at various levels of resolution.

  9. MetaStorm: A Public Resource for Customizable Metagenomics Annotation.

    Gustavo Arango-Argoty

    Full Text Available Metagenomics is a trending research area, calling for the need to analyze large quantities of data generated from next generation DNA sequencing technologies. The need to store, retrieve, analyze, share, and visualize such data challenges current online computational systems. Interpretation and annotation of specific information is especially a challenge for metagenomic data sets derived from environmental samples, because current annotation systems only offer broad classification of microbial diversity and function. Moreover, existing resources are not configured to readily address common questions relevant to environmental systems. Here we developed a new online user-friendly metagenomic analysis server called MetaStorm (http://bench.cs.vt.edu/MetaStorm/, which facilitates customization of computational analysis for metagenomic data sets. Users can upload their own reference databases to tailor the metagenomics annotation to focus on various taxonomic and functional gene markers of interest. MetaStorm offers two major analysis pipelines: an assembly-based annotation pipeline and the standard read annotation pipeline used by existing web servers. These pipelines can be selected individually or together. Overall, MetaStorm provides enhanced interactive visualization to allow researchers to explore and manipulate taxonomy and functional annotation at various levels of resolution.

  10. Metagenomic analysis of permafrost microbial community response to thaw

    Mackelprang, R.; Waldrop, M.P.; DeAngelis, K.M.; David, M.M.; Chavarria, K.L.; Blazewicz, S.J.; Rubin, E.M.; Jansson, J.K.

    2011-07-01

    We employed deep metagenomic sequencing to determine the impact of thaw on microbial phylogenetic and functional genes and related this data to measurements of methane emissions. Metagenomics, the direct sequencing of DNA from the environment, allows for the examination of whole biochemical pathways and associated processes, as opposed to individual pieces of the metabolic puzzle. Our metagenome analyses revealed that during transition from a frozen to a thawed state there were rapid shifts in many microbial, phylogenetic and functional gene abundances and pathways. After one week of incubation at 5°C, permafrost metagenomes converged to be more similar to each other than while they were frozen. We found that multiple genes involved in cycling of C and nitrogen shifted rapidly during thaw. We also constructed the first draft genome from a complex soil metagenome, which corresponded to a novel methanogen. Methane previously accumulated in permafrost was released during thaw and subsequently consumed by methanotrophic bacteria. Together these data point towards the importance of rapid cycling of methane and nitrogen in thawing permafrost.

  11. MetaStorm: A Public Resource for Customizable Metagenomics Annotation

    Arango-Argoty, Gustavo; Singh, Gargi; Heath, Lenwood S.; Pruden, Amy; Xiao, Weidong; Zhang, Liqing

    2016-01-01

    Metagenomics is a trending research area, calling for the need to analyze large quantities of data generated from next generation DNA sequencing technologies. The need to store, retrieve, analyze, share, and visualize such data challenges current online computational systems. Interpretation and annotation of specific information is especially a challenge for metagenomic data sets derived from environmental samples, because current annotation systems only offer broad classification of microbial diversity and function. Moreover, existing resources are not configured to readily address common questions relevant to environmental systems. Here we developed a new online user-friendly metagenomic analysis server called MetaStorm (http://bench.cs.vt.edu/MetaStorm/), which facilitates customization of computational analysis for metagenomic data sets. Users can upload their own reference databases to tailor the metagenomics annotation to focus on various taxonomic and functional gene markers of interest. MetaStorm offers two major analysis pipelines: an assembly-based annotation pipeline and the standard read annotation pipeline used by existing web servers. These pipelines can be selected individually or together. Overall, MetaStorm provides enhanced interactive visualization to allow researchers to explore and manipulate taxonomy and functional annotation at various levels of resolution. PMID:27632579

  12. Meta-IDBA: a de Novo assembler for metagenomic data.

    Peng, Yu; Leung, Henry C M; Yiu, S M; Chin, Francis Y L

    2011-07-01

    Next-generation sequencing techniques allow us to generate reads from a microbial environment in order to analyze the microbial community. However, assembling of a set of mixed reads from different species to form contigs is a bottleneck of metagenomic research. Although there are many assemblers for assembling reads from a single genome, there are no assemblers for assembling reads in metagenomic data without reference genome sequences. Moreover, the performances of these assemblers on metagenomic data are far from satisfactory, because of the existence of common regions in the genomes of subspecies and species, which make the assembly problem much more complicated. We introduce the Meta-IDBA algorithm for assembling reads in metagenomic data, which contain multiple genomes from different species. There are two core steps in Meta-IDBA. It first tries to partition the de Bruijn graph into isolated components of different species based on an important observation. Then, for each component, it captures the slight variants of the genomes of subspecies from the same species by multiple alignments and represents the genome of one species, using a consensus sequence. Comparison of the performances of Meta-IDBA and existing assemblers, such as Velvet and Abyss for different metagenomic datasets shows that Meta-IDBA can reconstruct longer contigs with similar accuracy. Meta-IDBA toolkit is available at our website http://www.cs.hku.hk/~alse/metaidba. chin@cs.hku.hk.

  13. The potential of viral metagenomics in blood transfusion safety.

    Sauvage, V; Gomez, J; Boizeau, L; Laperche, S

    2017-09-01

    Thanks to the significant advent of high throughput sequencing in the last ten years, it is now possible via metagenomics to define the spectrum of the microbial sequences present in human blood samples. Therefore, metagenomics sequencing appears as a promising approach for the identification and global surveillance of new, emerging and/or unexpected viruses that could impair blood transfusion safety. However, despite considerable advantages compared to the traditional methods of pathogen identification, this non-targeted approach presents several drawbacks including a lack of sensitivity and sequence contaminant issues. With further improvements, especially to increase sensitivity, metagenomics sequencing should become in a near future an additional diagnostic tool in infectious disease field and especially in blood transfusion safety. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  14. Functional Metagenomic Investigations of the Human Intestinal Microbiota

    Moore, Aimee M.; Munck, Christian; Sommer, Morten Otto Alexander

    2011-01-01

    The human intestinal microbiota encode multiple critical functions impacting human health, including metabolism of dietary substrate, prevention of pathogen invasion, immune system modulation, and provision of a reservoir of antibiotic resistance genes accessible to pathogens. The complexity...... microorganisms, but relatively recently applied to the study of the human commensal microbiota. Metagenomic functional screens characterize the functional capacity of a microbial community, independent of identity to known genes, by subjecting the metagenome to functional assays in a genetically tractable host....... Here we highlight recent work applying this technique to study the functional diversity of the intestinal microbiota, and discuss how an approach combining high-throughput sequencing, cultivation, and metagenomic functional screens can improve our understanding of interactions between this complex...

  15. An integrated catalog of reference genes in the human gut microbiome

    Li, Junhua; Jia, Huijue; Cai, Xianghang

    2014-01-01

    Many analyses of the human gut microbiome depend on a catalog of reference genes. Existing catalogs for the human gut microbiome are based on samples from single cohorts or on reference genomes or protein sequences, which limits coverage of global microbiome diversity. Here we combined 249 newly...... signatures. This expanded catalog should facilitate quantitative characterization of metagenomic, metatranscriptomic and metaproteomic data from the gut microbiome to understand its variation across populations in human health and disease.......) comprising 9,879,896 genes. The catalog includes close-to-complete sets of genes for most gut microbes, which are also of considerably higher quality than in previous catalogs. Analyses of a group of samples from Chinese and Danish individuals using the catalog revealed country-specific gut microbial...

  16. Brain Meta-Transcriptomics from Harbor Seals to Infer the Role of the Microbiome and Virome in a Stranding Event.

    Rosales, Stephanie M; Thurber, Rebecca Vega

    2015-01-01

    Marine diseases are becoming more frequent, and tools for identifying pathogens and disease reservoirs are needed to help prevent and mitigate epizootics. Meta-transcriptomics provides insights into disease etiology by cataloguing and comparing sequences from suspected pathogens. This method is a powerful approach to simultaneously evaluate both the viral and bacterial communities, but few studies have applied this technique in marine systems. In 2009 seven harbor seals, Phoca vitulina, stranded along the California coast from a similar brain disease of unknown cause of death (UCD). We evaluated the differences between the virome and microbiome of UCDs and harbor seals with known causes of death. Here we determined that UCD stranded animals had no viruses in their brain tissue. However, in the bacterial community, we identified Burkholderia and Coxiella burnetii as important pathogens associated with this stranding event. Burkholderia were 100% prevalent and ~2.8 log2 fold more abundant in the UCD animals. Further, while C. burnetii was found in only 35.7% of all samples, it was highly abundant (~94% of the total microbial community) in a single individual. In this harbor seal, C. burnetii showed high transcription rates of invading and translation genes, implicating it in the pathogenesis of this animal. Based on these data we propose that Burkholderia taxa and C. burnetii are potentially important opportunistic neurotropic pathogens in UCD stranded harbor seals.

  17. Metatranscriptomics Reveals the Functions and Enzyme Profiles of the Microbial Community in Chinese Nong-Flavor Liquor Starter

    Yuhong Huang

    2017-09-01

    Full Text Available Chinese liquor is one of the world's best-known distilled spirits and is the largest spirit category by sales. The unique and traditional solid-state fermentation technology used to produce Chinese liquor has been in continuous use for several thousand years. The diverse and dynamic microbial community in a liquor starter is the main contributor to liquor brewing. However, little is known about the ecological distribution and functional importance of these community members. In this study, metatranscriptomics was used to comprehensively explore the active microbial community members and key transcripts with significant functions in the liquor starter production process. Fungi were found to be the most abundant and active community members. A total of 932 carbohydrate-active enzymes, including highly expressed auxiliary activity family 9 and 10 proteins, were identified at 62°C under aerobic conditions. Some potential thermostable enzymes were identified at 50, 62, and 25°C (mature stage. Increased content and overexpressed key enzymes involved in glycolysis and starch, pyruvate and ethanol metabolism were detected at 50 and 62°C. The key enzymes of the citrate cycle were up-regulated at 62°C, and their abundant derivatives are crucial for flavor generation. Here, the metabolism and functional enzymes of the active microbial communities in NF liquor starter were studied, which could pave the way to initiate improvements in liquor quality and to discover microbes that produce novel enzymes or high-value added products.

  18. Brain Meta-Transcriptomics from Harbor Seals to Infer the Role of the Microbiome and Virome in a Stranding Event.

    Stephanie M Rosales

    Full Text Available Marine diseases are becoming more frequent, and tools for identifying pathogens and disease reservoirs are needed to help prevent and mitigate epizootics. Meta-transcriptomics provides insights into disease etiology by cataloguing and comparing sequences from suspected pathogens. This method is a powerful approach to simultaneously evaluate both the viral and bacterial communities, but few studies have applied this technique in marine systems. In 2009 seven harbor seals, Phoca vitulina, stranded along the California coast from a similar brain disease of unknown cause of death (UCD. We evaluated the differences between the virome and microbiome of UCDs and harbor seals with known causes of death. Here we determined that UCD stranded animals had no viruses in their brain tissue. However, in the bacterial community, we identified Burkholderia and Coxiella burnetii as important pathogens associated with this stranding event. Burkholderia were 100% prevalent and ~2.8 log2 fold more abundant in the UCD animals. Further, while C. burnetii was found in only 35.7% of all samples, it was highly abundant (~94% of the total microbial community in a single individual. In this harbor seal, C. burnetii showed high transcription rates of invading and translation genes, implicating it in the pathogenesis of this animal. Based on these data we propose that Burkholderia taxa and C. burnetii are potentially important opportunistic neurotropic pathogens in UCD stranded harbor seals.

  19. An algorithm for detecting eukaryotic sequences in metagenomic ...

    species but also from accidental contamination from the genome of eukaryotic host cells. The latter scenario generally occurs in the case of host-associated metagenomes, e.g. microbes living in human gut. In such cases, one needs to identify and remove contaminating host DNA sequences, since the latter sequences will ...

  20. SPHINX--an algorithm for taxonomic binning of metagenomic sequences.

    Mohammed, Monzoorul Haque; Ghosh, Tarini Shankar; Singh, Nitin Kumar; Mande, Sharmila S

    2011-01-01

    Compared with composition-based binning algorithms, the binning accuracy and specificity of alignment-based binning algorithms is significantly higher. However, being alignment-based, the latter class of algorithms require enormous amount of time and computing resources for binning huge metagenomic datasets. The motivation was to develop a binning approach that can analyze metagenomic datasets as rapidly as composition-based approaches, but nevertheless has the accuracy and specificity of alignment-based algorithms. This article describes a hybrid binning approach (SPHINX) that achieves high binning efficiency by utilizing the principles of both 'composition'- and 'alignment'-based binning algorithms. Validation results with simulated sequence datasets indicate that SPHINX is able to analyze metagenomic sequences as rapidly as composition-based algorithms. Furthermore, the binning efficiency (in terms of accuracy and specificity of assignments) of SPHINX is observed to be comparable with results obtained using alignment-based algorithms. A web server for the SPHINX algorithm is available at http://metagenomics.atc.tcs.com/SPHINX/.

  1. Finding the needles in the meta-genome haystack

    Kowalchuk, G.A.; Speksnijder, A.G.C.L.; Zhang, K.; Goodman, R.M.; Veen, van J.A.

    2007-01-01

    In the collective genomes (the metagenome) of the microorganisms inhabiting the Earth's diverse environments is written the history of life on this planet. New molecular tools developed and used for the past 15 years by microbial ecologists are facilitating the extraction, cloning, screening, and

  2. The microbiome of Brazilian mangrove sediments as revealed by metagenomics

    Andreote, Fernando Dini; Jiménez Avella, Diego; Chaves, Diego; Dias, Armando Cavalcante Franco; Luvizotto, Danice Mazzer; Dini-Andreote, Francisco; Fasanella, Cristiane Cipola; Lopez, Maryeimy Varon; Baena, Sandra; Taketani, Rodrigo Gouvêa; de Melo, Itamar Soares

    2012-01-01

    Here we embark in a deep metagenomic survey that revealed the taxonomic and potential metabolic pathways aspects of mangrove sediment microbiology. The extraction of DNA from sediment samples and the direct application of pyrosequencing resulted in approximately 215 Mb of data from four distinct

  3. A probabilistic model to recover individual genomes from metagenomes

    J. Dröge (Johannes); A. Schönhuth (Alexander); A.C. McHardy (Alice)

    2017-01-01

    textabstractShotgun metagenomics of microbial communities reveal information about strains of relevance for applications in medicine, biotechnology and ecology. Recovering their genomes is a crucial but very challenging step due to the complexity of the underlying biological system and technical

  4. A feruloyl esterase derived from a leachate metagenome library

    Rashamuse, K

    2012-01-01

    Full Text Available A feruloyl esterase encoding gene (designated fae6), derived from a leachate metagenomic library, was cloned and the nucleotide sequence of the insert DNA determined. Translational analysis revealed that fae6 consists of a 515 amino acid polypeptide...

  5. Marine Metagenome as A Resource for Novel Enzymes

    Alma’abadi, Amani D.

    2015-11-10

    More than 99% of identified prokaryotes, including many from the marine environment, cannot be cultured in the laboratory. This lack of capability restricts our knowledge of microbial genetics and community ecology. Metagenomics, the culture-independent cloning of environmental DNAs that are isolated directly from an environmental sample, has already provided a wealth of information about the uncultured microbial world. It has also facilitated the discovery of novel biocatalysts by allowing researchers to probe directly into a huge diversity of enzymes within natural microbial communities. Recent advances in these studies have led to great interest in recruiting microbial enzymes for the development of environmentally-friendly industry. Although the metagenomics approach has many limitations, it is expected to provide not only scientific insights but also economic benefits, especially in industry. This review highlights the importance of metagenomics in mining microbial lipases, as an example, by using high-throughput techniques. In addition, we discuss challenges in the metagenomics as an important part of bioinformatics analysis in big data.

  6. Comprehensive benchmarking and ensemble approaches for metagenomic classifiers.

    McIntyre, Alexa B R; Ounit, Rachid; Afshinnekoo, Ebrahim; Prill, Robert J; Hénaff, Elizabeth; Alexander, Noah; Minot, Samuel S; Danko, David; Foox, Jonathan; Ahsanuddin, Sofia; Tighe, Scott; Hasan, Nur A; Subramanian, Poorani; Moffat, Kelly; Levy, Shawn; Lonardi, Stefano; Greenfield, Nick; Colwell, Rita R; Rosen, Gail L; Mason, Christopher E

    2017-09-21

    One of the main challenges in metagenomics is the identification of microorganisms in clinical and environmental samples. While an extensive and heterogeneous set of computational tools is available to classify microorganisms using whole-genome shotgun sequencing data, comprehensive comparisons of these methods are limited. In this study, we use the largest-to-date set of laboratory-generated and simulated controls across 846 species to evaluate the performance of 11 metagenomic classifiers. Tools were characterized on the basis of their ability to identify taxa at the genus, species, and strain levels, quantify relative abundances of taxa, and classify individual reads to the species level. Strikingly, the number of species identified by the 11 tools can differ by over three orders of magnitude on the same datasets. Various strategies can ameliorate taxonomic misclassification, including abundance filtering, ensemble approaches, and tool intersection. Nevertheless, these strategies were often insufficient to completely eliminate false positives from environmental samples, which are especially important where they concern medically relevant species. Overall, pairing tools with different classification strategies (k-mer, alignment, marker) can combine their respective advantages. This study provides positive and negative controls, titrated standards, and a guide for selecting tools for metagenomic analyses by comparing ranges of precision, accuracy, and recall. We show that proper experimental design and analysis parameters can reduce false positives, provide greater resolution of species in complex metagenomic samples, and improve the interpretation of results.

  7. Metaviz: interactive statistical and visual analysis of metagenomic data.

    Wagner, Justin; Chelaru, Florin; Kancherla, Jayaram; Paulson, Joseph N; Zhang, Alexander; Felix, Victor; Mahurkar, Anup; Elmqvist, Niklas; Corrada Bravo, Héctor

    2018-04-06

    Large studies profiling microbial communities and their association with healthy or disease phenotypes are now commonplace. Processed data from many of these studies are publicly available but significant effort is required for users to effectively organize, explore and integrate it, limiting the utility of these rich data resources. Effective integrative and interactive visual and statistical tools to analyze many metagenomic samples can greatly increase the value of these data for researchers. We present Metaviz, a tool for interactive exploratory data analysis of annotated microbiome taxonomic community profiles derived from marker gene or whole metagenome shotgun sequencing. Metaviz is uniquely designed to address the challenge of browsing the hierarchical structure of metagenomic data features while rendering visualizations of data values that are dynamically updated in response to user navigation. We use Metaviz to provide the UMD Metagenome Browser web service, allowing users to browse and explore data for more than 7000 microbiomes from published studies. Users can also deploy Metaviz as a web service, or use it to analyze data through the metavizr package to interoperate with state-of-the-art analysis tools available through Bioconductor. Metaviz is free and open source with the code, documentation and tutorials publicly accessible.

  8. Marine Metagenome as A Resource for Novel Enzymes

    Amani D. Alma’abadi

    2015-10-01

    Full Text Available More than 99% of identified prokaryotes, including many from the marine environment, cannot be cultured in the laboratory. This lack of capability restricts our knowledge of microbial genetics and community ecology. Metagenomics, the culture-independent cloning of environmental DNAs that are isolated directly from an environmental sample, has already provided a wealth of information about the uncultured microbial world. It has also facilitated the discovery of novel biocatalysts by allowing researchers to probe directly into a huge diversity of enzymes within natural microbial communities. Recent advances in these studies have led to a great interest in recruiting microbial enzymes for the development of environmentally-friendly industry. Although the metagenomics approach has many limitations, it is expected to provide not only scientific insights but also economic benefits, especially in industry. This review highlights the importance of metagenomics in mining microbial lipases, as an example, by using high-throughput techniques. In addition, we discuss challenges in the metagenomics as an important part of bioinformatics analysis in big data.

  9. A human gut microbial gene catalogue established by metagenomic sequencing

    dos Santos, Marcelo Bertalan Quintanilha; Sicheritz-Pontén, Thomas; Nielsen, Henrik Bjørn

    2010-01-01

    To understand the impact of gut microbes on human health and well-being it is crucial to assess their genetic potential. Here we describe the Illumina-based metagenomic sequencing, assembly and characterization of 3.3 million non-redundant microbial genes, derived from 576.7 gigabases of sequence...

  10. Functional Metagenomic Investigations of the Human Intestinal Microbiota

    Aimee Marguerite Moore

    2011-10-01

    Full Text Available The human intestinal microbiota encode multiple critical functions impacting human health, including, metabolism of dietary substrate, prevention of pathogen invasion, immune system modulation, and provision of a reservoir of antibiotic resistance genes accessible to pathogens. The complexity of this microbial community, its recalcitrance to standard cultivation and the immense diversity of its encoded genes has necessitated the development of novel molecular, microbiological, and genomic tools. Functional metagenomics is one such culture-independent technique used for decades to study environmental microorganisms but relatively recently applied to the study of the human commensal microbiota. Metagenomic functional screens characterize the functional capacity of a microbial community independent of identity to known genes by subjecting the metagenome to functional assays in a genetically tractable host. Here we highlight recent work applying this technique to study the functional diversity of the intestinal microbiota, and discuss how an approach combining high-throughput sequencing, cultivation, and metagenomic functional screens can improve our understanding of interactions between this complex community and its human host.

  11. Comparative analysis of metagenomes of Italian top soil improvers

    Gigliucci, Federica; Brambilla, Gianfranco; Tozzoli, Rosangela; Michelacci, Valeria; Morabito, Stefano

    2017-01-01

    Biosolids originating from Municipal Waste Water Treatment Plants are proposed as top soil improvers (TSI) for their beneficial input of organic carbon on agriculture lands. Their use to amend soil is controversial, as it may lead to the presence of emerging hazards of anthropogenic or animal origin in the environment devoted to food production. In this study, we used a shotgun metagenomics sequencing as a tool to perform a characterization of the hazards related with the TSIs. The samples showed the presence of many virulence genes associated to different diarrheagenic E. coli pathotypes as well as of different antimicrobial resistance-associated genes. The genes conferring resistance to Fluoroquinolones was the most relevant class of antimicrobial resistance genes observed in all the samples tested. To a lesser extent traits associated with the resistance to Methicillin in Staphylococci and genes conferring resistance to Streptothricin, Fosfomycin and Vancomycin were also identified. The most represented metal resistance genes were cobalt-zinc-cadmium related, accounting for 15–50% of the sequence reads in the different metagenomes out of the total number of those mapping on the class of resistance to compounds determinants. Moreover the taxonomic analysis performed by comparing compost-based samples and biosolids derived from municipal sewage-sludges treatments divided the samples into separate populations, based on the microbiota composition. The results confirm that the metagenomics is efficient to detect genomic traits associated with pathogens and antimicrobial resistance in complex matrices and this approach can be efficiently used for the traceability of TSI samples using the microorganisms’ profiles as indicators of their origin. - Highlights: • Sludge- and green- based biosolids analysed by metagenomics. • Biosolids may introduce microbial hazards in the food chain. • Metagenomics enables tracking biosolids’ sources.

  12. Metagenomic frameworks for monitoring antibiotic resistance in aquatic environments.

    Port, Jesse A; Cullen, Alison C; Wallace, James C; Smith, Marissa N; Faustman, Elaine M

    2014-03-01

    High-throughput genomic technologies offer new approaches for environmental health monitoring, including metagenomic surveillance of antibiotic resistance determinants (ARDs). Although natural environments serve as reservoirs for antibiotic resistance genes that can be transferred to pathogenic and human commensal bacteria, monitoring of these determinants has been infrequent and incomplete. Furthermore, surveillance efforts have not been integrated into public health decision making. We used a metagenomic epidemiology-based approach to develop an ARD index that quantifies antibiotic resistance potential, and we analyzed this index for common modal patterns across environmental samples. We also explored how metagenomic data such as this index could be conceptually framed within an early risk management context. We analyzed 25 published data sets from shotgun pyrosequencing projects. The samples consisted of microbial community DNA collected from marine and freshwater environments across a gradient of human impact. We used principal component analysis to identify index patterns across samples. We observed significant differences in the overall index and index subcategory levels when comparing ecosystems more proximal versus distal to human impact. The selection of different sequence similarity thresholds strongly influenced the index measurements. Unique index subcategory modes distinguished the different metagenomes. Broad-scale screening of ARD potential using this index revealed utility for framing environmental health monitoring and surveillance. This approach holds promise as a screening tool for establishing baseline ARD levels that can be used to inform and prioritize decision making regarding management of ARD sources and human exposure routes. Port JA, Cullen AC, Wallace JC, Smith MN, Faustman EM. 2014. Metagenomic frameworks for monitoring antibiotic resistance in aquatic environments. Environ Health Perspect 122:222–228; http://dx.doi.org/10.1289/ehp

  13. Functional metagenomics to decipher food-microbe-host crosstalk.

    Larraufie, Pierre; de Wouters, Tomas; Potocki-Veronese, Gabrielle; Blottière, Hervé M; Doré, Joël

    2015-02-01

    The recent developments of metagenomics permit an extremely high-resolution molecular scan of the intestinal microbiota giving new insights and opening perspectives for clinical applications. Beyond the unprecedented vision of the intestinal microbiota given by large-scale quantitative metagenomics studies, such as the EU MetaHIT project, functional metagenomics tools allow the exploration of fine interactions between food constituents, microbiota and host, leading to the identification of signals and intimate mechanisms of crosstalk, especially between bacteria and human cells. Cloning of large genome fragments, either from complex intestinal communities or from selected bacteria, allows the screening of these biological resources for bioactivity towards complex plant polymers or functional food such as prebiotics. This permitted identification of novel carbohydrate-active enzyme families involved in dietary fibre and host glycan breakdown, and highlighted unsuspected bacterial players at the top of the intestinal microbial food chain. Similarly, exposure of fractions from genomic and metagenomic clones onto human cells engineered with reporter systems to track modulation of immune response, cell proliferation or cell metabolism has allowed the identification of bioactive clones modulating key cell signalling pathways or the induction of specific genes. This opens the possibility to decipher mechanisms by which commensal bacteria or candidate probiotics can modulate the activity of cells in the intestinal epithelium or even in distal organs such as the liver, adipose tissue or the brain. Hence, in spite of our inability to culture many of the dominant microbes of the human intestine, functional metagenomics open a new window for the exploration of food-microbe-host crosstalk.

  14. Comparative analysis of metagenomes of Italian top soil improvers

    Gigliucci, Federica, E-mail: Federica.gigliucci@libero.it [Department of Veterinary Public Health and Food Safety, Istituto Superiore di Sanità, Viale Regina Elena, 299 00161 Rome (Italy); Department of Sciences, University Roma,Tre, Viale Marconi, 446, 00146 Rome (Italy); Brambilla, Gianfranco; Tozzoli, Rosangela; Michelacci, Valeria; Morabito, Stefano [Department of Veterinary Public Health and Food Safety, Istituto Superiore di Sanità, Viale Regina Elena, 299 00161 Rome (Italy)

    2017-05-15

    Biosolids originating from Municipal Waste Water Treatment Plants are proposed as top soil improvers (TSI) for their beneficial input of organic carbon on agriculture lands. Their use to amend soil is controversial, as it may lead to the presence of emerging hazards of anthropogenic or animal origin in the environment devoted to food production. In this study, we used a shotgun metagenomics sequencing as a tool to perform a characterization of the hazards related with the TSIs. The samples showed the presence of many virulence genes associated to different diarrheagenic E. coli pathotypes as well as of different antimicrobial resistance-associated genes. The genes conferring resistance to Fluoroquinolones was the most relevant class of antimicrobial resistance genes observed in all the samples tested. To a lesser extent traits associated with the resistance to Methicillin in Staphylococci and genes conferring resistance to Streptothricin, Fosfomycin and Vancomycin were also identified. The most represented metal resistance genes were cobalt-zinc-cadmium related, accounting for 15–50% of the sequence reads in the different metagenomes out of the total number of those mapping on the class of resistance to compounds determinants. Moreover the taxonomic analysis performed by comparing compost-based samples and biosolids derived from municipal sewage-sludges treatments divided the samples into separate populations, based on the microbiota composition. The results confirm that the metagenomics is efficient to detect genomic traits associated with pathogens and antimicrobial resistance in complex matrices and this approach can be efficiently used for the traceability of TSI samples using the microorganisms’ profiles as indicators of their origin. - Highlights: • Sludge- and green- based biosolids analysed by metagenomics. • Biosolids may introduce microbial hazards in the food chain. • Metagenomics enables tracking biosolids’ sources.

  15. Metagenomics, metaMicrobesOnline and Kbase Data Integration (MICW - Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Dehal, Paramvir

    2011-10-12

    Berkeley Lab's Paramvir Dehal on "Managing and Storing large Datasets in MicrobesOnline, metaMicrobesOnline and the DOE Knowledgebase" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  16. Meta-transcriptomics indicates biotic cross-tolerance in willow trees cultivated on petroleum hydrocarbon contaminated soil.

    Gonzalez, Emmanuel; Brereton, Nicholas J B; Marleau, Julie; Guidi Nissim, Werther; Labrecque, Michel; Pitre, Frederic E; Joly, Simon

    2015-10-12

    High concentrations of petroleum hydrocarbon (PHC) pollution can be hazardous to human health and leave soils incapable of supporting agricultural crops. A cheap solution, which can help restore biodiversity and bring land back to productivity, is cultivation of high biomass yielding willow trees. However, the genetic mechanisms which allow these fast-growing trees to tolerate PHCs are as yet unclear. Salix purpurea 'Fish Creek' trees were pot-grown in soil from a former petroleum refinery, either lacking or enriched with C10-C50 PHCs. De novo assembled transcriptomes were compared between tree organs and impartially annotated without a priori constraint to any organism. Over 45% of differentially expressed genes originated from foreign organisms, the majority from the two-spotted spidermite, Tetranychus urticae. Over 99% of T. urticae transcripts were differentially expressed with greater abundance in non-contaminated trees. Plant transcripts involved in the polypropanoid pathway, including phenylalanine ammonia-lyase (PAL), had greater expression in contaminated trees whereas most resistance genes showed higher expression in non-contaminated trees. The impartial approach to annotation of the de novo transcriptomes, allowing for the possibility for multiple species identification, was essential for interpretation of the crop's response treatment. The meta-transcriptomic pattern of expression suggests a cross-tolerance mechanism whereby abiotic stress resistance systems provide improved biotic resistance. These findings highlight a valuable but complex biotic and abiotic stress response to real-world, multidimensional contamination which could, in part, help explain why crops such as willow can produce uniquely high biomass yields on challenging marginal land.

  17. Study of the Metatranscriptome of Eight Social and Solitary Wild Bee Species Reveals Novel Viruses and Bee Parasites.

    Schoonvaere, Karel; Smagghe, Guy; Francis, Frédéric; de Graaf, Dirk C

    2018-01-01

    Bees are associated with a remarkable diversity of microorganisms, including unicellular parasites, bacteria, fungi, and viruses. The application of next-generation sequencing approaches enables the identification of this rich species composition as well as the discovery of previously unknown associations. Using high-throughput polyadenylated ribonucleic acid (RNA) sequencing, we investigated the metatranscriptome of eight wild bee species ( Andrena cineraria, Andrena fulva, Andrena haemorrhoa, Bombus terrestris, Bombus cryptarum, Bombus pascuorum, Osmia bicornis , and Osmia cornuta ) sampled from four different localities in Belgium. Across the RNA sequencing libraries, 88-99% of the taxonomically informative reads were of the host transcriptome. Four viruses with homology to insect pathogens were found including two RNA viruses (belonging to the families Iflaviridae and Tymoviridae that harbor already viruses of honey bees), a double stranded DNA virus (family Nudiviridae ) and a single stranded DNA virus (family Parvoviridae ). In addition, we found genomic sequences of 11 unclassified arthropod viruses (related to negeviruses, sobemoviruses, totiviruses, rhabdoviruses, and mononegaviruses), seven plant pathogenic viruses, and one fungal virus. Interestingly, nege-like viruses appear to be widespread, host-specific, and capable of attaining high copy numbers inside bees. Next to viruses, three novel parasite associations were discovered in wild bees, including Crithidia pragensis and a tubulinosematid and a neogregarine parasite. Yeasts of the genus Metschnikowia were identified in solitary bees. This study gives a glimpse of the microorganisms and viruses associated with social and solitary wild bees and demonstrates that their diversity exceeds by far the subset of species first discovered in honey bees.

  18. Study of the Metatranscriptome of Eight Social and Solitary Wild Bee Species Reveals Novel Viruses and Bee Parasites

    Karel Schoonvaere

    2018-02-01

    Full Text Available Bees are associated with a remarkable diversity of microorganisms, including unicellular parasites, bacteria, fungi, and viruses. The application of next-generation sequencing approaches enables the identification of this rich species composition as well as the discovery of previously unknown associations. Using high-throughput polyadenylated ribonucleic acid (RNA sequencing, we investigated the metatranscriptome of eight wild bee species (Andrena cineraria, Andrena fulva, Andrena haemorrhoa, Bombus terrestris, Bombus cryptarum, Bombus pascuorum, Osmia bicornis, and Osmia cornuta sampled from four different localities in Belgium. Across the RNA sequencing libraries, 88–99% of the taxonomically informative reads were of the host transcriptome. Four viruses with homology to insect pathogens were found including two RNA viruses (belonging to the families Iflaviridae and Tymoviridae that harbor already viruses of honey bees, a double stranded DNA virus (family Nudiviridae and a single stranded DNA virus (family Parvoviridae. In addition, we found genomic sequences of 11 unclassified arthropod viruses (related to negeviruses, sobemoviruses, totiviruses, rhabdoviruses, and mononegaviruses, seven plant pathogenic viruses, and one fungal virus. Interestingly, nege-like viruses appear to be widespread, host-specific, and capable of attaining high copy numbers inside bees. Next to viruses, three novel parasite associations were discovered in wild bees, including Crithidia pragensis and a tubulinosematid and a neogregarine parasite. Yeasts of the genus Metschnikowia were identified in solitary bees. This study gives a glimpse of the microorganisms and viruses associated with social and solitary wild bees and demonstrates that their diversity exceeds by far the subset of species first discovered in honey bees.

  19. Metatranscriptomic analysis of a high-sulfide aquatic spring reveals insights into sulfur cycling and unexpected aerobic metabolism

    Anne M. Spain

    2015-09-01

    Full Text Available Zodletone spring is a sulfide-rich spring in southwestern Oklahoma characterized by shallow, microoxic, light-exposed spring water overlaying anoxic sediments. Previously, culture-independent 16S rRNA gene based diversity surveys have revealed that Zodletone spring source sediments harbor a highly diverse microbial community, with multiple lineages putatively involved in various sulfur-cycling processes. Here, we conducted a metatranscriptomic survey of microbial populations in Zodletone spring source sediments to characterize the relative prevalence and importance of putative phototrophic, chemolithotrophic, and heterotrophic microorganisms in the sulfur cycle, the identity of lineages actively involved in various sulfur cycling processes, and the interaction between sulfur cycling and other geochemical processes at the spring source. Sediment samples at the spring’s source were taken at three different times within a 24-h period for geochemical analyses and RNA sequencing. In depth mining of datasets for sulfur cycling transcripts revealed major sulfur cycling pathways and taxa involved, including an unexpected potential role of Actinobacteria in sulfide oxidation and thiosulfate transformation. Surprisingly, transcripts coding for the cyanobacterial Photosystem II D1 protein, methane monooxygenase, and terminal cytochrome oxidases were encountered, indicating that genes for oxygen production and aerobic modes of metabolism are actively being transcribed, despite below-detectable levels (<1 µM of oxygen in source sediment. Results highlight transcripts involved in sulfur, methane, and oxygen cycles, propose that oxygenic photosynthesis could support aerobic methane and sulfide oxidation in anoxic sediments exposed to sunlight, and provide a viewpoint of microbial metabolic lifestyles under conditions similar to those seen during late Archaean and Proterozoic eons.

  20. DOE JGI Quality Metrics; Approaches to Scaling and Improving Metagenome Assembly (Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Copeland, Alex; Brown, C. Titus

    2011-10-13

    DOE JGI's Alex Copeland on "DOE JGI Quality Metrics" and Michigan State University's C. Titus Brown on "Approaches to Scaling and Improving Metagenome Assembly" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  1. Evaluation of the Cow Rumen Metagenome: Assembly by Single Copy Gene Analysis and Single Cell Genome Assemblies (Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Sczyrba, Alex

    2011-10-13

    DOE JGI's Alex Sczyrba on "Evaluation of the Cow Rumen Metagenome" and "Assembly by Single Copy Gene Analysis and Single Cell Genome Assemblies" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  2. MetaVelvet: An Extension of Velvet Assembler to de novo Metagenome Assembly from Short Sequence Reads (Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Sakakibara, Yasumbumi

    2011-10-13

    Keio University's Yasumbumi Sakakibara on "MetaVelvet: An Extension of Velvet Assembler to de novo Metagenome Assembly from Short Sequence Reads" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  3. Metagenomics: The Next Culture-Independent Game Changer

    Jessica D. Forbes

    2017-07-01

    Full Text Available A trend towards the abandonment of obtaining pure culture isolates in frontline laboratories is at a crossroads with the ability of public health agencies to perform their basic mandate of foodborne disease surveillance and response. The implementation of culture-independent diagnostic tests (CIDTs including nucleic acid and antigen-based assays for acute gastroenteritis is leaving public health agencies without laboratory evidence to link clinical cases to each other and to food or environmental substances. This limits the efficacy of public health epidemiology and surveillance as well as outbreak detection and investigation. Foodborne outbreaks have the potential to remain undetected or have insufficient evidence to support source attribution and may inadvertently increase the incidence of foodborne diseases. Next-generation sequencing of pure culture isolates in clinical microbiology laboratories has the potential to revolutionize the fields of food safety and public health. Metagenomics and other ‘omics’ disciplines could provide the solution to a cultureless future in clinical microbiology, food safety and public health. Data mining of information obtained from metagenomics assays can be particularly useful for the identification of clinical causative agents or foodborne contamination, detection of AMR and/or virulence factors, in addition to providing high-resolution subtyping data. Thus, metagenomics assays may provide a universal test for clinical diagnostics, foodborne pathogen detection, subtyping and investigation. This information has the potential to reform the field of enteric disease diagnostics and surveillance and also infectious diseases as a whole. The aim of this review will be to present the current state of CIDTs in diagnostic and public health laboratories as they relate to foodborne illness and food safety. Moreover, we will also discuss the diagnostic and subtyping utility and concomitant bias limitations of

  4. [Mini review] metagenomic studies of the Red Sea

    Behzad, Hayedeh; Ibarra, Martin Augusto; Mineta, Katsuhiko; Gojobori, Takashi

    2015-01-01

    Metagenomics has significantly advanced the field of marine microbial ecology, revealing the vast diversity of previously unknown microbial life forms in different marine niches. The tremendous amount of data generated has enabled identification of a large number of microbial genes (metagenomes), their community interactions, adaptation mechanisms, and their potential applications in pharmaceutical and biotechnology-based industries. Comparative metagenomics reveals that microbial diversity is a function of the local environment, meaning that unique or unusual environments typically harbor novel microbial species with unique genes and metabolic pathways. The Red Sea has an abundance of unique characteristics; however, its microbiota is one of the least studied amongst marine environments. The Red Sea harbors approximately 25 hot anoxic brine pools, plus a vibrant coral reef ecosystem. Physiochemical studies describe the Red Sea as an oligotrophic environment that contains one of the warmest and saltiest waters in the world with year-round high UV radiations. These characteristics are believed to have shaped the evolution of microbial communities in the Red Sea. Over-representation of genes involved in DNA repair, high-intensity light responses, and osmolyte C1 oxidation were found in the Red Sea metagenomic databases suggesting acquisition of specific environmental adaptation by the Red Sea microbiota. The Red Sea brine pools harbor a diverse range of halophilic and thermophilic bacterial and archaeal communities, which are potential sources of enzymes for pharmaceutical and biotechnology-based application. Understanding the mechanisms of these adaptations and their function within the larger ecosystem could also prove useful in light of predicted global warming scenarios where global ocean temperatures are expected to rise by 1–3 °C in the next few decades. In this review, we provide an overview of the published metagenomic studies that were conducted in the

  5. [Mini review] metagenomic studies of the Red Sea

    Behzad, Hayedeh

    2015-10-23

    Metagenomics has significantly advanced the field of marine microbial ecology, revealing the vast diversity of previously unknown microbial life forms in different marine niches. The tremendous amount of data generated has enabled identification of a large number of microbial genes (metagenomes), their community interactions, adaptation mechanisms, and their potential applications in pharmaceutical and biotechnology-based industries. Comparative metagenomics reveals that microbial diversity is a function of the local environment, meaning that unique or unusual environments typically harbor novel microbial species with unique genes and metabolic pathways. The Red Sea has an abundance of unique characteristics; however, its microbiota is one of the least studied amongst marine environments. The Red Sea harbors approximately 25 hot anoxic brine pools, plus a vibrant coral reef ecosystem. Physiochemical studies describe the Red Sea as an oligotrophic environment that contains one of the warmest and saltiest waters in the world with year-round high UV radiations. These characteristics are believed to have shaped the evolution of microbial communities in the Red Sea. Over-representation of genes involved in DNA repair, high-intensity light responses, and osmolyte C1 oxidation were found in the Red Sea metagenomic databases suggesting acquisition of specific environmental adaptation by the Red Sea microbiota. The Red Sea brine pools harbor a diverse range of halophilic and thermophilic bacterial and archaeal communities, which are potential sources of enzymes for pharmaceutical and biotechnology-based application. Understanding the mechanisms of these adaptations and their function within the larger ecosystem could also prove useful in light of predicted global warming scenarios where global ocean temperatures are expected to rise by 1–3 °C in the next few decades. In this review, we provide an overview of the published metagenomic studies that were conducted in the

  6. Metagenomic sequence of saline desert microbiota from wild ass sanctuary, Little Rann of Kutch, Gujarat, India.

    Patel, Rajesh; Mevada, Vishal; Prajapati, Dhaval; Dudhagara, Pravin; Koringa, Prakash; Joshi, C G

    2015-03-01

    We report Metagenome from the saline desert soil sample of Little Rann of Kutch, Gujarat State, India. Metagenome consisted of 633,760 sequences with size 141,307,202 bp and 56% G + C content. Metagenome sequence data are available at EBI under EBI Metagenomics database with accession no. ERP005612. Community metagenomics revealed total 1802 species belonged to 43 different phyla with dominating Marinobacter (48.7%) and Halobacterium (4.6%) genus in bacterial and archaeal domain respectively. Remarkably, 18.2% sequences in a poorly characterized group and 4% gene for various stress responses along with versatile presence of commercial enzyme were evident in a functional metagenome analysis.

  7. High throughtput comparisons and profiling of metagenomes for industrially relevant enzymes

    Alam, Intikhab

    2016-01-26

    More and more genomes and metagenomes are being sequenced since the advent of Next Generation Sequencing Technologies (NGS). Many metagenomic samples are collected from a variety of environments, each exhibiting a different environmental profile, e.g. temperature, environmental chemistry, etc… These metagenomes can be profiled to unearth enzymes relevant to several industries based on specific enzyme properties such as ability to work on extreme conditions, such as extreme temperatures, salinity, anaerobically, etc.. In this work, we present the DMAP platform comprising of a high-throughput metagenomic annotation pipeline and a data-warehouse for comparisons and profiling across large number of metagenomes. We developed two reference databases for profiling of important genes, one containing enzymes related to different industries and the other containing genes with potential bioactivity roles. In this presentation we describe an example analysis of a large number of publicly available metagenomic sample from TARA oceans study (Science 2015) that covers significant part of world oceans.

  8. Metagenomes from two microbial consortia associated with Santa Barbara seep oil.

    Hawley, Erik R; Malfatti, Stephanie A; Pagani, Ioanna; Huntemann, Marcel; Chen, Amy; Foster, Brian; Copeland, Alexander; del Rio, Tijana Glavina; Pati, Amrita; Jansson, Janet R; Gilbert, Jack A; Tringe, Susannah Green; Lorenson, Thomas D; Hess, Matthias

    2014-12-01

    The metagenomes from two microbial consortia associated with natural oils seeping into the Pacific Ocean offshore the coast of Santa Barbara (California, USA) were determined to complement already existing metagenomes generated from microbial communities associated with hydrocarbons that pollute the marine ecosystem. This genomics resource article is the first of two publications reporting a total of four new metagenomes from oils that seep into the Santa Barbara Channel. Copyright © 2014 Elsevier B.V. All rights reserved.

  9. A Statistical Framework for the Functional Analysis of Metagenomes

    Sharon, Itai; Pati, Amrita; Markowitz, Victor; Pinter, Ron Y.

    2008-10-01

    Metagenomic studies consider the genetic makeup of microbial communities as a whole, rather than their individual member organisms. The functional and metabolic potential of microbial communities can be analyzed by comparing the relative abundance of gene families in their collective genomic sequences (metagenome) under different conditions. Such comparisons require accurate estimation of gene family frequencies. They present a statistical framework for assessing these frequencies based on the Lander-Waterman theory developed originally for Whole Genome Shotgun (WGS) sequencing projects. They also provide a novel method for assessing the reliability of the estimations which can be used for removing seemingly unreliable measurements. They tested their method on a wide range of datasets, including simulated genomes and real WGS data from sequencing projects of whole genomes. Results suggest that their framework corrects inherent biases in accepted methods and provides a good approximation to the true statistics of gene families in WGS projects.

  10. Metagenomic species profiling using universal phylogenetic marker genes

    Sunagawa, Shinichi; Mende, Daniel R; Zeller, Georg

    2013-01-01

    To quantify known and unknown microorganisms at species-level resolution using shotgun sequencing data, we developed a method that establishes metagenomic operational taxonomic units (mOTUs) based on single-copy phylogenetic marker genes. Applied to 252 human fecal samples, the method revealed th...... that on average 43% of the species abundance and 58% of the richness cannot be captured by current reference genome-based methods. An implementation of the method is available at http://www.bork.embl.de/software/mOTU/.......To quantify known and unknown microorganisms at species-level resolution using shotgun sequencing data, we developed a method that establishes metagenomic operational taxonomic units (mOTUs) based on single-copy phylogenetic marker genes. Applied to 252 human fecal samples, the method revealed...

  11. Extremozymes from metagenome: Potential applications in food processing.

    Khan, Mahejibin; Sathya, T A

    2017-06-12

    The long-established use of enzymes for food processing and product formulation has resulted in an increased enzyme market compounding to 7.0% annual growth rate. Advancements in molecular biology and recognition that enzymes with specific properties have application for industrial production of infant, baby and functional foods boosted research toward sourcing the genes of microorganisms for enzymes with distinctive properties. In this regard, functional metagenomics for extremozymes has gained attention on the premise that such enzymes can catalyze specific reactions. Hence, metagenomics that can isolate functional genes of unculturable extremophilic microorganisms has expanded attention as a promising tool. Developments in this field of research in relation to food sector are reviewed.

  12. Metagenome of a Versatile Chemolithoautotroph from Expanding Oceanic Dead Zones

    Walsh, David A.; Zaikova, Elena; Howes, Charles L.; Song, Young; Wright, Jody; Tringe, Susannah G.; Tortell, Philippe D.; Hallam, Steven J.

    2009-07-15

    Oxygen minimum zones (OMZs), also known as oceanic"dead zones", are widespread oceanographic features currently expanding due to global warming and coastal eutrophication. Although inhospitable to metazoan life, OMZs support a thriving but cryptic microbiota whose combined metabolic activity is intimately connected to nutrient and trace gas cycling within the global ocean. Here we report time-resolved metagenomic analyses of a ubiquitous and abundant but uncultivated OMZ microbe (SUP05) closely related to chemoautotrophic gill symbionts of deep-sea clams and mussels. The SUP05 metagenome harbors a versatile repertoire of genes mediating autotrophic carbon assimilation, sulfur-oxidation and nitrate respiration responsive to a wide range of water column redox states. Thus, SUP05 plays integral roles in shaping nutrient and energy flow within oxygen-deficient oceanic waters via carbon sequestration, sulfide detoxification and biological nitrogen loss with important implications for marine productivity and atmospheric greenhouse control.

  13. Diverse circovirus-like genome architectures revealed by environmental metagenomics.

    Rosario, Karyna; Duffy, Siobain; Breitbart, Mya

    2009-10-01

    Single-stranded DNA (ssDNA) viruses with circular genomes are the smallest viruses known to infect eukaryotes. The present study identified 10 novel genomes similar to ssDNA circoviruses through data-mining of public viral metagenomes. The metagenomic libraries included samples from reclaimed water and three different marine environments (Chesapeake Bay, British Columbia coastal waters and Sargasso Sea). All the genomes have similarities to the replication (Rep) protein of circoviruses; however, only half have genomic features consistent with known circoviruses. Some of the genomes exhibit a mixture of genomic features associated with different families of ssDNA viruses (i.e. circoviruses, geminiviruses and parvoviruses). Unique genome architectures and phylogenetic analysis of the Rep protein suggest that these viruses belong to novel genera and/or families. Investigating the complex community of ssDNA viruses in the environment can lead to the discovery of divergent species and help elucidate evolutionary links between ssDNA viruses.

  14. Metagenomic analysis of the airborne environment in urban spaces.

    Be, Nicholas A; Thissen, James B; Fofanov, Viacheslav Y; Allen, Jonathan E; Rojas, Mark; Golovko, George; Fofanov, Yuriy; Koshinsky, Heather; Jaing, Crystal J

    2015-02-01

    The organisms in aerosol microenvironments, especially densely populated urban areas, are relevant to maintenance of public health and detection of potential epidemic or biothreat agents. To examine aerosolized microorganisms in this environment, we performed sequencing on the material from an urban aerosol surveillance program. Whole metagenome sequencing was applied to DNA extracted from air filters obtained during periods from each of the four seasons. The composition of bacteria, plants, fungi, invertebrates, and viruses demonstrated distinct temporal shifts. Bacillus thuringiensis serovar kurstaki was detected in samples known to be exposed to aerosolized spores, illustrating the potential utility of this approach for identification of intentionally introduced microbial agents. Together, these data demonstrate the temporally dependent metagenomic complexity of urban aerosols and the potential of genomic analytical techniques for biosurveillance and monitoring of threats to public health.

  15. A metagenomic framework for the study of airborne microbial communities.

    Yooseph, Shibu; Andrews-Pfannkoch, Cynthia; Tenney, Aaron; McQuaid, Jeff; Williamson, Shannon; Thiagarajan, Mathangi; Brami, Daniel; Zeigler-Allen, Lisa; Hoffman, Jeff; Goll, Johannes B; Fadrosh, Douglas; Glass, John; Adams, Mark D; Friedman, Robert; Venter, J Craig

    2013-01-01

    Understanding the microbial content of the air has important scientific, health, and economic implications. While studies have primarily characterized the taxonomic content of air samples by sequencing the 16S or 18S ribosomal RNA gene, direct analysis of the genomic content of airborne microorganisms has not been possible due to the extremely low density of biological material in airborne environments. We developed sampling and amplification methods to enable adequate DNA recovery to allow metagenomic profiling of air samples collected from indoor and outdoor environments. Air samples were collected from a large urban building, a medical center, a house, and a pier. Analyses of metagenomic data generated from these samples reveal airborne communities with a high degree of diversity and different genera abundance profiles. The identities of many of the taxonomic groups and protein families also allows for the identification of the likely sources of the sampled airborne bacteria.

  16. Toward a Standards-Compliant Genomic and Metagenomic Publication Record

    Garrity, GM; Field, D; Kyrpides, N

    2008-01-01

    Increasingly, we are aware as a community of the growing need to manage the avalanche of genomic and metagenomic data, in addition to related data types like ribosomal RNA and barcode sequences, in a way that tightly integrates contextual data with traditional literature in a machine-readable way...... is in the midst of a publishing revolution. This revolution is marked by a growing shift away from a traditional dichotomy between "journal articles" and "database entries" and an increasing adoption of hybrid models of collecting and disseminating scientific information. With respect to genomes and metagenomes...... or communities) such as the call by the GSC for a central repository of Standard Operating Procedures describing the genomic annotation pipelines of the major sequencing centers. We argue that such an "eJournal," published under the Open Access paradigm by the GSC, could be an attractive publishing forum...

  17. Construction and Screening of Marine Metagenomic Large Insert Libraries.

    Weiland-Bräuer, Nancy; Langfeldt, Daniela; Schmitz, Ruth A

    2017-01-01

    The marine environment covers more than 70 % of the world's surface. Marine microbial communities are highly diverse and have evolved during extended evolutionary processes of physiological adaptations under the influence of a variety of ecological conditions and selection pressures. They harbor an enormous diversity of microbes with still unknown and probably new physiological characteristics. In the past, marine microbes, mostly bacteria of microbial consortia attached to marine tissues of multicellular organisms, have proven to be a rich source of highly potent bioactive compounds, which represent a considerable number of drug candidates. However, to date, the biodiversity of marine microbes and the versatility of their bioactive compounds and metabolites have not been fully explored. This chapter describes sampling in the marine environment, construction of metagenomic large insert libraries from marine habitats, and exemplarily one function based screen of metagenomic clones for identification of quorum quenching activities.

  18. Metagenomes provide valuable comparative information on soil microeukaryotes

    Jacquiod, Samuel Jehan Auguste; Stenbæk, Jonas; Santos, Susana

    2016-01-01

    has been identified. Our analyses suggest that publicly available metagenome data can provide valuable information on soil microeukaryotes for comparative purposes when handled appropriately, complementing the current view provided by ribosomal amplicon sequencing methods......., providing microbiologists with substantial amounts of accessible information. We took advantage of public metagenomes in order to investigate microeukaryote communities in a well characterized grassland soil. The data gathered allowed the evaluation of several factors impacting the community structure......, including the DNA extraction method, the database choice and also the annotation procedure. While most studies on soil microeukaryotes are based on sequencing of PCR-amplified taxonomic markers (18S rRNA genes, ITS regions), this work represents, to our knowledge, the first report based solely...

  19. The new science of metagenomics: revealing the secrets of our microbial planet

    Committee on Metagenomics: Challenges and Functional Applications, National Research Council

    2007-01-01

    .... The emerging field of metagenomics offers a new way of exploring the microbial world that will transform modern microbiology and lead to practical applications in medicine, agriculture, alternative...

  20. Genome signature analysis of thermal virus metagenomes reveals Archaea and thermophilic signatures

    Pride David T

    2008-09-01

    Full Text Available Abstract Background Metagenomic analysis provides a rich source of biological information for otherwise intractable viral communities. However, study of viral metagenomes has been hampered by its nearly complete reliance on BLAST algorithms for identification of DNA sequences. We sought to develop algorithms for examination of viral metagenomes to identify the origin of sequences independent of BLAST algorithms. We chose viral metagenomes obtained from two hot springs, Bear Paw and Octopus, in Yellowstone National Park, as they represent simple microbial populations where comparatively large contigs were obtained. Thermal spring metagenomes have high proportions of sequences without significant Genbank homology, which has hampered identification of viruses and their linkage with hosts. To analyze each metagenome, we developed a method to classify DNA fragments using genome signature-based phylogenetic classification (GSPC, where metagenomic fragments are compared to a database of oligonucleotide signatures for all previously sequenced Bacteria, Archaea, and viruses. Results From both Bear Paw and Octopus hot springs, each assembled contig had more similarity to other metagenome contigs than to any sequenced microbial genome based on GSPC analysis, suggesting a genome signature common to each of these extreme environments. While viral metagenomes from Bear Paw and Octopus share some similarity, the genome signatures from each locale are largely unique. GSPC using a microbial database predicts most of the Octopus metagenome has archaeal signatures, while bacterial signatures predominate in Bear Paw; a finding consistent with those of Genbank BLAST. When using a viral database, the majority of the Octopus metagenome is predicted to belong to archaeal virus Families Globuloviridae and Fuselloviridae, while none of the Bear Paw metagenome is predicted to belong to archaeal viruses. As expected, when microbial and viral databases are combined, each of

  1. Genome signature analysis of thermal virus metagenomes reveals Archaea and thermophilic signatures.

    Pride, David T; Schoenfeld, Thomas

    2008-09-17

    Metagenomic analysis provides a rich source of biological information for otherwise intractable viral communities. However, study of viral metagenomes has been hampered by its nearly complete reliance on BLAST algorithms for identification of DNA sequences. We sought to develop algorithms for examination of viral metagenomes to identify the origin of sequences independent of BLAST algorithms. We chose viral metagenomes obtained from two hot springs, Bear Paw and Octopus, in Yellowstone National Park, as they represent simple microbial populations where comparatively large contigs were obtained. Thermal spring metagenomes have high proportions of sequences without significant Genbank homology, which has hampered identification of viruses and their linkage with hosts. To analyze each metagenome, we developed a method to classify DNA fragments using genome signature-based phylogenetic classification (GSPC), where metagenomic fragments are compared to a database of oligonucleotide signatures for all previously sequenced Bacteria, Archaea, and viruses. From both Bear Paw and Octopus hot springs, each assembled contig had more similarity to other metagenome contigs than to any sequenced microbial genome based on GSPC analysis, suggesting a genome signature common to each of these extreme environments. While viral metagenomes from Bear Paw and Octopus share some similarity, the genome signatures from each locale are largely unique. GSPC using a microbial database predicts most of the Octopus metagenome has archaeal signatures, while bacterial signatures predominate in Bear Paw; a finding consistent with those of Genbank BLAST. When using a viral database, the majority of the Octopus metagenome is predicted to belong to archaeal virus Families Globuloviridae and Fuselloviridae, while none of the Bear Paw metagenome is predicted to belong to archaeal viruses. As expected, when microbial and viral databases are combined, each of the Octopus and Bear Paw metagenomic contigs

  2. Rapid and efficient method to extract metagenomic DNA from estuarine sediments.

    Shamim, Kashif; Sharma, Jaya; Dubey, Santosh Kumar

    2017-07-01

    Metagenomic DNA from sediments of selective estuaries of Goa, India was extracted using a simple, fast, efficient and environment friendly method. The recovery of pure metagenomic DNA from our method was significantly high as compared to other well-known methods since the concentration of recovered metagenomic DNA ranged from 1185.1 to 4579.7 µg/g of sediment. The purity of metagenomic DNA was also considerably high as the ratio of absorbance at 260 and 280 nm ranged from 1.88 to 1.94. Therefore, the recovered metagenomic DNA was directly used to perform various molecular biology experiments viz. restriction digestion, PCR amplification, cloning and metagenomic library construction. This clearly proved that our protocol for metagenomic DNA extraction using silica gel efficiently removed the contaminants and prevented shearing of the metagenomic DNA. Thus, this modified method can be used to recover pure metagenomic DNA from various estuarine sediments in a rapid, efficient and eco-friendly manner.

  3. Metagenome-derived haloalkane dehalogenases with novel catalytic properties

    Kotík, Michael; Vaňáček, P.; Kuňka, A.; Prokop, Z.; Dambrovský, J.

    2017-01-01

    Roč. 101, č. 16 (2017), s. 6385-6397 ISSN 0175-7598 R&D Projects: GA ČR GAP504/10/0137; GA MŠk(CZ) LM2015047; GA MŠk(CZ) LM2015055 Institutional support: RVO:61388971 Keywords : Haloalkane dehalogenase * Metagenomic DNA * Heterologous production Subject RIV: CE - Biochemistry OBOR OECD: Biochemistry and molecular biology Impact factor: 3.420, year: 2016

  4. Bioprospecting metagenomics of decaying wood: mining for new glycoside hydrolases

    Li Luen-Luen

    2011-08-01

    Full Text Available Abstract Background To efficiently deconstruct recalcitrant plant biomass to fermentable sugars in industrial processes, biocatalysts of higher performance and lower cost are required. The genetic diversity found in the metagenomes of natural microbial biomass decay communities may harbor such enzymes. Our goal was to discover and characterize new glycoside hydrolases (GHases from microbial biomass decay communities, especially those from unknown or never previously cultivated microorganisms. Results From the metagenome sequences of an anaerobic microbial community actively decaying poplar biomass, we identified approximately 4,000 GHase homologs. Based on homology to GHase families/activities of interest and the quality of the sequences, candidates were selected for full-length cloning and subsequent expression. As an alternative strategy, a metagenome expression library was constructed and screened for GHase activities. These combined efforts resulted in the cloning of four novel GHases that could be successfully expressed in Escherichia coli. Further characterization showed that two enzymes showed significant activity on p-nitrophenyl-α-L-arabinofuranoside, one enzyme had significant activity against p-nitrophenyl-β-D-glucopyranoside, and one enzyme showed significant activity against p-nitrophenyl-β-D-xylopyranoside. Enzymes were also tested in the presence of ionic liquids. Conclusions Metagenomics provides a good resource for mining novel biomass degrading enzymes and for screening of cellulolytic enzyme activities. The four GHases that were cloned may have potential application for deconstruction of biomass pretreated with ionic liquids, as they remain active in the presence of up to 20% ionic liquid (except for 1-ethyl-3-methylimidazolium diethyl phosphate. Alternatively, ionic liquids might be used to immobilize or stabilize these enzymes for minimal solvent processing of biomass.

  5. Forest harvesting reduces the soil metagenomic potential for biomass decomposition.

    Cardenas, Erick; Kranabetter, J M; Hope, Graeme; Maas, Kendra R; Hallam, Steven; Mohn, William W

    2015-11-01

    Soil is the key resource that must be managed to ensure sustainable forest productivity. Soil microbial communities mediate numerous essential ecosystem functions, and recent studies show that forest harvesting alters soil community composition. From a long-term soil productivity study site in a temperate coniferous forest in British Columbia, 21 forest soil shotgun metagenomes were generated, totaling 187 Gb. A method to analyze unassembled metagenome reads from the complex community was optimized and validated. The subsequent metagenome analysis revealed that, 12 years after forest harvesting, there were 16% and 8% reductions in relative abundances of biomass decomposition genes in the organic and mineral soil layers, respectively. Organic and mineral soil layers differed markedly in genetic potential for biomass degradation, with the organic layer having greater potential and being more strongly affected by harvesting. Gene families were disproportionately affected, and we identified 41 gene families consistently affected by harvesting, including families involved in lignin, cellulose, hemicellulose and pectin degradation. The results strongly suggest that harvesting profoundly altered below-ground cycling of carbon and other nutrients at this site, with potentially important consequences for forest regeneration. Thus, it is important to determine whether these changes foreshadow long-term changes in forest productivity or resilience and whether these changes are broadly characteristic of harvested forests.

  6. Challenges of the Unknown: Clinical Application of Microbial Metagenomics

    Graham Rose

    2015-01-01

    Full Text Available Availability of fast, high throughput and low cost whole genome sequencing holds great promise within public health microbiology, with applications ranging from outbreak detection and tracking transmission events to understanding the role played by microbial communities in health and disease. Within clinical metagenomics, identifying microorganisms from a complex and host enriched background remains a central computational challenge. As proof of principle, we sequenced two metagenomic samples, a known viral mixture of 25 human pathogens and an unknown complex biological model using benchtop technology. The datasets were then analysed using a bioinformatic pipeline developed around recent fast classification methods. A targeted approach was able to detect 20 of the viruses against a background of host contamination from multiple sources and bacterial contamination. An alternative untargeted identification method was highly correlated with these classifications, and over 1,600 species were identified when applied to the complex biological model, including several species captured at over 50% genome coverage. In summary, this study demonstrates the great potential of applying metagenomics within the clinical laboratory setting and that this can be achieved using infrastructure available to nondedicated sequencing centres.

  7. Bioinformatic approaches reveal metagenomic characterization of soil microbial community.

    Zhuofei Xu

    Full Text Available As is well known, soil is a complex ecosystem harboring the most prokaryotic biodiversity on the Earth. In recent years, the advent of high-throughput sequencing techniques has greatly facilitated the progress of soil ecological studies. However, how to effectively understand the underlying biological features of large-scale sequencing data is a new challenge. In the present study, we used 33 publicly available metagenomes from diverse soil sites (i.e. grassland, forest soil, desert, Arctic soil, and mangrove sediment and integrated some state-of-the-art computational tools to explore the phylogenetic and functional characterizations of the microbial communities in soil. Microbial composition and metabolic potential in soils were comprehensively illustrated at the metagenomic level. A spectrum of metagenomic biomarkers containing 46 taxa and 33 metabolic modules were detected to be significantly differential that could be used as indicators to distinguish at least one of five soil communities. The co-occurrence associations between complex microbial compositions and functions were inferred by network-based approaches. Our results together with the established bioinformatic pipelines should provide a foundation for future research into the relation between soil biodiversity and ecosystem function.

  8. BeerDeCoded: the open beer metagenome project.

    Sobel, Jonathan; Henry, Luc; Rotman, Nicolas; Rando, Gianpaolo

    2017-01-01

    Next generation sequencing has radically changed research in the life sciences, in both academic and corporate laboratories. The potential impact is tremendous, yet a majority of citizens have little or no understanding of the technological and ethical aspects of this widespread adoption. We designed BeerDeCoded as a pretext to discuss the societal issues related to genomic and metagenomic data with fellow citizens, while advancing scientific knowledge of the most popular beverage of all. In the spirit of citizen science, sample collection and DNA extraction were carried out with the participation of non-scientists in the community laboratory of Hackuarium, a not-for-profit organisation that supports unconventional research and promotes the public understanding of science. The dataset presented herein contains the targeted metagenomic profile of 39 bottled beers from 5 countries, based on internal transcribed spacer (ITS) sequencing of fungal species. A preliminary analysis reveals the presence of a large diversity of wild yeast species in commercial brews. With this project, we demonstrate that coupling simple laboratory procedures that can be carried out in a non-professional environment with state-of-the-art sequencing technologies and targeted metagenomic analyses, can lead to the detection and identification of the microbial content in bottled beer.

  9. PhyloSift: phylogenetic analysis of genomes and metagenomes.

    Darling, Aaron E; Jospin, Guillaume; Lowe, Eric; Matsen, Frederick A; Bik, Holly M; Eisen, Jonathan A

    2014-01-01

    Like all organisms on the planet, environmental microbes are subject to the forces of molecular evolution. Metagenomic sequencing provides a means to access the DNA sequence of uncultured microbes. By combining DNA sequencing of microbial communities with evolutionary modeling and phylogenetic analysis we might obtain new insights into microbiology and also provide a basis for practical tools such as forensic pathogen detection. In this work we present an approach to leverage phylogenetic analysis of metagenomic sequence data to conduct several types of analysis. First, we present a method to conduct phylogeny-driven Bayesian hypothesis tests for the presence of an organism in a sample. Second, we present a means to compare community structure across a collection of many samples and develop direct associations between the abundance of certain organisms and sample metadata. Third, we apply new tools to analyze the phylogenetic diversity of microbial communities and again demonstrate how this can be associated to sample metadata. These analyses are implemented in an open source software pipeline called PhyloSift. As a pipeline, PhyloSift incorporates several other programs including LAST, HMMER, and pplacer to automate phylogenetic analysis of protein coding and RNA sequences in metagenomic datasets generated by modern sequencing platforms (e.g., Illumina, 454).

  10. PhyloSift: phylogenetic analysis of genomes and metagenomes

    Aaron E. Darling

    2014-01-01

    Full Text Available Like all organisms on the planet, environmental microbes are subject to the forces of molecular evolution. Metagenomic sequencing provides a means to access the DNA sequence of uncultured microbes. By combining DNA sequencing of microbial communities with evolutionary modeling and phylogenetic analysis we might obtain new insights into microbiology and also provide a basis for practical tools such as forensic pathogen detection.In this work we present an approach to leverage phylogenetic analysis of metagenomic sequence data to conduct several types of analysis. First, we present a method to conduct phylogeny-driven Bayesian hypothesis tests for the presence of an organism in a sample. Second, we present a means to compare community structure across a collection of many samples and develop direct associations between the abundance of certain organisms and sample metadata. Third, we apply new tools to analyze the phylogenetic diversity of microbial communities and again demonstrate how this can be associated to sample metadata.These analyses are implemented in an open source software pipeline called PhyloSift. As a pipeline, PhyloSift incorporates several other programs including LAST, HMMER, and pplacer to automate phylogenetic analysis of protein coding and RNA sequences in metagenomic datasets generated by modern sequencing platforms (e.g., Illumina, 454.

  11. Reconstruction of ribosomal RNA genes from metagenomic data.

    Lu Fan

    Full Text Available Direct sequencing of environmental DNA (metagenomics has a great potential for describing the 16S rRNA gene diversity of microbial communities. However current approaches using this 16S rRNA gene information to describe community diversity suffer from low taxonomic resolution or chimera problems. Here we describe a new strategy that involves stringent assembly and data filtering to reconstruct full-length 16S rRNA genes from metagenomicpyrosequencing data. Simulations showed that reconstructed 16S rRNA genes provided a true picture of the community diversity, had minimal rates of chimera formation and gave taxonomic resolution down to genus level. The strategy was furthermore compared to PCR-based methods to determine the microbial diversity in two marine sponges. This showed that about 30% of the abundant phylotypes reconstructed from metagenomic data failed to be amplified by PCR. Our approach is readily applicable to existing metagenomic datasets and is expected to lead to the discovery of new microbial phylotypes.

  12. MOCAT: a metagenomics assembly and gene prediction toolkit.

    Kultima, Jens Roat; Sunagawa, Shinichi; Li, Junhua; Chen, Weineng; Chen, Hua; Mende, Daniel R; Arumugam, Manimozhiyan; Pan, Qi; Liu, Binghang; Qin, Junjie; Wang, Jun; Bork, Peer

    2012-01-01

    MOCAT is a highly configurable, modular pipeline for fast, standardized processing of single or paired-end sequencing data generated by the Illumina platform. The pipeline uses state-of-the-art programs to quality control, map, and assemble reads from metagenomic samples sequenced at a depth of several billion base pairs, and predict protein-coding genes on assembled metagenomes. Mapping against reference databases allows for read extraction or removal, as well as abundance calculations. Relevant statistics for each processing step can be summarized into multi-sheet Excel documents and queryable SQL databases. MOCAT runs on UNIX machines and integrates seamlessly with the SGE and PBS queuing systems, commonly used to process large datasets. The open source code and modular architecture allow users to modify or exchange the programs that are utilized in the various processing steps. Individual processing steps and parameters were benchmarked and tested on artificial, real, and simulated metagenomes resulting in an improvement of selected quality metrics. MOCAT can be freely downloaded at http://www.bork.embl.de/mocat/.

  13. MOCAT: a metagenomics assembly and gene prediction toolkit.

    Jens Roat Kultima

    Full Text Available MOCAT is a highly configurable, modular pipeline for fast, standardized processing of single or paired-end sequencing data generated by the Illumina platform. The pipeline uses state-of-the-art programs to quality control, map, and assemble reads from metagenomic samples sequenced at a depth of several billion base pairs, and predict protein-coding genes on assembled metagenomes. Mapping against reference databases allows for read extraction or removal, as well as abundance calculations. Relevant statistics for each processing step can be summarized into multi-sheet Excel documents and queryable SQL databases. MOCAT runs on UNIX machines and integrates seamlessly with the SGE and PBS queuing systems, commonly used to process large datasets. The open source code and modular architecture allow users to modify or exchange the programs that are utilized in the various processing steps. Individual processing steps and parameters were benchmarked and tested on artificial, real, and simulated metagenomes resulting in an improvement of selected quality metrics. MOCAT can be freely downloaded at http://www.bork.embl.de/mocat/.

  14. Cyclodipeptides from metagenomic library of a japanese marine sponge

    He, Rui; Wang, Bochu; Zhub, Liancai, E-mail: wangbc2000@126.com [Bioengineering College, Chongqing University, Chongqing, (China); Wang, Manyuan [School of Traditional Chinese Medicine, Capital University of Medical Sciences, Beijing (China); Wakimoto, Toshiyuki; Abe, Ikuro, E-mail: abei@mol.f.u-tokyo.ac.jp [Graduate School of Pharmaceutical Sciences, The University of Tokyo, Tokyo (Japan)

    2013-12-01

    Culture-independent metagenomics is an attractive and promising approach to explore unique bioactive small molecules from marine sponges harboring uncultured symbiotic microbes. Therefore, we conducted functional screening of the metagenomic library constructed from the Japanese marine sponge Discodermia calyx. Bioassay-guided fractionation of plate culture extract of antibacterial clone pDC113 afforded eleven cyclodipeptides: Cyclo(l-Thr-l-Leu) (1), Cyclo(l-Val-d-Pro) (2), Cyclo(l-Ile-d-Pro) (3), Cyclo(l-Leu-l-Pro) (4), Cyclo(l-Val-l-Leu) (5), Cyclo(l-Leu-l-Ile) (6), Cyclo(l-Leu-l-Leu) (7), Cyclo(l-Phe-l-Tyr) (8), Cyclo(l-Trp-l-Pro) (9), Cyclo(l-Val-l-Trp) (10) and Cyclo(l-Ile-l-Trp) (11). To the best of our knowledge, these are first cyclodepeptides isolated from metagenomic library. Sequence analysis suggested that isolated cyclodipeptides were not synthesized by nonribosomal peptide synthetases and there was no significant indication of cyclodipeptide synthetases. (author)

  15. Culture-independent discovery of natural products from soil metagenomes.

    Katz, Micah; Hover, Bradley M; Brady, Sean F

    2016-03-01

    Bacterial natural products have proven to be invaluable starting points in the development of many currently used therapeutic agents. Unfortunately, traditional culture-based methods for natural product discovery have been deemphasized by pharmaceutical companies due in large part to high rediscovery rates. Culture-independent, or "metagenomic," methods, which rely on the heterologous expression of DNA extracted directly from environmental samples (eDNA), have the potential to provide access to metabolites encoded by a large fraction of the earth's microbial biosynthetic diversity. As soil is both ubiquitous and rich in bacterial diversity, it is an appealing starting point for culture-independent natural product discovery efforts. This review provides an overview of the history of soil metagenome-driven natural product discovery studies and elaborates on the recent development of new tools for sequence-based, high-throughput profiling of environmental samples used in discovering novel natural product biosynthetic gene clusters. We conclude with several examples of these new tools being employed to facilitate the recovery of novel secondary metabolite encoding gene clusters from soil metagenomes and the subsequent heterologous expression of these clusters to produce bioactive small molecules.

  16. Cyclodipeptides from metagenomic library of a japanese marine sponge

    He, Rui; Wang, Bochu; Zhub, Liancai; Wang, Manyuan; Wakimoto, Toshiyuki; Abe, Ikuro

    2013-01-01

    Culture-independent metagenomics is an attractive and promising approach to explore unique bioactive small molecules from marine sponges harboring uncultured symbiotic microbes. Therefore, we conducted functional screening of the metagenomic library constructed from the Japanese marine sponge Discodermia calyx. Bioassay-guided fractionation of plate culture extract of antibacterial clone pDC113 afforded eleven cyclodipeptides: Cyclo(l-Thr-l-Leu) (1), Cyclo(l-Val-d-Pro) (2), Cyclo(l-Ile-d-Pro) (3), Cyclo(l-Leu-l-Pro) (4), Cyclo(l-Val-l-Leu) (5), Cyclo(l-Leu-l-Ile) (6), Cyclo(l-Leu-l-Leu) (7), Cyclo(l-Phe-l-Tyr) (8), Cyclo(l-Trp-l-Pro) (9), Cyclo(l-Val-l-Trp) (10) and Cyclo(l-Ile-l-Trp) (11). To the best of our knowledge, these are first cyclodepeptides isolated from metagenomic library. Sequence analysis suggested that isolated cyclodipeptides were not synthesized by nonribosomal peptide synthetases and there was no significant indication of cyclodipeptide synthetases. (author)

  17. Parallel metatranscriptome analyses of host and symbiont gene expression in the gut of the termite Reticulitermes flavipes

    Zhou Xuguo

    2009-10-01

    Full Text Available Abstract Background Termite lignocellulose digestion is achieved through a collaboration of host plus prokaryotic and eukaryotic symbionts. In the present work, we took a combined host and symbiont metatranscriptomic approach for investigating the digestive contributions of host and symbiont in the lower termite Reticulitermes flavipes. Our approach consisted of parallel high-throughput sequencing from (i a host gut cDNA library and (ii a hindgut symbiont cDNA library. Subsequently, we undertook functional analyses of newly identified phenoloxidases with potential importance as pretreatment enzymes in industrial lignocellulose processing. Results Over 10,000 expressed sequence tags (ESTs were sequenced from the 2 libraries that aligned into 6,555 putative transcripts, including 171 putative lignocellulase genes. Sequence analyses provided insights in two areas. First, a non-overlapping complement of host and symbiont (prokaryotic plus protist glycohydrolase gene families known to participate in cellulose, hemicellulose, alpha carbohydrate, and chitin degradation were identified. Of these, cellulases are contributed by host plus symbiont genomes, whereas hemicellulases are contributed exclusively by symbiont genomes. Second, a diverse complement of previously unknown genes that encode proteins with homology to lignase, antioxidant, and detoxification enzymes were identified exclusively from the host library (laccase, catalase, peroxidase, superoxide dismutase, carboxylesterase, cytochrome P450. Subsequently, functional analyses of phenoloxidase activity provided results that were strongly consistent with patterns of laccase gene expression. In particular, phenoloxidase activity and laccase gene expression are mostly restricted to symbiont-free foregut plus salivary gland tissues, and phenoloxidase activity is inducible by lignin feeding. Conclusion To our knowledge, this is the first time that a dual host-symbiont transcriptome sequencing effort

  18. In-depth resistome analysis by targeted metagenomics.

    Lanza, Val F; Baquero, Fernando; Martínez, José Luís; Ramos-Ruíz, Ricardo; González-Zorn, Bruno; Andremont, Antoine; Sánchez-Valenzuela, Antonio; Ehrlich, Stanislav Dusko; Kennedy, Sean; Ruppé, Etienne; van Schaik, Willem; Willems, Rob J; de la Cruz, Fernando; Coque, Teresa M

    2018-01-15

    Antimicrobial resistance is a major global health challenge. Metagenomics allows analyzing the presence and dynamics of "resistomes" (the ensemble of genes encoding antimicrobial resistance in a given microbiome) in disparate microbial ecosystems. However, the low sensitivity and specificity of available metagenomic methods preclude the detection of minority populations (often present below their detection threshold) and/or the identification of allelic variants that differ in the resulting phenotype. Here, we describe a novel strategy that combines targeted metagenomics using last generation in-solution capture platforms, with novel bioinformatics tools to establish a standardized framework that allows both quantitative and qualitative analyses of resistomes. We developed ResCap, a targeted sequence capture platform based on SeqCapEZ (NimbleGene) technology, which includes probes for 8667 canonical resistance genes (7963 antibiotic resistance genes and 704 genes conferring resistance to metals or biocides), and 2517 relaxase genes (plasmid markers) and 78,600 genes homologous to the previous identified targets (47,806 for antibiotics and 30,794 for biocides or metals). Its performance was compared with metagenomic shotgun sequencing (MSS) for 17 fecal samples (9 humans, 8 swine). ResCap significantly improves MSS to detect "gene abundance" (from 2.0 to 83.2%) and "gene diversity" (26 versus 14.9 genes unequivocally detected per sample per million of reads; the number of reads unequivocally mapped increasing up to 300-fold by using ResCap), which were calculated using novel bioinformatic tools. ResCap also facilitated the analysis of novel genes potentially involved in the resistance to antibiotics, metals, biocides, or any combination thereof. ResCap, the first targeted sequence capture, specifically developed to analyze resistomes, greatly enhances the sensitivity and specificity of available metagenomic methods and offers the possibility to analyze genes

  19. NeSSM: a Next-generation Sequencing Simulator for Metagenomics.

    Ben Jia

    Full Text Available BACKGROUND: Metagenomics can reveal the vast majority of microbes that have been missed by traditional cultivation-based methods. Due to its extremely wide range of application areas, fast metagenome sequencing simulation systems with high fidelity are in great demand to facilitate the development and comparison of metagenomics analysis tools. RESULTS: We present here a customizable metagenome simulation system: NeSSM (Next-generation Sequencing Simulator for Metagenomics. Combining complete genomes currently available, a community composition table, and sequencing parameters, it can simulate metagenome sequencing better than existing systems. Sequencing error models based on the explicit distribution of errors at each base and sequencing coverage bias are incorporated in the simulation. In order to improve the fidelity of simulation, tools are provided by NeSSM to estimate the sequencing error models, sequencing coverage bias and the community composition directly from existing metagenome sequencing data. Currently, NeSSM supports single-end and pair-end sequencing for both 454 and Illumina platforms. In addition, a GPU (graphics processing units version of NeSSM is also developed to accelerate the simulation. By comparing the simulated sequencing data from NeSSM with experimental metagenome sequencing data, we have demonstrated that NeSSM performs better in many aspects than existing popular metagenome simulators, such as MetaSim, GemSIM and Grinder. The GPU version of NeSSM is more than one-order of magnitude faster than MetaSim. CONCLUSIONS: NeSSM is a fast simulation system for high-throughput metagenome sequencing. It can be helpful to develop tools and evaluate strategies for metagenomics analysis and it's freely available for academic users at http://cbb.sjtu.edu.cn/~ccwei/pub/software/NeSSM.php.

  20. A metagenomic snapshot of taxonomic and functional diversity in an alpine glacier cryoconite ecosystem

    Edwards, Arwyn; Pachebat, Justin A; Swain, Martin; Hegarty, Matt; Rassner, Sara M E; Hodson, Andrew J; Irvine-Fynn, Tristram D L; Sattler, Birgit

    2013-01-01

    Cryoconite is a microbe–mineral aggregate which darkens the ice surface of glaciers. Microbial process and marker gene PCR-dependent measurements reveal active and diverse cryoconite microbial communities on polar glaciers. Here, we provide the first report of a cryoconite metagenome and culture-independent study of alpine cryoconite microbial diversity. We assembled 1.2 Gbp of metagenomic DNA sequenced using an Illumina HiScanSQ from cryoconite holes across the ablation zone of Rotmoosferner in the Austrian Alps. The metagenome revealed a bacterially-dominated community, with Proteobacteria (62% of bacterial-assigned contigs) and Bacteroidetes (14%) considerably more abundant than Cyanobacteria (2.5%). Streptophyte DNA dominated the eukaryotic metagenome. Functional genes linked to N, Fe, S and P cycling illustrated an acquisitive trend and a nitrogen cycle based upon efficient ammonia recycling. A comparison of 32 metagenome datasets revealed a similarity in functional profiles between the cryoconite and metagenomes characterized from other cold microbe–mineral aggregates. Overall, the metagenomic snapshot reveals the cryoconite ecosystem of this alpine glacier as dependent on scavenging carbon and nutrients from allochthonous sources, in particular mosses transported by wind from ice-marginal habitats, consistent with net heterotrophy indicated by productivity measurements. A transition from singular snapshots of cryoconite metagenomes to comparative analyses is advocated. (letter)

  1. BioCreative Workshops for DOE Genome Sciences: Text Mining for Metagenomics

    Wu, Cathy H. [Univ. of Delaware, Newark, DE (United States). Center for Bioinformatics and Computational Biology; Hirschman, Lynette [The MITRE Corporation, Bedford, MA (United States)

    2016-10-29

    The objective of this project was to host BioCreative workshops to define and develop text mining tasks to meet the needs of the Genome Sciences community, focusing on metadata information extraction in metagenomics. Following the successful introduction of metagenomics at the BioCreative IV workshop, members of the metagenomics community and BioCreative communities continued discussion to identify candidate topics for a BioCreative metagenomics track for BioCreative V. Of particular interest was the capture of environmental and isolation source information from text. The outcome was to form a “community of interest” around work on the interactive EXTRACT system, which supported interactive tagging of environmental and species data. This experiment is included in the BioCreative V virtual issue of Database. In addition, there was broad participation by members of the metagenomics community in the panels held at BioCreative V, leading to valuable exchanges between the text mining developers and members of the metagenomics research community. These exchanges are reflected in a number of the overview and perspective pieces also being captured in the BioCreative V virtual issue. Overall, this conversation has exposed the metagenomics researchers to the possibilities of text mining, and educated the text mining developers to the specific needs of the metagenomics community.

  2. Beyond research: a primer for considerations on using viral metagenomics in the field and clinic

    Hall, Richard J; Draper, Jenny L; Nielsen, Fiona G G; Dutilh, Bas E

    2015-01-01

    Powered by recent advances in next-generation sequencing technologies, metagenomics has already unveiled vast microbial biodiversity in a range of environments, and is increasingly being applied in clinics for difficult-to-diagnose cases. It can be tempting to suggest that metagenomics could be used

  3. A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes

    Dutilh, Bas E; Cassman, Noriko; McNair, Katelyn; Sanchez, Savannah E; Silva, Genivaldo G Z; Boling, Lance; Barr, Jeremy J; Speth, Daan R; Seguritan, Victor; Aziz, Ramy K; Felts, Ben; Dinsdale, Elizabeth A; Mokili, John L; Edwards, Robert A

    2014-01-01

    Metagenomics, or sequencing of the genetic material from a complete microbial community, is a promising tool to discover novel microbes and viruses. Viral metagenomes typically contain many unknown sequences. Here we describe the discovery of a previously unidentified bacteriophage present in the

  4. Introduction to Metagenomics at DOE JGI: Program Overview and Program Informatics (Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Tringe, Susannah

    2011-10-12

    Susannah Tringe of the DOE Joint Genome Institute talks about the Program Overview and Program Informatics at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  5. Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes.

    Nielsen, H Bjørn; Almeida, Mathieu; Juncker, Agnieszka Sierakowska; Rasmussen, Simon; Li, Junhua; Sunagawa, Shinichi; Plichta, Damian R; Gautier, Laurent; Pedersen, Anders G; Le Chatelier, Emmanuelle; Pelletier, Eric; Bonde, Ida; Nielsen, Trine; Manichanh, Chaysavanh; Arumugam, Manimozhiyan; Batto, Jean-Michel; Quintanilha Dos Santos, Marcelo B; Blom, Nikolaj; Borruel, Natalia; Burgdorf, Kristoffer S; Boumezbeur, Fouad; Casellas, Francesc; Doré, Joël; Dworzynski, Piotr; Guarner, Francisco; Hansen, Torben; Hildebrand, Falk; Kaas, Rolf S; Kennedy, Sean; Kristiansen, Karsten; Kultima, Jens Roat; Léonard, Pierre; Levenez, Florence; Lund, Ole; Moumen, Bouziane; Le Paslier, Denis; Pons, Nicolas; Pedersen, Oluf; Prifti, Edi; Qin, Junjie; Raes, Jeroen; Sørensen, Søren; Tap, Julien; Tims, Sebastian; Ussery, David W; Yamada, Takuji; Renault, Pierre; Sicheritz-Ponten, Thomas; Bork, Peer; Wang, Jun; Brunak, Søren; Ehrlich, S Dusko

    2014-08-01

    Most current approaches for analyzing metagenomic data rely on comparisons to reference genomes, but the microbial diversity of many environments extends far beyond what is covered by reference databases. De novo segregation of complex metagenomic data into specific biological entities, such as particular bacterial strains or viruses, remains a largely unsolved problem. Here we present a method, based on binning co-abundant genes across a series of metagenomic samples, that enables comprehensive discovery of new microbial organisms, viruses and co-inherited genetic entities and aids assembly of microbial genomes without the need for reference sequences. We demonstrate the method on data from 396 human gut microbiome samples and identify 7,381 co-abundance gene groups (CAGs), including 741 metagenomic species (MGS). We use these to assemble 238 high-quality microbial genomes and identify affiliations between MGS and hundreds of viruses or genetic entities. Our method provides the means for comprehensive profiling of the diversity within complex metagenomic samples.

  6. Mining the metagenome of activated biomass of an industrial wastewater treatment plant by a novel method.

    Sharma, Nandita; Tanksale, Himgouri; Kapley, Atya; Purohit, Hemant J

    2012-12-01

    Metagenomic libraries herald the era of magnifying the microbial world, tapping into the vast metabolic potential of uncultivated microbes, and enhancing the rate of discovery of novel genes and pathways. In this paper, we describe a method that facilitates the extraction of metagenomic DNA from activated sludge of an industrial wastewater treatment plant and its use in mining the metagenome via library construction. The efficiency of this method was demonstrated by the large representation of the bacterial genome in the constructed metagenomic libraries and by the functional clones obtained. The BAC library represented 95.6 times the bacterial genome, while, the pUC library represented 41.7 times the bacterial genome. Twelve clones in the BAC library demonstrated lipolytic activity, while four clones demonstrated dioxygenase activity. Four clones in pUC library tested positive for cellulase activity. This method, using FTA cards, not only can be used for library construction, but can also store the metagenome at room temperature.

  7. MG-Digger: an automated pipeline to search for giant virus-related sequences in metagenomes

    Jonathan eVerneau

    2016-03-01

    Full Text Available The number of metagenomic studies conducted each year is growing dramatically. Storage and analysis of such big data is difficult and time-consuming. Interestingly, analysis shows that environmental and human metagenomes include a significant amount of non-annotated sequences, representing a ‘dark matter’. We established a bioinformatics pipeline that automatically detects metagenome reads matching query sequences from a given set and applied this tool to the detection of sequences matching large and giant DNA viral members of the proposed order Megavirales or virophages. A total of 1,045 environmental and human metagenomes (≈ 1 Terabase pairs were collected, processed and stored on our bioinformatics server. In addition, nucleotide and protein sequences from 93 Megavirales representatives, including 19 giant viruses of amoeba, and five virophages, were collected. The pipeline was generated by scripts written in Python language and entitled MG-Digger. Metagenomes previously found to contain megavirus-like sequences were tested as controls. MG-Digger was able to annotate hundreds of metagenome sequences as best matching those of giant viruses. These sequences were most often found to be similar to phycodnavirus or mimivirus sequences, but included reads related to recently available pandoraviruses, Pithovirus sibericum, and faustoviruses. Compared to other tools, MG-Digger combined stand-alone use on Linux or Windows operating systems through a user-friendly interface, implementation of ready-to-use customized metagenome databases and query sequence databases, adjustable parameters for BLAST searches, and creation of output files containing selected reads with best match identification. Compared to Metavir 2, a reference tool in viral metagenome analysis, MG-Digger detected 8% more true positive Megavirales-related reads in a control metagenome. The present work shows that massive, automated and recurrent analyses of metagenomes are

  8. Combining gene prediction methods to improve metagenomic gene annotation

    Rosen Gail L

    2011-01-01

    Full Text Available Abstract Background Traditional gene annotation methods rely on characteristics that may not be available in short reads generated from next generation technology, resulting in suboptimal performance for metagenomic (environmental samples. Therefore, in recent years, new programs have been developed that optimize performance on short reads. In this work, we benchmark three metagenomic gene prediction programs and combine their predictions to improve metagenomic read gene annotation. Results We not only analyze the programs' performance at different read-lengths like similar studies, but also separate different types of reads, including intra- and intergenic regions, for analysis. The main deficiencies are in the algorithms' ability to predict non-coding regions and gene edges, resulting in more false-positives and false-negatives than desired. In fact, the specificities of the algorithms are notably worse than the sensitivities. By combining the programs' predictions, we show significant improvement in specificity at minimal cost to sensitivity, resulting in 4% improvement in accuracy for 100 bp reads with ~1% improvement in accuracy for 200 bp reads and above. To correctly annotate the start and stop of the genes, we find that a consensus of all the predictors performs best for shorter read lengths while a unanimous agreement is better for longer read lengths, boosting annotation accuracy by 1-8%. We also demonstrate use of the classifier combinations on a real dataset. Conclusions To optimize the performance for both prediction and annotation accuracies, we conclude that the consensus of all methods (or a majority vote is the best for reads 400 bp and shorter, while using the intersection of GeneMark and Orphelia predictions is the best for reads 500 bp and longer. We demonstrate that most methods predict over 80% coding (including partially coding reads on a real human gut sample sequenced by Illumina technology.

  9. Assembling the Marine Metagenome, One Cell at a Time

    Woyke, Tanja; Xie, Gary; Copeland, Alex; Gonzalez, Jose M.; Han, Cliff; Kiss, Hajnalka; Saw, Jimmy H.; Senin, Pavel; Yang, Chi; Chatterji, Sourav; Cheng, Jan-Fang; Eisen, Jonathan A.; Sieracki, Michael E.; Stepanauskas, Ramunas

    2010-06-24

    The difficulty associated with the cultivation of most microorganisms and the complexity of natural microbial assemblages, such as marine plankton or human microbiome, hinder genome reconstruction of representative taxa using cultivation or metagenomic approaches. Here we used an alternative, single cell sequencing approach to obtain high-quality genome assemblies of two uncultured, numerically significant marine microorganisms. We employed fluorescence-activated cell sorting and multiple displacement amplification to obtain hundreds of micrograms of genomic DNA from individual, uncultured cells of two marine flavobacteria from the Gulf of Maine that were phylogenetically distant from existing cultured strains. Shotgun sequencing and genome finishing yielded 1.9 Mbp in 17 contigs and 1.5 Mbp in 21 contigs for the two flavobacteria, with estimated genome recoveries of about 91percent and 78percent, respectively. Only 0.24percent of the assembling sequences were contaminants and were removed from further analysis using rigorous quality control. In contrast to all cultured strains of marine flavobacteria, the two single cell genomes were excellent Global Ocean Sampling (GOS) metagenome fragment recruiters, demonstrating their numerical significance in the ocean. The geographic distribution of GOS recruits along the Northwest Atlantic coast coincided with ocean surface currents. Metabolic reconstruction indicated diverse potential energy sources, including biopolymer degradation, proteorhodopsin photometabolism, and hydrogen oxidation. Compared to cultured relatives, the two uncultured flavobacteria have small genome sizes, few non-coding nucleotides, and few paralogous genes, suggesting adaptations to narrow ecological niches. These features may have contributed to the abundance of the two taxa in specific regions of the ocean, and may have hindered their cultivation. We demonstrate the power of single cell DNA sequencing to generate reference genomes of uncultured

  10. Separating metagenomic short reads into genomes via clustering

    Tanaseichuk Olga

    2012-09-01

    Full Text Available Abstract Background The metagenomics approach allows the simultaneous sequencing of all genomes in an environmental sample. This results in high complexity datasets, where in addition to repeats and sequencing errors, the number of genomes and their abundance ratios are unknown. Recently developed next-generation sequencing (NGS technologies significantly improve the sequencing efficiency and cost. On the other hand, they result in shorter reads, which makes the separation of reads from different species harder. Among the existing computational tools for metagenomic analysis, there are similarity-based methods that use reference databases to align reads and composition-based methods that use composition patterns (i.e., frequencies of short words or l-mers to cluster reads. Similarity-based methods are unable to classify reads from unknown species without close references (which constitute the majority of reads. Since composition patterns are preserved only in significantly large fragments, composition-based tools cannot be used for very short reads, which becomes a significant limitation with the development of NGS. A recently proposed algorithm, AbundanceBin, introduced another method that bins reads based on predicted abundances of the genomes sequenced. However, it does not separate reads from genomes of similar abundance levels. Results In this work, we present a two-phase heuristic algorithm for separating short paired-end reads from different genomes in a metagenomic dataset. We use the observation that most of the l-mers belong to unique genomes when l is sufficiently large. The first phase of the algorithm results in clusters of l-mers each of which belongs to one genome. During the second phase, clusters are merged based on l-mer repeat information. These final clusters are used to assign reads. The algorithm could handle very short reads and sequencing errors. It is initially designed for genomes with similar abundance levels and then

  11. Fast and sensitive taxonomic classification for metagenomics with Kaiju

    Menzel, Peter; Ng, Kim Lee; Krogh, Anders

    2016-01-01

    heuristic. We show in a genome exclusion study that Kaiju can classify more reads with higher sensitivity and similar precision compared to fast k-mer based classifiers, especially in genera that are underrepresented in reference databases. We also demonstrate that Kaiju classifies more than twice as many...... reads in ten real metagenomes compared to programs based on genomic k-mers. Kaiju can process up to millions of reads per minute, and its memory footprint is below 5 GB of RAM, allowing the analysis on a standard PC. The program is available under the GPL3 license at: github.com/bioinformatics-centre/kaiju...

  12. deFUME: Dynamic exploration of functional metagenomic sequencing data

    van der Helm, Eric; Geertz-Hansen, Henrik Marcus; Genee, Hans Jasper

    2015-01-01

    is time consuming and constitutes a major bottleneck for experimental researchers in the field. Here we present the deFUME web server, an easy-to-use web-based interface for processing, annotation and visualization of functional metagenomics sequencing data, tailored to meet the requirements of non......-bioinformaticians. The web-server integrates multiple analysis steps into one single workflow: read assembly, open reading frame prediction, and annotation with BLAST, InterPro and GO classifiers. Analysis results are visualized in an online dynamic web-interface. The deFUME webserver provides a fast track from raw sequence...

  13. Comparative metagenomics of eight geographically remote terrestrial hot springs

    Menzel, Peter; Islin, Sóley Ruth; Rike, Anne Gunn

    2015-01-01

    Hot springs are natural habitats for thermophilic Archaea and Bacteria. In this paper, we present the metagenomic analysis of eight globally distributed terrestrial hot springs from China, Iceland, Italy, Russia, and the USA with a temperature range between 61 and 92 (∘)C and pH between 1.8 and 7....... A comparison of the biodiversity and community composition generally showed a decrease in biodiversity with increasing temperature and decreasing pH. Another important factor shaping microbial diversity of the studied sites was the abundance of organic substrates. Several species of the Crenarchaeal order...

  14. Binning sequences using very sparse labels within a metagenome

    Halgamuge Saman K

    2008-04-01

    Full Text Available Abstract Background In metagenomic studies, a process called binning is necessary to assign contigs that belong to multiple species to their respective phylogenetic groups. Most of the current methods of binning, such as BLAST, k-mer and PhyloPythia, involve assigning sequence fragments by comparing sequence similarity or sequence composition with already-sequenced genomes that are still far from comprehensive. We propose a semi-supervised seeding method for binning that does not depend on knowledge of completed genomes. Instead, it extracts the flanking sequences of highly conserved 16S rRNA from the metagenome and uses them as seeds (labels to assign other reads based on their compositional similarity. Results The proposed seeding method is implemented on an unsupervised Growing Self-Organising Map (GSOM, and called Seeded GSOM (S-GSOM. We compared it with four well-known semi-supervised learning methods in a preliminary test, separating random-length prokaryotic sequence fragments sampled from the NCBI genome database. We identified the flanking sequences of the highly conserved 16S rRNA as suitable seeds that could be used to group the sequence fragments according to their species. S-GSOM showed superior performance compared to the semi-supervised methods tested. Additionally, S-GSOM may also be used to visually identify some species that do not have seeds. The proposed method was then applied to simulated metagenomic datasets using two different confidence threshold settings and compared with PhyloPythia, k-mer and BLAST. At the reference taxonomic level Order, S-GSOM outperformed all k-mer and BLAST results and showed comparable results with PhyloPythia for each of the corresponding confidence settings, where S-GSOM performed better than PhyloPythia in the ≥ 10 reads datasets and comparable in the ≥ 8 kb benchmark tests. Conclusion In the task of binning using semi-supervised learning methods, results indicate S-GSOM to be the best of

  15. Metagenomics and development of the gut microbiota in infants

    Vallès, Y.; Gosalbes, M. J.; de Vries, Lisbeth Elvira

    2012-01-01

    Clin Microbiol Infect 2012; 18 (Suppl. 4): 21–26 The establishment of a balanced intestinal microbiota is essential for numerous aspects of human health, yet the microbial colonization of the gastrointestinal tract of infants is both complex and highly variable among individuals. In addition......, the gastrointestinal tract microbiota is often exposed to antibiotics, and may be an important reservoir of resistant strains and of transferable resistance genes from early infancy. We are investigating by means of diverse metagenomic approaches several areas of microbiota development in infants, including...

  16. Functional metagenomic profiling of intestinal microbiome in extreme ageing

    Rampelli, Simone; Candela, Marco; Turroni, Silvia; Biagi, Elena; Collino, Sebastiano; Franceschi, Claudio; O'Toole, Paul W; Brigidi, Patrizia

    2013-01-01

    Age-related alterations in human gut microbiota composition have been thoroughly described, but a detailed functional description of the intestinal bacterial coding capacity is still missing. In order to elucidate the contribution of the gut metagenome to the complex mosaic of human longevity, we applied shotgun sequencing to total fecal bacterial DNA in a selection of samples belonging to a well-characterized human ageing cohort. The age-related trajectory of the human gut microbiome was characterized by loss of genes for shortchain fatty acid production and an overall decrease in the saccharolytic potential, while proteolytic functions were more abundant than in the intestinal metagenome of younger adults. This altered functional profile was associated with a relevant enrichment in “pathobionts”, i.e. opportunistic pro-inflammatory bacteria generally present in the adult gut ecosystem in low numbers. Finally, as a signature for long life we identified 116 microbial genes that significantly correlated with ageing. Collectively, our data emphasize the relationship between intestinal bacteria and human metabolism, by detailing the modifications in the gut microbiota as a consequence of and/or promoter of the physiological changes occurring in the human host upon ageing. PMID:24334635

  17. Centrifuge: rapid and sensitive classification of metagenomic sequences.

    Kim, Daehwan; Song, Li; Breitwieser, Florian P; Salzberg, Steven L

    2016-12-01

    Centrifuge is a novel microbial classification engine that enables rapid, accurate, and sensitive labeling of reads and quantification of species on desktop computers. The system uses an indexing scheme based on the Burrows-Wheeler transform (BWT) and the Ferragina-Manzini (FM) index, optimized specifically for the metagenomic classification problem. Centrifuge requires a relatively small index (4.2 GB for 4078 bacterial and 200 archaeal genomes) and classifies sequences at very high speed, allowing it to process the millions of reads from a typical high-throughput DNA sequencing run within a few minutes. Together, these advances enable timely and accurate analysis of large metagenomics data sets on conventional desktop computers. Because of its space-optimized indexing schemes, Centrifuge also makes it possible to index the entire NCBI nonredundant nucleotide sequence database (a total of 109 billion bases) with an index size of 69 GB, in contrast to k-mer-based indexing schemes, which require far more extensive space. © 2016 Kim et al.; Published by Cold Spring Harbor Laboratory Press.

  18. Quantitative metagenomics reveals unique gut microbiome biomarkers in ankylosing spondylitis.

    Wen, Chengping; Zheng, Zhijun; Shao, Tiejuan; Liu, Lin; Xie, Zhijun; Le Chatelier, Emmanuelle; He, Zhixing; Zhong, Wendi; Fan, Yongsheng; Zhang, Linshuang; Li, Haichang; Wu, Chunyan; Hu, Changfeng; Xu, Qian; Zhou, Jia; Cai, Shunfeng; Wang, Dawei; Huang, Yun; Breban, Maxime; Qin, Nan; Ehrlich, Stanislav Dusko

    2017-07-27

    The assessment and characterization of the gut microbiome has become a focus of research in the area of human autoimmune diseases. Ankylosing spondylitis is an inflammatory autoimmune disease and evidence showed that ankylosing spondylitis may be a microbiome-driven disease. To investigate the relationship between the gut microbiome and ankylosing spondylitis, a quantitative metagenomics study based on deep shotgun sequencing was performed, using gut microbial DNA from 211 Chinese individuals. A total of 23,709 genes and 12 metagenomic species were shown to be differentially abundant between ankylosing spondylitis patients and healthy controls. Patients were characterized by a form of gut microbial dysbiosis that is more prominent than previously reported cases with inflammatory bowel disease. Specifically, the ankylosing spondylitis patients demonstrated increases in the abundance of Prevotella melaninogenica, Prevotella copri, and Prevotella sp. C561 and decreases in Bacteroides spp. It is noteworthy that the Bifidobacterium genus, which is commonly used in probiotics, accumulated in the ankylosing spondylitis patients. Diagnostic algorithms were established using a subset of these gut microbial biomarkers. Alterations of the gut microbiome are associated with development of ankylosing spondylitis. Our data suggest biomarkers identified in this study might participate in the pathogenesis or development process of ankylosing spondylitis, providing new leads for the development of new diagnostic tools and potential treatments.

  19. Microbial survival strategies in ancient permafrost: insights from metagenomics.

    Mackelprang, Rachel; Burkert, Alexander; Haw, Monica; Mahendrarajah, Tara; Conaway, Christopher H; Douglas, Thomas A; Waldrop, Mark P

    2017-10-01

    In permafrost (perennially frozen ground) microbes survive oligotrophic conditions, sub-zero temperatures, low water availability and high salinity over millennia. Viable life exists in permafrost tens of thousands of years old but we know little about the metabolic and physiological adaptations to the challenges presented by life in frozen ground over geologic time. In this study we asked whether increasing age and the associated stressors drive adaptive changes in community composition and function. We conducted deep metagenomic and 16 S rRNA gene sequencing across a Pleistocene permafrost chronosequence from 19 000 to 33 000 years before present (kyr). We found that age markedly affected community composition and reduced diversity. Reconstruction of paleovegetation from metagenomic sequence suggests vegetation differences in the paleo record are not responsible for shifts in community composition and function. Rather, we observed shifts consistent with long-term survival strategies in extreme cryogenic environments. These include increased reliance on scavenging detrital biomass, horizontal gene transfer, chemotaxis, dormancy, environmental sensing and stress response. Our results identify traits that may enable survival in ancient cryoenvironments with no influx of energy or new materials.

  20. Functional metagenomic profiling of intestinal microbiome in extreme ageing.

    Rampelli, Simone; Candela, Marco; Turroni, Silvia; Biagi, Elena; Collino, Sebastiano; Franceschi, Claudio; O'Toole, Paul W; Brigidi, Patrizia

    2013-12-01

    Age-related alterations in human gut microbiota composition have been thoroughly described, but a detailed functional description of the intestinal bacterial coding capacity is still missing. In order to elucidate the contribution of the gut metagenome to the complex mosaic of human longevity, we applied shotgun sequencing to total fecal bacterial DNA in a selection of samples belonging to a well-characterized human ageing cohort. The age-related trajectory of the human gut microbiome was characterized by loss of genes for shortchain fatty acid production and an overall decrease in the saccharolytic potential, while proteolytic functions were more abundant than in the intestinal metagenome of younger adults. This altered functional profile was associated with a relevant enrichment in "pathobionts", i.e. opportunistic pro-inflammatory bacteria generally present in the adult gut ecosystem in low numbers. Finally, as a signature for long life we identified 116 microbial genes that significantly correlated with ageing. Collectively, our data emphasize the relationship between intestinal bacteria and human metabolism, by detailing the modifications in the gut microbiota as a consequence of and/or promoter of the physiological changes occurring in the human host upon ageing.

  1. Genomic and metagenomic technologies to explore the antibiotic resistance mobilome.

    Martínez, José L; Coque, Teresa M; Lanza, Val F; de la Cruz, Fernando; Baquero, Fernando

    2017-01-01

    Antibiotic resistance is a relevant problem for human health that requires global approaches to establish a deep understanding of the processes of acquisition, stabilization, and spread of resistance among human bacterial pathogens. Since natural (nonclinical) ecosystems are reservoirs of resistance genes, a health-integrated study of the epidemiology of antibiotic resistance requires the exploration of such ecosystems with the aim of determining the role they may play in the selection, evolution, and spread of antibiotic resistance genes, involving the so-called resistance mobilome. High-throughput sequencing techniques allow an unprecedented opportunity to describe the genetic composition of a given microbiome without the need to subculture the organisms present inside. However, bioinformatic methods for analyzing this bulk of data, mainly with respect to binning each resistance gene with the organism hosting it, are still in their infancy. Here, we discuss how current genomic methodologies can serve to analyze the resistance mobilome and its linkage with different bacterial genomes and metagenomes. In addition, we describe the drawbacks of current methodologies for analyzing the resistance mobilome, mainly in cases of complex microbiotas, and discuss the possibility of implementing novel tools to improve our current metagenomic toolbox. © 2016 New York Academy of Sciences.

  2. Comparative metagenome of a stream impacted by the urbanization phenomenon

    Julliane Dutra Medeiros

    Full Text Available Abstract Rivers and streams are important reservoirs of freshwater for human consumption. These ecosystems are threatened by increasing urbanization, because raw sewage discharged into them alters their nutrient content and may affect the composition of their microbial community. In the present study, we investigate the taxonomic and functional profile of the microbial community in an urban lotic environment. Samples of running water were collected at two points in the São Pedro stream: an upstream preserved and non-urbanized area, and a polluted urbanized area with discharged sewage. The metagenomic DNA was sequenced by pyrosequencing. Differences were observed in the community composition at the two sites. The non-urbanized area was overrepresented by genera of ubiquitous microbes that act in the maintenance of environments. In contrast, the urbanized metagenome was rich in genera pathogenic to humans. The functional profile indicated that the microbes act on the metabolism of methane, nitrogen and sulfur, especially in the urbanized area. It was also found that virulence/defense (antibiotic resistance and metal resistance and stress response-related genes were disseminated in the urbanized environment. The structure of the microbial community was altered by uncontrolled anthropic interference, highlighting the selective pressure imposed by high loads of urban sewage discharged into freshwater environments.

  3. WebMGA: a customizable web server for fast metagenomic sequence analysis.

    Wu, Sitao; Zhu, Zhengwei; Fu, Liming; Niu, Beifang; Li, Weizhong

    2011-09-07

    The new field of metagenomics studies microorganism communities by culture-independent sequencing. With the advances in next-generation sequencing techniques, researchers are facing tremendous challenges in metagenomic data analysis due to huge quantity and high complexity of sequence data. Analyzing large datasets is extremely time-consuming; also metagenomic annotation involves a wide range of computational tools, which are difficult to be installed and maintained by common users. The tools provided by the few available web servers are also limited and have various constraints such as login requirement, long waiting time, inability to configure pipelines etc. We developed WebMGA, a customizable web server for fast metagenomic analysis. WebMGA includes over 20 commonly used tools such as ORF calling, sequence clustering, quality control of raw reads, removal of sequencing artifacts and contaminations, taxonomic analysis, functional annotation etc. WebMGA provides users with rapid metagenomic data analysis using fast and effective tools, which have been implemented to run in parallel on our local computer cluster. Users can access WebMGA through web browsers or programming scripts to perform individual analysis or to configure and run customized pipelines. WebMGA is freely available at http://weizhongli-lab.org/metagenomic-analysis. WebMGA offers to researchers many fast and unique tools and great flexibility for complex metagenomic data analysis.

  4. MALINA: a web service for visual analytics of human gut microbiota whole-genome metagenomic reads.

    Tyakht, Alexander V; Popenko, Anna S; Belenikin, Maxim S; Altukhov, Ilya A; Pavlenko, Alexander V; Kostryukova, Elena S; Selezneva, Oksana V; Larin, Andrei K; Karpova, Irina Y; Alexeev, Dmitry G

    2012-12-07

    MALINA is a web service for bioinformatic analysis of whole-genome metagenomic data obtained from human gut microbiota sequencing. As input data, it accepts metagenomic reads of various sequencing technologies, including long reads (such as Sanger and 454 sequencing) and next-generation (including SOLiD and Illumina). It is the first metagenomic web service that is capable of processing SOLiD color-space reads, to authors' knowledge. The web service allows phylogenetic and functional profiling of metagenomic samples using coverage depth resulting from the alignment of the reads to the catalogue of reference sequences which are built into the pipeline and contain prevalent microbial genomes and genes of human gut microbiota. The obtained metagenomic composition vectors are processed by the statistical analysis and visualization module containing methods for clustering, dimension reduction and group comparison. Additionally, the MALINA database includes vectors of bacterial and functional composition for human gut microbiota samples from a large number of existing studies allowing their comparative analysis together with user samples, namely datasets from Russian Metagenome project, MetaHIT and Human Microbiome Project (downloaded from http://hmpdacc.org). MALINA is made freely available on the web at http://malina.metagenome.ru. The website is implemented in JavaScript (using Ext JS), Microsoft .NET Framework, MS SQL, Python, with all major browsers supported.

  5. WebMGA: a customizable web server for fast metagenomic sequence analysis

    Niu Beifang

    2011-09-01

    Full Text Available Abstract Background The new field of metagenomics studies microorganism communities by culture-independent sequencing. With the advances in next-generation sequencing techniques, researchers are facing tremendous challenges in metagenomic data analysis due to huge quantity and high complexity of sequence data. Analyzing large datasets is extremely time-consuming; also metagenomic annotation involves a wide range of computational tools, which are difficult to be installed and maintained by common users. The tools provided by the few available web servers are also limited and have various constraints such as login requirement, long waiting time, inability to configure pipelines etc. Results We developed WebMGA, a customizable web server for fast metagenomic analysis. WebMGA includes over 20 commonly used tools such as ORF calling, sequence clustering, quality control of raw reads, removal of sequencing artifacts and contaminations, taxonomic analysis, functional annotation etc. WebMGA provides users with rapid metagenomic data analysis using fast and effective tools, which have been implemented to run in parallel on our local computer cluster. Users can access WebMGA through web browsers or programming scripts to perform individual analysis or to configure and run customized pipelines. WebMGA is freely available at http://weizhongli-lab.org/metagenomic-analysis. Conclusions WebMGA offers to researchers many fast and unique tools and great flexibility for complex metagenomic data analysis.

  6. Computational workflow for the fine-grained analysis of metagenomic samples.

    Pérez-Wohlfeil, Esteban; Arjona-Medina, Jose A; Torreno, Oscar; Ulzurrun, Eugenia; Trelles, Oswaldo

    2016-10-25

    The field of metagenomics, defined as the direct genetic analysis of uncultured samples of genomes contained within an environmental sample, is gaining increasing popularity. The aim of studies of metagenomics is to determine the species present in an environmental community and identify changes in the abundance of species under different conditions. Current metagenomic analysis software faces bottlenecks due to the high computational load required to analyze complex samples. A computational open-source workflow has been developed for the detailed analysis of metagenomes. This workflow provides new tools and datafile specifications that facilitate the identification of differences in abundance of reads assigned to taxa (mapping), enables the detection of reads of low-abundance bacteria (producing evidence of their presence), provides new concepts for filtering spurious matches, etc. Innovative visualization ideas for improved display of metagenomic diversity are also proposed to better understand how reads are mapped to taxa. Illustrative examples are provided based on the study of two collections of metagenomes from faecal microbial communities of adult female monozygotic and dizygotic twin pairs concordant for leanness or obesity and their mothers. The proposed workflow provides an open environment that offers the opportunity to perform the mapping process using different reference databases. Additionally, this workflow shows the specifications of the mapping process and datafile formats to facilitate the development of new plugins for further post-processing. This open and extensible platform has been designed with the aim of enabling in-depth analysis of metagenomic samples and better understanding of the underlying biological processes.

  7. Challenges and opportunities in understanding microbial communities with metagenome assembly (accompanied by IPython Notebook tutorial)

    Howe, Adina; Chain, Patrick S. G.

    2015-01-01

    Metagenomic investigations hold great promise for informing the genetics, physiology, and ecology of environmental microorganisms. Current challenges for metagenomic analysis are related to our ability to connect the dots between sequencing reads, their population of origin, and their encoding functions. Assembly-based methods reduce dataset size by extending overlapping reads into larger contiguous sequences (contigs), providing contextual information for genetic sequences that does not rely on existing references. These methods, however, tend to be computationally intensive and are again challenged by sequencing errors as well as by genomic repeats While numerous tools have been developed based on these methodological concepts, they present confounding choices and training requirements to metagenomic investigators. To help with accessibility to assembly tools, this review also includes an IPython Notebook metagenomic assembly tutorial. This tutorial has instructions for execution any operating system using Amazon Elastic Cloud Compute and guides users through downloading, assembly, and mapping reads to contigs of a mock microbiome metagenome. Despite its challenges, metagenomic analysis has already revealed novel insights into many environments on Earth. As software, training, and data continue to emerge, metagenomic data access and its discoveries will to grow. PMID:26217314

  8. Challenges and opportunities in understanding microbial communities with metagenome assembly (accompanied by IPython Notebook tutorial

    Adina eHowe

    2015-07-01

    Full Text Available Metagenomic investigations hold great promise for informing the genetics, physiology, and ecology of environmental microorganisms. Current challenges for metagenomic analysis are related to our ability to connect the dots between sequencing reads, their population of origin, and their encoding functions. Assembly-based methods reduce dataset size by extending overlapping reads into larger contiguous sequences (contigs, providing contextual information for genetic sequences that does not rely on existing references. These methods, however, tend to be computationally intensive and are again challenged by sequencing errors as well as by genomic repeats While numerous tools have been developed based on these methodological concepts, they present confounding choices and training requirements to metagenomic investigators. To help with accessibility to assembly tools, this review also includes an IPython Notebook metagenomic assembly tutorial. This tutorial has instructions for execution any operating system using Amazon Elastic Cloud Compute and guides users through downloading, assembly, and mapping reads to contigs of a mock microbiome metagenome. Despite its challenges, metagenomic analysis has already revealed novel insights into many environments on Earth. As software, training, and data continue to emerge, metagenomic data access and its discoveries will to grow.

  9. Gene prediction in metagenomic fragments: A large scale machine learning approach

    Morgenstern Burkhard

    2008-04-01

    Full Text Available Abstract Background Metagenomics is an approach to the characterization of microbial genomes via the direct isolation of genomic sequences from the environment without prior cultivation. The amount of metagenomic sequence data is growing fast while computational methods for metagenome analysis are still in their infancy. In contrast to genomic sequences of single species, which can usually be assembled and analyzed by many available methods, a large proportion of metagenome data remains as unassembled anonymous sequencing reads. One of the aims of all metagenomic sequencing projects is the identification of novel genes. Short length, for example, Sanger sequencing yields on average 700 bp fragments, and unknown phylogenetic origin of most fragments require approaches to gene prediction that are different from the currently available methods for genomes of single species. In particular, the large size of metagenomic samples requires fast and accurate methods with small numbers of false positive predictions. Results We introduce a novel gene prediction algorithm for metagenomic fragments based on a two-stage machine learning approach. In the first stage, we use linear discriminants for monocodon usage, dicodon usage and translation initiation sites to extract features from DNA sequences. In the second stage, an artificial neural network combines these features with open reading frame length and fragment GC-content to compute the probability that this open reading frame encodes a protein. This probability is used for the classification and scoring of gene candidates. With large scale training, our method provides fast single fragment predictions with good sensitivity and specificity on artificially fragmented genomic DNA. Additionally, this method is able to predict translation initiation sites accurately and distinguishes complete from incomplete genes with high reliability. Conclusion Large scale machine learning methods are well-suited for gene

  10. Metagenomes obtained by "deep sequencing" - what do they tell about the EBPR communities?

    Albertsen, Mads; Saunders, Aaron Marc; Nielsen, Kåre Lehmann

    2013-01-01

    Metagenomics enables studies of the genomic potential of complex microbial communities by sequencing bulk genomic DNA directly from the environment. Knowledge of the genetic potential of a community can be used to formulate and test ecological hypotheses about stability and performance...... demonstrate that metagenomics can be used as a powerful tool for system wide characterization of the EBPR community as well as for a deeper understanding of the function of specific community members. Furthermore, we discuss and illustrate some of the general pitfalls in metagenomics and stress the need...

  11. A novel genome signature based on inter-nucleotide distances profiles for visualization of metagenomic data

    Xie, Xian-Hua; Yu, Zu-Guo; Ma, Yuan-Lin; Han, Guo-Sheng; Anh, Vo

    2017-09-01

    There has been a growing interest in visualization of metagenomic data. The present study focuses on the visualization of metagenomic data using inter-nucleotide distances profile. We first convert the fragment sequences into inter-nucleotide distances profiles. Then we analyze these profiles by principal component analysis. Finally the principal components are used to obtain the 2-D scattered plot according to their source of species. We name our method as inter-nucleotide distances profiles (INP) method. Our method is evaluated on three benchmark data sets used in previous published papers. Our results demonstrate that the INP method is good, alternative and efficient for visualization of metagenomic data.

  12. A highly optimized grid deployment: the metagenomic analysis example.

    Aparicio, Gabriel; Blanquer, Ignacio; Hernández, Vicente

    2008-01-01

    Computational resources and computationally expensive processes are two topics that are not growing at the same ratio. The availability of large amounts of computing resources in Grid infrastructures does not mean that efficiency is not an important issue. It is necessary to analyze the whole process to improve partitioning and submission schemas, especially in the most critical experiments. This is the case of metagenomic analysis, and this text shows the work done in order to optimize a Grid deployment, which has led to a reduction of the response time and the failure rates. Metagenomic studies aim at processing samples of multiple specimens to extract the genes and proteins that belong to the different species. In many cases, the sequencing of the DNA of many microorganisms is hindered by the impossibility of growing significant samples of isolated specimens. Many bacteria cannot survive alone, and require the interaction with other organisms. In such cases, the information of the DNA available belongs to different kinds of organisms. One important stage in Metagenomic analysis consists on the extraction of fragments followed by the comparison and analysis of their function stage. By the comparison to existing chains, whose function is well known, fragments can be classified. This process is computationally intensive and requires of several iterations of alignment and phylogeny classification steps. Source samples reach several millions of sequences, which could reach up to thousands of nucleotides each. These sequences are compared to a selected part of the "Non-redundant" database which only implies the information from eukaryotic species. From this first analysis, a refining process is performed and alignment analysis is restarted from the results. This process implies several CPU years. The article describes and analyzes the difficulties to fragment, automate and check the above operations in current Grid production environments. This environment has been

  13. High throughtput comparisons and profiling of metagenomes for industrially relevant enzymes

    Alam, Intikhab

    2016-01-01

    .g. temperature, environmental chemistry, etc… These metagenomes can be profiled to unearth enzymes relevant to several industries based on specific enzyme properties such as ability to work on extreme conditions, such as extreme temperatures, salinity

  14. IDENTIFICATION OF AVIAN-SPECIFIC FECAL METAGENOMIC SEQUENCES USING GENOME FRAGMENT ENRICHMENTS

    Sequence analysis of microbial genomes has provided biologists the opportunity to compare genetic differences between closely related microorganisms. While random sequencing has also been used to study natural microbial communities, metagenomic comparisons via sequencing analysis...

  15. ELIXIR pilot action: Marine metagenomics – towards a domain specific set of sustainable services

    Robertsen, Espen Mikal; Denise, Hubert; Mitchell, Alex; Finn, Robert D.; Bongo, Lars Ailo; Willassen, Nils Peder

    2017-01-01

    Metagenomics, the study of genetic material recovered directly from environmental samples, has the potential to provide insight into the structure and function of heterogeneous microbial communities.  There has been an increased use of metagenomics to discover and understand the diverse biosynthetic capacities of marine microbes, thereby allowing them to be exploited for industrial, food, and health care products. This ELIXIR pilot action was motivated by the need to establish dedicated data resources and harmonized metagenomics pipelines for the marine domain, in order to enhance the exploration and exploitation of marine genetic resources. In this paper, we summarize some of the results from the ELIXIR pilot action “Marine metagenomics – towards user centric services”. PMID:28620454

  16. ELIXIR pilot action: Marine metagenomics - towards a domain specific set of sustainable services.

    Robertsen, Espen Mikal; Denise, Hubert; Mitchell, Alex; Finn, Robert D; Bongo, Lars Ailo; Willassen, Nils Peder

    2017-01-01

    Metagenomics, the study of genetic material recovered directly from environmental samples, has the potential to provide insight into the structure and function of heterogeneous microbial communities.  There has been an increased use of metagenomics to discover and understand the diverse biosynthetic capacities of marine microbes, thereby allowing them to be exploited for industrial, food, and health care products. This ELIXIR pilot action was motivated by the need to establish dedicated data resources and harmonized metagenomics pipelines for the marine domain, in order to enhance the exploration and exploitation of marine genetic resources. In this paper, we summarize some of the results from the ELIXIR pilot action "Marine metagenomics - towards user centric services".

  17. A deep gold mine metagenome as a source of novel esterases

    Jane

    2011-07-04

    Jul 4, 2011 ... small metagenome library from the deep mine biofilm provided two esterolytic clones, ...... tuberosum) tubers, and its occurrence as genotype effect: processing .... diversity in freshwater sediment of a shallow eutrophic lake by.

  18. Experimental Design and Bioinformatics Analysis for the Application of Metagenomics in Environmental Sciences and Biotechnology.

    Ju, Feng; Zhang, Tong

    2015-11-03

    Recent advances in DNA sequencing technologies have prompted the widespread application of metagenomics for the investigation of novel bioresources (e.g., industrial enzymes and bioactive molecules) and unknown biohazards (e.g., pathogens and antibiotic resistance genes) in natural and engineered microbial systems across multiple disciplines. This review discusses the rigorous experimental design and sample preparation in the context of applying metagenomics in environmental sciences and biotechnology. Moreover, this review summarizes the principles, methodologies, and state-of-the-art bioinformatics procedures, tools and database resources for metagenomics applications and discusses two popular strategies (analysis of unassembled reads versus assembled contigs/draft genomes) for quantitative or qualitative insights of microbial community structure and functions. Overall, this review aims to facilitate more extensive application of metagenomics in the investigation of uncultured microorganisms, novel enzymes, microbe-environment interactions, and biohazards in biotechnological applications where microbial communities are engineered for bioenergy production, wastewater treatment, and bioremediation.

  19. Metagenomic approaches to exploit the biotechnological potential of the microbial consortia of marine sponges.

    Kennedy, Jonathan; Marchesi, Julian R; Dobson, Alan D W

    2007-05-01

    Natural products isolated from sponges are an important source of new biologically active compounds. However, the development of these compounds into drugs has been held back by the difficulties in achieving a sustainable supply of these often-complex molecules for pre-clinical and clinical development. Increasing evidence implicates microbial symbionts as the source of many of these biologically active compounds, but the vast majority of the sponge microbial community remain uncultured. Metagenomics offers a biotechnological solution to this supply problem. Metagenomes of sponge microbial communities have been shown to contain genes and gene clusters typical for the biosynthesis of biologically active natural products. Heterologous expression approaches have also led to the isolation of secondary metabolism gene clusters from uncultured microbial symbionts of marine invertebrates and from soil metagenomic libraries. Combining a metagenomic approach with heterologous expression holds much promise for the sustainable exploitation of the chemical diversity present in the sponge microbial community.

  20. Use of simulated data sets to evaluate the fidelity of metagenomic processing methods

    Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Barry, Kerrie [U.S. Department of Energy, Joint Genome Institute; Shapiro, Harris [U.S. Department of Energy, Joint Genome Institute; Goltsman, Eugene [U.S. Department of Energy, Joint Genome Institute; McHardy, Alice C. [IBM T. J. Watson Research Center; Rigoutsos, Isidore [IBM T. J. Watson Research Center; Salamov, Asaf [U.S. Department of Energy, Joint Genome Institute; Korzeniewski, Frank [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Grigoriev, Igor [U.S. Department of Energy, Joint Genome Institute; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute

    2007-01-01

    Metagenomics is a rapidly emerging field of research for studying microbial communities. To evaluate methods presently used to process metagenomic sequences, we constructed three simulated data sets of varying complexity by combining sequencing reads randomly selected from 113 isolate genomes. These data sets were designed to model real metagenomes in terms of complexity and phylogenetic composition. We assembled sampled reads using three commonly used genome assemblers (Phrap, Arachne and JAZZ), and predicted genes using two popular gene-finding pipelines (fgenesb and CRITICA/GLIMMER). The phylogenetic origins of the assembled contigs were predicted using one sequence similarity-based ( blast hit distribution) and two sequence composition-based (PhyloPythia, oligonucleotide frequencies) binning methods. We explored the effects of the simulated community structure and method combinations on the fidelity of each processing step by comparison to the corresponding isolate genomes. The simulated data sets are available online to facilitate standardized benchmarking of tools for metagenomic analysis.

  1. A viral metagenomic approach on a nonmetagenomic experiment

    Bovo, Samuele; Mazzoni, Gianluca; Ribani, Anisa

    2017-01-01

    Shot-gun next generation sequencing (NGS) on whole DNA extracted from specimens collected from mammals often produces reads that are not mapped (i.e. unmapped reads) on the host reference genome and that are usually discarded as by-products of the experiments. In this study, we mined Ion Torrent...... reads obtained by sequencing DNA isolated from archived blood samples collected from 100 performance tested Italian Large White pigs. Two reduced representation libraries were prepared from two DNA pools constructed each from 50 equimolar DNA samples. Bioinformatic analyses were carried out to mine...... unmapped reads on the reference pig genome that were obtained from the two NGS datasets. In silico analyses included read mapping and sequence assembly approaches for a viral metagenomic analysis using the NCBI Viral Genome Resource. Our approach identified sequences matching several viruses...

  2. Symbiosis insights through metagenomic analysis of a microbialconsortium

    Woyke, Tanja; Teeling, Hanno; Ivanova, Natalia N.; Hunteman,Marcel; Richter, Michael; Gloeckner, Frank Oliver; Boffelli, Dario; Barry, Kerrie W.; Shapiro, Harris J.; Anderson, Iain J.; Szeto, Ernest; Kyrpides, Nikos C.; Mussmann, Marc; Amann, Rudolf; Bergin, Claudia; Ruehland, Caroline; Rubin, Edward M.; Dubilier, Nicole

    2006-09-01

    Symbioses between bacteria and eukaryotes are ubiquitous, yet our understanding of the interactions driving these associations is hampered by our inability to cultivate most host-associated microbes. Here, we used a metagenomic approach to describe four co-occurring symbionts from the marine oligochaete Olavius algarvensis, a worm lacking a mouth, gut, and nephridia. Shotgun sequencing and metabolic pathway reconstruction revealed that the symbionts are sulfur-oxidizing and sulfate-reducing bacteria, all of which are capable of carbon fixation, providing the host with multiple sources of nutrition. Molecular evidence for the uptake and recycling of worm waste products by the symbionts suggests how the worm could eliminate its excretory system, an adaptation unique among annelid worms. We propose a model which describes how the versatile metabolism within this symbiotic consortium provides the host with an optimal energy supply as it shuttles between the upper oxic and lower anoxic coastal sediments which it inhabits.

  3. Metagenomic approach for discovering new pathogens in infection disease outbreaks

    Emanuela Giombini

    2011-09-01

    Full Text Available Viruses represent the most abundant biological components on earth.They can be found in every environment, from deep layers of oceans to animal bodies.Although several viruses have been isolated and sequenced, in each environment there are millions of different types of viruses that have not been identified yet.The advent of nextgeneration sequencing technologies with their high throughput capabilities make possible to study in a single experiment all the community of microorganisms present in a particular sample “microbioma”.They made more feasible the application of the metagenomic approach, by which it is also possible to discover and identify new pathogens, that may pose a threat to public health.This paper summarizes the most recent applications of nextgeneration sequencing to discover new viral pathogens during the occurrence of infection disease outbreaks.

  4. MetaPhinder-Identifying Bacteriophage Sequences in Metagenomic Data Sets

    Jurtz, Vanessa Isabell; Villarroel, Julia; Lund, Ole

    2016-01-01

    genome structure of many bacteriophages. The method is demonstrated to outperform both BLAST methods based on single hits and methods based on k-mer comparisons. MetaPhinder is available as a web service at the Center for Genomic Epidemiology https://cge.cbs.dtu.dk/services/MetaPhinder/, while the source...... and understand them. Here we present MetaPhinder, a method to identify assembled genomic fragments (i.e. contigs) of phage origin in metage-nomic data sets. The method is based on a comparison to a database of whole genome bacteriophage sequences, integrating hits to multiple genomes to accomodate for the mosaic...... code can be downloaded from https://bitbucket.org/genomicepidemiology/metaphinder or https://github.com/vanessajurtz/MetaPhinder....

  5. Quantitative metagenomic analyses based on average genome size normalization

    Frank, Jeremy Alexander; Sørensen, Søren Johannes

    2011-01-01

    provide not just a census of the community members but direct information on metabolic capabilities and potential interactions among community members. Here we introduce a method for the quantitative characterization and comparison of microbial communities based on the normalization of metagenomic data...... marine sources using both conventional small-subunit (SSU) rRNA gene analyses and our quantitative method to calculate the proportion of genomes in each sample that are capable of a particular metabolic trait. With both environments, to determine what proportion of each community they make up and how......). These analyses demonstrate how genome proportionality compares to SSU rRNA gene relative abundance and how factors such as average genome size and SSU rRNA gene copy number affect sampling probability and therefore both types of community analysis....

  6. Metagenomic Analysis of Microbial Symbionts in a Gutless Worm

    Woyke, Tanja; Teeling, Hanno; Ivanova, Natalia N.; Hunteman, Marcel; Richter, Michael; Gloeckner, Frank Oliver; Boeffelli, Dario; Barry, Kerrie W.; Shapiro, Harris J.; Anderson, Iain J.; Szeto, Ernest; Kyrpides, Nikos C.; Mussmann, Marc; Amann, Rudolf; Bergin, Claudia; Ruehland, Caroline; Rubin, Edward M.; Dubilier, Nicole

    2006-05-01

    Symbioses between bacteria and eukaryotes are ubiquitous, yet our understanding of the interactions driving these associations is hampered by our inability to cultivate most host-associated microbes. Here we use a metagenomic approach to describe four co-occurring symbionts from the marine oligochaete Olavius algarvensis, a worm lacking a mouth, gut and nephridia. Shotgun sequencing and metabolic pathway reconstruction revealed that the symbionts are sulphur-oxidizing and sulphate-reducing bacteria, all of which are capable of carbon fixation, thus providing the host with multiple sources of nutrition. Molecular evidence for the uptake and recycling of worm waste products by the symbionts suggests how the worm could eliminate its excretory system, an adaptation unique among annelid worms. We propose a model that describes how the versatile metabolism within this symbiotic consortium provides the host with an optimal energy supply as it shuttles between the upper oxic and lower anoxic coastal sediments that it inhabits.

  7. Data Management in Metagenomics: A Risk Management Approach

    Filipe Ferreira

    2014-07-01

    Full Text Available In eScience, where vast data collections are processed in scientific workflows, new risks and challenges are emerging. Those challenges are changing the eScience paradigm, mainly regarding digital preservation and scientific workflows. To address specific concerns with data management in these scenarios, the concept of the Data Management Plan was established, serving as a tool for enabling digital preservation in eScience research projects. We claim risk management can be jointly used with a Data Management Plan, so new risks and challenges can be easily tackled. Therefore, we propose an analysis process for eScience projects using a Data Management Plan and ISO 31000 in order to create a Risk Management Plan that can complement the Data Management Plan. The motivation, requirements and validation of this proposal are explored in the MetaGen-FRAME project, focused in Metagenomics.

  8. Mining anaerobic digester consortia metagenomes for secreted carbohydrate active enzymes

    Wilkens, Casper; Busk, Peter Kamp; Pilgaard, Bo

    thermophilic and mesophilic ADs a wide variety of carbohydrate active enzyme functions were discovered in the metagenomic sequencing of the microbial consortia. The most dominating type of glycoside hydrolases were β-glucosidases (up to 27%), α-amylases (up to 10%), α-glucosidases (up to 8%), α......, and food wastes (Alvarado et al., 2014). The processes and the roles of the microorganisms that are involved in biomass conversion and methane production in ADs are still not fully understood. We are investigating thermophilic and mesophilic ADs that use wastewater surplus sludge for methane production...... was done with the Peptide Pattern Recognition (PPR) program (Busk and Lange, 2013), which is a novel non-alignment based approach that can predict function of e.g. CAZymes. PPR identifies a set of short conserved sequences, which can be used as a finger print when mining genomes for novel enzymes. In both...

  9. Assessment of metagenomic assembly using simulated next generation sequencing data

    Mende, Daniel R; Waller, Alison S; Sunagawa, Shinichi

    2012-01-01

    with platform-specific (Sanger, pyrosequencing, Illumina) base-error models, and simulated metagenomes of differing community complexities. We first evaluated the effect of rigorous quality control on Illumina data. Although quality filtering removed a large proportion of the data, it greatly improved...... the accuracy and contig lengths of resulting assemblies. We then compared the quality-trimmed Illumina assemblies to those from Sanger and pyrosequencing. For the simple community (10 genomes) all sequencing technologies assembled a similar amount and accurately represented the expected functional composition...... the Sanger reads still represented the overall functional composition reasonably well. We further examined the effect of scaffolding of contigs using paired-end Illumina reads. It dramatically increased contig lengths of the simple community and yielded minor improvements to the more complex communities...

  10. A retrospective metagenomics approach to studying Blastocystis

    Andersen, Lee O'Brien; Bonde, Ida; Nielsen, Henrik Bjørn

    2015-01-01

    a selection of 316 human faecal samples, hence representing genes originating from a single subtype. The 316 faecal samples were from 236 healthy individuals, 13 patients with Crohn's disease (CD) and 67 patients with ulcerative colitis (UC). The prevalence of Blastocystis was 20.3% in the healthy individuals......Blastocystis is a common single-celled intestinal parasitic genus, comprising several subtypes. Here, we screened data obtained by metagenomic analysis of faecal DNA for Blastocystis by searching for subtype-specific genes in coabundance gene groups, which are groups of genes that covary across...... and 14.9% in patients with UC. Meanwhile, Blastocystis was absent in patients with CD. Individuals with intestinal microbiota dominated by Bacteroides were much less prone to having Blastocystis-positive stool (Matthew's correlation coefficient = -0.25, P

  11. Metagenomic recovery of phage genomes of uncultured freshwater actinobacteria.

    Ghai, Rohit; Mehrshad, Maliheh; Mizuno, Carolina Megumi; Rodriguez-Valera, Francisco

    2017-01-01

    Low-GC Actinobacteria are among the most abundant and widespread microbes in freshwaters and have largely resisted all cultivation efforts. Consequently, their phages have remained totally unknown. In this work, we have used deep metagenomic sequencing to assemble eight complete genomes of the first tailed phages that infect freshwater Actinobacteria. Their genomes encode the actinobacterial-specific transcription factor whiB, frequently found in mycobacteriophages and also in phages infecting marine pelagic Actinobacteria. Its presence suggests a common and widespread strategy of modulation of host transcriptional machinery upon infection via this transcriptional switch. We present evidence that some whiB-carrying phages infect the acI lineage of Actinobacteria. At least one of them encodes the ADP-ribosylating component of the widespread bacterial AB toxins family (for example, clostridial toxin). We posit that the presence of this toxin reflects a 'trojan horse' strategy, providing protection at the population level to the abundant host microbes against eukaryotic predators.

  12. A Metagenomic Survey of Serpentinites and Nearby Soils in Taiwan

    Li, K. Y.; Hsu, Y. W.; Chen, Y. W.; Huang, T. Y.; Shih, Y. J.; Chen, J. S.; Hsu, B. M.

    2016-12-01

    The serpentinite of Taiwan is originated from the subduction zone of the Eurasian plate and the Philippine Sea plate. Many small bodies of serpentinite are scattered around the lands of the East Rift Valley, which are also one of the major agricultural areas in Taiwan. Since microbial communities play a role both on weathering process and soil recovery, uncovering the microbial compositions in serpentinites and surrounding soils may help people to understand the roles of microorganisms on serpentinites during the nature weathering process. In this study, microorganisms growing on the surface of serpentinites, in the surrounding soil, and agriculture soils that are miles of horizontal distance away from serpentinite were collected. Next generation sequencing (NGS) was carried out to examine the metagenomics of uncultured microbial community in these samples. The metagenomics were further clustered into operational taxonomic units (OTUs) to analyze relative abundance, heatmap of OTUs, and principal coordinates analysis (PCoA). Our data revealed the different types of geographic material had their own distinct structures of microbial community. In serpentinites, the heatmaps based on the phylogenetic pattern showed that the OTUs distributions were similar in phyla of Bacteroidetes, Cyanobacteria, Proteobacteria, Verrucomicrobia, and WPS-1/WPS-2. On the other hand, the heatmaps of phylogenetic pattern of agriculture soils showed that the OTUs distributions in phyla of Chloroflexi, Acidobacteria, Actinobacteria, WPS-1/WPS-2, and Proteobacteria were similar. In soil nearby the serpentinite, some clusters of OTUs in phyla of Bacteroidetes, Cyanobacteria, and WPS-1/WPS-2 have disappeared. Our data provided evidence regarding kinetic evolutions of microbial communities in different geographic materials.

  13. Metagenomic exploration of viruses throughout the Indian Ocean.

    Shannon J Williamson

    Full Text Available The characterization of global marine microbial taxonomic and functional diversity is a primary goal of the Global Ocean Sampling Expedition. As part of this study, 19 water samples were collected aboard the Sorcerer II sailing vessel from the southern Indian Ocean in an effort to more thoroughly understand the lifestyle strategies of the microbial inhabitants of this ultra-oligotrophic region. No investigations of whole virioplankton assemblages have been conducted on waters collected from the Indian Ocean or across multiple size fractions thus far. Therefore, the goals of this study were to examine the effect of size fractionation on viral consortia structure and function and understand the diversity and functional potential of the Indian Ocean virome. Five samples were selected for comprehensive metagenomic exploration; and sequencing was performed on the microbes captured on 3.0-, 0.8- and 0.1 µm membrane filters as well as the viral fraction (<0.1 µm. Phylogenetic approaches were also used to identify predicted proteins of viral origin in the larger fractions of data from all Indian Ocean samples, which were included in subsequent metagenomic analyses. Taxonomic profiling of viral sequences suggested that size fractionation of marine microbial communities enriches for specific groups of viruses within the different size classes and functional characterization further substantiated this observation. Functional analyses also revealed a relative enrichment for metabolic proteins of viral origin that potentially reflect the physiological condition of host cells in the Indian Ocean including those involved in nitrogen metabolism and oxidative phosphorylation. A novel classification method, MGTAXA, was used to assess virus-host relationships in the Indian Ocean by predicting the taxonomy of putative host genera, with Prochlorococcus, Acanthochlois and members of the SAR86 cluster comprising the most abundant predictions. This is the first study

  14. Metagenomics for the discovery of novel biosurfactants of environmental interest from marine ecosystems.

    Jackson, Stephen A; Borchert, Erik; O'Gara, Fergal; Dobson, Alan D W

    2015-06-01

    Research focused on the search for new biosurfactants aims to replace chemical surfactants, which while being cost-effective are ecologically undesirable. Metagenomics can lead to discovery of novel biosurfactants, tackling issues of low production yields. Recent successes include the heterologous production of biosurfactants. The dearth of biosurfactants discovered to date through metagenomics is puzzling given that good screening systems and heterologous host systems are available. Copyright © 2015 Elsevier Ltd. All rights reserved.

  15. Beyond research: a primer for considerations on using viral metagenomics in the field and clinic

    Hall, Richard J.; Draper, Jenny L.; Nielsen, Fiona G. G.; Dutilh, Bas E.

    2015-01-01

    Powered by recent advances in next-generation sequencing technologies, metagenomics has already unveiled vast microbial biodiversity in a range of environments, and is increasingly being applied in clinics for difficult-to-diagnose cases. It can be tempting to suggest that metagenomics could be used as a “universal test” for all pathogens without the need to conduct lengthy serial testing using specific assays. While this is an exciting prospect, there are issues that need to be addressed bef...

  16. Quantitative Field Testing Rotylenchulus reniformis DNA from Metagenomic Samples Isolated Directly from Soil

    Showmaker, Kurt; Lawrence, Gary W.; Lu, Shien; Balbalian, Clarissa; Klink, Vincent P.

    2011-01-01

    A quantitative PCR procedure targeting the β-tubulin gene determined the number of Rotylenchulus reniformis Linford & Oliveira 1940 in metagenomic DNA samples isolated from soil. Of note, this outcome was in the presence of other soil-dwelling plant parasitic nematodes including its sister genus Helicotylenchus Steiner, 1945. The methodology provides a framework for molecular diagnostics of nematodes from metagenomic DNA isolated directly from soil. PMID:22194958

  17. A Novel Prosthetic Joint Infection Pathogen, Mycoplasma salivarium, Identified by Metagenomic Shotgun Sequencing.

    Thoendel, Matthew; Jeraldo, Patricio; Greenwood-Quaintance, Kerryl E; Chia, Nicholas; Abdel, Matthew P; Steckelberg, James M; Osmon, Douglas R; Patel, Robin

    2017-07-15

    Defining the microbial etiology of culture-negative prosthetic joint infection (PJI) can be challenging. Metagenomic shotgun sequencing is a new tool to identify organisms undetected by conventional methods. We present a case where metagenomics was used to identify Mycoplasma salivarium as a novel PJI pathogen in a patient with hypogammaglobulinemia. © The Author 2017. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail: journals.permissions@oup.com.

  18. Genetic variability of psychrotolerant Acidithiobacillus ferrivorans revealed by (meta)genomic analysis.

    González, Carolina; Yanquepe, María; Cardenas, Juan Pablo; Valdes, Jorge; Quatrini, Raquel; Holmes, David S; Dopson, Mark

    2014-11-01

    Acidophilic microorganisms inhabit low pH environments such as acid mine drainage that is generated when sulfide minerals are exposed to air. The genome sequence of the psychrotolerant Acidithiobacillus ferrivorans SS3 was compared to a metagenome from a low temperature acidic stream dominated by an A. ferrivorans-like strain. Stretches of genomic DNA characterized by few matches to the metagenome, termed 'metagenomic islands', encoded genes associated with metal efflux and pH homeostasis. The metagenomic islands were enriched in mobile elements such as phage proteins, transposases, integrases and in one case, predicted to be flanked by truncated tRNAs. Cus gene clusters predicted to be involved in copper efflux and further Cus-like RND systems were predicted to be located in metagenomic islands and therefore, constitute part of the flexible gene complement of the species. Phylogenetic analysis of Cus clusters showed both lineage specificity within the Acidithiobacillus genus as well as niche specificity associated with an acidic environment. The metagenomic islands also contained a predicted copper efflux P-type ATPase system and a polyphosphate kinase potentially involved in polyphosphate mediated copper resistance. This study identifies genetic variability of low temperature acidophiles that likely reflects metal resistance selective pressures in the copper rich environment. Copyright © 2014 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

  19. Diversity Indices as Measures of Functional Annotation Methods in Metagenomics Studies

    Jankovic, Boris R.

    2016-01-26

    Applications of high-throughput techniques in metagenomics studies produce massive amounts of data. Fragments of genomic, transcriptomic and proteomic molecules are all found in metagenomics samples. Laborious and meticulous effort in sequencing and functional annotation are then required to, amongst other objectives, reconstruct a taxonomic map of the environment that metagenomics samples were taken from. In addition to computational challenges faced by metagenomics studies, the analysis is further complicated by the presence of contaminants in the samples, potentially resulting in skewed taxonomic analysis. The functional annotation in metagenomics can utilize all available omics data and therefore different methods that are associated with a particular type of data. For example, protein-coding DNA, non-coding RNA or ribosomal RNA data can be used in such an analysis. These methods would have their advantages and disadvantages and the question of comparison among them naturally arises. There are several criteria that can be used when performing such a comparison. Loosely speaking, methods can be evaluated in terms of computational complexity or in terms of the expected biological accuracy. We propose that the concept of diversity that is used in the ecosystems and species diversity studies can be successfully used in evaluating certain aspects of the methods employed in metagenomics studies. We show that when applying the concept of Hill’s diversity, the analysis of variations in the diversity order provides valuable clues into the robustness of methods used in the taxonomical analysis.

  20. New Hydrocarbon Degradation Pathways in the Microbial Metagenome from Brazilian Petroleum Reservoirs

    Sierra-García, Isabel Natalia; Correa Alvarez, Javier; Pantaroto de Vasconcellos, Suzan; Pereira de Souza, Anete; dos Santos Neto, Eugenio Vaz; de Oliveira, Valéria Maia

    2014-01-01

    Current knowledge of the microbial diversity and metabolic pathways involved in hydrocarbon degradation in petroleum reservoirs is still limited, mostly due to the difficulty in recovering the complex community from such an extreme environment. Metagenomics is a valuable tool to investigate the genetic and functional diversity of previously uncultured microorganisms in natural environments. Using a function-driven metagenomic approach, we investigated the metabolic abilities of microbial communities in oil reservoirs. Here, we describe novel functional metabolic pathways involved in the biodegradation of aromatic compounds in a metagenomic library obtained from an oil reservoir. Although many of the deduced proteins shared homology with known enzymes of different well-described aerobic and anaerobic catabolic pathways, the metagenomic fragments did not contain the complete clusters known to be involved in hydrocarbon degradation. Instead, the metagenomic fragments comprised genes belonging to different pathways, showing novel gene arrangements. These results reinforce the potential of the metagenomic approach for the identification and elucidation of new genes and pathways in poorly studied environments and contribute to a broader perspective on the hydrocarbon degradation processes in petroleum reservoirs. PMID:24587220

  1. Metatranscriptome analysis of fungal strains Penicillium camemberti and Geotrichum candidum reveal cheese matrix breakdown and potential development of sensory properties of ripened Camembert-type cheese.

    Lessard, Marie-Hélène; Viel, Catherine; Boyle, Brian; St-Gelais, Daniel; Labrie, Steve

    2014-03-26

    Camembert-type cheese ripening is driven mainly by fungal microflora including Geotrichum candidum and Penicillium camemberti. These species are major contributors to the texture and flavour of typical bloomy rind cheeses. Biochemical studies showed that G. candidum reduces bitterness, enhances sulphur flavors through amino acid catabolism and has an impact on rind texture, firmness and thickness, while P. camemberti is responsible for the white and bloomy aspect of the rind, and produces enzymes involved in proteolysis and lipolysis activities. However, very little is known about the genetic determinants that code for these activities and their expression profile over time during the ripening process. The metatranscriptome of an industrial Canadian Camembert-type cheese was studied at seven different sampling days over 77 days of ripening. A database called CamemBank01 was generated, containing a total of 1,060,019 sequence tags (reads) assembled in 7916 contigs. Sequence analysis revealed that 57% of the contigs could be affiliated to molds, 16% originated from yeasts, and 27% could not be identified. According to the functional annotation performed, the predominant processes during Camembert ripening include gene expression, energy-, carbohydrate-, organic acid-, lipid- and protein- metabolic processes, cell growth, and response to different stresses. Relative expression data showed that these functions occurred mostly in the first two weeks of the ripening period. These data provide further advances in our knowledge about the biological activities of the dominant ripening microflora of Camembert cheese and will help select biological markers to improve cheese quality assessment.

  2. Feasibility of Metatranscriptome Analysis from Infant Gut Microbiota: Adaptation to Solid Foods Results in Increased Activity of Firmicutes at Six Months

    Floor Hugenholtz

    2017-01-01

    Full Text Available Newborns are rapidly colonized by microbes and their intestinal tracts contain highly dynamic and rapidly developing microbial communities in the first months of life. In this study, we describe the feasibility of isolating mRNA from rapidly processed faecal samples and applying deep RNA-Seq analysis to provide insight into the active contributors of the microbial community in early life. Specific attention is given to the impact of removing rRNA from the mRNA on the phylogenetic and transcriptional profiling and its analysis depth. A breastfed baby was followed in the first six months of life during adaptation to solid food, dairy products, and formula. It was found that, in the weaning period, the total transcriptional activity of Actinobacteria, mainly represented by Bifidobacterium, decreased while that of Firmicutes increased over time. Moreover, Firmicutes and Actinobacteria, including the canonical Bifidobacteria as well as Collinsella, were found to be important contributors to carbohydrate fermentation and vitamin biosynthesis in the infant intestine. Finally, the expression of Lactobacillus rhamnosus-like genes was detected, likely following transfer from the mother who consumed L. rhamnosus GG. The study indicates that metatranscriptome analysis of the infant gut microbiota is feasible on infant stool samples and can be used to provide insight into the core activities of the developing community.

  3. Chronic Meningitis Investigated via Metagenomic Next-Generation Sequencing

    O’Donovan, Brian D.; Gelfand, Jeffrey M.; Sample, Hannah A.; Chow, Felicia C.; Betjemann, John P.; Shah, Maulik P.; Richie, Megan B.; Gorman, Mark P.; Hajj-Ali, Rula A.; Calabrese, Leonard H.; Zorn, Kelsey C.; Chow, Eric D.; Greenlee, John E.; Blum, Jonathan H.; Green, Gary; Khan, Lillian M.; Banerji, Debarko; Langelier, Charles; Bryson-Cahn, Chloe; Harrington, Whitney; Lingappa, Jairam R.; Shanbhag, Niraj M.; Green, Ari J.; Brew, Bruce J.; Soldatos, Ariane; Strnad, Luke; Doernberg, Sarah B.; Jay, Cheryl A.; Douglas, Vanja; Josephson, S. Andrew; DeRisi, Joseph L.

    2018-01-01

    Importance Identifying infectious causes of subacute or chronic meningitis can be challenging. Enhanced, unbiased diagnostic approaches are needed. Objective To present a case series of patients with diagnostically challenging subacute or chronic meningitis using metagenomic next-generation sequencing (mNGS) of cerebrospinal fluid (CSF) supported by a statistical framework generated from mNGS of control samples from the environment and from patients who were noninfectious. Design, Setting, and Participants In this case series, mNGS data obtained from the CSF of 94 patients with noninfectious neuroinflammatory disorders and from 24 water and reagent control samples were used to develop and implement a weighted scoring metric based on z scores at the species and genus levels for both nucleotide and protein alignments to prioritize and rank the mNGS results. Total RNA was extracted for mNGS from the CSF of 7 participants with subacute or chronic meningitis who were recruited between September 2013 and March 2017 as part of a multicenter study of mNGS pathogen discovery among patients with suspected neuroinflammatory conditions. The neurologic infections identified by mNGS in these 7 participants represented a diverse array of pathogens. The patients were referred from the University of California, San Francisco Medical Center (n = 2), Zuckerberg San Francisco General Hospital and Trauma Center (n = 2), Cleveland Clinic (n = 1), University of Washington (n = 1), and Kaiser Permanente (n = 1). A weighted z score was used to filter out environmental contaminants and facilitate efficient data triage and analysis. Main Outcomes and Measures Pathogens identified by mNGS and the ability of a statistical model to prioritize, rank, and simplify mNGS results. Results The 7 participants ranged in age from 10 to 55 years, and 3 (43%) were female. A parasitic worm (Taenia solium, in 2 participants), a virus (HIV-1), and 4 fungi (Cryptococcus neoformans

  4. Metagenomic analysis of viral diversity in respiratory samples from patients with respiratory tract infections in Kuwait.

    Madi, Nada; Al-Nakib, Widad; Mustafa, Abu Salim; Habibi, Nazima

    2018-03-01

    A metagenomic approach based on target independent next-generation sequencing has become a known method for the detection of both known and novel viruses in clinical samples. This study aimed to use the metagenomic sequencing approach to characterize the viral diversity in respiratory samples from patients with respiratory tract infections. We have investigated 86 respiratory samples received from various hospitals in Kuwait between 2015 and 2016 for the diagnosis of respiratory tract infections. A metagenomic approach using the next-generation sequencer to characterize viruses was used. According to the metagenomic analysis, an average of 145, 019 reads were identified, and 2% of these reads were of viral origin. Also, metagenomic analysis of the viral sequences revealed many known respiratory viruses, which were detected in 30.2% of the clinical samples. Also, sequences of non-respiratory viruses were detected in 14% of the clinical samples, while sequences of non-human viruses were detected in 55.8% of the clinical samples. The average genome coverage of the viruses was 12% with the highest genome coverage of 99.2% for respiratory syncytial virus, and the lowest was 1% for torque teno midi virus 2. Our results showed 47.7% agreement between multiplex Real-Time PCR and metagenomics sequencing in the detection of respiratory viruses in the clinical samples. Though there are some difficulties in using this method to clinical samples such as specimen quality, these observations are indicative of the promising utility of the metagenomic sequencing approach for the identification of respiratory viruses in patients with respiratory tract infections. © 2017 Wiley Periodicals, Inc.

  5. Comparative fecal metagenomics unveils unique functional capacity of the swine gut

    Martinson John

    2011-05-01

    Full Text Available Abstract Background Uncovering the taxonomic composition and functional capacity within the swine gut microbial consortia is of great importance to animal physiology and health as well as to food and water safety due to the presence of human pathogens in pig feces. Nonetheless, limited information on the functional diversity of the swine gut microbiome is available. Results Analysis of 637, 722 pyrosequencing reads (130 megabases generated from Yorkshire pig fecal DNA extracts was performed to help better understand the microbial diversity and largely unknown functional capacity of the swine gut microbiome. Swine fecal metagenomic sequences were annotated using both MG-RAST and JGI IMG/M-ER pipelines. Taxonomic analysis of metagenomic reads indicated that swine fecal microbiomes were dominated by Firmicutes and Bacteroidetes phyla. At a finer phylogenetic resolution, Prevotella spp. dominated the swine fecal metagenome, while some genes associated with Treponema and Anareovibrio species were found to be exclusively within the pig fecal metagenomic sequences analyzed. Functional analysis revealed that carbohydrate metabolism was the most abundant SEED subsystem, representing 13% of the swine metagenome. Genes associated with stress, virulence, cell wall and cell capsule were also abundant. Virulence factors associated with antibiotic resistance genes with highest sequence homology to genes in Bacteroidetes, Clostridia, and Methanosarcina were numerous within the gene families unique to the swine fecal metagenomes. Other abundant proteins unique to the distal swine gut shared high sequence homology to putative carbohydrate membrane transporters. Conclusions The results from this metagenomic survey demonstrated the presence of genes associated with resistance to antibiotics and carbohydrate metabolism suggesting that the swine gut microbiome may be shaped by husbandry practices.

  6. MP3: a software tool for the prediction of pathogenic proteins in genomic and metagenomic data.

    Gupta, Ankit; Kapil, Rohan; Dhakan, Darshan B; Sharma, Vineet K

    2014-01-01

    The identification of virulent proteins in any de-novo sequenced genome is useful in estimating its pathogenic ability and understanding the mechanism of pathogenesis. Similarly, the identification of such proteins could be valuable in comparing the metagenome of healthy and diseased individuals and estimating the proportion of pathogenic species. However, the common challenge in both the above tasks is the identification of virulent proteins since a significant proportion of genomic and metagenomic proteins are novel and yet unannotated. The currently available tools which carry out the identification of virulent proteins provide limited accuracy and cannot be used on large datasets. Therefore, we have developed an MP3 standalone tool and web server for the prediction of pathogenic proteins in both genomic and metagenomic datasets. MP3 is developed using an integrated Support Vector Machine (SVM) and Hidden Markov Model (HMM) approach to carry out highly fast, sensitive and accurate prediction of pathogenic proteins. It displayed Sensitivity, Specificity, MCC and accuracy values of 92%, 100%, 0.92 and 96%, respectively, on blind dataset constructed using complete proteins. On the two metagenomic blind datasets (Blind A: 51-100 amino acids and Blind B: 30-50 amino acids), it displayed Sensitivity, Specificity, MCC and accuracy values of 82.39%, 97.86%, 0.80 and 89.32% for Blind A and 71.60%, 94.48%, 0.67 and 81.86% for Blind B, respectively. In addition, the performance of MP3 was validated on selected bacterial genomic and real metagenomic datasets. To our knowledge, MP3 is the only program that specializes in fast and accurate identification of partial pathogenic proteins predicted from short (100-150 bp) metagenomic reads and also performs exceptionally well on complete protein sequences. MP3 is publicly available at http://metagenomics.iiserb.ac.in/mp3/index.php.

  7. Strain-Level Metagenomic Analysis of the Fermented Dairy Beverage Nunu Highlights Potential Food Safety Risks.

    Walsh, Aaron M; Crispie, Fiona; Daari, Kareem; O'Sullivan, Orla; Martin, Jennifer C; Arthur, Cornelius T; Claesson, Marcus J; Scott, Karen P; Cotter, Paul D

    2017-08-15

    The rapid detection of pathogenic strains in food products is essential for the prevention of disease outbreaks. It has already been demonstrated that whole-metagenome shotgun sequencing can be used to detect pathogens in food but, until recently, strain-level detection of pathogens has relied on whole-metagenome assembly, which is a computationally demanding process. Here we demonstrated that three short-read-alignment-based methods, i.e., MetaMLST, PanPhlAn, and StrainPhlAn, could accurately and rapidly identify pathogenic strains in spinach metagenomes that had been intentionally spiked with Shiga toxin-producing Escherichia coli in a previous study. Subsequently, we employed the methods, in combination with other metagenomics approaches, to assess the safety of nunu, a traditional Ghanaian fermented milk product that is produced by the spontaneous fermentation of raw cow milk. We showed that nunu samples were frequently contaminated with bacteria associated with the bovine gut and, worryingly, we detected putatively pathogenic E. coli and Klebsiella pneumoniae strains in a subset of nunu samples. Ultimately, our work establishes that short-read-alignment-based bioinformatics approaches are suitable food safety tools, and we describe a real-life example of their utilization. IMPORTANCE Foodborne pathogens are responsible for millions of illnesses each year. Here we demonstrate that short-read-alignment-based bioinformatics tools can accurately and rapidly detect pathogenic strains in food products by using shotgun metagenomics data. The methods used here are considerably faster than both traditional culturing methods and alternative bioinformatics approaches that rely on metagenome assembly; therefore, they can potentially be used for more high-throughput food safety testing. Overall, our results suggest that whole-metagenome sequencing can be used as a practical food safety tool to prevent diseases or to link outbreaks to specific food products. Copyright

  8. Computational workflow for the fine-grained analysis of metagenomic samples

    Esteban Pérez-Wohlfeil

    2016-10-01

    Full Text Available Abstract Background The field of metagenomics, defined as the direct genetic analysis of uncultured samples of genomes contained within an environmental sample, is gaining increasing popularity. The aim of studies of metagenomics is to determine the species present in an environmental community and identify changes in the abundance of species under different conditions. Current metagenomic analysis software faces bottlenecks due to the high computational load required to analyze complex samples. Results A computational open-source workflow has been developed for the detailed analysis of metagenomes. This workflow provides new tools and datafile specifications that facilitate the identification of differences in abundance of reads assigned to taxa (mapping, enables the detection of reads of low-abundance bacteria (producing evidence of their presence, provides new concepts for filtering spurious matches, etc. Innovative visualization ideas for improved display of metagenomic diversity are also proposed to better understand how reads are mapped to taxa. Illustrative examples are provided based on the study of two collections of metagenomes from faecal microbial communities of adult female monozygotic and dizygotic twin pairs concordant for leanness or obesity and their mothers. Conclusions The proposed workflow provides an open environment that offers the opportunity to perform the mapping process using different reference databases. Additionally, this workflow shows the specifications of the mapping process and datafile formats to facilitate the development of new plugins for further post-processing. This open and extensible platform has been designed with the aim of enabling in-depth analysis of metagenomic samples and better understanding of the underlying biological processes.

  9. VSEARCH: a versatile open source tool for metagenomics.

    Rognes, Torbjørn; Flouri, Tomáš; Nichols, Ben; Quince, Christopher; Mahé, Frédéric

    2016-01-01

    VSEARCH is an open source and free of charge multithreaded 64-bit tool for processing and preparing metagenomics, genomics and population genomics nucleotide sequence data. It is designed as an alternative to the widely used USEARCH tool (Edgar, 2010) for which the source code is not publicly available, algorithm details are only rudimentarily described, and only a memory-confined 32-bit version is freely available for academic use. When searching nucleotide sequences, VSEARCH uses a fast heuristic based on words shared by the query and target sequences in order to quickly identify similar sequences, a similar strategy is probably used in USEARCH. VSEARCH then performs optimal global sequence alignment of the query against potential target sequences, using full dynamic programming instead of the seed-and-extend heuristic used by USEARCH. Pairwise alignments are computed in parallel using vectorisation and multiple threads. VSEARCH includes most commands for analysing nucleotide sequences available in USEARCH version 7 and several of those available in USEARCH version 8, including searching (exact or based on global alignment), clustering by similarity (using length pre-sorting, abundance pre-sorting or a user-defined order), chimera detection (reference-based or de novo ), dereplication (full length or prefix), pairwise alignment, reverse complementation, sorting, and subsampling. VSEARCH also includes commands for FASTQ file processing, i.e., format detection, filtering, read quality statistics, and merging of paired reads. Furthermore, VSEARCH extends functionality with several new commands and improvements, including shuffling, rereplication, masking of low-complexity sequences with the well-known DUST algorithm, a choice among different similarity definitions, and FASTQ file format conversion. VSEARCH is here shown to be more accurate than USEARCH when performing searching, clustering, chimera detection and subsampling, while on a par with USEARCH for paired

  10. VSEARCH: a versatile open source tool for metagenomics

    Torbjørn Rognes

    2016-10-01

    Full Text Available Background VSEARCH is an open source and free of charge multithreaded 64-bit tool for processing and preparing metagenomics, genomics and population genomics nucleotide sequence data. It is designed as an alternative to the widely used USEARCH tool (Edgar, 2010 for which the source code is not publicly available, algorithm details are only rudimentarily described, and only a memory-confined 32-bit version is freely available for academic use. Methods When searching nucleotide sequences, VSEARCH uses a fast heuristic based on words shared by the query and target sequences in order to quickly identify similar sequences, a similar strategy is probably used in USEARCH. VSEARCH then performs optimal global sequence alignment of the query against potential target sequences, using full dynamic programming instead of the seed-and-extend heuristic used by USEARCH. Pairwise alignments are computed in parallel using vectorisation and multiple threads. Results VSEARCH includes most commands for analysing nucleotide sequences available in USEARCH version 7 and several of those available in USEARCH version 8, including searching (exact or based on global alignment, clustering by similarity (using length pre-sorting, abundance pre-sorting or a user-defined order, chimera detection (reference-based or de novo, dereplication (full length or prefix, pairwise alignment, reverse complementation, sorting, and subsampling. VSEARCH also includes commands for FASTQ file processing, i.e., format detection, filtering, read quality statistics, and merging of paired reads. Furthermore, VSEARCH extends functionality with several new commands and improvements, including shuffling, rereplication, masking of low-complexity sequences with the well-known DUST algorithm, a choice among different similarity definitions, and FASTQ file format conversion. VSEARCH is here shown to be more accurate than USEARCH when performing searching, clustering, chimera detection and subsampling

  11. Glucose-tolerant β-glucosidase retrieved from the metagenome

    Taku eUchiyama

    2015-06-01

    Full Text Available β-glucosidases (BGLs hydrolyze cellooligosaccharides to glucose and play a crucial role in the enzymatic saccharification of cellulosic biomass. Despite their significance for the production of glucose, most identified BGLs are commonly inhibited by low (~mM concentrations of glucose. Therefore, BGLs that are insensitive to glucose inhibition have great biotechnological merit. We applied a metagenomic approach to screen for such rare glucose-tolerant BGLs. A metagenomic library was created in Escherichia coli (approximately 10,000 colonies and grown on LB agar plates containing 5-bromo-4-chloro-3-indolyl-β-D-glucoside, yielding 828 positive (blue colonies. These were then arrayed in 96-well plates, grown in LB, and secondarily screened for activity in the presence of 10% (w/v glucose. Seven glucose-tolerant clones were identified, each of which contained a single bgl gene. The genes were classified into two groups, differing by two nucleotides. The deduced amino acid sequences of these genes were identical (452 aa and found to belong to the glycosyl hydrolase family 1. The recombinant protein (Ks5A7 was overproduced in E. coli as a C-terminal 6 × His-tagged protein and purified to apparent homogeneity. The molecular mass of the purified Ks5A7 was determined to be 54 kDa by SDS-PAGE, and 160 kDa by gel filtration analysis. The enzyme was optimally active at 45°C and pH 5.0–6.5 and retained full or 1.5–2-fold enhanced activity in the presence of 0.1–0.5 M glucose. It had a low KM (78 µM with p-nitrophenyl β-D-glucoside; 0.36 mM with cellobiose and high Vmax (91 µmol min-1 mg-1 with p-nitrophenyl β-D-glucoside; 155 µmol min-1 mg-1 with cellobiose among known glucose-tolerant BGLs and was free from substrate (0.1 M cellobiose inhibition. The efficient use of Ks5A7 in conjunction with Trichoderma reesei cellulases in enzymatic saccharification of alkaline-treated rice straw was demonstrated by increased production of glucose.

  12. A Metagenomic Survey of Limestone Hill in Taiwan

    Hsu, Y. W.; Li, K. Y.; Chen, Y. W.; Huang, T. Y.; Chen, W. J.; Shih, Y. J.; Chen, J. S.; Fan, C. W.; Hsu, B. M.

    2016-12-01

    The limestone of Narro-Sky in Tainliao, Taiwan is of Pleistocene reef limestones interbedded in clastic layers that covered the Takangshan anticlines. Understanding how microbial relative abundance was changed in response to changes of environmental factors may contribute to better comprehension of roles that microorganisms play in altering the landscape structures. In this study, microorganisms growing on the wall of limestone, in the water dripping from the limestone wall and of soil underneath the wall were collected from different locations where the environmental factors such as daytime illumination, humidity, or pH are different. Next generation sequencing (NGS) was carried out to examine the compositions and richness of microbial community. The metagenomics were clustered into operational taxonomic units (OTUs) to analyze relative abundance, diversities and principal coordinates analysis (PCoA). Our results showed the soil sample has the highest alpha diversity while water sample has the lowest. Four major phyla, which are Proteobacteria, Acidobacteria, Actinobacteria, and Cyanobacteria, account for 80 % of total microbial biomass in all groups. Cyanobacteria were found most abundantly in limestone wall instead of water or soil of weathering limestone. The PCoA dimensional patterns of each phylum showed a trace of microbial community dynamic changes, which might be affected by environmental factors. This study provides the insights to understand how environmental factors worked together with microbial community to shape landscape structures.

  13. WGSQuikr: fast whole-genome shotgun metagenomic classification.

    David Koslicki

    Full Text Available With the decrease in cost and increase in output of whole-genome shotgun technologies, many metagenomic studies are utilizing this approach in lieu of the more traditional 16S rRNA amplicon technique. Due to the large number of relatively short reads output from whole-genome shotgun technologies, there is a need for fast and accurate short-read OTU classifiers. While there are relatively fast and accurate algorithms available, such as MetaPhlAn, MetaPhyler, PhyloPythiaS, and PhymmBL, these algorithms still classify samples in a read-by-read fashion and so execution times can range from hours to days on large datasets. We introduce WGSQuikr, a reconstruction method which can compute a vector of taxonomic assignments and their proportions in the sample with remarkable speed and accuracy. We demonstrate on simulated data that WGSQuikr is typically more accurate and up to an order of magnitude faster than the aforementioned classification algorithms. We also verify the utility of WGSQuikr on real biological data in the form of a mock community. WGSQuikr is a Whole-Genome Shotgun QUadratic, Iterative, K-mer based Reconstruction method which extends the previously introduced 16S rRNA-based algorithm Quikr. A MATLAB implementation of WGSQuikr is available at: http://sourceforge.net/projects/wgsquikr.

  14. The microbiome of Brazilian mangrove sediments as revealed by metagenomics.

    Fernando Dini Andreote

    Full Text Available Here we embark in a deep metagenomic survey that revealed the taxonomic and potential metabolic pathways aspects of mangrove sediment microbiology. The extraction of DNA from sediment samples and the direct application of pyrosequencing resulted in approximately 215 Mb of data from four distinct mangrove areas (BrMgv01 to 04 in Brazil. The taxonomic approaches applied revealed the dominance of Deltaproteobacteria and Gammaproteobacteria in the samples. Paired statistical analysis showed higher proportions of specific taxonomic groups in each dataset. The metabolic reconstruction indicated the possible occurrence of processes modulated by the prevailing conditions found in mangrove sediments. In terms of carbon cycling, the sequences indicated the prevalence of genes involved in the metabolism of methane, formaldehyde, and carbon dioxide. With respect to the nitrogen cycle, evidence for sequences associated with dissimilatory reduction of nitrate, nitrogen immobilization, and denitrification was detected. Sequences related to the production of adenylsulfate, sulfite, and H(2S were relevant to the sulphur cycle. These data indicate that the microbial core involved in methane, nitrogen, and sulphur metabolism consists mainly of Burkholderiaceae, Planctomycetaceae, Rhodobacteraceae, and Desulfobacteraceae. Comparison of our data to datasets from soil and sea samples resulted in the allotment of the mangrove sediments between those samples. The results of this study add valuable data about the composition of microbial communities in mangroves and also shed light on possible transformations promoted by microbial organisms in mangrove sediments.

  15. The microbiome of Brazilian mangrove sediments as revealed by metagenomics.

    Andreote, Fernando Dini; Jiménez, Diego Javier; Chaves, Diego; Dias, Armando Cavalcante Franco; Luvizotto, Danice Mazzer; Dini-Andreote, Francisco; Fasanella, Cristiane Cipola; Lopez, Maryeimy Varon; Baena, Sandra; Taketani, Rodrigo Gouvêa; de Melo, Itamar Soares

    2012-01-01

    Here we embark in a deep metagenomic survey that revealed the taxonomic and potential metabolic pathways aspects of mangrove sediment microbiology. The extraction of DNA from sediment samples and the direct application of pyrosequencing resulted in approximately 215 Mb of data from four distinct mangrove areas (BrMgv01 to 04) in Brazil. The taxonomic approaches applied revealed the dominance of Deltaproteobacteria and Gammaproteobacteria in the samples. Paired statistical analysis showed higher proportions of specific taxonomic groups in each dataset. The metabolic reconstruction indicated the possible occurrence of processes modulated by the prevailing conditions found in mangrove sediments. In terms of carbon cycling, the sequences indicated the prevalence of genes involved in the metabolism of methane, formaldehyde, and carbon dioxide. With respect to the nitrogen cycle, evidence for sequences associated with dissimilatory reduction of nitrate, nitrogen immobilization, and denitrification was detected. Sequences related to the production of adenylsulfate, sulfite, and H(2)S were relevant to the sulphur cycle. These data indicate that the microbial core involved in methane, nitrogen, and sulphur metabolism consists mainly of Burkholderiaceae, Planctomycetaceae, Rhodobacteraceae, and Desulfobacteraceae. Comparison of our data to datasets from soil and sea samples resulted in the allotment of the mangrove sediments between those samples. The results of this study add valuable data about the composition of microbial communities in mangroves and also shed light on possible transformations promoted by microbial organisms in mangrove sediments.

  16. Biogeographic partitioning of Southern Ocean microorganisms revealed by metagenomics.

    Wilkins, David; Lauro, Federico M; Williams, Timothy J; Demaere, Matthew Z; Brown, Mark V; Hoffman, Jeffrey M; Andrews-Pfannkoch, Cynthia; McQuaid, Jeffrey B; Riddle, Martin J; Rintoul, Stephen R; Cavicchioli, Ricardo

    2013-05-01

    We performed a metagenomic survey (6.6 Gbp of 454 sequence data) of Southern Ocean (SO) microorganisms during the austral summer of 2007-2008, examining the genomic signatures of communities across a latitudinal transect from Hobart (44°S) to the Mertz Glacier, Antarctica (67°S). Operational taxonomic units (OTUs) of the SAR11 and SAR116 clades and the cyanobacterial genera Prochlorococcus and Synechococcus were strongly overrepresented north of the Polar Front (PF). Conversely, OTUs of the Gammaproteobacterial Sulfur Oxidizer-EOSA-1 (GSO-EOSA-1) complex, the phyla Bacteroidetes and Verrucomicrobia and order Rhodobacterales were characteristic of waters south of the PF. Functions enriched south of the PF included a range of transporters, sulfur reduction and histidine degradation to glutamate, while branched-chain amino acid transport, nucleic acid biosynthesis and methionine salvage were overrepresented north of the PF. The taxonomic and functional characteristics suggested a shift of primary production from cyanobacteria in the north to eukaryotic phytoplankton in the south, and reflected the different trophic statuses of the two regions. The study provides a new level of understanding about SO microbial communities, describing the contrasting taxonomic and functional characteristics of microbial assemblages either side of the PF. © 2012 Society for Applied Microbiology and Blackwell Publishing Ltd.

  17. Comparative Metagenomics of Eight Geographically Remote Terrestrial Hot Springs.

    Menzel, Peter; Gudbergsdóttir, Sóley Ruth; Rike, Anne Gunn; Lin, Lianbing; Zhang, Qi; Contursi, Patrizia; Moracci, Marco; Kristjansson, Jakob K; Bolduc, Benjamin; Gavrilov, Sergey; Ravin, Nikolai; Mardanov, Andrey; Bonch-Osmolovskaya, Elizaveta; Young, Mark; Krogh, Anders; Peng, Xu

    2015-08-01

    Hot springs are natural habitats for thermophilic Archaea and Bacteria. In this paper, we present the metagenomic analysis of eight globally distributed terrestrial hot springs from China, Iceland, Italy, Russia, and the USA with a temperature range between 61 and 92 (∘)C and pH between 1.8 and 7. A comparison of the biodiversity and community composition generally showed a decrease in biodiversity with increasing temperature and decreasing pH. Another important factor shaping microbial diversity of the studied sites was the abundance of organic substrates. Several species of the Crenarchaeal order Thermoprotei were detected, whereas no single bacterial species was found in all samples, suggesting a better adaptation of certain archaeal species to different thermophilic environments. Two hot springs show high abundance of Acidithiobacillus, supporting the idea of a true thermophilic Acidithiobacillus species that can thrive in hyperthermophilic environments. Depending on the sample, up to 58 % of sequencing reads could not be assigned to a known phylum, reinforcing the fact that a large number of microorganisms in nature, including those thriving in hot environments remain to be isolated and characterized.

  18. Microbial community profiling of human saliva using shotgun metagenomic sequencing.

    Nur A Hasan

    Full Text Available Human saliva is clinically informative of both oral and general health. Since next generation shotgun sequencing (NGS is now widely used to identify and quantify bacteria, we investigated the bacterial flora of saliva microbiomes of two healthy volunteers and five datasets from the Human Microbiome Project, along with a control dataset containing short NGS reads from bacterial species representative of the bacterial flora of human saliva. GENIUS, a system designed to identify and quantify bacterial species using unassembled short NGS reads was used to identify the bacterial species comprising the microbiomes of the saliva samples and datasets. Results, achieved within minutes and at greater than 90% accuracy, showed more than 175 bacterial species comprised the bacterial flora of human saliva, including bacteria known to be commensal human flora but also Haemophilus influenzae, Neisseria meningitidis, Streptococcus pneumoniae, and Gamma proteobacteria. Basic Local Alignment Search Tool (BLASTn analysis in parallel, reported ca. five times more species than those actually comprising the in silico sample. Both GENIUS and BLAST analyses of saliva samples identified major genera comprising the bacterial flora of saliva, but GENIUS provided a more precise description of species composition, identifying to strain in most cases and delivered results at least 10,000 times faster. Therefore, GENIUS offers a facile and accurate system for identification and quantification of bacterial species and/or strains in metagenomic samples.

  19. Identifying airborne fungi in Seoul, Korea using metagenomics.

    Oh, Seung-Yoon; Fong, Jonathan J; Park, Myung Soo; Chang, Limseok; Lim, Young Woon

    2014-06-01

    Fungal spores are widespread and common in the atmosphere. In this study, we use a metagenomic approach to study the fungal diversity in six total air samples collected from April to May 2012 in Seoul, Korea. This springtime period is important in Korea because of the peak in fungal spore concentration and Asian dust storms, although the year of this study (2012) was unique in that were no major Asian dust events. Clustering sequences for operational taxonomic unit (OTU) identification recovered 1,266 unique OTUs in the combined dataset, with between 223᾿96 OTUs present in individual samples. OTUs from three fungal phyla were identified. For Ascomycota, Davidiella (anamorph: Cladosporium) was the most common genus in all samples, often accounting for more than 50% of all sequences in a sample. Other common Ascomycota genera identified were Alternaria, Didymella, Khuskia, Geosmitha, Penicillium, and Aspergillus. While several Basidiomycota genera were observed, Chytridiomycota OTUs were only present in one sample. Consistency was observed within sampling days, but there was a large shift in species composition from Ascomycota dominant to Basidiomycota dominant in the middle of the sampling period. This marked change may have been caused by meteorological events. A potential set of 40 allergy-inducing genera were identified, accounting for a large proportion of the diversity present (22.5᾿7.2%). Our study identifies high fungal diversity and potentially high levels of fungal allergens in springtime air of Korea, and provides a good baseline for future comparisons with Asian dust storms.

  20. Benchmarking of gene prediction programs for metagenomic data.

    Yok, Non; Rosen, Gail

    2010-01-01

    This manuscript presents the most rigorous benchmarking of gene annotation algorithms for metagenomic datasets to date. We compare three different programs: GeneMark, MetaGeneAnnotator (MGA) and Orphelia. The comparisons are based on their performances over simulated fragments from one hundred species of diverse lineages. We defined four different types of fragments; two types come from the inter- and intra-coding regions and the other types are from the gene edges. Hoff et al. used only 12 species in their comparison; therefore, their sample is too small to represent an environmental sample. Also, no predecessors has separately examined fragments that contain gene edges as opposed to intra-coding regions. General observations in our results are that performances of all these programs improve as we increase the length of the fragment. On the other hand, intra-coding fragments of our data show low annotation error in all of the programs if compared to the gene edge fragments. Overall, we found an upper-bound performance by combining all the methods.

  1. Key roles for freshwater Actinobacteria revealed by deep metagenomic sequencing.

    Ghai, Rohit; Mizuno, Carolina Megumi; Picazo, Antonio; Camacho, Antonio; Rodriguez-Valera, Francisco

    2014-12-01

    Freshwater ecosystems are critical but fragile environments directly affecting society and its welfare. However, our understanding of genuinely freshwater microbial communities, constrained by our capacity to manipulate its prokaryotic participants in axenic cultures, remains very rudimentary. Even the most abundant components, freshwater Actinobacteria, remain largely unknown. Here, applying deep metagenomic sequencing to the microbial community of a freshwater reservoir, we were able to circumvent this traditional bottleneck and reconstruct de novo seven distinct streamlined actinobacterial genomes. These genomes represent three new groups of photoheterotrophic, planktonic Actinobacteria. We describe for the first time genomes of two novel clades, acMicro (Micrococcineae, related to Luna2,) and acAMD (Actinomycetales, related to acTH1). Besides, an aggregate of contigs belonged to a new branch of the Acidimicrobiales. All are estimated to have small genomes (approximately 1.2 Mb), and their GC content varied from 40 to 61%. One of the Micrococcineae genomes encodes a proteorhodopsin, a rhodopsin type reported for the first time in Actinobacteria. The remarkable potential capacity of some of these genomes to transform recalcitrant plant detrital material, particularly lignin-derived compounds, suggests close linkages between the terrestrial and aquatic realms. Moreover, abundances of Actinobacteria correlate inversely to those of Cyanobacteria that are responsible for prolonged and frequently irretrievable damage to freshwater ecosystems. This suggests that they might serve as sentinels of impending ecological catastrophes. © 2014 John Wiley & Sons Ltd.

  2. Metagenomic screening for aromatic compound-responsive transcriptional regulators.

    Taku Uchiyama

    Full Text Available We applied a metagenomics approach to screen for transcriptional regulators that sense aromatic compounds. The library was constructed by cloning environmental DNA fragments into a promoter-less vector containing green fluorescence protein. Fluorescence-based screening was then performed in the presence of various aromatic compounds. A total of 12 clones were isolated that fluoresced in response to salicylate, 3-methyl catechol, 4-chlorocatechol and chlorohydroquinone. Sequence analysis revealed at least 1 putative transcriptional regulator, excluding 1 clone (CHLO8F. Deletion analysis identified compound-specific transcriptional regulators; namely, 8 LysR-types, 2 two-component-types and 1 AraC-type. Of these, 9 representative clones were selected and their reaction specificities to 18 aromatic compounds were investigated. Overall, our transcriptional regulators were functionally diverse in terms of both specificity and induction rates. LysR- and AraC- type regulators had relatively narrow specificities with high induction rates (5-50 fold, whereas two-component-types had wide specificities with low induction rates (3 fold. Numerous transcriptional regulators have been deposited in sequence databases, but their functions remain largely unknown. Thus, our results add valuable information regarding the sequence-function relationship of transcriptional regulators.

  3. Metagenomic analysis reveals presence of Treponema denticola in a tissue biopsy of the Iceman.

    Frank Maixner

    Full Text Available Ancient hominoid genome studies can be regarded by definition as metagenomic analyses since they represent a mixture of both hominoid and microbial sequences in an environment. Here, we report the molecular detection of the oral spirochete Treponema denticola in ancient human tissue biopsies of the Iceman, a 5,300-year-old Copper Age natural ice mummy. Initially, the metagenomic data of the Iceman's genomic survey was screened for bacterial ribosomal RNA (rRNA specific reads. Through ranking the reads by abundance a relatively high number of rRNA reads most similar to T. denticola was detected. Mapping of the metagenome sequences against the T. denticola genome revealed additional reads most similar to this opportunistic pathogen. The DNA damage pattern of specifically mapped reads suggests an ancient origin of these sequences. The haematogenous spread of bacteria of the oral microbiome often reported in the recent literature could already explain the presence of metagenomic reads specific for T. denticola in the Iceman's bone biopsy. We extended, however, our survey to an Iceman gingival tissue sample and a mouth swab sample and could thereby detect T. denticola and Porphyrimonas gingivalis, another important member of the human commensal oral microflora. Taken together, this study clearly underlines the opportunity to detect disease-associated microorganisms when applying metagenomics-enabled approaches on datasets of ancient human remains.

  4. Metagenomics of the Svalbard reindeer rumen microbiome reveals abundance of polysaccharide utilization loci.

    Phillip B Pope

    Full Text Available Lignocellulosic biomass remains a largely untapped source of renewable energy predominantly due to its recalcitrance and an incomplete understanding of how this is overcome in nature. We present here a compositional and comparative analysis of metagenomic data pertaining to a natural biomass-converting ecosystem adapted to austere arctic nutritional conditions, namely the rumen microbiome of Svalbard reindeer (Rangifer tarandus platyrhynchus. Community analysis showed that deeply-branched cellulolytic lineages affiliated to the Bacteroidetes and Firmicutes are dominant, whilst sequence binning methods facilitated the assemblage of metagenomic sequence for a dominant and novel Bacteroidales clade (SRM-1. Analysis of unassembled metagenomic sequence as well as metabolic reconstruction of SRM-1 revealed the presence of multiple polysaccharide utilization loci-like systems (PULs as well as members of more than 20 glycoside hydrolase and other carbohydrate-active enzyme families targeting various polysaccharides including cellulose, xylan and pectin. Functional screening of cloned metagenome fragments revealed high cellulolytic activity and an abundance of PULs that are rich in endoglucanases (GH5 but devoid of other common enzymes thought to be involved in cellulose degradation. Combining these results with known and partly re-evaluated metagenomic data strongly indicates that much like the human distal gut, the digestive system of herbivores harbours high numbers of deeply branched and as-yet uncultured members of the Bacteroidetes that depend on PUL-like systems for plant biomass degradation.

  5. Structural and Functional Insights from the Metagenome of an Acidic Hot Spring Microbial Planktonic Community in the Colombian Andes

    Jiménez Avella, Diego; Dini Andreote, Fernando; Chaves, Diego; Montaña, José Salvador; Osorio-Forero, Cesar; Junca, Howard; Zambrano, María Mercedes; Baena, Sandra

    2012-01-01

    A taxonomic and annotated functional description of microbial life was deduced from 53 Mb of metagenomic sequence retrieved from a planktonic fraction of the Neotropical high Andean (3,973 meters above sea level) acidic hot spring El Coquito (EC). A classification of unassembled metagenomic reads

  6. Biofilm-Growing Bacteria Involved in the Corrosion of Concrete Wastewater Pipes: Protocols for Comparative Metagenomic Analyses

    Advances in high-throughput next-generation sequencing (NGS) technology for direct sequencing of environmental DNA (i.e. shotgun metagenomics) is transforming the field of microbiology. NGS technologies are now regularly being applied in comparative metagenomic studies, which pr...

  7. Re-Analysis of Metagenomic Sequences from Acute Flaccidmyelitis Patients Reveals Alternatives to Enterovirus D68 Infection

    2015-07-13

    caused in some cases by infection with enterovirus D68. We found that among the patients whose symptoms were previously attributed to enterovirus D68...distribution is unlimited. Re-analysis of metagenomic sequences from acute flaccidmyelitis patients reveals alternatives to enterovirus D68...Street Baltimore, MD 21218 -2685 ABSTRACT Re-analysis of metagenomic sequences from acute flaccidmyelitis patients reveals alternatives to enterovirus

  8. Development of high-throughput phenotyping of metagenomic clones from the human gut microbiome for modulation of eukaryotic cell growth.

    Gloux, Karine; Leclerc, Marion; Iliozer, Harout; L'Haridon, René; Manichanh, Chaysavanh; Corthier, Gérard; Nalin, Renaud; Blottière, Hervé M; Doré, Joël

    2007-06-01

    Metagenomic libraries derived from human intestinal microbiota (20,725 clones) were screened for epithelial cell growth modulation. Modulatory clones belonging to the four phyla represented among the metagenomic libraries were identified (hit rate, 0.04 to 8.7% depending on the screening cutoff). Several candidate loci were identified by transposon mutagenesis and subcloning.

  9. A Delphi Technology Foresight Study: Mapping Social Construction of Scientific Evidence on Metagenomics Tests for Water Safety.

    Stanislav Birko

    Full Text Available Access to clean water is a grand challenge in the 21st century. Water safety testing for pathogens currently depends on surrogate measures such as fecal indicator bacteria (e.g., E. coli. Metagenomics concerns high-throughput, culture-independent, unbiased shotgun sequencing of DNA from environmental samples that might transform water safety by detecting waterborne pathogens directly instead of their surrogates. Yet emerging innovations such as metagenomics are often fiercely contested. Innovations are subject to shaping/construction not only by technology but also social systems/values in which they are embedded, such as experts' attitudes towards new scientific evidence. We conducted a classic three-round Delphi survey, comprised of 107 questions. A multidisciplinary expert panel (n = 24 representing the continuum of discovery scientists and policymakers evaluated the emergence of metagenomics tests. To the best of our knowledge, we report here the first Delphi foresight study of experts' attitudes on (1 the top 10 priority evidentiary criteria for adoption of metagenomics tests for water safety, (2 the specific issues critical to governance of metagenomics innovation trajectory where there is consensus or dissensus among experts, (3 the anticipated time lapse from discovery to practice of metagenomics tests, and (4 the role and timing of public engagement in development of metagenomics tests. The ability of a test to distinguish between harmful and benign waterborne organisms, analytical/clinical sensitivity, and reproducibility were the top three evidentiary criteria for adoption of metagenomics. Experts agree that metagenomic testing will provide novel information but there is dissensus on whether metagenomics will replace the current water safety testing methods or impact the public health end points (e.g., reduction in boil water advisories. Interestingly, experts view the publics relevant in a "downstream capacity" for adoption of

  10. Marine metagenomics: strategies for the discovery of novel enzymes with biotechnological applications from marine environments

    Dobson Alan DW

    2008-08-01

    Full Text Available Abstract Metagenomic based strategies have previously been successfully employed as powerful tools to isolate and identify enzymes with novel biocatalytic activities from the unculturable component of microbial communities from various terrestrial environmental niches. Both sequence based and function based screening approaches have been employed to identify genes encoding novel biocatalytic activities and metabolic pathways from metagenomic libraries. While much of the focus to date has centred on terrestrial based microbial ecosystems, it is clear that the marine environment has enormous microbial biodiversity that remains largely unstudied. Marine microbes are both extremely abundant and diverse; the environments they occupy likewise consist of very diverse niches. As culture-dependent methods have thus far resulted in the isolation of only a tiny percentage of the marine microbiota the application of metagenomic strategies holds great potential to study and exploit the enormous microbial biodiversity which is present within these marine environments.

  11. Autotrophic microbe metagenomes and metabolic pathways differentiate adjacent red sea brine pools

    Wang, Yong

    2013-04-29

    In the Red Sea, two neighboring deep-sea brine pools, Atlantis II and Discovery, have been studied extensively, and the results have shown that the temperature and concentrations of metal and methane in Atlantis II have increased over the past decades. Therefore, we investigated changes in the microbial community and metabolic pathways. Here, we compared the metagenomes of the two pools to each other and to those of deep-sea water samples. Archaea were generally absent in the Atlantis II metagenome; Bacteria in the metagenome were typically heterotrophic and depended on aromatic compounds and other extracellular organic carbon compounds as indicated by enrichment of the related metabolic pathways. In contrast, autotrophic Archaea capable of CO2 fixation and methane oxidation were identified in Discovery but not in Atlantis II. Our results suggest that hydrothermal conditions and metal precipitation in the Atlantis II pool have resulted in elimination of the autotrophic community and methanogens.

  12. High-resolution metagenomics targets major functional types in complex microbial communities

    Kalyuzhnaya, Marina G.; Lapidus, Alla; Ivanova, Natalia; Copeland, Alex C.; McHardy, Alice C.; Szeto, Ernest; Salamov, Asaf; Grigoriev, Igor V.; Suciu, Dominic; Levine, Samuel R.; Markowitz, Victor M.; Rigoutsos, Isidore; Tringe, Susannah G.; Bruce, David C.; Richardson, Paul M.; Lidstrom, Mary E.; Chistoserdova, Ludmila

    2009-08-01

    Most microbes in the biosphere remain uncultured and unknown. Whole genome shotgun (WGS) sequencing of environmental DNA (metagenomics) allows glimpses into genetic and metabolic potentials of natural microbial communities. However, in communities of high complexity metagenomics fail to link specific microbes to specific ecological functions. To overcome this limitation, we selectively targeted populations involved in oxidizing single-carbon (C{sub 1}) compounds in Lake Washington (Seattle, USA) by labeling their DNA via stable isotope probing (SIP), followed by WGS sequencing. Metagenome analysis demonstrated specific sequence enrichments in response to different C{sub 1} substrates, highlighting ecological roles of individual phylotypes. We further demonstrated the utility of our approach by extracting a nearly complete genome of a novel methylotroph Methylotenera mobilis, reconstructing its metabolism and conducting genome-wide analyses. This approach allowing high-resolution genomic analysis of ecologically relevant species has the potential to be applied to a wide variety of ecosystems.

  13. Novel polyhydroxyalkanoate copolymers produced in Pseudomonas putida by metagenomic polyhydroxyalkanoate synthases.

    Cheng, Jiujun; Charles, Trevor C

    2016-09-01

    Bacterially produced biodegradable polyhydroxyalkanoates (PHAs) with versatile properties can be achieved using different PHA synthases (PhaCs). This work aims to expand the diversity of known PhaCs via functional metagenomics and demonstrates the use of these novel enzymes in PHA production. Complementation of a PHA synthesis-deficient Pseudomonas putida strain with a soil metagenomic cosmid library retrieved 27 clones expressing either class I, class II, or unclassified PHA synthases, and many did not have close sequence matches to known PhaCs. The composition of PHA produced by these clones was dependent on both the supplied growth substrates and the nature of the PHA synthase, with various combinations of short-chain-length (SCL) and medium-chain-length (MCL) PHA. These data demonstrate the ability to isolate diverse genes for PHA synthesis by functional metagenomics and their use for the production of a variety of PHA polymer and copolymer mixtures.

  14. A sampling and metagenomic sequencing-based methodology for monitoring antimicrobial resistance in swine herds

    Munk, Patrick; Dalhoff Andersen, Vibe; de Knegt, Leonardo

    2016-01-01

    Objectives Reliable methods for monitoring antimicrobial resistance (AMR) in livestock and other reservoirs are essential to understand the trends, transmission and importance of agricultural resistance. Quantification of AMR is mostly done using culture-based techniques, but metagenomic read...... mapping shows promise for quantitative resistance monitoring. Methods We evaluated the ability of: (i) MIC determination for Escherichia coli; (ii) cfu counting of E. coli; (iii) cfu counting of aerobic bacteria; and (iv) metagenomic shotgun sequencing to predict expected tetracycline resistance based...... cultivation-based techniques in terms of predicting expected tetracycline resistance based on antimicrobial consumption. Our metagenomic approach had sufficient resolution to detect antimicrobial-induced changes to individual resistance gene abundances. Pen floor manure samples were found to represent rectal...

  15. Untangling Genomes from Metagenomes: Revealing an Uncultured Class of Marine Euryarchaeota

    Iverson, Vaughn; Morris, Robert M.; Frazar, Christian D.; Berthiaume, Chris T.; Morales, Rhonda L.; Armbrust, E. Virginia

    2012-02-01

    Ecosystems are shaped by complex communities of mostly unculturable microbes. Metagenomes provide a fragmented view of such communities, but the ecosystem functions of major groups of organisms remain mysterious. To better characterize members of these communities, we developed methods to reconstruct genomes directly from mate-paired short-read metagenomes. We closed a genome representing the as-yet uncultured marine group II Euryarchaeota, assembled de novo from 1.7% of a metagenome sequenced from surface seawater. The genome describes a motile, photo-heterotrophic cell focused on degradation of protein and lipids and clarifies the origin of proteorhodopsin. It also demonstrates that high-coverage mate-paired sequence can overcome assembly difficulties caused by interstrain variation in complex microbial communities, enabling inference of ecosystem functions for uncultured members.

  16. MetaBAT: Metagenome Binning based on Abundance and Tetranucleotide frequence

    Kang, Dongwan; Froula, Jeff; Egan, Rob; Wang, Zhong

    2014-03-21

    Grouping large fragments assembled from shotgun metagenomic sequences to deconvolute complex microbial communities, or metagenome binning, enables the study of individual organisms and their interactions. Here we developed automated metagenome binning software, called MetaBAT, which integrates empirical probabilistic distances of genome abundance and tetranucleotide frequency. On synthetic datasets MetaBAT on average achieves 98percent precision and 90percent recall at the strain level with 281 near complete unique genomes. Applying MetaBAT to a human gut microbiome data set we recovered 176 genome bins with 92percent precision and 80percent recall. Further analyses suggest MetaBAT is able to recover genome fragments missed in reference genomes up to 19percent, while 53 genome bins are novel. In summary, we believe MetaBAT is a powerful tool to facilitate comprehensive understanding of complex microbial communities.

  17. Abundance profiling of specific gene groups using precomputed gut metagenomes yields novel biological hypotheses.

    Konstantin Yarygin

    Full Text Available The gut microbiota is essentially a multifunctional bioreactor within a human being. The exploration of its enormous metabolic potential provides insights into the mechanisms underlying microbial ecology and interactions with the host. The data obtained using "shotgun" metagenomics capture information about the whole spectrum of microbial functions. However, each new study presenting new sequencing data tends to extract only a little of the information concerning the metabolic potential and often omits specific functions. A meta-analysis of the available data with an emphasis on biomedically relevant gene groups can unveil new global trends in the gut microbiota. As a step toward the reuse of metagenomic data, we developed a method for the quantitative profiling of user-defined groups of genes in human gut metagenomes. This method is based on the quick analysis of a gene coverage matrix obtained by pre-mapping the metagenomic reads to a global gut microbial catalogue. The method was applied to profile the abundance of several gene groups related to antibiotic resistance, phages, biosynthesis clusters and carbohydrate degradation in 784 metagenomes from healthy populations worldwide and patients with inflammatory bowel diseases and obesity. We discovered country-wise functional specifics in gut resistome and virome compositions. The most distinct features of the disease microbiota were found for Crohn's disease, followed by ulcerative colitis and obesity. Profiling of the genes belonging to crAssphage showed that its abundance varied across the world populations and was not associated with clinical status. We demonstrated temporal resilience of crAssphage and the influence of the sample preparation protocol on its detected abundance. Our approach offers a convenient method to add value to accumulated "shotgun" metagenomic data by helping researchers state and assess novel biological hypotheses.

  18. Gene identification and protein classification in microbial metagenomic sequence data via incremental clustering

    Li Weizhong

    2008-04-01

    Full Text Available Abstract Background The identification and study of proteins from metagenomic datasets can shed light on the roles and interactions of the source organisms in their communities. However, metagenomic datasets are characterized by the presence of organisms with varying GC composition, codon usage biases etc., and consequently gene identification is challenging. The vast amount of sequence data also requires faster protein family classification tools. Results We present a computational improvement to a sequence clustering approach that we developed previously to identify and classify protein coding genes in large microbial metagenomic datasets. The clustering approach can be used to identify protein coding genes in prokaryotes, viruses, and intron-less eukaryotes. The computational improvement is based on an incremental clustering method that does not require the expensive all-against-all compute that was required by the original approach, while still preserving the remote homology detection capabilities. We present evaluations of the clustering approach in protein-coding gene identification and classification, and also present the results of updating the protein clusters from our previous work with recent genomic and metagenomic sequences. The clustering results are available via CAMERA, (http://camera.calit2.net. Conclusion The clustering paradigm is shown to be a very useful tool in the analysis of microbial metagenomic data. The incremental clustering method is shown to be much faster than the original approach in identifying genes, grouping sequences into existing protein families, and also identifying novel families that have multiple members in a metagenomic dataset. These clusters provide a basis for further studies of protein families.

  19. Natural history bycatch: a pipeline for identifying metagenomic sequences in RADseq data

    Iris Holmes

    2018-04-01

    Full Text Available Background Reduced representation genomic datasets are increasingly becoming available from a variety of organisms. These datasets do not target specific genes, and so may contain sequences from parasites and other organisms present in the target tissue sample. In this paper, we demonstrate that (1 RADseq datasets can be used for exploratory analysis of tissue-specific metagenomes, and (2 tissue collections house complete metagenomic communities, which can be investigated and quantified by a variety of techniques. Methods We present an exploratory method for mining metagenomic “bycatch” sequences from a range of host tissue types. We use a combination of the pyRAD assembly pipeline, NCBI’s blastn software, and custom R scripts to isolate metagenomic sequences from RADseq type datasets. Results When we focus on sequences that align with existing references in NCBI’s GenBank, we find that between three and five percent of identifiable double-digest restriction site associated DNA (ddRAD sequences from host tissue samples are from phyla to contain known blood parasites. In addition to tissue samples, we examine ddRAD sequences from metagenomic DNA extracted snake and lizard hind-gut samples. We find that the sequences recovered from these samples match with expected bacterial and eukaryotic gut microbiome phyla. Discussion Our results suggest that (1 museum tissue banks originally collected for host DNA archiving are also preserving valuable parasite and microbiome communities, (2 that publicly available RADseq datasets may include metagenomic sequences that could be explored, and (3 that restriction site approaches are a useful exploratory technique to identify microbiome lineages that could be missed by primer-based approaches.

  20. Understanding Aquatic Rhizosphere Processes Through Metabolomics and Metagenomics Approach

    Lee, Yong Jian; Mynampati, Kalyan; Drautz, Daniela; Arumugam, Krithika; Williams, Rohan; Schuster, Stephan; Kjelleberg, Staffan; Swarup, Sanjay

    2013-04-01

    The aquatic rhizosphere is a region around the roots of aquatic plants. Many studies focusing on terrestrial rhizosphere have led to a good understanding of the interactions between the roots, its exudates and its associated rhizobacteria. The rhizosphere of free-floating roots, however, is a different habitat that poses several additional challenges, including rapid diffusion rates of signals and nutrient molecules, which are further influenced by the hydrodynamic forces. These can lead to rapid diffusion and complicates the studying of diffusible factors from both plant and/or rhizobacterial origins. These plant systems are being increasingly used for self purification of water bodies to provide sustainable solution. A better understanding of these processes will help in improving their performance for ecological engineering of freshwater systems. The same principles can also be used to improve the yield of hydroponic cultures. Novel toolsets and approaches are needed to investigate the processes occurring in the aquatic rhizosphere. We are interested in understanding the interaction between root exudates and the complex microbial communities that are associated with the roots, using a systems biology approach involving metabolomics and metagenomics. With this aim, we have developed a RhizoFlowCell (RFC) system that provides a controlled study of aquatic plants, observed the root biofilms, collect root exudates and subject the rhizosphere system to changes in various chemical or physical perturbations. As proof of concept, we have used RFC to test the response of root exudation patterns of Pandanus amaryllifolius after exposure to the pollutant naphthalene. Complexity of root exudates in the aquatic rhizosphere was captured using this device and analysed using LC-qTOF-MS. The highly complex metabolomic profile allowed us to study the dynamics of the response of roots to varying levels of naphthalene. The metabolic profile changed within 5mins after spiking with

  1. Vinasse fertirrigation alters soil resistome dynamics: an analysis based on metagenomic profiles.

    Braga, Lucas P P; Alves, Rafael F; Dellias, Marina T F; Navarrete, Acacio A; Basso, Thiago O; Tsai, Siu M

    2017-01-01

    Every year around 300 Gl of vinasse, a by-product of ethanol distillation in sugarcane mills, are flushed into more than 9 Mha of sugarcane cropland in Brazil. This practice links fermentation waste management to fertilization for plant biomass production, and it is known as fertirrigation. Here we evaluate public datasets of soil metagenomes mining for changes in antibiotic resistance genes (ARGs) of soils from sugarcane mesocosms repeatedly amended with vinasse. The metagenomes were annotated using the ResFam database. We found that the abundance of open read frames (ORFs) annotated as ARGs changed significantly across 43 different families ( p -value resistome.

  2. Technical Report: Benchmarking for Quasispecies Abundance Inference with Confidence Intervals from Metagenomic Sequence Data

    McLoughlin, K. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

    2016-01-22

    The software application “MetaQuant” was developed by our group at Lawrence Livermore National Laboratory (LLNL). It is designed to profile microbial populations in a sample using data from whole-genome shotgun (WGS) metagenomic DNA sequencing. Several other metagenomic profiling applications have been described in the literature. We ran a series of benchmark tests to compare the performance of MetaQuant against that of a few existing profiling tools, using real and simulated sequence datasets. This report describes our benchmarking procedure and results.

  3. Tuning the performance of a natural treatment process using metagenomics for improved trace organic chemical attenuation

    Drewes, Jorg

    2014-02-01

    By utilizing high-throughput sequencing and metagenomics, this study revealed how the microbial community characteristics including composition, diversity, as well as functional genes in managed aquifer recharge (MAR) systems can be tuned to enhance removal of trace organic chemicals of emerging concern (CECs). Increasing the humic content of the primary substrate resulted in higher microbial diversity. Lower concentrations and a higher humic content of the primary substrate promoted the attenuation of biodegradable CECs in laboratory and field MAR systems. Metagenomic results indicated that the metabolic capabilities of xenobiotic biodegradation were significantly promoted for the microbiome under carbon-starving conditions. © IWA Publishing 2014.

  4. Constructing and Screening a Metagenomic Library of a Cold and Alkaline Extreme Environment.

    Glaring, Mikkel A; Vester, Jan K; Stougaard, Peter

    2017-01-01

    Natural cold or alkaline environments are common on Earth. A rare combination of these two extremes is found in the permanently cold (less than 6 °C) and alkaline (pH above 10) ikaite columns in the Ikka Fjord in Southern Greenland. Bioprospecting efforts have established the ikaite columns as a source of bacteria and enzymes adapted to these conditions. They have also highlighted the limitations of cultivation-based methods in this extreme environment and metagenomic approaches may provide access to novel extremophilic enzymes from the uncultured majority of bacteria. Here, we describe the construction and screening of a metagenomic library of the prokaryotic community inhabiting the ikaite columns.

  5. Deployment and Preparation of Metagenomic Analysis on the EELA Grid

    Aparicio, G.; Blanquer, I.; Hernandez, V.; Pignatelli, M.; Tamames, J.

    2007-01-01

    In many cases, the sequencing of the DNA of many microorganisms is hindered by the impossibility of growing significant samples of isolated specimens. Many bacteria cannot survive alone, and require the interaction with other organisms. In such cases, the information of the DNA available belongs to different kinds of organisms. Metagenomic studies aim at processing samples of multiple specimens to extract the genes and proteins that belong to the different species. This can be achieved through a process of extraction of fragment, comparison and analysis of the function. By the comparison to existing chains, whose function is well known, fragments can be classified. This process is computationally expensive and requires several iterations of alignment and phylogeny classification steps. Source samples reach several millions of sequences, which could reach up to thousands of nucleotides each. These sequences are compared to a selected part of the N on-redundant d atabase which only implies the information from eukaryotic species. From this first analysis, a refining process is performed and alignment analysis is restarted from the results. This process implies several CPU years. An environment has been developed to fragment, automate and check the above operations. This environment has been tuned-up from an experimental study which has tested the most efficient and reliable resources, the optimal job size, and the data transference and database reindexation overhead. The environment should re-submit faulty jobs, detect endless tasks and ensure that the results are correctly retrieved and work flow synchronised. The paper will give an outline on the structure of the system, and the preparation steps performed to deal with this experiment. (Author)

  6. Metagenomics as a preliminary screen for antimicrobial bioprospecting

    Al Amoudi, Soha

    2016-09-16

    Since the composition of soil directs the diversity of the contained microbiome and its potential to produce bioactive compounds, many studies has been focused on sediment types with unique features characteristic of extreme environments. However, not much is known about the potential of microbiomes that inhabit the highly saline and hot Red Sea lagoons. This case study explores mangrove mud and the microbial mat of sediments collected from the Rabigh harbor lagoon and Al Kharrar lagoon for antimicrobial bioprospecting. Rabigh harbor lagoon appears the better location, and the best sediment type for this purpose is mangrove mud. On the other hand, Al Kharrar lagoon displayed increased anaerobic hydrocarbon degradation and an abundance of bacterial DNA associated with antibiotic resistance. Moreover, our findings show an identical shift in phyla associated with historic hydrocarbon contamination exposure reported in previous studies (that is, enrichment of Gamma-and Delta-proteobacteria), but we also report that bacterial DNA sequences associated with antibiotic synthesis enzymes are derived from Gamma-, Delta-and Alpha-proteobacteria. This suggests that selection pressure associated with hydrocarbon contamination tend to enrich the bacterial classes DNA associated with antibiotic synthesis enzymes. Although Actinobacteria tends to be the common target for research when it comes to antimicrobial bioprospecting, our study suggests that Firmicutes (Bacilli and Clostridia), Bacteroidetes, Cyanobacteria, and Proteobacteria should be antimicrobial bioprospecting targets as well. To the best of our knowledge, this is the first metagenomic study that analyzed the microbiomes in Red Sea lagoons for antimicrobial bioprospecting. (C) 2016 The Authors. Published by Elsevier B.V.

  7. Dynamic processes of the microbiota - from metagenomics to biofilms

    Wingreen, Ned

    The extent, origin, and impact of microbial diversity is a central question in biology. We expect that physical processes contribute to this diversity, but we are only beginning to explore the nature of these interactions. I will briefly discuss two approaches to this question, one based on metagenomics the other on observation of bacterial biofilms. First, I will address the challenge of identifying the constituents of microbial systems by presenting a new approach to analyzing community sequencing data that identifies microbial subpopulations while avoiding problematic clustering-based methods. Using data from a time-series study of human tongue microbiota, we were able to resolve within the standard definition of a ``species'' up to 20 ecologically distinct subpopulations with tag sequences differing by as little as one nucleotide (99.2% similarity). This fine resolution allowed us decouple sequence similarity from dynamical similarity, and to resolve dynamics on multiple time scales, including the slow appearance and disappearance of strains over months. Second, I will present recent results on the growth and competition of bacteria within biofilms. We imaged the growth ofliving biofilms of Vibrio choleraefrom single founder cells to ten thousand cells at single cell spatial resolution and with temporal resolution of one cell cycle. We discovered a transition from a branched 2D colony to a dense 3D cluster, in which cells at the biofilm center exhibit collective vertical alignment and local nematic packing. Our results suggest that biofilm cells exploit mechanics to simultaneously achieve strong surface adhesion, access to 3D space, resistance to invasion, and dominance over surface territory.

  8. Composition and Metabolic Activities of the Bacterial Community in Shrimp Sauce at the Flavor-Forming Stage of Fermentation As Revealed by Metatranscriptome and 16S rRNA Gene Sequencings.

    Duan, Shan; Hu, Xiaoxi; Li, Mengru; Miao, Jianyin; Du, Jinghe; Wu, Rongli

    2016-03-30

    The bacterial community and the metabolic activities involved at the flavor-forming stage during the fermentation of shrimp sauce were investigated using metatranscriptome and 16S rRNA gene sequencings. Results showed that the abundance of Tetragenococcus was 95.1%. Tetragenococcus halophilus was identified in 520 of 588 transcripts annotated in the Nr database. Activation of the citrate cycle and oxidative phosphorylation, along with the absence of lactate dehydrogenase gene expression, in T. halophilus suggests that T. halophilus probably underwent aerobic metabolism during shrimp sauce fermentation. The metabolism of amino acids, production of peptidase, and degradation of limonene and pinene were very active in T. halophilus. Carnobacterium, Pseudomonas, Escherichia, Staphylococcus, Bacillus, and Clostridium were also metabolically active, although present in very small populations. Enterococcus, Abiotrophia, Streptococcus, and Lactobacillus were detected in metatranscriptome sequencing, but not in 16S rRNA gene sequencing. Many minor taxa showed no gene expression, suggesting that they were in dormant status.

  9. Metagenome reveals potential microbial degradation of hydrocarbon coupled with sulfate reduction in an oil-immersed chimney from Guaymas Basin

    Ying eHe

    2013-06-01

    Full Text Available Deep-sea hydrothermal vent chimneys contain a high diversity of microorganisms, yet the metabolic activity and the ecological functions of the microbial communities remain largely unexplored. In this study, a metagenomic approach was applied to characterize the metabolic potential in a Guaymas hydrothermal vent chimney and to conduct comparative genomic analysis among a variety of environments with sequenced metagenomes. Complete clustering of functional gene categories with a comparative metagenomic approach showed that this Guaymas chimney metagenome was clustered most closely with a chimney metagenome from Juan de Fuca. All chimney samples were enriched with genes involved in recombination and repair, chemotaxis and flagellar assembly, highlighting their roles in coping with the fluctuating extreme deep-sea environments. A high proportion of transposases was observed in all the metagenomes from deep-sea chimneys, supporting the previous hypothesis that horizontal gene transfer may be common in the deep-sea vent chimney biosphere. In the Guaymas chimney metagenome, thermophilic sulfate reducing microorganisms including bacteria and archaea were found predominant, and genes coding for the degradation of refractory organic compounds such as cellulose, lipid, pullullan, as well as a few hydrocarbons including toluene, ethylbenzene and o-xylene were identified. Therefore, this oil-immersed chimney supported a thermophilic microbial community capable of oxidizing a range of hydrocarbons that served as electron donors for sulphate reduction under anaerobic conditions.

  10. Metagenomic Characterization of the Human Intestinal Microbiota in Fecal Samples from STEC-Infected Patients

    Gigliucci, Federica; von Meijenfeldt, F A Bastiaan; Knijn, Arnold; Michelacci, Valeria; Scavia, Gaia; Minelli, Fabio; Dutilh, Bas E|info:eu-repo/dai/nl/304546313; Ahmad, Hamideh M; Raangs, Gerwin C; Friedrich, Alex W; Rossen, John W A; Morabito, Stefano

    2018-01-01

    The human intestinal microbiota is a homeostatic ecosystem with a remarkable impact on human health and the disruption of this equilibrium leads to an increased susceptibility to infection by numerous pathogens. In this study, we used shotgun metagenomic sequencing and two different bioinformatic

  11. Metagenome sequencing of the microbial community of two Brazilian anthropogenic Amazon dark earth sites, Brazil.

    Lemos, Leandro Nascimento; de Souza, Rosineide Cardoso; de Souza Cannavan, Fabiana; Patricio, André; Pylro, Victor Satler; Hanada, Rogério Eiji; Mui, Tsai Siu

    2016-12-01

    The Anthropogenic Amazon Dark Earth soil is considered one of the world's most fertile soils. These soils differs from conventional Amazon soils because its higher organic content concentration. Here we describe the metagenome sequencing of microbial communities of two sites of Anthropogenic Amazon Dark Earth soils from Amazon Rainforest, Brazil. The raw sequence data are stored under Short Read Accession number: PRJNA344917.

  12. Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes

    Nielsen, Henrik Bjørn; Almeida, Mathieu; Juncker, Agnieszka

    2014-01-01

    of microbial genomes without the need for reference sequences. We demonstrate the method on data from 396 human gut microbiome samples and identify 7,381 co-abundance gene groups (CAGs), including 741 metagenomic species (MGS). We use these to assemble 238 high-quality microbial genomes and identify...

  13. Metabolic model for the filamentous ‘Candidatus Microthrix parvicella’ based on genomic and metagenomic analyses

    McIlroy, Simon Jon; Kristiansen, Rikke; Albertsen, Mads

    2013-01-01

    acids as triacylglycerols. Utilisation of trehalose and/or polyphosphate stores or partial oxidation of long-chain fatty acids may supply the energy required for anaerobic lipid uptake and storage. Comparing the genome sequence of this isolate with metagenomes from two full-scale wastewater treatment...

  14. From cultured to uncultured genome sequences: metagenomics and modeling microbial ecosystems.

    Garza, Daniel R; Dutilh, Bas E

    2015-11-01

    Microorganisms and the viruses that infect them are the most numerous biological entities on Earth and enclose its greatest biodiversity and genetic reservoir. With strength in their numbers, these microscopic organisms are major players in the cycles of energy and matter that sustain all life. Scientists have only scratched the surface of this vast microbial world through culture-dependent methods. Recent developments in generating metagenomes, large random samples of nucleic acid sequences isolated directly from the environment, are providing comprehensive portraits of the composition, structure, and functioning of microbial communities. Moreover, advances in metagenomic analysis have created the possibility of obtaining complete or nearly complete genome sequences from uncultured microorganisms, providing important means to study their biology, ecology, and evolution. Here we review some of the recent developments in the field of metagenomics, focusing on the discovery of genetic novelty and on methods for obtaining uncultured genome sequences, including through the recycling of previously published datasets. Moreover we discuss how metagenomics has become a core scientific tool to characterize eco-evolutionary patterns of microbial ecosystems, thus allowing us to simultaneously discover new microbes and study their natural communities. We conclude by discussing general guidelines and challenges for modeling the interactions between uncultured microorganisms and viruses based on the information contained in their genome sequences. These models will significantly advance our understanding of the functioning of microbial ecosystems and the roles of microbes in the environment.

  15. Ten years of maintaining and expanding a microbial genome and metagenome analysis system.

    Markowitz, Victor M; Chen, I-Min A; Chu, Ken; Pati, Amrita; Ivanova, Natalia N; Kyrpides, Nikos C

    2015-11-01

    Launched in March 2005, the Integrated Microbial Genomes (IMG) system is a comprehensive data management system that supports multidimensional comparative analysis of genomic data. At the core of the IMG system is a data warehouse that contains genome and metagenome datasets sequenced at the Joint Genome Institute or provided by scientific users, as well as public genome datasets available at the National Center for Biotechnology Information Genbank sequence data archive. Genomes and metagenome datasets are processed using IMG's microbial genome and metagenome sequence data processing pipelines and are integrated into the data warehouse using IMG's data integration toolkits. Microbial genome and metagenome application specific data marts and user interfaces provide access to different subsets of IMG's data and analysis toolkits. This review article revisits IMG's original aims, highlights key milestones reached by the system during the past 10 years, and discusses the main challenges faced by a rapidly expanding system, in particular the complexity of maintaining such a system in an academic setting with limited budgets and computing and data management infrastructure. Copyright © 2015 Elsevier Ltd. All rights reserved.

  16. myPhyloDB: a local web server for the storage and analysis of metagenomics data

    myPhyloDB is a user-friendly personal database with a browser-interface designed to facilitate the storage, processing, analysis, and distribution of metagenomics data. MyPhyloDB archives raw sequencing files, and allows for easy selection of project(s)/sample(s) of any combination from all availab...

  17. Estimating DNA coverage and abundance in metagenomes using a gamma approximation

    Hooper, Sean D; Dalevi, Daniel; Pati, Amrita; Mavromatis, Konstantinos; Ivanova, Natalia N; Kyrpides, Nikos C

    2010-01-01

    Shotgun sequencing generates large numbers of short DNA reads from either an isolated organism or, in the case of metagenomics projects, from the aggregate genome of a microbial community. These reads are then assembled based on overlapping sequences into larger, contiguous sequences (contigs). The feasibility of assembly and the coverage achieved (reads per nucleotide or distinct sequence of nucleotides) depend on several factors: the number of reads sequenced, the read length and the relative abundances of their source genomes in the microbial community. A low coverage suggests that most of the genomic DNA in the sample has not been sequenced, but it is often difficult to estimate either the extent of the uncaptured diversity or the amount of additional sequencing that would be most efficacious. In this work, we regard a metagenome as a population of DNA fragments (bins), each of which may be covered by one or more reads. We employ a gamma distribution to model this bin population due to its flexibility and ease of use. When a gamma approximation can be found that adequately fits the data, we may estimate the number of bins that were not sequenced and that could potentially be revealed by additional sequencing. We evaluated the performance of this model using simulated metagenomes and demonstrate its applicability on three recent metagenomic datasets.

  18. Identification of nitrogen-fixing genes and gene clusters from metagenomic library of acid mine drainage.

    Dai, Zhimin; Guo, Xue; Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

    2014-01-01

    Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.

  19. Identification of nitrogen-fixing genes and gene clusters from metagenomic library of acid mine drainage.

    Zhimin Dai

    Full Text Available Biological nitrogen fixation is an essential function of acid mine drainage (AMD microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.

  20. Possibilities and obstacles in recovery of genomes from elusive microbes in complex metagenomes

    Karst, Søren Michael; Albertsen, Mads; Nielsen, Jeppe Lund

    Representative genomes provide an entry point for understanding a given ecosystem. The genomes themselves give insights in the metabolic potential and possible role of the bacteria in the ecosystem, as well as being essential when applying other omics based techniques. Metagenomics and single cel...

  1. Resolving the Complexity of Human Skin Metagenomes Using Single-Molecule Sequencing

    Yu-Chih Tsai

    2016-02-01

    Full Text Available Deep metagenomic shotgun sequencing has emerged as a powerful tool to interrogate composition and function of complex microbial communities. Computational approaches to assemble genome fragments have been demonstrated to be an effective tool for de novo reconstruction of genomes from these communities. However, the resultant “genomes” are typically fragmented and incomplete due to the limited ability of short-read sequence data to assemble complex or low-coverage regions. Here, we use single-molecule, real-time (SMRT sequencing to reconstruct a high-quality, closed genome of a previously uncharacterized Corynebacterium simulans and its companion bacteriophage from a skin metagenomic sample. Considerable improvement in assembly quality occurs in hybrid approaches incorporating short-read data, with even relatively small amounts of long-read data being sufficient to improve metagenome reconstruction. Using short-read data to evaluate strain variation of this C. simulans in its skin community at single-nucleotide resolution, we observed a dominant C. simulans strain with moderate allelic heterozygosity throughout the population. We demonstrate the utility of SMRT sequencing and hybrid approaches in metagenome quantitation, reconstruction, and annotation.

  2. Resolving the Complexity of Human Skin Metagenomes Using Single-Molecule Sequencing

    Tsai, Yu-Chih; Deming, Clayton; Segre, Julia A.; Kong, Heidi H.; Korlach, Jonas

    2016-01-01

    ABSTRACT Deep metagenomic shotgun sequencing has emerged as a powerful tool to interrogate composition and function of complex microbial communities. Computational approaches to assemble genome fragments have been demonstrated to be an effective tool for de novo reconstruction of genomes from these communities. However, the resultant “genomes” are typically fragmented and incomplete due to the limited ability of short-read sequence data to assemble complex or low-coverage regions. Here, we use single-molecule, real-time (SMRT) sequencing to reconstruct a high-quality, closed genome of a previously uncharacterized Corynebacterium simulans and its companion bacteriophage from a skin metagenomic sample. Considerable improvement in assembly quality occurs in hybrid approaches incorporating short-read data, with even relatively small amounts of long-read data being sufficient to improve metagenome reconstruction. Using short-read data to evaluate strain variation of this C. simulans in its skin community at single-nucleotide resolution, we observed a dominant C. simulans strain with moderate allelic heterozygosity throughout the population. We demonstrate the utility of SMRT sequencing and hybrid approaches in metagenome quantitation, reconstruction, and annotation. PMID:26861018

  3. Enrichment allows identification of diverse, rare elements in metagenomic resistome-virulome sequencing.

    Noyes, Noelle R; Weinroth, Maggie E; Parker, Jennifer K; Dean, Chris J; Lakin, Steven M; Raymond, Robert A; Rovira, Pablo; Doster, Enrique; Abdo, Zaid; Martin, Jennifer N; Jones, Kenneth L; Ruiz, Jaime; Boucher, Christina A; Belk, Keith E; Morley, Paul S

    2017-10-17

    Shotgun metagenomic sequencing is increasingly utilized as a tool to evaluate ecological-level dynamics of antimicrobial resistance and virulence, in conjunction with microbiome analysis. Interest in use of this method for environmental surveillance of antimicrobial resistance and pathogenic microorganisms is also increasing. In published metagenomic datasets, the total of all resistance- and virulence-related sequences accounts for enrichment system that incorporates unique molecular indices to count DNA molecules and correct for enrichment bias. The use of the bait-capture and enrichment system significantly increased on-target sequencing of the resistome-virulome, enabling detection of an additional 1441 gene accessions and revealing a low-abundance portion of the resistome-virulome that was more diverse and compositionally different than that detected by more traditional metagenomic assays. The low-abundance portion of the resistome-virulome also contained resistance genes with public health importance, such as extended-spectrum betalactamases, that were not detected using traditional shotgun metagenomic sequencing. In addition, the use of the bait-capture and enrichment system enabled identification of rare resistance gene haplotypes that were used to discriminate between sample origins. These results demonstrate that the rare resistome-virulome contains valuable and unique information that can be utilized for both surveillance and population genetic investigations of resistance. Access to the rare resistome-virulome using the bait-capture and enrichment system validated in this study can greatly advance our understanding of microbiome-resistome dynamics.

  4. Metagenomic data of fungal internal transcribed spacer from serofluid dish, a traditional Chinese fermented food

    Peng Chen

    2016-03-01

    Full Text Available Serofluid dish (or Jiangshui, in Chinese, a traditional food in the Chinese culture for thousands of years, is made from vegetables by fermentation. In this work, microorganism community of the fermented serofluid dish was investigated by the culture-independent method. The metagenomic data in this article contains the sequences of fungal internal transcribed spacer (ITS regions of rRNA genes from 12 different serofluid dish samples. The metagenome comprised of 50,865 average raw reads with an average of 8,958,220 bp and G + C content is 45.62%. This is the first report on metagenomic data of fungal ITS from serofluid dish employing Illumina platform to profile the fungal communities of this little known fermented food from Gansu Province, China. The Metagenomic data of fungal internal transcribed spacer can be accessed at NCBI, SRA database accession no. SRP067411. Keywords: Serofluid dish, Jiangshui, Fungal ITS, Cultivation-independent, Microbial diversity

  5. IDENTIFICATION OF CHICKEN-SPECIFIC FECAL MICROBIAL SEQUENCES USING A METAGENOMIC APPROACH

    In this study, we applied a genome fragment enrichment (GFE) method to select for genomic regions that differ between different fecal metagenomes. Competitive DNA hybridizations were performed between chicken fecal DNA and pig fecal DNA (C-P) and between chicken fecal DNA and an ...

  6. Evaluation of a pooled strategy for high-throughput sequencing of cosmid clones from metagenomic libraries.

    Lam, Kathy N; Hall, Michael W; Engel, Katja; Vey, Gregory; Cheng, Jiujun; Neufeld, Josh D; Charles, Trevor C

    2014-01-01

    High-throughput sequencing methods have been instrumental in the growing field of metagenomics, with technological improvements enabling greater throughput at decreased costs. Nonetheless, the economy of high-throughput sequencing cannot be fully leveraged in the subdiscipline of functional metagenomics. In this area of research, environmental DNA is typically cloned to generate large-insert libraries from which individual clones are isolated, based on specific activities of interest. Sequence data are required for complete characterization of such clones, but the sequencing of a large set of clones requires individual barcode-based sample preparation; this can become costly, as the cost of clone barcoding scales linearly with the number of clones processed, and thus sequencing a large number of metagenomic clones often remains cost-prohibitive. We investigated a hybrid Sanger/Illumina pooled sequencing strategy that omits barcoding altogether, and we evaluated this strategy by comparing the pooled sequencing results to reference sequence data obtained from traditional barcode-based sequencing of the same set of clones. Using identity and coverage metrics in our evaluation, we show that pooled sequencing can generate high-quality sequence data, without producing problematic chimeras. Though caveats of a pooled strategy exist and further optimization of the method is required to improve recovery of complete clone sequences and to avoid circumstances that generate unrecoverable clone sequences, our results demonstrate that pooled sequencing represents an effective and low-cost alternative for sequencing large sets of metagenomic clones.

  7. Metagenomic analysis of bacterial community structure and diversity of lignocellulolytic bacteria in Vietnamese native goat rumen

    Do, Huyen Thi; Dao, Khoa Trong; Nguyen, Viet Khanh Hoang; Le Ngoc, Giang; Nguyen, Phuong Thi Mai; Le, Lam Tung; Phung, Nguyet Thu; M. van Straalen, Nico; Roelofs, Dick; Truong, Hai Nam

    2017-01-01

    Objective: In a previous study, analysis of Illumina sequenced metagenomic DNA data of bacteria in Vietnamese goats' rumen showed a high diversity of putative lignocellulolytic genes. In this study, taxonomy speculation of microbial community and lignocellulolytic bacteria population in the rumen

  8. Validation of Metagenomic Next-Generation Sequencing Tests for Universal Pathogen Detection.

    Schlaberg, Robert; Chiu, Charles Y; Miller, Steve; Procop, Gary W; Weinstock, George

    2017-06-01

    - Metagenomic sequencing can be used for detection of any pathogens using unbiased, shotgun next-generation sequencing (NGS), without the need for sequence-specific amplification. Proof-of-concept has been demonstrated in infectious disease outbreaks of unknown causes and in patients with suspected infections but negative results for conventional tests. Metagenomic NGS tests hold great promise to improve infectious disease diagnostics, especially in immunocompromised and critically ill patients. - To discuss challenges and provide example solutions for validating metagenomic pathogen detection tests in clinical laboratories. A summary of current regulatory requirements, largely based on prior guidance for NGS testing in constitutional genetics and oncology, is provided. - Examples from 2 separate validation studies are provided for steps from assay design, and validation of wet bench and bioinformatics protocols, to quality control and assurance. - Although laboratory and data analysis workflows are still complex, metagenomic NGS tests for infectious diseases are increasingly being validated in clinical laboratories. Many parallels exist to NGS tests in other fields. Nevertheless, specimen preparation, rapidly evolving data analysis algorithms, and incomplete reference sequence databases are idiosyncratic to the field of microbiology and often overlooked.

  9. Insights into resistome and stress responses genes in Bubalus bubalis rumen through metagenomic analysis.

    Reddy, Bhaskar; Singh, Krishna M; Patel, Amrutlal K; Antony, Ancy; Panchasara, Harshad J; Joshi, Chaitanya G

    2014-10-01

    Buffalo rumen microbiota experience variety of diets and represents a huge reservoir of mobilome, resistome and stress responses. However, knowledge of metagenomic responses to such conditions is still rudimentary. We analyzed the metagenomes of buffalo rumen in the liquid and solid phase of the rumen biomaterial from river buffalo adapted to varying proportion of concentrate to green or dry roughages, using high-throughput sequencing to know the occurrence of antibiotics resistance genes, genetic exchange between bacterial population and environmental reservoirs. A total of 3914.94 MB data were generated from all three treatments group. The data were analysed with Metagenome rapid annotation system tools. At phyla level, Bacteroidetes were dominant in all the treatments followed by Firmicutes. Genes coding for functional responses to stress (oxidative stress and heat shock proteins) and resistome genes (resistance to antibiotics and toxic compounds, phages, transposable elements and pathogenicity islands) were prevalent in similar proportion in liquid and solid fraction of rumen metagenomes. The fluoroquinolone resistance, MDR efflux pumps and Methicillin resistance genes were broadly distributed across 11, 9, and 14 bacterial classes, respectively. Bacteria responsible for phages replication and prophages and phage packaging and rlt-like streptococcal phage genes were mostly assigned to phyla Bacteroides, Firmicutes and proteaobacteria. Also, more reads matching the sigma B genes were identified in the buffalo rumen. This study underscores the presence of diverse mechanisms of adaptation to different diet, antibiotics and other stresses in buffalo rumen, reflecting the proportional representation of major bacterial groups.

  10. Rhizosphere microbiome metagenomics of gray mangroves (Avicennia marina) in the Red Sea

    Alzubaidy, Hanin S.; Essack, Magbubah; Malas, Tareq Majed Yasin; Bokhari, Ameerah; Motwalli, Olaa Amin; Kamanu, Frederick Kinyua; Jamhor, Suhaiza; Mokhtar, Noor Azlin; Antunes, Andre; Simoes, Marta; Alam, Intikhab; Bougouffa, Salim; Lafi, Feras Fawzi; Bajic, Vladimir B.; Archer, John A.C.

    2015-01-01

    To our knowledge, this is the first metagenomic study on the microbiome of mangroves in the Red Sea, and the first application of unbiased 454-pyrosequencing to study the rhizosphere microbiome associated with A. marina. Our results provide the first insights into the range of functions and microbial diversity in the rhizosphere and soil sediments of gray mangrove (A. marina) in the Red Sea.

  11. Identification of Nitrogen-Fixing Genes and Gene Clusters from Metagenomic Library of Acid Mine Drainage

    Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

    2014-01-01

    Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community. PMID:24498417

  12. Diversity Indices as Measures of Functional Annotation Methods in Metagenomics Studies

    Jankovic, Boris R.

    2016-01-01

    in the ecosystems and species diversity studies can be successfully used in evaluating certain aspects of the methods employed in metagenomics studies. We show that when applying the concept of Hill’s diversity, the analysis of variations in the diversity order

  13. Metagenome Analyses of Corroded Concrete Wastewater Pipe Biofilms Reveals a Complex Microbial System

    Analysis of whole-metagenome pyrosequencing data and 16S rRNA gene clone libraries was used to determine microbial composition and functional genes associated with biomass harvested from crown (top) and invert (bottom) sections of a corroded wastewater pipe. Taxonomic and functio...

  14. Diagnosis of Fatal Human Case of St. Louis Encephalitis Virus Infection by Metagenomic Sequencing, California, 2016.

    Chiu, Charles Y; Coffey, Lark L; Murkey, Jamie; Symmes, Kelly; Sample, Hannah A; Wilson, Michael R; Naccache, Samia N; Arevalo, Shaun; Somasekar, Sneha; Federman, Scot; Stryke, Doug; Vespa, Paul; Schiller, Gary; Messenger, Sharon; Humphries, Romney; Miller, Steve; Klausner, Jeffrey D

    2017-10-01

    We used unbiased metagenomic next-generation sequencing to diagnose a fatal case of meningoencephalitis caused by St. Louis encephalitis virus in a patient from California in September 2016. This case is associated with the recent 2015-2016 reemergence of this virus in the southwestern United States.

  15. Draft Genome Sequences of Two Novel Acidimicrobiaceae Members from an Acid Mine Drainage Biofilm Metagenome

    Pinto, Ameet J.; Sharp, Jonathan O.; Yoder, Michael J.; Almstrand, Robert

    2016-01-01

    Bacteria belonging to the family Acidimicrobiaceae are frequently encountered in heavy metal-contaminated acidic environments. However, their phylogenetic and metabolic diversity is poorly resolved. We present draft genome sequences of two novel and phylogenetically distinct Acidimicrobiaceae members assembled from an acid mine drainage biofilm metagenome.

  16. Draft Genome Sequence of a Novel Desulfobacteraceae Member from a Sulfate-Reducing Bioreactor Metagenome

    Almstrand, Robert; Pinto, Ameet J.; Figueroa, Linda A.; Sharp, Jonathan O.

    2016-01-01

    Sulfate-reducing bacteria are important players in the global sulfur cycle and of considerable commercial interest. The draft genome sequence of a sulfate-reducing bacterium of the family Desulfobacteraceae, assembled from a sulfate-reducing bioreactor metagenome, indicates that heavy-metal? and acid-resistance traits of this organism may be of importance for its application in acid mine drainage mitigation.

  17. Identification of a novel bat papillomavirus by metagenomics.

    Herman Tse

    Full Text Available The discovery of novel viruses in animals expands our knowledge of viral diversity and potentially emerging zoonoses. High-throughput sequencing (HTS technology gives millions or even billions of sequence reads per run, allowing a comprehensive survey of the genetic content within a sample without prior nucleic acid amplification. In this study, we screened 156 rectal swab samples from apparently healthy bats (n = 96, pigs (n = 9, cattles (n = 9, stray dogs (n = 11, stray cats (n = 11 and monkeys (n = 20 using a HTS metagenomics approach. The complete genome of a novel papillomavirus (PV, Miniopterus schreibersii papillomavirus type 1 (MscPV1, with L1 of 60% nucleotide identity to Canine papillomavirus (CPV6, was identified in a specimen from a Common Bent-wing Bat (M. schreibersii. It is about 7.5kb in length, with a G+C content of 45.8% and a genomic organization similar to that of other PVs. Despite the higher nucleotide identity between the genomes of MscPV1 and CPV6, maximum-likelihood phylogenetic analysis of the L1 gene sequence showed that MscPV1 and Erethizon dorsatum papillomavirus (EdPV1 are most closely related. Estimated divergence time of MscPV1 from the EdPV1/MscPV1 common ancestor was approximately 60.2-91.9 millions of years ago, inferred under strict clocks using the L1 and E1 genes. The estimates were limited by the lack of reliable calibration points from co-divergence because of possible host shifts. As the nucleotide sequence of this virus only showed limited similarity with that of related animal PVs, the conventional approach of PCR using consensus primers would be unlikely to have detected the novel virus in the sample. Unlike the first bat papillomavirus RaPV1, MscPV1 was found in an asymptomatic bat with no apparent mucosal or skin lesions whereas RaPV1 was detected in the basosquamous carcinoma of a fruit bat Rousettus aegyptiacus. We propose MscPV1 as the first member of the novel Dyolambda-papillomavirus genus.

  18. Metagenomic analysis of the microbiomes in ruminants and other herbivores

    Morrison, M.; Adams, S.E.; Nelson, K.E.; Attwood, G.T.

    2005-01-01

    Many conceptual breakthroughs in the life sciences would not have been possible without first developing techniques and instrumentation to investigate biological processes and molecules. In 1995, The Institute for Genomic Research (TIGR) completely sequenced, assembled and published the fist genome of a free-living organism, that of Haemophilus influenzae Rd. This milestone in scientific achievement has allowed microbiologists to progress from a reductionist approach of studying one gene at a time to the examination of microbial biology from an organismal perspective, using a combination of existing and newly developed (bio)chemical and computational (in silico) approaches. These fields of investigation are often defined with an 'omics' suffix. Hence, genomics refers to the holistic examination of the genetic blueprint that a microbe has acquired, at that point in evolutionary time, to support its lifestyle. Transcriptomics, proteomics and metabolomics refer to a similar level of analysis at the RNA, protein and metabolite levels, respectively. Furthermore, the latest advances in sequencing technologies and cloning vectors better enable a detailed examination of the structure and function of microbial communities, including those organisms that cannot readily be cultured, and we refer to the integrative use of the following methods as the basis of an emerging scientific discipline referred to as metagenomics: 1. Bacterial artificial chromosome and fosmid cloning technologies: Community genomic DNA is cloned in large fragments (>50-150 kilobases [kb]) to create libraries of bacterial artificial chromosomes (BACs), or smaller fragments (∼40 kb) are cloned into fosmid vectors. These libraries can then be screened by DNA- and activity-based screens for genes encoding any number of particular functions including hydrolytic and other enzymes central to schemes of carbon sequestration. 2. High throughput DNA sequencing and bioinformatics: Both BAC and fosmid libraries

  19. The binning of metagenomic contigs for microbial physiology of mixed cultures.

    Strous, Marc; Kraft, Beate; Bisdorf, Regina; Tegetmeyer, Halina E

    2012-01-01

    So far, microbial physiology has dedicated itself mainly to pure cultures. In nature, cross feeding and competition are important aspects of microbial physiology and these can only be addressed by studying complete communities such as enrichment cultures. Metagenomic sequencing is a powerful tool to characterize such mixed cultures. In the analysis of metagenomic data, well established algorithms exist for the assembly of short reads into contigs and for the annotation of predicted genes. However, the binning of the assembled contigs or unassembled reads is still a major bottleneck and required to understand how the overall metabolism is partitioned over different community members. Binning consists of the clustering of contigs or reads that apparently originate from the same source population. In the present study eight metagenomic samples from the same habitat, a laboratory enrichment culture, were sequenced. Each sample contained 13-23 Mb of assembled contigs and up to eight abundant populations. Binning was attempted with existing methods but they were found to produce poor results, were slow, dependent on non-standard platforms or produced errors. A new binning procedure was developed based on multivariate statistics of tetranucleotide frequencies combined with the use of interpolated Markov models. Its performance was evaluated by comparison of the results between samples with BLAST and in comparison to existing algorithms for four publicly available metagenomes and one previously published artificial metagenome. The accuracy of the new approach was comparable or higher than existing methods. Further, it was up to a 100 times faster. It was implemented in Java Swing as a complete open source graphical binning application available for download and further development (http://sourceforge.net/projects/metawatt).

  20. The binning of metagenomic contigs for microbial physiology of mixed cultures

    Marc eStrous

    2012-12-01

    Full Text Available So far, microbial physiology has dedicated itself mainly to pure cultures. In nature, cross feeding and competition are important aspects of microbial physiology and these can only be addressed by studying complete communities such as enrichment cultures. Metagenomic sequencing is a powerful tool to characterize such mixed cultures. In the analysis of metagenomic data, well established algorithms exist for the assembly of short reads into contigs and for the annotation of predicted genes. However, the binning of the assembled contigs or unassembled reads is still a major bottleneck and required to understand how the overall metabolism is partitioned over different community members. Binning consists of the clustering of contigs or reads that apparently originate from the same source population.In the present study eight metagenomic samples originating from the same habitat, a laboratory enrichment culture, were sequenced. Each sample contained 13-23 Mb of assembled contigs and up to eight abundant populations. Binning was attempted with existing methods but they were found to produce poor results, were slow, dependent on non-standard platforms or produced errors. A new binning procedure was developed based on multivariate statistics of tetranucleotide frequencies combined with the use of interpolated Markov models. Its performance was evaluated by comparison of the results between samples with BLAST and in comparison to exisiting algorithms for four publicly available metagenomes and one previously published artificial metagenome. The accuracy of the new approach was comparable or higher than existing methods. Further, it was up to a hunderd times faster. It was implemented in Java Swing as a complete open source graphical binning application available for download and further development (http://sourceforge.net/projects/metawatt.

  1. A metagenomic approach to decipher the indigenous microbial communities of arsenic contaminated groundwater of Assam

    Saurav Das

    2017-06-01

    Full Text Available Metagenomic approach was used to understand the structural and functional diversity present in arsenic contaminated groundwater of the Ganges Brahmaputra Delta aquifer system. A metagene dataset (coded as TTGW1 of 89,171 sequences (totaling 125,449,864 base pairs with an average length of 1406 bps was annotated. About 74,478 sequences containing 101,948 predicted protein coding regions passed the quality control. Taxonomical classification revealed abundance of bacteria that accounted for 98.3% of the microbial population of the metagenome. Eukaryota had an abundance of 1.1% followed by archea that showed 0.4% abundance. In phylum based classification, Proteobacteria was dominant (62.6% followed by Bacteroidetes (11.7%, Planctomycetes (7.7%, Verrucomicrobia (5.6%, Actinobacteria (3.7% and Firmicutes (1.9%. The Clusters of Orthologous Groups (COGs analysis indicated that the protein regulating the metabolic functions constituted a high percentage (18,199 reads; 39.3% of the whole metagenome followed by the proteins regulating the cellular processes (22.3%. About 0.07% sequences of the whole metagenome were related to genes coding for arsenic resistant mechanisms. Nearly 50% sequences of these coded for the arsenate reductase enzyme (EC. 1.20.4.1, the dominant enzyme of ars operon. Proteins associated with iron acquisition and metabolism were coded by 2% of the metagenome as revealed through SEED analysis. Our study reveals the microbial diversity and provides an insight into the functional aspect of the genes that might play crucial role in arsenic geocycle in contaminated ground water of Assam.

  2. Metagenomic analyses of bacteria on human hairs: a qualitative assessment for applications in forensic science.

    Tridico, Silvana R; Murray, Dáithí C; Addison, Jayne; Kirkbride, Kenneth P; Bunce, Michael

    2014-01-01

    Mammalian hairs are one of the most ubiquitous types of trace evidence collected in the course of forensic investigations. However, hairs that are naturally shed or that lack roots are problematic substrates for DNA profiling; these hair types often contain insufficient nuclear DNA to yield short tandem repeat (STR) profiles. Whilst there have been a number of initial investigations evaluating the value of metagenomics analyses for forensic applications (e.g. examination of computer keyboards), there have been no metagenomic evaluations of human hairs-a substrate commonly encountered during forensic practice. This present study attempts to address this forensic capability gap, by conducting a qualitative assessment into the applicability of metagenomic analyses of human scalp and pubic hair. Forty-two DNA extracts obtained from human scalp and pubic hairs generated a total of 79,766 reads, yielding 39,814 reads post control and abundance filtering. The results revealed the presence of unique combinations of microbial taxa that can enable discrimination between individuals and signature taxa indigenous to female pubic hairs. Microbial data from a single co-habiting couple added an extra dimension to the study by suggesting that metagenomic analyses might be of evidentiary value in sexual assault cases when other associative evidence is not present. Of all the data generated in this study, the next-generation sequencing (NGS) data generated from pubic hair held the most potential for forensic applications. Metagenomic analyses of human hairs may provide independent data to augment other forensic results and possibly provide association between victims of sexual assault and offender when other associative evidence is absent. Based on results garnered in the present study, we believe that with further development, bacterial profiling of hair will become a valuable addition to the forensic toolkit.

  3. Biotechnological applications of functional metagenomics in the food and pharmaceutical industries.

    Coughlan, Laura M; Cotter, Paul D; Hill, Colin; Alvarez-Ordóñez, Avelino

    2015-01-01

    Microorganisms are found throughout nature, thriving in a vast range of environmental conditions. The majority of them are unculturable or difficult to culture by traditional methods. Metagenomics enables the study of all microorganisms, regardless of whether they can be cultured or not, through the analysis of genomic data obtained directly from an environmental sample, providing knowledge of the species present, and allowing the extraction of information regarding the functionality of microbial communities in their natural habitat. Function-based screenings, following the cloning and expression of metagenomic DNA in a heterologous host, can be applied to the discovery of novel proteins of industrial interest encoded by the genes of previously inaccessible microorganisms. Functional metagenomics has considerable potential in the food and pharmaceutical industries, where it can, for instance, aid (i) the identification of enzymes with desirable technological properties, capable of catalyzing novel reactions or replacing existing chemically synthesized catalysts which may be difficult or expensive to produce, and able to work under a wide range of environmental conditions encountered in food and pharmaceutical processing cycles including extreme conditions of temperature, pH, osmolarity, etc; (ii) the discovery of novel bioactives including antimicrobials active against microorganisms of concern both in food and medical settings; (iii) the investigation of industrial and societal issues such as antibiotic resistance development. This review article summarizes the state-of-the-art functional metagenomic methods available and discusses the potential of functional metagenomic approaches to mine as yet unexplored environments to discover novel genes with biotechnological application in the food and pharmaceutical industries.

  4. Variability in metagenomic samples from the Puget Sound: Relationship to temporal and anthropogenic impacts.

    James C Wallace

    Full Text Available Whole-metagenome sequencing (WMS has emerged as a powerful tool to assess potential public health risks in marine environments by measuring changes in microbial community structure and function in uncultured bacteria. In addition to monitoring public health risks such as antibiotic resistance determinants, it is essential to measure predictors of microbial variation in order to identify natural versus anthropogenic factors as well as to evaluate reproducibility of metagenomic measurements.This study expands our previous metagenomic characterization of Puget Sound by sampling new nearshore environments including the Duwamish River, an EPA superfund site, and the Hood Canal, an area characterized by highly variable oxygen levels. We also resampled a wastewater treatment plant, nearshore and open ocean sites introducing a longitudinal component measuring seasonal and locational variations and establishing metagenomics sampling reproducibility. Microbial composition from samples collected in the open sound were highly similar within the same season and location across different years, while nearshore samples revealed multi-fold seasonal variation in microbial composition and diversity. Comparisons with recently sequenced predominant marine bacterial genomes helped provide much greater species level taxonomic detail compared to our previous study. Antibiotic resistance determinants and pollution and detoxification indicators largely grouped by location showing minor seasonal differences. Metal resistance, oxidative stress and detoxification systems showed no increase in samples proximal to an EPA superfund site indicating a lack of ecosystem adaptation to anthropogenic impacts. Taxonomic analysis of common sewage influent families showed a surprising similarity between wastewater treatment plant and open sound samples suggesting a low-level but pervasive sewage influent signature in Puget Sound surface waters. Our study shows reproducibility of

  5. Exploration of soil metagenome diversity for prospection of enzymes involved in lignocellulosic biomass conversion

    Alvarez, T.M.; Squina, F.M. [Laboratorio Nacional de Luz Sincrotron (LNLS), Campinas, SP (Brazil); Paixao, D.A.A.; Franco Cairo, J.P.L.; Buchli, F.; Ruller, R. [Laboratorio Nacional de Ciencia e Tecnologia do Bioetanol (CTBE), Campinas, SP (Brazil); Prade, R. [Oklahoma State University, Sillwater, OK (United States)

    2012-07-01

    Full text: Metagenomics allows access to genetic information encoded in DNA of microorganisms recalcitrant to cultivation. They represent a reservoir of novel biocatalyst with potential application in environmental friendly techniques aiming to overcome the dependence on fossil fuels and also to diminish air and water pollution. The focus of our work is the generation of a tool kit of lignocellulolytic enzymes from soil metagenome, which could be used for second generation ethanol production. Environmental samples were collected at a sugarcane field after harvesting, where it is expected that the microbial population involved on lignocellulose degradation was enriched due to the presence of straws covering the soil. Sugarcane Bagasse-Degrading-Soil (SBDS) metagenome was massively-parallel-454-Roche-sequenced. We identified a full repertoire of genes with significant match to glycosyl hydrolases catalytic domain and carbohydrate-binding modules. Soil metagenomics libraries cloned into pUC19 were screened through functional assays. CMC-agar screening resulted in positive clones, revealing new cellulases coding genes. Through a CMC-zymogram it was possible to observe that one of these genes, nominated as E-1, corresponds to an enzyme that is secreted to the extracellular medium, suggesting that the cloned gene carried the original signal peptide. Enzymatic assays and analysis through capillary electrophoresis showed that E-1 was able to cleave internal glycosidic bonds of cellulose. New rounds of functional screenings through chromogenic substrates are being conducted aiming the generation of a library of lignocellulolytic enzymes derived from soil metagenome, which may become key component for development of second generation biofuels. (author)

  6. Potential and pitfalls of eukaryotic metagenome skimming: a test case for lichens.

    Greshake, Bastian; Zehr, Simonida; Dal Grande, Francesco; Meiser, Anjuli; Schmitt, Imke; Ebersberger, Ingo

    2016-03-01

    Whole-genome shotgun sequencing of multispecies communities using only a single library layout is commonly used to assess taxonomic and functional diversity of microbial assemblages. Here, we investigate to what extent such metagenome skimming approaches are applicable for in-depth genomic characterizations of eukaryotic communities, for example lichens. We address how to best assemble a particular eukaryotic metagenome skimming data, what pitfalls can occur, and what genome quality can be expected from these data. To facilitate a project-specific benchmarking, we introduce the concept of twin sets, simulated data resembling the outcome of a particular metagenome sequencing study. We show that the quality of genome reconstructions depends essentially on assembler choice. Individual tools, including the metagenome assemblers Omega and MetaVelvet, are surprisingly sensitive to low and uneven coverages. In combination with the routine of assembly parameter choice to optimize the assembly N50 size, these tools can preclude an entire genome from the assembly. In contrast, MIRA, an all-purpose overlap assembler, and SPAdes, a multisized de Bruijn graph assembler, facilitate a comprehensive view on the individual genomes across a wide range of coverage ratios. Testing assemblers on a real-world metagenome skimming data from the lichen Lasallia pustulata demonstrates the applicability of twin sets for guiding method selection. Furthermore, it reveals that the assembly outcome for the photobiont Trebouxia sp. falls behind the a priori expectation given the simulations. Although the underlying reasons remain still unclear, this highlights that further studies on this organism require special attention during sequence data generation and downstream analysis. © 2015 John Wiley & Sons Ltd.

  7. Biotechnological applications of functional metagenomics in the food and pharmaceutical industries

    Laura M Coughlan

    2015-06-01

    Full Text Available Microorganisms are found throughout nature, thriving in a vast range of environmental conditions. The majority of them are unculturable or difficult to culture by traditional methods. Metagenomics enables the study of all microorganisms, regardless of whether they can be cultured or not, through the analysis of genomic data obtained directly from an environmental sample, providing knowledge of the species present and allowing the extraction of information regarding the functionality of microbial communities in their natural habitat. Function-based screenings, following the cloning and expression of metagenomic DNA in a heterologous host, can be applied to the discovery of novel proteins of industrial interest encoded by the genes of previously inaccessible microorganisms. Functional metagenomics has considerable potential in the food and pharmaceutical industries, where it can, for instance, aid (i the identification of enzymes with desirable technological properties, capable of catalysing novel reactions or replacing existing chemically synthesized catalysts which may be difficult or expensive to produce, and able to work under a wide range of environmental conditions encountered in food and pharmaceutical processing cycles including extreme conditions of temperature, pH, osmolarity, etc; (ii the discovery of novel bioactives including antimicrobials active against microorganisms of concern both in food and medical settings; (iii the investigation of industrial and societal issues such as antibiotic resistance development. This review article summarizes the state-of-the-art functional metagenomic methods available and discusses the potential of functional metagenomic approaches to mine as yet unexplored environments to discover novel genes with biotechnological application in the food and pharmaceutical industries.

  8. Genome diversity of marine phages recovered from Mediterranean metagenomes: Size matters.

    Mario López-Pérez

    2017-09-01

    Full Text Available Marine viruses play a critical role not only in the global geochemical cycles but also in the biology and evolution of their hosts. Despite their importance, viral diversity remains underexplored mostly due to sampling and cultivation challenges. Direct sequencing approaches such as viromics has provided new insights into the marine viral world. As a complementary approach, we analysed 24 microbial metagenomes (>0.2 μm size range obtained from six sites in the Mediterranean Sea that vary by depth, season and filter used to retrieve the fraction. Filter-size comparison showed a significant number of viral sequences that were retained on the larger-pore filters and were different from those found in the viral fraction from the same sample, indicating that some important viral information is missing using only assembly from viromes. Besides, we were able to describe 1,323 viral genomic fragments that were more than 10Kb in length, of which 36 represented complete viral genomes including some of them retrieved from a cross-assembly from different metagenomes. Host prediction based on sequence methods revealed new phage groups belonging to marine prokaryotes like SAR11, Cyanobacteria or SAR116. We also identified the first complete virophage from deep seawater and a new endemic clade of the recently discovered Marine group II Euryarchaeota virus. Furthermore, analysis of viral distribution using metagenomes and viromes indicated that most of the new phages were found exclusively in the Mediterranean Sea and some of them, mostly the ones recovered from deep metagenomes, do not recruit in any database probably indicating higher variability and endemicity in Mediterranean bathypelagic waters. Together these data provide the first detailed picture of genomic diversity, spatial and depth variations of viral communities within the Mediterranean Sea using metagenome assembly.

  9. Construction of a dairy microbial genome catalog opens new perspectives for the metagenomic analysis of dairy fermented products

    Almeida, Mathieu; Hebert, Agnes; Abraham, Anne-Laure

    2014-01-01

    Background: Microbial communities of traditional cheeses are complex and insufficiently characterized. The origin, safety and functional role in cheese making of these microbial communities are still not well understood. Metagenomic analysis of these communities by high throughput shotgun sequenc...

  10. An enrichment of CRISPR and other defense-related features in marine sponge-associated microbial metagenomes

    Hannes Horn

    2016-11-01

    Full Text Available Many marine sponges are populated by dense and taxonomically diverse microbial consortia. We employed a metagenomics approach to unravel the differences in the functional gene repertoire among three Mediterranean sponge species, Petrosia ficiformis, Sarcotragus foetidus, Aplysina aerophoba and seawater. Different signatures were observed between sponge and seawater metagenomes with regard to microbial community composition, GC content, and estimated bacterial genome size. Our analysis showed further a pronounced repertoire for defense systems in sponge metagenomes. Specifically, Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR, restriction modification, DNA phosphorothioation and phage growth limitation systems were enriched in sponge metagenomes. These data suggest that defense is an important functional trait for an existence within sponges that requires mechanisms to defend against foreign DNA from microorganisms and viruses. This study contributes to an understanding of the evolutionary arms race between viruses/phages and bacterial genomes and it sheds light on the bacterial defenses that have evolved in the context of the sponge holobiont.

  11. Scalability of Comparative Analysis, Novel Algorithms and Tools (MICW - Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Mavrommatis, Kostas

    2011-10-12

    DOE JGI's Kostas Mavrommatis, chair of the Scalability of Comparative Analysis, Novel Algorithms and Tools panel, at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  12. Genome Assembly Forensics: Metrics for Assessing Assembly Correctness (Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Pop, Mihai

    2011-10-13

    University of Maryland's Mihai Pop on Genome Assembly Forensics: Metrics for Assessing Assembly Correctness at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  13. MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets

    Wu, Yu-Wei [Joint BioEnergy Inst. (JBEI), Emeryville, CA (United States); Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Simmons, Blake A. [Joint BioEnergy Inst. (JBEI), Emeryville, CA (United States); Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Singer, Steven W. [Joint BioEnergy Inst. (JBEI), Emeryville, CA (United States); Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)

    2015-10-29

    The recovery of genomes from metagenomic datasets is a critical step to defining the functional roles of the underlying uncultivated populations. We previously developed MaxBin, an automated binning approach for high-throughput recovery of microbial genomes from metagenomes. Here, we present an expanded binning algorithm, MaxBin 2.0, which recovers genomes from co-assembly of a collection of metagenomic datasets. Tests on simulated datasets revealed that MaxBin 2.0 is highly accurate in recovering individual genomes, and the application of MaxBin 2.0 to several metagenomes from environmental samples demonstrated that it could achieve two complementary goals: recovering more bacterial genomes compared to binning a single sample as well as comparing the microbial community composition between different sampling environments. Availability and implementation: MaxBin 2.0 is freely available at http://sourceforge.net/projects/maxbin/ under BSD license. Supplementary information: Supplementary data are available at Bioinformatics online.

  14. Memory Efficient Sequence Analysis Using Compressed Data Structures (Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Simpson, Jared

    2011-10-13

    Wellcome Trust Sanger Institute's Jared Simpson on Memory efficient sequence analysis using compressed data structures at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  15. Sequencing Single Cell Microbial Genomes with Microfluidic Amplifications Tools (MICW - Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Quake, Steve

    2011-10-12

    Stanford University's Steve Quake on "Sequencing Single Cell Microbial Genomes with Microfluidic Amplification Tools" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  16. Functional Metagenomics: Construction and High-Throughput Screening of Fosmid Libraries for Discovery of Novel Carbohydrate-Active Enzymes.

    Ufarté, Lisa; Bozonnet, Sophie; Laville, Elisabeth; Cecchini, Davide A; Pizzut-Serin, Sandra; Jacquiod, Samuel; Demanèche, Sandrine; Simonet, Pascal; Franqueville, Laure; Veronese, Gabrielle Potocki

    2016-01-01

    Activity-based metagenomics is one of the most efficient approaches to boost the discovery of novel biocatalysts from the huge reservoir of uncultivated bacteria. In this chapter, we describe a highly generic procedure of metagenomic library construction and high-throughput screening for carbohydrate-active enzymes. Applicable to any bacterial ecosystem, it enables the swift identification of functional enzymes that are highly efficient, alone or acting in synergy, to break down polysaccharides and oligosaccharides.

  17. Metagenomic investigation of the microbial diversity in a chrysotile asbestos mine pit pond, Lowell, Vermont, USA

    Heather E. Driscoll

    2016-12-01

    Full Text Available Here we report on a metagenomics investigation of the microbial diversity in a serpentine-hosted aquatic habitat created by chrysotile asbestos mining activity at the Vermont Asbestos Group (VAG Mine in northern Vermont, USA. The now-abandoned VAG Mine on Belvidere Mountain in the towns of Eden and Lowell includes three open-pit quarries, a flooded pit, mill buildings, roads, and >26 million metric tons of eroding mine waste that contribute alkaline mine drainage to the surrounding watershed. Metagenomes and water chemistry originated from aquatic samples taken at three depths (0.5 m, 3.5 m, and 25 m along the water column at three distinct, offshore sites within the mine's flooded pit (near 44°46′00.7673″, −72°31′36.2699″; UTM NAD 83 Zone 18 T 0695720 E, 4960030 N. Whole metagenome shotgun Illumina paired-end sequences were quality trimmed and analyzed based on a translated nucleotide search of NCBI-NR protein database and lowest common ancestor taxonomic assignments. Our results show strata within the pit pond water column can be distinguished by taxonomic composition and distribution, pH, temperature, conductivity, light intensity, and concentrations of dissolved oxygen. At the phylum level, metagenomes from 0.5 m and 3.5 m contained a similar distribution of taxa and were dominated by Actinobacteria (46% and 53% of reads, respectively, Proteobacteria (45% and 38%, respectively, and Bacteroidetes (7% in both. The metagenomes from 25 m showed a greater diversity of phyla and a different distribution of reads than the two upper strata: Proteobacteria (60%, Actinobacteria (18%, Planctomycetes, (10%, Bacteroidetes (5% and Cyanobacteria (2.5%, Armatimonadetes (<1%, Verrucomicrobia (<1%, Firmicutes (<1%, and Nitrospirae (<1%. Raw metagenome sequence data from each sample reside in NCBI's Short Read Archive (SRA ID: SRP056095 and are accessible through NCBI BioProject PRJNA277916.

  18. Resolving prokaryotic taxonomy without rRNA: longer oligonucleotide word lengths improve genome and metagenome taxonomic classification.

    Alsop, Eric B; Raymond, Jason

    2013-01-01

    Oligonucleotide signatures, especially tetranucleotide signatures, have been used as method for homology binning by exploiting an organism's inherent biases towards the use of specific oligonucleotide words. Tetranucleotide signatures have been especially useful in environmental metagenomics samples as many of these samples contain organisms from poorly classified phyla which cannot be easily identified using traditional homology methods, including NCBI BLAST. This study examines oligonucleotide signatures across 1,424 completed genomes from across the tree of life, substantially expanding upon previous work. A comprehensive analysis of mononucleotide through nonanucleotide word lengths suggests that longer word lengths substantially improve the classification of DNA fragments across a range of sizes of relevance to high throughput sequencing. We find that, at present, heptanucleotide signatures represent an optimal balance between prediction accuracy and computational time for resolving taxonomy using both genomic and metagenomic fragments. We directly compare the ability of tetranucleotide and heptanucleotide world lengths (tetranucleotide signatures are the current standard for oligonucleotide word usage analyses) for taxonomic binning of metagenome reads. We present evidence that heptanucleotide word lengths consistently provide more taxonomic resolving power, particularly in distinguishing between closely related organisms that are often present in metagenomic samples. This implies that longer oligonucleotide word lengths should replace tetranucleotide signatures for most analyses. Finally, we show that the application of longer word lengths to metagenomic datasets leads to more accurate taxonomic binning of DNA scaffolds and have the potential to substantially improve taxonomic assignment and assembly of metagenomic data.

  19. Resolving prokaryotic taxonomy without rRNA: longer oligonucleotide word lengths improve genome and metagenome taxonomic classification.

    Eric B Alsop

    Full Text Available Oligonucleotide signatures, especially tetranucleotide signatures, have been used as method for homology binning by exploiting an organism's inherent biases towards the use of specific oligonucleotide words. Tetranucleotide signatures have been especially useful in environmental metagenomics samples as many of these samples contain organisms from poorly classified phyla which cannot be easily identified using traditional homology methods, including NCBI BLAST. This study examines oligonucleotide signatures across 1,424 completed genomes from across the tree of life, substantially expanding upon previous work. A comprehensive analysis of mononucleotide through nonanucleotide word lengths suggests that longer word lengths substantially improve the classification of DNA fragments across a range of sizes of relevance to high throughput sequencing. We find that, at present, heptanucleotide signatures represent an optimal balance between prediction accuracy and computational time for resolving taxonomy using both genomic and metagenomic fragments. We directly compare the ability of tetranucleotide and heptanucleotide world lengths (tetranucleotide signatures are the current standard for oligonucleotide word usage analyses for taxonomic binning of metagenome reads. We present evidence that heptanucleotide word lengths consistently provide more taxonomic resolving power, particularly in distinguishing between closely related organisms that are often present in metagenomic samples. This implies that longer oligonucleotide word lengths should replace tetranucleotide signatures for most analyses. Finally, we show that the application of longer word lengths to metagenomic datasets leads to more accurate taxonomic binning of DNA scaffolds and have the potential to substantially improve taxonomic assignment and assembly of metagenomic data.

  20. Gut metagenomes of type 2 diabetic patients have characteristic single-nucleotide polymorphism distribution in Bacteroides coprocola.

    Chen, Yaowen; Li, Zongcheng; Hu, Shuofeng; Zhang, Jian; Wu, Jiaqi; Shao, Ningsheng; Bo, Xiaochen; Ni, Ming; Ying, Xiaomin

    2017-02-01

    Gut microbes play a critical role in human health and disease, and researchers have begun to characterize their genomes, the so-called gut metagenome. Thus far, metagenomics studies have focused on genus- or species-level composition and microbial gene sets, while strain-level composition and single-nucleotide polymorphism (SNP) have been overlooked. The gut metagenomes of type 2 diabetes (T2D) patients have been found to be enriched with butyrate-producing bacteria and sulfate reduction functions. However, it is not known whether the gut metagenomes of T2D patients have characteristic strain patterns or SNP distributions. We downloaded public gut metagenome datasets from 170 T2D patients and 174 healthy controls and performed a systematic comparative analysis of their metagenome SNPs. We found that Bacteroides coprocola, whose relative abundance did not differ between the groups, had a characteristic distribution of SNPs in the T2D patient group. We identified 65 genes, all in B. coprocola, that had remarkably different enrichment of SNPs. The first and sixth ranked genes encode glycosyl hydrolases (GenBank accession EDU99824.1 and EDV02301.1). Interestingly, alpha-glucosidase, which is also a glycosyl hydrolase located in the intestine, is an important drug target of T2D. These results suggest that different strains of B. coprocola may have different roles in human gut and a specific set of B. coprocola strains are correlated with T2D.

  1. Metagenomic analysis of microbial communities yields insight into impacts of nanoparticle design

    Metch, Jacob W.; Burrows, Nathan D.; Murphy, Catherine J.; Pruden, Amy; Vikesland, Peter J.

    2018-01-01

    Next-generation DNA sequencing and metagenomic analysis provide powerful tools for the environmentally friendly design of nanoparticles. Herein we demonstrate this approach using a model community of environmental microbes (that is, wastewater-activated sludge) dosed with gold nanoparticles of varying surface coatings and morphologies. Metagenomic analysis was highly sensitive in detecting the microbial community response to gold nanospheres and nanorods with either cetyltrimethylammonium bromide or polyacrylic acid surface coatings. We observed that the gold-nanoparticle morphology imposes a stronger force in shaping the microbial community structure than does the surface coating. Trends were consistent in terms of the compositions of both taxonomic and functional genes, which include antibiotic resistance genes, metal resistance genes and gene-transfer elements associated with cell stress that are relevant to public health. Given that nanoparticle morphology remained constant, the potential influence of gold dissolution was minimal. Surface coating governed the nanoparticle partitioning between the bioparticulate and aqueous phases.

  2. Characterization of Bacterial Hydrocarbon Degradation Potential in the Red Sea Through Metagenomic and Cultivation Methods

    Bianchi, Patrick

    2018-01-01

    The focus of this thesis is on the characterization at the metagenomic level of the water column of the Red Sea and on the isolation and characterization of novel hydrocarbon-degrading species and genomes adapted to the unique environmental characteristics of the basin. The presence of metabolic genes responsible of both linear and aromatic hydrocarbon degradation has been evaluated from a metagenomic survey and a meta-analysis of already available datasets. In parallel, water column-based microcosms have been established with crude oil as the sole carbon source, with aim to isolate potential novel bacterial species and provide new genome-based insights on the hydrocarbon degradation potential available in the Red Sea.

  3. Ancient DNA analysis identifies marine mollusc shells as new metagenomic archives of the past

    Der Sarkissian, Clio; Pichereau, Vianney; Dupont, Catherine

    2017-01-01

    Marine mollusc shells enclose a wealth of information on coastal organisms and their environment. Their life history traits as well as (palaeo-) environmental conditions, including temperature, food availability, salinity and pollution, can be traced through the analysis of their shell (micro...... extraction, high-throughput shotgun DNA sequencing and metagenomic analyses to marine mollusc shells spanning the last ~7,000 years. We report successful DNA extraction from shells, including a variety of ancient specimens, and find that DNA recovery is highly dependent on their biomineral structure......, carbonate layer preservation and disease state. We demonstrate positive taxonomic identification of mollusc species using a combination of mitochondrial DNA genomes, barcodes, genome-scale data and metagenomic approaches. We also find shell biominerals to contain a diversity of microbial DNA from the marine...

  4. Metagenomic identification of active methanogens and methanotrophs in serpentinite springs of the Voltri Massif, Italy

    William J. Brazelton

    2017-01-01

    Full Text Available The production of hydrogen and methane by geochemical reactions associated with the serpentinization of ultramafic rocks can potentially support subsurface microbial ecosystems independent of the photosynthetic biosphere. Methanogenic and methanotrophic microorganisms are abundant in marine hydrothermal systems heavily influenced by serpentinization, but evidence for methane-cycling archaea and bacteria in continental serpentinite springs has been limited. This report provides metagenomic and experimental evidence for active methanogenesis and methanotrophy by microbial communities in serpentinite springs of the Voltri Massif, Italy. Methanogens belonging to family Methanobacteriaceae and methanotrophic bacteria belonging to family Methylococcaceae were heavily enriched in three ultrabasic springs (pH 12. Metagenomic data also suggest the potential for hydrogen oxidation, hydrogen production, carbon fixation, fermentation, and organic acid metabolism in the ultrabasic springs. The predicted metabolic capabilities are consistent with an active subsurface ecosystem supported by energy and carbon liberated by geochemical reactions within the serpentinite rocks of the Voltri Massif.

  5. The Human Gut Antibiotic Resistome in the Metagenomic Era: Progress and Perspectives

    Yongfei Hu

    2016-04-01

    Full Text Available The human gut is populated by a vast number of bacteria, which play a critical role in human health. In recent years, attention has focused on the gut bacteria as a reservoir of antibiotic resistance genes (ARGs. Both culture-dependent and culture-independent methods have been applied to investigate numerous ARGs, collectively called the antibiotic resistome, harbored by gut bacteria. This has led to an increased understanding of the overall profile of the gut antibiotic resistome, although it remains incompletely understood. In this review, we summarize the recent research findings on the human gut antibiotic resistome, with an emphasis on progress achieved using the culture-independent metagenomic strategy. We also describe the features of different available ARG databases used for annotation in metagenomic analysis, discuss the potential problems and limitations in current research, and suggest several directions for future investigation.

  6. GenomePeek—an online tool for prokaryotic genome and metagenome analysis

    Katelyn McNair

    2015-06-01

    Full Text Available As more and more prokaryotic sequencing takes place, a method to quickly and accurately analyze this data is needed. Previous tools are mainly designed for metagenomic analysis and have limitations; such as long runtimes and significant false positive error rates. The online tool GenomePeek (edwards.sdsu.edu/GenomePeek was developed to analyze both single genome and metagenome sequencing files, quickly and with low error rates. GenomePeek uses a sequence assembly approach where reads to a set of conserved genes are extracted, assembled and then aligned against the highly specific reference database. GenomePeek was found to be faster than traditional approaches while still keeping error rates low, as well as offering unique data visualization options.

  7. Metagenomics Study on the Polymorphism of Gut Microbiota and Their Function on Human Health

    Feng, Qiang

    diversity and functional complexity of the gut microbiome. Facilitated by the Next Generation Sequencing (NGS) technologies and the progress of bioinformatics in the past decade, we have acquired substantial achievements in metagenomic studies on human gut microbiome and established the fundamentals of our...... understanding of the interactions between gut microbes and human body, and also the importance of this interaction on human health. As one of the milestones, the first integrated gene catalog in the human gut microbiome was constructed in 2010 in the scheme of the Metagenomics of Human Intestinal Tract (Meta......’ are shared in the population. These microorganisms participate in various metabolic pathways and activities of the immune system and the nervous system of our bodies,and have fundamental impacts on our health. For example, an association study between gut microbiome and type 2 diabetes (T2D) highlighted...

  8. Toward molecular trait-based ecology through integration of biogeochemical, geographical and metagenomic data

    Raes, Jeroen; Letunic, Ivica; Yamada, Takuji

    2011-01-01

    Using metagenomic 'parts lists' to infer global patterns on microbial ecology remains a significant challenge. To deduce important ecological indicators such as environmental adaptation, molecular trait dispersal, diversity variation and primary production from the gene pool of an ecosystem, we...... integrated 25 ocean metagenomes with geographical, meteorological and geophysicochemical data. We find that climatic factors (temperature, sunlight) are the major determinants of the biomolecular repertoire of each sample and the main limiting factor on functional trait dispersal (absence of biogeographic...... provincialism). Molecular functional richness and diversity show a distinct latitudinal gradient peaking at 20° N and correlate with primary production. The latter can also be predicted from the molecular functional composition of an environmental sample. Together, our results show that the functional community...

  9. Using Short-Term Enrichments and Metagenomics to Obtain Genomes from uncultured Activated Sludge Microorganisms

    Karst, Søren Michael; Nielsen, Per Halkjær; Albertsen, Mads

    is that they depend on system-specific reference genomes in order to analyze the vast amounts of data (Albertsen et al., 2012). This limits the application of -omics to environments for which a comprehensive catalogue of reference genomes exists e.g. the human gut. Several strategies for obtaining microbial genomes...... exist today, but their ability to obtain complete genomes from complex microbial communities on a large scale is still inadequate (Lasken, 2012). In theory, conventional metagenomics should be able to recover genomes from complex communities, but in practice the approach is hampered by the presence...... of microdiversity. This leads to fragmented and chimeric de novo assemblies, which prevent the extraction of complete genomes. The new approach presented here involves reducing the impact of microdiversity and increasing genome extraction efficiency by what we term “metagenome triangulation”. The microdiversity...

  10. Applying Shannon's information theory to bacterial and phage genomes and metagenomes

    Akhter, Sajia; Bailey, Barbara A.; Salamon, Peter; Aziz, Ramy K.; Edwards, Robert A.

    2013-01-01

    All sequence data contain inherent information that can be measured by Shannon's uncertainty theory. Such measurement is valuable in evaluating large data sets, such as metagenomic libraries, to prioritize their analysis and annotation, thus saving computational resources. Here, Shannon's index of complete phage and bacterial genomes was examined. The information content of a genome was found to be highly dependent on the genome length, GC content, and sequence word size. In metagenomic sequences, the amount of information correlated with the number of matches found by comparison to sequence databases. A sequence with more information (higher uncertainty) has a higher probability of being significantly similar to other sequences in the database. Measuring uncertainty may be used for rapid screening for sequences with matches in available database, prioritizing computational resources, and indicating which sequences with no known similarities are likely to be important for more detailed analysis.

  11. Metagenomic Analysis of Chicken Gut Microbiota for Improving Metabolism and Health of Chickens — A Review

    Ki Young Choi

    2015-09-01

    Full Text Available Chicken is a major food source for humans, hence it is important to understand the mechanisms involved in nutrient absorption in chicken. In the gastrointestinal tract (GIT, the microbiota plays a central role in enhancing nutrient absorption and strengthening the immune system, thereby affecting both growth and health of chicken. There is little information on the diversity and functions of chicken GIT microbiota, its impact on the host, and the interactions between the microbiota and host. Here, we review the recent metagenomic strategies to analyze the chicken GIT microbiota composition and its functions related to improving metabolism and health. We summarize methodology of metagenomics in order to obtain bacterial taxonomy and functional inferences of the GIT microbiota and suggest a set of indicator genes for monitoring and manipulating the microbiota to promote host health in future.

  12. Stable isotope probing in the metagenomics era: a bridge towards improved bioremediation

    Uhlik, Ondrej; Leewis, Mary-Cathrine; Strejcek, Michal; Musilova, Lucie; Mackova, Martina; Leigh, Mary Beth; Macek, Tomas

    2012-01-01

    Microbial biodegradation and biotransformation reactions are essential to most bioremediation processes, yet the specific organisms, genes, and mechanisms involved are often not well understood. Stable isotope probing (SIP) enables researchers to directly link microbial metabolic capability to phylogenetic and metagenomic information within a community context by tracking isotopically labeled substances into phylogenetically and functionally informative biomarkers. SIP is thus applicable as a tool for the identification of active members of the microbial community and associated genes integral to the community functional potential, such as biodegradative processes. The rapid evolution of SIP over the last decade and integration with metagenomics provides researchers with a much deeper insight into potential biodegradative genes, processes, and applications, thereby enabling an improved mechanistic understanding that can facilitate advances in the field of bioremediation. PMID:23022353

  13. Metagenomics as a tool to obtain full genomes of process-critical bacteria in engineered systems

    Albertsen, Mads; Hugenholtz, Philip; Tyson, Gene W.

    of the community. The assembled genomes include many of the process-critical bacteria involved in wastewater treatment, such as Competibacter, Tetrasphaera and TM7. The approach is not limited to different extraction methods, but can be applied to any treatment that results in different relative abundance......Bacteria play a pivotal role in engineered systems such as wastewater treatment plants. Obtaining genomes of the bacteria provides the genetic potential of the system and also allows studies of in situ functions through transcriptomics and proteomics. Hence, it enables correlations of operational......, the sequencing of bulk genomic DNA from environmental samples, has the potential to provide genomes of this uncultured majority. However, so far only few bacterial genomes have been obtained from metagenomic data. In this study we present a new approach to obtain individual genomes from metagenomes. We deeply...

  14. Reconstruction of diverse verrucomicrobial genomes from metagenome datasets of freshwater reservoirs

    Cabello-Yeves, P.J.; Ghai, Rohit; Mehrshad, Maliheh; Picazo, A.; Camacho, A.; Rodriguez-Valera, F.

    2017-01-01

    Roč. 8, Nov (2017), č. článku 2131. ISSN 1664-302X R&D Projects: GA ČR GA17-04828S Grant - others:AV ČR(CZ) L200961651 Institutional support: RVO:60077344 Keywords : freshwater Verrucomicrobia * metagenomics * rhodopsin * nitrogen fixation * genome streamlining Subject RIV: EE - Microbiology, Virology OBOR OECD: Microbiology Impact factor: 4.076, year: 2016

  15. Draft Genome Sequence of Uncultured SAR324 Bacterium lautmerah10, Binned from a Red Sea Metagenome

    Haroon, Mohamed; Thompson, Luke R.; Stingl, Ulrich

    2016-01-01

    A draft genome of SAR324 bacterium lautmerah10 was assembled from a metagenome of a surface water sample from the Red Sea, Saudi Arabia. The genome is more complete and has a higher G+C content than that of previously sequenced SAR324 representatives. Its genomic information shows a versatile metabolism that confers an advantage to SAR324, which is reflected in its distribution throughout different depths of the marine water column.

  16. Data on gut metagenomes of the patients with alcoholic dependence syndrome and alcoholic liver cirrhosis

    Alexander V. Tyakht

    2017-04-01

    Full Text Available Alcoholism is associated with significant changes in gut microbiota composition. Metagenomic sequencing allows to assess the altered abundance levels of bacterial taxa and genes in a culture-independent way. We collected 99 stool samples from the patients with alcoholic dependence syndrome (n=72 and alcoholic liver cirrhosis (n=27. Each of the samples was surveyed using “shotgun” (whole-genome sequencing on SOLiD platform. The reads are deposited in the ENA (project ID: PRJEB18041.

  17. Shotgun pyrosequencing metagenomic analyses of dusts from swine confinement and grain facilities.

    Boissy, Robert J; Romberger, Debra J; Roughead, William A; Weissenburger-Moser, Lisa; Poole, Jill A; LeVan, Tricia D

    2014-01-01

    Inhalation of agricultural dusts causes inflammatory reactions and symptoms such as headache, fever, and malaise, which can progress to chronic airway inflammation and associated diseases, e.g. asthma, chronic bronchitis, chronic obstructive pulmonary disease, and hypersensitivity pneumonitis. Although in many agricultural environments feed particles are the major constituent of these dusts, the inflammatory responses that they provoke are likely attributable to particle-associated bacteria, archaebacteria, fungi, and viruses. In this study, we performed shotgun pyrosequencing metagenomic analyses of DNA from dusts from swine confinement facilities or grain elevators, with comparisons to dusts from pet-free households. DNA sequence alignment showed that 19% or 62% of shotgun pyrosequencing metagenomic DNA sequence reads from swine facility or household dusts, respectively, were of swine or human origin, respectively. In contrast only 2% of such reads from grain elevator dust were of mammalian origin. These metagenomic shotgun reads of mammalian origin were excluded from our analyses of agricultural dust microbiota. The ten most prevalent bacterial taxa identified in swine facility compared to grain elevator or household dust were comprised of 75%, 16%, and 42% gram-positive organisms, respectively. Four of the top five swine facility dust genera were assignable (Clostridium, Lactobacillus, Ruminococcus, and Eubacterium, ranging from 4% to 19% relative abundance). The relative abundances of these four genera were lower in dust from grain elevators or pet-free households. These analyses also highlighted the predominance in swine facility dust of Firmicutes (70%) at the phylum level, Clostridia (44%) at the Class level, and Clostridiales at the Order level (41%). In summary, shotgun pyrosequencing metagenomic analyses of agricultural dusts show that they differ qualitatively and quantitatively at the level of microbial taxa present, and that the bioinformatic analyses

  18. Shotgun pyrosequencing metagenomic analyses of dusts from swine confinement and grain facilities.

    Robert J Boissy

    Full Text Available Inhalation of agricultural dusts causes inflammatory reactions and symptoms such as headache, fever, and malaise, which can progress to chronic airway inflammation and associated diseases, e.g. asthma, chronic bronchitis, chronic obstructive pulmonary disease, and hypersensitivity pneumonitis. Although in many agricultural environments feed particles are the major constituent of these dusts, the inflammatory responses that they provoke are likely attributable to particle-associated bacteria, archaebacteria, fungi, and viruses. In this study, we performed shotgun pyrosequencing metagenomic analyses of DNA from dusts from swine confinement facilities or grain elevators, with comparisons to dusts from pet-free households. DNA sequence alignment showed that 19% or 62% of shotgun pyrosequencing metagenomic DNA sequence reads from swine facility or household dusts, respectively, were of swine or human origin, respectively. In contrast only 2% of such reads from grain elevator dust were of mammalian origin. These metagenomic shotgun reads of mammalian origin were excluded from our analyses of agricultural dust microbiota. The ten most prevalent bacterial taxa identified in swine facility compared to grain elevator or household dust were comprised of 75%, 16%, and 42% gram-positive organisms, respectively. Four of the top five swine facility dust genera were assignable (Clostridium, Lactobacillus, Ruminococcus, and Eubacterium, ranging from 4% to 19% relative abundance. The relative abundances of these four genera were lower in dust from grain elevators or pet-free households. These analyses also highlighted the predominance in swine facility dust of Firmicutes (70% at the phylum level, Clostridia (44% at the Class level, and Clostridiales at the Order level (41%. In summary, shotgun pyrosequencing metagenomic analyses of agricultural dusts show that they differ qualitatively and quantitatively at the level of microbial taxa present, and that the

  19. High frequency of phylogenetically diverse reductive dehalogenase-homologous genes in deep subseafloor sedimentary metagenomes

    Mikihiko eKawai

    2014-03-01

    Full Text Available Marine subsurface sediments on the Pacific margin harbor diverse microbial communities even at depths of several hundreds meters below the seafloor (mbsf or more. Previous PCR-based molecular analysis showed the presence of diverse reductive dehalogenase gene (rdhA homologs in marine subsurface sediment, suggesting that anaerobic respiration of organohalides is one of the possible energy-yielding pathways in the organic-rich sedimentary habitat. However, primer-independent molecular characterization of rdhA has remained to be demonstrated. Here, we studied the diversity and frequency of rdhA homologs by metagenomic analysis of five different depth horizons (0.8, 5.1, 18.6, 48.5 and 107.0 mbsf at Site C9001 off the Shimokita Peninsula of Japan. From all metagenomic pools, remarkably diverse rdhA-homologous sequences, some of which are affiliated with novel clusters, were observed with high frequency. As a comparison, we also examined frequency of dissimilatory sulfite reductase genes (dsrAB, key functional genes for microbial sulfate reduction. The dsrAB were also widely observed in the metagenomic pools whereas the frequency of dsrAB genes was generally smaller than that of rdhA-homologous genes. The phylogenetic composition of rdhA-homologous genes was similar among the five depth horizons. Our metagenomic data revealed that subseafloor rdhA homologs are more diverse than previously identified from PCR-based molecular studies. Spatial distribution of similar rdhA homologs across wide depositional ages indicates that the heterotrophic metabolic processes mediated by the genes can be ecologically important, functioning in the organic-rich subseafloor sedimentary biosphere.

  20. Draft Genome Sequence of Uncultured SAR324 Bacterium lautmerah10, Binned from a Red Sea Metagenome

    Haroon, Mohamed

    2016-02-11

    A draft genome of SAR324 bacterium lautmerah10 was assembled from a metagenome of a surface water sample from the Red Sea, Saudi Arabia. The genome is more complete and has a higher G+C content than that of previously sequenced SAR324 representatives. Its genomic information shows a versatile metabolism that confers an advantage to SAR324, which is reflected in its distribution throughout different depths of the marine water column.

  1. Strain-Level Discrimination of Shiga Toxin-Producing Escherichia coli in Spinach Using Metagenomic Sequencing.

    Susan R Leonard

    Full Text Available Consumption of fresh bagged spinach contaminated with Shiga toxin-producing Escherichia coli (STEC has led to severe illness and death; however current culture-based methods to detect foodborne STEC are time consuming. Since not all STEC strains are considered pathogenic to humans, it is crucial to incorporate virulence characterization of STEC in the detection method. In this study, we assess the comprehensiveness of utilizing a shotgun metagenomics approach for detection and strain-level identification by spiking spinach with a variety of genomically disparate STEC strains at a low contamination level of 0.1 CFU/g. Molecular serotyping, virulence gene characterization, microbial community analysis, and E. coli core gene single nucleotide polymorphism (SNP analysis were performed on metagenomic sequence data from enriched samples. It was determined from bacterial community analysis that E. coli, which was classified at the phylogroup level, was a major component of the population in most samples. However, in over half the samples, molecular serotyping revealed the presence of indigenous E. coli which also contributed to the percent abundance of E. coli. Despite the presence of additional E. coli strains, the serotype and virulence genes of the spiked STEC, including correct Shiga toxin subtype, were detected in 94% of the samples with a total number of reads per sample averaging 2.4 million. Variation in STEC abundance and/or detection was observed in replicate spiked samples, indicating an effect from the indigenous microbiota during enrichment. SNP analysis of the metagenomic data correctly placed the spiked STEC in a phylogeny of related strains in cases where the indigenous E. coli did not predominate in the enriched sample. Also, for these samples, our analysis demonstrates that strain-level phylogenetic resolution is possible using shotgun metagenomic data for determining the genomic relatedness of a contaminating STEC strain to other

  2. Seasonal patterns in Arctic prasinophytes and inferred ecology of Bathycoccus unveiled in an Arctic winter metagenome.

    Joli, Nathalie; Monier, Adam; Logares, Ramiro; Lovejoy, Connie

    2017-06-01

    Prasinophytes occur in all oceans but rarely dominate phytoplankton populations. In contrast, a single ecotype of the prasinophyte Micromonas is frequently the most abundant photosynthetic taxon reported in the Arctic from summer through autumn. However, seasonal dynamics of prasinophytes outside of this period are little known. To address this, we analyzed high-throughput V4 18S rRNA amplicon data collected from November to July in the Amundsen Gulf Region, Beaufort Sea, Arctic. Surprisingly during polar sunset in November and December, we found a high proportion of reads from both DNA and RNA belonging to another prasinophyte, Bathycoccus. We then analyzed a metagenome from a December sample and the resulting Bathycoccus metagenome assembled genome (MAG) covered ~90% of the Bathycoccus Ban7 reference genome. In contrast, only ~20% of a reference Micromonas genome was found in the metagenome. Our phylogenetic analysis of marker genes placed the Arctic Bathycoccus in the B1 coastal clade. In addition, substitution rates of 129 coding DNA sequences were ~1.6% divergent between the Arctic MAG and coastal Chilean upwelling MAGs and 17.3% between it and a South East Atlantic open ocean MAG in the B2 Clade. The metagenomic analysis also revealed a winter viral community highly skewed toward viruses targeting Micromonas, with a much lower diversity of viruses targeting Bathycoccus. Overall a combination of Micromonas being relatively less able to maintain activity under dark winter conditions and viral suppression of Micromonas may have contributed to the success of Bathycoccus in the Amundsen Gulf during winter.

  3. An Improved Methodology to Overcome Key Issues in Human Fecal Metagenomic DNA Extraction

    Jitendra Kumar

    2016-12-01

    Full Text Available Microbes are ubiquitously distributed in nature, and recent culture-independent studies have highlighted the significance of gut microbiota in human health and disease. Fecal DNA is the primary source for the majority of human gut microbiome studies. However, further improvement is needed to obtain fecal metagenomic DNA with sufficient amount and good quality but low host genomic DNA contamination. In the current study, we demonstrate a quick, robust, unbiased, and cost-effective method for the isolation of high molecular weight (>23 kb metagenomic DNA (260/280 ratio >1.8 with a good yield (55.8 ± 3.8 ng/mg of feces. We also confirm that there is very low human genomic DNA contamination (eubacterial: human genomic DNA marker genes = 227.9:1 in the human feces. The newly-developed method robustly performs for fresh as well as stored fecal samples as demonstrated by 16S rRNA gene sequencing using 454 FLX+. Moreover, 16S rRNA gene analysis indicated that compared to other DNA extraction methods tested, the fecal metagenomic DNA isolated with current methodology retains species richness and does not show microbial diversity biases, which is further confirmed by qPCR with a known quantity of spike-in genomes. Overall, our data highlight a protocol with a balance between quality, amount, user-friendliness, and cost effectiveness for its suitability toward usage for culture-independent analysis of the human gut microbiome, which provides a robust solution to overcome key issues associated with fecal metagenomic DNA isolation in human gut microbiome studies.

  4. Comparison of normalization methods for the analysis of metagenomic gene abundance data.

    Pereira, Mariana Buongermino; Wallroth, Mikael; Jonsson, Viktor; Kristiansson, Erik

    2018-04-20

    In shotgun metagenomics, microbial communities are studied through direct sequencing of DNA without any prior cultivation. By comparing gene abundances estimated from the generated sequencing reads, functional differences between the communities can be identified. However, gene abundance data is affected by high levels of systematic variability, which can greatly reduce the statistical power and introduce false positives. Normalization, which is the process where systematic variability is identified and removed, is therefore a vital part of the data analysis. A wide range of normalization methods for high-dimensional count data has been proposed but their performance on the analysis of shotgun metagenomic data has not been evaluated. Here, we present a systematic evaluation of nine normalization methods for gene abundance data. The methods were evaluated through resampling of three comprehensive datasets, creating a realistic setting that preserved the unique characteristics of metagenomic data. Performance was measured in terms of the methods ability to identify differentially abundant genes (DAGs), correctly calculate unbiased p-values and control the false discovery rate (FDR). Our results showed that the choice of normalization method has a large impact on the end results. When the DAGs were asymmetrically present between the experimental conditions, many normalization methods had a reduced true positive rate (TPR) and a high false positive rate (FPR). The methods trimmed mean of M-values (TMM) and relative log expression (RLE) had the overall highest performance and are therefore recommended for the analysis of gene abundance data. For larger sample sizes, CSS also showed satisfactory performance. This study emphasizes the importance of selecting a suitable normalization methods in the analysis of data from shotgun metagenomics. Our results also demonstrate that improper methods may result in unacceptably high levels of false positives, which in turn may lead

  5. Statistical methods for detecting differentially abundant features in clinical metagenomic samples.

    James Robert White

    2009-04-01

    Full Text Available Numerous studies are currently underway to characterize the microbial communities inhabiting our world. These studies aim to dramatically expand our understanding of the microbial biosphere and, more importantly, hope to reveal the secrets of the complex symbiotic relationship between us and our commensal bacterial microflora. An important prerequisite for such discoveries are computational tools that are able to rapidly and accurately compare large datasets generated from complex bacterial communities to identify features that distinguish them.We present a statistical method for comparing clinical metagenomic samples from two treatment populations on the basis of count data (e.g. as obtained through sequencing to detect differentially abundant features. Our method, Metastats, employs the false discovery rate to improve specificity in high-complexity environments, and separately handles sparsely-sampled features using Fisher's exact test. Under a variety of simulations, we show that Metastats performs well compared to previously used methods, and significantly outperforms other methods for features with sparse counts. We demonstrate the utility of our method on several datasets including a 16S rRNA survey of obese and lean human gut microbiomes, COG functional profiles of infant and mature gut microbiomes, and bacterial and viral metabolic subsystem data inferred from random sequencing of 85 metagenomes. The application of our method to the obesity dataset reveals differences between obese and lean subjects not reported in the original study. For the COG and subsystem datasets, we provide the first statistically rigorous assessment of the differences between these populations. The methods described in this paper are the first to address clinical metagenomic datasets comprising samples from multiple subjects. Our methods are robust across datasets of varied complexity and sampling level. While designed for metagenomic applications, our software

  6. Comparing and Evaluating Metagenome Assembly Tools from a Microbiologist's Perspective - Not Only Size Matters!

    John Vollmers

    Full Text Available With the constant improvement in cost-efficiency and quality of Next Generation Sequencing technologies, shotgun-sequencing approaches -such as metagenomics- have nowadays become the methods of choice for studying and classifying microorganisms from various habitats. The production of data has dramatically increased over the past years and processing and analysis steps are becoming more and more of a bottleneck. Limiting factors are partly the availability of computational resources, but mainly the bioinformatics expertise in establishing and applying appropriate processing and analysis pipelines. Fortunately, a large diversity of specialized software tools is nowadays available. Nevertheless, choosing the most appropriate methods for answering specific biological questions can be rather challenging, especially for non-bioinformaticians. In order to provide a comprehensive overview and guide for the microbiological scientific community, we assessed the most common and freely available metagenome assembly tools with respect to their output statistics, their sensitivity for low abundant community members and variability in resulting community profiles as well as their ease-of-use. In contrast to the highly anticipated "Critical Assessment of Metagenomic Interpretation" (CAMI challenge, which uses general mock community-based assembler comparison we here tested assemblers on real Illumina metagenome sequencing data from natural communities of varying complexity sampled from forest soil and algal biofilms. Our observations clearly demonstrate that different assembly tools can prove optimal, depending on the sample type, available computational resources and, most importantly, the specific research goal. In addition, we present detailed descriptions of the underlying principles and pitfalls of publically available assembly tools from a microbiologist's perspective, and provide guidance regarding the user-friendliness, sensitivity and reliability of

  7. A combined meta-barcoding and shotgun metagenomic analysis of spontaneous wine fermentation.

    Sternes, Peter R; Lee, Danna; Kutyna, Dariusz R; Borneman, Anthony R

    2017-07-01

    Wine is a complex beverage, comprising hundreds of metabolites produced through the action of yeasts and bacteria in fermenting grape must. Commercially, there is now a growing trend away from using wine yeast (Saccharomyces) starter cultures, toward the historic practice of uninoculated or "wild" fermentation, where the yeasts and bacteria associated with the grapes and/or winery perform the fermentation. It is the varied metabolic contributions of these numerous non-Saccharomyces species that are thought to impart complexity and desirable taste and aroma attributes to wild ferments in comparison to their inoculated counterparts. To map the microflora of spontaneous fermentation, metagenomic techniques were employed to characterize and monitor the progression of fungal species in 5 different wild fermentations. Both amplicon-based ribosomal DNA internal transcribed spacer (ITS) phylotyping and shotgun metagenomics were used to assess community structure across different stages of fermentation. While providing a sensitive and highly accurate means of characterizing the wine microbiome, the shotgun metagenomic data also uncovered a significant overabundance bias in the ITS phylotyping abundance estimations for the common non-Saccharomyces wine yeast genus Metschnikowia. By identifying biases such as that observed for Metschnikowia, abundance measurements from future ITS phylotyping datasets can be corrected to provide more accurate species representation. Ultimately, as more shotgun metagenomic and single-strain de novo assemblies for key wine species become available, the accuracy of both ITS-amplicon and shotgun studies will greatly increase, providing a powerful methodology for deciphering the influence of the microbial community on the wine flavor and aroma. © The Authors 2017. Published by Oxford University Press.

  8. Exploring nucleo-cytoplasmic large DNA viruses in Tara Oceans microbial metagenomes.

    Hingamp, Pascal; Grimsley, Nigel; Acinas, Silvia G; Clerissi, Camille; Subirana, Lucie; Poulain, Julie; Ferrera, Isabel; Sarmento, Hugo; Villar, Emilie; Lima-Mendez, Gipsi; Faust, Karoline; Sunagawa, Shinichi; Claverie, Jean-Michel; Moreau, Hervé; Desdevises, Yves; Bork, Peer; Raes, Jeroen; de Vargas, Colomban; Karsenti, Eric; Kandels-Lewis, Stefanie; Jaillon, Olivier; Not, Fabrice; Pesant, Stéphane; Wincker, Patrick; Ogata, Hiroyuki

    2013-09-01

    Nucleo-cytoplasmic large DNA viruses (NCLDVs) constitute a group of eukaryotic viruses that can have crucial ecological roles in the sea by accelerating the turnover of their unicellular hosts or by causing diseases in animals. To better characterize the diversity, abundance and biogeography of marine NCLDVs, we analyzed 17 metagenomes derived from microbial samples (0.2-1.6 μm size range) collected during the Tara Oceans Expedition. The sample set includes ecosystems under-represented in previous studies, such as the Arabian Sea oxygen minimum zone (OMZ) and Indian Ocean lagoons. By combining computationally derived relative abundance and direct prokaryote cell counts, the abundance of NCLDVs was found to be in the order of 10(4)-10(5) genomes ml(-1) for the samples from the photic zone and 10(2)-10(3) genomes ml(-1) for the OMZ. The Megaviridae and Phycodnaviridae dominated the NCLDV populations in the metagenomes, although most of the reads classified in these families showed large divergence from known viral genomes. Our taxon co-occurrence analysis revealed a potential association between viruses of the Megaviridae family and eukaryotes related to oomycetes. In support of this predicted association, we identified six cases of lateral gene transfer between Megaviridae and oomycetes. Our results suggest that marine NCLDVs probably outnumber eukaryotic organisms in the photic layer (per given water mass) and that metagenomic sequence analyses promise to shed new light on the biodiversity of marine viruses and their interactions with potential hosts.

  9. Genomic and metagenomic challenges and opportunities for bioleaching: a mini-review.

    Cárdenas, Juan Pablo; Quatrini, Raquel; Holmes, David S

    2016-09-01

    High-throughput genomic technologies are accelerating progress in understanding the diversity of microbial life in many environments. Here we highlight advances in genomics and metagenomics of microorganisms from bioleaching heaps and related acidic mining environments. Bioleaching heaps used for copper recovery provide significant opportunities to study the processes and mechanisms underlying microbial successions and the influence of community composition on ecosystem functioning. Obtaining quantitative and process-level knowledge of these dynamics is pivotal for understanding how microorganisms contribute to the solubilization of copper for industrial recovery. Advances in DNA sequencing technology provide unprecedented opportunities to obtain information about the genomes of bioleaching microorganisms, allowing predictive models of metabolic potential and ecosystem-level interactions to be constructed. These approaches are enabling predictive phenotyping of organisms many of which are recalcitrant to genetic approaches or are unculturable. This mini-review describes current bioleaching genomic and metagenomic projects and addresses the use of genome information to: (i) build metabolic models; (ii) predict microbial interactions; (iii) estimate genetic diversity; and (iv) study microbial evolution. Key challenges and perspectives of bioleaching genomics/metagenomics are addressed. Copyright © 2016 The Author(s). Published by Elsevier Masson SAS.. All rights reserved.

  10. HORSE SPECIES SYMPOSIUM: Canine intestinal microbiology and metagenomics: From phylogeny to function.

    Guard, B C; Suchodolski, J S

    2016-06-01

    Recent molecular studies have revealed a complex microbiota in the dog intestine. Convincing evidence has been reported linking changes in microbial communities to acute and chronic gastrointestinal inflammation, especially in canine inflammatory bowel disease (IBD). The most common microbial changes observed in intestinal inflammation are decreases in the bacterial phyla Firmicutes (i.e., Lachnospiraceae, Ruminococcaceae, and ) and Bacteroidetes, with concurrent increases in Proteobacteria (i.e., ). Due to the important role of microbial-derived metabolites for host health, it is important to elucidate the metabolic consequences of gastrointestinal dysbiosis and physiological pathways implicated in specific disease phenotypes. Metagenomic studies have used shotgun sequencing of DNA as well as phylogenetic investigation of communities by reconstruction of unobserved states (PICRUSt) to characterize functional changes in the bacterial metagenome in gastrointestinal disease. Furthermore, wide-scale and untargeted measurements of metabolic products derived by the host and the microbiota in intestinal samples allow a better understanding of the functional alterations that occur in gastrointestinal disease. For example, changes in bile acid metabolism and tryptophan catabolism recently have been reported in humans and dogs. Also, metabolites associated with the pentose phosphate pathway were significantly altered in chronic gastrointestinal inflammation and indicate the presence of oxidative stress in dogs with IBD. This review focuses on the advancements made in canine metagenomics and metabolomics and their implications in understanding gastrointestinal disease as well as the development of better treatment approaches.

  11. Metagenomic analysis reveals that modern microbialites and polar microbial mats have similar taxonomic and functional potential

    Richard Allen White III

    2015-09-01

    Full Text Available Within the subarctic climate of Clinton Creek, Yukon, Canada, lies an abandoned and flooded open-pit asbestos mine that harbors rapidly growing microbialites. To understand their formation we completed a metagenomic community profile of the microbialites and their surrounding sediments. Assembled metagenomic data revealed that bacteria within the phylum Proteobacteria numerically dominated this system, although the relative abundances of taxa within the phylum varied among environments. Bacteria belonging to Alphaproteobacteria and Gammaproteobacteria were dominant in the microbialites and sediments, respectively. The microbialites were also home to many other groups associated with microbialite formation including filamentous cyanobacteria and dissimilatory sulfate-reducing Deltaproteobacteria, consistent with the idea of a shared global microbialite microbiome. Other members were present that are typically not associated with microbialites including Gemmatimonadetes and iron-oxidizing Betaproteobacteria, which participate in carbon metabolism and iron cycling. Compared to the sediments, the microbialite microbiome has significantly more genes associated with photosynthetic processes (e.g., photosystem II reaction centers, carotenoid and chlorophyll biosynthesis and carbon fixation (e.g., CO dehydrogenase. The Clinton Creek microbialite communities had strikingly similar functional potentials to non-lithifying microbial mats from the Canadian High Arctic and Antarctica, but are functionally distinct, from non-lithifying mats or biofilms from Yellowstone. Clinton Creek microbialites also share metabolic genes (R2 0.900. These metagenomic profiles from an anthropogenic microbialite-forming ecosystem provide context to microbialite formation on a human-relevant timescale.

  12. Meta4: a web application for sharing and annotating metagenomic gene predictions using web services.

    Richardson, Emily J; Escalettes, Franck; Fotheringham, Ian; Wallace, Robert J; Watson, Mick

    2013-01-01

    Whole-genome shotgun metagenomics experiments produce DNA sequence data from entire ecosystems, and provide a huge amount of novel information. Gene discovery projects require up-to-date information about sequence homology and domain structure for millions of predicted proteins to be presented in a simple, easy-to-use system. There is a lack of simple, open, flexible tools that allow the rapid sharing of metagenomics datasets with collaborators in a format they can easily interrogate. We present Meta4, a flexible and extensible web application that can be used to share and annotate metagenomic gene predictions. Proteins and predicted domains are stored in a simple relational database, with a dynamic front-end which displays the results in an internet browser. Web services are used to provide up-to-date information about the proteins from homology searches against public databases. Information about Meta4 can be found on the project website, code is available on Github, a cloud image is available, and an example implementation can be seen at.

  13. Aerially transmitted human fungal pathogens: what can we learn from metagenomics and comparative genomics?

    Aliouat-Denis, Cécile-Marie; Chabé, Magali; Delhaes, Laurence; Dei-Cas, Eduardo

    2014-01-01

    In the last few decades, aerially transmitted human fungal pathogens have been increasingly recognized to impact the clinical course of chronic pulmonary diseases, such as asthma, cystic fibrosis or chronic obstructive pulmonary disease. Thanks to recent development of culture-free high-throughput sequencing methods, the metagenomic approaches are now appropriate to detect, identify and even quantify prokaryotic or eukaryotic microorganism communities inhabiting human respiratory tract and to access the complexity of even low-burden microbe communities that are likely to play a role in chronic pulmonary diseases. In this review, we explore how metagenomics and comparative genomics studies can alleviate fungal culture bottlenecks, improve our knowledge about fungal biology, lift the veil on cross-talks between host lung and fungal microbiota, and gain insights into the pathogenic impact of these aerially transmitted fungi that affect human beings. We reviewed metagenomic studies and comparative genomic analyses of carefully chosen microorganisms, and confirmed the usefulness of such approaches to better delineate biology and pathogenesis of aerially transmitted human fungal pathogens. Efforts to generate and efficiently analyze the enormous amount of data produced by such novel approaches have to be pursued, and will potentially provide the patients suffering from chronic pulmonary diseases with a better management. This manuscript is part of the series of works presented at the "V International Workshop: Molecular genetic approaches to the study of human pathogenic fungi" (Oaxaca, Mexico, 2012). Copyright © 2013 Revista Iberoamericana de Micología. Published by Elsevier Espana. All rights reserved.

  14. The YNP Metagenome Project: Environmental Parameters Responsible for Microbial Distribution in the Yellowstone Geothermal Ecosystem

    William P. Inskeep

    2013-05-01

    Full Text Available The Yellowstone geothermal complex contains over 10,000 diverse geothermal features that host numerous phylogenetically deeply-rooted and poorly understood archaea, bacteria and viruses. Microbial communities in high-temperature environments are generally less diverse than soil, marine, sediment or lake habitats and therefore offer a tremendous opportunity for studying the structure and function of different model microbial communities using environmental metagenomics. One of the broader goals of this study was to establish linkages among microbial distribution, metabolic potential and environmental variables. Twenty geochemically distinct geothermal ecosystems representing a broad spectrum of Yellowstone hot-spring environments were used for metagenomic and geochemical analysis and included approximately equal numbers of: (1 phototrophic mats, (2 ‘filamentous streamer’ communities, and (3 archaeal-dominated sediments. The metagenomes were analyzed using a suite of complementary and integrative bioinformatic tools, including phylogenetic and functional analysis of both individual sequence reads and assemblies of predominant phylotypes. This volume identifies major environmental determinants of a large number of thermophilic microbial lineages, many of which have not been fully described in the literature nor previously cultivated to enable functional and genomic analyses. Moreover, protein family abundance comparisons and in-depth analyses of specific genes and metabolic pathways relevant to these hot-spring environments reveal hallmark signatures of metabolic capabilities that parallel the distribution of phylotypes across specific types of geochemical environments.

  15. The Metagenome of Utricularia gibba's Traps: Into the Microbial Input to a Carnivorous Plant

    Alcaraz, Luis David; Martínez-Sánchez, Shamayim; Torres, Ignacio; Ibarra-Laclette, Enrique; Herrera-Estrella, Luis

    2016-01-01

    The genome and transcriptome sequences of the aquatic, rootless, and carnivorous plant Utricularia gibba L. (Lentibulariaceae), were recently determined. Traps are necessary for U. gibba because they help the plant to survive in nutrient-deprived environments. The U. gibba's traps (Ugt) are specialized structures that have been proposed to selectively filter microbial inhabitants. To determine whether the traps indeed have a microbiome that differs, in composition or abundance, from the microbiome in the surrounding environment, we used whole-genome shotgun (WGS) metagenomics to describe both the taxonomic and functional diversity of the Ugt microbiome. We collected U. gibba plants from their natural habitat and directly sequenced the metagenome of the Ugt microbiome and its surrounding water. The total predicted number of species in the Ugt was more than 1,100. Using pan-genome fragment recruitment analysis, we were able to identify to the species level of some key Ugt players, such as Pseudomonas monteilii. Functional analysis of the Ugt metagenome suggests that the trap microbiome plays an important role in nutrient scavenging and assimilation while complementing the hydrolytic functions of the plant. PMID:26859489

  16. Novel resistance functions uncovered using functional metagenomic investigations of resistance reservoirs

    Erica C. Pehrsson

    2013-06-01

    Full Text Available Rates of infection with antibiotic-resistant bacteria have increased precipitously over the past several decades, with far-reaching healthcare and societal costs. Recent evidence has established a link between antibiotic resistance genes in human pathogens and those found in non-pathogenic, commensal, and environmental organisms, prompting deeper investigation of natural and human-associated reservoirs of antibiotic resistance. Functional metagenomic selections, in which shotgun-cloned DNA fragments are selected for their ability to confer survival to an indicator host, have been increasingly applied to the characterization of many antibiotic resistance reservoirs. These experiments have demonstrated that antibiotic resistance genes are highly diverse and widely distributed, many times bearing little to no similarity to known sequences. Through unbiased selections for survival to antibiotic exposure, functional metagenomics can improve annotations by reducing the discovery of false-positive resistance and by allowing for the identification of previously unrecognizable resistance genes. In this review, we summarize the novel resistance functions uncovered using functional metagenomic investigations of natural and human-impacted resistance reservoirs. Examples of novel antibiotic resistance genes include those highly divergent from known sequences, those for which sequence is entirely unable to predict resistance function, bifunctional resistance genes, and those with unconventional, atypical resistance mechanisms. Overcoming antibiotic resistance in the clinic will require a better understanding of existing resistance reservoirs and the dissemination networks that govern horizontal gene exchange, informing best practices to limit the spread of resistance-conferring genes to human pathogens.

  17. A robust and accurate binning algorithm for metagenomic sequences with arbitrary species abundance ratio.

    Leung, Henry C M; Yiu, S M; Yang, Bin; Peng, Yu; Wang, Yi; Liu, Zhihua; Chen, Jingchi; Qin, Junjie; Li, Ruiqiang; Chin, Francis Y L

    2011-06-01

    With the rapid development of next-generation sequencing techniques, metagenomics, also known as environmental genomics, has emerged as an exciting research area that enables us to analyze the microbial environment in which we live. An important step for metagenomic data analysis is the identification and taxonomic characterization of DNA fragments (reads or contigs) resulting from sequencing a sample of mixed species. This step is referred to as 'binning'. Binning algorithms that are based on sequence similarity and sequence composition markers rely heavily on the reference genomes of known microorganisms or phylogenetic markers. Due to the limited availability of reference genomes and the bias and low availability of markers, these algorithms may not be applicable in all cases. Unsupervised binning algorithms which can handle fragments from unknown species provide an alternative approach. However, existing unsupervised binning algorithms only work on datasets either with balanced species abundance ratios or rather different abundance ratios, but not both. In this article, we present MetaCluster 3.0, an integrated binning method based on the unsupervised top--down separation and bottom--up merging strategy, which can bin metagenomic fragments of species with very balanced abundance ratios (say 1:1) to very different abundance ratios (e.g. 1:24) with consistently higher accuracy than existing methods. MetaCluster 3.0 can be downloaded at http://i.cs.hku.hk/~alse/MetaCluster/.

  18. Metagenomic potential for and diversity of N-cycle driving microorganisms in the Bothnian Sea sediment.

    Rasigraf, Olivia; Schmitt, Julia; Jetten, Mike S M; Lüke, Claudia

    2017-08-01

    The biological nitrogen cycle is driven by a plethora of reactions transforming nitrogen compounds between various redox states. Here, we investigated the metagenomic potential for nitrogen cycle of the in situ microbial community in an oligotrophic, brackish environment of the Bothnian Sea sediment. Total DNA from three sediment depths was isolated and sequenced. The characterization of the total community was performed based on 16S rRNA gene inventory using SILVA database as reference. The diversity of diagnostic functional genes coding for nitrate reductases (napA;narG), nitrite:nitrate oxidoreductase (nxrA), nitrite reductases (nirK;nirS;nrfA), nitric oxide reductase (nor), nitrous oxide reductase (nosZ), hydrazine synthase (hzsA), ammonia monooxygenase (amoA), hydroxylamine oxidoreductase (hao), and nitrogenase (nifH) was analyzed by blastx against curated reference databases. In addition, Polymerase chain reaction (PCR)-based amplification was performed on the hzsA gene of anammox bacteria. Our results reveal high genomic potential for full denitrification to N 2 , but minor importance of anaerobic ammonium oxidation and dissimilatory nitrite reduction to ammonium. Genomic potential for aerobic ammonia oxidation was dominated by Thaumarchaeota. A higher diversity of anammox bacteria was detected in metagenomes than with PCR-based technique. The results reveal the importance of various N-cycle driving processes and highlight the advantage of metagenomics in detection of novel microbial key players. © 2017 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.

  19. A Novel Cold Active Esterase from a Deep Sea Sponge Stelletta normani Metagenomic Library

    Erik Borchert

    2017-09-01

    Full Text Available Esterases catalyze the hydrolysis of ester bonds in fatty acid esters with short-chain acyl groups. Due to the widespread applications of lipolytic enzymes in various industrial applications, there continues to be an interest in novel esterases with unique properties. Marine ecosystems have long been acknowledged as a significant reservoir of microbial biodiversity and in particular of bacterial enzymes with desirable characteristics for industrial use, such as for example cold adaptation and activity in the alkaline pH range. We employed a functional metagenomic approach to exploit the enzymatic potential of one particular marine ecosystem, namely the microbiome of the deep sea sponge Stelletta normani. Screening of a metagenomics library from this sponge resulted in the identification of a number of lipolytic active clones. One of these encoded a highly, cold-active esterase 7N9, and the recombinant esterase was subsequently heterologously expressed in Escherichia coli. The esterase was classified as a type IV lipolytic enzyme, belonging to the GDSAG subfamily of hormone sensitive lipases. Furthermore, the recombinant 7N9 esterase was biochemically characterized and was found to be most active at alkaline pH (8.0 and displays salt tolerance over a wide range of concentrations. In silico docking studies confirmed the enzyme's activity toward short-chain fatty acids while also highlighting the specificity toward certain inhibitors. Furthermore, structural differences to a closely related mesophilic E40 esterase isolated from a marine sediment metagenomics library are discussed.

  20. Metagenomics of Bacterial Diversity in Villa Luz Caves with Sulfur Water Springs

    Giuseppe D’Auria

    2018-01-01

    Full Text Available New biotechnology applications require in-depth preliminary studies of biodiversity. The methods of massive sequencing using metagenomics and bioinformatics tools offer us sufficient and reliable knowledge to understand environmental diversity, to know new microorganisms, and to take advantage of their functional genes. Villa Luz caves, in the southern Mexican state of Tabasco, are fed by at least 26 groundwater inlets, containing 300–500 mg L-1 H2S and <0.1 mg L-1 O2. We extracted environmental DNA for metagenomic analysis of collected samples in five selected Villa Luz caves sites, with pH values from 2.5 to 7. Foreign organisms found in this underground ecosystem can oxidize H2S to H2SO4. These include: biovermiculites, a bacterial association that can grow on the rock walls; snottites, that are whitish, viscous biofilms hanging from the rock walls, and sacks or bags of phlegm, which live within the aquatic environment of the springs. Through the emergency food assistance program (TEFAP pyrosequencing, a total of 20,901 readings of amplification products from hypervariable regions V1 and V3 of 16S rRNA bacterial gene in whole and pure metagenomic DNA samples were generated. Seven bacterial phyla were identified. As a result, Proteobacteria was more frequent than Acidobacteria. Finally, acidophilic Proteobacteria was detected in UJAT5 sample

  1. Metagenomic exploration reveals a marked change in the river resistome and mobilome after treated wastewater discharges.

    Lekunberri, Itziar; Balcázar, José Luis; Borrego, Carles M

    2018-03-01

    Mobile genetic elements (MGEs) are key agents in the spread of antibiotic resistance genes (ARGs) across environments. Here we used metagenomics to compare the river resistome (collection of all ARGs) and mobilome (e.g., integrases, transposases, integron integrases and insertion sequence common region "ISCR" elements) between samples collected upstream (n = 6) and downstream (n = 6) of an urban wastewater treatment plant (UWWTP). In comparison to upstream metagenomes, downstream metagenomes showed a drastic increase in the abundance of ARGs, as well as markers of MGEs, particularly integron integrases and ISCR elements. These changes were accompanied by a concomitant prevalence of 16S rRNA gene signatures of bacteria affiliated to families encompassing well-known human and animal pathogens. Our results confirm that chronic discharges of treated wastewater severely impact the river resistome affecting not only the abundance and diversity of ARGs but also their potential spread by enriching the river mobilome in a wide variety of MGEs. Copyright © 2017 Elsevier Ltd. All rights reserved.

  2. Microbial Diversity and Biochemical Potential Encoded by Thermal Spring Metagenomes Derived from the Kamchatka Peninsula

    Bernd Wemheuer

    2013-01-01

    Full Text Available Volcanic regions contain a variety of environments suitable for extremophiles. This study was focused on assessing and exploiting the prokaryotic diversity of two microbial communities derived from different Kamchatkian thermal springs by metagenomic approaches. Samples were taken from a thermoacidophilic spring near the Mutnovsky Volcano and from a thermophilic spring in the Uzon Caldera. Environmental DNA for metagenomic analysis was isolated from collected sediment samples by direct cell lysis. The prokaryotic community composition was examined by analysis of archaeal and bacterial 16S rRNA genes. A total number of 1235 16S rRNA gene sequences were obtained and used for taxonomic classification. Most abundant in the samples were members of Thaumarchaeota, Thermotogae, and Proteobacteria. The Mutnovsky hot spring was dominated by the Terrestrial Hot Spring Group, Kosmotoga, and Acidithiobacillus. The Uzon Caldera was dominated by uncultured members of the Miscellaneous Crenarchaeotic Group and Enterobacteriaceae. The remaining 16S rRNA gene sequences belonged to the Aquificae, Dictyoglomi, Euryarchaeota, Korarchaeota, Thermodesulfobacteria, Firmicutes, and some potential new phyla. In addition, the recovered DNA was used for generation of metagenomic libraries, which were subsequently mined for genes encoding lipolytic and proteolytic enzymes. Three novel genes conferring lipolytic and one gene conferring proteolytic activity were identified.

  3. Evaluation of FTA ® paper for storage of oral meta-genomic DNA.

    Foitzik, Magdalena; Stumpp, Sascha N; Grischke, Jasmin; Eberhard, Jörg; Stiesch, Meike

    2014-10-01

    The purpose of the present study was to evaluate the short-term storage of meta-genomic DNA from native oral biofilms on FTA(®) paper. Thirteen volunteers of both sexes received an acrylic splint for intraoral biofilm formation over a period of 48 hours. The biofilms were collected, resuspended in phosphate-buffered saline, and either stored on FTA(®) paper or directly processed by standard laboratory DNA extraction. The nucleic acid extraction efficiencies were evaluated by 16S rDNA targeted SSCP fingerprinting. The acquired banding pattern of FTA-derived meta-genomic DNA was compared to a standard DNA preparation protocol. Sensitivity and positive predictive values were calculated. The volunteers showed inter-individual differences in their bacterial species composition. A total of 200 bands were found for both methods and 85% of the banding patterns were equal, representing a sensitivity of 0.941 and a false-negative predictive value of 0.059. Meta-genomic DNA sampling, extraction, and adhesion using FTA(®) paper is a reliable method for storage of microbial DNA for a short period of time.

  4. Fast and accurate taxonomic assignments of metagenomic sequences using MetaBin.

    Vineet K Sharma

    Full Text Available Taxonomic assignment of sequence reads is a challenging task in metagenomic data analysis, for which the present methods mainly use either composition- or homology-based approaches. Though the homology-based methods are more sensitive and accurate, they suffer primarily due to the time needed to generate the Blast alignments. We developed the MetaBin program and web server for better homology-based taxonomic assignments using an ORF-based approach. By implementing Blat as the faster alignment method in place of Blastx, the analysis time has been reduced by severalfold. It is benchmarked using both simulated and real metagenomic datasets, and can be used for both single and paired-end sequence reads of varying lengths (≥45 bp. To our knowledge, MetaBin is the only available program that can be used for the taxonomic binning of short reads (<100 bp with high accuracy and high sensitivity using a homology-based approach. The MetaBin web server can be used to carry out the taxonomic analysis, by either submitting reads or Blastx output. It provides several options including construction of taxonomic trees, creation of a composition chart, functional analysis using COGs, and comparative analysis of multiple metagenomic datasets. MetaBin web server and a standalone version for high-throughput analysis are available freely at http://metabin.riken.jp/.

  5. Challenges of metabolomics in human gut microbiota research.

    Smirnov, Kirill S; Maier, Tanja V; Walker, Alesia; Heinzmann, Silke S; Forcisi, Sara; Martinez, Inés; Walter, Jens; Schmitt-Kopplin, Philippe

    2016-08-01

    The review highlights the role of metabolomics in studying human gut microbial metabolism. Microbial communities in our gut exert a multitude of functions with huge impact on human health and disease. Within the meta-omics discipline, gut microbiome is studied by (meta)genomics, (meta)transcriptomics, (meta)proteomics and metabolomics. The goal of metabolomics research applied to fecal samples is to perform their metabolic profiling, to quantify compounds and classes of interest, to characterize small molecules produced by gut microbes. Nuclear magnetic resonance spectroscopy and mass spectrometry are main technologies that are applied in fecal metabolomics. Metabolomics studies have been increasingly used in gut microbiota related research regarding health and disease with main focus on understanding inflammatory bowel diseases. The elucidated metabolites in this field are summarized in this review. We also addressed the main challenges of metabolomics in current and future gut microbiota research. The first challenge reflects the need of adequate analytical tools and pipelines, including sample handling, selection of appropriate equipment, and statistical evaluation to enable meaningful biological interpretation. The second challenge is related to the choice of the right animal model for studies on gut microbiota. We exemplified this using NMR spectroscopy for the investigation of cross-species comparison of fecal metabolite profiles. Finally, we present the problem of variability of human gut microbiota and metabolome that has important consequences on the concepts of personalized nutrition and medicine. Copyright © 2016 Elsevier GmbH. All rights reserved.

  6. The human gut microbiome and its dysfunctions through the meta-omics prism.

    Mondot, Stanislas; Lepage, Patricia

    2016-05-01

    The microorganisms inhabiting the human gut are abundant (10(14) cells) and diverse (approximately 500 species per individual). It is now acknowledged that the microbiota has coevolved with its host to achieve a symbiotic relationship, leading to physiological homeostasis. The gut microbiota ensures vital functions, such as food digestibility, maturation of the host immune system, and protection against pathogens. Over the last few decades, the gut microbiota has also been associated with numerous diseases, such as inflammatory bowel disease, irritable bowel syndrome, obesity, and metabolic diseases. In most of these pathologies, a microbial dysbiosis has been found, indicating shifts in the taxonomic composition of the gut microbiota and changes in its functionality. Our understanding of the influence of the gut microbiota on human health is still growing. Working with microorganisms residing in the gut is challenging since most of them are anaerobic and a vast majority (approximately 75%) are uncultivable to date. Recently, a wide range of new approaches (meta-omics) has been developed to bypass the uncultivability and reveal the intricate mechanisms that sustain gut microbial homeostasis. After a brief description of these approaches (metagenomics, metatranscriptomics, metaproteomics, and metabolomics), this review will discuss the importance of considering the gut microbiome as a structured ecosystem and the use of meta-omics to decipher dysfunctions of the gut microbiome in diseases. © 2016 New York Academy of Sciences.

  7. Predicting Biological Information Flow in a Model Oxygen Minimum Zone

    Louca, S.; Hawley, A. K.; Katsev, S.; Beltran, M. T.; Bhatia, M. P.; Michiels, C.; Capelle, D.; Lavik, G.; Doebeli, M.; Crowe, S.; Hallam, S. J.

    2016-02-01

    Microbial activity drives marine biochemical fluxes and nutrient cycling at global scales. Geochemical measurements as well as molecular techniques such as metagenomics, metatranscriptomics and metaproteomics provide great insight into microbial activity. However, an integration of molecular and geochemical data into mechanistic biogeochemical models is still lacking. Recent work suggests that microbial metabolic pathways are, at the ecosystem level, strongly shaped by stoichiometric and energetic constraints. Hence, models rooted in fluxes of matter and energy may yield a holistic understanding of biogeochemistry. Furthermore, such pathway-centric models would allow a direct consolidation with meta'omic data. Here we present a pathway-centric biogeochemical model for the seasonal oxygen minimum zone in Saanich Inlet, a fjord off the coast of Vancouver Island. The model considers key dissimilatory nitrogen and sulfur fluxes, as well as the population dynamics of the genes that mediate them. By assuming a direct translation of biocatalyzed energy fluxes to biosynthesis rates, we make predictions about the distribution and activity of the corresponding genes. A comparison of the model to molecular measurements indicates that the model explains observed DNA, RNA, protein and cell depth profiles. This suggests that microbial activity in marine ecosystems such as oxygen minimum zones is well described by DNA abundance, which, in conjunction with geochemical constraints, determines pathway expression and process rates. Our work further demonstrates how meta'omic data can be mechanistically linked to environmental redox conditions and biogeochemical processes.

  8. The Impact of Global Warming on the Carbon Cycle of Arctic Permafrost: An Experimental and Field Based Study

    Onstott, Tullis C [Princeton University; Pffifner, Susan M; Chourey, Karuna [Oak Ridge National Laboratory

    2014-11-07

    Our results to date indicate that CO2 and CH4 fluxes from organic poor, Arctic cryosols on Axel Heiberg Island are net CH4 sinks and CO2 emitters in contrast to organic-rich peat deposits at sub-Arctic latitudes. This is based upon field observations and a 1.5 year long thawing experiment performed upon one meter long intact cores. The results of the core thawing experiments are in good agreement with field measurements. Metagenomic, metatranscriptomic and metaproteomic analyses indicate that high affinity aerobic methanotrophs belong to the uncultivated USCalpha are present in <1% abundance in these cryosols are are active in the field during the summer and in the core thawing experiments. The methanotrophs are 100 times more abundant than the methanogens. As a result mineral cryosols, which comprise 87% of Arctic tundra, are net methane sinks. Their presence and activity may account for the discrepancies observed between the atmospheric methane concentrations observed in the Arctic predicted by climate models and the observed seasonal fluctuations and decadal trends. This has not been done yet.

  9. Biomarkers for monitoring intestinal health in poultry: present status and future perspectives.

    Ducatelle, Richard; Goossens, Evy; De Meyer, Fien; Eeckhaut, Venessa; Antonissen, Gunther; Haesebrouck, Freddy; Van Immerseel, Filip

    2018-05-08

    Intestinal health is determined by host (immunity, mucosal barrier), nutritional, microbial and environmental factors. Deficiencies in intestinal health are associated with shifts in the composition of the intestinal microbiome (dysbiosis), leakage of the mucosal barrier and/or inflammation. Since the ban on growth promoting antimicrobials in animal feed, these dysbiosis-related problems have become a major issue, especially in intensive animal farming. The economical and animal welfare consequences are considerable. Consequently, there is a need for continuous monitoring of the intestinal health status, particularly in intensively reared animals, where the intestinal function is often pushed to the limit. In the current review, the recent advances in the field of intestinal health biomarkers, both in human and veterinary medicine are discussed, trying to identify present and future markers of intestinal health in poultry. The most promising new biomarkers will be stable molecules ending up in the feces and litter that can be quantified, preferably using rapid and simple pen-side tests. It is unlikely, however, that a single biomarker will be sufficient to follow up all aspects of intestinal health. Combinations of multiple biomarkers and/or metabarcoding, metagenomic, metatranscriptomic, metaproteomic and metabolomic approaches will be the way to go in the future. Candidate biomarkers currently are being investigated by many research groups, but the validation will be a major challenge, due to the complexity of intestinal health in the field.

  10. Bacterial tag encoded FLX titanium amplicon pyrosequencing (bTEFAP based assessment of prokaryotic diversity in metagenome of Lonar soda lake, India

    Pravin Dudhagara

    2015-06-01

    Full Text Available Bacterial diversity and archaeal diversity in metagenome of the Lonar soda lake sediment were assessed by bacterial tag-encoded FLX amplicon pyrosequencing (bTEFAP. Metagenome comprised 5093 sequences with 2,531,282 bp and 53 ± 2% G + C content. Metagenome sequence data are available at NCBI under the Bioproject database with accession no. PRJNA218849. Metagenome sequence represented the presence of 83.1% bacterial and 10.5% archaeal origin. A total of 14 different bacteria demonstrating 57 species were recorded with dominating species like Coxiella burnetii (17%, Fibrobacter intestinalis (12% and Candidatus Cloacamonas acidaminovorans (11%. Occurrence of two archaeal phyla representing 24 species, among them Methanosaeta harundinacea (35%, Methanoculleus chikugoensis (12% and Methanolinea tarda (11% were dominating species. Significant presence of 11% sequences as an unclassified indicated the possibilities for unknown novel prokaryotes from the metagenome.

  11. The Genomes OnLine Database (GOLD) v.4: status of genomic and metagenomic projects and their associated metadata

    Pagani, Ioanna; Liolios, Konstantinos; Jansson, Jakob; Chen, I-Min A.; Smirnova, Tatyana; Nosrat, Bahador; Markowitz, Victor M.; Kyrpides, Nikos C.

    2012-01-01

    The Genomes OnLine Database (GOLD, http://www.genomesonline.org/) is a comprehensive resource for centralized monitoring of genome and metagenome projects worldwide. Both complete and ongoing projects, along with their associated metadata, can be accessed in GOLD through precomputed tables and a search page. As of September 2011, GOLD, now on version 4.0, contains information for 11 472 sequencing projects, of which 2907 have been completed and their sequence data has been deposited in a public repository. Out of these complete projects, 1918 are finished and 989 are permanent drafts. Moreover, GOLD contains information for 340 metagenome studies associated with 1927 metagenome samples. GOLD continues to expand, moving toward the goal of providing the most comprehensive repository of metadata information related to the projects and their organisms/environments in accordance with the Minimum Information about any (x) Sequence specification and beyond. PMID:22135293

  12. High definition for systems biology of microbial communities: metagenomics gets genome-centric and strain-resolved.

    Turaev, Dmitrij; Rattei, Thomas

    2016-06-01

    The systems biology of microbial communities, organismal communities inhabiting all ecological niches on earth, has in recent years been strongly facilitated by the rapid development of experimental, sequencing and data analysis methods. Novel experimental approaches and binning methods in metagenomics render the semi-automatic reconstructions of near-complete genomes of uncultivable bacteria possible, while advances in high-resolution amplicon analysis allow for efficient and less biased taxonomic community characterization. This will also facilitate predictive modeling approaches, hitherto limited by the low resolution of metagenomic data. In this review, we pinpoint the most promising current developments in metagenomics. They facilitate microbial systems biology towards a systemic understanding of mechanisms in microbial communities with scopes of application in many areas of our daily life. Copyright © 2016 Elsevier Ltd. All rights reserved.

  13. The GAAS metagenomic tool and its estimations of viral and microbial average genome size in four major biomes.

    Angly, Florent E; Willner, Dana; Prieto-Davó, Alejandra; Edwards, Robert A; Schmieder, Robert; Vega-Thurber, Rebecca; Antonopoulos, Dionysios A; Barott, Katie; Cottrell, Matthew T; Desnues, Christelle; Dinsdale, Elizabeth A; Furlan, Mike; Haynes, Matthew; Henn, Matthew R; Hu, Yongfei; Kirchman, David L; McDole, Tracey; McPherson, John D; Meyer, Folker; Miller, R Michael; Mundt, Egbert; Naviaux, Robert K; Rodriguez-Mueller, Beltran; Stevens, Rick; Wegley, Linda; Zhang, Lixin; Zhu, Baoli; Rohwer, Forest

    2009-12-01

    Metagenomic studies characterize both the composition and diversity of uncultured viral and microbial communities. BLAST-based comparisons have typically been used for such analyses; however, sampling biases, high percentages of unknown sequences, and the use of arbitrary thresholds to find significant similarities can decrease the accuracy and validity of estimates. Here, we present Genome relative Abundance and Average Size (GAAS), a complete software package that provides improved estimates of community composition and average genome length for metagenomes in both textual and graphical formats. GAAS implements a novel methodology to control for sampling bias via length normalization, to adjust for multiple BLAST similarities by similarity weighting, and to select significant similarities using relative alignment lengths. In benchmark tests, the GAAS method was robust to both high percentages of unknown sequences and to variations in metagenomic sequence read lengths. Re-analysis of the Sargasso Sea virome using GAAS indicated that standard methodologies for metagenomic analysis may dramatically underestimate the abundance and importance of organisms with small genomes in environmental systems. Using GAAS, we conducted a meta-analysis of microbial and viral average genome lengths in over 150 metagenomes from four biomes to determine whether genome lengths vary consistently between and within biomes, and between microbial and viral communities from the same environment. Significant differences between biomes and within aquatic sub-biomes (oceans, hypersaline systems, freshwater, and microbialites) suggested that average genome length is a fundamental property of environments driven by factors at the sub-biome level. The behavior of paired viral and microbial metagenomes from the same environment indicated that microbial and viral average genome sizes are independent of each other, but indicative of community responses to stressors and environmental conditions.

  14. The GAAS metagenomic tool and its estimations of viral and microbial average genome size in four major biomes.

    Florent E Angly

    2009-12-01

    Full Text Available Metagenomic studies characterize both the composition and diversity of uncultured viral and microbial communities. BLAST-based comparisons have typically been used for such analyses; however, sampling biases, high percentages of unknown sequences, and the use of arbitrary thresholds to find significant similarities can decrease the accuracy and validity of estimates. Here, we present Genome relative Abundance and Average Size (GAAS, a complete software package that provides improved estimates of community composition and average genome length for metagenomes in both textual and graphical formats. GAAS implements a novel methodology to control for sampling bias via length normalization, to adjust for multiple BLAST similarities by similarity weighting, and to select significant similarities using relative alignment lengths. In benchmark tests, the GAAS method was robust to both high percentages of unknown sequences and to variations in metagenomic sequence read lengths. Re-analysis of the Sargasso Sea virome using GAAS indicated that standard methodologies for metagenomic analysis may dramatically underestimate the abundance and importance of organisms with small genomes in environmental systems. Using GAAS, we conducted a meta-analysis of microbial and viral average genome lengths in over 150 metagenomes from four biomes to determine whether genome lengths vary consistently between and within biomes, and between microbial and viral communities from the same environment. Significant differences between biomes and within aquatic sub-biomes (oceans, hypersaline systems, freshwater, and microbialites suggested that average genome length is a fundamental property of environments driven by factors at the sub-biome level. The behavior of paired viral and microbial metagenomes from the same environment indicated that microbial and viral average genome sizes are independent of each other, but indicative of community responses to stressors and

  15. Comparative metagenomic, phylogenetic and physiological analyses of soil microbial communities across nitrogen gradients.

    Fierer, Noah; Lauber, Christian L; Ramirez, Kelly S; Zaneveld, Jesse; Bradford, Mark A; Knight, Rob

    2012-05-01

    Terrestrial ecosystems are receiving elevated inputs of nitrogen (N) from anthropogenic sources and understanding how these increases in N availability affect soil microbial communities is critical for predicting the associated effects on belowground ecosystems. We used a suite of approaches to analyze the structure and functional characteristics of soil microbial communities from replicated plots in two long-term N fertilization experiments located in contrasting systems. Pyrosequencing-based analyses of 16S rRNA genes revealed no significant effects of N fertilization on bacterial diversity, but significant effects on community composition at both sites; copiotrophic taxa (including members of the Proteobacteria and Bacteroidetes phyla) typically increased in relative abundance in the high N plots, with oligotrophic taxa (mainly Acidobacteria) exhibiting the opposite pattern. Consistent with the phylogenetic shifts under N fertilization, shotgun metagenomic sequencing revealed increases in the relative abundances of genes associated with DNA/RNA replication, electron transport and protein metabolism, increases that could be resolved even with the shallow shotgun metagenomic sequencing conducted here (average of 75 000 reads per sample). We also observed shifts in the catabolic capabilities of the communities across the N gradients that were significantly correlated with the phylogenetic and metagenomic responses, indicating possible linkages between the structure and functioning of soil microbial communities. Overall, our results suggest that N fertilization may, directly or indirectly, induce a shift in the predominant microbial life-history strategies, favoring a more active, copiotrophic microbial community, a pattern that parallels the often observed replacement of K-selected with r-selected plant species with elevated N.

  16. Metagenomic Characterization of the Human Intestinal Microbiota in Fecal Samples from STEC-Infected Patients

    Federica Gigliucci

    2018-02-01

    Full Text Available The human intestinal microbiota is a homeostatic ecosystem with a remarkable impact on human health and the disruption of this equilibrium leads to an increased susceptibility to infection by numerous pathogens. In this study, we used shotgun metagenomic sequencing and two different bioinformatic approaches, based on mapping of the reads onto databases and on the reconstruction of putative draft genomes, to investigate possible changes in the composition of the intestinal microbiota in samples from patients with Shiga Toxin-producing E. coli (STEC infection compared to healthy and healed controls, collected during an outbreak caused by a STEC O26:H11 infection. Both the bioinformatic procedures used, produced similar result with a good resolution of the taxonomic profiles of the specimens. The stool samples collected from the STEC infected patients showed a lower abundance of the members of Bifidobacteriales and Clostridiales orders in comparison to controls where those microorganisms predominated. These differences seemed to correlate with the STEC infection although a flexion in the relative abundance of the Bifidobacterium genus, part of the Bifidobacteriales order, was observed also in samples from Crohn's disease patients, displaying a STEC-unrelated dysbiosis. The metagenomics also allowed to identify in the STEC positive samples, all the virulence traits present in the genomes of the STEC O26 that caused the outbreak as assessed through isolation of the epidemic strain and whole genome sequencing. The results shown represent a first evidence of the changes occurring in the intestinal microbiota of children in the course of STEC infection and indicate that metagenomics may be a promising tool for the culture-independent clinical diagnosis of the infection.

  17. Metagenomic Characterization of the Human Intestinal Microbiota in Fecal Samples from STEC-Infected Patients

    Gigliucci, Federica; von Meijenfeldt, F. A. Bastiaan; Knijn, Arnold; Michelacci, Valeria; Scavia, Gaia; Minelli, Fabio; Dutilh, Bas E.; Ahmad, Hamideh M.; Raangs, Gerwin C.; Friedrich, Alex W.; Rossen, John W. A.; Morabito, Stefano

    2018-01-01

    The human intestinal microbiota is a homeostatic ecosystem with a remarkable impact on human health and the disruption of this equilibrium leads to an increased susceptibility to infection by numerous pathogens. In this study, we used shotgun metagenomic sequencing and two different bioinformatic approaches, based on mapping of the reads onto databases and on the reconstruction of putative draft genomes, to investigate possible changes in the composition of the intestinal microbiota in samples from patients with Shiga Toxin-producing E. coli (STEC) infection compared to healthy and healed controls, collected during an outbreak caused by a STEC O26:H11 infection. Both the bioinformatic procedures used, produced similar result with a good resolution of the taxonomic profiles of the specimens. The stool samples collected from the STEC infected patients showed a lower abundance of the members of Bifidobacteriales and Clostridiales orders in comparison to controls where those microorganisms predominated. These differences seemed to correlate with the STEC infection although a flexion in the relative abundance of the Bifidobacterium genus, part of the Bifidobacteriales order, was observed also in samples from Crohn's disease patients, displaying a STEC-unrelated dysbiosis. The metagenomics also allowed to identify in the STEC positive samples, all the virulence traits present in the genomes of the STEC O26 that caused the outbreak as assessed through isolation of the epidemic strain and whole genome sequencing. The results shown represent a first evidence of the changes occurring in the intestinal microbiota of children in the course of STEC infection and indicate that metagenomics may be a promising tool for the culture-independent clinical diagnosis of the infection. PMID:29468143

  18. Filthy lucre: A metagenomic pilot study of microbes found on circulating currency in New York City.

    Julia M Maritz

    Full Text Available Paper currency by its very nature is frequently transferred from one person to another and represents an important medium for human contact with-and potential exchange of-microbes. In this pilot study, we swabbed circulating $1 bills obtained from a New York City bank in February (Winter and June (Summer 2013 and used shotgun metagenomic sequencing to profile the communities found on their surface. Using basic culture conditions, we also tested whether viable microbes could be recovered from bills.Shotgun metagenomics identified eukaryotes as the most abundant sequences on money, followed by bacteria, viruses and archaea. Eukaryotic assemblages were dominated by human, other metazoan and fungal taxa. The currency investigated harbored a diverse microbial population that was dominated by human skin and oral commensals, including Propionibacterium acnes, Staphylococcus epidermidis and Micrococcus luteus. Other taxa detected not associated with humans included Lactococcus lactis and Streptococcus thermophilus, microbes typically associated with dairy production and fermentation. Culturing results indicated that viable microbes can be isolated from paper currency.We conducted the first metagenomic characterization of the surface of paper money in the United States, establishing a baseline for microbes found on $1 bills circulating in New York City. Our results suggest that money amalgamates DNA from sources inhabiting the human microbiome, food, and other environmental inputs, some of which can be recovered as viable organisms. These monetary communities may be maintained through contact with human skin, and DNA obtained from money may provide a record of human behavior and health. Understanding these microbial profiles is especially relevant to public health as money could potentially mediate interpersonal transfer of microbes.

  19. Metagenomic Sequencing of Marine Periphyton: Taxonomic and Functional Insights into Biofilm Communities

    Kemal eSanli

    2015-10-01

    Full Text Available Periphyton communities are complex phototrophic, multispecies biofilms that develop on surfaces in aquatic environments. These communities harbor a large diversity of organisms comprising viruses, bacteria, algae, fungi, protozoans and metazoans. However, thus far the total biodiversity of periphyton has not been described. In this study, we use metagenomics to characterize periphyton communities from the marine environment of the Swedish west coast. Although we found approximately ten times more eukaryotic rRNA marker gene sequences compared to prokaryotic, the whole metagenome-based similarity searches showed that bacteria constitute the most abundant phyla in these biofilms. We show that marine periphyton encompass a range of heterotrophic and phototrophic organisms. Heterotrophic bacteria, including the majority of proteobacterial clades and Bacteroidetes, and eukaryotic macro-invertebrates were found to dominate periphyton. The phototrophic groups comprise Cyanobacteria and the alpha-proteobacterial genus Roseobacter, followed by different micro- and macro-algae. We also assess the metabolic pathways that predispose these communities to an attached lifestyle. Functional indicators of the biofilm form of life in periphyton involve genes coding for enzymes that catalyze the production and degradation of extracellular polymeric substances, mainly in the form of complex sugars such as starch and glycogen-like meshes together with chitin. Genes for 278 different transporter proteins were detected in the metagenome, constituting the most abundant protein complexes. Finally, genes encoding enzymes that participate in anaerobic pathways, such as denitrification and methanogenesis, were detected suggesting the presence of anaerobic or low-oxygen micro-zones within the biofilms.

  20. Genometa--a fast and accurate classifier for short metagenomic shotgun reads.

    Davenport, Colin F; Neugebauer, Jens; Beckmann, Nils; Friedrich, Benedikt; Kameri, Burim; Kokott, Svea; Paetow, Malte; Siekmann, Björn; Wieding-Drewes, Matthias; Wienhöfer, Markus; Wolf, Stefan; Tümmler, Burkhard; Ahlers, Volker; Sprengel, Frauke

    2012-01-01

    Metagenomic studies use high-throughput sequence data to investigate microbial communities in situ. However, considerable challenges remain in the analysis of these data, particularly with regard to speed and reliable analysis of microbial species as opposed to higher level taxa such as phyla. We here present Genometa, a computationally undemanding graphical user interface program that enables identification of bacterial species and gene content from datasets generated by inexpensive high-throughput short read sequencing technologies. Our approach was first verified on two simulated metagenomic short read datasets, detecting 100% and 94% of the bacterial species included with few false positives or false negatives. Subsequent comparative benchmarking analysis against three popular metagenomic algorithms on an Illumina human gut dataset revealed Genometa to attribute the most reads to bacteria at species level (i.e. including all strains of that species) and demonstrate similar or better accuracy than the other programs. Lastly, speed was demonstrated to be many times that of BLAST due to the use of modern short read aligners. Our method is highly accurate if bacteria in the sample are represented by genomes in the reference sequence but cannot find species absent from the reference. This method is one of the most user-friendly and resource efficient approaches and is thus feasible for rapidly analysing millions of short reads on a personal computer. The Genometa program, a step by step tutorial and Java source code are freely available from http://genomics1.mh-hannover.de/genometa/ and on http://code.google.com/p/genometa/. This program has been tested on Ubuntu Linux and Windows XP/7.