Kuo, Alan; Grigoriev, Igor
Pyrosequencing technologies such as 454/Roche and Solexa/Illumina vastly lower the cost of nucleotide sequencing compared to the traditional Sanger method, and thus promise to greatly expand the number of sequenced eukaryotic genomes. However, the new technologies also bring new challenges such as shorter reads and new kinds and higher rates of sequencing errors, which complicate genome assembly and gene prediction. At JGI we are deploying 454 technology for the sequencing and assembly of ever-larger eukaryotic genomes. Here we describe our first whole-genome annotation of a purely 454-sequenced fungal genome that is larger than a yeast (>30 Mbp). The pezizomycotine (filamentous ascomycote) Aspergillus carbonarius belongs to the Aspergillus section Nigri species complex, members of which are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as agricultural toxigens. Application of a modified version of the standard JGI Annotation Pipeline has so far predicted ~;;10k genes. ~;;12percent of these preliminary annotations suffer a potential frameshift error, which is somewhat higher than the ~;;9percent rate in the Sanger-sequenced and conventionally assembled and annotated genome of fellow Aspergillus section Nigri member A. niger. Also,>90percent of A. niger genes have potential homologs in the A. carbonarius preliminary annotation. Weconclude, and with further annotation and comparative analysis expect to confirm, that 454 sequencing strategies provide a promising substrate for annotation of modestly sized eukaryotic genomes. We will also present results of annotation of a number of other pyrosequenced fungal genomes of bioenergy interest.
Noort, V. van
This thesis focuses on developing comparative genomics methods in eukaryotes, with an emphasis on applications for gene function prediction and regulatory element detection. In the past, methods have been developed to predict functional associations between gene pairs in prokaryotes. The challenge
Phospholipid biosynthetic enzymes produce diverse molecular structures and are often present in multiple forms encoded by different genes. This work utilizes comparative genomics and phylogenetics for exploring the distribution, structure and evolution of phospholipid biosynthetic genes and pathways in 26 eukaryotic genomes. Although the basic structure of the pathways was formed early in eukaryotic evolution, the emerging picture indicates that individual enzyme families followed unique evolutionary courses. For example, choline and ethanolamine kinases and cytidylyltransferases emerged in ancestral eukaryotes, whereas, multiple forms of the corresponding phosphatidyltransferases evolved mainly in a lineage specific manner. Furthermore, several unicellular eukaryotes maintain bacterial-type enzymes and reactions for the synthesis of phosphatidylglycerol and cardiolipin. Also, base-exchange phosphatidylserine synthases are widespread and ancestral enzymes. The multiplicity of phospholipid biosynthetic enzymes has been largely generated by gene expansion in a lineage specific manner. Thus, these observations suggest that phospholipid biosynthesis has been an actively evolving system. Finally, comparative genomic analysis indicates the existence of novel phosphatidyltransferases and provides a candidate for the uncharacterized eukaryotic phosphatidylglycerol phosphate phosphatase.
Moore, Michael J; Dhingra, Amit; Soltis, Pamela S; Shaw, Regina; Farmerie, William G; Folta, Kevin M; Soltis, Douglas E
Background Plastid genome sequence information is vital to several disciplines in plant biology, including phylogenetics and molecular biology. The past five years have witnessed a dramatic increase in the number of completely sequenced plastid genomes, fuelled largely by advances in conventional Sanger sequencing technology. Here we report a further significant reduction in time and cost for plastid genome sequencing through the successful use of a newly available pyrosequencing platform, the Genome Sequencer 20 (GS 20) System (454 Life Sciences Corporation), to rapidly and accurately sequence the whole plastid genomes of the basal eudicot angiosperms Nandina domestica (Berberidaceae) and Platanus occidentalis (Platanaceae). Results More than 99.75% of each plastid genome was simultaneously obtained during two GS 20 sequence runs, to an average depth of coverage of 24.6× in Nandina and 17.3× in Platanus. The Nandina and Platanus plastid genomes shared essentially identical gene complements and possessed the typical angiosperm plastid structure and gene arrangement. To assess the accuracy of the GS 20 sequence, over 45 kilobases of sequence were generated for each genome using conventional sequencing. Overall error rates of 0.043% and 0.031% were observed in GS 20 sequence for Nandina and Platanus, respectively. More than 97% of all observed errors were associated with homopolymer runs, with ~60% of all errors associated with homopolymer runs of 5 or more nucleotides and ~50% of all errors associated with regions of extensive homopolymer runs. No substitution errors were present in either genome. Error rates were generally higher in the single-copy and noncoding regions of both plastid genomes relative to the inverted repeat and coding regions. Conclusion Highly accurate and essentially complete sequence information was obtained for the Nandina and Platanus plastid genomes using the GS 20 System. More importantly, the high accuracy observed in the GS 20 plastid
Farmerie William G
Full Text Available Abstract Background Plastid genome sequence information is vital to several disciplines in plant biology, including phylogenetics and molecular biology. The past five years have witnessed a dramatic increase in the number of completely sequenced plastid genomes, fuelled largely by advances in conventional Sanger sequencing technology. Here we report a further significant reduction in time and cost for plastid genome sequencing through the successful use of a newly available pyrosequencing platform, the Genome Sequencer 20 (GS 20 System (454 Life Sciences Corporation, to rapidly and accurately sequence the whole plastid genomes of the basal eudicot angiosperms Nandina domestica (Berberidaceae and Platanus occidentalis (Platanaceae. Results More than 99.75% of each plastid genome was simultaneously obtained during two GS 20 sequence runs, to an average depth of coverage of 24.6× in Nandina and 17.3× in Platanus. The Nandina and Platanus plastid genomes shared essentially identical gene complements and possessed the typical angiosperm plastid structure and gene arrangement. To assess the accuracy of the GS 20 sequence, over 45 kilobases of sequence were generated for each genome using conventional sequencing. Overall error rates of 0.043% and 0.031% were observed in GS 20 sequence for Nandina and Platanus, respectively. More than 97% of all observed errors were associated with homopolymer runs, with ~60% of all errors associated with homopolymer runs of 5 or more nucleotides and ~50% of all errors associated with regions of extensive homopolymer runs. No substitution errors were present in either genome. Error rates were generally higher in the single-copy and noncoding regions of both plastid genomes relative to the inverted repeat and coding regions. Conclusion Highly accurate and essentially complete sequence information was obtained for the Nandina and Platanus plastid genomes using the GS 20 System. More importantly, the high accuracy
Arkhipova, Irina R; Batzer, Mark A; Brosius, Juergen; Feschotte, Cédric; Moran, John V; Schmitz, Jürgen; Jurka, Jerzy
The third international conference on the genomic impact of eukaryotic transposable elements (TEs) was held 24 to 28 February 2012 at the Asilomar Conference Center, Pacific Grove, CA, USA. Sponsored in part by the National Institutes of Health grant 5 P41 LM006252, the goal of the conference was to bring together researchers from around the world who study the impact and mechanisms of TEs using multiple computational and experimental approaches. The meeting drew close to 170 attendees and included invited floor presentations on the biology of TEs and their genomic impact, as well as numerous talks contributed by young scientists. The workshop talks were devoted to computational analysis of TEs with additional time for discussion of unresolved issues. Also, there was ample opportunity for poster presentations and informal evening discussions. The success of the meeting reflects the important role of Repbase in comparative genomic studies, and emphasizes the need for close interactions between experimental and computational biologists in the years to come.
Arkhipova Irina R
Full Text Available Abstract The third international conference on the genomic impact of eukaryotic transposable elements (TEs was held 24 to 28 February 2012 at the Asilomar Conference Center, Pacific Grove, CA, USA. Sponsored in part by the National Institutes of Health grant 5 P41 LM006252, the goal of the conference was to bring together researchers from around the world who study the impact and mechanisms of TEs using multiple computational and experimental approaches. The meeting drew close to 170 attendees and included invited floor presentations on the biology of TEs and their genomic impact, as well as numerous talks contributed by young scientists. The workshop talks were devoted to computational analysis of TEs with additional time for discussion of unresolved issues. Also, there was ample opportunity for poster presentations and informal evening discussions. The success of the meeting reflects the important role of Repbase in comparative genomic studies, and emphasizes the need for close interactions between experimental and computational biologists in the years to come.
Fritz-Laylin, Lillian K.; Prochnik, Simon E.; Ginger, Michael L.; Dacks, Joel; Carpenter, Meredith L.; Field, Mark C.; Kuo, Alan; Paredez, Alex; Chapman, Jarrod; Pham, Jonathan; Shu, Shengqiang; Neupane, Rochak; Cipriano, Michael; Mancuso, Joel; Tu, Hank; Salamov, Asaf; Lindquist, Erika; Shapiro, Harris; Lucas, Susan; Grigoriev, Igor V.; Cande, W. Zacheus; Fulton, Chandler; Rokhsar, Daniel S.; Dawson, Scott C.
Genome sequences of diverse free-living protists are essential for understanding eukaryotic evolution and molecular and cell biology. The free-living amoeboflagellate Naegleria gruberi belongs to a varied and ubiquitous protist clade (Heterolobosea) that diverged from other eukaryotic lineages over a billion years ago. Analysis of the 15,727 protein-coding genes encoded by Naegleria's 41 Mb nuclear genome indicates a capacity for both aerobic respiration and anaerobic metabolism with concomitant hydrogen production, with fundamental implications for the evolution of organelle metabolism. The Naegleria genome facilitates substantially broader phylogenomic comparisons of free-living eukaryotes than previously possible, allowing us to identify thousands of genes likely present in the pan-eukaryotic ancestor, with 40% likely eukaryotic inventions. Moreover, we construct a comprehensive catalog of amoeboid-motility genes. The Naegleria genome, analyzed in the context of other protists, reveals a remarkably complex ancestral eukaryote with a rich repertoire of cytoskeletal, sexual, signaling, and metabolic modules.
Costantini, Maria; Alvarez-Valin, Fernando; Costantini, Susan; Cammarano, Rosalia; Bernardi, Giorgio
The genomes of multicellular eukaryotes are compartmentalized in mosaics of isochores, large and fairly homogeneous stretches of DNA that belong to a small number of families characterized by different average GC levels, by different gene concentration (that increase with GC), different chromatin structures, different replication timing in the cell cycle, and other different properties. A question raised by these basic results concerns how far back in evolution the compartmentalized organization of the eukaryotic genomes arose. In the present work we approached this problem by studying the compositional organization of the genomes from the unicellular eukaryotes for which full sequences are available, the sample used being representative. The average GC levels of the genomes from unicellular eukaryotes cover an extremely wide range (19%-60% GC) and the compositional patterns of individual genomes are extremely different but all genomes tested show a compositional compartmentalization. The average GC range of the genomes of unicellular eukaryotes is very broad (as broad as that of prokaryotes) and individual compositional patterns cover a very broad range from very narrow to very complex. Both features are not surprising for organisms that are very far from each other both in terms of phylogenetic distances and of environmental life conditions. Most importantly, all genomes tested, a representative sample of all supergroups of unicellular eukaryotes, are compositionally compartmentalized, a major difference with prokaryotes.
West, Patrick T; Probst, Alexander J; Grigoriev, Igor V; Thomas, Brian C; Banfield, Jillian F
Microbial eukaryotes are integral components of natural microbial communities, and their inclusion is critical for many ecosystem studies, yet the majority of published metagenome analyses ignore eukaryotes. In order to include eukaryotes in environmental studies, we propose a method to recover eukaryotic genomes from complex metagenomic samples. A key step for genome recovery is separation of eukaryotic and prokaryotic fragments. We developed a k -mer-based strategy, EukRep, for eukaryotic sequence identification and applied it to environmental samples to show that it enables genome recovery, genome completeness evaluation, and prediction of metabolic potential. We used this approach to test the effect of addition of organic carbon on a geyser-associated microbial community and detected a substantial change of the community metabolism, with selection against almost all candidate phyla bacteria and archaea and for eukaryotes. Near complete genomes were reconstructed for three fungi placed within the Eurotiomycetes and an arthropod. While carbon fixation and sulfur oxidation were important functions in the geyser community prior to carbon addition, the organic carbon-impacted community showed enrichment for secreted proteases, secreted lipases, cellulose targeting CAZymes, and methanol oxidation. We demonstrate the broader utility of EukRep by reconstructing and evaluating relatively high-quality fungal, protist, and rotifer genomes from complex environmental samples. This approach opens the way for cultivation-independent analyses of whole microbial communities. © 2018 West et al.; Published by Cold Spring Harbor Laboratory Press.
Ciobanu, Doina; Clum, Alicia; Singh, Vasanth; Salamov, Asaf; Han, James; Copeland, Alex; Grigoriev, Igor; James, Timothy; Singer, Steven; Woyke, Tanja; Malmstrom, Rex; Cheng, Jan-Fang
Despite their small size, unicellular eukaryotes have complex genomes with a high degree of plasticity that allow them to adapt quickly to environmental changes. Unicellular eukaryotes live with prokaryotes and higher eukaryotes, frequently in symbiotic or parasitic niches. To this day their contribution to the dynamics of the environmental communities remains to be understood. Unfortunately, the vast majority of eukaryotic microorganisms are either uncultured or unculturable, making genome sequencing impossible using traditional approaches. We have developed an approach to isolate unicellular eukaryotes of interest from environmental samples, and to sequence and analyze their genomes and transcriptomes. We have tested our methods with six species: an uncharacterized protist from cellulose-enriched compost identified as Platyophrya, a close relative of P. vorax; the fungus Metschnikowia bicuspidate, a parasite of water flea Daphnia; the mycoparasitic fungi Piptocephalis cylindrospora, a parasite of Cokeromyces and Mucor; Caulochytrium protosteloides, a parasite of Sordaria; Rozella allomycis, a parasite of the water mold Allomyces; and the microalgae Chlamydomonas reinhardtii. Here, we present the four components of our approach: pre-sequencing methods, sequence analysis for single cell genome assembly, sequence analysis of single cell transcriptomes, and genome annotation. This technology has the potential to uncover the complexity of single cell eukaryotes and their role in the environmental samples.
Bricheux, Geneviève; Morin, Loïc; Le Moal, Gwenaël; Coffe, Gérard; Balestrino, Damien; Charbonnel, Nicolas; Bohatier, Jacques; Forestier, Christiane
Despite the recent and significant increase in the study of aquatic microbial communities, little is known about the microbial diversity of complex ecosystems such as running waters. This study investigated the biodiversity of biofilm communities formed in a river with 454 Sequencing™. This river has the particularity of integrating both organic and microbiological pollution, as receiver of agricultural pollution in its upstream catchment area and urban pollution through discharges of the wastewater treatment plant of the town of Billom. Different regions of the small subunit (SSU) ribosomal RNA gene were targeted using nine pairs of primers, either universal or specific for bacteria, eukarya, or archaea. Our aim was to characterize the widest range of rDNA sequences using different sets of polymerase chain reaction (PCR) primers. A first look at reads abundance revealed that a large majority (47-48%) were rare sequences (<5 copies). Prokaryotic phyla represented the species richness, and eukaryotic phyla accounted for a small part. Among the prokaryotic phyla, Proteobacteria (beta and alpha) predominated, followed by Bacteroidetes together with a large number of nonaffiliated bacterial sequences. Bacillariophyta plastids were abundant. The remaining bacterial phyla, Verrucomicrobia and Cyanobacteria, made up the rest of the bulk biodiversity. The most abundant eukaryotic phyla were annelid worms, followed by Diatoms, and Chlorophytes. These latter phyla attest to the abundance of plastids and the importance of photosynthetic activity for the biofilm. These findings highlight the existence and plasticity of multiple trophic levels within these complex biological systems. © 2013 The Authors. Microbiology Open published by John Wiley & Sons Ltd.
Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G
The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.
Full Text Available The ability to predict gene content is highly desirable for characterization of not-yet sequenced genomes like those of dinoflagellates. Using data from completely sequenced and annotated genomes from phylogenetically diverse lineages, we investigated the relationship between gene content and genome size using regression analyses. Distinct relationships between log(10-transformed protein-coding gene number (Y' versus log(10-transformed genome size (X', genome size in kbp were found for eukaryotes and non-eukaryotes. Eukaryotes best fit a logarithmic model, Y' = ln(-46.200+22.678X', whereas non-eukaryotes a linear model, Y' = 0.045+0.977X', both with high significance (p0.91. Total gene number shows similar trends in both groups to their respective protein coding regressions. The distinct correlations reflect lower and decreasing gene-coding percentages as genome size increases in eukaryotes (82%-1% compared to higher and relatively stable percentages in prokaryotes and viruses (97%-47%. The eukaryotic regression models project that the smallest dinoflagellate genome (3x10(6 kbp contains 38,188 protein-coding (40,086 total genes and the largest (245x10(6 kbp 87,688 protein-coding (92,013 total genes, corresponding to 1.8% and 0.05% gene-coding percentages. These estimates do not likely represent extraordinarily high functional diversity of the encoded proteome but rather highly redundant genomes as evidenced by high gene copy numbers documented for various dinoflagellate species.
Full Text Available Abstract Background The number of completed eukaryotic genome sequences and cDNA projects has increased exponentially in the past few years although most of them have not been published yet. In addition, many microarray analyses yielded thousands of sequenced EST and cDNA clones. For the researcher interested in single gene analyses (from a phylogenetic, a structural biology or other perspective it is therefore important to have up-to-date knowledge about the various resources providing primary data. Description The database is built around 3 central tables: species, sequencing projects and publications. The species table contains commonly and alternatively used scientific names, common names and the complete taxonomic information. For projects the sequence type and links to species project web-sites and species homepages are stored. All publications are linked to projects. The web-interface provides comprehensive search modules with detailed options and three different views of the selected data. We have especially focused on developing an elaborate taxonomic tree search tool that allows the user to instantaneously identify e.g. the closest relative to the organism of interest. Conclusion We have developed a database, called diArk, to store, organize, and present the most relevant information about completed genome projects and EST/cDNA data from eukaryotes. Currently, diArk provides information about 415 eukaryotes, 823 sequencing projects, and 248 publications.
Felder, Marius; Romualdi, Alessandro; Petzold, Andreas; Platzer, Matthias; Sühnel, Jürgen; Glöckner, Gernot
Many sequence data repositories can give a quick and easily accessible overview on genomes and their annotations. Less widespread is the possibility to compare related genomes with each other in a common database environment. We have previously described the GenColors database system (http://gencolors.fli-leibniz.de) and its applications to a number of bacterial genomes such as Borrelia, Legionella, Leptospira and Treponema. This system has an emphasis on genome comparison. It combines data from related genomes and provides the user with an extensive set of visualization and analysis tools. Eukaryote genomes are normally larger than prokaryote genomes and thus pose additional challenges for such a system. We have, therefore, adapted GenColors to also handle larger datasets of small eukaryotic genomes and to display eukaryotic gene structures. Further recent developments include whole genome views, genome list options and, for bacterial genome browsers, the display of horizontal gene transfer predictions. Two new GenColors-based databases for two fungal species (http://fgb.fli-leibniz.de) and for four social amoebas (http://sacgb.fli-leibniz.de) were set up. Both new resources open up a single entry point for related genomes for the amoebozoa and fungal research communities and other interested users. Comparative genomics approaches are greatly facilitated by these resources.
Kramerov, D A; Vassetzky, N S
Short interspersed elements (SINEs) are one of the two most prolific mobile genomic elements in most of the higher eukaryotes. Although their biology is still not thoroughly understood, unusual life cycle of these simple elements amplified as genomic parasites makes their evolution unique in many ways. In contrast to most genetic elements including other transposons, SINEs emerged de novo many times in evolution from available molecules (for example, tRNA). The involvement of reverse transcription in their amplification cycle, huge number of genomic copies and modular structure allow variation mechanisms in SINEs uncommon or rare in other genetic elements (module exchange between SINE families, dimerization, and so on.). Overall, SINE evolution includes their emergence, progressive optimization and counteraction to the cell's defense against mobile genetic elements.
Bonnet, Amandine; Grosso, Ana R; Elkaoutari, Abdessamad; Coleno, Emeline; Presle, Adrien; Sridhara, Sreerama C; Janbon, Guilhem; Géli, Vincent; de Almeida, Sérgio F; Palancade, Benoit
Transcription is a source of genetic instability that can notably result from the formation of genotoxic DNA:RNA hybrids, or R-loops, between the nascent mRNA and its template. Here we report an unexpected function for introns in counteracting R-loop accumulation in eukaryotic genomes. Deletion of endogenous introns increases R-loop formation, while insertion of an intron into an intronless gene suppresses R-loop accumulation and its deleterious impact on transcription and recombination in yeast. Recruitment of the spliceosome onto the mRNA, but not splicing per se, is shown to be critical to attenuate R-loop formation and transcription-associated genetic instability. Genome-wide analyses in a number of distant species differing in their intron content, including human, further revealed that intron-containing genes and the intron-richest genomes are best protected against R-loop accumulation and subsequent genetic instability. Our results thereby provide a possible rationale for the conservation of introns throughout the eukaryotic lineage. Copyright © 2017 Elsevier Inc. All rights reserved.
Hu, Zhiqiang; Sun, Chen; Lu, Kuang-Chen; Chu, Xixia; Zhao, Yue; Lu, Jinyuan; Shi, Jianxin; Wei, Chaochun
Pan-genome analyses are routinely carried out for bacteria to interpret the within-species gene presence/absence variations (PAVs). However, pan-genome analyses are rare for eukaryotes due to the large sizes and higher complexities of their genomes. Here we proposed EUPAN, a eukaryotic pan-genome analysis toolkit, enabling automatic large-scale eukaryotic pan-genome analyses and detection of gene PAVs at a relatively low sequencing depth. In the previous studies, we demonstrated the effectiveness and high accuracy of EUPAN in the pan-genome analysis of 453 rice genomes, in which we also revealed widespread gene PAVs among individual rice genomes. Moreover, EUPAN can be directly applied to the current re-sequencing projects primarily focusing on single nucleotide polymorphisms. EUPAN is implemented in Perl, R and C ++. It is supported under Linux and preferred for a computer cluster with LSF and SLURM job scheduling system. EUPAN together with its standard operating procedure (SOP) is freely available for non-commercial use (CC BY-NC 4.0) at http://cgm.sjtu.edu.cn/eupan/index.html . email@example.com or firstname.lastname@example.org. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: email@example.com
Full Text Available Next generation sequencing of ribosomal DNA is increasingly used to assess the diversity and structure of microbial communities. Here we test the ability of 454 pyrosequencing to detect the number of species present, and assess the relative abundance in terms of cell numbers and biomass of protists in the phylum Haptophyta. We used a mock community consisting of equal number of cells of 11 haptophyte species and compared targeting DNA and RNA/cDNA, and two different V4 SSU rDNA haptophyte-biased primer pairs. Further, we tested four different bioinformatic filtering methods to reduce errors in the resulting sequence dataset. With sequencing depth of 11000-20000 reads and targeting cDNA with Haptophyta specific primers Hap454 we detected all 11 species. A rarefaction analysis of expected number of species recovered as a function of sampling depth suggested that minimum 1400 reads were required here to recover all species in the mock community. Relative read abundance did not correlate to relative cell numbers. Although the species represented with the largest biomass was also proportionally most abundant among the reads, there was generally a weak correlation between proportional read abundance and proportional biomass of the different species, both with DNA and cDNA as template. The 454 sequencing generated considerable spurious diversity, and more with cDNA than DNA as template. With initial filtering based only on match with barcode and primer we observed 100-fold more operational taxonomic units (OTUs at 99% similarity than the number of species present in the mock community. Filtering based on quality scores, or denoising with PyroNoise resulted in ten times more OTU99% than the number of species. Denoising with AmpliconNoise reduced the number of OTU99% to match the number of species present in the mock community. Based on our analyses, we propose a strategy to more accurately depict haptophyte diversity using 454 pyrosequencing.
Full Text Available To address whether there are differences of variation among repeat motif types and among taxonomic groups, we present here an analysis of variation and correlation of dinucleotide microsatellite repeats in eukaryotic genomes. Ten taxonomic groups were compared, those being primates, mammalia (excluding primates and rodentia, rodentia, birds, fish, amphibians and reptiles, insects, molluscs, plants and fungi, respectively. The data used in the analysis is from the literature published in the Journal of Molecular Ecology Notes. Analysis of variation reveals that there are no significant differences between AC and AG repeat motif types. Moreover, the number of alleles correlates positively with the copy number in both AG and AC repeats. Similar conclusions can be obtained from each taxonomic group. These results strongly suggest that the increase of SSR variation is almost linear with the increase of the copy number of each repeat motif. As well, the results suggest that the variability of SSR in the genomes of low-ranking species seem to be more than that of high-ranking species, excluding primates and fungi.
Sheveleva, Anna; Kudryavtseva, Anna; Speranskaya, Anna; Belenikin, Maxim; Melnikova, Natalia; Chirkov, Sergei
The near-complete (99.7 %) genome sequence of a novel Russian Plum pox virus (PPV) isolate Pk, belonging to the strain Winona (W), has been determined by 454 pyrosequencing with the exception of the thirty-one 5'-terminal nucleotides. This region was amplified using 5'RACE kit and sequenced by the Sanger method. Genomic RNA released from immunocaptured PPV particles was employed for generation of cDNA library using TransPlex Whole transcriptome amplification kit (WTA2, Sigma-Aldrich). The entire Pk genome has identity level of 92.8-94.5 % when compared to the complete nucleotide sequences of other PPV-W isolates (W3174, LV-141pl, LV-145bt, and UKR 44189), confirming a high degree of variability within the PPV-W strain. The isolates Pk and LV-141pl are most closely related. The Pk has been found in a wild plum (Prunus domestica) in a new region of Russia indicating widespread dissemination of the PPV-W strain in the European part of the former USSR.
Galagan James E
Full Text Available Abstract Background Natural selection has traditionally been understood as a force responsible for pushing genes to states of higher translational efficiency, whereas lower translational efficiency has been explained by neutral mutation and genetic drift. We looked for evidence of directional selection resulting in increased unpreferred codon usage (and presumably reduced translational efficiency in three divergent clusters of eukaryotic genomes using a simple optimal-codon-based metric (Kp/Ku. Results Here we show that for some genes natural selection is indeed responsible for causing accelerated unpreferred codon substitution, and document the scope of this selection. In Cryptococcus and to a lesser extent Drosophila, we find many genes showing a statistically significant signal of selection for unpreferred codon usage in one or more lineages. We did not find evidence for this type of selection in Saccharomyces. The signal of positive selection observed from unpreferred synonymous codon substitutions is coincident in Cryptococcus and Drosophila with the distribution of upstream open reading frames (uORFs, another genic feature known to reduce translational efficiency. Functional enrichment analysis of genes exhibiting low Kp/Ku ratios reveals that genes in regulatory roles are particularly subject to this type of selection. Conclusion Through genome-wide scans, we find recent selection for unpreferred codon usage at approximately 1% of genetic loci in a Cryptococcus and several genes in Drosophila. Unpreferred codons can impede translation efficiency, and we find that genes with translation-impeding uORFs are enriched for this selection signal. We find that regulatory genes are particularly likely to be subject to selection for unpreferred codon usage. Given that expression noise can propagate through regulatory cascades, and that low translational efficiency can reduce expression noise, this finding supports the hypothesis that translational
Lukeš, Julius; López-García, P.; Louis, E.; Boekhout, T.
Roč. 167, č. 3 (2016), s. 217-221 ISSN 1434-4610 Institutional support: RVO:60077344 Keywords : protist * eukaryotic microorganisms * genomics Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 2.794, year: 2016
Zhu, Shijia; Beaulaurier, John; Deikus, Gintaras; Wu, Tao; Strahl, Maya; Hao, Ziyang; Luo, Guanzheng; Gregory, James A; Chess, Andrew; He, Chuan; Xiao, Andrew; Sebra, Robert; Schadt, Eric E; Fang, Gang
N6-methyladenine (m6dA) has been discovered as a novel form of DNA methylation prevalent in eukaryotes, however, methods for high resolution mapping of m6dA events are still lacking. Single-molecule real-time (SMRT) sequencing has enabled the detection of m6dA events at single-nucleotide resolution in prokaryotic genomes, but its application to detecting m6dA in eukaryotic genomes has not been rigorously examined. Herein, we identified unique characteristics of eukaryotic m6dA methylomes that fundamentally differ from those of prokaryotes. Based on these differences, we describe the first approach for mapping m6dA events using SMRT sequencing specifically designed for the study of eukaryotic genomes, and provide appropriate strategies for designing experiments and carrying out sequencing in future studies. We apply the novel approach to study two eukaryotic genomes. For green algae, we construct the first complete genome-wide map of m6dA at single nucleotide and single molecule resolution. For human lymphoblastoid cells (hLCLs), joint analyses of SMRT sequencing and independent sequencing data suggest that putative m6dA events are enriched in the promoters of young, full length LINE-1 elements (L1s). These analyses demonstrate a general method for rigorous mapping and characterization of m6dA events in eukaryotic genomes. Published by Cold Spring Harbor Laboratory Press.
Qiu, Guo-Hua; Yang, Xiaoyan; Zheng, Xintian; Huang, Cuiqin
Traditionally, the genome has been described as the 'book of life'. However, the metaphor of a book may not reflect the dynamic nature of the structure and function of the genome. In the eukaryotic genome, the number of centrally located protein-coding sequences is relatively constant across species, but the amount of noncoding DNA increases considerably with the increase of organismal evolutional complexity. Therefore, it has been hypothesized that the abundant peripheral noncoding DNA protects the genome and the central protein-coding sequences in the eukaryotic genome. Upon comparison with the habitation, sociality and defense mechanisms of a social insect colony, it is found that the genome is similar to a social insect colony in various aspects. A social insect colony may thus be a better metaphor than a book to describe the spatial organization and physical functions of the genome. The potential implications of the metaphor are also discussed.
Full Text Available Abstract Background Microsatellites have immense utility as molecular markers in different fields like genome characterization and mapping, phylogeny and evolutionary biology. Existing microsatellite databases are of limited utility for experimental and computational biologists with regard to their content and information output. EuMicroSatdb (Eukaryotic MicroSatellite database http://ipu.ac.in/usbt/EuMicroSatdb.htm is a web based relational database for easy and efficient positional mining of microsatellites from sequenced eukaryotic genomes. Description A user friendly web interface has been developed for microsatellite data retrieval using Active Server Pages (ASP. The backend database codes for data extraction and assembly have been written using Perl based scripts and C++. Precise need based microsatellites data retrieval is possible using different input parameters like microsatellite type (simple perfect or compound perfect, repeat unit length (mono- to hexa-nucleotide, repeat number, microsatellite length and chromosomal location in the genome. Furthermore, information about clustering of different microsatellites in the genome can also be retrieved. Finally, to facilitate primer designing for PCR amplification of any desired microsatellite locus, 200 bp upstream and downstream sequences are provided. Conclusion The database allows easy systematic retrieval of comprehensive information about simple and compound microsatellites, microsatellite clusters and their locus coordinates in 31 sequenced eukaryotic genomes. The information content of the database is useful in different areas of research like gene tagging, genome mapping, population genetics, germplasm characterization and in understanding microsatellite dynamics in eukaryotic genomes.
Vandepoele, Klaas; Van Bel, Michiel; Richard, Guilhem; Van Landeghem, Sofie; Verhelst, Bram; Moreau, Hervé; Van de Peer, Yves; Grimsley, Nigel; Piganeau, Gwenael
With the advent of next generation genome sequencing, the number of sequenced algal genomes and transcriptomes is rapidly growing. Although a few genome portals exist to browse individual genome sequences, exploring complete genome information from multiple species for the analysis of user-defined sequences or gene lists remains a major challenge. pico-PLAZA is a web-based resource (http://bioinformatics.psb.ugent.be/pico-plaza/) for algal genomics that combines different data types with intuitive tools to explore genomic diversity, perform integrative evolutionary sequence analysis and study gene functions. Apart from homologous gene families, multiple sequence alignments, phylogenetic trees, Gene Ontology, InterPro and text-mining functional annotations, different interactive viewers are available to study genome organization using gene collinearity and synteny information. Different search functions, documentation pages, export functions and an extensive glossary are available to guide non-expert scientists. To illustrate the versatility of the platform, different case studies are presented demonstrating how pico-PLAZA can be used to functionally characterize large-scale EST/RNA-Seq data sets and to perform environmental genomics. Functional enrichments analysis of 16 Phaeodactylum tricornutum transcriptome libraries offers a molecular view on diatom adaptation to different environments of ecological relevance. Furthermore, we show how complementary genomic data sources can easily be combined to identify marker genes to study the diversity and distribution of algal species, for example in metagenomes, or to quantify intraspecific diversity from environmental strains. © 2013 John Wiley & Sons Ltd and Society for Applied Microbiology.
Permal, Emmanuelle; Flutre, Timothée; Quesneville, Hadi
Current high-throughput techniques have made it feasible to sequence even the genomes of non-model organisms. However, the annotation process now represents a bottleneck to genome analysis, especially when dealing with transposable elements (TE). Combined approaches, using both de novo and knowledge-based methods to detect TEs, are likely to produce reasonably comprehensive and sensitive results. This chapter provides a roadmap for researchers involved in genome projects to address this issue. At each step of the TE annotation process, from the identification of TE families to the annotation of TE copies, we outline the tools and good practices to be used.
Pedersen, Anders Gorm; Nielsen, Henrik
Translation in eukaryotes does not always start at the first AUG in an mRNA, implying that context information also plays a role.This makes prediction of translation initiation sites a non-trivial task, especially when analysing EST and genome data where the entire mature mRNA sequence is not known...
Hameister, Jörn; Helm, Werner E.; Hütt, Marc-Thorsten; Dehnert, Manuel
Attempts to identify a species on the basis of its DNA sequence on purely statistical grounds have been formulated for more than a decade. Solving this problem could have a huge impact on understanding processes of genome evolution and on the design of classification schemes for DNA sequences.
Das, Mitali; Singh, Sunita; Pradhan, Satyajit; Narayan, Gopeshwar
As a crucial component of DNA replication licensing system, minichromosome maintenance (MCM) 2-7 complex acts as the eukaryotic DNA replicative helicase. The six related MCM proteins form a heterohexamer and bind with ORC, CDC6, and Cdt1 to form the prereplication complex. Although the MCMs are well known as replicative helicases, their overabundance and distribution patterns on chromatin present a paradox called the "MCM paradox." Several approaches had been taken to solve the MCM paradox and describe the purpose of excess MCMs distributed beyond the replication origins. Alternative functions of these MCMs rather than a helicase had also been proposed. This review focuses on several models and concepts generated to solve the MCM paradox coinciding with their helicase function and provides insight into the concept that excess MCMs are meant for licensing dormant origins as a backup during replication stress. Finally, we extend our view towards the effect of alteration of MCM level. Though an excess MCM constituent is needed for normal cells to withstand stress, there must be a delineation of the threshold level in normal and malignant cells. This review also outlooks the future prospects to better understand the MCM biology.
Zhabinskaya, Dina; Benham, Craig; Madden, Sally
We analyze the competitive transitions to alternate secondary DNA structures in a negatively supercoiled DNA molecule of kilobase length and specified base sequence. We use statistical mechanics to calculate the competition among all regions within the sequence that are susceptible to transitions to alternate structures. We use an approximate numerical method since the calculation of an exact partition function is numerically cumbersome for DNA molecules of lengths longer than hundreds of base pairs. This method yields accurate results in reasonable computational times. We implement algorithms that calculate the competition between transitions to denatured states and to Z-form DNA. We analyze these transitions near the transcription start sites (TSS) of a set of eukaryotic genes. We find an enhancement of Z-forming regions upstream of the TSS and a depletion of denatured regions around the start sites. We confirm that these finding are statistically significant by comparing our results to a set of randomized genes with preserved base composition at each position relative to the gene start sites. When we study the correlation of these transitions in orthologous mouse and human genes we find a clear evolutionary conservation of both types of transitions around the TSS.
Kaplan, Noam; Moore, Irene K; Fondufe-Mittendorf, Yvonne; Gossett, Andrea J; Tillo, Desiree; Field, Yair; LeProust, Emily M; Hughes, Timothy R; Lieb, Jason D; Widom, Jonathan; Segal, Eran
Nucleosome organization is critical for gene regulation. In living cells this organization is determined by multiple factors, including the action of chromatin remodellers, competition with site-specific DNA-binding proteins, and the DNA sequence preferences of the nucleosomes themselves. However, it has been difficult to estimate the relative importance of each of these mechanisms in vivo, because in vivo nucleosome maps reflect the combined action of all influencing factors. Here we determine the importance of nucleosome DNA sequence preferences experimentally by measuring the genome-wide occupancy of nucleosomes assembled on purified yeast genomic DNA. The resulting map, in which nucleosome occupancy is governed only by the intrinsic sequence preferences of nucleosomes, is similar to in vivo nucleosome maps generated in three different growth conditions. In vitro, nucleosome depletion is evident at many transcription factor binding sites and around gene start and end sites, indicating that nucleosome depletion at these sites in vivo is partly encoded in the genome. We confirm these results with a micrococcal nuclease-independent experiment that measures the relative affinity of nucleosomes for approximately 40,000 double-stranded 150-base-pair oligonucleotides. Using our in vitro data, we devise a computational model of nucleosome sequence preferences that is significantly correlated with in vivo nucleosome occupancy in Caenorhabditis elegans. Our results indicate that the intrinsic DNA sequence preferences of nucleosomes have a central role in determining the organization of nucleosomes in vivo.
Marques, Catarina A; Dickens, Nicholas J; Paape, Daniel; Campbell, Samantha J; McCulloch, Richard
DNA replication initiates on defined genome sites, termed origins. Origin usage appears to follow common rules in the eukaryotic organisms examined to date: all chromosomes are replicated from multiple origins, which display variations in firing efficiency and are selected from a larger pool of potential origins. To ask if these features of DNA replication are true of all eukaryotes, we describe genome-wide origin mapping in the parasite Leishmania. Origin mapping in Leishmania suggests a striking divergence in origin usage relative to characterized eukaryotes, since each chromosome appears to be replicated from a single origin. By comparing two species of Leishmania, we find evidence that such origin singularity is maintained in the face of chromosome fusion or fission events during evolution. Mapping Leishmania origins suggests that all origins fire with equal efficiency, and that the genomic sites occupied by origins differ from related non-origins sites. Finally, we provide evidence that origin location in Leishmania displays striking conservation with Trypanosoma brucei, despite the latter parasite replicating its chromosomes from multiple, variable strength origins. The demonstration of chromosome replication for a single origin in Leishmania, a microbial eukaryote, has implications for the evolution of origin multiplicity and associated controls, and may explain the pervasive aneuploidy that characterizes Leishmania chromosome architecture.
Deng, Feilong; Chen, Shi-Yi; Wu, Zhou-Lin; Hu, Yongsong; Jia, Xianbo; Lai, Song-Jia
Owing to wide application of RNA sequencing (RNA-seq) technology, more and more eukaryotic genomes have been extensively annotated, such as the gene structure, alternative splicing, and noncoding loci. Annotation information of genome is prevalently stored as plain text in General Feature Format (GFF), which could be hundreds or thousands Mb in size. Therefore, it is a challenge for manipulating GFF file for biologists who have no bioinformatic skill. In this study, we provide a web server (GFFview) for parsing the annotation information of eukaryotic genome and then generating statistical description of six indices for visualization. GFFview is very useful for investigating quality and difference of the de novo assembled transcriptome in RNA-seq studies.
Currently, the best scenario for earliest forms of life is based on RNA molecules as they have the proven ability to catalyze enzymatic reactions and harbor genetic information. Evolutionary principles valid today become apparent in such models already. Furthermore, many features of eukaryotic genome architecture might have their origins in an RNA or RNA/protein (RNP) world, including the onset of a further transition, when DNA replaced RNA as the genetic bookkeeper of the cell. Chromosome maintenance, splicing, and regulatory function via RNA may be deeply rooted in the RNA/RNP worlds. Mostly in eukaryotes, conversion from RNA to DNA is still ongoing, which greatly impacts the plasticity of extant genomes. Raw material for novel genes encoding protein or RNA, or parts of genes including regulatory elements that selection can act on, continues to enter the evolutionary lottery. PMID:25081515
Bonten Marc JM
Full Text Available Abstract Background The Gram-positive bacterium Enterococcus faecium is an important cause of nosocomial infections in immunocompromized patients. Results We present a pyrosequencing-based comparative genome analysis of seven E. faecium strains that were isolated from various sources. In the genomes of clinical isolates several antibiotic resistance genes were identified, including the vanA transposon that confers resistance to vancomycin in two strains. A functional comparison between E. faecium and the related opportunistic pathogen E. faecalis based on differences in the presence of protein families, revealed divergence in plant carbohydrate metabolic pathways and oxidative stress defense mechanisms. The E. faecium pan-genome was estimated to be essentially unlimited in size, indicating that E. faecium can efficiently acquire and incorporate exogenous DNA in its gene pool. One of the most prominent sources of genomic diversity consists of bacteriophages that have integrated in the genome. The CRISPR-Cas system, which contributes to immunity against bacteriophage infection in prokaryotes, is not present in the sequenced strains. Three sequenced isolates carry the esp gene, which is involved in urinary tract infections and biofilm formation. The esp gene is located on a large pathogenicity island (PAI, which is between 64 and 104 kb in size. Conjugation experiments showed that the entire esp PAI can be transferred horizontally and inserts in a site-specific manner. Conclusions Genes involved in environmental persistence, colonization and virulence can easily be aquired by E. faecium. This will make the development of successful treatment strategies targeted against this organism a challenge for years to come.
Blanc, Guillaume; Agarkova, Irina; Grimwood, Jane; Kuo, Alan; Brueggeman, Andrew; Dunigan, David D.; Gurnon, James; Ladunga, Istvan; Lindquist, Erika; Lucas, Susan; Pangilinan, Jasmyn; Proschold, Thomas; Salamov, Asaf; Schmutz, Jeremy; Weeks, Donald; Tamada, Takashi; Lomsadze, Alexandre; Borodovsky, Mark; Claverie, Jean-Michel; Grigoriev, Igor V.; Van Etten, James L.
Background Little is known about the mechanisms of adaptation of life to the extreme environmental conditions encountered in polar regions. Here we present the genome sequence of a unicellular green alga from the division chlorophyta, Coccomyxa subellipsoidea C-169, which we will hereafter refer to as C-169. This is the first eukaryotic microorganism from a polar environment to have its genome sequenced. Results The 48.8 Mb genome contained in 20 chromosomes exhibits significant synteny conservation with the chromosomes of its relatives Chlorella variabilis and Chlamydomonas reinhardtii. The order of the genes is highly reshuffled within synteny blocks, suggesting that intra-chromosomal rearrangements were more prevalent than inter-chromosomal rearrangements. Remarkably, Zepp retrotransposons occur in clusters of nested elements with strictly one cluster per chromosome probably residing at the centromere. Several protein families overrepresented in C. subellipsoidae include proteins involved in lipid metabolism, transporters, cellulose synthases and short alcohol dehydrogenases. Conversely, C-169 lacks proteins that exist in all other sequenced chlorophytes, including components of the glycosyl phosphatidyl inositol anchoring system, pyruvate phosphate dikinase and the photosystem 1 reaction center subunit N (PsaN). Conclusions We suggest that some of these gene losses and gains could have contributed to adaptation to low temperatures. Comparison of these genomic features with the adaptive strategies of psychrophilic microbes suggests that prokaryotes and eukaryotes followed comparable evolutionary routes to adapt to cold environments.
Polychronopoulos, Dimitris; Tsiagkas, Giannis; Athanasopoulou, Labrini; Sellis, Diamantis; Almirantis, Yannis
The late Professor J.S. Nicolis always emphasized, both in his writings and in presentations and discussions with students and friends, the relevance of a dynamical systems approach to biology. In particular, viewing the genome as a "biological text" captures the dynamical character of both the evolution and function of the organisms in the form of correlations indicating the presence of a long-range order. This genomic structure can be expressed in forms reminiscent of natural languages and several temporal and spatial traces l by the functioning of dynamical systems: Zipf laws, self-similarity and fractality. Here we review several works of our group and recent unpublished results, focusing on the chromosomal distribution of biologically active genomic components: Genes and protein-coding segments, CpG islands, transposable elements belonging to all major classes and several types of conserved non-coding genomic elements. We report the systematic appearance of power-laws in the size distribution of the distances between elements belonging to each of these types of functional genomic elements. Moreover, fractality is also found in several cases, using box-counting and entropic scaling.We present here, for the first time in a unified way, an aggregative model of the genomic dynamics which can explain the observed patterns on the grounds of known phenomena accompanying genome evolution. Our results comply with recent findings about a "fractal globule" geometry of chromatin in the eukaryotic nucleus.
Mikkelsen, Martin; Frank-Hansen, Rune; Hansen, Anders Johannes
RESULTS: of sequencing of whole mitochondrial genome, HV1 and HV2 DNA with the second generation system (SGS) Roche 454 GS Junior were compared with results of Sanger sequencing and SNP typing with SNaPshot single base extension detected with MALDI-TOF and capillary electrophoresis. We investigated...
Kevin J Liu
Full Text Available One outcome of interspecific hybridization and subsequent effects of evolutionary forces is introgression, which is the integration of genetic material from one species into the genome of an individual in another species. The evolution of several groups of eukaryotic species has involved hybridization, and cases of adaptation through introgression have been already established. In this work, we report on PhyloNet-HMM-a new comparative genomic framework for detecting introgression in genomes. PhyloNet-HMM combines phylogenetic networks with hidden Markov models (HMMs to simultaneously capture the (potentially reticulate evolutionary history of the genomes and dependencies within genomes. A novel aspect of our work is that it also accounts for incomplete lineage sorting and dependence across loci. Application of our model to variation data from chromosome 7 in the mouse (Mus musculus domesticus genome detected a recently reported adaptive introgression event involving the rodent poison resistance gene Vkorc1, in addition to other newly detected introgressed genomic regions. Based on our analysis, it is estimated that about 9% of all sites within chromosome 7 are of introgressive origin (these cover about 13 Mbp of chromosome 7, and over 300 genes. Further, our model detected no introgression in a negative control data set. We also found that our model accurately detected introgression and other evolutionary processes from synthetic data sets simulated under the coalescent model with recombination, isolation, and migration. Our work provides a powerful framework for systematic analysis of introgression while simultaneously accounting for dependence across sites, point mutations, recombination, and ancestral polymorphism.
Full Text Available Abstract Background Plant inducible immunity includes the accumulation of a set of defense proteins during infection called pathogenesis-related (PR proteins, which are grouped into families termed PR-1 to PR-17. The PR-5 family is composed of thaumatin-like proteins (TLPs, which are responsive to biotic and abiotic stress and are widely studied in plants. TLPs were also recently discovered in fungi and animals. In the poplar genome, TLPs are over-represented compared with annual species and their transcripts strongly accumulate during stress conditions. Results Our analysis of the poplar TLP family suggests that the expansion of this gene family was followed by diversification, as differences in expression patterns and predicted properties correlate with phylogeny. In particular, we identified a clade of poplar TLPs that cluster to a single 350 kb locus of chromosome I and that are up-regulated by poplar leaf rust infection. A wider phylogenetic analysis of eukaryote TLPs - including plant, animal and fungi sequences - shows that TLP gene content and diversity increased markedly during land plant evolution. Mapping the reported functions of characterized TLPs to the eukaryote phylogenetic tree showed that antifungal or glycan-lytic properties are widespread across eukaryote phylogeny, suggesting that these properties are shared by most TLPs and are likely associated with the presence of a conserved acidic cleft in their 3D structure. Also, we established an exhaustive catalog of TLPs with atypical architectures such as small-TLPs, TLP-kinases and small-TLP-kinases, which have potentially developed alternative functions (such as putative receptor kinases for pathogen sensing and signaling. Conclusion Our study, based on the most recent plant genome sequences, provides evidence for TLP gene family diversification during land plant evolution. We have shown that the diverse functions described for TLPs are not restricted to specific clades but seem
Full Text Available Abstract Background Plant leucine-rich repeat receptor-like kinases (LRR-RLKs are receptor kinases that contain LRRs in their extracellular domain. In the last 15 years, many research groups have demonstrated major roles played by LRR-RLKs in plants during almost all developmental processes throughout the life of the plant and in defense/resistance against a large range of pathogens. Recently, a breakthrough has been made in this field that challenges the dogma of the specificity of plant LRR-RLKs. Results We analyzed ~1000 complete genomes and show that LRR-RK genes have now been identified in 8 non-plant genomes. We performed an exhaustive phylogenetic analysis of all of these receptors, revealing that all of the LRR-containing receptor subfamilies form lineage-specific clades. Our results suggest that the association of LRRs with RKs appeared independently at least four times in eukaryotic evolutionary history. Moreover, the molecular evolutionary history of the LRR-RKs found in oomycetes is reminiscent of the pattern observed in plants: expansion with amplification/deletion and evolution of the domain organization leading to the functional diversification of members of the gene family. Finally, the expression data suggest that oomycete LRR-RKs may play a role in several stages of the oomycete life cycle. Conclusions In view of the key roles that LRR-RLKs play throughout the entire lifetime of plants and plant-environment interactions, the emergence and expansion of this type of receptor in several phyla along the evolution of eukaryotes, and particularly in oomycete genomes, questions their intrinsic functions in mimicry and/or in the coevolution of receptors between hosts and pathogens.
Abe, Takashi; Hamano, Yuta; Ikemura, Toshimichi
A strategy of evolutionary studies that can compare vast numbers of genome sequences is becoming increasingly important with the remarkable progress of high-throughput DNA sequencing methods. We previously established a sequence alignment-free clustering method "BLSOM" for di-, tri-, and tetranucleotide compositions in genome sequences, which can characterize sequence characteristics (genome signatures) of a wide range of species. In the present study, we generated BLSOMs for tetra- and pentanucleotide compositions in approximately one million sequence fragments derived from 101 eukaryotes, for which almost complete genome sequences were available. BLSOM recognized phylotype-specific characteristics (e.g., key combinations of oligonucleotide frequencies) in the genome sequences, permitting phylotype-specific clustering of the sequences without any information regarding the species. In our detailed examination of 12 Drosophila species, the correlation between their phylogenetic classification and the classification on the BLSOMs was observed to visualize oligonucleotides diagnostic for species-specific clustering.
Harb, Omar S; Roos, David S
Over the past 20 years, advances in high-throughput biological techniques and the availability of computational resources including fast Internet access have resulted in an explosion of large genome-scale data sets "big data." While such data are readily available for download and personal use and analysis from a variety of repositories, often such analysis requires access to seldom-available computational skills. As a result a number of databases have emerged to provide scientists with online tools enabling the interrogation of data without the need for sophisticated computational skills beyond basic knowledge of Internet browser utility. This chapter focuses on the Eukaryotic Pathogen Databases (EuPathDB: http://eupathdb.org) Bioinformatic Resource Center (BRC) and illustrates some of the available tools and methods.
Stuart Gary W
Full Text Available Abstract Background Eukaryotic whole genome sequences are accumulating at an impressive rate. Effective methods for comparing multiple whole eukaryotic genomes on a large scale are needed. Most attempted solutions involve the production of large scale alignments, and many of these require a high stringency pre-screen for putative orthologs in order to reduce the effective size of the dataset and provide a reasonably high but unknown fraction of correctly aligned homologous sites for comparison. As an alternative, highly efficient methods that do not require the pre-alignment of operationally defined orthologs are also being explored. Results A non-alignment method based on the Singular Value Decomposition (SVD was used to compare the predicted protein complement of nine whole eukaryotic genomes ranging from yeast to man. This analysis resulted in the simultaneous identification and definition of a large number of well conserved motifs and gene families, and produced a species tree supporting one of two conflicting hypotheses of metazoan relationships. Conclusions Our SVD-based analysis of the entire protein complement of nine whole eukaryotic genomes suggests that highly conserved motifs and gene families can be identified and effectively compared in a single coherent definition space for the easy extraction of gene and species trees. While this occurs without the explicit definition of orthologs or homologous sites, the analysis can provide a basis for these definitions.
Brawley, Susan H; Blouin, Nicolas A; Ficko-Blean, Elizabeth; Wheeler, Glen L; Lohr, Martin; Goodson, Holly V; Jenkins, Jerry W; Blaby-Haas, Crysten E; Helliwell, Katherine E; Chan, Cheong Xin; Marriage, Tara N; Bhattacharya, Debashish; Klein, Anita S; Badis, Yacine; Brodie, Juliet; Cao, Yuanyu; Collén, Jonas; Dittami, Simon M; Gachon, Claire M M; Green, Beverley R; Karpowicz, Steven J; Kim, Jay W; Kudahl, Ulrich Johan; Lin, Senjie; Michel, Gurvan; Mittag, Maria; Olson, Bradley J S C; Pangilinan, Jasmyn L; Peng, Yi; Qiu, Huan; Shu, Shengqiang; Singer, John T; Smith, Alison G; Sprecher, Brittany N; Wagner, Volker; Wang, Wenfei; Wang, Zhi-Yong; Yan, Juying; Yarish, Charles; Zäuner-Riek, Simone; Zhuang, Yunyun; Zou, Yong; Lindquist, Erika A; Grimwood, Jane; Barry, Kerrie W; Rokhsar, Daniel S; Schmutz, Jeremy; Stiller, John W; Grossman, Arthur R; Prochnik, Simon E
Porphyra umbilicalis (laver) belongs to an ancient group of red algae (Bangiophyceae), is harvested for human food, and thrives in the harsh conditions of the upper intertidal zone. Here we present the 87.7-Mbp haploid Porphyra genome (65.8% G + C content, 13,125 gene loci) and elucidate traits that inform our understanding of the biology of red algae as one of the few multicellular eukaryotic lineages. Novel features of the Porphyra genome shared by other red algae relate to the cytoskeleton, calcium signaling, the cell cycle, and stress-tolerance mechanisms including photoprotection. Cytoskeletal motor proteins in Porphyra are restricted to a small set of kinesins that appear to be the only universal cytoskeletal motors within the red algae. Dynein motors are absent, and most red algae, including Porphyra , lack myosin. This surprisingly minimal cytoskeleton offers a potential explanation for why red algal cells and multicellular structures are more limited in size than in most multicellular lineages. Additional discoveries further relating to the stress tolerance of bangiophytes include ancestral enzymes for sulfation of the hydrophilic galactan-rich cell wall, evidence for mannan synthesis that originated before the divergence of green and red algae, and a high capacity for nutrient uptake. Our analyses provide a comprehensive understanding of the red algae, which are both commercially important and have played a major role in the evolution of other algal groups through secondary endosymbioses.
Feschotte, Cédric; Keswani, Umeshkumar; Ranganathan, Nirmal; Guibotsy, Marcel L; Levine, David
Eukaryotic genomes contain large amount of repetitive DNA, most of which is derived from transposable elements (TEs). Progress has been made to develop computational tools for ab initio identification of repeat families, but there is an urgent need to develop tools to automate the annotation of TEs in genome sequences. Here we introduce REPCLASS, a tool that automates the classification of TE sequences. Using control repeat libraries, we show that the program can classify accurately virtually any known TE types. Combining REPCLASS to ab initio repeat finding in the genomes of Caenorhabditis elegans and Drosophila melanogaster allowed us to recover the contrasting TE landscape characteristic of these species. Unexpectedly, REPCLASS also uncovered several novel TE families in both genomes, augmenting the TE repertoire of these model species. When applied to the genomes of distant Caenorhabditis and Drosophila species, the approach revealed a remarkable conservation of TE composition profile within each genus, despite substantial interspecific covariations in genome size and in the number of TEs and TE families. Lastly, we applied REPCLASS to analyze 10 fungal genomes from a wide taxonomic range, most of which have not been analyzed for TE content previously. The results showed that TE diversity varies widely across the fungi "kingdom" and appears to positively correlate with genome size, in particular for DNA transposons. Together, these data validate REPCLASS as a powerful tool to explore the repetitive DNA landscapes of eukaryotes and to shed light onto the evolutionary forces shaping TE diversity and genome architecture.
Clasen, Frederick Johannes; Pierneef, Rian Ewald; Slippers, Bernard; Reva, Oleg
Genomic islands (GIs) are inserts of foreign DNA that have potentially arisen through horizontal gene transfer (HGT). There are evidences that GIs can contribute significantly to the evolution of prokaryotes. The acquisition of GIs through HGT in eukaryotes has, however, been largely unexplored. In this study, the previously developed GI prediction tool, SeqWord Gene Island Sniffer (SWGIS), is modified to predict GIs in eukaryotic chromosomes. Artificial simulations are used to estimate ratios of predicting false positive and false negative GIs by inserting GIs into different test chromosomes and performing the SWGIS v2.0 algorithm. Using SWGIS v2.0, GIs are then identified in 36 fungal, 22 protozoan and 8 invertebrate genomes. SWGIS v2.0 predicts GIs in large eukaryotic chromosomes based on the atypical nucleotide composition of these regions. Averages for predicting false negative and false positive GIs were 20.1% and 11.01% respectively. A total of 10,550 GIs were identified in 66 eukaryotic species with 5299 of these GIs coding for at least one functional protein. The EuGI web-resource, freely accessible at http://eugi.bi.up.ac.za , was developed that allows browsing the database created from identified GIs and genes within GIs through an interactive and visual interface. SWGIS v2.0 along with the EuGI database, which houses GIs identified in 66 different eukaryotic species, and the EuGI web-resource, provide the first comprehensive resource for studying HGT in eukaryotes.
Shima, Naoko; Pederson, Kayla D
DNA replication is a prerequisite for cell proliferation, yet it can be increasingly challenging for a eukaryotic cell to faithfully duplicate its genome as its size and complexity expands. Dormant origins now emerge as a key component for cells to successfully accomplish such a demanding but essential task. In this perspective, we will first provide an overview of the fundamental processes eukaryotic cells have developed to regulate origin licensing and firing. With a special focus on mammalian systems, we will then highlight the role of dormant origins in preventing replication-associated genome instability and their functional interplay with proteins involved in the DNA damage repair response for tumor suppression. Lastly, deficiencies in the origin licensing machinery will be discussed in relation to their influence on stem cell maintenance and human diseases. Copyright © 2017 Elsevier B.V. All rights reserved.
Møller, Henrik D.; Bojsen, Rasmus Kenneth; Tachibana, Chris
Extrachromosomal circular DNAs (eccDNAs) are common genetic elements in Saccharomyces cerevisiae and are reported in other eukaryotes as well. EccDNAs contribute to genetic variation among somatic cells in multicellular organisms and to evolution of unicellular eukaryotes. Sensitive methods for d...
Møller, Henrik D.; Bojsen, Rasmus Kenneth; Tachibana, Chris
Extrachromosomal circular DNAs (eccDNAs) are common genetic elements in Saccharomyces cerevisiae and are reported in other eukaryotes as well. EccDNAs contribute to genetic variation among somatic cells in multicellular organisms and to evolution of unicellular eukaryotes. Sensitive methods...
Full Text Available The highly modular nature of protein kinases generates diverse functional roles mediated by evolutionary events such as domain recombination, insertion and deletion of domains. Usually domain architecture of a kinase is related to the subfamily to which the kinase catalytic domain belongs. However outlier kinases with unusual domain architectures serve in the expansion of the functional space of the protein kinase family. For example, Src kinases are made-up of SH2 and SH3 domains in addition to the kinase catalytic domain. A kinase which lacks these two domains but retains sequence characteristics within the kinase catalytic domain is an outlier that is likely to have modes of regulation different from classical src kinases. This study defines two types of outlier kinases: hybrids and rogues depending on the nature of domain recombination. Hybrid kinases are those where the catalytic kinase domain belongs to a kinase subfamily but the domain architecture is typical of another kinase subfamily. Rogue kinases are those with kinase catalytic domain characteristic of a kinase subfamily but the domain architecture is typical of neither that subfamily nor any other kinase subfamily. This report provides a consolidated set of such hybrid and rogue kinases gleaned from six eukaryotic genomes-S.cerevisiae, D. melanogaster, C.elegans, M.musculus, T.rubripes and H.sapiens-and discusses their functions. The presence of such kinases necessitates a revisiting of the classification scheme of the protein kinase family using full length sequences apart from classical classification using solely the sequences of kinase catalytic domains. The study of these kinases provides a good insight in engineering signalling pathways for a desired output. Lastly, identification of hybrids and rogues in pathogenic protozoa such as P.falciparum sheds light on possible strategies in host-pathogen interactions.
Marsh, T. L.; Reich, C. I.; Whitelock, R. B.; Olsen, G. J.; Woese, C. R. (Principal Investigator)
The first step in transcription initiation in eukaryotes is mediated by the TATA-binding protein, a subunit of the transcription factor IID complex. We have cloned and sequenced the gene for a presumptive homolog of this eukaryotic protein from Thermococcus celer, a member of the Archaea (formerly archaebacteria). The protein encoded by the archaeal gene is a tandem repeat of a conserved domain, corresponding to the repeated domain in its eukaryotic counterparts. Molecular phylogenetic analyses of the two halves of the repeat are consistent with the duplication occurring before the divergence of the archael and eukaryotic domains. In conjunction with previous observations of similarity in RNA polymerase subunit composition and sequences and the finding of a transcription factor IIB-like sequence in Pyrococcus woesei (a relative of T. celer) it appears that major features of the eukaryotic transcription apparatus were well-established before the origin of eukaryotic cellular organization. The divergence between the two halves of the archael protein is less than that between the halves of the individual eukaryotic sequences, indicating that the average rate of sequence change in the archael protein has been less than in its eukaryotic counterparts. To the extent that this lower rate applies to the genome as a whole, a clearer picture of the early genes (and gene families) that gave rise to present-day genomes is more apt to emerge from the study of sequences from the Archaea than from the corresponding sequences from eukaryotes.
Full Text Available Although best known for its ability to cause severe pneumonia in people whose immune defenses are weakened, Legionella pneumophila and Legionella longbeachae are two species of a large genus of bacteria that are ubiquitous in nature, where they parasitize protozoa. Adaptation to the host environment and exploitation of host cell functions are critical for the success of these intracellular pathogens. The establishment and publication of the complete genome sequences of L. pneumophila and L. longbeachae isolates paved the way for major breakthroughs in understanding the biology of these organisms. In this review we present the knowledge gained from the analyses and comparison of the complete genome sequences of different L. pneumophila and L. longbeachae strains. Emphasis is given on putative virulence and Legionella life cycle related functions, such as the identification of an extended array of eukaryotic-like proteins, many of which have been shown to modulate host cell functions to the pathogen's advantage. Surprisingly, many of the eukaryotic domain proteins identified in L. pneumophila as well as many substrates of the Dot/Icm type IV secretion system essential for intracellular replication are different between these two species, although they cause the same disease. Finally, evolutionary aspects regarding the eukaryotic like proteins in Legionella are discussed.
Iacono, M.; Villa, L.; Fortini, D.
The whole-genome sequence of an epidemic, multidrug-resistant Acinetobacter baumannii strain (strain ACICU) belonging to the European clone II group and carrying the plasmid-mediated bla(OXA-58) carbapenem resistance gene was determined. The A. baumannii ACICU genome was compared with the genomes...
Pretorius, I S; Boeke, J D
Historians of the future may well describe 2018 as the year that the world's first functional synthetic eukaryotic genome became a reality. Without the benefit of hindsight, it might be hard to completely grasp the long-term significance of a breakthrough moment in the history of science like this. The role of synthetic biology in the imminent birth of a budding Saccharomyces cerevisiae yeast cell carrying 16 man-made chromosomes causes the world of science to teeter on the threshold of a future-defining scientific frontier. The genome-engineering tools and technologies currently being developed to produce the ultimate yeast genome will irreversibly connect the dots between our improved understanding of the fundamentals of a complex cell containing its DNA in a specialised nucleus and the application of bioengineered eukaryotes designed for advanced biomanufacturing of beneficial products. By joining up the dots between the findings and learnings from the international Synthetic Yeast Genome project (known as the Yeast 2.0 or Sc2.0 project) and concurrent advancements in biodesign tools and smart data-intensive technologies, a future world powered by a thriving bioeconomy seems realistic. This global project demonstrates how a collaborative network of dot connectors-driven by a tinkerer's indomitable curiosity to understand how things work inside a eukaryotic cell-are using cutting-edge biodesign concepts and synthetic biology tools to advance science and to positively frame human futures (i.e. improved quality of life) in a planetary context (i.e. a sustainable environment). Explorations such as this have a rich history of resulting in unexpected discoveries and unanticipated applications for the benefit of people and planet. However, we must learn from past explorations into controversial futuristic sciences and ensure that researchers at the forefront of an emerging science such as synthetic biology remain connected to all stakeholders' concerns about the
Lin, Yu; Hu, Fei; Tang, Jijun; Moret, Bernard M E
The rapid accumulation of whole-genome data has renewed interest in the study of the evolution of genomic architecture, under such events as rearrangements, duplications, losses. Comparative genomics, evolutionary biology, and cancer research all require tools to elucidate the mechanisms, history, and consequences of those evolutionary events, while phylogenetics could use whole-genome data to enhance its picture of the Tree of Life. Current approaches in the area of phylogenetic analysis are limited to very small collections of closely related genomes using low-resolution data (typically a few hundred syntenic blocks); moreover, these approaches typically do not include duplication and loss events. We describe a maximum likelihood (ML) approach for phylogenetic analysis that takes into account genome rearrangements as well as duplications, insertions, and losses. Our approach can handle high-resolution genomes (with 40,000 or more markers) and can use in the same analysis genomes with very different numbers of markers. Because our approach uses a standard ML reconstruction program (RAxML), it scales up to large trees. We present the results of extensive testing on both simulated and real data showing that our approach returns very accurate results very quickly. In particular, we analyze a dataset of 68 high-resolution eukaryotic genomes, with from 3,000 to 42,000 genes, from the eGOB database; the analysis, including bootstrapping, takes just 3 hours on a desktop system and returns a tree in agreement with all well supported branches, while also suggesting resolutions for some disputed placements.
Zhang, Yan-Qiong; Chen, Dong-Liang; Tian, Hai-Feng; Zhang, Bao-Hong; Wen, Jian-Fan
Using a combined computational program, we identified 50 potential microRNAs (miRNAs) in Giardia lamblia, one of the most primitive unicellular eukaryotes. These miRNAs are unique to G. lamblia and no homologues have been found in other organisms; miRNAs, currently known in other species, were not found in G. lamblia. This suggests that miRNA biogenesis and miRNA-mediated gene regulation pathway may evolve independently, especially in evolutionarily distant lineages. A majority (43) of the predicted miRNAs are located at one single locus; however, some miRNAs have two or more copies in the genome. Among the 58 miRNA genes, 28 are located in the intergenic regions whereas 30 are present in the anti-sense strands of the protein-coding sequences. Five predicted miRNAs are expressed in G. lamblia trophozoite cells evidenced by expressed sequence tags or RT-PCR. Thirty-seven identified miRNAs may target 50 protein-coding genes, including seven variant-specific surface proteins (VSPs). Our findings provide a clue that miRNA-mediated gene regulation may exist in the early stage of eukaryotic evolution, suggesting that it is an important regulation system ubiquitous in eukaryotes.
in honey bee colony collapse disorder. Science 318: 283–287. 39. Towner JS, Sealy TK, Khristova ML, Albarino CG, Conlan S, et al. (2008) Newly...utilize known, organism-specific proteins or genomic DNA signatures respectively. Hence, these assays lack the ability to detect novel natural variations...detection assays utilize known, organism-specific proteins or genomic DNA signatures respectively. Hence, these assays lack the ability to detect novel
Hu, Lin-Yong; Cui, Chen-Chen; Song, Yu-Jie; Wang, Xiang-Guo; Jin, Ya-Ping; Wang, Ai-Hua; Zhang, Yong
cDNA is widely used in gene function elucidation and/or transgenics research but often suitable tissues or cells from which to isolate mRNA for reverse transcription are unavailable. Here, an alternative method for cDNA cloning is described and tested by cloning the cDNA of human LALBA (human alpha-lactalbumin) from genomic DNA. First, genomic DNA containing all of the coding exons was cloned from human peripheral blood and inserted into a eukaryotic expression vector. Next, by delivering the plasmids into either 293T or fibroblast cells, surrogate cells were constructed. Finally, the total RNA was extracted from the surrogate cells and cDNA was obtained by RT-PCR. The human LALBA cDNA that was obtained was compared with the corresponding mRNA published in GenBank. The comparison showed that the two sequences were identical. The novel method for cDNA cloning from surrogate eukaryotic cells described here uses well-established techniques that are feasible and simple to use. We anticipate that this alternative method will have widespread applications.
Cutler Sean R
Full Text Available Abstract Background The carboxy termini of proteins are a frequent site of activity for a variety of biologically important functions, ranging from post-translational modification to protein targeting. Several short peptide motifs involved in protein sorting roles and dependent upon their proximity to the C-terminus for proper function have already been characterized. As a limited number of such motifs have been identified, the potential exists for genome-wide statistical analysis and comparative genomics to reveal novel peptide signatures functioning in a C-terminal dependent manner. We have applied a novel methodology to the prediction of C-terminal-anchored peptide motifs involving a simple z-statistic and several techniques for improving the signal-to-noise ratio. Results We examined the statistical over-representation of position-specific C-terminal tripeptides in 7 eukaryotic proteomes. Sequence randomization models and simple-sequence masking were applied to the successful reduction of background noise. Similarly, as C-terminal homology among members of large protein families may artificially inflate tripeptide counts in an irrelevant and obfuscating manner, gene-family clustering was performed prior to the analysis in order to assess tripeptide over-representation across protein families as opposed to across all proteins. Finally, comparative genomics was used to identify tripeptides significantly occurring in multiple species. This approach has been able to predict, to our knowledge, all C-terminally anchored targeting motifs present in the literature. These include the PTS1 peroxisomal targeting signal (SKL*, the ER-retention signal (K/HDEL*, the ER-retrieval signal for membrane bound proteins (KKxx*, the prenylation signal (CC* and the CaaX box prenylation motif. In addition to a high statistical over-representation of these known motifs, a collection of significant tripeptides with a high propensity for biological function exists
Núñez-Acuña, Gustavo; Aguilar-Espinoza, Andrea; Gallardo-Escárate, Cristian
Despite the great relevance of mitochondrial genome analysis in evolutionary studies, there is scarce information on how the transcripts associated with the mitogenome are expressed and their role in the genetic structuring of populations. This work reports the complete mitochondrial genome of the marine gastropod Concholepas concholepas, obtained by 454 pryosequencing, and an analysis of mitochondrial transcripts of two populations 1000 km apart along the Chilean coast. The mitochondrion of C. concholepas is 15,495 base pairs (bp) in size and contains the 37 subunits characteristic of metazoans, as well as a non-coding region of 330 bp. In silico analysis of mitochondrial gene variability showed significant differences among populations. In terms of levels of relative abundance of transcripts associated with mitochondrion in the two populations (assessed by qPCR), the genes associated with complexes III and IV of the mitochondrial genome had the highest levels of expression in the northern population while transcripts associated with the ATP synthase complex had the highest levels of expression in the southern population. Moreover, fifteen polymorphic SNPs were identified in silico between the mitogenomes of the two populations. Four of these markers implied different amino acid substitutions (non-synonymous SNPs). This work contributes novel information regarding the mitochondrial genome structure and mRNA expression levels of C. concholepas. Copyright © 2012 Elsevier Inc. All rights reserved.
Romano, Stefano; Fernàndez-Guerra, Antonio; Reen, F Jerry; Glöckner, Frank O; Crowley, Susan P; O'Sullivan, Orla; Cotter, Paul D; Adams, Claire; Dobson, Alan D W; O'Gara, Fergal
Strains of the Pseudovibrio genus have been detected worldwide, mainly as part of bacterial communities associated with marine invertebrates, particularly sponges. This recurrent association has been considered as an indication of a symbiotic relationship between these microbes and their host. Until recently, the availability of only two genomes, belonging to closely related strains, has limited the knowledge on the genomic and physiological features of the genus to a single phylogenetic lineage. Here we present 10 newly sequenced genomes of Pseudovibrio strains isolated from marine sponges from the west coast of Ireland, and including the other two publicly available genomes we performed an extensive comparative genomic analysis. Homogeneity was apparent in terms of both the orthologous genes and the metabolic features shared amongst the 12 strains. At the genomic level, a key physiological difference observed amongst the isolates was the presence only in strain P. axinellae AD2 of genes encoding proteins involved in assimilatory nitrate reduction, which was then proved experimentally. We then focused on studying those systems known to be involved in the interactions with eukaryotic and prokaryotic cells. This analysis revealed that the genus harbors a large diversity of toxin-like proteins, secretion systems and their potential effectors. Their distribution in the genus was not always consistent with the phylogenetic relationship of the strains. Finally, our analyses identified new genomic islands encoding potential toxin-immunity systems, previously unknown in the genus. Our analyses shed new light on the Pseudovibrio genus, indicating a large diversity of both metabolic features and systems for interacting with the host. The diversity in both distribution and abundance of these systems amongst the strains underlines how metabolically and phylogenetically similar bacteria may use different strategies to interact with the host and find a niche within its
Georgiev, G.P.; Kramerov, D.A.; Ryskov, A.P.; Skryabin, K.G.; Lukanidin, E.M.
In this paper is described the properties of a novel mouse mdg-like element, the A2 sequence, which is the most abundant repetitive sequence. We also characterized an ubiquitous B2 sequence that represents, after B1, the dominant family among the short interspersed repeats of the mouse genome. The existence of some putative transposition intermediates was shown for repeats of both A and B types of the mouse genome. These are closed circular DNA of the A type and small polyadenylated B + RNAs. The fundamental question that arises is whether these sequences are simply selfish DNA capable of transpositions or do they fulfill some useful biological functions within the genome. 66 references, 11 figures, 1 table
Duvick, Jon; Standage, Daniel S; Merchant, Nirav; Brendel, Volker P
Genome-wide annotation of gene structure requires the integration of numerous computational steps. Currently, annotation is arguably best accomplished through collaboration of bioinformatics and domain experts, with broad community involvement. However, such a collaborative approach is not scalable at today's pace of sequence generation. To address this problem, we developed the xGDBvm software, which uses an intuitive graphical user interface to access a number of common genome analysis and gene structure tools, preconfigured in a self-contained virtual machine image. Once their virtual machine instance is deployed through iPlant's Atmosphere cloud services, users access the xGDBvm workflow via a unified Web interface to manage inputs, set program parameters, configure links to high-performance computing (HPC) resources, view and manage output, apply analysis and editing tools, or access contextual help. The xGDBvm workflow will mask the genome, compute spliced alignments from transcript and/or protein inputs (locally or on a remote HPC cluster), predict gene structures and gene structure quality, and display output in a public or private genome browser complete with accessory tools. Problematic gene predictions are flagged and can be reannotated using the integrated yrGATE annotation tool. xGDBvm can also be configured to append or replace existing data or load precomputed data. Multiple genomes can be annotated and displayed, and outputs can be archived for sharing or backup. xGDBvm can be adapted to a variety of use cases including de novo genome annotation, reannotation, comparison of different annotations, and training or teaching. © 2016 American Society of Plant Biologists. All rights reserved.
Jonathan A Eisen
Full Text Available The ciliate Tetrahymena thermophila is a model organism for molecular and cellular biology. Like other ciliates, this species has separate germline and soma functions that are embodied by distinct nuclei within a single cell. The germline-like micronucleus (MIC has its genome held in reserve for sexual reproduction. The soma-like macronucleus (MAC, which possesses a genome processed from that of the MIC, is the center of gene expression and does not directly contribute DNA to sexual progeny. We report here the shotgun sequencing, assembly, and analysis of the MAC genome of T. thermophila, which is approximately 104 Mb in length and composed of approximately 225 chromosomes. Overall, the gene set is robust, with more than 27,000 predicted protein-coding genes, 15,000 of which have strong matches to genes in other organisms. The functional diversity encoded by these genes is substantial and reflects the complexity of processes required for a free-living, predatory, single-celled organism. This is highlighted by the abundance of lineage-specific duplications of genes with predicted roles in sensing and responding to environmental conditions (e.g., kinases, using diverse resources (e.g., proteases and transporters, and generating structural complexity (e.g., kinesins and dyneins. In contrast to the other lineages of alveolates (apicomplexans and dinoflagellates, no compelling evidence could be found for plastid-derived genes in the genome. UGA, the only T. thermophila stop codon, is used in some genes to encode selenocysteine, thus making this organism the first known with the potential to translate all 64 codons in nuclear genes into amino acids. We present genomic evidence supporting the hypothesis that the excision of DNA from the MIC to generate the MAC specifically targets foreign DNA as a form of genome self-defense. The combination of the genome sequence, the functional diversity encoded therein, and the presence of some pathways missing from
Muhire, Brejnev Muhizi; Golden, Michael; Murrell, Ben; Lefeuvre, Pierre; Lett, Jean-Michel; Gray, Alistair; Poon, Art Y F; Ngandu, Nobubelo Kwanele; Semegni, Yves; Tanov, Emil Pavlov; Monjane, Adérito Luis; Harkins, Gordon William; Varsani, Arvind; Shepherd, Dionne Natalie; Martin, Darren Patrick
Single-stranded DNA (ssDNA) viruses have genomes that are potentially capable of forming complex secondary structures through Watson-Crick base pairing between their constituent nucleotides. A few of the structural elements formed by such base pairings are, in fact, known to have important functions during the replication of many ssDNA viruses. Unknown, however, are (i) whether numerous additional ssDNA virus genomic structural elements predicted to exist by computational DNA folding methods actually exist and (ii) whether those structures that do exist have any biological relevance. We therefore computationally inferred lists of the most evolutionarily conserved structures within a diverse selection of animal- and plant-infecting ssDNA viruses drawn from the families Circoviridae, Anelloviridae, Parvoviridae, Nanoviridae, and Geminiviridae and analyzed these for evidence of natural selection favoring the maintenance of these structures. While we find evidence that is consistent with purifying selection being stronger at nucleotide sites that are predicted to be base paired than at sites predicted to be unpaired, we also find strong associations between sites that are predicted to pair with one another and site pairs that are apparently coevolving in a complementary fashion. Collectively, these results indicate that natural selection actively preserves much of the pervasive secondary structure that is evident within eukaryote-infecting ssDNA virus genomes and, therefore, that much of this structure is biologically functional. Lastly, we provide examples of various highly conserved but completely uncharacterized structural elements that likely have important functions within some of the ssDNA virus genomes analyzed here.
David S. Shin
Full Text Available As the third domain of life, archaea, like the eukarya and bacteria, must have robust DNA replication and repair complexes to ensure genome fidelity. Archaea moreover display a breadth of unique habitats and characteristics, and structural biologists increasingly appreciate these features. As archaea include extremophiles that can withstand diverse environmental stresses, they provide fundamental systems for understanding enzymes and pathways critical to genome integrity and stress responses. Such archaeal extremophiles provide critical data on the periodic table for life as well as on the biochemical, geochemical, and physical limitations to adaptive strategies allowing organisms to thrive under environmental stress relevant to determining the boundaries for life as we know it. Specifically, archaeal enzyme structures have informed the architecture and mechanisms of key DNA repair proteins and complexes. With added abilities to temperature-trap flexible complexes and reveal core domains of transient and dynamic complexes, these structures provide insights into mechanisms of maintaining genome integrity despite extreme environmental stress. The DNA damage response protein structures noted in this review therefore inform the basis for genome integrity in the face of environmental stress, with implications for all domains of life as well as for biomanufacturing, astrobiology, and medicine.
Christen M Klinger
Full Text Available Apicomplexa are obligate intracellular parasites that cause tremendous disease burden world-wide. They utilize a set of specialized secretory organelles in their invasive process that require delivery of components for their biogenesis and function, yet the precise mechanisms underpinning such processes remain unclear. One set of potentially important components is the multi-subunit tethering complexes (MTCs, factors increasingly implicated in all aspects of vesicle-target interactions. Prompted by the results of previous studies indicating a loss of membrane trafficking factors in Apicomplexa, we undertook a bioinformatic analysis of MTC conservation. Building on knowledge of the ancient presence of most MTC proteins, we demonstrate the near complete retention of MTCs in the newly available genomes for Guillardiatheta and Bigelowiellanatans. The latter is a key taxonomic sampling point as a basal sister taxa to the group including Apicomplexa. We also demonstrate an ancient origin of the CORVET complex subunits Vps8 and Vps3, as well as the TRAPPII subunit Tca17. Having established that the lineage leading to Apicomplexa did at one point possess the complete eukaryotic complement of MTC components, we undertook a deeper taxonomic investigation in twelve apicomplexan genomes. We observed excellent conservation of the VpsC core of the HOPS and CORVET complexes, as well as the core TRAPP subunits, but sparse conservation of TRAPPII, COG, Dsl1, and HOPS/CORVET-specific subunits. However, those subunits that we did identify appear to be expressed with similar patterns to the fully conserved MTC proteins, suggesting that they may function as minimal complexes or with analogous partners. Strikingly, we failed to identify any subunits of the exocyst complex in all twelve apicomplexan genomes, as well as the dinoflagellate Perkinsus marinus. Overall, we demonstrate reduction of MTCs in Apicomplexa and their ancestors, consistent with modification during
Full Text Available The candidate phylum Poribacteria is one of the most dominant and widespread members of the microbial communities residing within marine sponges. Cell compartmentalization had been postulated along with their discovery about a decade ago and their phylogenetic association to the Planctomycetes, Verrucomicrobia, Chlamydiae superphylum was proposed soon thereafter. In the present study we revised these features based on genomic data obtained from six poribacterial single cells. We propose that Poribacteria form a distinct monophyletic phylum contiguous to the PVC superphylum together with other candidate phyla. Our genomic analyses supported the possibility of cell compartmentalization in form of bacterial microcompartments. Further analyses of eukaryote-like protein domains stressed the importance of such proteins with features including tetratricopeptide repeats, leucin rich repeats as well as low density lipoproteins receptor repeats, the latter of which are reported here for the first time from a sponge symbiont. Finally, examining the most abundant protein domain family on poribacterial genomes revealed diverse phyH family proteins, some of which may be related to dissolved organic posphorus uptake.
Nowrousian, Minou; Stajich, Jason E; Chu, Meiling; Engh, Ines; Espagne, Eric; Halliday, Karen; Kamerewerd, Jens; Kempken, Frank; Knab, Birgit; Kuo, Hsiao-Che; Osiewacz, Heinz D; Pöggeler, Stefanie; Read, Nick D; Seiler, Stephan; Smith, Kristina M; Zickler, Denise; Kück, Ulrich; Freitag, Michael
Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30-90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in approximately 4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb) with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data can be used for
Full Text Available Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30-90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in approximately 4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data
Der Haupthistokompatibilitätskomplex ist durch seine biologische Funktion eine für die Diagnostik und Forschung äußerst wichtige Region im humanen Genom. Die Untersuchung von HLA-Genorten stellt ein wichtiges Instrument in der molekulargenetischen Praxis dar. Die Pyrosequencing-Technik ist gut geeignet, um kurze DNA-Abschnitte mit weitgehend bekannter Sequenz schnell und effizient zu untersuchen. Ziel dieser Arbeit war die Entwicklung von Pyrosequencing-basierten Methoden zur HLA-Typisierung....
Rybalko, S.; Larionov, S.; Poptsova, M.; Loskutov, A.
Large-scale dynamical properties of complete chromosome DNA sequences of eukaryotes are considered. Using the proposed deterministic models with intermittency and symbolic dynamics we describe a wide spectrum of large-scale patterns inherent in these sequences, such as segmental duplications, tandem repeats, and other complex sequence structures. It is shown that the recently discovered gene number balance on the strands is not of a random nature, and certain subsystems of a complete chromosome DNA sequence exhibit the properties of deterministic chaos.
Full Text Available The eukaryotic translation factor, eIF5A has been recently reported as a sequence-specific elongation factor that facilitates peptide bond formation at consecutive prolines in Saccharomyces cerevisiae, as its ortholog elongation factor P (EF-P does in bacteria. We have searched the genome databases of 35 representative organisms from six kingdoms of life for PPP (Pro-Pro-Pro and/or PPG (Pro-Pro-Gly-encoding genes whose expression is expected to depend on eIF5A. We have made detailed analyses of proteome data of 5 selected species, Escherichia coli, Saccharomyces cerevisiae, Drosophila melanogaster, Mus musculus and Homo sapiens. The PPP and PPG motifs are low in the prokaryotic proteomes. However, their frequencies markedly increase with the biological complexity of eukaryotic organisms, and are higher in newly derived proteins than in those orthologous proteins commonly shared in all species. Ontology classifications of S. cerevisiae and human genes encoding the highest level of polyprolines reveal their strong association with several specific biological processes, including actin/cytoskeletal associated functions, RNA splicing/turnover, DNA binding/transcription and cell signaling. Previously reported phenotypic defects in actin polarity and mRNA decay of eIF5A mutant strains are consistent with the proposed role for eIF5A in the translation of the polyproline-containing proteins. Of all the amino acid tandem repeats (≥3 amino acids, only the proline repeat frequency correlates with functional complexity of the five organisms examined. Taken together, these findings suggest the importance of proline repeat-rich proteins and a potential role for eIF5A and its hypusine modification pathway in the course of eukaryotic evolution.
Davenport Russell J
Full Text Available Abstract Background In many environmental genomics applications a homologous region of DNA from a diverse sample is first amplified by PCR and then sequenced. The next generation sequencing technology, 454 pyrosequencing, has allowed much larger read numbers from PCR amplicons than ever before. This has revolutionised the study of microbial diversity as it is now possible to sequence a substantial fraction of the 16S rRNA genes in a community. However, there is a growing realisation that because of the large read numbers and the lack of consensus sequences it is vital to distinguish noise from true sequence diversity in this data. Otherwise this leads to inflated estimates of the number of types or operational taxonomic units (OTUs present. Three sources of error are important: sequencing error, PCR single base substitutions and PCR chimeras. We present AmpliconNoise, a development of the PyroNoise algorithm that is capable of separately removing 454 sequencing errors and PCR single base errors. We also introduce a novel chimera removal program, Perseus, that exploits the sequence abundances associated with pyrosequencing data. We use data sets where samples of known diversity have been amplified and sequenced to quantify the effect of each of the sources of error on OTU inflation and to validate these algorithms. Results AmpliconNoise outperforms alternative algorithms substantially reducing per base error rates for both the GS FLX and latest Titanium protocol. All three sources of error lead to inflation of diversity estimates. In particular, chimera formation has a hitherto unrealised importance which varies according to amplification protocol. We show that AmpliconNoise allows accurate estimates of OTU number. Just as importantly AmpliconNoise generates the right OTUs even at low sequence differences. We demonstrate that Perseus has very high sensitivity, able to find 99% of chimeras, which is critical when these are present at high
Santini, Sebastien; Jeudy, Sandra; Bartoli, Julia; Poirot, Olivier; Lescot, Magali; Abergel, Chantal; Barbe, Valérie; Wommack, K. Eric; Noordeloos, Anna A. M.; Brussaard, Corina P. D.; Claverie, Jean-Michel
Large dsDNA viruses are involved in the population control of many globally distributed species of eukaryotic phytoplankton and have a prominent role in bloom termination. The genus Phaeocystis (Haptophyta, Prymnesiophyceae) includes several high-biomass-forming phytoplankton species, such as Phaeocystis globosa, the blooms of which occur mostly in the coastal zone of the North Atlantic and the North Sea. Here, we report the 459,984-bp-long genome sequence of P. globosa virus strain PgV-16T, encoding 434 proteins and eight tRNAs and, thus, the largest fully sequenced genome to date among viruses infecting algae. Surprisingly, PgV-16T exhibits no phylogenetic affinity with other viruses infecting microalgae (e.g., phycodnaviruses), including those infecting Emiliania huxleyi, another ubiquitous bloom-forming haptophyte. Rather, PgV-16T belongs to an emerging clade (the Megaviridae) clustering the viruses endowed with the largest known genomes, including Megavirus, Mimivirus (both infecting acanthamoeba), and a virus infecting the marine microflagellate grazer Cafeteria roenbergensis. Seventy-five percent of the best matches of PgV-16T–predicted proteins correspond to two viruses [Organic Lake phycodnavirus (OLPV)1 and OLPV2] from a hypersaline lake in Antarctica (Organic Lake), the hosts of which are unknown. As for OLPVs and other Megaviridae, the PgV-16T sequence data revealed the presence of a virophage-like genome. However, no virophage particle was detected in infected P. globosa cultures. The presence of many genes found only in Megaviridae in its genome and the presence of an associated virophage strongly suggest that PgV-16T shares a common ancestry with the largest known dsDNA viruses, the host range of which already encompasses the earliest diverging branches of domain Eukarya. PMID:23754393
Genome-wide annotation of gene structure requires the integration of numerous computational steps. Currently, annotation is arguably best accomplished through collaboration of bioinformatics and domain experts, with broad community involvement. However, such a collaborative approach is not scalable at today’s pace of sequence generation. To address this problem, we developed the xGDBvm software, which uses an intuitive graphical user interface to access a number of common genome analysis and gene structure tools, preconfigured in a self-contained virtual machine image. Once their virtual machine instance is deployed through iPlant’s Atmosphere cloud services, users access the xGDBvm workflow via a unified Web interface to manage inputs, set program parameters, configure links to high-performance computing (HPC) resources, view and manage output, apply analysis and editing tools, or access contextual help. The xGDBvm workflow will mask the genome, compute spliced alignments from transcript and/or protein inputs (locally or on a remote HPC cluster), predict gene structures and gene structure quality, and display output in a public or private genome browser complete with accessory tools. Problematic gene predictions are flagged and can be reannotated using the integrated yrGATE annotation tool. xGDBvm can also be configured to append or replace existing data or load precomputed data. Multiple genomes can be annotated and displayed, and outputs can be archived for sharing or backup. xGDBvm can be adapted to a variety of use cases including de novo genome annotation, reannotation, comparison of different annotations, and training or teaching. PMID:27020957
Amlacher, Stefan; Sarges, Phillip; Flemming, Dirk
is composed of two large Nups, Nup192 and Nup170, which are flexibly bridged by short linear motifs made up of linker Nups, Nic96 and Nup53. This assembly illustrates how Nup interactions can generate structural plasticity within the NPC scaffold. Our findings therefore demonstrate the utility of the genome...
Full Text Available BACKGROUND: The complexity of phosphoinositide signaling in higher eukaryotes is partly due to expansion of specific families and types of phosphoinositide kinases (PIKs that can generate all phosphoinositides via multiple routes. This is particularly evident in the PI3Ks and PIPKs, and it is considered an evolutionary trait associated with metazoan diversification. Yet, there are limited comprehensive studies on the PIK repertoire of free living unicellular organisms. METHODOLOGY/PRINCIPAL FINDINGS: We undertook a genome-wide analysis of putative PIK genes in two free living ciliated cells, Tetrahymena and Paramecium. The Tetrahymena thermophila and Paramecium tetraurelia genomes were probed with representative kinases from all families and types. Putative homologs were verified by EST, microarray and deep RNA sequencing database searches and further characterized for domain structure, catalytic efficiency, expression patterns and phylogenetic relationships. In total, we identified and characterized 22 genes in the Tetrahymena thermophila genome and 62 highly homologues genes in Paramecium tetraurelia suggesting a tight evolutionary conservation in the ciliate lineage. Comparison to the kinome of fungi reveals a significant expansion of PIK genes in ciliates. CONCLUSIONS/SIGNIFICANCE: Our study highlights four important aspects concerning ciliate and other unicellular PIKs. First, ciliate-specific expansion of PI4KIII-like genes. Second, presence of class I PI3Ks which, at least in Tetrahymena, are associated with a metazoan-type machinery for PIP3 signaling. Third, expansion of divergent PIPK enzymes such as the recently described type IV transmembrane PIPKs. Fourth, presence of possible type II PIPKs and presumably inactive PIKs (hence, pseudo-PIKs not previously described. Taken together, our results provide a solid framework for future investigation of the roles of PIKs in ciliates and indicate that novel functions and novel regulatory
Alsmark, Cecilia; Foster, Peter G; Sicheritz-Pontén, Thomas
BACKGROUND: The influence of lateral gene transfer on gene origins and biology in eukaryotes is poorly understood compared with those of prokaryotes. A number of independent investigations focusing on specific genes, individual genomes, or specific functional categories from various eukaryotes have...... approach to systematically investigate lateral gene transfer affecting the proteomes of thirteen, mainly parasitic, microbial eukaryotes, representing four of the six eukaryotic super-groups. All of the genomes investigated have been significantly affected by prokaryote-to-eukaryote lateral gene transfers...... indicated that lateral gene transfer does indeed affect eukaryotic genomes. However, the lack of common methodology and criteria in these studies makes it difficult to assess the general importance and influence of lateral gene transfer on eukaryotic genome evolution. RESULTS: We used a phylogenomic...
Sverdlov Alexander V
Full Text Available Abstract Background The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. Results We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after eukaryotic orthologous groups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted proteins encoded in 66 genomes of unicellular organisms. The eukaryotic orthologous groups (KOGs include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens, one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe, and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or ~54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of ~20% of the KOG set. This conserved portion of the
Till now, in biological sciences, the term, transcription, mainly refers to DNA to RNA transcription. But our recently published experimental findings obtained from Plasmodium falciparum strongly suggest the existence of DNA to DNA transcription in the genome of eukaryotic cells, which could shed some light on the functions of certain noncoding DNA in the human and other eukaryotic genomes.
Nedelcu, Aurora M
Programmed cell death (PCD) represents a significant component of normal growth and development in multicellular organisms. Recently, PCD-like processes have been reported in single-celled eukaryotes, implying that some components of the PCD machinery existed early in eukaryotic evolution. This study provides a comparative analysis of PCD-related sequences across more than 50 unicellular genera from four eukaryotic supergroups: Unikonts, Excavata, Chromalveolata, and Plantae. A complex set of PCD-related sequences that correspond to domains or proteins associated with all main functional classes--from ligands and receptors to executors of PCD--was found in many unicellular lineages. Several PCD domains and proteins previously thought to be restricted to animals or land plants are also present in unicellular species. Noteworthy, the yeast, Saccharomyces cerevisiae--used as an experimental model system for PCD research, has a rather reduced set of PCD-related sequences relative to other unicellular species. The phylogenetic distribution of the PCD-related sequences identified in unicellular lineages suggests that the genetic basis for the evolution of the complex PCD machinery present in extant multicellular lineages has been established early in the evolution of eukaryotes. The shaping of the PCD machinery in multicellular lineages involved the duplication, co-option, recruitment, and shuffling of domains already present in their unicellular ancestors.
Doolittle, W Ford
Systematics balances uneasily between realism and nominalism, uncommitted as to whether biological taxa are discoveries or inventions. If the former, they might be taken as natural kinds. I briefly review some philosophers' concepts of natural kinds and then argue that several of these apply well enough to "eukaryote." Although there are some sticky issues around genomic chimerism and when eukaryotes first appeared, if we allow for degrees in the naturalness of kinds, existing eukaryotes rank highly, higher than prokaryotes. Most biologists feel this intuitively: All I attempt to do here is provide some conceptual justification. Copyright © 2014 Cold Spring Harbor Laboratory Press; all rights reserved.
Balzer, Susanne; Malde, Ketil; Lanzén, Anders; Sharma, Animesh; Jonassen, Inge
The commercial launch of 454 pyrosequencing in 2005 was a milestone in genome sequencing in terms of performance and cost. Throughout the three available releases, average read lengths have increased to approximately 500 base pairs and are thus approaching read lengths obtained from traditional Sanger sequencing. Study design of sequencing projects would benefit from being able to simulate experiments. We explore 454 raw data to investigate its characteristics and derive empirical distributions for the flow values generated by pyrosequencing. Based on our findings, we implement Flowsim, a simulator that generates realistic pyrosequencing data files of arbitrary size from a given set of input DNA sequences. We finally use our simulator to examine the impact of sequence lengths on the results of concrete whole-genome assemblies, and we suggest its use in planning of sequencing projects, benchmarking of assembly methods and other fields. Flowsim is freely available under the General Public License from http://blog.malde.org/index.php/flowsim/.
He, Liming; Liu, Fang; Karuppiah, Valliappan; Ren, Yi; Li, Zhiyong
To date, the knowledge of eukaryotic communities associated with sponges remains limited compared with prokaryotic communities. In a manner similar to prokaryotes, it could be hypothesized that sponge holobionts have phylogenetically diverse eukaryotic symbionts, and the eukaryotic community structures in different sponge holobionts were probably different. In order to test this hypothesis, the communities of eukaryota associated with 11 species of South China Sea sponges were compared with the V4 region of 18S ribosomal ribonucleic acid gene using 454 pyrosequencing. Consequently, 135 and 721 unique operational taxonomic units (OTUs) of fungi and protists were obtained at 97 % sequence similarity, respectively. These sequences were assigned to 2 phyla of fungi (Ascomycota and Basidiomycota) and 9 phyla of protists including 5 algal phyla (Chlorophyta, Haptophyta, Streptophyta, Rhodophyta, and Stramenopiles) and 4 protozoal phyla (Alveolata, Cercozoa, Haplosporidia, and Radiolaria) including 47 orders (12 fungi, 35 protists). Entorrhizales of fungi and 18 orders of protists were detected in marine sponges for the first time. Particularly, Tilletiales of fungi and Chlorocystidales of protists were detected for the first time in marine habitats. Though Ascomycota, Alveolata, and Radiolaria were detected in all the 11 sponge species, sponge holobionts have different fungi and protistan communities according to OTU comparison and principal component analysis at the order level. This study provided the first insights into the fungal and protistan communities associated with different marine sponge holobionts using pyrosequencing, thus further extending the knowledge on sponge-associated eukaryotic diversity.
Full Text Available Historically, the members of the Agrobacterium genus have been considered the only bacterial species naturally able to transfer and integrate DNA into the genomes of their eukaryotic hosts. Yet, increasing evidence suggests that this ability to genetically transform eukaryotic host cells might be more widespread in the bacterial world. Indeed, analyses of accumulating genomic data reveal cases of horizontal gene transfer from bacteria to eukaryotes and suggest that it represents a significant force in adaptive evolution of eukaryotic species. Specifically, recent reports indicate that bacteria other than Agrobacterium, such as Bartonella henselae (a zoonotic pathogen, Rhizobium etli (a plant-symbiotic bacterium related to Agrobacterium, or even Escherichia coli, have the ability to genetically transform their host cells under laboratory conditions. This DNA transfer relies on type IV secretion systems (T4SSs, the molecular machines that transport macromolecules during conjugative plasmid transfer and also during transport of proteins and/or DNA to the eukaryotic recipient cells. In this review article, we explore the extent of possible transfer of genetic information from bacteria to eukaryotic cells as well as the evolutionary implications and potential applications of this transfer.
The yeast Saccharomyces cerevisiae is one of the champions of basic biomedical research due to its compact eukaryotic genome and ease of experimental manipulation. Despite these immense strengths, its impact on understanding the genetic basis of natural phenotypic variation has been limited by strai...
National Aeronautics and Space Administration — The whole-genome sequences of eight fungal strains that were selected for exposure to microgravity at the International Space Station are presented here. These...
We sequenced the genome of Theileria orientalis, a tick-borne apicomplexan protozoan parasite of cattle. The focus of this study was a comparative genome analysis of T. orientalis relative to other highly pathogenic Theileria species, T. parva and T. annulata. T. parva and T. annulata induce transformation of infected cells of lymphocyte or macrophage/monocyte lineages; in contrast, T. orientalis does not induce uncontrolled proliferation of infected leukocytes and multiplies predominantly within infected erythrocytes. While synteny across homologous chromosomes of the three Theileria species was found to be well conserved overall, subtelomeric structures were found to differ substantially, as T. orientalis lacks the large tandemly arrayed subtelomere-encoded variable secreted protein-encoding gene family. Moreover, expansion of particular gene families by gene duplication was found in the genomes of the two transforming Theileria species, most notably, the TashAT/TpHN and Tar/Tpr gene families. Gene families that are present only in T. parva and T. annulata and not in T. orientalis, Babesia bovis, or Plasmo-dium were also identified. Identification of differences between the genome sequences of Theileria species with different abilities to transform and immortalize bovine leukocytes will provide insight into proteins and mechanisms that have evolved to induce and regulate this process. The T. orientalis genome database is available at http://totdb.czc.hokudai.ac.jp/. 2012 Hayashida et al. T.
Cells need a constant supply of precursors to enable the production of macromolecules to sustain growth and survival. Unlike metazoans, unicellular eukaryotes depend exclusively on the extracellular medium for this supply. When environmental nutrients become depleted, existing cytoplasmic components
Acquisti, Claudia; Allegrini, Paolo E-mail: firstname.lastname@example.org; Bogani, Patrizia; Buiatti, Marcello; Catanese, Elena; Fronzoni, Leone; Grigolini, Paolo; Mersi, Giuseppe; Palatella, Luigi
We investigate on a possible way to connect the presence of low-complexity sequences (LCS) in DNA genomes and the non-stationary properties of base correlations. Under the hypothesis that these variations signal a change in the DNA function, we use a new technique, called non-stationarity entropic index (NSEI) method, and we prove that this technique is an efficient way to detect functional changes with respect to a random baseline. The remarkable aspect is that NSEI does not imply any training data or fitting parameter, the only arbitrarity being the choice of a marker in the sequence. We make this choice on the basis of biological information about LCS distributions in genomes. We show that there exists a correlation between changing the amount in LCS and the ratio of long- to short-range correlation.
Acquisti, Claudia; Allegrini, Paolo; Bogani, Patrizia; Buiatti, Marcello; Catanese, Elena; Fronzoni, Leone; Grigolini, Paolo; Mersi, Giuseppe; Palatella, Luigi
We investigate on a possible way to connect the presence of Low-Complexity Sequences (LCS) in DNA genomes and the nonstationary properties of base correlations. Under the hypothesis that these variations signal a change in the DNA function, we use a new technique, called Non-Stationarity Entropic Index (NSEI) method, and we prove that this technique is an efficient way to detect functional changes with respect to a random baseline. The remarkable aspect is that NSEI does not imply any training data or fitting parameter, the only arbitrarity being the choice of a marker in the sequence. We make this choice on the basis of biological information about LCS distributions in genomes. We show that there exists a correlation between changing the amount in LCS and the ratio of long- to short-range correlation.
Acquisti, Claudia; Allegrini, Paolo; Bogani, Patrizia; Buiatti, Marcello; Catanese, Elena; Fronzoni, Leone; Grigolini, Paolo; Mersi, Giuseppe; Palatella, Luigi
We investigate on a possible way to connect the presence of low-complexity sequences (LCS) in DNA genomes and the non-stationary properties of base correlations. Under the hypothesis that these variations signal a change in the DNA function, we use a new technique, called non-stationarity entropic index (NSEI) method, and we prove that this technique is an efficient way to detect functional changes with respect to a random baseline. The remarkable aspect is that NSEI does not imply any training data or fitting parameter, the only arbitrarity being the choice of a marker in the sequence. We make this choice on the basis of biological information about LCS distributions in genomes. We show that there exists a correlation between changing the amount in LCS and the ratio of long- to short-range correlation
Russo Luis MS
Full Text Available Abstract Background Over the past few years, new massively parallel DNA sequencing technologies have emerged. These platforms generate massive amounts of data per run, greatly reducing the cost of DNA sequencing. However, these techniques also raise important computational difficulties mostly due to the huge volume of data produced, but also because of some of their specific characteristics such as read length and sequencing errors. Among the most critical problems is that of efficiently and accurately mapping reads to a reference genome in the context of re-sequencing projects. Results We present an efficient method for the local alignment of pyrosequencing reads produced by the GS FLX (454 system against a reference sequence. Our approach explores the characteristics of the data in these re-sequencing applications and uses state of the art indexing techniques combined with a flexible seed-based approach, leading to a fast and accurate algorithm which needs very little user parameterization. An evaluation performed using real and simulated data shows that our proposed method outperforms a number of mainstream tools on the quantity and quality of successful alignments, as well as on the execution time. Conclusions The proposed methodology was implemented in a software tool called TAPyR--Tool for the Alignment of Pyrosequencing Reads--which is publicly available from http://www.tapyr.net.
Breton, Sophie; Stewart, Donald T
Mitochondrial DNA (mtDNA) is predominantly maternally inherited in eukaryotes. Diverse molecular mechanisms underlying the phenomenon of strict maternal inheritance (SMI) of mtDNA have been described, but the evolutionary forces responsible for its predominance in eukaryotes remain to be elucidated. Exceptions to SMI have been reported in diverse eukaryotic taxa, leading to the prediction that several distinct molecular mechanisms controlling mtDNA transmission are present among the eukaryotes. We propose that these mechanisms will be better understood by studying the deviations from the predominating pattern of SMI. This minireview summarizes studies on eukaryote species with unusual or rare mitochondrial inheritance patterns, i.e., other than the predominant SMI pattern, such as maternal inheritance of stable heteroplasmy, paternal leakage of mtDNA, biparental and strictly paternal inheritance, and doubly uniparental inheritance of mtDNA. The potential genes and mechanisms involved in controlling mitochondrial inheritance in these organisms are discussed. The linkage between mitochondrial inheritance and sex determination is also discussed, given that the atypical systems of mtDNA inheritance examined in this minireview are frequently found in organisms with uncommon sexual systems such as gynodioecy, monoecy, or andromonoecy. The potential of deviations from SMI for facilitating a better understanding of a number of fundamental questions in biology, such as the evolution of mtDNA inheritance, the coevolution of nuclear and mitochondrial genomes, and, perhaps, the role of mitochondria in sex determination, is considerable.
Goodsell, David S.
Diverse biological data may be used to create illustrations of molecules in their cellular context. This report describes the scientific results that support an illustration of a eukaryotic cell, enlarged by one million times to show the distribution and arrangement of macromolecules. The panoramic cross section includes eight panels that extend…
Bae, Albert; Westendorf, Christian; Erlenkamper, Christoph; Galland, Edouard; Franck, Carl; Bodenschatz, Eberhard; Beta, Carsten
Eukaryotic cell flattening is valuable for improving microscopic observations, ranging from bright field to total internal reflection fluorescence microscopy. In this talk, we will discuss traditional overlay techniques, and more modern, microfluidic based flattening, which provides a greater level of control. We demonstrate these techniques on the social amoebae Dictyostelium discoideum, comparing the advantages and disadvantages of each method.
Chung, Liliane; Bailey, Dalan; Leen, Eoin N; Emmott, Edward P; Chaudhry, Yasmin; Roberts, Lisa O; Curry, Stephen; Locker, Nicolas; Goodfellow, Ian G
Viruses have evolved a variety of mechanisms to usurp the host cell translation machinery to enable translation of the viral genome in the presence of high levels of cellular mRNAs. Noroviruses, a major cause of gastroenteritis in man, have evolved a mechanism that relies on the interaction of translation initiation factors with the virus-encoded VPg protein covalently linked to the 5' end of the viral RNA. To further characterize this novel mechanism of translation initiation, we have used proteomics to identify the components of the norovirus translation initiation factor complex. This approach revealed that VPg binds directly to the eIF4F complex, with a high affinity interaction occurring between VPg and eIF4G. Mutational analyses indicated that the C-terminal region of VPg is important for the VPg-eIF4G interaction; viruses with mutations that alter or disrupt this interaction are debilitated or non-viable. Our results shed new light on the unusual mechanisms of protein-directed translation initiation. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.
Sun, Wei; Zhang, Fengli; He, Liming; Li, Zhiyong
Diverse sessile organisms inhabit the coral reef ecosystems, including corals, sponges, and sea anemones. In the past decades, scleractinian corals (Cnidaria, Anthozoa, Scleractinia) and their associated microorganisms have attracted much attention. Zoanthids (Cnidaria, Anthozoa, Zoanthidea) are commonly found in coral reefs. However, little is known about the community structure of zoanthid-associated microbiota. In this study, the microbial community associated with the zoanthid Palythoa australiae in the South China Sea was investigated by 454 pyrosequencing. As a result, 2,353 bacterial, 583 archaeal, and 36 eukaryotic microbial ribotypes were detected, respectively. A total of 22 bacterial phyla (16 formally described phyla and six candidate phyla) were recovered. Proteobacteria was the most abundant group, followed by Chloroflexi and Actinobacteria. High-abundance Rhizobiales and diverse Chloroflexi were observed in the bacterial community. The archaeal population was composed of Crenarchaeota and Euryarchaeota, with Marine Group I as the dominant lineage. In particular, Candidatus Nitrosopumilus dominated the archaeal community. Besides bacteria and archaea, the zoanthid harbored eukaryotic microorganisms including fungi and algae though their diversity was very low. This study provided the first insights into the microbial community associated with P. australiae by 454 pyrosequencing, consequently laid a basis for the understanding of the association of P. australiae-microbes symbioses.
Atteia, Ariane; van Lis, Robert; Tielens, Aloysius G M; Martin, William F
Anaerobic metabolic pathways allow unicellular organisms to tolerate or colonize anoxic environments. Over the past ten years, genome sequencing projects have brought a new light on the extent of anaerobic metabolism in eukaryotes. A surprising development has been that free-living unicellular algae capable of photoautotrophic lifestyle are, in terms of their enzymatic repertoire, among the best equipped eukaryotes known when it comes to anaerobic energy metabolism. Some of these algae are marine organisms, common in the oceans, others are more typically soil inhabitants. All these species are important from the ecological (O(2)/CO(2) budget), biotechnological, and evolutionary perspectives. In the unicellular algae surveyed here, mixed-acid type fermentations are widespread while anaerobic respiration, which is more typical of eukaryotic heterotrophs, appears to be rare. The presence of a core anaerobic metabolism among the algae provides insights into its evolutionary origin, which traces to the eukaryote common ancestor. The predicted fermentative enzymes often exhibit an amino acid extension at the N-terminus, suggesting that these proteins might be compartmentalized in the cell, likely in the chloroplast or the mitochondrion. The green algae Chlamydomonas reinhardtii and Chlorella NC64 have the most extended set of fermentative enzymes reported so far. Among the eukaryotes with secondary plastids, the diatom Thalassiosira pseudonana has the most pronounced anaerobic capabilities as yet. From the standpoints of genomic, transcriptomic, and biochemical studies, anaerobic energy metabolism in C. reinhardtii remains the best characterized among photosynthetic protists. This article is part of a Special Issue entitled: The evolutionary aspects of bioenergetic systems. Copyright © 2012 Elsevier B.V. All rights reserved.
van Noort Vera
Full Text Available Abstract Background Proteomes of thermophilic prokaryotes have been instrumental in structural biology and successfully exploited in biotechnology, however many proteins required for eukaryotic cell function are absent from bacteria or archaea. With Chaetomium thermophilum, Thielavia terrestris and Thielavia heterothallica three genome sequences of thermophilic eukaryotes have been published. Results Studying the genomes and proteomes of these thermophilic fungi, we found common strategies of thermal adaptation across the different kingdoms of Life, including amino acid biases and a reduced genome size. A phylogenetics-guided comparison of thermophilic proteomes with those of other, mesophilic Sordariomycetes revealed consistent amino acid substitutions associated to thermophily that were also present in an independent lineage of thermophilic fungi. The most consistent pattern is the substitution of lysine by arginine, which we could find in almost all lineages but has not been extensively used in protein stability engineering. By exploiting mutational paths towards the thermophiles, we could predict particular amino acid residues in individual proteins that contribute to thermostability and validated some of them experimentally. By determining the three-dimensional structure of an exemplar protein from C. thermophilum (Arx1, we could also characterise the molecular consequences of some of these mutations. Conclusions The comparative analysis of these three genomes not only enhances our understanding of the evolution of thermophily, but also provides new ways to engineer protein stability.
Harder, Christoffer Bugge
carefully selected waterworks (Article IV), where the bacterial metabolic diversity and its important for water purification was described. Building on this, the most important part of the thesis consists of two pyrosequencing analyses of protozoa with newly developed 18S primers. One specifically targets...... Cercozoa, a particularly abundant phylum of protozoa (Article III), on heath land that had been subjected to prolonged artificially induced drought in a Danish free-air climate-manipulation experiment (CLIMAITE). Article III showed that the testate cercozoan forms responded negatively to prolonged drought...
Koonin, Eugene V.; Dolja, Valerian V.; Krupovic, Mart
Viruses and other selfish genetic elements are dominant entities in the biosphere, with respect to both physical abundance and genetic diversity. Various selfish elements parasitize on all cellular life forms. The relative abundances of different classes of viruses are dramatically different between prokaryotes and eukaryotes. In prokaryotes, the great majority of viruses possess double-stranded (ds) DNA genomes, with a substantial minority of single-stranded (ss) DNA viruses and only limited presence of RNA viruses. In contrast, in eukaryotes, RNA viruses account for the majority of the virome diversity although ssDNA and dsDNA viruses are common as well. Phylogenomic analysis yields tangible clues for the origins of major classes of eukaryotic viruses and in particular their likely roots in prokaryotes. Specifically, the ancestral genome of positive-strand RNA viruses of eukaryotes might have been assembled de novo from genes derived from prokaryotic retroelements and bacteria although a primordial origin of this class of viruses cannot be ruled out. Different groups of double-stranded RNA viruses derive either from dsRNA bacteriophages or from positive-strand RNA viruses. The eukaryotic ssDNA viruses apparently evolved via a fusion of genes from prokaryotic rolling circle-replicating plasmids and positive-strand RNA viruses. Different families of eukaryotic dsDNA viruses appear to have originated from specific groups of bacteriophages on at least two independent occasions. Polintons, the largest known eukaryotic transposons, predicted to also form virus particles, most likely, were the evolutionary intermediates between bacterial tectiviruses and several groups of eukaryotic dsDNA viruses including the proposed order “Megavirales” that unites diverse families of large and giant viruses. Strikingly, evolution of all classes of eukaryotic viruses appears to have involved fusion between structural and replicative gene modules derived from different sources
Koonin, Eugene V., E-mail: email@example.com [National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894 (United States); Dolja, Valerian V., E-mail: firstname.lastname@example.org [Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331 (United States); Krupovic, Mart, E-mail: email@example.com [Institut Pasteur, Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Department of Microbiology, Paris 75015 (France)
Viruses and other selfish genetic elements are dominant entities in the biosphere, with respect to both physical abundance and genetic diversity. Various selfish elements parasitize on all cellular life forms. The relative abundances of different classes of viruses are dramatically different between prokaryotes and eukaryotes. In prokaryotes, the great majority of viruses possess double-stranded (ds) DNA genomes, with a substantial minority of single-stranded (ss) DNA viruses and only limited presence of RNA viruses. In contrast, in eukaryotes, RNA viruses account for the majority of the virome diversity although ssDNA and dsDNA viruses are common as well. Phylogenomic analysis yields tangible clues for the origins of major classes of eukaryotic viruses and in particular their likely roots in prokaryotes. Specifically, the ancestral genome of positive-strand RNA viruses of eukaryotes might have been assembled de novo from genes derived from prokaryotic retroelements and bacteria although a primordial origin of this class of viruses cannot be ruled out. Different groups of double-stranded RNA viruses derive either from dsRNA bacteriophages or from positive-strand RNA viruses. The eukaryotic ssDNA viruses apparently evolved via a fusion of genes from prokaryotic rolling circle-replicating plasmids and positive-strand RNA viruses. Different families of eukaryotic dsDNA viruses appear to have originated from specific groups of bacteriophages on at least two independent occasions. Polintons, the largest known eukaryotic transposons, predicted to also form virus particles, most likely, were the evolutionary intermediates between bacterial tectiviruses and several groups of eukaryotic dsDNA viruses including the proposed order “Megavirales” that unites diverse families of large and giant viruses. Strikingly, evolution of all classes of eukaryotic viruses appears to have involved fusion between structural and replicative gene modules derived from different sources
Burgers, Peter M J; Kunkel, Thomas A
This review focuses on the biogenesis and composition of the eukaryotic DNA replication fork, with an emphasis on the enzymes that synthesize DNA and repair discontinuities on the lagging strand of the replication fork. Physical and genetic methodologies aimed at understanding these processes are discussed. The preponderance of evidence supports a model in which DNA polymerase ε (Pol ε) carries out the bulk of leading strand DNA synthesis at an undisturbed replication fork. DNA polymerases α and δ carry out the initiation of Okazaki fragment synthesis and its elongation and maturation, respectively. This review also discusses alternative proposals, including cellular processes during which alternative forks may be utilized, and new biochemical studies with purified proteins that are aimed at reconstituting leading and lagging strand DNA synthesis separately and as an integrated replication fork.
Liu, Yun; Song, Shuqun; Chen, Tiantian; Li, Caiwen
Pyrosequencing of the 18S rRNA gene has been widely adopted to study the eukaryotic diversity in various types of environments, and has an advantage over traditional morphology methods in exploring unknown microbial communities. To comprehensively assess the diversity and community composition of marine protists in the coastal waters of China, we applied both morphological observations and high-throughput sequencing of the V2 and V3 regions of 18S rDNA simultaneously to analyze samples collected from the surface layer of the Yellow and East China Seas. Dinoflagellates, diatoms and ciliates were the three dominant protistan groups as revealed by the two methods. Diatoms were the first dominant protistan group in the microscopic observations, with Skeletonema mainly distributed in the nearshore eutrophic waters and Chaetoceros in higher temperature and higher pH waters. The mixotrophic dinoflagellates, Gymnodinium and Gyrodinium, were more competitive in the oligotrophic waters. The pyrosequencing method revealed an extensive diversity of dinoflagellates. Chaetoceros was the only dominant diatom group in the pyrosequencing dataset. Gyrodinium represented the most abundant reads and dominated the offshore oligotrophic protistan community as they were in the microscopic observations. The dominance of parasitic dinoflagellates in the pyrosequencing dataset, which were overlooked in the morphological observations, indicates more attention should be paid to explore the potential role of this group. Both methods provide coherent clustering of samples. Nutrient levels, salinity and pH were the main factors influencing the distribution of protists. This study demonstrates that different primer pairs used in the pyrosequencing will indicate different protistan community structures. A suitable marker may reveal more comprehensive composition of protists and provide valuable information on environmental drivers.
The eukaryotic cytoskeleton evolved from prokaryotic cytomotive filaments. Prokaryotic filament systems show bewildering structural and dynamic complexity and, in many aspects, prefigure the self-organizing properties of the eukaryotic cytoskeleton. Here, the dynamic properties of the prokaryotic and eukaryotic cytoskeleton are compared, and how these relate to function and evolution of organellar networks is discussed. The evolution of new aspects of filament dynamics in eukaryotes, including severing and branching, and the advent of molecular motors converted the eukaryotic cytoskeleton into a self-organizing "active gel," the dynamics of which can only be described with computational models. Advances in modeling and comparative genomics hold promise of a better understanding of the evolution of the self-organizing cytoskeleton in early eukaryotes, and its role in the evolution of novel eukaryotic functions, such as amoeboid motility, mitosis, and ciliary swimming. Copyright © 2014 Cold Spring Harbor Laboratory Press; all rights reserved.
Zaher, Manal S.; Oke, Muse; Hamdan, Samir
The current model of the eukaryotic DNA replication fork includes three replicative DNA polymerases, polymerase α/primase complex (Pol α), polymerase δ (Pol δ), and polymerase ε (Pol ε). The primase synthesizes 8–12 nucleotide RNA primers that are extended by the DNA polymerization activity of Pol α into 30–35 nucleotide RNA-DNA primers. Replication factor C (RFC) opens the polymerase clamp-like processivity factor, proliferating cell nuclear antigen (PCNA), and loads it onto the primer-template. Pol δ utilizes PCNA to mediate highly processive DNA synthesis, while Pol ε has intrinsic high processivity that is modestly stimulated by PCNA. Pol ε replicates the leading strand and Pol δ replicates the lagging strand in a division of labor that is not strict. The three polymerases are comprised of multiple subunits and share unifying features in their large catalytic and B subunits. The remaining subunits are evolutionarily not related and perform diverse functions. The catalytic subunits are members of family B, which are distinguished by their larger sizes due to inserts in their N- and C-terminal regions. The sizes of these inserts vary among the three polymerases, and their functions remain largely unknown. Strikingly, the quaternary structures of Pol α, Pol δ, and Pol ε are arranged similarly. The catalytic subunits adopt a globular structure that is linked via its conserved C-terminal region to the B subunit. The remaining subunits are linked to the catalytic and B subunits in a highly flexible manner.
Zaher, Manal S.
The current model of the eukaryotic DNA replication fork includes three replicative DNA polymerases, polymerase α/primase complex (Pol α), polymerase δ (Pol δ), and polymerase ε (Pol ε). The primase synthesizes 8–12 nucleotide RNA primers that are extended by the DNA polymerization activity of Pol α into 30–35 nucleotide RNA-DNA primers. Replication factor C (RFC) opens the polymerase clamp-like processivity factor, proliferating cell nuclear antigen (PCNA), and loads it onto the primer-template. Pol δ utilizes PCNA to mediate highly processive DNA synthesis, while Pol ε has intrinsic high processivity that is modestly stimulated by PCNA. Pol ε replicates the leading strand and Pol δ replicates the lagging strand in a division of labor that is not strict. The three polymerases are comprised of multiple subunits and share unifying features in their large catalytic and B subunits. The remaining subunits are evolutionarily not related and perform diverse functions. The catalytic subunits are members of family B, which are distinguished by their larger sizes due to inserts in their N- and C-terminal regions. The sizes of these inserts vary among the three polymerases, and their functions remain largely unknown. Strikingly, the quaternary structures of Pol α, Pol δ, and Pol ε are arranged similarly. The catalytic subunits adopt a globular structure that is linked via its conserved C-terminal region to the B subunit. The remaining subunits are linked to the catalytic and B subunits in a highly flexible manner.
species but also from accidental contamination from the genome of eukaryotic host cells. The latter scenario generally occurs in the case of host-associated metagenomes, e.g. microbes living in human gut. In such cases, one needs to identify and remove contaminating host DNA sequences, since the latter sequences will ...
Yardimci, Hasan; Loveland, Anna B.; Habuchi, Satoshi; van Oijen, Antoine M.; Walter, Johannes C.
The duplication of eukaryotic genomes involves the replication of DNA from multiple origins of replication. In S phase, two sister replisomes assemble at each active origin, and they replicate DNA in opposite directions. Little is known about the functional relationship between sister replisomes.
Shcherbik, Svetlana V; Pearce, Nicholas C; Levine, Marnie L; Klimov, Alexander I; Villanueva, Julie M; Bousse, Tatiana L
Live attenuated influenza vaccine viruses (LAIVs) can be generated by classical reassortment of gene segments between a cold adapted, temperature sensitive and attenuated Master Donor Virus (MDV) and a seasonal wild-type (wt) virus. The vaccine candidates contain hemagglutinin (HA) and neuraminidase (NA) genes derived from the circulating wt viruses and the remaining six genes derived from the MDV strains. Rapid, efficient selection of the viruses with 6∶2 genome compositions from the large number of genetically different viruses generated during reassortment is essential for the biannual production schedule of vaccine viruses. This manuscript describes a new approach for the genotypic analysis of LAIV reassortant virus clones based on pyrosequencing. LAIV candidate viruses were created by classical reassortment of seasonal influenza A (H3N2) (A/Victoria/361/2011, A/Ohio/02/2012, A/Texas/50/2012) or influenza A (H7N9) (A/Anhui/1/2013) wt viruses with the MDV A/Leningrad/134/17/57(H2N2). Using strain-specific pyrosequencing assays, mixed gene variations were detected in the allantoic progenies during the cloning procedure. The pyrosequencing analysis also allowed for estimation of the relative abundance of segment variants in mixed populations. This semi-quantitative approach was used for selecting specific clones for the subsequent cloning procedures. The present study demonstrates that pyrosequencing analysis is a useful technique for rapid and reliable genotyping of reassortants and intermediate clones during the preparation of LAIV candidates, and can expedite the selection of vaccine virus candidates.
Svetlana V Shcherbik
Full Text Available BACKGROUND: Live attenuated influenza vaccine viruses (LAIVs can be generated by classical reassortment of gene segments between a cold adapted, temperature sensitive and attenuated Master Donor Virus (MDV and a seasonal wild-type (wt virus. The vaccine candidates contain hemagglutinin (HA and neuraminidase (NA genes derived from the circulating wt viruses and the remaining six genes derived from the MDV strains. Rapid, efficient selection of the viruses with 6∶2 genome compositions from the large number of genetically different viruses generated during reassortment is essential for the biannual production schedule of vaccine viruses. METHODOLOGY/PRINCIPAL FINDINGS: This manuscript describes a new approach for the genotypic analysis of LAIV reassortant virus clones based on pyrosequencing. LAIV candidate viruses were created by classical reassortment of seasonal influenza A (H3N2 (A/Victoria/361/2011, A/Ohio/02/2012, A/Texas/50/2012 or influenza A (H7N9 (A/Anhui/1/2013 wt viruses with the MDV A/Leningrad/134/17/57(H2N2. Using strain-specific pyrosequencing assays, mixed gene variations were detected in the allantoic progenies during the cloning procedure. The pyrosequencing analysis also allowed for estimation of the relative abundance of segment variants in mixed populations. This semi-quantitative approach was used for selecting specific clones for the subsequent cloning procedures. CONCLUSIONS/SIGNIFICANCE: The present study demonstrates that pyrosequencing analysis is a useful technique for rapid and reliable genotyping of reassortants and intermediate clones during the preparation of LAIV candidates, and can expedite the selection of vaccine virus candidates.
Walker, Giselle; Dorrell, Richard G; Schlacht, Alexander; Dacks, Joel B
Single-celled parasites like Entamoeba, Trypanosoma, Phytophthora and Plasmodium wreak untold havoc on human habitat and health. Understanding the position of the various protistan pathogens in the larger context of eukaryotic diversity informs our study of how these parasites operate on a cellular level, as well as how they have evolved. Here, we review the literature that has brought our understanding of eukaryotic relationships from an idea of parasites as primitive cells to a crystallized view of diversity that encompasses 6 major divisions, or supergroups, of eukaryotes. We provide an updated taxonomic scheme (for 2011), based on extensive genomic, ultrastructural and phylogenetic evidence, with three differing levels of taxonomic detail for ease of referencing and accessibility (see supplementary material at Cambridge Journals On-line). Two of the most pressing issues in cellular evolution, the root of the eukaryotic tree and the evolution of photosynthesis in complex algae, are also discussed along with ideas about what the new generation of genome sequencing technologies may contribute to the field of eukaryotic systematics. We hope that, armed with this user's guide, cell biologists and parasitologists will be encouraged about taking an increasingly evolutionary point of view in the battle against parasites representing real dangers to our livelihoods and lives.
Full Text Available Diversity is one of the most remarkable features of living organisms. Current assessments of eukaryote biodiversity reaches 1.5 million species, but the true figure could be several times that number. Diversity is ingrained in all stages and echelons of life, namely, the occupancy of ecological niches, behavioral patterns, body plans and organismal complexity, as well as metabolic needs and genetics. In this review, we will discuss that diversity also exists in a key biochemical process, translation, across eukaryotes. Translation is a fundamental process for all forms of life, and the basic components and mechanisms of translation in eukaryotes have been largely established upon the study of traditional, so-called model organisms. By using modern genome-wide, high-throughput technologies, recent studies of many nonmodel eukaryotes have unveiled a surprising diversity in the configuration of the translation apparatus across eukaryotes, showing that this apparatus is far from being evolutionarily static. For some of the components of this machinery, functional differences between different species have also been found. The recent research reviewed in this article highlights the molecular and functional diversification the translational machinery has undergone during eukaryotic evolution. A better understanding of all aspects of organismal diversity is key to a more profound knowledge of life.
Full Text Available Abstract Background Calpains are Ca2+-dependent cysteine proteases that participate in a range of crucial cellular processes. Dysfunction of these enzymes may cause, for instance, life-threatening diseases in humans, the loss of sex determination in nematodes and embryo lethality in plants. Although the calpain family is well characterized in animal and plant model organisms, there is a great lack of knowledge about these genes in unicellular eukaryote species (i.e. protists. Here, we study the distribution and evolution of calpain genes in a wide range of eukaryote genomes from major branches in the tree of life. Results Our investigations reveal 24 types of protein domains that are combined with the calpain-specific catalytic domain CysPc. In total we identify 41 different calpain domain architectures, 28 of these domain combinations have not been previously described. Based on our phylogenetic inferences, we propose that at least four calpain variants were established in the early evolution of eukaryotes, most likely before the radiation of all the major supergroups of eukaryotes. Many domains associated with eukaryotic calpain genes can be found among eubacteria or archaebacteria but never in combination with the CysPc domain. Conclusions The analyses presented here show that ancient modules present in prokaryotes, and a few de novo eukaryote domains, have been assembled into many novel domain combinations along the evolutionary history of eukaryotes. Some of the new calpain genes show a narrow distribution in a few branches in the tree of life, likely representing lineage-specific innovations. Hence, the functionally important classical calpain genes found among humans and vertebrates make up only a tiny fraction of the calpain family. In fact, a massive expansion of the calpain family occurred by domain shuffling among unicellular eukaryotes and contributed to a wealth of functionally different genes.
Zhao, Sen; Liang, Zhe; Demko, Viktor; Wilson, Robert; Johansen, Wenche; Olsen, Odd-Arne; Shalchian-Tabrizi, Kamran
Calpains are Ca2+-dependent cysteine proteases that participate in a range of crucial cellular processes. Dysfunction of these enzymes may cause, for instance, life-threatening diseases in humans, the loss of sex determination in nematodes and embryo lethality in plants. Although the calpain family is well characterized in animal and plant model organisms, there is a great lack of knowledge about these genes in unicellular eukaryote species (i.e. protists). Here, we study the distribution and evolution of calpain genes in a wide range of eukaryote genomes from major branches in the tree of life. Our investigations reveal 24 types of protein domains that are combined with the calpain-specific catalytic domain CysPc. In total we identify 41 different calpain domain architectures, 28 of these domain combinations have not been previously described. Based on our phylogenetic inferences, we propose that at least four calpain variants were established in the early evolution of eukaryotes, most likely before the radiation of all the major supergroups of eukaryotes. Many domains associated with eukaryotic calpain genes can be found among eubacteria or archaebacteria but never in combination with the CysPc domain. The analyses presented here show that ancient modules present in prokaryotes, and a few de novo eukaryote domains, have been assembled into many novel domain combinations along the evolutionary history of eukaryotes. Some of the new calpain genes show a narrow distribution in a few branches in the tree of life, likely representing lineage-specific innovations. Hence, the functionally important classical calpain genes found among humans and vertebrates make up only a tiny fraction of the calpain family. In fact, a massive expansion of the calpain family occurred by domain shuffling among unicellular eukaryotes and contributed to a wealth of functionally different genes.
Moriyama, Takashi; Sato, Naoki
Plastids and mitochondria possess their own genomes. Although the replication mechanisms of these organellar genomes remain unclear in photosynthetic eukaryotes, several organelle-localized enzymes related to genome replication, including DNA polymerase, DNA primase, DNA helicase, DNA topoisomerase, single-stranded DNA maintenance protein, DNA ligase, primer removal enzyme, and several DNA recombination-related enzymes, have been identified. In the reference Eudicot plant Arabidopsis thaliana, the replication-related enzymes of plastids and mitochondria are similar because many of them are dual targeted to both organelles, whereas in the red alga Cyanidioschyzon merolae, plastids and mitochondria contain different replication machinery components. The enzymes involved in organellar genome replication in green plants and red algae were derived from different origins, including proteobacterial, cyanobacterial, and eukaryotic lineages. In the present review, we summarize the available data for enzymes related to organellar genome replication in green plants and red algae. In addition, based on the type and distribution of replication enzymes in photosynthetic eukaryotes, we discuss the transitional history of replication enzymes in the organelles of plants.
Full Text Available BACKGROUND: Massive phytoplankton blooms, like the recurrent Phaeocystis proliferation observed every year in the Eastern English Channel (EEC, have a significant influence on the overall planktonic community structure and their food web dynamics. As well as being an important area for local fisheries, the EEC is an ideal ecosystem for work on microbial diversity. This is because, although its environmental context is relatively complex, it is reasonably well understood due to several years of monitoring and morphological observations of its planktonic organisms. The objective of our study was to better understand the under-explored microbial eukaryotic diversity relative to the Phaeocystis bloom. METHODOLOGY AND PRINCIPAL FINDINGS: The community structure of microplankton (diatoms, haptophytes, ciliates and dinoflagellates was studied through morphological observations and tag pyrosequencing. During the annual Phaeocystis spring bloom, the phytoplankton biomass increased by 34-fold, while the microzooplankton biomass showed a 4-fold increase, representing on average about 4.6% of the biomass of their phytoplankton prey. Tag pyrosequencing unveiled an extensive diversity of Gymnodiniaceae, with G. spirale and G. fusiformis representing the most abundant reads. An extended diversity of Phaeocystales, with partial 18S rDNA genes sequence identity as low as 85% was found, with taxa corresponding to P. globosa, but also to unknown Phaeocystaceae. CONCLUSIONS: Morphological analyses and pyrosequencing were generally in accordance with capturing frequency shifts of abundant taxa. Tag pyrosequencing allowed highlighting the maintenance of microplankton diversity during the Phaeocystis bloom and the increase of the taxa presenting low number of reads (minor taxa along with the dominant ones in response to biotic and/or abiotic changing conditions. Although molecular approaches have enhanced our perception on diversity, it has come to light that the
Pauchet, Y; Wilkinson, P; Vogel, H; Nelson, D R; Reynolds, S E; Heckel, D G; ffrench-Constant, R H
The tobacco hornworm Manduca sexta is an important model for insect physiology but genomic and transcriptomic data are currently lacking. Following a recent pyrosequencing study generating immune related expressed sequence tags (ESTs), here we use this new technology to define the M. sexta larval midgut transcriptome. We generated over 387,000 midgut ESTs, using a combination of Sanger and 454 sequencing, and classified predicted proteins into those involved in digestion, detoxification and immunity. In many cases the depth of 454 pyrosequencing coverage allowed us to define the entire cDNA sequence of a particular gene. Many new M. sexta genes are described including up to 36 new cytochrome P450s, some of which have been implicated in the metabolism of host plant-derived nicotine. New lepidopteran gene families such as the beta-fructofuranosidases, previously thought to be restricted to Bombyx mori, are also described. An unexpectedly high number of ESTs were involved in immunity, for example 39 contigs encoding serpins, and the increasingly appreciated role of the midgut in insect immunity is discussed. Similar studies of other tissues will allow for a tissue by tissue description of the M. sexta transcriptome and will form an essential complimentary step on the road to genome sequencing and annotation.
Okamura, Masumi; Inose, Haruko; Masuda, Seiji
In eukaryotic cells, RNAs are transcribed in the nucleus and exported to the cytoplasm through the nuclear pore complex. The RNA molecules that are exported from the nucleus into the cytoplasm include messenger RNAs (mRNAs), ribosomal RNAs (rRNAs), transfer RNAs (tRNAs), small nuclear RNAs (snRNAs), micro RNAs (miRNAs), and viral mRNAs. Each RNA is transported by a specific nuclear export receptor. It is believed that most of the mRNAs are exported by Nxf1 (Mex67 in yeast), whereas rRNAs, snRNAs, and a certain subset of mRNAs are exported in a Crm1/Xpo1-dependent manner. tRNAs and miRNAs are exported by Xpot and Xpo5. However, multiple export receptors are involved in the export of some RNAs, such as 60S ribosomal subunit. In addition to these export receptors, some adapter proteins are required to export RNAs. The RNA export system of eukaryotic cells is also used by several types of RNA virus that depend on the machineries of the host cell in the nucleus for replication of their genome, therefore this review describes the RNA export system of two representative viruses. We also discuss the NPC anchoring-dependent mRNA export factors that directly recruit specific genes to the NPC.
Williams, Tom A.; Embley, T. Martin
Current hypotheses about the history of cellular life are mainly based on analyses of cultivated organisms, but these represent only a small fraction of extant biodiversity. The sequencing of new environmental lineages therefore provides an opportunity to test, revise, or reject existing ideas about the tree of life and the origin of eukaryotes. According to the textbook three domains hypothesis, the eukaryotes emerge as the sister group to a monophyletic Archaea. However, recent analyses incorporating better phylogenetic models and an improved sampling of the archaeal domain have generally supported the competing eocyte hypothesis, in which core genes of eukaryotic cells originated from within the Archaea, with important implications for eukaryogenesis. Given this trend, it was surprising that a recent analysis incorporating new genomes from uncultivated Archaea recovered a strongly supported three domains tree. Here, we show that this result was due in part to the use of a poorly fitting phylogenetic model and also to the inclusion by an automated pipeline of genes of putative bacterial origin rather than nucleocytosolic versions for some of the eukaryotes analyzed. When these issues were resolved, analyses including the new archaeal lineages placed core eukaryotic genes within the Archaea. These results are consistent with a number of recent studies in which improved archaeal sampling and better phylogenetic models agree in supporting the eocyte tree over the three domains hypothesis. PMID:24532674
Brown, T. A. (Terence A.)
... of genome expression and replication processes, and transcriptomics and proteomics. This text is richly illustrated with clear, easy-to-follow, full color diagrams, which are downloadable from the book's website...
Dieci, Giorgio; Preti, Milena; Montanini, Barbara
Small nucleolar RNAs (snoRNAs) are one of the most ancient and numerous families of non-protein-coding RNAs (ncRNAs). The main function of snoRNAs - to guide site-specific rRNA modification - is the same in Archaea and all eukaryotic lineages. In contrast, as revealed by recent genomic and RNomic studies, their genomic organization and expression strategies are the most varied. Seemingly snoRNA coding units have adopted, in the course of evolution, all the possible ways of being transcribed, thus providing a unique paradigm of gene expression flexibility. By focusing on representative fungal, plant and animal genomes, we review here all the documented types of snoRNA gene organization and expression, and we provide a comprehensive account of snoRNA expressional freedom by precisely estimating the frequency, in each genome, of each type of genomic organization. We finally discuss the relevance of snoRNA genomic studies for our general understanding of ncRNA family evolution and expression in eukaryotes.
Lacroix, Benoît; Citovsky, Vitaly
Besides the massive gene transfer from organelles to the nuclear genomes, which occurred during the early evolution of eukaryote lineages, the importance of horizontal gene transfer (HGT) in eukaryotes remains controversial. Yet, increasing amounts of genomic data reveal many cases of bacterium-to-eukaryote HGT that likely represent a significant force in adaptive evolution of eukaryotic species. However, DNA transfer involved in genetic transformation of plants by Agrobacterium species has traditionally been considered as the unique example of natural DNA transfer and integration into eukaryotic genomes. Recent discoveries indicate that the repertoire of donor bacterial species and of recipient eukaryotic hosts potentially are much wider than previously thought, including donor bacterial species, such as plant symbiotic nitrogen-fixing bacteria (e.g., Rhizobium etli) and animal bacterial pathogens (e.g., Bartonella henselae, Helicobacter pylori), and recipient species from virtually all eukaryotic clades. Here, we review the molecular pathways and potential mechanisms of these trans-kingdom HGT events and discuss their utilization in biotechnology and research.
Ros, V.I.D.; Hurst, G.D.D.
The expansion of genome sequencing projects has produced accumulating evidence for lateral transfer of genes between prokaryotic and eukaryotic genomes. However, it remains controversial whether these genes are of functional importance in their recipient host. Nikoh and Nakabachi, in a recent paper
James, J.F.; Lammel, C.J.; Draper, D.L.; Brown, D.A.; Sweet, R.L.; Brooks, G.F.
The attachment of Neisseria gonorrhoeae to eukaryotic cells grown in tissue culture was analyzed by use of light and electron microscopy and by labeling of the bacteria with [ 3 H]- and [ 14 C]adenine. Isogenic piliated and nonpiliated N. gonorrhoeae from opaque and transparent colonies were studied. The results of light microscopy studies showed that the gonococci attached to cells of human origin, including Flow 2000, HeLa 229, and HEp 2. Studies using radiolabeled gonococci gave comparable results. Piliated N. gonorrhoeae usually attached in larger numbers than nonpiliated organisms, and those from opaque colonies attached more often than isogenic variants from transparent colonies. Day-to-day variation in rate of attachment was observed. Scanning electron microscopy studies showed the gonococcal attachment to be specific for microvilli of the host cells. It is concluded that more N. gonorrhoeae from opaque colonies, as compared with isogenic variants from transparent colonies, attach to eukaryotic cells grown in tissue culture
Dey, Gautam; Thattai, Mukund; Baum, Buzz
If eukaryotes arose through a merger between archaea and bacteria, what did the first true eukaryotic cell look like? A major step toward an answer came with the discovery of Lokiarchaeum, an archaeon whose genome encodes small GTPases related to those used by eukaryotes to regulate membrane traffic. Although 'Loki' cells have yet to be seen, their existence has prompted the suggestion that the archaeal ancestor of eukaryotes engulfed the future mitochondrion by phagocytosis. We propose instead that the archaeal ancestor was a relatively simple cell, and that eukaryotic cellular organization arose as the result of a gradual transfer of bacterial genes and membranes driven by an ever-closer symbiotic partnership between a bacterium and an archaeon. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Mark, Kristiina; Cornejo, Carolina; Keller, Christine; Flück, Daniela; Scheidegger, Christoph
Although lichens (lichen-forming fungi) play an important role in the ecological integrity of many vulnerable landscapes, only a minority of lichen-forming fungi have been barcoded out of the currently accepted ∼18 000 species. Regular Sanger sequencing can be problematic when analyzing lichens since saprophytic, endophytic, and parasitic fungi live intimately admixed, resulting in low-quality sequencing reads. Here, high-throughput, long-read 454 pyrosequencing in a GS FLX+ System was tested to barcode the fungal partner of 100 epiphytic lichen species from Switzerland using fungal-specific primers when amplifying the full internal transcribed spacer region (ITS). The present study shows the potential of DNA barcoding using pyrosequencing, in that the expected lichen fungus was successfully sequenced for all samples except one. Alignment solutions such as BLAST were found to be largely adequate for the generated long reads. In addition, the NCBI nucleotide database-currently the most complete database for lichen-forming fungi-can be used as a reference database when identifying common species, since the majority of analyzed lichens were identified correctly to the species or at least to the genus level. However, several issues were encountered, including a high sequencing error rate, multiple ITS versions in a genome (incomplete concerted evolution), and in some samples the presence of mixed lichen-forming fungi (possible lichen chimeras).
Patrícia M. Silva
Full Text Available Over the last years, antimicrobial peptides (AMPs have been the focus of intense research towards the finding of a viable alternative to current antifungal drugs. Defensins are one of the major families of AMPs and the most represented among all eukaryotic groups, providing an important first line of host defense against pathogenic microorganisms. Several of these cysteine-stabilized peptides present a relevant effect against fungi. Defensins are the AMPs with the broader distribution across all eukaryotic kingdoms, namely, Fungi, Plantæ and Animalia, and were recently shown to have an ancestor in a bacterial organism. As a part of the host defense, defensins act as an important vehicle of information between innate and adaptive immune system and have a role in immunomodulation. This multidimensionality represents a powerful host shield, hard for microorganisms to overcome using single approach resistance strategies. Pathogenic fungi resistance to conventional antimycotic drugs is becoming a major problem. Defensins, as other AMPs, have shown to be an effective alternative to the current antimycotic therapies, demonstrating potential as novel therapeutic agents or drug leads. In this review, we summarize the current knowledge on some eukaryotic defensins with antifungal action. An overview of the main targets in the fungal cell and the mechanism of action of these AMPs (namely, the selectivity for some fungal membrane components are presented. Additionally, recent works on antifungal defensins structure, activity and citotoxicity are also reviewed.
Full Text Available The deep marine subsurface is a vast habitat for microbial life where cells may live on geologic timescales. Because DNA in sediments may be preserved on long timescales, ribosomal RNA (rRNA is suggested to be a proxy for the active fraction of a microbial community in the subsurface. During an investigation of eukaryotic 18S rRNA by amplicon pyrosequencing, unique profiles of Fungi were found across a range of marine subsurface provinces including ridge flanks, continental margins, and abyssal plains. Subseafloor fungal populations exhibit statistically significant correlations with total organic carbon (TOC, nitrate, sulfide, and dissolved inorganic carbon (DIC. These correlations are supported by terminal restriction length polymorphism (TRFLP analyses of fungal rRNA. Geochemical correlations with fungal pyrosequencing and TRFLP data from this geographically broad sample set suggests environmental selection of active Fungi in the marine subsurface. Within the same dataset, ancient rRNA signatures were recovered from plants and diatoms in marine sediments ranging from 0.03 to 2.7 million years old, suggesting that rRNA from some eukaryotic taxa may be much more stable than previously considered in the marine subsurface.
Møller, Annette; Asp, Torben; Holm, Preben Bach
prokaryotic genome. Based on a protein alignment we could group the P5 ATPases into two subfamilies, P5A and P5B that, based on the number of negative charges in conserved trans-membrane segment 4, are likely to have different ion specificities. P5A ATPases are present in all eukaryotic genomes sequenced so......Eukaryotes encompass a remarkable variety of organisms and unresolved lineages. Different phylogenetic analyses have lead to conflicting conclusions as to the origin and associations between lineages and species. In this work, we investigated evolutionary relationship of a family of cation pumps...... exclusive for the secretory pathway of eukaryotes by combining the identification of lineage-specific genes with phylogenetic evolution of common genes. Sequences of P5 ATPases, which are regarded to be cation pumps in the endoplasmic reticulum (ER), were identified in all eukaryotic lineages but not in any...
Wolf Yuri I
Full Text Available Abstract Background: The presence of introns in protein-coding genes is a universal feature of eukaryotic genome organization, and the genes of multicellular eukaryotes, typically, contain multiple introns, a substantial fraction of which share position in distant taxa, such as plants and animals. Depending on the methods and data sets used, researchers have reached opposite conclusions on the causes of the high fraction of shared introns in orthologous genes from distant eukaryotes. Some studies conclude that shared intron positions reflect, almost entirely, a remarkable evolutionary conservation, whereas others attribute it to parallel gain of introns. To resolve these contradictions, it is crucial to analyze the evolution of introns by using a model that minimally relies on arbitrary assumptions. Results: We developed a probabilistic model of evolution that allows for variability of intron gain and loss rates over branches of the phylogenetic tree, individual genes, and individual sites. Applying this model to an extended set of conserved eukaryotic genes, we find that parallel gain, on average, accounts for only ~8% of the shared intron positions. However, the distribution of parallel gains over the phylogenetic tree of eukaryotes is highly non-uniform. There are, practically, no parallel gains in closely related lineages, whereas for distant lineages, such as animals and plants, parallel gains appear to contribute up to 20% of the shared intron positions. In accord with these findings, we estimated that ancestral introns have a high probability to be retained in extant genomes, and conversely, that a substantial fraction of extant introns have retained their positions since the early stages of eukaryotic evolution. In addition, the density of sites that are available for intron insertion is estimated to be, approximately, one in seven basepairs. Conclusion: We obtained robust estimates of the contribution of parallel gain to the observed
Haas, B J; Salzberg, S L; Zhu, W; Pertea, M; Allen, J E; Orvis, J; White, O; Buell, C R; Wortman, J R
EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.
Full Text Available Abstract Background The teleost Zoarces viviparus (eelpout lives along the coasts of Northern Europe and has long been an established model organism for marine ecology and environmental monitoring. The scarce information about this species genome has however restrained the use of efficient molecular-level assays, such as gene expression microarrays. Results In the present study we present the first comprehensive characterization of the Zoarces viviparus liver transcriptome. From 400,000 reads generated by massively parallel pyrosequencing, more than 50,000 pieces of putative transcripts were assembled, annotated and functionally classified. The data was estimated to cover roughly 40% of the total transcriptome and homologues for about half of the genes of Gasterosteus aculeatus (stickleback were identified. The sequence data was consequently used to design an oligonucleotide microarray for large-scale gene expression analysis. Conclusion Our results show that one run using a Genome Sequencer FLX from 454 Life Science/Roche generates enough genomic information for adequate de novo assembly of a large number of genes in a higher vertebrate. The generated sequence data, including the validated microarray probes, are publicly available to promote genome-wide research in Zoarces viviparus.
Woodhams, Michael D; Stadler, Peter F; Penny, David; Collins, Lesley J
Within eukaryotes there is a complex cascade of RNA-based macromolecules that process other RNA molecules, especially mRNA, tRNA and rRNA. An example is RNase MRP processing ribosomal RNA (rRNA) in ribosome biogenesis. One hypothesis is that this complexity was present early in eukaryotic evolution; an alternative is that an initial simpler network later gained complexity by gene duplication in lineages that led to animals, fungi and plants. Recently there has been a rapid increase in support for the complexity-early theory because the vast majority of these RNA-processing reactions are found throughout eukaryotes, and thus were likely to be present in the last common ancestor of living eukaryotes, herein called the Eukaryotic Ancestor. We present an overview of the RNA processing cascade in the Eukaryotic Ancestor and investigate in particular, RNase MRP which was previously thought to have evolved later in eukaryotes due to its apparent limited distribution in fungi and animals and plants. Recent publications, as well as our own genomic searches, find previously unknown RNase MRP RNAs, indicating that RNase MRP has a wide distribution in eukaryotes. Combining secondary structure and promoter region analysis of RNAs for RNase MRP, along with analysis of the target substrate (rRNA), allows us to discuss this distribution in the light of eukaryotic evolution. We conclude that RNase MRP can now be placed in the RNA-processing cascade of the Eukaryotic Ancestor, highlighting the complexity of RNA-processing in early eukaryotes. Promoter analyses of MRP-RNA suggest that regulation of the critical processes of rRNA cleavage can vary, showing that even these key cellular processes (for which we expect high conservation) show some species-specific variability. We present our consensus MRP-RNA secondary structure as a useful model for further searches.
How-Kit, Alexandre; Daunay, Antoine; Mazaleyrat, Nicolas; Busato, Florence; Daviaud, Christian; Teyssier, Emeline; Deleuze, Jean-François; Gallusci, Philippe; Tost, Jörg
Pyrosequencing permits accurate quantification of DNA methylation of specific regions where the proportions of the C/T polymorphism induced by sodium bisulfite treatment of DNA reflects the DNA methylation level. The commercially available high-throughput locus-specific pyrosequencing instruments allow for the simultaneous analysis of 96 samples, but restrict the DNA methylation analysis to CpG dinucleotide sites, which can be limiting in many biological systems. In contrast to mammals where DNA methylation occurs nearly exclusively on CpG dinucleotides, plants genomes harbor DNA methylation also in other sequence contexts including CHG and CHH motives, which cannot be evaluated by these pyrosequencing instruments due to software limitations. Here, we present a complete pipeline for accurate CpG and non-CpG cytosine methylation analysis at single base-resolution using high-throughput locus-specific pyrosequencing. The devised approach includes the design and validation of PCR amplification on bisulfite-treated DNA and pyrosequencing assays as well as the quantification of the methylation level at every cytosine from the raw peak intensities of the Pyrograms by two newly developed Visual Basic Applications. Our method presents accurate and reproducible results as exemplified by the cytosine methylation analysis of the promoter regions of two Tomato genes (NOR and CNR) encoding transcription regulators of fruit ripening during different stages of fruit development. Our results confirmed a significant and temporally coordinated loss of DNA methylation on specific cytosines during the early stages of fruit development in both promoters as previously shown by WGBS. The manuscript describes thus the first high-throughput locus-specific DNA methylation analysis in plants using pyrosequencing.
Yeeles, Joseph T P; Deegan, Tom D; Janska, Agnieszka; Early, Anne; Diffley, John F X
Eukaryotic cells initiate DNA replication from multiple origins, which must be tightly regulated to promote precise genome duplication in every cell cycle. To accomplish this, initiation is partitioned into two temporally discrete steps: a double hexameric minichromosome maintenance (MCM) complex is first loaded at replication origins during G1 phase, and then converted to the active CMG (Cdc45-MCM-GINS) helicase during S phase. Here we describe the reconstitution of budding yeast DNA replication initiation with 16 purified replication factors, made from 42 polypeptides. Origin-dependent initiation recapitulates regulation seen in vivo. Cyclin-dependent kinase (CDK) inhibits MCM loading by phosphorylating the origin recognition complex (ORC) and promotes CMG formation by phosphorylating Sld2 and Sld3. Dbf4-dependent kinase (DDK) promotes replication by phosphorylating MCM, and can act either before or after CDK. These experiments define the minimum complement of proteins, protein kinase substrates and co-factors required for regulated eukaryotic DNA replication.
Keeling Patrick J
Full Text Available Abstract Background Horizontal or lateral transfer of genetic material between distantly related prokaryotes has been shown to play a major role in the evolution of bacterial and archaeal genomes, but exchange of genes between prokaryotes and eukaryotes is not as well understood. In particular, gene flow from eukaryotes to prokaryotes is rarely documented with strong support, which is unusual since prokaryotic genomes appear to readily accept foreign genes. Results Here, we show that abundant marine cyanobacteria in the related genera Synechococcus and Prochlorococcus acquired a key Calvin cycle/glycolytic enzyme from a eukaryote. Two non-homologous forms of fructose bisphosphate aldolase (FBA are characteristic of eukaryotes and prokaryotes respectively. However, a eukaryotic gene has been inserted immediately upstream of the ancestral prokaryotic gene in several strains (ecotypes of Synechococcus and Prochlorococcus. In one lineage this new gene has replaced the ancestral gene altogether. The eukaryotic gene is most closely related to the plastid-targeted FBA from red algae. This eukaryotic-type FBA once replaced the plastid/cyanobacterial type in photosynthetic eukaryotes, hinting at a possible functional advantage in Calvin cycle reactions. The strains that now possess this eukaryotic FBA are scattered across the tree of Synechococcus and Prochlorococcus, perhaps because the gene has been transferred multiple times among cyanobacteria, or more likely because it has been selectively retained only in certain lineages. Conclusion A gene for plastid-targeted FBA has been transferred from red algae to cyanobacteria, where it has inserted itself beside its non-homologous, functional analogue. Its current distribution in Prochlorococcus and Synechococcus is punctate, suggesting a complex history since its introduction to this group.
Terkelsen, Kasper Munch; Krogh, A.
and quality of reliable gene annotation grows. Results We present a procedure, Agene, that automatically generates a species-specific gene predictor from a set of reliable mRNA sequences and a genome. We apply a Hidden Markov model (HMM) that implements explicit length distribution modelling for all gene......Background The number of sequenced eukaryotic genomes is rapidly increasing. This means that over time it will be hard to keep supplying customised gene finders for each genome. This calls for procedures to automatically generate species-specific gene finders and to re-train them as the quantity...... structure blocks using acyclic discrete phase type distributions. The state structure of the each HMM is generated dynamically from an array of sub-models to include only gene features represented in the training set. Conclusion Acyclic discrete phase type distributions are well suited to model sequence...
Bomberg, M.; Nyyssoenen, M.; Itaevaara, M. [VTT Technical Research Centre of Finland, Espoo (Finland)
Recent advancement in sequencing technologies, 'Next Generation Sequencing', such as FLX 454 pyrosequencing has made it possible to obtain large amounts of sequence data where previously only few sequences could be obtained. This technique is especially useful for the study of community composition of uncultured microbial populations in environmental samples. In this project, the FLX 454 pyrosequencing technique was used to obtain up to 20 000 16S rRNA sequences or 10 000 mRNA sequences from each sample for identification of the microbial species composition as well as for comparison of the microbial communities between different samples. This project focused on the characterization of active microbial communities in the groundwater at the final disposal site of high radioactive wastes in Olkiluoto by FLX 454 pyrosequencing of the bacterial and archaeal ribosomal RNA as well as of the mRNA transcripts of the dsrB gene and mcrA gene of sulphate reducing bacteria and methanogenic archaea, respectively. Specific emphasis was put on studying the relationship of active and latent sulphate reducers and methanogens by qPCR due to their important roles in deep geobiochemical processes connected to copper corrosion. Seven packered boreholes were sampled anaerobically in Olkiluoto during 2009-2010. Groundwater was pumped from specific depths and the microbial cells werecollected by filtration on a membrane. Active microbial communities were studied based on RNA extracted from the membranes and translated to copy DNA, followed by sequencing by 454 Tag pyrosequencing. A total of 27 different bacterial and 17 archaeal taxonomic groups were detected.
Kristensen, Lasse Sommer; Johansen, Jens Vilstrup; Grønbæk, Kirsten
DNA methylation is an epigenetic modification that plays important roles in healthy as well as diseased cells, by influencing the transcription of genes. In spite the fact that human somatic cells are diploid, most of the currently available methods for the study of DNA methylation do not provide......-effective protocol for allele-specific DNA methylation detection based on Pyrosequencing(®) of methylation-specific PCR (MSP) products including a single nucleotide polymorphism (SNP) within the amplicon....
Bomberg, M; Nyyssoenen, M; Itaevaara, M [VTT Technical Research Centre of Finland, Espoo (Finland)
Recent advancement in sequencing technologies, 'Next Generation Sequencing', such as FLX 454 pyrosequencing has made it possible to obtain large amounts of sequence data where previously only few sequences could be obtained. This technique is especially useful for the study of community composition of uncultured microbial populations in environmental samples. In this project, the FLX 454 pyrosequencing technique was used to obtain up to 20 000 16S rRNA sequences or 10 000 mRNA sequences from each sample for identification of the microbial species composition as well as for comparison of the microbial communities between different samples. This project focused on the characterization of active microbial communities in the groundwater at the final disposal site of high radioactive wastes in Olkiluoto by FLX 454 pyrosequencing of the bacterial and archaeal ribosomal RNA as well as of the mRNA transcripts of the dsrB gene and mcrA gene of sulphate reducing bacteria and methanogenic archaea, respectively. Specific emphasis was put on studying the relationship of active and latent sulphate reducers and methanogens by qPCR due to their important roles in deep geobiochemical processes connected to copper corrosion. Seven packered boreholes were sampled anaerobically in Olkiluoto during 2009-2010. Groundwater was pumped from specific depths and the microbial cells werecollected by filtration on a membrane. Active microbial communities were studied based on RNA extracted from the membranes and translated to copy DNA, followed by sequencing by 454 Tag pyrosequencing. A total of 27 different bacterial and 17 archaeal taxonomic groups were detected.
Bomberg, M.; Nyyssoenen, M.; Itaevaara, M.
Recent advancement in sequencing technologies, 'Next Generation Sequencing', such as FLX 454 pyrosequencing has made it possible to obtain large amounts of sequence data where previously only few sequences could be obtained. This technique is especially useful for the study of community composition of uncultured microbial populations in environmental samples. In this project, the FLX 454 pyrosequencing technique was used to obtain up to 20 000 16S rRNA sequences or 10 000 mRNA sequences from each sample for identification of the microbial species composition as well as for comparison of the microbial communities between different samples. This project focused on the characterization of active microbial communities in the groundwater at the final disposal site of high radioactive wastes in Olkiluoto by FLX 454 pyrosequencing of the bacterial and archaeal ribosomal RNA as well as of the mRNA transcripts of the dsrB gene and mcrA gene of sulphate reducing bacteria and methanogenic archaea, respectively. Specific emphasis was put on studying the relationship of active and latent sulphate reducers and methanogens by qPCR due to their important roles in deep geobiochemical processes connected to copper corrosion. Seven packered boreholes were sampled anaerobically in Olkiluoto during 2009-2010. Groundwater was pumped from specific depths and the microbial cells werecollected by filtration on a membrane. Active microbial communities were studied based on RNA extracted from the membranes and translated to copy DNA, followed by sequencing by 454 Tag pyrosequencing. A total of 27 different bacterial and 17 archaeal taxonomic groups were detected
Greganova, Eva; Steinmann, Michael; Mäser, Pascal; Fankhauser, Niklaus
Ion transporters are fundamental to life. Due to their ancient origin and conservation in sequence, ion transporters are also particularly well suited for comparative genomics of distantly related species. Here, we perform genome-wide ion transporter profiling as a basis for comparative genomics of eukaryotes. From a given predicted proteome, we identify all bona fide ion channels, ion porters, and ion pumps. Concentrating on unicellular eukaryotes (n = 37), we demonstrate that clustering of species according to their repertoire of ion transporters segregates obligate endoparasites (n = 23) on the one hand, from free-living species and facultative parasites (n = 14) on the other hand. This surprising finding indicates strong convergent evolution of the parasites regarding the acquisition and homeostasis of inorganic ions. Random forest classification identifies transporters of ammonia, plus transporters of iron and other transition metals, as the most informative for distinguishing the obligate parasites. Thus, in silico ionomics further underscores the importance of iron in infection biology and suggests access to host sources of nitrogen and transition metals to be selective forces in the evolution of parasitism. This finding is in agreement with the phenomenon of iron withholding as a primordial antimicrobial strategy of infected mammals.
Pernice, Massimo C; Giner, Caterina R; Logares, Ramiro; Perera-Bel, Júlia; Acinas, Silvia G; Duarte, Carlos M; Gasol, Josep M; Massana, Ramon
In this work, we study the diversity of bathypelagic microbial eukaryotes (0.8-20 μm) in the global ocean. Seawater samples from 3000 to 4000 m depth from 27 stations in the Atlantic, Pacific and Indian Oceans were analyzed by pyrosequencing the V4 region of the 18S ribosomal DNA. The relative abundance of the most abundant operational taxonomic units agreed with the results of a parallel metagenomic analysis, suggesting limited PCR biases in the tag approach. Although rarefaction curves for single stations were seldom saturated, the global analysis of all sequences together suggested an adequate recovery of bathypelagic diversity. Community composition presented a large variability among samples, which was poorly explained by linear geographic distance. In fact, the similarity between communities was better explained by water mass composition (26% of the variability) and the ratio in cell abundance between prokaryotes and microbial eukaryotes (21%). Deep diversity appeared dominated by four taxonomic groups (Collodaria, Chrysophytes, Basidiomycota and MALV-II) appearing in different proportions in each sample. Novel diversity amounted to 1% of the pyrotags and was lower than expected. Our study represents an essential step in the investigation of bathypelagic microbial eukaryotes, indicating dominating taxonomic groups and suggesting idiosyncratic assemblages in distinct oceanic regions.
Pernice, Massimo C.
In this work, we study the diversity of bathypelagic microbial eukaryotes (0.8–20 μm) in the global ocean. Seawater samples from 3000 to 4000 m depth from 27 stations in the Atlantic, Pacific and Indian Oceans were analyzed by pyrosequencing the V4 region of the 18S ribosomal DNA. The relative abundance of the most abundant operational taxonomic units agreed with the results of a parallel metagenomic analysis, suggesting limited PCR biases in the tag approach. Although rarefaction curves for single stations were seldom saturated, the global analysis of all sequences together suggested an adequate recovery of bathypelagic diversity. Community composition presented a large variability among samples, which was poorly explained by linear geographic distance. In fact, the similarity between communities was better explained by water mass composition (26% of the variability) and the ratio in cell abundance between prokaryotes and microbial eukaryotes (21%). Deep diversity appeared dominated by four taxonomic groups (Collodaria, Chrysophytes, Basidiomycota and MALV-II) appearing in different proportions in each sample. Novel diversity amounted to 1% of the pyrotags and was lower than expected. Our study represents an essential step in the investigation of bathypelagic microbial eukaryotes, indicating dominating taxonomic groups and suggesting idiosyncratic assemblages in distinct oceanic regions.The ISME Journal advance online publication, 9 October 2015; doi:10.1038/ismej.2015.170
Uldahl, Kristine Buch; Wu, Linping; Hall, Arnaldur
followed viral uptake, intracellular trafficking and cell viability in human endothelial cells of brain (hCMEC/D3 cells) and umbilical vein (HUVEC) origin. Whereas SMV1 is efficiently internalized into both types of human cells, SSV2 differentiates between HUVECs and hCMEC/D3 cells, thus opening a path......Viruses from the third domain of life, Archaea, exhibit unusual features including extreme stability that allow their survival in harsh environments. In addition, these species have never been reported to integrate into human or any other eukaryotic genomes, and could thus serve for exploration...
Genome sequencing efforts in the past decade were aimed at generating draft sequences of many prokaryotic and eukaryotic model organisms. Successful completion of unicellular eukaryotes, worm, fly and human genome have opened up the new field of molecular biology and function...
Full Text Available Abstract Background Common bean (Phaseolus vulgaris is the most important food legume in the world. Although this crop is very important to both the developed and developing world as a means of dietary protein supply, resources available in common bean are limited. Global transcriptome analysis is important to better understand gene expression, genetic variation, and gene structure annotation in addition to other important features. However, the number and description of common bean sequences are very limited, which greatly inhibits genome and transcriptome research. Here we used 454 pyrosequencing to obtain a substantial transcriptome dataset for common bean. Results We obtained 1,692,972 reads with an average read length of 207 nucleotides (nt. These reads were assembled into 59,295 unigenes including 39,572 contigs and 19,723 singletons, in addition to 35,328 singletons less than 100 bp. Comparing the unigenes to common bean ESTs deposited in GenBank, we found that 53.40% or 31,664 of these unigenes had no matches to this dataset and can be considered as new common bean transcripts. Functional annotation of the unigenes carried out by Gene Ontology assignments from hits to Arabidopsis and soybean indicated coverage of a broad range of GO categories. The common bean unigenes were also compared to the bean bacterial artificial chromosome (BAC end sequences, and a total of 21% of the unigenes (12,724 including 9,199 contigs and 3,256 singletons match to the 8,823 BAC-end sequences. In addition, a large number of simple sequence repeats (SSRs and transcription factors were also identified in this study. Conclusions This work provides the first large scale identification of the common bean transcriptome derived by 454 pyrosequencing. This research has resulted in a 150% increase in the number of Phaseolus vulgaris ESTs. The dataset obtained through this analysis will provide a platform for functional genomics in common bean and related legumes and
Ruggles, Kelly V.; Garbarino, Jeanne; Liu, Ying; Moon, James; Schneider, Kerry; Henneberry, Annette; Billheimer, Jeff; Millar, John S.; Marchadier, Dawn; Valasek, Mark A.; Joblin-Mills, Aidan; Gulati, Sonia; Munkacsi, Andrew B.; Repa, Joyce J.; Rader, Dan; Sturley, Stephen L.
The toxic subcellular accumulation of lipids predisposes several human metabolic syndromes, including obesity, type 2 diabetes, and some forms of neurodegeneration. To identify pathways that prevent lipid-induced cell death, we performed a genome-wide fatty acid sensitivity screen in Saccharomyces cerevisiae. We identified 167 yeast mutants as sensitive to 0.5 mm palmitoleate, 45% of which define pathways that were conserved in humans. 63 lesions also impacted the status of the lipid droplet; however, this was not correlated to the degree of fatty acid sensitivity. The most liposensitive yeast strain arose due to deletion of the “ARE2 required for viability” (ARV1) gene, encoding an evolutionarily conserved, potential lipid transporter that localizes to the endoplasmic reticulum membrane. Down-regulation of mammalian ARV1 in MIN6 pancreatic β-cells or HEK293 cells resulted in decreased neutral lipid synthesis, increased fatty acid sensitivity, and lipoapoptosis. Conversely, elevated expression of human ARV1 in HEK293 cells or mouse liver significantly increased triglyceride mass and lipid droplet number. The ARV1-induced hepatic triglyceride accumulation was accompanied by up-regulation of DGAT1, a triglyceride synthesis gene, and the fatty acid transporter, CD36. Furthermore, ARV1 was identified as a transcriptional of the protein peroxisome proliferator-activated receptor α (PPARα), a key regulator of lipid homeostasis whose transcriptional targets include DGAT1 and CD36. These results implicate ARV1 as a protective factor in lipotoxic diseases due to modulation of fatty acid metabolism. In conclusion, a lipotoxicity-based genetic screen in a model microorganism has identified 75 human genes that may play key roles in neutral lipid metabolism and disease. PMID:24273168
Ruggles, Kelly V; Garbarino, Jeanne; Liu, Ying; Moon, James; Schneider, Kerry; Henneberry, Annette; Billheimer, Jeff; Millar, John S; Marchadier, Dawn; Valasek, Mark A; Joblin-Mills, Aidan; Gulati, Sonia; Munkacsi, Andrew B; Repa, Joyce J; Rader, Dan; Sturley, Stephen L
The toxic subcellular accumulation of lipids predisposes several human metabolic syndromes, including obesity, type 2 diabetes, and some forms of neurodegeneration. To identify pathways that prevent lipid-induced cell death, we performed a genome-wide fatty acid sensitivity screen in Saccharomyces cerevisiae. We identified 167 yeast mutants as sensitive to 0.5 mm palmitoleate, 45% of which define pathways that were conserved in humans. 63 lesions also impacted the status of the lipid droplet; however, this was not correlated to the degree of fatty acid sensitivity. The most liposensitive yeast strain arose due to deletion of the "ARE2 required for viability" (ARV1) gene, encoding an evolutionarily conserved, potential lipid transporter that localizes to the endoplasmic reticulum membrane. Down-regulation of mammalian ARV1 in MIN6 pancreatic β-cells or HEK293 cells resulted in decreased neutral lipid synthesis, increased fatty acid sensitivity, and lipoapoptosis. Conversely, elevated expression of human ARV1 in HEK293 cells or mouse liver significantly increased triglyceride mass and lipid droplet number. The ARV1-induced hepatic triglyceride accumulation was accompanied by up-regulation of DGAT1, a triglyceride synthesis gene, and the fatty acid transporter, CD36. Furthermore, ARV1 was identified as a transcriptional of the protein peroxisome proliferator-activated receptor α (PPARα), a key regulator of lipid homeostasis whose transcriptional targets include DGAT1 and CD36. These results implicate ARV1 as a protective factor in lipotoxic diseases due to modulation of fatty acid metabolism. In conclusion, a lipotoxicity-based genetic screen in a model microorganism has identified 75 human genes that may play key roles in neutral lipid metabolism and disease.
Ginger, Michael L; Fritz-Laylin, Lillian K; Fulton, Chandler; Cande, W Zacheus; Dawson, Scott C
Protists account for the bulk of eukaryotic diversity. Through studies of gene and especially genome sequences the molecular basis for this diversity can be determined. Evident from genome sequencing are examples of versatile metabolism that go far beyond the canonical pathways described for eukaryotes in textbooks. In the last 2-3 years, genome sequencing and transcript profiling has unveiled several examples of heterotrophic and phototrophic protists that are unexpectedly well-equipped for ATP production using a facultative anaerobic metabolism, including some protists that can (Chlamydomonas reinhardtii) or are predicted (Naegleria gruberi, Acanthamoeba castellanii, Amoebidium parasiticum) to produce H(2) in their metabolism. It is possible that some enzymes of anaerobic metabolism were acquired and distributed among eukaryotes by lateral transfer, but it is also likely that the common ancestor of eukaryotes already had far more metabolic versatility than was widely thought a few years ago. The discussion of core energy metabolism in unicellular eukaryotes is the subject of this review. Since genomic sequencing has so far only touched the surface of protist diversity, it is anticipated that sequences of additional protists may reveal an even wider range of metabolic capabilities, while simultaneously enriching our understanding of the early evolution of eukaryotes. Copyright © 2010 Elsevier GmbH. All rights reserved.
Nielsen, Henrik; Engelbrecht, Jacob; Brunak, Søren
We have developed a new method for the identification of signal peptides and their cleavage based on neural networks trained on separate sets of prokaryotic and eukaryotic sequence. The method performs significantly better than previous prediction schemes and can easily be applied on genome...
The complete and faithful transmission of eukaryotic genome to daughter cells involves the timely duplication of mother cell's DNA. DNA replication starts at multiple chromosomal positions called replication origin. From each activated replication origin two replication forks progress in opposite direction and duplicate the mother cell's DNA. While it is widely accepted that in eukaryotic organisms replication origins are activated in a stochastic manner, little is known on the sources of the observed stochasticity. It is often associated to the population variability to enter S phase. We extract from a growing Saccharomyces cerevisiae population the average rate of origin activation in a single cell by combining single molecule measurements and a numerical deconvolution technique. We show that the temporal profile of the rate of origin activation in a single cell is similar to the one extracted from a replicating cell population. Taking into account this observation we exclude the population variability as the origin of observed stochasticity in origin activation. We confirm that the rate of origin activation increases in the early stage of S phase and decreases at the latter stage. The population average activation rate extracted from single molecule analysis is in prefect accordance with the activation rate extracted from published micro-array data, confirming therefore the homogeneity and genome scale invariance of dynamic of replication process. All these observations point toward a possible role of replication fork to control the rate of origin activation.
Thanchomnang, Tongjit; Tantrawatpan, Chairat; Intapan, Pewpan M.; Sanpool, Oranuch; Janwan, Penchom; Lulitanond, Viraphong; Tourtip, Somjintana; Yamasaki, Hiroshi; Maleewong, Wanchai
Taenia saginata, T. solium, and T. asiatica are causative agents of taeniasis in humans. The difficulty of morphological identification of human taeniids can lead to misdiagnosis or confusion. To overcome this problem, several molecular methods have been developed, but use of these tends to be time-consuming. Here, a rapid and high-throughput pyrosequencing approach was developed for the identification of three human taeniids originating from various countries. Primers targeting the mitochondrial cytochrome c oxidase subunit 1 (cox1) gene of the three Taenia species were designed. Variations in a 26-nucleotide target region were used for identification. The reproducibility and accuracy of the pyrosequencing technology was confirmed by Sanger sequencing. This technique will be a valuable tool to distinguish between sympatric human taeniids that occur in Thailand, Asia and Pacific countries. This method could potentially be used for the molecular identification of the taeniid species that might be associated with suspicious cysts and lesions, or cyst residues in humans or livestock at the slaughterhouse. PMID:24945530
Full Text Available Taenia saginata, T. solium, and T. asiatica are causative agents of taeniasis in humans. The difficulty of morphological identification of human taeniids can lead to misdiagnosis or confusion. To overcome this problem, several molecular methods have been developed, but use of these tends to be time-consuming. Here, a rapid and high-throughput pyrosequencing approach was developed for the identification of three human taeniids originating from various countries. Primers targeting the mitochondrial cytochrome c oxidase subunit 1 (cox1 gene of the three Taenia species were designed. Variations in a 26-nucleotide target region were used for identification. The reproducibility and accuracy of the pyrosequencing technology was confirmed by Sanger sequencing. This technique will be a valuable tool to distinguish between sympatric human taeniids that occur in Thailand, Asia and Pacific countries. This method could potentially be used for the molecular identification of the taeniid species that might be associated with suspicious cysts and lesions, or cyst residues in humans or livestock at the slaughterhouse.
Roger Andrew J
Full Text Available Abstract Background Lateral gene transfer can introduce genes with novel functions into genomes or replace genes with functionally similar orthologs or paralogs. Here we present a study of the occurrence of the latter gene replacement phenomenon in the four gene families encoding different classes of glutamate dehydrogenase (GDH, to evaluate and compare the patterns and rates of lateral gene transfer (LGT in prokaryotes and eukaryotes. Results We extend the taxon sampling of gdh genes with nine new eukaryotic sequences and examine the phylogenetic distribution pattern of the various GDH classes in combination with maximum likelihood phylogenetic analyses. The distribution pattern analyses indicate that LGT has played a significant role in the evolution of the four gdh gene families. Indeed, a number of gene transfer events are identified by phylogenetic analyses, including numerous prokaryotic intra-domain transfers, some prokaryotic inter-domain transfers and several inter-domain transfers between prokaryotes and microbial eukaryotes (protists. Conclusion LGT has apparently affected eukaryotes and prokaryotes to a similar extent within the gdh gene families. In the absence of indications that the evolution of the gdh gene families is radically different from other families, these results suggest that gene transfer might be an important evolutionary mechanism in microbial eukaryote genome evolution.
Jagus, Rosemary; Bachvaroff, Tsvetan R; Joshi, Bhavesh; Place, Allen R
The greatest diversity of eukaryotic species is within the microbial eukaryotes, the protists, with plants and fungi/metazoa representing just two of the estimated seventy five lineages of eukaryotes. Protists are a diverse group characterized by unusual genome features and a wide range of genome sizes from 8.2 Mb in the apicomplexan parasite Babesia bovis to 112,000-220,050 Mb in the dinoflagellate Prorocentrum micans. Protists possess numerous cellular, molecular and biochemical traits not observed in "text-book" model organisms. These features challenge some of the concepts and assumptions about the regulation of gene expression in eukaryotes. Like multicellular eukaryotes, many protists encode multiple eIF4Es, but few functional studies have been undertaken except in parasitic species. An earlier phylogenetic analysis of protist eIF4Es indicated that they cannot be grouped within the three classes that describe eIF4E family members from multicellular organisms. Many more protist sequences are now available from which three clades can be recognized that are distinct from the plant/fungi/metazoan classes. Understanding of the protist eIF4Es will be facilitated as more sequences become available particularly for the under-represented opisthokonts and amoebozoa. Similarly, a better understanding of eIF4Es within each clade will develop as more functional studies of protist eIF4Es are completed.
Hervé, Cécile; Siméon, Amandine; Jam, Murielle
Arabinogalactan proteins (AGPs) are highly glycosylated, hydroxyproline-rich proteins found at the cell surface of plants, where they play key roles in developmental processes. Brown algae are marine, multicellular, photosynthetic eukaryotes. They belong to the phylum Stramenopiles, which...
Wolf Yuri I; Novichkov Pavel S; Sorokin Alexander V; Makarova Kira S; Koonin Eugene V
Abstract Background An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs ...
Sun, Shumei; Zhou, Hao; Zhou, Bin; Hu, Ziyou; Hou, Jinlin; Sun, Jian
To evaluate the sensitivity and specificity of nested PCR combined with pyrosequencing in the detection of HBV drug-resistance gene. RtM204I (ATT) mutant and rtM204 (ATG) nonmutant plasmids mixed at different ratios were detected for mutations using nested-PCR combined with pyrosequencing, and the results were compared with those by conventional PCR pyrosequencing to analyze the linearity and consistency of the two methods. Clinical specimens with different viral loads were examined for drug-resistant mutations using nested PCR pyrosequencing and nested PCR combined with dideoxy sequencing (Sanger) for comparison of the detection sensitivity and specificity. The fitting curves demonstrated good linearity of both conventional PCR pyrosequencing and nested PCR pyrosequencing (R(2)>0.99, PNested PCR showed a better consistency with the predicted value than conventional PCR, and was superior to conventional PCR for detection of samples containing 90% mutant plasmid. In the detection of clinical specimens, Sanger sequencing had a significantly lower sensitivity than nested PCR pyrosequencing (92% vs 100%, Pnested PCR and Sanger sequencing method, nested PCR pyrosequencing has a higher sensitivity especially in clinical specimens with low viral copies, which can be important for early detection of HBV mutant strains and hence more effective clinical management.
Liu, Bernard A
Today there exists a rapidly expanding number of sequenced genomes. Cataloging protein interaction domains such as the Src Homology 2 (SH2) domain across these various genomes can be accomplished with ease due to existing algorithms and predictions models. An evolutionary analysis of SH2 domains provides a step towards understanding how SH2 proteins integrated with existing signaling networks to position phosphotyrosine signaling as a crucial driver of robust cellular communication networks in metazoans. However organizing and tracing SH2 domain across organisms and understanding their evolutionary trajectory remains a challenge. This chapter describes several methodologies towards analyzing the evolutionary trajectory of SH2 domains including a global SH2 domain classification system, which facilitates annotation of new SH2 sequences essential for tracing the lineage of SH2 domains throughout eukaryote evolution. This classification utilizes a combination of sequence homology, protein domain architecture and the boundary positions between introns and exons within the SH2 domain or genes encoding these domains. Discrete SH2 families can then be traced across various genomes to provide insight into its origins. Furthermore, additional methods for examining potential mechanisms for divergence of SH2 domains from structural changes to alterations in the protein domain content and genome duplication will be discussed. Therefore a better understanding of SH2 domain evolution may enhance our insight into the emergence of phosphotyrosine signaling and the expansion of protein interaction domains.
One of the fundamental problems of modern molecular cellular radiobiology is to reveal general and peculiar processes of the formation of gene mutations and chromosome aberrations in each stage of their formation in the irradiated genome of the higher eukaryots. The solution of the problems depends on the development of research within the framework of comparative radiobiology of genetic loci of the higher eukaryots that makes it possible to study quantitative regularities in the formation of gene (point) mutations and chromosome aberrations in one object and in the same experiment
Nielsen, Olaf; Løbner-Olesen, Anders
DNA replication is an extremely accurate process and cells have evolved intricate control mechanisms to ensure that each region of their genome is replicated only once during S phase. Here, we compare what is known about the processes that prevent re-replication in prokaryotic and eukaryotic cells...... prokaryotes and eukaryotes are inactivated until the next cell cycle. Furthermore, in both systems the beta-clamp of the replicative polymerase associates with enzymatic activities that contribute to the inactivation of the helicase loaders. Finally, recent studies suggest that the control mechanism...
Full Text Available BACKGROUND: Bemisia tabaci (Gennadius is a phloem-feeding insect poised to become one of the major insect pests in open field and greenhouse production systems throughout the world. The high level of resistance to insecticides is a main factor that hinders continued use of insecticides for suppression of B. tabaci. Despite its prevalence, little is known about B. tabaci at the genome level. To fill this gap, an invasive B. tabaci B biotype was subjected to pyrosequencing-based transcriptome analysis to identify genes and gene networks putatively involved in various physiological and toxicological processes. METHODOLOGY AND PRINCIPAL FINDINGS: Using Roche 454 pyrosequencing, 857,205 reads containing approximately 340 megabases were obtained from the B. tabaci transcriptome. De novo assembly generated 178,669 unigenes including 30,980 from insects, 17,881 from bacteria, and 129,808 from the nohit. A total of 50,835 (28.45% unigenes showed similarity to the non-redundant database in GenBank with a cut-off E-value of 10-5. Among them, 40,611 unigenes were assigned to one or more GO terms and 6,917 unigenes were assigned to 288 known pathways. De novo metatranscriptome analysis revealed highly diverse bacterial symbionts in B. tabaci, and demonstrated the host-symbiont cooperation in amino acid production. In-depth transcriptome analysis indentified putative molecular markers, and genes potentially involved in insecticide resistance and nutrient digestion. The utility of this transcriptome was validated by a thiamethoxam resistance study, in which annotated cytochrome P450 genes were significantly overexpressed in the resistant B. tabaci in comparison to its susceptible counterparts. CONCLUSIONS: This transcriptome/metatranscriptome analysis sheds light on the molecular understanding of symbiosis and insecticide resistance in an agriculturally important phloem-feeding insect pest, and lays the foundation for future functional genomics research of the
Sugiyama, Akifumi; Ueda, Yoshikatsu; Takase, Hisabumi; Yazaki, Kazufumi
Soil fungal communities play essential roles in soil ecosystems, affecting plant growth and health. Rhizosphere bacterial communities have been shown to undergo dynamic changes during plant growth. This study utilized 454 pyrosequencing to analyze rhizosphere fungal communities during soybean growth. Members of the Ascomycota and Basiodiomycota dominated in all soils. There were no statistically significant changes at the phylum level among growth stages or between bulk and rhizosphere soils. In contrast, the relative abundance of small numbers of operational taxonomic units, 4 during growth and 28 between bulk and rhizosphere soils, differed significantly. Clustering analysis revealed that rhizosphere fungal communities were different from bulk fungal communities during growth stages of soybeans. Taken together, these results suggest that in contrast to rhizosphere bacterial communities, most constituents of rhizosphere fungal communities remained stable during soybean growth.
Wang, H.; Perrault, A.R.; Takeda, Y.; Iliakis, G.
Cells of higher eukaryotes process within minutes double strand breaks (DSBs) in their genome using a NHEJ apparatus that engages DNA-PKcs, Ku, DNA ligase IV, XRCC4, and other as of yet unidentified factors. Although chemical inhibition, or mutation, in any of these factors delays processing, cells ultimately remove the majority of DNA DSBs using an alternative pathway operating with slower kinetics. This alternative pathway is active in mutants deficient in genes of the RAD52 epistasis group. We proposed, therefore, that it reflects an alternative form of NHEJ that operates as a backup (B-NHEJ) to the DNA-PK- dependent (D-NHEJ) pathway, rather than homology directed repair of DSBs. We studied the role of Ku and DNA-PKcs in the coordination of these pathways using as a model end joining of restriction endonuclease linearized plasmid DNA in whole cell extracts. Efficient error-free endjoining observed in such in-vitro reactions is strongly inhibited by anti-Ku antibodies. The inhibition requires DNA-PKcs, despite that fact that Ku efficiently binds DNA ends in the presence of antibodies, or in the absence of DNA-PKcs. Strong inhibition of DNA endjoining is also mediated by wortmannin, an inhibitor of DNA-PKcs, in the presence but not in the absence of Ku, and this inhibition can be rescued by pre-incubating the reaction with double stranded oligonucleotides. The results are compatible with a role of Ku in directing endjoining to a DNA-PK dependent pathway, mediated by efficient end binding and productive interactions with DNA-PKcs. On the other hand, efficient end joining is observed in extracts of cells lacking DNA-PKcs, as well as in Ku-depleted extracts sugggesting the operation of alternative pathways. Extracts depleted of Ku and DNA-PKcs rejoin blunt ends, as well as homologous ends with 3' or 5' protruding single strands with similar efficiency, but addition of Ku suppresses joining of blunt ends and homologous ends with 3' overhangs. We propose that the
El-Sayed Yasser Y
Full Text Available Abstract Background The metagenomic analysis of microbial communities holds the potential to improve our understanding of the role of microbes in clinical conditions. Recent, dramatic improvements in DNA sequencing throughput and cost will enable such analyses on individuals. However, such advances in throughput generally come at the cost of shorter read-lengths, limiting the discriminatory power of each read. In particular, classifying the microbial content of samples by sequencing the Results We describe a method for identifying the phylogenetic content of bacterial samples using high-throughput Pyrosequencing targeted at the 16S rRNA gene. Our analysis is adapted to the shorter read-lengths of such technology and uses a database of 16S rDNA to determine the most specific phylogenetic classification for reads, resulting in a weighted phylogenetic tree characterizing the content of the sample. We present results for six samples obtained from the human vagina during pregnancy that corroborates previous studies using conventional techniques. Next, we analyze the power of our method to classify reads at each level of the phylogeny using simulation experiments. We assess the impacts of read-length and database completeness on our method, and predict how we do as technology improves and more bacteria are sequenced. Finally, we study the utility of targeting specific 16S variable regions and show that such an approach considerably improves results for certain types of microbial samples. Using simulation, our method can be used to determine the most informative variable region. Conclusion This study provides positive validation of the effectiveness of targeting 16S metagenomes using short-read sequencing technology. Our methodology allows us to infer the most specific assignment of the sequence reads within the phylogeny, and to identify the most discriminative variable region to target. The analysis of high-throughput Pyrosequencing on human flora
Juan M. Arroyo
Full Text Available Premise of the study: Microsatellite primers were developed for the parasitic mistletoe Phoradendron californicum to investigate to what extent population genetic structure depends on host tree distribution within a highly fragmented landscape. Methods and Results: Fourteen unlinked polymorphic and four monomorphic nuclear microsatellite markers were developed using a genomic shotgun pyrosequencing method. A total of 187 alleles plus four monomorphic loci alleles were found in 98 individuals sampled in three populations from the Sonoran Desert in the Baja California peninsula (Mexico. Loci averaged 13.3 alleles per locus (range 4–28, and observed and expected heterozygosities within populations varied from 0.167–0.879 and 0.364–0.932, respectively. Conclusions: Levels of polymorphism of the reported markers are adequate for studies of diversity and fragmentation in natural populations of this parasitic plant. Cross-species amplifications in P. juniperinum and P. diguetianum only showed four markers that could be useful in P. diguetianum.
Fujimaki, Kotaro; Fujisawa, Tomochika; Yazawa, Shigenobu; Nishimura, Osamu; Sota, Teiji
Ground beetles of the subgenus Ohomopterus (genus Carabus) show marked divergence in species-specific male and female genital morphologies, which contributes to reproductive isolation among species. Characterizing the genetic basis of species-specific genital morphology is essential for understanding their diversification, but genomic information on Ohomopterus is not yet available. We analyzed mRNA extracted from abdominal sections of the last instar larvae and pupae of two sister species, Carabus (Ohomopterus) iwawakianus and C. (O.) uenoi, which show marked differences in genital morphology, to compare transcriptomic profiles using Roche 454 pyrosequencing. We obtained 1,608,572 high-quality reads and assembled them into 176,278 unique sequences, of which 66,049 sequences were combined into 12,662 clusters. Differential expression analyses for sexed pupae suggested that four and five clusters were differentially expressed between species for males and females, respectively. We also identified orthologous sequences of genes involved in genital development in Drosophila, which potentially affect genital development and species-specific genital morphology in Ohomopterus. This study provides the first large transcriptomic data set for a morphologically diversified beetle group, which can facilitate future studies on the genetic basis of species-specific genitalia.
We assume that the functional relations that a protein engages in, influences it’s evolutionary dynamics. An extreme case is a functional module in which all components are strongly dependent on each other, for example a protein complex or metabolic pathway. If a protein’s function is only relevant
Erives, Albert J
While the genomes of eukaryotes and Archaea both encode the histone-fold domain, only eukaryotes encode the core histone paralogs H2A, H2B, H3, and H4. With DNA, these core histones assemble into the nucleosomal octamer underlying eukaryotic chromatin. Importantly, core histones for H2A and H3 are maintained as neofunctionalized paralogs adapted for general bulk chromatin (canonical H2 and H3) or specialized chromatin (H2A.Z enriched at gene promoters and cenH3s enriched at centromeres). In this context, the identification of core histone-like "doublets" in the cytoplasmic replication factories of the Marseilleviridae (MV) is a novel finding with possible relevance to understanding the origin of eukaryotic chromatin. Here, we analyze and compare the core histone doublet genes from all known MV genomes as well as other MV genes relevant to the origin of the eukaryotic replisome. Using different phylogenetic approaches, we show that MV histone domains encode obligate H2B-H2A and H4-H3 dimers of possible proto-eukaryotic origin. MV core histone moieties form sister clades to each of the four eukaryotic clades of canonical and variant core histones. This suggests that MV core histone moieties diverged prior to eukaryotic neofunctionalizations associated with paired linear chromosomes and variant histone octamer assembly. We also show that MV genomes encode a proto-eukaryotic DNA topoisomerase II enzyme that forms a sister clade to eukaryotes. This is a relevant finding given that DNA topo II influences histone deposition and chromatin compaction and is the second most abundant nuclear protein after histones. The combined domain architecture and phylogenomic analyses presented here suggest that a primitive origin for MV histone genes is a more parsimonious explanation than horizontal gene transfers + gene fusions + sufficient divergence to eliminate relatedness to eukaryotic neofunctionalizations within the H2A and H3 clades without loss of relatedness to each of
Bruck, Irina; Perez-Arnaiz, Patricia; Colbert, Max K; Kaplan, Daniel L
The initiation of DNA replication is a highly regulated event in eukaryotic cells to ensure that the entire genome is copied once and only once during S phase. The primary target of cellular regulation of eukaryotic DNA replication initiation is the assembly and activation of the replication fork helicase, the 11-subunit assembly that unwinds DNA at a replication fork. The replication fork helicase, called CMG for Cdc45-Mcm2-7, and GINS, assembles in S phase from the constituent Cdc45, Mcm2-7, and GINS proteins. The assembly and activation of the CMG replication fork helicase during S phase is governed by 2 S-phase specific kinases, CDK and DDK. CDK stimulates the interaction between Sld2, Sld3, and Dpb11, 3 initiation factors that are each required for the initiation of DNA replication. DDK, on the other hand, phosphorylates the Mcm2, Mcm4, and Mcm6 subunits of the Mcm2-7 complex. Sld3 recruits Cdc45 to Mcm2-7 in a manner that depends on DDK, and recent work suggests that Sld3 binds directly to Mcm2-7 and also to single-stranded DNA. Furthermore, recent work demonstrates that Sld3 and its human homolog Treslin substantially stimulate DDK phosphorylation of Mcm2. These data suggest that the initiation factor Sld3/Treslin coordinates the assembly and activation of the eukaryotic replication fork helicase by recruiting Cdc45 to Mcm2-7, stimulating DDK phosphorylation of Mcm2, and binding directly to single-stranded DNA as the origin is melted.
Wuhe, Jike; Zefeng, Wu; Sanhong, Fan; Xuguang, Xi
The FANCJ-like protein family is a class of ATP-dependent helicases that can catalytically unwind duplex DNA along the 5'-3' direction. It is involved in the processes of DNA damage repair, homologous recombination and G-quadruplex DNA unwinding, and plays a critical role in maintaining genome integrity. In this study, we systemically analyzed FNACJ-like proteins from 47 eukaryotic species and discussed their sequences diversity, origin and evolution, motif organization patterns and spatial structure differences. Four members of FNACJ-like proteins, including XPD, CHL1, RTEL1 and FANCJ, were found in eukaryotes, but some of them were seriously deficient in most fungi and some insects. For example, the Zygomycota fungi lost RTEL1, Basidiomycota and Ascomycota fungi lost RTEL1 and FANCJ, and Diptera insect lost FANCJ. FANCJ-like proteins contain canonical motor domains HD1 and HD2, and the HD1 domain further integrates with three unique domains Fe-S, Arch and Extra-D. Fe-S and Arch domains are relatively conservative in all members of the family, but the Extra-D domain is lost in XPD and differs from one another in rest members. There are 7, 10 and 2 specific motifs found from the three unique domains respectively, while 5 and 12 specific motifs are found from HD1 and HD2 domains except the conserved motifs reported previously. By analyzing the arrangement pattern of these specific motifs, we found that RTEL1 and FANCJ are more closer and share two specific motifs Vb2 and Vc in HD2 domain, which are likely related with their G-quadruplex DNA unwinding activity. The evidence of evolution showed that FACNJ-like proteins were originated from a helicase, which has a HD1 domain inserted by extra Fe-S domain and Arch domain. By three continuous gene duplication events and followed specialization, eukaryotes finally possessed the current four members of FANCJ-like proteins.
Andersen, Gorm; Bjornberg, Olof; Polakova, Silvia
Pyrimidine bases are the central precursors for RNA and DNA, and their intracellular pools are determined by de novo, salvage and catabolic pathways. In eukaryotes, degradation of uracil has been believed to proceed only via the reduction to dihydrouracil. Using a yeast model, Saccharomyces kluyv...... of the eukaryotic or prokaryotic genes involved in pyrimidine degradation described to date.......Pyrimidine bases are the central precursors for RNA and DNA, and their intracellular pools are determined by de novo, salvage and catabolic pathways. In eukaryotes, degradation of uracil has been believed to proceed only via the reduction to dihydrouracil. Using a yeast model, Saccharomyces......, respectively. The gene products of URC1 and URC4 are highly conserved proteins with so far unknown functions and they are present in a variety of prokaryotes and fungi. In bacteria and in some fungi, URC1 and URC4 are linked on the genome together with the gene for uracil phosphoribosyltransferase (URC6). Urc1...
Parfrey, Laura Wegener; Walters, William A.; Lauber, Christian L.; Clemente, Jose C.; Berg-Lyons, Donna; Teiling, Clotilde; Kodira, Chinnappa; Mohiuddin, Mohammed; Brunelle, Julie; Driscoll, Mark; Fierer, Noah; Gilbert, Jack A.; Knight, Rob
Eukaryotic microbes (protists) residing in the vertebrate gut influence host health and disease, but their diversity and distribution in healthy hosts is poorly understood. Protists found in the gut are typically considered parasites, but many are commensal and some are beneficial. Further, the hygiene hypothesis predicts that association with our co-evolved microbial symbionts may be important to overall health. It is therefore imperative that we understand the normal diversity of our eukaryotic gut microbiota to test for such effects and avoid eliminating commensal organisms. We assembled a dataset of healthy individuals from two populations, one with traditional, agrarian lifestyles and a second with modern, westernized lifestyles, and characterized the human eukaryotic microbiota via high-throughput sequencing. To place the human gut microbiota within a broader context our dataset also includes gut samples from diverse mammals and samples from other aquatic and terrestrial environments. We curated the SILVA ribosomal database to reflect current knowledge of eukaryotic taxonomy and employ it as a phylogenetic framework to compare eukaryotic diversity across environment. We show that adults from the non-western population harbor a diverse community of protists, and diversity in the human gut is comparable to that in other mammals. However, the eukaryotic microbiota of the western population appears depauperate. The distribution of symbionts found in mammals reflects both host phylogeny and diet. Eukaryotic microbiota in the gut are less diverse and more patchily distributed than bacteria. More broadly, we show that eukaryotic communities in the gut are less diverse than in aquatic and terrestrial habitats, and few taxa are shared across habitat types, and diversity patterns of eukaryotes are correlated with those observed for bacteria. These results outline the distribution and diversity of microbial eukaryotic communities in the mammalian gut and across
Bracchi-Ricard, V; Nguyen, K T; Zhou, Y; Rajagopalan, P T; Chakrabarti, D; Pei, D
Ribosomal protein synthesis in eubacteria and eukaryotic organelles initiates with an N-formylmethionyl-tRNA(i), resulting in N-terminal formylation of all nascent polypeptides. Peptide deformylase (PDF) catalyzes the subsequent removal of the N-terminal formyl group from the majority of bacterial proteins. Until recently, PDF has been thought as an enzyme unique to the bacterial kingdom. Searches of the genomic DNA databases identified several genes that encode proteins of high sequence homology to bacterial PDF from eukaryotic organisms. The cDNA encoding Plasmodium falciparum PDF (PfPDF) has been cloned and overexpressed in Escherichia coli. The recombinant protein is catalytically active in deformylating N-formylated peptides, shares many of the properties of bacterial PDF, and is inhibited by specific PDF inhibitors. Western blot analysis indicated expression of mature PfPDF in trophozoite, schizont, and segmenter stages of intraerythrocytic development. These results provide strong evidence that a functional PDF is present in P. falciparum. In addition, PDF inhibitors inhibited the growth of P. falciparum in the intraerythrocytic culture. (c)2001 Elsevier Science.
Mariscal, Carlos; Doolittle, W Ford
In the half century since the formulation of the prokaryote : eukaryote dichotomy, many authors have proposed that the former evolved from something resembling the latter, in defiance of common (and possibly common sense) views. In such 'eukaryotes first' (EF) scenarios, the last universal common ancestor is imagined to have possessed significantly many of the complex characteristics of contemporary eukaryotes, as relics of an earlier 'progenotic' period or RNA world. Bacteria and Archaea thus must have lost these complex features secondarily, through 'streamlining'. If the canonical three-domain tree in which Archaea and Eukarya are sisters is accepted, EF entails that Bacteria and Archaea are convergently prokaryotic. We ask what this means and how it might be tested. © 2015 The Author(s).
Pagenkopp Lohan, K M; Fleischer, R C; Carney, K J; Holzer, K K; Ruiz, G M
Ships' ballast water (BW) commonly moves macroorganisms and microorganisms across the world's oceans and along coasts; however, the majority of these microbial transfers have gone undetected. We applied high-throughput sequencing methods to identify microbial eukaryotes, specifically emphasizing the protistan parasites, in ships' BW collected from vessels calling to the Chesapeake Bay (Virginia and Maryland, USA) from European and Eastern Canadian ports. We utilized tagged-amplicon 454 pyrosequencing with two general primer sets, amplifying either the V4 or V9 domain of the small subunit (SSU) of the ribosomal RNA (rRNA) gene complex, from total DNA extracted from water samples collected from the ballast tanks of bulk cargo vessels. We detected a diverse group of protistan taxa, with some known to contain important parasites in marine systems, including Apicomplexa (unidentified apicomplexans, unidentified gregarines, Cryptosporidium spp.), Dinophyta (Blastodinium spp., Euduboscquella sp., unidentified syndinids, Karlodinium spp., Syndinium spp.), Perkinsea (Parvilucifera sp.), Opisthokonta (Ichthyosporea sp., Pseudoperkinsidae, unidentified ichthyosporeans), and Stramenopiles (Labyrinthulomycetes). Further characterization of groups with parasitic taxa, consisting of phylogenetic analyses for four taxa (Cryptosporidium spp., Parvilucifera spp., Labyrinthulomycetes, and Ichthyosporea), revealed that sequences were obtained from both known and novel lineages. This study demonstrates that high-throughput sequencing is a viable and sensitive method for detecting parasitic protists when present and transported in the ballast water of ships. These data also underscore the potential importance of human-aided dispersal in the biogeography of these microbes and emerging diseases in the world's oceans.
This paper develops a conceptual framework for addressing questions about reproduction, individuality, and the units of selection in symbiotic associations, with special attention to the origin of the eukaryotic cell. Three kinds of reproduction are distinguished, and a possible evolutionary sequence giving rise to a mitochondrion-containing eukaryotic cell from an endosymbiotic partnership is analyzed as a series of transitions between each of the three forms of reproduction. The sequence of changes seen in this “egalitarian” evolutionary transition is compared with those that apply in “fraternal” transitions, such as the evolution of multicellularity in animals. PMID:26286983
Wallau, Gabriel Luz; Vieira, Cristina; Loreto, Élgion Lúcio Silva
All living species contain genetic information that was once shared by their common ancestor. DNA is being inherited through generations by vertical transmission (VT) from parents to offspring and from ancestor to descendant species. This process was considered the sole pathway by which biological entities exchange inheritable information. However, Horizontal Transfer (HT), the exchange of genetic information by other means than parents to offspring, was discovered in prokaryotes along with strong evidence showing that it is a very important process by which prokaryotes acquire new genes. For some time now, it has been a scientific consensus that HT events were rare and non-relevant for evolution of eukaryotic species, but there is growing evidence supporting that HT is an important and frequent phenomenon in eukaryotes as well. Here, we will discuss the latest findings regarding HT among eukaryotes, mainly HT of transposons (HTT), establishing HTT once and for all as an important phenomenon that should be taken into consideration to fully understand eukaryotes genome evolution. In addition, we will discuss the latest development methods to detect such events in a broader scale and highlight the new approaches which should be pursued by researchers to fill the knowledge gaps regarding HTT among eukaryotes.
The endosymbiotic hypothesis for the origin of the eukaryotic cell has been applied to the origin of the mitochondria and chloroplasts. However as has been pointed out by Mereschowsky in 1905, it should also be applied to the nucleus as well. If the nucleus, mitochondria and chloroplasts are endosymbionts, then it is likely that the organism that did the engulfing was not a DNA-based organism. In fact, it is useful to postulate that this organism was a primitive RNA-based organism. This hypothesis would explain the preponderance of RNA viruses found in eukaryotic cells. The centriole and basal body do not have a double membrane or DNA. Like all MTOCs (microtubule organising centres), they have a structural or morphic RNA implicated in their formation. This would argue for their origin in the early RNA-based organism rather than in an endosymbiotic event involving bacteria. Finally, the eukaryotic cell uses RNA in ways quite unlike bacteria, thus pointing to a greater emphasis of RNA in both control and structure in the cell. The origin of the eukaryotic cell may tell us why it rather than its prokaryotic relative evolved into the metazoans who are reading this paper.
Hand, Daniel; Wallis, Corrin; Colyer, Alison; Penn, Charles W.
Mammalian intestinal microbiota remain poorly understood despite decades of interest and investigation by culture-based and other long-established methodologies. Using high-throughput sequencing technology we now report a detailed analysis of canine faecal microbiota. The study group of animals comprised eleven healthy adult miniature Schnauzer dogs of mixed sex and age, some closely related and all housed in kennel and pen accommodation on the same premises with similar feeding and exercise regimes. DNA was extracted from faecal specimens and subjected to PCR amplification of 16S rDNA, followed by sequencing of the 5′ region that included variable regions V1 and V2. Barcoded amplicons were sequenced by Roche-454 FLX high-throughput pyrosequencing. Sequences were assigned to taxa using the Ribosomal Database Project Bayesian classifier and revealed dominance of Fusobacterium and Bacteroidetes phyla. Differences between animals in the proportions of different taxa, among 10,000 reads per animal, were clear and not supportive of the concept of a “core microbiota”. Despite this variability in prominent genera, littermates were shown to have a more similar faecal microbial composition than unrelated dogs. Diversity of the microbiota was also assessed by assignment of sequence reads into operational taxonomic units (OTUs) at the level of 97% sequence identity. The OTU data were then subjected to rarefaction analysis and determination of Chao1 richness estimates. The data indicated that faecal microbiota comprised possibly as many as 500 to 1500 OTUs. PMID:23382835
Full Text Available Mammalian intestinal microbiota remain poorly understood despite decades of interest and investigation by culture-based and other long-established methodologies. Using high-throughput sequencing technology we now report a detailed analysis of canine faecal microbiota. The study group of animals comprised eleven healthy adult miniature Schnauzer dogs of mixed sex and age, some closely related and all housed in kennel and pen accommodation on the same premises with similar feeding and exercise regimes. DNA was extracted from faecal specimens and subjected to PCR amplification of 16S rDNA, followed by sequencing of the 5' region that included variable regions V1 and V2. Barcoded amplicons were sequenced by Roche-454 FLX high-throughput pyrosequencing. Sequences were assigned to taxa using the Ribosomal Database Project Bayesian classifier and revealed dominance of Fusobacterium and Bacteroidetes phyla. Differences between animals in the proportions of different taxa, among 10,000 reads per animal, were clear and not supportive of the concept of a "core microbiota". Despite this variability in prominent genera, littermates were shown to have a more similar faecal microbial composition than unrelated dogs. Diversity of the microbiota was also assessed by assignment of sequence reads into operational taxonomic units (OTUs at the level of 97% sequence identity. The OTU data were then subjected to rarefaction analysis and determination of Chao1 richness estimates. The data indicated that faecal microbiota comprised possibly as many as 500 to 1500 OTUs.
Park, O-J; Yi, H; Jeon, J H; Kang, S-S; Koo, K-T; Kum, K-Y; Chun, J; Yun, C-H; Han, S H
Subgingival microorganisms are potentially associated with periodontal diseases. However, changes in the subgingival microbiota during the progress of periodontal diseases are poorly understood. In this study, we analyzed bacterial communities in the subgingival paper point samples from 32 Korean individuals with no sign of disease, gingivitis, or periodontitis using 454 FLX Titanium pyrosequencing. A total of 256,113 reads representing 26 phyla, 433 genera, and 1,016 species were detected. Bacteroidetes, Fusobacteria, Synergistetes, and Spirochaetes were the abundant phyla in periodontitis subjects, whereas Firmicutes and Proteobacteria were identified as the dominant phyla in the gingivitis and healthy subjects, respectively. Although high levels of Porphyromonas, Fusobacterium, Fretibacterium, Rothia, Filifactor, and Treponema genera were observed in the periodontitis subjects, Streptococcus, Capnocytophaga, Leptotrichia, and Haemophilus genera were found at high frequency in the gingivitis subjects. Species including Porphyromonas gingivalis, Fusobacterium nucleatum, and Fretibacterium fastidiosum were significantly increased in periodontitis subjects. On the other hand, Streptococcus pseudopneumoniae, Haemophilus parainfluenzae, and Leptotrichia hongkongensis were preferentially observed in the gingivitis subjects. Intriguingly, the halophile Halomonas hamiltonii was revealed as a predominant species in the healthy subjects. Based on Fast UniFrac analysis, distinctive bacterial clusters were classified for the healthy, gingivitis, and periodontitis state. The current findings might be useful for understanding the pathogenesis, diagnosis, and treatment of periodontal diseases. © International & American Associations for Dental Research 2015.
Park, J. E.; Lee, B. T.; Son, A.
Heavy metals leached from exhausted mines have been causing severe environmental problems in nearby soils and groundwater. Environmental mitigation was performed based on the heavy metal stabilization using Calcite and steel slag in Korea. Since the soil stabilization only temporarily immobilizes the contaminants to soil matrix, the potential risk of re-leaching heavy metal still exists. Therefore the follow-up management of stabilized soils and the corresponding evaluation methods are required to avoid the consequent contamination from the stabilized soils. In this study, microbial community analysis using pyrosequencing was performed for assessing the potential leaching of the stabilized soils. As a result of rarefaction curve and Chao1 and Shannon indices, the stabilized soil has shown lower richness and diversity as compared to non-contaminated negative control. At the phyla level, as the degree of contamination increases, most of phyla decreased with only exception of increased proteobacteria. Among proteobacteria, gamma-proteobacteria increased against the heavy metal contamination. At the species level, Methylobacter tundripaludum of gamma-proteobacteria showed the highest relative portion of microbial community, indicating that methanotrophs may play an important role in either solubilization or immobilization of heavy metals in stabilized soils.
Full Text Available Fungi are important soil components as both decomposers and plant symbionts and play a major role in ecological and biogeochemical processes. However, little is known about the richness and structure of fungal communities. DNA sequencing technologies allow for the direct estimation of microbial community diversity, avoiding culture-based biases. We therefore used 454 pyrosequencing to investigate the fungal communities in the rhizosphere of Xinjiang jujube. We obtained no less than 40,488 internal transcribed spacer (ITS rDNA reads, the number of each sample was 6943, 6647, 6584, 6550, 6860, and 6904, and we used bioinformatics and multivariate statistics to analyze the results. The index of diversity showed greater richness in the rhizosphere fungal community of a 3-year-old jujube than in that of an 8-year-old jujube. Most operational taxonomic units belonged to Ascomycota, and taxonomic analyses identified Hypocreales as the dominant fungal order. Our results demonstrated that the fungal orders are present in different proportions in different sampling areas. Redundancy analysis (RDA revealed a significant correlation between soil properties and the abundance of fungal phyla. Our results indicated lower fungal diversity in the rhizosphere of Xinjiang jujube than that reported in other studies, and we hope our findings provide a reference for future research.
David J. Rooks
Full Text Available Viruses, the most abundant biological entities on the planet, are capable of infecting organisms from all three branches of life, although the majority infect bacteria where the greatest degree of cellular diversity lies. However, the characterization and assessment of viral diversity in natural environments is only beginning to become a possibility. Through the development of a novel technique for the harvest of viral DNA and the application of 454 pyrosequencing, a snapshot of the diversity of the DNA viruses harvested from a standing pond on a cattle farm has been obtained. A high abundance of viral genotypes (785 were present within the virome. The absolute numbers of lambdoid and Shiga toxin (Stx encoding phages detected suggested that the depth of sequencing had enabled recovery of only ca. 8% of the total virus population, numbers that agreed within less than an order of magnitude with predictions made by rarefaction analysis. The most abundant viral genotypes in the pond were bacteriophages (93.7%. The predominant viral genotypes infecting higher life forms found in association with the farm were pathogens that cause disease in cattle and humans, e.g. members of the Herpesviridae. The techniques and analysis described here provide a fresh approach to the monitoring of viral populations in the aquatic environment, with the potential to become integral to the development of risk analysis tools for monitoring the dissemination of viral agents of animal, plant and human diseases.
Lee, Onon; Wang, Yong; Yang, Jiangke; Lafi, Feras Fawzi; Al-Suwailem, Abdulaziz M.; Qian, Peiyuan
Marine sponges are associated with a remarkable array of microorganisms. Using a tag pyrosequencing technology, this study was the first to investigate in depth the microbial communities associated with three Red Sea sponges, Hyrtios erectus
Kostovčík, Martin; Bateman, C.C.; Kolařík, Miroslav; Stelinski, L.L.; Jordal, B.H.; Hulcr, J.
Roč. 9, č. 1 (2015), s. 126-138 ISSN 1751-7362 Institutional support: RVO:61388971 Keywords : ambrosia symbiosis * pyrosequencing Subject RIV: EE - Microbiology, Virology Impact factor: 9.328, year: 2015
Kuo, Alan; Grigoriev, Igor
Algae play profound roles in aquatic food chains and the carbon cycle, can impose health and economic costs through toxic blooms, provide models for the study of symbiosis, photosynthesis, and eukaryotic evolution, and are candidate sources for bio-fuels; all of these research areas are part of the mission of DOE's Joint Genome Institute (JGI). To date JGI has sequenced, assembled, annotated, and released to the public the genomes of 18 species and strains of algae, sampling almost all of the major clades of photosynthetic eukaryotes. With more algal genomes currently undergoing analysis, JGI continues its commitment to driving forward basic and applied algal science. Among these ongoing projects are the pan-genome of the dominant coccolithophore Emiliania huxleyi, the interrelationships between the 4 genomes in the nucleomorph-containing Bigelowiella natans and Guillardia theta, and the search for symbiosis genes of lichens.
Nielsen, Henrik; Engelbrecht, Jacob; Brunak, Søren
We have developed a new method for the identication of signal peptides and their cleavage sites based on neural networks trained on separate sets of prokaryotic and eukaryotic sequences. The method performs signicantly better than previous prediction schemes, and can easily be applied to genome...
Revuelta, M.V.; Kan, van J.A.L.; Kay, J.; Have, ten A.
The A1 family of eukaryotic aspartic proteinases (APs) forms one of the 16 AP families. Although one of the best characterized families, the recent increase in genome sequence data has revealed many fungal AP homologs with novel sequence characteristics. This study was performed to explore the
Sergeeva, Ekaterina M; Shcherban, Andrey B; Adonina, Irina G; Nesterov, Michail A; Beletsky, Alexey V; Rakitin, Andrey L; Mardanov, Andrey V; Ravin, Nikolai V; Salina, Elena A
The multigene family encoding the 5S rRNA, one of the most important structurally-functional part of the large ribosomal subunit, is an obligate component of all eukaryotic genomes. 5S rDNA has long been a favored target for cytological and phylogenetic studies due to the inherent peculiarities of its structural organization, such as the tandem arrays of repetitive units and their high interspecific divergence. The complex polyploid nature of the genome of bread wheat, Triticum aestivum, and the technically difficult task of sequencing clusters of tandem repeats mean that the detailed organization of extended genomic regions containing 5S rRNA genes remains unclear. This is despite the recent progress made in wheat genomic sequencing. Using pyrosequencing of BAC clones, in this work we studied the organization of two distinct 5S rDNA-tagged regions of the 5BS chromosome of bread wheat. Three BAC-clones containing 5S rDNA were identified in the 5BS chromosome-specific BAC-library of Triticum aestivum. Using the results of pyrosequencing and assembling, we obtained six 5S rDNA- containing contigs with a total length of 140,417 bp, and two sets (pools) of individual 5S rDNA sequences belonging to separate, but closely located genomic regions on the 5BS chromosome. Both regions are characterized by the presence of approximately 70-80 copies of 5S rDNA, however, they are completely different in their structural organization. The first region contained highly diverged short-type 5S rDNA units that were disrupted by multiple insertions of transposable elements. The second region contained the more conserved long-type 5S rDNA, organized as a single tandem array. FISH using probes specific to both 5S rDNA unit types showed differences in the distribution and intensity of signals on the chromosomes of polyploid wheat species and their diploid progenitors. A detailed structural organization of two closely located 5S rDNA-tagged genomic regions on the 5BS chromosome of bread
Ambroise, Jérôme; Butoescu, Valentina; Robert, Annie; Tombal, Bertrand; Gala, Jean-Luc
Single Nucleotide Polymorphisms (SNPs) identified in Genome Wide Association Studies (GWAS) have generally moderate association with related complex diseases. Accordingly, Multilocus Genetic Risk Scores (MGRSs) have been computed in previous studies in order to assess the cumulative association of multiple SNPs. When several SNPs have to be genotyped for each patient, using successive uniplex pyrosequencing reactions increases analytical reagent expenses and Turnaround Time (TAT). While a set of several pyrosequencing primers could theoretically be used to analyze multiplex amplicons, this would generate overlapping primer-specific pyro-signals that are visually uninterpretable. In the current study, two multiplex assays were developed consisting of a quadruplex (n=4) and a quintuplex (n=5) polymerase chain reaction (PCR) each followed by multiplex pyrosequencing analysis. The aim was to reliably but rapidly genotype a set of prostate cancer-related SNPs (n=9). The nucleotide dispensation order was selected using SENATOR software. Multiplex pyro-signals were analyzed using the new AdvISER-MH-PYRO software based on a sparse representation of the signal. Using uniplex assays as gold standard, the concordance between multiplex and uniplex assays was assessed on DNA extracted from patient blood samples (n = 10). All genotypes (n=90) generated with the quadruplex and the quintuplex pyroquencing assays were perfectly (100 %) concordant with uniplex pyrosequencing. Using multiplex genotyping approach for analyzing a set of 90 patients allowed reducing TAT by approximately 75 % (i.e., from 2025 to 470 min) while reducing reagent consumption and cost by approximately 70 % (i.e., from ~229 US$ /patient to ~64 US$ /patient). This combination of quadruplex and quintuplex pyrosequencing and PCR assays enabled to reduce the amount of DNA required for multi-SNP analysis, and to lower the global TAT and costs of SNP genotyping while providing results as reliable as uniplex
Hahn Daniel A
Full Text Available Abstract Background Flesh flies in the genus Sarcophaga are important models for investigating endocrinology, diapause, cold hardiness, reproduction, and immunity. Despite the prominence of Sarcophaga flesh flies as models for insect physiology and biochemistry, and in forensic studies, little genomic or transcriptomic data are available for members of this genus. We used massively parallel pyrosequencing on the Roche 454-FLX platform to produce a substantial EST dataset for the flesh fly Sarcophaga crassipalpis. To maximize sequence diversity, we pooled RNA extracted from whole bodies of all life stages and normalized the cDNA pool after reverse transcription. Results We obtained 207,110 ESTs with an average read length of 241 bp. These reads assembled into 20,995 contigs and 31,056 singletons. Using BLAST searches of the NR and NT databases we were able to identify 11,757 unique gene elements (ES. crassipalpis unigenes among GO Biological Process functional groups with that of the Drosophila melanogaster transcriptome suggests that our ESTs are broadly representative of the flesh fly transcriptome. Insertion and deletion errors in 454 sequencing present a serious hurdle to comparative transcriptome analysis. Aided by a new approach to correcting for these errors, we performed a comparative analysis of genetic divergence across GO categories among S. crassipalpis, D. melanogaster, and Anopheles gambiae. The results suggest that non-synonymous substitutions occur at similar rates across categories, although genes related to response to stimuli may evolve slightly faster. In addition, we identified over 500 potential microsatellite loci and more than 12,000 SNPs among our ESTs. Conclusion Our data provides the first large-scale EST-project for flesh flies, a much-needed resource for exploring this model species. In addition, we identified a large number of potential microsatellite and SNP markers that could be used in population and systematic
Hindle, Matthew M; Martin, Sarah F; Noordally, Zeenat B; van Ooijen, Gerben; Barrios-Llerena, Martin E; Simpson, T Ian; Le Bihan, Thierry; Millar, Andrew J
The current knowledge of eukaryote signalling originates from phenotypically diverse organisms. There is a pressing need to identify conserved signalling components among eukaryotes, which will lead to the transfer of knowledge across kingdoms. Two useful properties of a eukaryote model for signalling are (1) reduced signalling complexity, and (2) conservation of signalling components. The alga Ostreococcus tauri is described as the smallest free-living eukaryote. With less than 8,000 genes, it represents a highly constrained genomic palette. Our survey revealed 133 protein kinases and 34 protein phosphatases (1.7% and 0.4% of the proteome). We conducted phosphoproteomic experiments and constructed domain structures and phylogenies for the catalytic protein-kinases. For each of the major kinases families we review the completeness and divergence of O. tauri representatives in comparison to the well-studied kinomes of the laboratory models Arabidopsis thaliana and Saccharomyces cerevisiae, and of Homo sapiens. Many kinase clades in O. tauri were reduced to a single member, in preference to the loss of family diversity, whereas TKL and ABC1 clades were expanded. We also identified kinases that have been lost in A. thaliana but retained in O. tauri. For three, contrasting eukaryotic pathways - TOR, MAPK, and the circadian clock - we established the subset of conserved components and demonstrate conserved sites of substrate phosphorylation and kinase motifs. We conclude that O. tauri satisfies our two central requirements. Several of its kinases are more closely related to H. sapiens orthologs than S. cerevisiae is to H. sapiens. The greatly reduced kinome of O. tauri is therefore a suitable model for signalling in free-living eukaryotes.
Margulis, L.; Hinkle, G.
Ongoing work to test the hypothesis of the origin of eukaryotic cell organelles by microbial symbioses is discussed. Because of the widespread acceptance of the serial endosymbiotic theory (SET) of the origin of plastids and mitochondria, the idea of the symbiotic origin of the centrioles and axonemes for spirochete bacteria motility symbiosis was tested. Intracellular microtubular systems are purported to derive from symbiotic associations between ancestral eukaryotic cells and motile bacteria. Four lines of approach to this problem are being pursued: (1) cloning the gene of a tubulin-like protein discovered in Spirocheata bajacaliforniesis; (2) seeking axoneme proteins in spirochets by antibody cross-reaction; (3) attempting to cultivate larger, free-living spirochetes; and (4) studying in detail spirochetes (e.g., Cristispira) symbiotic with marine animals. Other aspects of the investigation are presented.
Liu, J.; Bolstad, D; Bolstad, E; Wright, D; Anderson, A
Trimethoprim, an antifolate commonly prescribed in combination with sulfamethoxazole, potently inhibits several prokaryotic species of dihydrofolate reductase (DHFR). However, several eukaryotic pathogenic organisms are resistant to trimethoprim, preventing its effective use as a therapeutic for those infections. We have been building a program to reengineer trimethoprim to more potently and selectively inhibit eukaryotic species of DHFR as a viable strategy for new drug discovery targeting several opportunistic pathogens. We have developed a series of compounds that exhibit potent and selective inhibition of DHFR from the parasitic protozoa Cryptosporidium and Toxoplasma as well as the fungus Candida glabrata. A comparison of the structures of DHFR from the fungal species Candida glabrata and Pneumocystis suggests that the compounds may also potently inhibit Pneumocystis DHFR.
Dick, Claudia F; Dos-Santos, André L A; Meyer-Fernandes, José R
Inorganic phosphate (Pi) is an essential nutrient for all organisms. The route of Pi utilization begins with Pi transport across the plasma membrane. Here, we analyzed the gene sequences and compared the biochemical profiles, including kinetic and modulator parameters, of Pi transporters in unicellular eukaryotes. The objective of this review is to evaluate the recent findings regarding Pi uptake mechanisms in microorganisms, such as the fungi Neurospora crassa and Saccharomyces cerevisiae and the parasite protozoans Trypanosoma cruzi, Trypanosoma rangeli, Leishmania infantum and Plasmodium falciparum. Pi uptake is the key step of Pi homeostasis and in the subsequent signaling event in eukaryotic microorganisms. Biochemical and structural studies are important for clarifying mechanisms of Pi homeostasis, as well as Pi sensor and downstream pathways, and raise possibilities for future studies in this field. Copyright © 2014 Elsevier B.V. All rights reserved.
Full Text Available A fundamental stage in viral infection is the internalization of viral genomes in host cells. Although extensively studied, the mechanisms and factors responsible for the genome internalization process remain poorly understood. Here we report our observations, derived from diverse imaging methods on genome internalization of the large dsDNA Paramecium bursaria chlorella virus-1 (PBCV-1. Our studies reveal that early infection stages of this eukaryotic-infecting virus occurs by a bacteriophage-like pathway, whereby PBCV-1 generates a hole in the host cell wall and ejects its dsDNA genome in a linear, base-pair-by-base-pair process, through a membrane tunnel generated by the fusion of the virus internal membrane with the host membrane. Furthermore, our results imply that PBCV-1 DNA condensation that occurs shortly after infection probably plays a role in genome internalization, as hypothesized for the infection of some bacteriophages. The subsequent perforation of the host photosynthetic membranes presumably enables trafficking of viral genomes towards host nuclei. Previous studies established that at late infection stages PBCV-1 generates cytoplasmic organelles, termed viral factories, where viral assembly takes place, a feature characteristic of many large dsDNA viruses that infect eukaryotic organisms. PBCV-1 thus appears to combine a bacteriophage-like mechanism during early infection stages with a eukaryotic-like infection pathway in its late replication cycle.
Full Text Available The industrial production of fine chemicals, feed and food ingredients, pharmaceuticals, agrochemicals and their respective intermediates relies on an increasing application of biocatalysis, i.e. on enzyme or whole-cell catalyzed conversions of molecules. Simple procedures for discovery, cloning and over-expression as well as fast growth favour fungi, yeasts and especially bacteria as sources of biocatalysts. Higher eukaryotes also harbour an almost unlimited number of potential biocatalysts, although to date the limited supply of enzymes, the high heterogeneity of enzyme preparations and the hazard of infectious contaminants keep some interesting candidates out of reach for industrial bioprocesses. In the past only a few animal and plant enzymes from agricultural waste materials were employed in food processing. The use of bacterial expression strains or non-conventional yeasts for the heterologous production of efficient eukaryotic enzymes can overcome the bottleneck in enzyme supply and provide sufficient amounts of homogenous enzyme preparations for reliable and economically feasible applications at large scale. Ideal enzymatic processes represent an environmentally friendly, »near-to-completion« conversion of (mostly non-natural substrates to pure products. Recent developments demonstrate the commercial feasibility of large-scale biocatalytic processes employing enzymes from higher eukaryotes (e.g. plants, animals and also their usefulness in some small-scale industrial applications.
Full Text Available Arsenic and antimony are toxic metalloids, naturally present in the environment and all organisms have developed pathways for their detoxification. The most effective metalloid tolerance systems in eukaryotes include downregulation of metalloid uptake, efflux out of the cell, and complexation with phytochelatin or glutathione followed by sequestration into the vacuole. Understanding of arsenic and antimony transport system is of high importance due to the increasing usage of arsenic-based drugs in the treatment of certain types of cancer and diseases caused by protozoan parasites as well as for the development of bio- and phytoremediation strategies for metalloid polluted areas. However, in contrast to prokaryotes, the knowledge about specific transporters of arsenic and antimony and the mechanisms of metalloid transport in eukaryotes has been very limited for a long time. Here, we review the recent advances in understanding of arsenic and antimony transport pathways in eukaryotes, including a dual role of aquaglyceroporins in uptake and efflux of metalloids, elucidation of arsenic transport mechanism by the yeast Acr3 transporter and its role in arsenic hyperaccumulation in ferns, identification of vacuolar transporters of arsenic-phytochelatin complexes in plants and forms of arsenic substrates recognized by mammalian ABC transporters.
Maciaszczyk-Dziubinska, Ewa; Wawrzycka, Donata; Wysocki, Robert
Arsenic and antimony are toxic metalloids, naturally present in the environment and all organisms have developed pathways for their detoxification. The most effective metalloid tolerance systems in eukaryotes include downregulation of metalloid uptake, efflux out of the cell, and complexation with phytochelatin or glutathione followed by sequestration into the vacuole. Understanding of arsenic and antimony transport system is of high importance due to the increasing usage of arsenic-based drugs in the treatment of certain types of cancer and diseases caused by protozoan parasites as well as for the development of bio- and phytoremediation strategies for metalloid polluted areas. However, in contrast to prokaryotes, the knowledge about specific transporters of arsenic and antimony and the mechanisms of metalloid transport in eukaryotes has been very limited for a long time. Here, we review the recent advances in understanding of arsenic and antimony transport pathways in eukaryotes, including a dual role of aquaglyceroporins in uptake and efflux of metalloids, elucidation of arsenic transport mechanism by the yeast Acr3 transporter and its role in arsenic hyperaccumulation in ferns, identification of vacuolar transporters of arsenic-phytochelatin complexes in plants and forms of arsenic substrates recognized by mammalian ABC transporters. PMID:22489166
Zagrobelny, Mika; Scheibye-Alsing, Karsten; Jensen, Niels Bjerg; Møller, Birger Lindberg; Gorodkin, Jan; Bak, Søren
An essential driving component in the co-evolution of plants and insects is the ability to produce and handle bioactive compounds. Plants produce bioactive natural products for defense, but some insects detoxify and/or sequester the compounds, opening up for new niches with fewer competitors. To study the molecular mechanism behind the co-adaption in plant-insect interactions, we have investigated the interactions between Lotus corniculatus and Zygaena filipendulae. They both contain cyanogenic glucosides which liberate toxic hydrogen cyanide upon breakdown. Moths belonging to the Zygaena family are the only insects known, able to carry out both de novo biosynthesis and sequestration of the same cyanogenic glucosides as those from their feed plants. The biosynthetic pathway for cyanogenic glucoside biosynthesis in Z. filipendulae proceeds using the same intermediates as in the well known pathway from plants, but none of the enzymes responsible have been identified. A genomics strategy founded on 454 pyrosequencing of the Z. filipendulae transcriptome was undertaken to identify some of these enzymes in Z. filipendulae. Comparisons of the Z. filipendulae transcriptome with the sequenced genomes of Bombyx mori, Drosophila melanogaster, Tribolium castaneum, Apis mellifera and Anopheles gambiae indicate a high coverage of the Z. filipendulae transcriptome. 11% of the Z. filipendulae transcriptome sequences were assigned to Gene Ontology categories. Candidate genes for enzymes functioning in the biosynthesis of cyanogenic glucosides (cytochrome P450 and family 1 glycosyltransferases) were identified based on sequence length, number of copies and presence/absence of close homologs in D. melanogaster, B. mori and the cyanogenic butterfly Heliconius. Examination of biased codon usage, GC content and selection on gene candidates support the notion of cyanogenesis as an "old" trait within Ditrysia, as well as its origins being convergent between plants and insects
Full Text Available Abstract Background An essential driving component in the co-evolution of plants and insects is the ability to produce and handle bioactive compounds. Plants produce bioactive natural products for defense, but some insects detoxify and/or sequester the compounds, opening up for new niches with fewer competitors. To study the molecular mechanism behind the co-adaption in plant-insect interactions, we have investigated the interactions between Lotus corniculatus and Zygaena filipendulae. They both contain cyanogenic glucosides which liberate toxic hydrogen cyanide upon breakdown. Moths belonging to the Zygaena family are the only insects known, able to carry out both de novo biosynthesis and sequestration of the same cyanogenic glucosides as those from their feed plants. The biosynthetic pathway for cyanogenic glucoside biosynthesis in Z. filipendulae proceeds using the same intermediates as in the well known pathway from plants, but none of the enzymes responsible have been identified. A genomics strategy founded on 454 pyrosequencing of the Z. filipendulae transcriptome was undertaken to identify some of these enzymes in Z. filipendulae. Results Comparisons of the Z. filipendulae transcriptome with the sequenced genomes of Bombyx mori, Drosophila melanogaster, Tribolium castaneum, Apis mellifera and Anopheles gambiae indicate a high coverage of the Z. filipendulae transcriptome. 11% of the Z. filipendulae transcriptome sequences were assigned to Gene Ontology categories. Candidate genes for enzymes functioning in the biosynthesis of cyanogenic glucosides (cytochrome P450 and family 1 glycosyltransferases were identified based on sequence length, number of copies and presence/absence of close homologs in D. melanogaster, B. mori and the cyanogenic butterfly Heliconius. Examination of biased codon usage, GC content and selection on gene candidates support the notion of cyanogenesis as an "old" trait within Ditrysia, as well as its origins being
Welander, P. V.; Banta, A. B.; Lee, A. K.; Wei, J. H.
Lipid biomarkers are organic molecules preserved in sediments and sedimentary rocks that can function as geological proxies for certain microbial taxa or for specific environmental conditions. These molecular fossils provide a link between organisms and their environments in both modern and ancient settings and have afforded significant insight into ancient climatic events, mass extinctions, and various evolutionary transitions throughout Earth's history. However, the proper interpretation of lipid biomarkers is dependent on a broad understanding of their diagenetic precursors in modern systems. This includes understanding the taphonomic transformations that these molecules undergo, their biosynthetic pathways, and the ecological conditions that affect their cellular production. In this study, we focus on one group of lipid biomarkers - the sterols. These are polycyclic isoprenoidal lipids that have a high preservation potential and play a critical role in the physiology of most eukaryotes. However, the synthesis and function of these lipids in the bacterial domain has not been fully explored. Here we utilize a combination of bioinformatics, microbial genetics, and biochemistry to demonstrate that bacterial sterol producers are more prevalent in environmental metagenomic samples than in the genomic databases of cultured organisms and to identify novel proteins required to synthesize and modify sterols in bacteria. These proteins represent a distinct pathway for sterol synthesis exclusive to bacteria and indicate that sterol synthesis in bacteria may have evolved independently of eukaryotic sterol biosynthesis. Taken together, these results demonstrate how studies in extant bacteria can provide insight into the biological sources and the biosynthetic pathways of specific lipid biomarkers and in turn may allow for more robust interpretation of biomarker signatures.
Ahola, Virpi; Wahlberg, Niklas; Frilander, Mikko J.
The first lepidopteran genome (Bombyx mori) was published in 2004. Ten years later the genome of Melitaea cinxia came out as the third butterfly genome published, and the first eukaryotic genome sequenced in Finland. Owing to Ilkka Hanski, the M. cinxia system in the angstrom land Islands has become a famous model for metapopulation biology. More than 20 years of research on this system provides a strong ecological basis upon which a genetic framework could be built. Genetic knowledge is an e...
Barlow, Peter W; Baldelli, E; Baluška, Frantisek
The Energide concept, as well as the endosymbiotic theory of eukaryotic cell organization and evolution, proposes that present-day cells of eukaryotic organisms are mosaics of specialized and cooperating units, or organelles. Some of these units were originally free-living prokaryotes, which were engulfed during evolutionary time. Mitochondria represent one of these types of previously independent organisms, the Energide, is another type. This new perspective on the organization of the cell has been further expanded to reveal the concept of a public milieu, the cytosol, in which Energides and mitochondria live, each with their own private internal milieu. The present paper discusses how the endosymbiotic theory implicates a new hypothesis about the hierarchical and communicational organization of the integrated prokaryotic components of the eukaryotic cell and provides a new angle from which to consider the theory of evolution and its bearing upon cellular complexity. Thus, it is proposed that the “selfish gene” hypothesis of Dawkins1 is not the only possible perspective for comprehending genomic and cellular evolution. Our proposal is that maternal mitochondria are the selfish “master” entities of the eukaryotic cell with respect not only to their propagation from cell-to-cell and from generation-to-generation but also to their regulation of all other cellular functions. However, it should be recognized that the concept of “master” and “servant” cell components is a metaphor; in present-day living organisms their organellar components are considered to be interdependent and inseparable. PMID:19513277
Prigigallo, Maria I; Abdelfattah, Ahmed; Cacciola, Santa O; Faedda, Roberto; Sanzani, Simona M; Cooke, David E L; Schena, L
A metabarcoding method based on genus-specific primers and 454 pyrosequencing was utilized to investigate the genetic diversity of Phytophthora spp. in soil and root samples of potted plants, from eight nurseries. Pyrosequencing enabled the detection of 25 Phytophthora phylotypes distributed in seven different clades and provided a much higher resolution than a corresponding cloning/Sanger sequencing approach. Eleven of these phylotypes, including P. cactorum, P. citricola s.str., P. palmivora, P. palmivora-like, P. megasperma or P. gonapodyides, P. ramorum, and five putative new Phytophthora species phylogenetically related to clades 1, 2, 4, 6, and 7 were detected only with the 454 pyrosequencing approach. We also found an additional 18 novel records of a phylotype in a particular nursery that were not detected with cloning/Sanger sequencing. Several aspects confirmed the reliability of the method: (i) many identical sequence types were identified independently in different nurseries, (ii) most sequence types identified with 454 pyrosequencing were identical to those from the cloning/Sanger sequencing approach and/or perfectly matched GenBank deposited sequences, and (iii) the divergence noted between sequence types of putative new Phytophthora species and all other detected sequences was sufficient to rule out sequencing errors. The proposed method represents a powerful tool to study Phytophthora diversity providing that particular attention is paid to the analysis of 454 pyrosequencing raw read sequences and to the identification of sequence types.
Bravo, Lulette Tricia C.; Tuohy, Marion J.; Ang, Concepcion; Destura, Raul V.; Mendoza, Myrna; Procop, Gary W.; Gordon, Steven M.; Hall, Geraldine S.; Shrestha, Nabin K.
After isoniazid and rifampin (rifampicin), the next pivotal drug class in Mycobacterium tuberculosis treatment is the fluoroquinolone class. Mutations in resistance-determining regions (RDR) of the rpoB, katG, and gyrA genes occur with frequencies of 97%, 50%, and 85% among M. tuberculosis isolates resistant to rifampin, isoniazid, and fluoroquinolones, respectively. Sequences are highly conserved, and certain mutations correlate well with phenotypic resistance. We developed a pyrosequencing assay to determine M. tuberculosis genotypic resistance to rifampin, isoniazid, and fluoroquinolones. We characterized 102 M. tuberculosis clinical isolates from the Philippines for susceptibility to rifampin, isoniazid, and ofloxacin by using the conventional submerged-disk proportion method and validated our pyrosequencing assay using these isolates. DNA was extracted and amplified by using PCR primers directed toward the RDR of the rpoB, katG, and gyrA genes, and pyrosequencing was performed on the extracts. The M. tuberculosis H37Rv strain (ATCC 25618) was used as the reference strain. The sensitivities and specificities of pyrosequencing were 96.7% and 97.3%, 63.8% and 100%, and 70.0% and 100% for the detection of resistance to rifampin, isoniazid, and ofloxacin, respectively. Pyrosequencing is thus a rapid and accurate method for detecting M. tuberculosis resistance to these three drugs. PMID:19846642
Permutation Multivariate Analysis of Variance ( PerMANOVA ). We used PerMANOVA to test the null-hypothesis of no... permutation -based version of the multivariate analysis of variance (MANOVA). PerMANOVA uses the distances between samples to partition variance and...coli. Antibiotics, bacteria, community analysis , diabetes, pyrosequencing, wound, wound therapy, 16S rRNA gene Genomic Analysis of Complex
Full Text Available Microorganisms represent the largest component of biodiversity in our world. For millions of years, prokaryotic microorganisms have functioned as a major selective force shaping eukaryotic evolution. Microbes that live inside and on animals outnumber the animals’ actual somatic and germ cells by an estimated 10-fold. Collectively, the intestinal microbiome represents a ‘forgotten organ’, functioning as an organ inside another that can execute many physiological responsibilities. The nature of primitive eukaryotes was drastically changed due to the association with symbiotic prokaryotes facilitating mutual coevolution of host and microbe. Phytophagous insects have long been used to test theories of evolutionary diversification; moreover, the diversification of a number of phytophagous insect lineages has been linked to mutualisms with microbes. From termites and honey bees to ruminants and mammals, depending on novel biochemistries provided by the prokaryotic microbiome, the association helps to metabolize several nutrients that the host cannot digest and converting these into useful end products (such as short chain fatty acids, a process which has huge impact on the biology and homeostasis of metazoans. More importantly, in a direct and/or indirect way, the intestinal microbiota influences the assembly of gut-associated lymphoid tissue, helps to educate immune system, affects the integrity of the intestinal mucosal barrier, modulates proliferation and differentiation of its epithelial lineages, regulates angiogenesis, and modifies the activity of enteric as well as the central nervous system,. Despite these important effects, the mechanisms by which the gut microbial community influences the host’s biology remains almost entirely unknown. Our aim here is to encourage empirical inquiry into the relationship between mutualism and evolutionary diversification between prokaryotes and eukaryotes which encourage us to postulate: Who is
Jarmuszkiewicz, Wieslawa; Woyda-Ploszczyca, Andrzej; Antos-Krzeminska, Nina; Sluse, Francis E
Uncoupling proteins (UCPs) are members of the mitochondrial anion carrier protein family that are present in the mitochondrial inner membrane and mediate free fatty acid (FFA)-activated, purine nucleotide (PN)-inhibited proton conductance. Since 1999, the presence of UCPs has been demonstrated in some non-photosynthesising unicellular eukaryotes, including amoeboid and parasite protists, as well as in non-fermentative yeast and filamentous fungi. In the mitochondria of these organisms, UCP activity is revealed upon FFA-induced, PN-inhibited stimulation of resting respiration and a decrease in membrane potential, which are accompanied by a decrease in membranous ubiquinone (Q) reduction level. UCPs in unicellular eukaryotes are able to divert energy from oxidative phosphorylation and thus compete for a proton electrochemical gradient with ATP synthase. Our recent work indicates that membranous Q is a metabolic sensor that might utilise its redox state to release the PN inhibition of UCP-mediated mitochondrial uncoupling under conditions of phosphorylation and resting respiration. The action of reduced Q (QH2) could allow higher or complete activation of UCP. As this regulatory feature was demonstrated for microorganism UCPs (A. castellanii UCP), plant and mammalian UCP1 analogues, and UCP1 in brown adipose tissue, the process could involve all UCPs. Here, we discuss the functional connection and physiological role of UCP and alternative oxidase, two main energy-dissipating systems in the plant-type mitochondrial respiratory chain of unicellular eukaryotes, including the control of cellular energy balance as well as preventive action against the production of reactive oxygen species. Copyright © 2009 Elsevier B.V. All rights reserved.
Johnston, Iain G; Williams, Ben P
Since their endosymbiotic origin, mitochondria have lost most of their genes. Although many selective mechanisms underlying the evolution of mitochondrial genomes have been proposed, a data-driven exploration of these hypotheses is lacking, and a quantitatively supported consensus remains absent. We developed HyperTraPS, a methodology coupling stochastic modeling with Bayesian inference, to identify the ordering of evolutionary events and suggest their causes. Using 2015 complete mitochondrial genomes, we inferred evolutionary trajectories of mtDNA gene loss across the eukaryotic tree of life. We find that proteins comprising the structural cores of the electron transport chain are preferentially encoded within mitochondrial genomes across eukaryotes. A combination of high GC content and high protein hydrophobicity is required to explain patterns of mtDNA gene retention; a model that accounts for these selective pressures can also predict the success of artificial gene transfer experiments in vivo. This work provides a general method for data-driven inference of the ordering of evolutionary and progressive events, here identifying the distinct features shaping mitochondrial genomes of present-day species. Copyright © 2016 Elsevier Inc. All rights reserved.
Procopio, Luciano; Alvarez, Vanessa M.; Jurelevicius, Diogo A.; Hansen, Lars; Sorensen, Soren J.; Cardoso, Janine S.; Padula, Marcelo; Leitao, Alvaro C.; Seldin, Lucy; van Elsas, Jan Dirk
The draft genome of Dietzia cinnamea strain P4 was determined using pyrosequencing. In total, 428 supercontigs were obtained and analyzed. We here describe and interpret the main features of the draft genome. The genome contained a total of 3,555,295 bp, arranged in a single replicon with an average
Grimwood, Jane; Schmutz, Jeremy J.; Myers, Richard M.
Since the completion of the sequencing of the human genome, the Joint Genome Institute (JGI) has rapidly expanded its scientific goals in several DOE mission-relevant areas. At the JGI-HAGSC, we have kept pace with this rapid expansion of projects with our focus on assessing, assembling, improving and finishing eukaryotic whole genome shotgun (WGS) projects for which the shotgun sequence is generated at the Production Genomic Facility (JGI-PGF). We follow this by combining the draft WGS with genomic resources generated at JGI-HAGSC or in collaborator laboratories (including BAC end sequences, genetic maps and FLcDNA sequences) to produce an improved draft sequence. For eukaryotic genomes important to the DOE mission, we then add further information from directed experiments to produce reference genomic sequences that are publicly available for any scientific researcher. Also, we have continued our program for producing BAC-based finished sequence, both for adding information to JGI genome projects and for small BAC-based sequencing projects proposed through any of the JGI sequencing programs. We have now built our computational expertise in WGS assembly and analysis and have moved eukaryotic genome assembly from the JGI-PGF to JGI-HAGSC. We have concentrated our assembly development work on large plant genomes and complex fungal and algal genomes.
Christaki, Urania; Kormas, Konstantinos A; Genitsaris, Savvas; Georges, Clément; Sime-Ngando, Télesphore; Viscogliosi, Eric; Monchy, Sébastien
The objective of this study was to explore the succession of planktonic unicellular eukaryotes by means of 18S rRNA gene tag pyrosequencing in the eastern English Channel (EEC) during the winter to summer transition. The 59 most representative (>0.1%, representing altogether 95% of total reads), unique operational taxonomic units (OTUs) from all samples belonged to 18 known high-level taxonomic groups and 1 unaffiliated clade. The five most abundant OTUs (69.2% of total reads) belonged to Dinophyceae, Cercozoa, Haptophyceae, marine alveolate group I, and Fungi. Cluster and network analysis between samples distinguished the winter, the pre-bloom, the Phaeocystis globosa bloom and the post-bloom early summer conditions. The OTUs-based network revealed that P. globosa showed a relatively low number of connections-most of them negative-with all other OTUs. Fungi were linked to all major taxonomic groups, except Dinophyceae. Cercozoa mostly co-occurred with the Fungi, the Bacillariophyceae and several of the miscellaneous OTUs. This study provided a more detailed exploration into the planktonic succession pattern of the EEC due to its increased depth of taxonomic sampling over previous efforts based on classical monitoring observations. Data analysis implied that the food web concept in a coastal system based on predator-prey (e.g. grazer-phytoplankton) relationships is just a part of the ecological picture; and those organisms exploiting a variety of strategies, such as saprotrophy and parasitism, are persistent and abundant members of the community.
Ben-David, Smadar; Yaakov, Beery; Kashkush, Khalil
Short interspersed nuclear elements (SINEs) are non-autonomous non-LTR retroelements that are present in most eukaryotic species. While SINEs have been intensively investigated in humans and other animal systems, they are poorly studied in plants, especially in wheat (Triticum aestivum). We used quantitative PCR of various wheat species to determine the copy number of a wheat SINE family, termed Au SINE, combined with computer-assisted analyses of the publicly available 454 pyrosequencing database of T. aestivum. In addition, we utilized site-specific PCR on 57 Au SINE insertions, transposon methylation display and transposon display on newly formed wheat polyploids to assess retrotranspositional activity, epigenetic status and genetic rearrangements in Au SINE, respectively. We retrieved 3706 different insertions of Au SINE from the 454 pyrosequencing database of T. aestivum, and found that most of the elements are inserted in A/T-rich regions, while approximately 38% of the insertions are associated with transcribed regions, including known wheat genes. We observed typical retrotransposition of Au SINE in the second generation of a newly formed wheat allohexaploid, and massive hypermethylation in CCGG sites surrounding Au SINE in the third generation. Finally, we observed huge differences in the copy numbers in diploid Triticum and Aegilops species, and a significant increase in the copy numbers in natural wheat polyploids, but no significant increase in the copy number of Au SINE in the first four generations for two of three newly formed allopolyploid species used in this study. Our data indicate that SINEs may play a prominent role in the genomic evolution of wheat through stress-induced activation. © 2013 Ben-Gurion University The Plant Journal © 2013 John Wiley & Sons Ltd.
Koonin, Eugene V; Wolf, Yuri I; Nagasaki, Keizo; Dolja, Valerian V
The recent discovery of RNA viruses in diverse unicellular eukaryotes and developments in evolutionary genomics have provided the means for addressing the origin of eukaryotic RNA viruses. The phylogenetic analyses of RNA polymerases and helicases presented in this Analysis article reveal close evolutionary relationships between RNA viruses infecting hosts from the Chromalveolate and Excavate supergroups and distinct families of picorna-like viruses of plants and animals. Thus, diversification of picorna-like viruses probably occurred in a 'Big Bang' concomitant with key events of eukaryogenesis. The origins of the conserved genes of picorna-like viruses are traced to likely ancestors including bacterial group II retroelements, the family of HtrA proteases and DNA bacteriophages.
Full Text Available BACKGROUND: The objective of this work was to investigate the hypothesis that eukaryotic Internal Ribosome Entry Sites (IRES lack secondary structure and to examine the generality of the hypothesis. METHODOLOGY/PRINCIPAL FINDINGS: IRESs of the yeast and the fruit fly are located in the 5'UTR immediately upstream of the initiation codon. The minimum folding energy (MFE of 60 nt RNA segments immediately upstream of the initiation codons was calculated as a proxy of secondary structure stability. MFE of the reverse complements of these 60 nt segments was also calculated. The relationship between MFE and empirically determined IRES activity was investigated to test the hypothesis that strong IRES activity is associated with weak secondary structure. We show that IRES activity in the yeast and the fruit fly correlates strongly with the structural stability, with highest IRES activity found in RNA segments that exhibit the weakest secondary structure. CONCLUSIONS: We found that a subset of eukaryotic IRESs exhibits very low secondary structure in the 5'-UTR sequences immediately upstream of the initiation codon. The consistency in results between the yeast and the fruit fly suggests a possible shared mechanism of cap-independent translation initiation that relies on an unstructured RNA segment.
O?Brien, Emmet A.; Zhang, Yue; Wang, Eric; Marie, Veronique; Badejoko, Wole; Lang, B. Franz; Burger, Gertraud
The organelle genome database GOBASE, now in its 21st release (June 2008), contains all published mitochondrion-encoded sequences (?913 000) and chloroplast-encoded sequences (?250 000) from a wide range of eukaryotic taxa. For all sequences, information on related genes, exons, introns, gene products and taxonomy is available, as well as selected genome maps and RNA secondary structures. Recent major enhancements to database functionality include: (i) addition of an interface for RNA editing...
Bauer, Nicholas C; Corbett, Anita H; Doetsch, Paul W
DNA damage is a natural hazard of life. The most common DNA lesions are base, sugar, and single-strand break damage resulting from oxidation, alkylation, deamination, and spontaneous hydrolysis. If left unrepaired, such lesions can become fixed in the genome as permanent mutations. Thus, evolution has led to the creation of several highly conserved, partially redundant pathways to repair or mitigate the effects of DNA base damage. The biochemical mechanisms of these pathways have been well characterized and the impact of this work was recently highlighted by the selection of Tomas Lindahl, Aziz Sancar and Paul Modrich as the recipients of the 2015 Nobel Prize in Chemistry for their seminal work in defining DNA repair pathways. However, how these repair pathways are regulated and interconnected is still being elucidated. This review focuses on the classical base excision repair and strand incision pathways in eukaryotes, considering both Saccharomyces cerevisiae and humans, and extends to some important questions and challenges facing the field of DNA base damage repair. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Full Text Available Progress in uncovering the protein interaction networks of several species has led to questions of what underlying principles might govern their organization. Few studies have tried to determine the impact of protein interaction network evolution on the observed physiological differences between species. Using comparative genomics and structural information, we show here that eukaryotic species have rewired their interactomes at a fast rate of approximately 10(-5 interactions changed per protein pair, per million years of divergence. For Homo sapiens this corresponds to 10(3 interactions changed per million years. Additionally we find that the specificity of binding strongly determines the interaction turnover and that different biological processes show significantly different link dynamics. In particular, human proteins involved in immune response, transport, and establishment of localization show signs of positive selection for change of interactions. Our analysis suggests that a small degree of molecular divergence can give rise to important changes at the network level. We propose that the power law distribution observed in protein interaction networks could be partly explained by the cell's requirement for different degrees of protein binding specificity.
Full Text Available Abstract Background The ectoparasitic mite Varroa destructor has emerged as the primary pest of domestic honey bees (Apis mellifera. Here we present an initial survey of the V. destructor genome carried out to advance our understanding of Varroa biology and to identify new avenues for mite control. This sequence survey provides immediate resources for molecular and population-genetic analyses of Varroa-Apis interactions and defines the challenges ahead for a comprehensive Varroa genome project. Results The genome size was estimated by flow cytometry to be 565 Mbp, larger than most sequenced insects but modest relative to some other Acari. Genomic DNA pooled from ~1,000 mites was sequenced to 4.3× coverage with 454 pyrosequencing. The 2.4 Gbp of sequencing reads were assembled into 184,094 contigs with an N50 of 2,262 bp, totaling 294 Mbp of sequence after filtering. Genic sequences with homology to other eukaryotic genomes were identified on 13,031 of these contigs, totaling 31.3 Mbp. Alignment of protein sequence blocks conserved among V. destructor and four other arthropod genomes indicated a higher level of sequence divergence within this mite lineage relative to the tick Ixodes scapularis. A number of microbes potentially associated with V. destructor were identified in the sequence survey, including ~300 Kbp of sequence deriving from one or more bacterial species of the Actinomycetales. The presence of this bacterium was confirmed in individual mites by PCR assay, but varied significantly by age and sex of mites. Fragments of a novel virus related to the Baculoviridae were also identified in the survey. The rate of single nucleotide polymorphisms (SNPs in the pooled mites was estimated to be 6.2 × 10-5per bp, a low rate consistent with the historical demography and life history of the species. Conclusions This survey has provided general tools for the research community and novel directions for investigating the biology and control of
Heyer, Wolf-Dietrich; Ehmsen, Kirk T.; Liu, Jie
Homologous recombination is required for accurate chromosome segregation during the first meiotic division and constitutes a key repair and tolerance pathway for complex DNA damage including DNA double-stranded breaks, interstrand crosslinks, and DNA gaps. In addition, recombination and replication are inextricably linked, as recombination recovers stalled and broken replication forks enabling the evolution of larger genomes/replicons. Defects in recombination lead to genomic instability and ...
Greshake, Bastian; Zehr, Simonida; Dal Grande, Francesco; Meiser, Anjuli; Schmitt, Imke; Ebersberger, Ingo
Whole-genome shotgun sequencing of multispecies communities using only a single library layout is commonly used to assess taxonomic and functional diversity of microbial assemblages. Here, we investigate to what extent such metagenome skimming approaches are applicable for in-depth genomic characterizations of eukaryotic communities, for example lichens. We address how to best assemble a particular eukaryotic metagenome skimming data, what pitfalls can occur, and what genome quality can be expected from these data. To facilitate a project-specific benchmarking, we introduce the concept of twin sets, simulated data resembling the outcome of a particular metagenome sequencing study. We show that the quality of genome reconstructions depends essentially on assembler choice. Individual tools, including the metagenome assemblers Omega and MetaVelvet, are surprisingly sensitive to low and uneven coverages. In combination with the routine of assembly parameter choice to optimize the assembly N50 size, these tools can preclude an entire genome from the assembly. In contrast, MIRA, an all-purpose overlap assembler, and SPAdes, a multisized de Bruijn graph assembler, facilitate a comprehensive view on the individual genomes across a wide range of coverage ratios. Testing assemblers on a real-world metagenome skimming data from the lichen Lasallia pustulata demonstrates the applicability of twin sets for guiding method selection. Furthermore, it reveals that the assembly outcome for the photobiont Trebouxia sp. falls behind the a priori expectation given the simulations. Although the underlying reasons remain still unclear, this highlights that further studies on this organism require special attention during sequence data generation and downstream analysis. © 2015 John Wiley & Sons Ltd.
Full Text Available Abstract Background A long term research goal of venomics, of applied importance for improving current antivenom therapy, but also for drug discovery, is to understand the pharmacological potential of venoms. Individually or combined, proteomic and transcriptomic studies have demonstrated their feasibility to explore in depth the molecular diversity of venoms. In the absence of genome sequence, transcriptomes represent also valuable searchable databases for proteomic projects. Results The venom gland transcriptomes of 8 Costa Rican taxa from 5 genera (Crotalus, Bothrops, Atropoides, Cerrophidion, and Bothriechis of pitvipers were investigated using high-throughput 454 pyrosequencing. 100,394 out of 330,010 masked reads produced significant hits in the available databases. 5.165,220 nucleotides (8.27% were masked by RepeatMasker, the vast majority of which corresponding to class I (retroelements and class II (DNA transposons mobile elements. BLAST hits included 79,991 matches to entries of the taxonomic suborder Serpentes, of which 62,433 displayed similarity to documented venom proteins. Strong discrepancies between the transcriptome-computed and the proteome-gathered toxin compositions were obvious at first sight. Although the reasons underlaying this discrepancy are elusive, since no clear trend within or between species is apparent, the data indicate that individual mRNA species may be translationally controlled in a species-dependent manner. The minimum number of genes from each toxin family transcribed into the venom gland transcriptome of each species was calculated from multiple alignments of reads matched to a full-length reference sequence of each toxin family. Reads encoding ORF regions of Kazal-type inhibitor-like proteins were uniquely found in Bothriechis schlegelii and B. lateralis transcriptomes, suggesting a genus-specific recruitment event during the early-Middle Miocene. A transcriptome-based cladogram supports the large
Gupta, Akshari; Kitagawa, Daiju
Several decades of centriole research have revealed the beautiful symmetry present in these microtubule-based organelles, which are required to form centrosomes, cilia, and flagella in many eukaryotes. Centriole architecture is largely conserved across most organisms, however, individual centriolar features such as the central cartwheel or microtubule walls exhibit considerable variability when examined with finer resolution. Here, we review the ultrastructural characteristics of centrioles in commonly studied organisms, highlighting the subtle and not-so-subtle differences between specific structural components of these centrioles. Additionally, we survey some non-canonical centriole structures that have been discovered in various species, from the coaxial bicentrioles of protists and lower land plants to the giant irregular centrioles of the fungus gnat Sciara. Finally, we speculate on the functional significance of these differences between centrioles, and the contribution of individual structural elements such as the cartwheel or microtubules towards the stability of centrioles.Centriole structure, cartwheel, triplet microtubules, SAS-6, centrosome.
Starokadomskyy P. L.
Full Text Available Inteins, or protein introns, are parts of protein sequences that are post-translationally excised, their flanking regions (exteins being spliced together. This process was called protein splicing. Originally inteins were found in prokaryotic or unicellular eukaryotic organisms. But the general principles of post-translation protein rearrangement are evolving yielding different post-translation modification of proteins in multicellular organisms. For clarity, these non-intein mediated events call either protein rearrangements or protein editing. The most intriguing example of protein editing is proteasome-mediated splicing of antigens in vertebrates that may play important role in antigen presentation. Other examples of protein rearrangements are maturation of Hg-proteins (critical receptors in embryogenesis as well as maturation of several metabolic enzymes. Despite a lack of experimental data we try to analyze some intriguing examples of protein splicing evolution.
López-Mirabal, H Reynaldo; Winther, Jakob R
The eukaryotic cytoplasm has long been regarded as a cellular compartment in which the reduced state of protein cysteines is largely favored. Under normal conditions, the cytosolic low-molecular weight redox buffer, comprising primarily of glutathione, is highly reducing and reactive oxygen species...... (ROS) and glutathionylated proteins are maintained at very low levels. In the present review, recent progress in the understanding of the cytosolic thiol-disulfide redox metabolism and novel analytical approaches to studying cytosolic redox properties are discussed. We will focus on the yeast model...... organism, Saccharomyces cerevisiae, where the combination of genetic and biochemical approaches has brought us furthest in understanding the mechanisms underlying cellular redox regulation. It has been shown in yeast that, in addition to the enzyme glutathione reductase, other mechanisms may exist...
Derelle, R.; Torruella, G.; Klimeš, V.; Brinkmann, H.; Kim, E.; Vlček, Čestmír; Lang, B.F.; Eliáš, M.
Roč. 112, č. 7 (2015), E693-E699 ISSN 0027-8424 R&D Projects: GA ČR GA13-24983S Grant - others:GA MŠk(CZ) ED2.1.00/03.0100; Howard Hughes Medical Institute International Early Career Scientist Program(US) 55007424; Spanish Ministry of Economy and Competitiveness, European Molecular Biology Organization Young Investigator Program(ES) BFU2012-31329; Spanish Ministry of Economy and Competitiveness, "Centro de Excelencia Severo Ochoa" - European Regional Development Fund(ES) Sev-2012-0208, BES-2013-064004 Institutional support: RVO:68378050 Keywords : eukaryote phylogeny * phylogenomics * Opimoda * Diphoda * LECA Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 9.423, year: 2015
Chai, Zhao Yang; He, Zhi Li; Deng, Yun Yan; Yang, Yu Feng; Tang, Ying Zhong
Plankton diversity reflects the quality and health of waters and should be monitored as a critical feature of marine ecosystems. This study applied a pair of 28S rRNA gene-specific primers and pyrosequencing to assess the effects of large-scale cultivation of the seaweed Gracilaria lemaneiformis on the biodiversity of eukaryotic plankton community in the coastal water of Guangdong, China. With 1 million sequences (2,221 operational taxonomic units [OTUs]) obtained from 51 samples, we found that the biodiversity of eukaryotic plankton community was significantly higher in the seaweed cultivation area than that in the nearby control area as reflected in OTU richness, evenness (Shannon-Wiener index) and dominance (Simpson index) for total plankton community and its four subcategories when Gracilaria biomass reached the maximum, while no such a significant difference was observed before seaweed inoculation. Our laboratory experiment using an artificial phytoplankton community of nine species observed the same effects of Gracilaria exposure. Principal component analysis and principal coordinates analysis showed the plankton community structure in cultivation area markedly differed from the control area when Gracilaria biomass reached its maximum. Redundancy analysis showed that G. lemaneiformis was the critical factor in controlling the dynamics of eukaryotic plankton communities in the studied coastal ecosystem. Our results explicitly demonstrated G. lemaneiformis cultivation could enhance biodiversity of plankton community via allelopathy, which prevents one or several plankton species from blooming and consequently maintains a relatively higher biodiversity. Our study provided further support for using large-scale G. lemaneiformis cultivation as an effective approach for improving costal ecosystem health. © 2018 John Wiley & Sons Ltd.
Vannini, Andrea; Bruni, Natalia; Tomassini, Alessia; Franceschini, Selma; Vettraino, Anna Maria
Pyrosequencing analysis was performed on soils from Italian chestnut groves to evaluate the diversity of the resident Phytophthora community. Sequences analysed with a custom database discriminated 15 pathogenic Phytophthoras including species common to chestnut soils, while a total of nine species were detected with baiting. The two sites studied differed in Phytophthora diversity and the presence of specific taxa responded to specific ecological traits of the sites. Furthermore, some species not previously recorded were represented by a discrete number of reads; among these species, Phytophthora ramorum was detected at both sites. Pyrosequencing was demonstrated to be a very sensitive technique to describe the Phytophthora community in soil and was able to detect species not easy to be isolated from soil with standard baiting techniques. In particular, pyrosequencing is an highly efficient tool for investigating the colonization of new environments by alien species, and for ecological and adaptive studies coupled with biological detection methods. This study represents the first application of pyrosequencing for describing Phytophthoras in environmental soil samples. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Harrison, C.; Musgrave-Brown, E.; Bender, K.
Analysing minute amounts of DNA is a routine challenge in forensics in part due to the poor sensitivity of an instrument and its inability to detect results from forensic samples. In this study, the sensitivity of the Pyrosequencing method is investigated using varying concentrations of DNA and f...
Piskur, Jure; Langkjær, Rikke Breinhold
For decades, unicellular yeasts have been general models to help understand the eukaryotic cell and also our own biology. Recently, over a dozen yeast genomes have been sequenced, providing the basis to resolve several complex biological questions. Analysis of the novel sequence data has shown...... of closely related species helps in gene annotation and to answer how many genes there really are within the genomes. Analysis of non-coding regions among closely related species has provided an example of how to determine novel gene regulatory sequences, which were previously difficult to analyse because...... they are short and degenerate and occupy different positions. Comparative genomics helps to understand the origin of yeasts and points out crucial molecular events in yeast evolutionary history, such as whole-genome duplication and horizontal gene transfer(s). In addition, the accumulating sequence data provide...
Full Text Available BACKGROUND: Because many picoplanktonic eukaryotic species cannot currently be maintained in culture, direct sequencing of PCR-amplified 18S ribosomal gene DNA fragments from filtered sea-water has been successfully used to investigate the astounding diversity of these organisms. The recognition of many novel planktonic organisms is thus based solely on their 18S rDNA sequence. However, a species delimited by its 18S rDNA sequence might contain many cryptic species, which are highly differentiated in their protein coding sequences. PRINCIPAL FINDINGS: Here, we investigate the issue of species identification from one gene to the whole genome sequence. Using 52 whole genome DNA sequences, we estimated the global genetic divergence in protein coding genes between organisms from different lineages and compared this to their ribosomal gene sequence divergences. We show that this relationship between proteome divergence and 18S divergence is lineage dependent. Unicellular lineages have especially low 18S divergences relative to their protein sequence divergences, suggesting that 18S ribosomal genes are too conservative to assess planktonic eukaryotic diversity. We provide an explanation for this lineage dependency, which suggests that most species with large effective population sizes will show far less divergence in 18S than protein coding sequences. CONCLUSIONS: There is therefore a trade-off between using genes that are easy to amplify in all species, but which by their nature are highly conserved and underestimate the true number of species, and using genes that give a better description of the number of species, but which are more difficult to amplify. We have shown that this trade-off differs between unicellular and multicellular organisms as a likely consequence of differences in effective population sizes. We anticipate that biodiversity of microbial eukaryotic species is underestimated and that numerous "cryptic species" will become
Full Text Available Abstract Background The ferlin gene family possesses a rare and identifying feature consisting of multiple tandem C2 domains and a C-terminal transmembrane domain. Much currently remains unknown about the fundamental function of this gene family, however, mutations in its two most well-characterised members, dysferlin and otoferlin, have been implicated in human disease. The availability of genome sequences from a wide range of species makes it possible to explore the evolution of the ferlin family, providing contextual insight into characteristic features that define the ferlin gene family in its present form in humans. Results Ferlin genes were detected from all species of representative phyla, with two ferlin subgroups partitioned within the ferlin phylogenetic tree based on the presence or absence of a DysF domain. Invertebrates generally possessed two ferlin genes (one with DysF and one without, with six ferlin genes in most vertebrates (three DysF, three non-DysF. Expansion of the ferlin gene family is evident between the divergence of lamprey (jawless vertebrates and shark (cartilaginous fish. Common to almost all ferlins is an N-terminal C2-FerI-C2 sandwich, a FerB motif, and two C-terminal C2 domains (C2E and C2F adjacent to the transmembrane domain. Preservation of these structural elements throughout eukaryotic evolution suggests a fundamental role of these motifs for ferlin function. In contrast, DysF, C2DE, and FerA are optional, giving rise to subtle differences in domain topologies of ferlin genes. Despite conservation of multiple C2 domains in all ferlins, the C-terminal C2 domains (C2E and C2F displayed higher sequence conservation and greater conservation of putative calcium binding residues across paralogs and orthologs. Interestingly, the two most studied non-mammalian ferlins (Fer-1 and Misfire in model organisms C. elegans and D. melanogaster, present as outgroups in the phylogenetic analysis, with results suggesting
Simon, Martin; Plattner, Helmut
Unicellular eukaryotes have been appreciated as model systems for the analysis of crucial questions in cell and molecular biology. This includes Dictyostelium (chemotaxis, amoeboid movement, phagocytosis), Tetrahymena (telomere structure, telomerase function), Paramecium (variant surface antigens, exocytosis, phagocytosis cycle) or both ciliates (ciliary beat regulation, surface pattern formation), Chlamydomonas (flagellar biogenesis and beat), and yeast (S. cerevisiae) for innumerable aspects. Nowadays many problems may be tackled with "higher" eukaryotic/metazoan cells for which full genomic information as well as domain databases, etc., were available long before protozoa. Established molecular tools, commercial antibodies, and established pharmacology are additional advantages available for higher eukaryotic cells. Moreover, an increasing number of inherited genetic disturbances in humans have become elucidated and can serve as new models. Among lower eukaryotes, yeast will remain a standard model because of its peculiarities, including its reduced genome and availability in the haploid form. But do protists still have a future as models? This touches not only the basic understanding of biology but also practical aspects of research, such as fund raising. As we try to scrutinize, due to specific advantages some protozoa should and will remain favorable models for analyzing novel genes or specific aspects of cell structure and function. Outstanding examples are epigenetic phenomena-a field of rising interest. © 2014 Elsevier Inc. All rights reserved.
Full Text Available The banana weevil Cosmopolites sordidus is an important and serious insect pest in most banana and plantain-growing areas of the world. In spite of the economic importance of this insect pest very little genomic and transcriptomic information exists for this species. In the present study, we characterized the midgut transcriptome of C. sordidus using massive 454-pyrosequencing. We generated over 590,000 sequencing reads that assembled into 30,840 contigs with more than 400 bp, representing a significant expansion of existing sequences available for this insect pest. Among them, 16,427 contigs contained one or more GO terms. In addition, 15,263 contigs were assigned an EC number. In-depth transcriptome analysis identified genes potentially involved in insecticide resistance, peritrophic membrane biosynthesis, immunity-related function and defense against pathogens, and Bacillus thuringiensis toxins binding proteins as well as multiple enzymes involved with protein digestion. This transcriptome will provide a valuable resource for understanding larval physiology and for identifying novel target sites and management approaches for this important insect pest.
Valencia, Arnubio; Wang, Haichuan; Soto, Alberto; Aristizabal, Manuel; Arboleda, Jorge W; Eyun, Seong-Il; Noriega, Daniel D; Siegfried, Blair
The banana weevil Cosmopolites sordidus is an important and serious insect pest in most banana and plantain-growing areas of the world. In spite of the economic importance of this insect pest very little genomic and transcriptomic information exists for this species. In the present study, we characterized the midgut transcriptome of C. sordidus using massive 454-pyrosequencing. We generated over 590,000 sequencing reads that assembled into 30,840 contigs with more than 400 bp, representing a significant expansion of existing sequences available for this insect pest. Among them, 16,427 contigs contained one or more GO terms. In addition, 15,263 contigs were assigned an EC number. In-depth transcriptome analysis identified genes potentially involved in insecticide resistance, peritrophic membrane biosynthesis, immunity-related function and defense against pathogens, and Bacillus thuringiensis toxins binding proteins as well as multiple enzymes involved with protein digestion. This transcriptome will provide a valuable resource for understanding larval physiology and for identifying novel target sites and management approaches for this important insect pest.
A hypoxic/suboxic brine pool at a depth of about 850 m was discovered near the Thuwal cold seeps in the Red Sea. Filled with high concentrations of hydrogen sulfide and ammonia, such a brine pool might limit the spread of eukaryotic organisms. Here, we compared the communities of the eukaryotic microbes in a microbial mat, sediments and water samples distributed in 7 sites within and adjacent to the brine pool. Taxonomic classification of the pyrosequenced 18S rRNA amplicon reads showed that fungi highly similar to the species identified along the Arabic coast were almost ubiquitous in the water and sediment samples, supporting their wide distribution in various environments. The microbial mat displayed the highest species diversity and contained grazers and a considerable percentage of unclassified species. Phylogeny-based methods revealed novel lineages representing a majority of the reads from the interface between the sea water and brine pool. Phylogenetic relationships with more reference sequences suggest that the lineages were affiliated with novel Alveolata and Euglenozoa inhabiting the interface where chemosynthetic prokaryotes are highly proliferative due to the strong chemocline and halocline. The brine sediments harbored abundant species highly similar to invertebrate gregarine parasites identified in different oxygen-depleted sediments. Therefore, the present findings support the uniqueness of some microbial eukaryotic groups in this cold seep brine system. 2014 Wang, Zhang, Cao, Shek, Tian, Wong, Batang, Al-suwailem and Qian.
Analysis of RET promoter CpG island methylation using methylation-specific PCR (MSP), pyrosequencing, and methylation-sensitive high-resolution melting (MS-HRM): impact on stage II colon cancer patient outcome.
Draht, Muriel X G; Smits, Kim M; Jooste, Valérie; Tournier, Benjamin; Vervoort, Martijn; Ramaekers, Chantal; Chapusot, Caroline; Weijenberg, Matty P; van Engeland, Manon; Melotte, Veerle
Already since the 1990s, promoter CpG island methylation markers have been considered promising diagnostic, prognostic, and predictive cancer biomarkers. However, so far, only a limited number of DNA methylation markers have been introduced into clinical practice. One reason why the vast majority of methylation markers do not translate into clinical applications is lack of independent validation of methylation markers, often caused by differences in methylation analysis techniques. We recently described RET promoter CpG island methylation as a potential prognostic marker in stage II colorectal cancer (CRC) patients of two independent series. In the current study, we analyzed the RET promoter CpG island methylation of 241 stage II colon cancer patients by direct methylation-specific PCR (MSP), nested-MSP, pyrosequencing, and methylation-sensitive high-resolution melting (MS-HRM). All primers were designed as close as possible to the same genomic region. In order to investigate the effect of different DNA methylation assays on patient outcome, we assessed the clinical sensitivity and specificity as well as the association of RET methylation with overall survival for three and five years of follow-up. Using direct-MSP and nested-MSP, 12.0 % (25/209) and 29.6 % (71/240) of the patients showed RET promoter CpG island methylation. Methylation frequencies detected by pyrosequencing were related to the threshold for positivity that defined RET methylation. Methylation frequencies obtained by pyrosequencing (threshold for positivity at 20 %) and MS-HRM were 13.3 % (32/240) and 13.8 % (33/239), respectively. The pyrosequencing threshold for positivity of 20 % showed the best correlation with MS-HRM and direct-MSP results. Nested-MSP detected RET promoter CpG island methylation in deceased patients with a higher sensitivity (33.1 %) compared to direct-MSP (10.7 %), pyrosequencing (14.4 %), and MS-HRM (15.4 %). While RET methylation frequencies detected by nested
Portune, Kevin J; Pérez, M Carmen; Álvarez-Hornos, F Javier; Gabaldón, Carmen
Microbial biofilms are essential components in the elimination of pollutants within biofilters, yet still little is known regarding the complex relationships between microbial community structure and biodegradation function within these engineered ecosystems. To further explore this relationship, 16S rDNA tag pyrosequencing was applied to samples taken at four time points from a styrene-degrading biofilter undergoing variable operating conditions. Changes in microbial structure were observed between different stages of biofilter operation, and the level of styrene concentration was revealed to be a critical factor affecting these changes. Bacterial genera Azoarcus and Pseudomonas were among the dominant classified genera in the biofilter. Canonical correspondence analysis (CCA) and correlation analysis revealed that the genera Brevundimonas, Hydrogenophaga, and Achromobacter may play important roles in styrene degradation under increasing styrene concentrations. No significant correlations (P > 0.05) could be detected between biofilter operational/functional parameters and biodiversity measurements, although biological heterogeneity within biofilms and/or technical variability within pyrosequencing may have considerably affected these results. Percentages of selected bacterial taxonomic groups detected by fluorescence in situ hybridization (FISH) were compared to results from pyrosequencing in order to assess the effectiveness and limitations of each method for identifying each microbial taxon. Comparison of results revealed discrepancies between the two methods in the detected percentages of numerous taxonomic groups. Biases and technical limitations of both FISH and pyrosequencing, such as the binding of FISH probes to non-target microbial groups and lack of classification of sequences for defined taxonomic groups from pyrosequencing, may partially explain some differences between the two methods.
Chang, Dong Kyung; Metzgar, David; Wills, Christopher; Boland, C. Richard
All "minor" components of the human DNA mismatch repair (MMR) system-MSH3, MSH6, PMS2, and the recently discovered MLH3-contain mononucleotide microsatellites in their coding sequences. This intriguing finding contrasts with the situation found in the major components of the DNA MMR system-MSH2 and MLH1-and, in fact, most human genes. Although eukaryotic genomes are rich in microsatellites, non-triplet microsatellites are rare in coding regions. The recurring presence of exonal mononucleotide repeat sequences within a single family of human genes would therefore be considered exceptional.
Kuo, Alan; Grigoriev, Igor
Algae, defined as photosynthetic eukaryotes other than plants, constitute a major component of fundamental eukaryotic diversity. Acquisition of the ability to conduct oxygenic photosynthesis through endosymbiotic events has been a principal driver of eukaryotic evolution, and today algae continue to underpin aquatic food chains as primary producers. Algae play profound roles in the carbon cycle, can impose health and economic costs through toxic blooms, and are candidate sources for bio-fuels; all of these research areas are part of the mission of DOE?s Joint Genome Institute (JGI). A collection of algal projects ongoing at JGI contributes to each of these areas and illustrates analyses employed in their genome exploration.
Trémeaux, P; Caporossi, A; Ramière, C; Santoni, E; Tarbouriech, N; Thélu, M-A; Fusillier, K; Geneletti, L; François, O; Leroy, V; Burmeister, W P; André, P; Morand, P; Larrat, S
Directly acting antiviral drugs have contributed considerable progress to hepatitis C virus (HCV) treatment, but they show variable activity depending on virus genotypes and subtypes. Therefore, accurate genotyping including recombinant form detection is still of major importance, as is the detection of resistance-associated mutations in case of therapeutic failure. To meet these goals, an approach to amplify the HCV near-complete genome with a single long-range PCR and sequence it with Roche GS Junior was developed. After optimization, the overall amplification success rate was 73% for usual genotypes (i.e. HCV 1a, 1b, 3a and 4a, 16/22) and 45% for recombinant forms RF_2k/1b (5/11). After pyrosequencing and subsequent de novo assembly, a near-full-length genomic consensus sequence was obtained for 19 of 21 samples. The genotype and subtype were confirmed by phylogenetic analysis for every sample, including the suspected recombinant forms. Resistance-associated mutations were detected in seven of 13 samples at baseline, in the NS3 (n = 3) or NS5A (n = 4) region. Of these samples, the treatment of one patient included daclatasvir, and that patient experienced a relapse. Virus sequences from pre- and posttreatment samples of four patients who experienced relapse after sofosbuvir-based therapy were compared: the selected variants seem too far from the NS5B catalytic site to be held responsible. Although tested on a limited set of samples and with technical improvements still necessary, this assay has proven to be successful for both genotyping and resistance-associated variant detection on several HCV types. Copyright © 2016 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
O'Toole, Nicholas; Barboza, Perry S.; Ungerfeld, Emilio; Leigh, Mary Beth; Selinger, L. Brent; Butler, Greg; Tsang, Adrian; McAllister, Tim A.; Forster, Robert J.
Background Herbivores rely on digestive tract lignocellulolytic microorganisms, including bacteria, fungi and protozoa, to derive energy and carbon from plant cell wall polysaccharides. Culture independent metagenomic studies have been used to reveal the genetic content of the bacterial species within gut microbiomes. However, the nature of the genes encoded by eukaryotic protozoa and fungi within these environments has not been explored using metagenomic or metatranscriptomic approaches. Methodology/Principal Findings In this study, a metatranscriptomic approach was used to investigate the functional diversity of the eukaryotic microorganisms within the rumen of muskoxen (Ovibos moschatus), with a focus on plant cell wall degrading enzymes. Polyadenylated RNA (mRNA) was sequenced on the Illumina Genome Analyzer II system and 2.8 gigabases of sequences were obtained and 59129 contigs assembled. Plant cell wall degrading enzyme modules including glycoside hydrolases, carbohydrate esterases and polysaccharide lyases were identified from over 2500 contigs. These included a number of glycoside hydrolase family 6 (GH6), GH48 and swollenin modules, which have rarely been described in previous gut metagenomic studies. Conclusions/Significance The muskoxen rumen metatranscriptome demonstrates a much higher percentage of cellulase enzyme discovery and an 8.7x higher rate of total carbohydrate active enzyme discovery per gigabase of sequence than previous rumen metagenomes. This study provides a snapshot of eukaryotic gene expression in the muskoxen rumen, and identifies a number of candidate genes coding for potentially valuable lignocellulolytic enzymes. PMID:21655220
Mouilleron, Hélène; Delcourt, Vivian; Roucou, Xavier
mRNAs carry the genetic information that is translated by ribosomes. The traditional view of a mature eukaryotic mRNA is a molecule with three main regions, the 5' UTR, the protein coding open reading frame (ORF) or coding sequence (CDS), and the 3' UTR. This concept assumes that ribosomes translate one ORF only, generally the longest one, and produce one protein. As a result, in the early days of genomics and bioinformatics, one CDS was associated with each protein-coding gene. This fundamental concept of a single CDS is being challenged by increasing experimental evidence indicating that annotated proteins are not the only proteins translated from mRNAs. In particular, mass spectrometry (MS)-based proteomics and ribosome profiling have detected productive translation of alternative open reading frames. In several cases, the alternative and annotated proteins interact. Thus, the expression of two or more proteins translated from the same mRNA may offer a mechanism to ensure the co-expression of proteins which have functional interactions. Translational mechanisms already described in eukaryotic cells indicate that the cellular machinery is able to translate different CDSs from a single viral or cellular mRNA. In addition to summarizing data showing that the protein coding potential of eukaryotic mRNAs has been underestimated, this review aims to challenge the single translated CDS dogma. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Full Text Available BACKGROUND: Herbivores rely on digestive tract lignocellulolytic microorganisms, including bacteria, fungi and protozoa, to derive energy and carbon from plant cell wall polysaccharides. Culture independent metagenomic studies have been used to reveal the genetic content of the bacterial species within gut microbiomes. However, the nature of the genes encoded by eukaryotic protozoa and fungi within these environments has not been explored using metagenomic or metatranscriptomic approaches. METHODOLOGY/PRINCIPAL FINDINGS: In this study, a metatranscriptomic approach was used to investigate the functional diversity of the eukaryotic microorganisms within the rumen of muskoxen (Ovibos moschatus, with a focus on plant cell wall degrading enzymes. Polyadenylated RNA (mRNA was sequenced on the Illumina Genome Analyzer II system and 2.8 gigabases of sequences were obtained and 59129 contigs assembled. Plant cell wall degrading enzyme modules including glycoside hydrolases, carbohydrate esterases and polysaccharide lyases were identified from over 2500 contigs. These included a number of glycoside hydrolase family 6 (GH6, GH48 and swollenin modules, which have rarely been described in previous gut metagenomic studies. CONCLUSIONS/SIGNIFICANCE: The muskoxen rumen metatranscriptome demonstrates a much higher percentage of cellulase enzyme discovery and an 8.7x higher rate of total carbohydrate active enzyme discovery per gigabase of sequence than previous rumen metagenomes. This study provides a snapshot of eukaryotic gene expression in the muskoxen rumen, and identifies a number of candidate genes coding for potentially valuable lignocellulolytic enzymes.
Chew Chieng Yeo
Full Text Available Toxin-antitoxin (TA systems are found in nearly all prokaryotic genomes and usually consist of a pair of co-transcribed genes, one of which encodes a stable toxin and the other, its cognate labile antitoxin. Certain environmental and physiological cues trigger the degradation of the antitoxin, causing activation of the toxin, leading either to the death or stasis of the host cell. TA systems have a variety of functions in the bacterial cell, including acting as mediators of programmed cell death, the induction of a dormant state known as persistence and the stable maintenance of plasmids and other mobile genetic elements. Some bacterial TA systems are functional when expressed in eukaryotic cells and this has led to several innovative applications, which are the subject of this review. Here, we look at how bacterial TA systems have been utilized for the genetic manipulation of yeasts and other eukaryotes, for the containment of genetically modified organisms, and for the engineering of high expression eukaryotic cell lines. We also examine how TA systems have been adopted as an important tool in developmental biology research for the ablation of specific cells and the potential for utility of TA systems in antiviral and anticancer gene therapies.
Yeo, Chew Chieng; Abu Bakar, Fauziah; Chan, Wai Ting; Espinosa, Manuel; Harikrishna, Jennifer Ann
Toxin-antitoxin (TA) systems are found in nearly all prokaryotic genomes and usually consist of a pair of co-transcribed genes, one of which encodes a stable toxin and the other, its cognate labile antitoxin. Certain environmental and physiological cues trigger the degradation of the antitoxin, causing activation of the toxin, leading either to the death or stasis of the host cell. TA systems have a variety of functions in the bacterial cell, including acting as mediators of programmed cell death, the induction of a dormant state known as persistence and the stable maintenance of plasmids and other mobile genetic elements. Some bacterial TA systems are functional when expressed in eukaryotic cells and this has led to several innovative applications, which are the subject of this review. Here, we look at how bacterial TA systems have been utilized for the genetic manipulation of yeasts and other eukaryotes, for the containment of genetically modified organisms, and for the engineering of high expression eukaryotic cell lines. We also examine how TA systems have been adopted as an important tool in developmental biology research for the ablation of specific cells and the potential for utility of TA systems in antiviral and anticancer gene therapies. PMID:26907343
Javaux, E J; Knoll, A H; Walter, M R
Molecular phylogeny and biogeochemistry indicate that eukaryotes differentiated early in Earth history. Sequence comparisons of small-subunit ribosomal RNA genes suggest a deep evolutionary divergence of Eukarya and Archaea; C27-C29 steranes (derived from sterols synthesized by eukaryotes) and strong depletion of 13C (a biogeochemical signature of methanogenic Archaea) in 2,700 Myr old kerogens independently place a minimum age on this split. Steranes, large spheroidal microfossils, and rare macrofossils of possible eukaryotic origin occur in Palaeoproterozoic rocks. Until now, however, evidence for morphological and taxonomic diversification within the domain has generally been restricted to very late Mesoproterozoic and Neoproterozoic successions. Here we show that the cytoskeletal and ecological prerequisites for eukaryotic diversification were already established in eukaryotic microorganisms fossilized nearly 1,500 Myr ago in shales of the early Mesoproterozoic Roper Group in northern Australia.
Hörandl, Elvira; Speijer, Dave
9years ago. The large amount of ROS coming from a bacterial endosymbiont gave rise to DNA damage and vast increases in host genome mutation rates. Eukaryogenesis and chromosome evolution represent adaptations to oxidative stress. The host, an archaeon, most probably already had repair mechanisms
Full Text Available Abstract Background Sweetpotato (Ipomoea batatas (L. Lam., a hexaploid outcrossing crop, is an important staple and food security crop in developing countries in Africa and Asia. The availability of genomic resources for sweetpotato is in striking contrast to its importance for human nutrition. Previously existing sequence data were restricted to around 22,000 expressed sequence tag (EST sequences and ~ 1,500 GenBank sequences. We have used 454 pyrosequencing to augment the available gene sequence information to enhance functional genomics and marker design for this plant species. Results Two quarter 454 pyrosequencing runs used two normalized cDNA collections from stems and leaves from drought-stressed sweetpotato clone Tanzania and yielded 524,209 reads, which were assembled together with 22,094 publically available expressed sequence tags into 31,685 sets of overlapping DNA segments and 34,733 unassembled sequences. Blastx comparisons with the UniRef100 database allowed annotation of 23,957 contigs and 15,342 singletons resulting in 24,657 putatively unique genes. Further, 27,119 sequences had no match to protein sequences of UniRef100database. On the basis of this gene index, we have identified 1,661 gene-based microsatellite sequences, of which 223 were selected for testing and 195 were successfully amplified in a test panel of 6 hexaploid (I. batatas and 2 diploid (I. trifida accessions. Conclusions The sweetpotato gene index is a useful source for functionally annotated sweetpotato gene sequences that contains three times more gene sequence information for sweetpotato than previous EST assemblies. A searchable version of the gene index, including a blastn function, is available at http://www.cipotato.org/sweetpotato_gene_index.
Pearman, John K.
Coral reefs are considered among the most diverse ecosystems on Earth, yet little is known about the diversity of plankton in the surrounding water column. Moreover, few studies have utilized genomic methods to investigate zooplankton diversity in any habitat. This study investigated the diversity of taxa by sampling 45 stations around three reef systems in the central/southern Red Sea. The diversity of metazoan plankton was investigated by targeting the 18S rRNA gene and clustering OTUs at 97% sequence similarity. A total of 754 and 854 metazoan OTUs were observed in the data set for the 1380F and 1389F primer sets respectively. The phylum Arthropoda dominated both primer sets accounting for ~60% of reads followed by Cnidaria (~20%). Only about 20% of OTUs were shared between all three reef systems and the relation between geographic distance and Jaccard Similarity measures was not significant. Cluster analysis showed that there was no distinct split between reefs and stations from different reefs clustered together both for metazoans as a whole and for the phyla Arthropoda, Cnidaria and Chordata separately. This suggests that distance may not be a determining factor in the taxonomic composition of stations.
Lio` , P [Cambridge, Univ. (United Kingdom). Genetics Dept.; Ruffo, S [Florence, Univ. (Italy). Fac. di Ingegneria. Dipt. di Energetica ` S. Stecco`
The authors have analyzed general properties of very long DNA sequences belonging to simple and complex organisms, by using different correlation methods. They have distinguished those base compositional rules that concern the entire genome which they call `genomic constraints` from the rules that depend on the `external natural selection` acting on single genes, i. e. protein-centered constraints. They show that G + C content, purine / pyrimidine distributions and biological complexity of the organism are the most important factors which determine base compositional rules and genome complexity. Three main facts are here reported: bacteria with high G + C content have more restrictions on base composition than those with low G + C content; at constant G + C content more complex organisms, ranging from prokaryotes to higher eukaryotes (e.g. human) display an increase of repeats 10-20 nucleotides long, which are also partly responsible for long-range correlations; work selection of length 3 to 10 is stronger in human and in bacteria for two distinct reasons. With respect to previous studies, they have also compared the genomic sequence of the archeon Methanococcus jannaschii with those of bacteria and eukaryotes: it shows sometimes an intermediate statistical behaviour.
Lio', P.; Ruffo, S.
The authors have analyzed general properties of very long DNA sequences belonging to simple and complex organisms, by using different correlation methods. They have distinguished those base compositional rules that concern the entire genome which they call 'genomic constraints' from the rules that depend on the 'external natural selection' acting on single genes, i. e. protein-centered constraints. They show that G + C content, purine / pyrimidine distributions and biological complexity of the organism are the most important factors which determine base compositional rules and genome complexity. Three main facts are here reported: bacteria with high G + C content have more restrictions on base composition than those with low G + C content; at constant G + C content more complex organisms, ranging from prokaryotes to higher eukaryotes (e.g. human) display an increase of repeats 10-20 nucleotides long, which are also partly responsible for long-range correlations; work selection of length 3 to 10 is stronger in human and in bacteria for two distinct reasons. With respect to previous studies, they have also compared the genomic sequence of the archeon Methanococcus jannaschii with those of bacteria and eukaryotes: it shows sometimes an intermediate statistical behaviour
Furse, Samuel; Shearman, Gemma C
Successful passage through the cell cycle presents a number of structural challenges to the cell. Inceptive studies carried out in the last five years have produced clear evidence of modulations in the lipid profile (sometimes referred to as the lipidome) of eukaryotes as a function of the cell cycle. This mounting body of evidence indicates that lipids play key roles in the structural transformations seen across the cycle. The accumulation of this evidence coincides with a revolution in our understanding of how lipid composition regulates a plethora of biological processes ranging from protein activity through to cellular signalling and membrane compartmentalisation. In this review, we discuss evidence from biological, chemical and physical studies of the lipid fraction across the cell cycle that demonstrate that lipids are well-developed cellular components at the heart of the biological machinery responsible for managing progress through the cell cycle. Furthermore, we discuss the mechanisms by which this careful control is exercised. Crown Copyright © 2017. Published by Elsevier B.V. All rights reserved.
Findeisen, Peggy; Mühlhausen, Stefanie; Dempewolf, Silke; Hertzog, Jonny; Zietlow, Alexander; Carlomagno, Teresa; Kollmar, Martin
Tubulins belong to the most abundant proteins in eukaryotes providing the backbone for many cellular substructures like the mitotic and meiotic spindles, the intracellular cytoskeletal network, and the axonemes of cilia and flagella. Homologs have even been reported for archaea and bacteria. However, a taxonomically broad and whole-genome-based analysis of the tubulin protein family has never been performed, and thus, the number of subfamilies, their taxonomic distribution, and the exact grouping of the supposed archaeal and bacterial homologs are unknown. Here, we present the analysis of 3,524 tubulins from 504 species. The tubulins formed six major subfamilies, α to ζ. Species of all major kingdoms of the eukaryotes encode members of these subfamilies implying that they must have already been present in the last common eukaryotic ancestor. The proposed archaeal homologs grouped together with the bacterial TubZ proteins as sister clade to the FtsZ proteins indicating that tubulins are unique to eukaryotes. Most species contained α- and/or β-tubulin gene duplicates resulting from recent branch- and species-specific duplication events. This shows that tubulins cannot be used for constructing species phylogenies without resolving their ortholog-paralog relationships. The many gene duplicates and also the independent loss of the δ-, ε-, or ζ-tubulins, which have been shown to be part of the triplet microtubules in basal bodies, suggest that tubulins can functionally substitute each other. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Eukaryotic microorganisms are naturally present in many water resources and can enter, grow and colonize water treatment and transport systems, including reservoirs, pipes and premise plumbing. In this study, we explored the eukaryotic microbial community structure in water during the (i) production of drinking water in a seawater desalination plant and (ii) transport of the drinking water in the distribution network. The desalination plant treatment involved pre-treatment (e.g. spruce filters), reverse osmosis (RO) membrane filtration and post-treatment steps (e.g. remineralization). 454 pyrosequencing analysis of the 18S rRNA gene revealed a highly diverse (35 phyla) and spatially variable eukaryotic community during water treatment and distribution. The desalination plant feed water contained a typical marine picoeukaryotic community dominated by Stramenopiles, Alveolates and Porifera. In the desalination plant Ascomycota was the most dominant phylum (15.5% relative abundance), followed by Alveolata (11.9%), unclassified fungi clade (10.9%) and Porifera (10.7%). In the drinking water distribution network, an uncultured fungi phylum was the major group (44.0%), followed by Chordata (17.0%), Ascomycota (11.0%) and Arthropoda (8.0%). Fungi constituted 40% of the total eukaryotic community in the treatment plant and the distribution network and their taxonomic composition was dominated by an uncultured fungi clade (55%). Comparing the plant effluent to the network samples, 84 OTUs (2.1%) formed the core eukaryotic community while 35 (8.4%) and 299 (71.5%) constituted unique OTUs in the produced water at the plant and combined tap water samples from the network, respectively. RO membrane filtration treatment significantly changed the water eukaryotic community composition and structure, highlighting the fact that (i) RO produced water is not sterile and (ii) the microbial community in the final tap water is influenced by the downstream distribution system. The study
Wijffels, René H; Kruse, Olaf; Hellingwerf, Klaas J
Both cyanobacteria and eukaryotic microalgae are promising organisms for sustainable production of bulk products such as food, feed, materials, chemicals and fuels. In this review we will summarize the potential and current biotechnological developments. Cyanobacteria are promising host organisms for the production of small molecules that can be secreted such as ethanol, butanol, fatty acids and other organic acids. Eukaryotic microalgae are interesting for products for which cellular storage is important such as proteins, lipids, starch and alkanes. For the development of new and promising lines of production, strains of both cyanobacteria and eukaryotic microalgae have to be improved. Transformation systems have been much better developed in cyanobacteria. However, several products would be preferably produced with eukaryotic microalgae. In the case of cyanobacteria a synthetic-systems biology approach has a great potential to exploit cyanobacteria as cell factories. For eukaryotic microalgae transformation systems need to be further developed. A promising strategy is transformation of heterologous (prokaryotic and eukaryotic) genes in established eukaryotic hosts such as Chlamydomonas reinhardtii. Experimental outdoor pilots under containment for the production of genetically modified cyanobacteria and microalgae are in progress. For full scale production risks of release of genetically modified organisms need to be assessed. Copyright © 2013. Published by Elsevier Ltd.
Feulner, Philine G. D.; Johnston, Susan E.; Santure, Anna W.; Smadja, Carole M.
Recombination, the exchange of DNA between maternal and paternal chromosomes during meiosis, is an essential feature of sexual reproduction in nearly all multicellular organisms. While the role of recombination in the evolution of sex has received theoretical and empirical attention, less is known about how recombination rate itself evolves and what influence this has on evolutionary processes within sexually reproducing organisms. Here, we explore the patterns of, and processes governing recombination in eukaryotes. We summarize patterns of variation, integrating current knowledge with an analysis of linkage map data in 353 organisms. We then discuss proximate and ultimate processes governing recombination rate variation and consider how these influence evolutionary processes. Genome-wide recombination rates (cM/Mb) can vary more than tenfold across eukaryotes, and there is large variation in the distribution of recombination events across closely related taxa, populations and individuals. We discuss how variation in rate and distribution relates to genome architecture, genetic and epigenetic mechanisms, sex, environmental perturbations and variable selective pressures. There has been great progress in determining the molecular mechanisms governing recombination, and with the continued development of new modelling and empirical approaches, there is now also great opportunity to further our understanding of how and why recombination rate varies. This article is part of the themed issue ‘Evolutionary causes and consequences of recombination rate variation in sexual organisms’. PMID:29109219
Full Text Available The MOB family includes a group of cell cycle-associated proteins highly conserved throughout eukaryotes, whose founding members are implicated in mitotic exit and co-ordination of cell cycle progression with cell polarity and morphogenesis. Here we report the characterization and evolution of the MOB domain-containing proteins as inferred from the 43 eukaryotic genomes so far sequenced. We show that genes for Mob-like proteins are present in at least 41 of these genomes, confi rming the universal distribution of this protein family and suggesting its prominent biological function. The phylogenetic analysis reveals fi ve distinct MOB domain classes, showing a progressive expansion of this family from unicellular to multicellular organisms, reaching the highest number in mammals. Plant Mob genes appear to have evolved from a single ancestor, most likely after the loss of one or more genes during the early stage of Viridiplantae evolutionary history. Three of the Mob classes are widespread among most of the analyzed organisms. The possible biological and molecular function of Mob proteins and their role in conserved signaling pathways related to cell proliferation, cell death and cell polarity are also presented and critically discussed.
Rask, Thomas Salhøj; Petersen, Bent; Chen, Donald S.
. The new basecalling method described here, named Multipass, implements a probabilistic framework for working with the raw flowgrams obtained by pyrosequencing. For each sequence variant Multipass calculates the likelihood and nucleotide sequence of several most likely sequences given the flowgram data....... This probabilistic approach enables integration of basecalling into a larger model where other parameters can be incorporated, such as the likelihood for observing a full-length open reading frame at the targeted region. We apply the method to 454 amplicon pyrosequencing data obtained from a malaria virulence gene...... family, where Multipass generates 20 % more error-free sequences than current state of the art methods, and provides sequence characteristics that allow generation of a set of high confidence error-free sequences. This novel method can be used to increase accuracy of existing and future amplicon...
Song, Qinxin; Wei, Guijiang; Zhou, Guohua
A portable bioluminescence analyser for detecting the DNA sequence of genetically modified organisms (GMOs) was developed by using a photodiode (PD) array. Pyrosequencing on eight genes (zSSIIb, Bt11 and Bt176 gene of genetically modified maize; Lectin, 35S-CTP4, CP4EPSPS, CaMV35S promoter and NOS terminator of the genetically modified Roundup ready soya) was successfully detected with this instrument. The corresponding limit of detection (LOD) was 0.01% with 35 PCR cycles. The maize and soya available from three different provenances in China were detected. The results indicate that pyrosequencing using the small size of the detector is a simple, inexpensive, and reliable way in a farm/field test of GMO analysis. Copyright © 2014 Elsevier Ltd. All rights reserved.
We have found that Rad51 and RecA Proteins form strikingly similar structures together with dsDNA and ATP. Their right handed helical nucleoprotein filaments extend the B-form DNA double helixes to 1.5 times in length and wind the helix. The similarity and uniqueness of their structures must reflect functional homologies between these proteins. Therefore, it is highly probable that similar recombination proteins are present in various organisms of different evolutional states. We have succeeded to clone RAD51 genes from human, mouse, chicken and fission yeast genes, and found that the homologues are widely distributed in eukaryotes. The HsRad51 and MmRad51 or ChRad51 proteins consist of 339 amino acids differing only by 4 or 12 amino acids, respectively, and highly homologous to both yeast proteins, but less so to Dmcl. All of these proteins are homologous to the region from residues 33 to 240 of RecA which was named ''homologous core. The homologous core is likely to be responsible for functions common for all of them, such as the formation of helical nucleoprotein filament that is considered to be involved in homologous pairing in the recombination reaction. The mouse gene is transcribed at a high level in thymus, spleen, testis, and ovary, at lower level in brain and at a further lower level in some other tissues. It is transcribed efficiently in recombination active tissues. A clear functional difference of Rad51 homologues from RecA was suggested by the failure of heterologous genes to complement the deficiency of Scrad51 mutants. This failure seems to reflect the absence of a compatible partner, such as ScRad52 protein in the case of ScRad51 protein, between different species. Thus, these discoveries play a role of the starting point to understand the fundamental gene targeting in mammalian cells and in gene therapy. (J.P.N.)
Full Text Available Eukaryotic translation is a complex process composed of three main steps: initiation, elongation, and termination. During infections by RNA- and DNA-viruses, the eukaryotic translation machinery is used to assure optimal viral protein synthesis. Human immunodeficiency virus type I (HIV-1 uses several non-canonical pathways to translate its own proteins, such as leaky scanning, frameshifting, shunt, and cap-independent mechanisms. Moreover, HIV-1 modulates the host translation machinery by targeting key translation factors and overcomes different cellular obstacles that affect protein translation. In this review, we describe how HIV-1 proteins target several components of the eukaryotic translation machinery, which consequently improves viral translation and replication.
Sherman, F; McKnight, G; Stewart, J W
An analysis of mutants of the yeast Saccharomyces cerevisiae indicates that AUG is the sole codon capable of initiating translation of iso-1-cytochrome c. This result with yeast and the sequence results of numerous eukaryotic genes indicate that AUG is the only initiation codon in eukaryotes; in contrast, results with Escherichia colia and bacteriophages indicate that both AUG and GUG are initiation codons in prokaryotes. The difference can be explained by the lack of the t/sup 6/ A hypermodified nucleoside (N-(9-(..beta..-D-ribofuranosyl)purin-6-ylcarbamoyl)threonine) in prokaryotic initiator tRNA and its presence in eukaryotic initiator tRNA.
Ho, Louis K; Nodwell, Justin R
Environmental microbes produce biologically active small molecules that have been mined extensively as antibiotics and a smaller number of drugs that act on eukaryotic cells. It is known that there are additional bioactives to be discovered from this source. While the discovery of new antibiotics is challenged by the frequent discovery of known compounds, we contend that the eukaryote-active compounds may be less saturated. Indeed, despite there being far fewer eukaryotic-active natural products these molecules interact with a far richer diversity of molecular and cellular targets.
Umen, James G.; Olson, Bradley J.S.C.
Volvocine algae are a group of chlorophytes that together comprise a unique model for evolutionary and developmental biology. The species Chlamydomonas reinhardtii and Volvox carteri represent extremes in morphological diversity within the Volvocine clade. Chlamydomonas is unicellular and reflects the ancestral state of the group, while Volvox is multicellular and has evolved numerous innovations including germ-soma differentiation, sexual dimorphism, and complex morphogenetic patterning. The Chlamydomonas genome sequence has shed light on several areas of eukaryotic cell biology, metabolism and evolution, while the Volvox genome sequence has enabled a comparison with Chlamydomonas that reveals some of the underlying changes that enabled its transition to multicellularity, but also underscores the subtlety of this transition. Many of the tools and resources are in place to further develop Volvocine algae as a model for evolutionary genomics. PMID:25883411
Carda Diéguez, Miguel; Mira, Alex; Fouz Rodríguez, Belén
The routine use of chemotherapy to control bacterial diseases in aquatic populations has resulted in the development and spread of antibiotic resistance. The inclusion of immunostimulants in fish diets (functional diets) is one of the main strategies to solve this threat. This study aimed to analyse the intestinal microbiota of cultured European sea bass (Dicentrarchus labrax) fed two functional diets applying pyrosequencing of PCR-amplified 16S rRNA gene. Quality-filtered reads were assigned...
Razin, S V
This issue of Biochemistry (Moscow) is devoted to the cell nucleus and mechanisms of transcription regulation. Over the years, biochemical processes in the cell nucleus have been studied in isolation, outside the context of their spatial organization. Now it is clear that segregation of functional processes within a compartmentalized cell nucleus is very important for the implementation of basic genetic processes. The functional compartmentalization of the cell nucleus is closely related to the spatial organization of the genome, which in turn plays a key role in the operation of epigenetic mechanisms. In this issue of Biochemistry (Moscow), we present a selection of review articles covering the functional architecture of the eukaryotic cell nucleus, the mechanisms of genome folding, the role of stochastic processes in establishing 3D architecture of the genome, and the impact of genome spatial organization on transcription regulation.
Fournier, D.; Hanus, Robert; Roisin, Y.
Roč. 7, č. 2 (2015), s. 521-524 ISSN 1877-7252 Institutional support: RVO:61388963 Keywords : Cavitermes tuberosus * termite * microsatellite * pyrosequencing * population genetics Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 0.446, year: 2015
Lee, O. O.; Yang, J.; Bougouffa, S.; Wang, Y.; Batang, Zenon B.; Tian, R.; Al-Suwailem, A.; Qian, P.-Y.
-pyrosequencing technique to investigate the bacterial communities associated with three stony Scleractinea and two soft Octocorallia corals from three locations in the Red Sea. Our results revealed highly diverse bacterial communities in the Red Sea corals, with more than
A Mark Ibekwe
Full Text Available Current microbial source tracking (MST methods for water depend on testing for fecal indicator bacterial counts or specific marker gene sequences to identify fecal contamination where potential human pathogenic bacteria could be present. In this study, we applied 454 high-throughput pyrosequencing to identify bacterial pathogen DNA sequences, including those not traditionally monitored by MST and correlated their abundances to specific sources of contamination such as urban runoff and agricultural runoff from concentrated animal feeding operations (CAFOs, recreation park area, waste-water treatment plants, and natural sites with little or no human activities. Samples for pyrosequencing were surface water, and sediment collected from 19 sites. A total of 12,959 16S rRNA gene sequences with average length of ≤400 bp were obtained, and were assigned to corresponding taxonomic ranks using ribosomal database project (RDP, Classifier and Greengenes databases. The percent of total potential pathogens were highest in urban runoff water (7.94%, agricultural runoff sediment (6.52%, and Prado Park sediment (6.00%, respectively. Although the numbers of DNA sequence tags from pyrosequencing were very high for the natural site, corresponding percent potential pathogens were very low (3.78-4.08%. Most of the potential pathogenic bacterial sequences identified were from three major phyla, namely, Proteobacteria, Bacteroidetes, and Firmicutes. The use of deep sequencing may provide improved and faster methods for the identification of pathogen sources in most watersheds so that better risk assessment methods may be developed to enhance public health.
Hou, X-L; Cao, Q-Y; Jia, H-Y; Chen, Z
Pathogens causing acute diarrhea include a large variety of species from Enterobacteriaceae and Vibrionaceae. A method based on pyrosequencing was used here to differentiate bacteria commonly associated with diarrhea in China; the method is targeted to a partial amplicon of the gyrB gene, which encodes the B subunit of DNA gyrase. Twenty-eight specific polymorphic positions were identified from sequence alignment of a large sequence dataset and targeted using 17 sequencing primers. Of 95 isolates tested, belonging to 13 species within 7 genera, most could be identified to the species level; O157 type could be differentiated from other E. coli types; Salmonella enterica subsp. enterica could be identified at the serotype level; the genus Shigella, except for S. boydii and S. dysenteriae, could also be identified. All these isolates were also subjected to conventional sequencing of a relatively long ( approximately1.2 kb) region of gyrB DNA; these results confirmed those with pyrosequencing. Twenty-two fecal samples were surveyed, the results of which were concordant with culture-based bacterial identification, and the pathogen detection limit with simulated stool specimens was 10(4) CFU/ml. DNA from different pathogens was also mixed to simulate a case of multibacterial infection, and the generated signals correlated well with the mix ratio. In summary, the gyrB-based pyrosequencing approach proved to have significant reliability and discriminatory power for enteropathogenic bacterial identification and provided a fast and effective method for clinical diagnosis.
Keskin, Cangül; Demiryürek, Ebru Özsezer; Onuk, Ertan Emek
This study aimed to characterize the microbial communities of primary and secondary/persistent endodontic infections using high-throughput pyrosequencing from the pulverized samples. The roots of 20 extracted human teeth with primary endodontic infection and 20 teeth with secondary/persistent endodontic infection were collected. The outer surfaces of the roots were disinfected, and whole roots were cryopulverized. 16S amplicon pyrosequencing data from the DNA extracted from the pulverized root powders were obtained, and microorganism abundance and diversity were calculated. Data were analyzed using statistical and bioinformatic methods. Pyrosequencing analysis resulted a total of 2,606,128 sequences from 40 samples. A total of 15 phyla, 160 genera, and 368 species were detected. No significant difference between primary and secondary/persistent endodontic infections was found regarding the diversity and richness of operational taxonomic units at the phyla, genera, and species levels (P > .005). The present study revealed that the microbial diversity of secondary/persistent endodontic infections did not differ than those of primary endodontic infections. A new archaeal species, Candidatus Nitrosoarchaeum limnia, was detected in root canals of 1 patient with primary endodontic infection for the first time. Copyright © 2017 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.
Kunin, Victor; Engelbrektson, Anna; Ochman, Howard; Hugenholtz, Philip
Massively parallel pyrosequencing of the small subunit (16S) ribosomal RNA gene has revealed that the extent of rare microbial populations in several environments, the 'rare biosphere', is orders of magnitude higher than previously thought. One important caveat with this method is that sequencing error could artificially inflate diversity estimates. Although the per-base error of 16S rDNA amplicon pyrosequencing has been shown to be as good as or lower than Sanger sequencing, no direct assessments of pyrosequencing errors on diversity estimates have been reported. Using only Escherichia coli MG1655 as a reference template, we find that 16S rDNA diversity is grossly overestimated unless relatively stringent read quality filtering and low clustering thresholds are applied. In particular, the common practice of removing reads with unresolved bases and anomalous read lengths is insufficient to ensure accurate estimates of microbial diversity. Furthermore, common and reproducible homopolymer length errors can result in relatively abundant spurious phylotypes further confounding data interpretation. We suggest that stringent quality-based trimming of 16S pyrotags and clustering thresholds no greater than 97% identity should be used to avoid overestimates of the rare biosphere.
Hong, Bo-Young; Lee, Tae-Kwon; Lim, Sang-Min; Chang, Seok Woo; Park, Joonhong; Han, Seung Hyun; Zhu, Qiang; Safavi, Kamran E; Fouad, Ashraf F; Kum, Kee Yeon
The aim of this study was to investigate the bacterial community profile of intracanal microbiota in primary and persistent endodontic infections associated with asymptomatic chronic apical periodontitis by using GS-FLX Titanium pyrosequencing. The null hypothesis was that there is no difference in diversity of overall bacterial community profiles between primary and persistent infections. Pyrosequencing analysis from 10 untreated and 8 root-filled samples was conducted. Analysis from 18 samples yielded total of 124,767 16S rRNA gene sequences (with a mean of 6932 reads per sample) that were taxonomically assigned into 803 operational taxonomic units (3% distinction), 148 genera, and 10 phyla including unclassified. Bacteroidetes was the most abundant phylum in both primary and persistent infections. There were no significant differences in bacterial diversity between the 2 infection groups (P > .05). The bacterial community profile that was based on dendrogram showed that bacterial population in both infections was not significantly different in their structure and composition (P > .05). The present pyrosequencing study demonstrates that persistent infections have as diverse bacterial community as primary infections. Copyright © 2013 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.
Karnkowska, A.; Vacek, V.; Zubáčová, Z.; Treitli, S.C.; Petrzelkova, R.; Eme, L.; Novák, L.; Žárský, V.; Barlow, L.D.; Herman, E.K.; Soukal, P.; Hroudová, Miluše; Doležal, P.; Stairs, C.W.; Roger, A. J.; Eliaš, M.; Dacks, J.B.; Vlček, Čestmír; Hampl, V.
Roč. 26, č. 10 (2016), s. 1274-1284 ISSN 0960-9822 R&D Projects: GA MŠk(CZ) LQ1604; GA MŠk(CZ) ED1.1.00/02.0109; GA ČR GAP506/12/1010 Institutional support: RVO:68378050 Keywords : arginine dihydrolase pathway * tail-anchored proteins * fe-s cluster * trichomonas-vaginalis * entamoeba-histolytica * giardia-intestinalis * tritrichomonas-fetus * genome annotation * energy-metabolism * gene-transfer Subject RIV: EB - Genetics ; Molecular Biology OBOR OECD: Developmental biology Impact factor: 8.851, year: 2016
Wijffels, R.H.; Kruse, O.; Hellingwerf, K.J.
Both cyanobacteria and eukaryotic microalgae are promising organisms for sustainable production of bulk products such as food, feed, materials, chemicals and fuels. In this review we will summarize the potential and current biotechnological developments. Cyanobacteria are promising host organisms
Comparisons among a variety of eukaryotes have revealed considerable variability in the structures and processes involved in their meiosis. Nevertheless, conventional forms of meiosis occur in all major groups of eukaryotes, including early-branching protists. This finding confirms that meiosis originated in the common ancestor of all eukaryotes and suggests that primordial meiosis may have had many characteristics in common with conventional extant meiosis. However, it is possible that the synaptonemal complex and the delicate crossover control related to its presence were later acquisitions. Later still, modifications to meiotic processes occurred within different groups of eukaryotes. Better knowledge on the spectrum of derived and uncommon forms of meiosis will improve our understanding of many still mysterious aspects of the meiotic process and help to explain the evolutionary basis of functional adaptations to the meiotic program.
Liu, Dekang; Keijzers, Guido; Rasmussen, Lene Juel
in the clinic, and as a biomarker of cancer susceptibility in animal model systems. Prokaryotic MMR is well-characterized at the molecular and mechanistic level; however, MMR is considerably more complex in eukaryotic cells than in prokaryotic cells, and in recent years, it has become evident that MMR plays...... novel roles in eukaryotic cells, several of which are not yet well-defined or understood. Many MMR-deficient human cancer cells lack mutations in known human MMR genes, which strongly suggests that essential eukaryotic MMR components/cofactors remain unidentified and uncharacterized. Furthermore......, the mechanism by which the eukaryotic MMR machinery discriminates between the parental (template) and the daughter (nascent) DNA strand is incompletely understood and how cells choose between the EXO1-dependent and the EXO1–independent subpathways of MMR is not known. This review summarizes recent literature...
Kamp, Anja; Høgslund, Signe; Risgaard-Petersen, Nils
The microbial nitrogen cycle is one of the most complex and environmentally important element cycles on Earth and has long been thought to be mediated exclusively by prokaryotic microbes. Rather recently, it was discovered that certain eukaryotic microbes are able to store nitrate intracellularly......, suggesting that eukaryotes may rival prokaryotes in terms of dissimilatory nitrate reduction. Finally, this review article sketches some evolutionary perspectives of eukaryotic nitrate metabolism and identifies open questions that need to be addressed in future investigations....... and use it for dissimilatory nitrate reduction in the absence of oxygen. The paradigm shift that this entailed is ecologically significant because the eukaryotes in question comprise global players like diatoms, foraminifers, and fungi. This review article provides an unprecedented overview of nitrate...
Dahmane, Narimane; Gadelle, Danièle; Delmas, Stéphane; Criscuolo, Alexis; Eberhard, Stephan; Desnoues, Nicole; Collin, Sylvie; Zhang, Hongliang; Pommier, Yves; Forterre, Patrick; Sezonov, Guennadi
Type IB DNA topoisomerases can eliminate torsional stresses produced during replication and transcription. These enzymes are found in all eukaryotes and a short version is present in some bacteria and viruses. Among prokaryotes, the long eukaryotic version is only observed in archaea of the phylum Thaumarchaeota. However, the activities and the roles of these topoisomerases have remained an open question. Here, we demonstrate that all available thaumarchaeal genomes contain a topoisomerase IB gene that defines a monophyletic group closely related to the eukaryotic enzymes. We show that the topIB gene is expressed in the model thaumarchaeon Nitrososphaera viennensis and we purified the recombinant enzyme from the uncultivated thaumarchaeon Candidatus Caldiarchaeum subterraneum. This enzyme is active in vitro at high temperature, making it the first thermophilic topoisomerase IB characterized so far. We have compared this archaeal type IB enzyme to its human mitochondrial and nuclear counterparts. The archaeal enzyme relaxes both negatively and positively supercoiled DNA like the eukaryotic enzymes. However, its pattern of DNA cleavage specificity is different and it is resistant to camptothecins (CPTs) and non-CPT Top1 inhibitors, LMP744 and lamellarin D. This newly described thermostable topoisomerases IB should be a promising new model for evolutionary, mechanistic and structural studies. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Full Text Available Abstract Background Alternative splicing has been reported in various eukaryotic groups including plants, apicomplexans, diatoms, amoebae, animals and fungi. However, whether widespread alternative splicing has evolved independently in the different eukaryotic groups or was inherited from their last common ancestor, and may therefore predate multicellularity, is still unknown. To better understand the origin and evolution of alternative splicing and its usage in diverse organisms, we studied alternative splicing in 12 eukaryotic species, comparing rates of alternative splicing across genes of different functional classes, cellular locations, intron/exon structures and evolutionary origins. Results For each species, we find that genes from most functional categories are alternatively spliced. Ancient genes (shared between animals, fungi and plants show high levels of alternative splicing. Genes with products expressed in the nucleus or plasma membrane are generally more alternatively spliced while those expressed in extracellular location show less alternative splicing. We find a clear correspondence between incidence of alternative splicing and intron number per gene both within and between genomes. In general, we find several similarities in patterns of alternative splicing across these diverse eukaryotes. Conclusion Along with previous studies indicating intron-rich genes with weak intron boundary consensus and complex spliceosomes in ancestral organisms, our results suggest that at least a simple form of alternative splicing may already have been present in the unicellular ancestor of plants, fungi and animals. A role for alternative splicing in the evolution of multicellularity then would largely have arisen by co-opting the preexisting process.
Huerta, Carlos; Borek, Dominika; Machius, Mischa; Grishin, Nick V.; Zhang, Hong
Flavin mononucleotide adenylyltransferase (FMNAT) catalyzes the formation of the essential flavocoenzyme FAD and plays an important role in flavocoenzyme homeostasis regulation. By sequence comparison, bacterial and eukaryotic FMNAT enzymes belong to two different protein superfamilies and apparently utilize different set of active site residues to accomplish the same chemistry. Here we report the first structural characterization of a eukaryotic FMNAT from a pathogenic yeast Candida glabrata...
Neil W. Blackstone
Full Text Available Two major obstacles hinder the application of evolutionary theory to the origin of eukaryotes. The first is more apparent than real—the endosymbiosis that led to the mitochondrion is often described as “non-Darwinian” because it deviates from the incremental evolution championed by the modern synthesis. Nevertheless, endosymbiosis can be accommodated by a multi-level generalization of evolutionary theory, which Darwin himself pioneered. The second obstacle is more serious—all of the major features of eukaryotes were likely present in the last eukaryotic common ancestor thus rendering comparative methods ineffective. In addition to a multi-level theory, the development of rigorous, sequence-based phylogenetic and comparative methods represents the greatest achievement of modern evolutionary theory. Nevertheless, the rapid evolution of major features in the eukaryotic stem group requires the consideration of an alternative framework. Such a framework, based on the contingent nature of these evolutionary events, is developed and illustrated with three examples: the putative intron proliferation leading to the nucleus and the cell cycle; conflict and cooperation in the origin of eukaryotic bioenergetics; and the inter-relationship between aerobic metabolism, sterol synthesis, membranes, and sex. The modern synthesis thus provides sufficient scope to develop an evolutionary framework to understand the origin of eukaryotes.
Full Text Available Dental caries has a polymicrobial etiology within the complex oral microbial ecosystem. However, the overall diversity and structure of supragingival plaque microbiota in adult dental health and caries are not well understood. Here, 160 supragingival plaque samples from patients with dental health and different severities of dental caries were collected for bacterial genomic DNA extraction, pyrosequencing by amplification of the 16S rDNA V1–V3 hypervariable regions, and bioinformatic analysis. High-quality sequences (2,261,700 clustered into 10,365 operational taxonomic units (OTUs; 97% identity, representing 453 independent species belonging to 122 genera, 66 families, 34 orders, 21 classes, and 12 phyla. All groups shared 7522 OTUs, indicating the presence of a core plaque microbiome. Smooth rarefaction curves were suggestive of plaque microbial diversity. α diversity analysis showed that healthy plaque microbial diversity exceeded that of dental caries, with the diversity decreasing gradually with the severity of caries. The dominant phyla of plaque microbiota included Bacteroidetes, Actinobacteria, Proteobacteria, Firmicutes, Fusobacteria, and TM7. The dominant genera included Capnocytophaga, Prevotella, Actinomyces, Corynebacterium, Neisseria, Streptococcus, Rothia, and Leptotrichia. β diversity analysis showed that the plaque microbial community structure was similar in all groups and that group members were relatively constant, only showing differences in abundance. Analysis of composition differences identified 10 health-related and 21 caries-related genera. Key genera (27 that potentially contributed to plaque microbiota distributions between groups were identified. Finally, co-occurrence network analysis and function prediction were performed. Treatment strategies directed toward modulating microbial interactions and their functional output should be further developed.
Chen, Lifeng; Liu, Hongfang; Friedman, Carol
With more and more scientific literature published online, the effective management and reuse of this knowledge has become problematic. Natural language processing (NLP) may be a potential solution by extracting, structuring and organizing biomedical information in online literature in a timely manner. One essential task is to recognize and identify genomic entities in text. 'Recognition' can be accomplished using pattern matching and machine learning. But for 'identification' these techniques are not adequate. In order to identify genomic entities, NLP needs a comprehensive resource that specifies and classifies genomic entities as they occur in text and that associates them with normalized terms and also unique identifiers so that the extracted entities are well defined. Online organism databases are an excellent resource to create such a lexical resource. However, gene name ambiguity is a serious problem because it affects the appropriate identification of gene entities. In this paper, we explore the extent of the problem and suggest ways to address it. We obtained gene information from 21 organisms and quantified naming ambiguities within species, across species, with English words and with medical terms. When the case (of letters) was retained, official symbols displayed negligible intra-species ambiguity (0.02%) and modest ambiguities with general English words (0.57%) and medical terms (1.01%). In contrast, the across-species ambiguity was high (14.20%). The inclusion of gene synonyms increased intra-species ambiguity substantially and full names contributed greatly to gene-medical-term ambiguity. A comprehensive lexical resource that covers gene information for the 21 organisms was then created and used to identify gene names by using a straightforward string matching program to process 45,000 abstracts associated with the mouse model organism while ignoring case and gene names that were also English words. We found that 85.1% of correctly retrieved mouse
Intensive care unit environmental surfaces are contaminated by multidrug-resistant bacteria in biofilms: combined results of conventional culture, pyrosequencing, scanning electron microscopy, and confocal laser microscopy.
Hu, H; Johani, K; Gosbell, I B; Jacombs, A S W; Almatroudi, A; Whiteley, G S; Deva, A K; Jensen, S; Vickery, K
Hospital-associated infections cause considerable morbidity and mortality, and are expensive to treat. Organisms causing these infections can be sourced from the inanimate environment around a patient. Could the difficulty in eradicating these organisms from the environment be because they reside in dry surface biofilms? The intensive care unit (ICU) of a tertiary referral hospital was decommissioned and the opportunity to destructively sample clinical surfaces was taken in order to investigate whether multidrug-resistant organisms (MDROs) had survived the decommissioning process and whether they were present in biofilms. The ICU had two 'terminal cleans' with 500 ppm free chlorine solution; items from bedding, surrounds, and furnishings were then sampled with cutting implements. Sections were sonicated in tryptone soya broth and inoculated on to chromogenic plates to demonstrate MDROs, which were confirmed with the Vitek2 system. Genomic DNA was extracted directly from ICU samples, and subjected to polymerase chain reaction (PCR) for femA to detect Staphylococcus aureus and the microbiome by bacterial tag-encoded FLX amplicon pyrosequencing. Confocal laser scanning microscopy (CLSM) and scanning electron microscopy (SEM) were performed on environmental samples. Multidrug-resistant bacteria were cultured from 52% (23/44) of samples cultured. S. aureus PCR was positive in 50%. Biofilm was demonstrated in 93% (41/44) of samples by CLSM and/or SEM. Pyrosequencing demonstrated that the biofilms were polymicrobial and contained species that had multidrug-resistant strains. Dry surface biofilms containing MDROs are found on ICU surfaces despite terminal cleaning with chlorine solution. How these arise and how they might be removed requires further study. Copyright © 2015 The Healthcare Infection Society. Published by Elsevier Ltd. All rights reserved.
Trubitsyna, Maryia; Michlewski, Gracjan; Finnegan, David J; Elfick, Alistair; Rosser, Susan J; Richardson, Julia M; French, Christopher E
Delivery of DNA to cells and its subsequent integration into the host genome is a fundamental task in molecular biology, biotechnology and gene therapy. Here we describe an IP-free one-step method that enables stable genome integration into either prokaryotic or eukaryotic cells. A synthetic mariner transposon is generated by flanking a DNA sequence with short inverted repeats. When purified recombinant Mos1 or Mboumar-9 transposase is co-transfected with transposon-containing plasmid DNA, it penetrates prokaryotic or eukaryotic cells and integrates the target DNA into the genome. In vivo integrations by purified transposase can be achieved by electroporation, chemical transfection or Lipofection of the transposase:DNA mixture, in contrast to other published transposon-based protocols which require electroporation or microinjection. As in other transposome systems, no helper plasmids are required since transposases are not expressed inside the host cells, thus leading to generation of stable cell lines. Since it does not require electroporation or microinjection, this tool has the potential to be applied for automated high-throughput creation of libraries of random integrants for purposes including gene knock-out libraries, screening for optimal integration positions or safe genome locations in different organisms, selection of the highest production of valuable compounds for biotechnology, and sequencing. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Shemi, Adva; Ben-Dor, Shifra; Vardi, Assaf
Aquatic photosynthetic eukaryotes represent highly diverse groups (green, red, and chromalveolate algae) derived from multiple endosymbiosis events, covering a wide spectrum of the tree of life. They are responsible for about 50% of the global photosynthesis and serve as the foundation for oceanic and fresh water food webs. Although the ecophysiology and molecular ecology of some algal species are extensively studied, some basic aspects of algal cell biology are still underexplored. The recent wealth of genomic resources from algae has opened new frontiers to decipher the role of cell signaling pathways and their function in an ecological and biotechnological context. Here, we took a bioinformatic approach to explore the distribution and conservation of TOR and autophagy-related (ATG) proteins (Atg in yeast) in diverse algal groups. Our genomic analysis demonstrates conservation of TOR and ATG proteins in green algae. In contrast, in all 5 available red algal genomes, we could not detect the sequences that encode for any of the 17 core ATG proteins examined, albeit TOR and its interacting proteins are conserved. This intriguing data suggests that the autophagy pathway is not conserved in red algae as it is in the entire eukaryote domain. In contrast, chromalveolates, despite being derived from the red-plastid lineage, retain and express ATG genes, which raises a fundamental question regarding the acquisition of ATG genes during algal evolution. Among chromalveolates, Emiliania huxleyi (Haptophyta), a bloom-forming coccolithophore, possesses the most complete set of ATG genes, and may serve as a model organism to study autophagy in marine protists with great ecological significance. PMID:25915714
Zhixin Zhao; Cheng Guo; Sreeskandarajan Sutharzan; Pei Li; Craig Echt; Jie Zhang; Chun Liang
Tandem repeats (TRs) extensively exist in the genomes of prokaryotes and eukaryotes. Based on the sequenced genomes and gene annotations of 31 plant and algal species in Phytozome version 8.0 (http://www.phytozome.net/), we examined TRs in a genome-wide scale, characterized their distributions and motif features, and explored their putative biological functions. Among...
Modern biological analyses are often assisted by recent technologies making the sequencing of complex genomes both technically possible and feasible. We recently sequenced the tomato genome that, like many eukaryotic genomes, is large and complex. Current sequencing technologies allow the developmen...
Foulongne-Oriol, M.; Murat, C.; Castanera, R.; Ramírez, L.; Sonnenberg, A.S.M.
Repetitive DNA elements are ubiquitous constituents of eukaryotic genomes. The biological roles of these repetitive elements, supposed to impact genome organization and evolution, are not completely elucidated yet. The availability of whole genome sequence offers the opportunity to draw a picture of
Ling, Zongxin; Kong, Jianming; Jia, Peng; Wei, Chaochun; Wang, Yuezhu; Pan, Zhiwen; Huang, Wujing; Li, Lanjuan; Chen, Hui; Xiang, Charlie
Oral microbiota plays a vital role in maintaining the homeostasis of oral cavity. Dental caries are among the most common oral diseases in children and pathogenic bacteria contribute to the development of the disease. However, the overall structure of bacterial communities in the oral cavity from children with dental caries has not been explored deeply heretofore. We used high-throughput barcoded pyrosequencing and PCR-denaturing gradient gel electrophoresis (DGGE) to examine bacterial diversity of oral microbiota in saliva and supragingival plaques from 60 children aged 3 to 6 years old with and without dental caries from China. The multiplex barcoded pyrosequencing was performed in a single run, with multiple samples tagged uniquely by multiplex identifiers. As PCR-DGGE analysis is a conventional molecular ecological approach, this analysis was also performed on the same samples and the results of both approaches were compared. A total of 186,787 high-quality sequences were obtained for evaluating bacterial diversity and 41,905 unique sequences represented all phylotypes. We found that the oral microbiota in children was far more diverse than previous studies reported, and more than 200 genera belonging to ten phyla were found in the oral cavity. The phylotypes in saliva and supragingival plaques were significantly different and could be divided into two distinct clusters (p oral microbiome analyzed by PCR-DGGE and barcoded pyrosequencing was employed to cross validate the data sets. The genera of Streptococcus, Veillonella, Actinomyces, Granulicatella, Leptotrichia, and Thiomonas in plaques were significantly associated with dental caries (p oral microbiota allowed for a better understanding of oral microecosystem, and these pathogenic populations in plaque provide new insights into the etiology of dental caries and suggest new targets for interventions of the disease.
Boissy, Robert J; Romberger, Debra J; Roughead, William A; Weissenburger-Moser, Lisa; Poole, Jill A; LeVan, Tricia D
Inhalation of agricultural dusts causes inflammatory reactions and symptoms such as headache, fever, and malaise, which can progress to chronic airway inflammation and associated diseases, e.g. asthma, chronic bronchitis, chronic obstructive pulmonary disease, and hypersensitivity pneumonitis. Although in many agricultural environments feed particles are the major constituent of these dusts, the inflammatory responses that they provoke are likely attributable to particle-associated bacteria, archaebacteria, fungi, and viruses. In this study, we performed shotgun pyrosequencing metagenomic analyses of DNA from dusts from swine confinement facilities or grain elevators, with comparisons to dusts from pet-free households. DNA sequence alignment showed that 19% or 62% of shotgun pyrosequencing metagenomic DNA sequence reads from swine facility or household dusts, respectively, were of swine or human origin, respectively. In contrast only 2% of such reads from grain elevator dust were of mammalian origin. These metagenomic shotgun reads of mammalian origin were excluded from our analyses of agricultural dust microbiota. The ten most prevalent bacterial taxa identified in swine facility compared to grain elevator or household dust were comprised of 75%, 16%, and 42% gram-positive organisms, respectively. Four of the top five swine facility dust genera were assignable (Clostridium, Lactobacillus, Ruminococcus, and Eubacterium, ranging from 4% to 19% relative abundance). The relative abundances of these four genera were lower in dust from grain elevators or pet-free households. These analyses also highlighted the predominance in swine facility dust of Firmicutes (70%) at the phylum level, Clostridia (44%) at the Class level, and Clostridiales at the Order level (41%). In summary, shotgun pyrosequencing metagenomic analyses of agricultural dusts show that they differ qualitatively and quantitatively at the level of microbial taxa present, and that the bioinformatic analyses
Robert J Boissy
Full Text Available Inhalation of agricultural dusts causes inflammatory reactions and symptoms such as headache, fever, and malaise, which can progress to chronic airway inflammation and associated diseases, e.g. asthma, chronic bronchitis, chronic obstructive pulmonary disease, and hypersensitivity pneumonitis. Although in many agricultural environments feed particles are the major constituent of these dusts, the inflammatory responses that they provoke are likely attributable to particle-associated bacteria, archaebacteria, fungi, and viruses. In this study, we performed shotgun pyrosequencing metagenomic analyses of DNA from dusts from swine confinement facilities or grain elevators, with comparisons to dusts from pet-free households. DNA sequence alignment showed that 19% or 62% of shotgun pyrosequencing metagenomic DNA sequence reads from swine facility or household dusts, respectively, were of swine or human origin, respectively. In contrast only 2% of such reads from grain elevator dust were of mammalian origin. These metagenomic shotgun reads of mammalian origin were excluded from our analyses of agricultural dust microbiota. The ten most prevalent bacterial taxa identified in swine facility compared to grain elevator or household dust were comprised of 75%, 16%, and 42% gram-positive organisms, respectively. Four of the top five swine facility dust genera were assignable (Clostridium, Lactobacillus, Ruminococcus, and Eubacterium, ranging from 4% to 19% relative abundance. The relative abundances of these four genera were lower in dust from grain elevators or pet-free households. These analyses also highlighted the predominance in swine facility dust of Firmicutes (70% at the phylum level, Clostridia (44% at the Class level, and Clostridiales at the Order level (41%. In summary, shotgun pyrosequencing metagenomic analyses of agricultural dusts show that they differ qualitatively and quantitatively at the level of microbial taxa present, and that the
Amoako, Kingsley K; Janzen, Timothy W; Shields, Michael J; Hahn, Kristen R; Thomas, Matthew C; Goji, Noriko
The development of advanced methodologies for the detection of Bacillus anthracis has been evolving rapidly since the release of the anthrax spores in the mail in 2001. Recent advances in detection and identification techniques could prove to be an essential component in the defense against biological attacks. Sequence based such as pyrosequencing, which has the capability to determine short DNA stretches in real-time using biotinylated PCR amplicons, has potential biodefense applications. Using markers from the virulence plasmids (pXO1 and pXO2) and chromosomal regions, we have demonstrated the power of this technology in the rapid, specific and sensitive detection of B. anthracis spores in food matrices including milk, juice, bottled water, and processed meat. The combined use of immunomagnetic separation and pyrosequencing showed positive detection when liquid foods (bottled water, milk, juice), and processed meat were experimentally inoculated with 6CFU/mL and 6CFU/g, respectively, without an enrichment step. Pyrosequencing is completed in about 60min (following PCR amplification) and yields accurate and reliable results with an added layer of confidence. The entire assay (from sample preparation to sequencing information) can be completed in about 7.5h. A typical run on food samples yielded 67-80bp reads with 94-100% identity to the expected sequence. This sequence based approach is a novel application for the detection of anthrax spores in food with potential application in foodborne bioterrorism response and biodefense involving the use of anthrax spores. Crown Copyright © 2013. Published by Elsevier B.V. All rights reserved.
Mougiakos, Ioannis; Mohanraju, Prarthana; Bosma, Elleke F.; Vrouwe, Valentijn; Finger Bou, Max; Naduthodi, Mihris I.S.; Gussak, Alex; Brinkman, Rudolf B.L.; Kranenburg, Van Richard; Oost, Van Der John
CRISPR-Cas9-based genome engineering tools have revolutionized fundamental research and biotechnological exploitation of both eukaryotes and prokaryotes. However, the mesophilic nature of the established Cas9 systems does not allow for applications that require enhanced stability, including
Zhang, Guojie; Wang, Hongsheng; Shi, Junjie
melanogaster, Anopheles gambiae, Bombyx mori, Tribolium castaneum, and Apis mellifera were compared to the complete genomes of three non-insect eukaryotes (opisthokonts) Homo sapiens, Caenorhabditis elegans and Saccharomyces cerevisiae. This operation yielded 154 groups of orthologous proteins in Drosophila...
Mougiakos, Ioannis; Mohanraju, Prarthana; Bosma, Elleke Fenna
CRISPR-Cas9-based genome engineering tools have revolutionized fundamental research and biotechnological exploitation of both eukaryotes and prokaryotes. However, the mesophilic nature of the established Cas9 systems does not allow for applications that require enhanced stability, including...
Full Text Available DNA replication origins strikingly differ between eukaryotic species and cell types. Origins are localized and can be highly efficient in budding yeast, are randomly located in early fly and frog embryos, which do not transcribe their genomes, and are clustered in broad (10-100 kb non-transcribed zones, frequently abutting transcribed genes, in mammalian cells. Nonetheless, in all cases, origins are established during the G1-phase of the cell cycle by the loading of double hexamers of the Mcm 2-7 proteins (MCM DHs, the core of the replicative helicase. MCM DH activation in S-phase leads to origin unwinding, polymerase recruitment, and initiation of bidirectional DNA synthesis. Although MCM DHs are initially loaded at sites defined by the binding of the origin recognition complex (ORC, they ultimately bind chromatin in much greater numbers than ORC and only a fraction are activated in any one S-phase. Data suggest that the multiplicity and functional redundancy of MCM DHs provide robustness to the replication process and affect replication time and that MCM DHs can slide along the DNA and spread over large distances around the ORC. Recent studies further show that MCM DHs are displaced along the DNA by collision with transcription complexes but remain functional for initiation after displacement. Therefore, eukaryotic DNA replication relies on intrinsically mobile and flexible origins, a strategy fundamentally different from bacteria but conserved from yeast to human. These properties of MCM DHs likely contribute to the establishment of broad, intergenic replication initiation zones in higher eukaryotes.
DNA replication origins strikingly differ between eukaryotic species and cell types. Origins are localized and can be highly efficient in budding yeast, are randomly located in early fly and frog embryos, which do not transcribe their genomes, and are clustered in broad (10-100 kb) non-transcribed zones, frequently abutting transcribed genes, in mammalian cells. Nonetheless, in all cases, origins are established during the G1-phase of the cell cycle by the loading of double hexamers of the Mcm 2-7 proteins (MCM DHs), the core of the replicative helicase. MCM DH activation in S-phase leads to origin unwinding, polymerase recruitment, and initiation of bidirectional DNA synthesis. Although MCM DHs are initially loaded at sites defined by the binding of the origin recognition complex (ORC), they ultimately bind chromatin in much greater numbers than ORC and only a fraction are activated in any one S-phase. Data suggest that the multiplicity and functional redundancy of MCM DHs provide robustness to the replication process and affect replication time and that MCM DHs can slide along the DNA and spread over large distances around the ORC. Recent studies further show that MCM DHs are displaced along the DNA by collision with transcription complexes but remain functional for initiation after displacement. Therefore, eukaryotic DNA replication relies on intrinsically mobile and flexible origins, a strategy fundamentally different from bacteria but conserved from yeast to human. These properties of MCM DHs likely contribute to the establishment of broad, intergenic replication initiation zones in higher eukaryotes.
Full Text Available BACKGROUND: The replication of DNA in Archaea and eukaryotes requires several ancillary complexes, including proliferating cell nuclear antigen (PCNA, replication factor C (RFC, and the minichromosome maintenance (MCM complex. Bacterial DNA replication utilizes comparable proteins, but these are distantly related phylogenetically to their archaeal and eukaryotic counterparts at best. METHODOLOGY/PRINCIPAL FINDINGS: While the structures of each of the complexes do not differ significantly between the archaeal and eukaryotic versions thereof, the evolutionary dynamic in the two cases does. The number of subunits in each complex is constant across all taxa. However, they vary subtly with regard to composition. In some taxa the subunits are all identical in sequence, while in others some are homologous rather than identical. In the case of eukaryotes, there is no phylogenetic variation in the makeup of each complex-all appear to derive from a common eukaryotic ancestor. This is not the case in Archaea, where the relationship between the subunits within each complex varies taxon-to-taxon. We have performed a detailed phylogenetic analysis of these relationships in order to better understand the gene duplications and divergences that gave rise to the homologous subunits in Archaea. CONCLUSION/SIGNIFICANCE: This domain level difference in evolution suggests that different forces have driven the evolution of DNA replication proteins in each of these two domains. In addition, the phylogenies of all three gene families support the distinctiveness of the proposed archaeal phylum Thaumarchaeota.
Domínguez-Contreras, J. F.; Munguía-Vega, A.; Ceballos-Vázquez, B. P.; Arellano-Martínez, M.; Culver, Melanie
We characterized 22 novel microsatellite loci in the two-spotted octopus Octopus bimaculatus using 454 pyrosequencing reads. All loci were polymorphic and will be used in studies of marine connectivity aimed at increasing sustainability of the resource. The mean number alleles per locus was 13.09 (range 7–19) and observed heterozygosities ranged from 0.50 to 1.00. Four loci pairs were linked and three deviated from Hardy–Weinberg equilibrium. Eighteen and 12 loci were polymorphic in Octopus bimaculoides and Octopus hubbsorum, respectively.
Huang, Bi; Bao, Lang; Zhong, Qi; Shang, Zheng-ling; Zhang, Hui-dong; Zhang, Ying
To construct the eukaryotic experssion vector of LipL32 gene from Leptospira serovar Lai and express the recombinant plasmid in COS-7 cell. The LipL32 gene was amplified from Leptospira strain 017 genomic DNA by PCR and cloned into pcDNA3.1, through restriction nuclease enzyme digestion. Then the recombinant plasmid was transformed into E.coli DH5alpha. After identified by nuclease digestion, PCR and sequencing analysis, the recombinant vector was transfected into COS-7 cell with lipsome. The expression of the target gene was detected by RT-PCR and Western blot. The eukaryotic experssion vector pcDNA3.1-LipL32 was successfully constructed and stably expressed in COS-7 cell. The eukaryotic recombinant vector of outer membrane protein LipL32 gene from Leptospira serovar Lai can be expressed in mammalian cell, which provides an experimental basis for the application of the Leptospira DNA vaccine.
Beck, Rose C; Kohn, Debra J; Tuohy, Marion J; Prayson, Richard A; Yen-Lieberman, Belinda; Procop, Gary W
We evaluated 2 methods, a LightCycler PCR assay and pyrosequencing for the detection of the JC polyoma virus (JCV) in fixed brain tissue of 10 patients with and 3 control patients without progressive multifocal leukoencephalopathy (PML). Nucleic acid extraction was performed after deparaffinization and proteinase K digestion. The LightCycler assay differentiates the BK virus (BKV), JCV, and SV40 using melt curve analysis. Conventional PCR was used with the same primers to generate products for pyrosequencing. Two sequencing primers were used that differentiate the polyoma viruses. Seven of 11 biopsies (1 patient had 2 biopsies) with PML were positive for JCV by real-time PCR and/or PCR/pyrosequencing. Three of 4 remaining biopsies were positive by real-time PCR but had melting points between JCV and SV40. The 4 specimens that were negative or atypical by LightCycler PCR were positive by traditional PCR, but 1 had an amplicon of lower molecular weight by gel electrophoresis. These were shown to represent JCV by at least 1 of the 2 pyrosequencing primers. The biopsies from patients without PML were PCR negative. Both the LightCycler and pyrosequencing assays are useful for confirming JCV in brain biopsies from patients with PML, but variant JCVs may require supplementary methods to confirm JCV infection.
Frenkel, Svetlana; Kirzhner, Valery; Korol, Abraham
Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS) analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers) in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM) allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.
Full Text Available Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.
Mitra, Sanga; Das, Pijush; Samadder, Arpa; Das, Smarajit; Betai, Rupal; Chakrabarti, Jayprokas
During translation, aminoacyl-tRNA synthetases recognize the identities of the tRNAs to charge them with their respective amino acids. The conserved identities of 58,244 eukaryotic tRNAs of 24 invertebrates and 45 vertebrates in genomic tRNA database were analyzed and their novel features extracted. The internal promoter sequences, namely, A-Box and B-Box, were investigated and evidence gathered that the intervention of optional nucleotides at 17a and 17b correlated with the optimal length of the A-Box. The presence of canonical transcription terminator sequences at the immediate vicinity of tRNA genes was ventured. Even though non-canonical introns had been reported in red alga, green alga, and nucleomorph so far, fairly motivating evidence of their existence emerged in tRNA genes of other eukaryotes. Non-canonical introns were seen to interfere with the internal promoters in two cases, questioning their transcription fidelity. In a first of its kind, phylogenetic constructs based on tRNA molecules delineated and built the trees of the vast and diverse invertebrates and vertebrates. Finally, two tRNA models representing the invertebrates and the vertebrates were drawn, by isolating the dominant consensus in the positional fluctuations of nucleotide compositions.
Smith, Alias J; Chudnovsky, Lorissa; Simoes-Barbosa, Augusto; Delgadillo-Correa, Maria G; Jonsson, Zophonias O; Wohlschlegel, James A; Johnson, Patricia J
A highly conserved DNA initiator (Inr) element has been the only core promoter element described in the divergent unicellular eukaryote Trichomonas vaginalis, although genome analyses reveal that only ∼75% of protein-coding genes appear to contain an Inr. In search of another core promoter element(s), a nonredundant database containing 5' untranslated regions of expressed T. vaginalis genes was searched for overrepresented DNA motifs and known eukaryotic core promoter elements. In addition to identifying the Inr, two elements that lack sequence similarity to the known protein-coding gene core promoter, motif 3 (M3) and motif 5 (M5), were identified. Mutational and functional analyses demonstrate that both are novel core promoter elements. M3 [(A/G/T)(A/G)C(G/C)G(T/C)T(T/A/G)] resembles a Myb recognition element (MRE) and is bound specifically by a unique protein with a Myb-like DNA binding domain. The M5 element (CCTTT) overlaps the transcription start site and replaces the Inr as an alternative, gene-specific initiator element. Transcription specifically initiates at the second cytosine within M5, in contrast to characteristic initiation by RNA polymerase II at an adenosine. In promoters that combine M3 with either M5 or Inr, transcription initiation is regulated by the M3 motif.
Smith, Alias J.; Chudnovsky, Lorissa; Simoes-Barbosa, Augusto; Delgadillo-Correa, Maria G.; Jonsson, Zophonias O.; Wohlschlegel, James A.; Johnson, Patricia J.
A highly conserved DNA initiator (Inr) element has been the only core promoter element described in the divergent unicellular eukaryote Trichomonas vaginalis, although genome analyses reveal that only ∼75% of protein-coding genes appear to contain an Inr. In search of another core promoter element(s), a nonredundant database containing 5′ untranslated regions of expressed T. vaginalis genes was searched for overrepresented DNA motifs and known eukaryotic core promoter elements. In addition to identifying the Inr, two elements that lack sequence similarity to the known protein-coding gene core promoter, motif 3 (M3) and motif 5 (M5), were identified. Mutational and functional analyses demonstrate that both are novel core promoter elements. M3 [(A/G/T)(A/G)C(G/C)G(T/C)T(T/A/G)] resembles a Myb recognition element (MRE) and is bound specifically by a unique protein with a Myb-like DNA binding domain. The M5 element (CCTTT) overlaps the transcription start site and replaces the Inr as an alternative, gene-specific initiator element. Transcription specifically initiates at the second cytosine within M5, in contrast to characteristic initiation by RNA polymerase II at an adenosine. In promoters that combine M3 with either M5 or Inr, transcription initiation is regulated by the M3 motif. PMID:21245378
Larson, Eric T.; Kim, Jessica E.; Napuli, Alberto J.; Verlinde, Christophe L. M. J.; Fan, Erkang; Zucker, Frank H.; Van Voorhis, Wesley C.; Buckner, Frederick S.; Hol, Wim G. J.; Merritt, Ethan A., E-mail: firstname.lastname@example.org [Medical Structural Genomics of Pathogenic Protozoa, (United States); University of Washington, Seattle, WA 98195 (United States)
The structure of Giardia prolyl-tRNA synthetase cocrystallized with proline and ATP shows evidence for half-of-the-sites activity, leading to a corresponding mixture of reaction substrates and product (prolyl-AMP) in the two active sites of the dimer. The genome of the human intestinal parasite Giardia lamblia contains only a single aminoacyl-tRNA synthetase gene for each amino acid. The Giardia prolyl-tRNA synthetase gene product was originally misidentified as a dual-specificity Pro/Cys enzyme, in part owing to its unexpectedly high off-target activation of cysteine, but is now believed to be a normal representative of the class of archaeal/eukaryotic prolyl-tRNA synthetases. The 2.2 Å resolution crystal structure of the G. lamblia enzyme presented here is thus the first structure determination of a prolyl-tRNA synthetase from a eukaryote. The relative occupancies of substrate (proline) and product (prolyl-AMP) in the active site are consistent with half-of-the-sites reactivity, as is the observed biphasic thermal denaturation curve for the protein in the presence of proline and MgATP. However, no corresponding induced asymmetry is evident in the structure of the protein. No thermal stabilization is observed in the presence of cysteine and ATP. The implied low affinity for the off-target activation product cysteinyl-AMP suggests that translational fidelity in Giardia is aided by the rapid release of misactivated cysteine.
Full Text Available This paper is a review of currently available data concerning interactions of tRNAs with the eukaryotic ribosome at various stages of translation. These data include the results obtained by means of cryo-electron microscopy and X-ray crystallography applied to various model ribosomal complexes, site-directed cross-linking with the use of tRNA derivatives bearing chemically or photochemically reactive groups in the CCA-terminal fragment and chemical probing of 28S rRNA in the region of the peptidyl transferase center. Similarities and differences in the interactions of tRNAs with prokaryotic and eukaryotic ribosomes are discussed with concomitant consideration of the extent of resemblance between molecular mechanisms of translation in eukaryotes and bacteria.
He, Mingyue; Edwards, Bryan M; Kastelic, Damjana; Taussig, Michael J
Ribosome display is a cell-free display technology for in vitro selection and optimisation of proteins from large diversified libraries. It operates through the formation of stable protein-ribosome-mRNA (PRM) complexes and selection of ligand-binding proteins, followed by DNA recovery from the selected genetic information. Both prokaryotic and eukaryotic ribosome display systems have been developed. In this chapter, we describe the eukaryotic rabbit reticulocyte method in which a distinct in situ single-primer RT-PCR procedure is used to recover DNA from the selected PRM complexes without the need for prior disruption of the ribosome.
Pedersen, Anders Gorm; Baldi, Pierre; Brunak, Søren
In this paper we utilize hidden Markov models (HMMs) and information theory to analyze prokaryotic and eukaryotic promoters. We perform this analysis with special emphasis on the fact that promoters are divided into a number of different classes, depending on which polymerase-associated factors...... that bind to them. We find that HMMs trained on such subclasses of Escherichia coli promoters (specifically, the so-called sigma-70 and sigma-54 classes) give an excellent classification of unknown promoters with respect to sigma-class. HMMs trained on eukaryotic sequences from human genes also model nicely...
Pedersen, Anders Gorm; Baldi, P.; Chauvin, Y.
In this paper we utilize hidden Markov models (HMMs) and information theory to analyze prokaryotic and eukaryotic promoters. We perform this analysis with special emphasis on the fact that promoters are divided into a number of different classes, depending on which polymerase-associated factors...... that bind to them. We find that HMMs trained on such subclasses of Escherichia coli promoters (specifically, the so-called sigma 70 and sigma 54 classes) give an excellent classification of unknown promoters with respect to sigma-class. HMMs trained on eukaryotic sequences from human genes also model nicely...
Nicholas J Hudson
Full Text Available Codon bias in the genome of an organism influences its phenome by changing the speed and efficiency of mRNA translation and hence protein abundance. We hypothesized that differences in codon bias, either between-species differences in orthologous genes, or within-species differences between genes, may play an evolutionary role. To explore this hypothesis, we compared the genome-wide codon bias in six species that occupy vital positions in the Eukaryotic Tree of Life. We acquired the entire protein coding sequences for these organisms, computed the codon bias for all genes in each organism and explored the output for relationships between codon bias and protein function, both within- and between-lineages. We discovered five notable coordinated patterns, with extreme codon bias most pronounced in traits considered highly characteristic of a given lineage. Firstly, the Homo sapiens genome had stronger codon bias for DNA-binding transcription factors than the Saccharomyces cerevisiae genome, whereas the opposite was true for ribosomal proteins--perhaps underscoring transcriptional regulation in the origin of complexity. Secondly, both mammalian species examined possessed extreme codon bias in genes relating to hair--a tissue unique to mammals. Thirdly, Arabidopsis thaliana showed extreme codon bias in genes implicated in cell wall formation and chloroplast function--which are unique to plants. Fourthly, Gallus gallus possessed strong codon bias in a subset of genes encoding mitochondrial proteins--perhaps reflecting the enhanced bioenergetic efficiency in birds that co-evolved with flight. And lastly, the G. gallus genome had extreme codon bias for the Ciliary Neurotrophic Factor--which may help to explain their spontaneous recovery from deafness. We propose that extreme codon bias in groups of genes that encode functionally related proteins has a pathway-level energetic explanation.
Full Text Available Abstract Background Free-living amoebae serve as a natural reservoir for some bacteria that have evolved into «amoeba-resistant» bacteria. Among these, some are strictly intra-amoebal, such as Candidatus "Protochlamydia amoebophila" (Candidatus "P. amoebophila", whose genomic sequence is available. We sequenced the genome of Legionella drancourtii (L. drancourtii, another recently described intra-amoebal bacterium. By comparing these two genomes with those of their closely related species, we were able to study the genetic characteristics specific to their amoebal lifestyle. Findings We identified a sterol delta-7 reductase-encoding gene common to these two bacteria and absent in their relatives. This gene encodes an enzyme which catalyses the last step of cholesterol biosynthesis in eukaryotes, and is probably functional within L. drancourtii since it is transcribed. The phylogenetic analysis of this protein suggests that it was acquired horizontally by a few bacteria from viridiplantae. This gene was also found in the Acanthamoeba polyphaga Mimivirus genome, a virus that grows in amoebae and possesses the largest viral genome known to date. Conclusion L. drancourtii acquired a sterol delta-7 reductase-encoding gene of viridiplantae origin. The most parsimonious hypothesis is that this gene was initially acquired by a Chlamydiales ancestor parasite of plants. Subsequently, its descendents transmitted this gene in amoebae to other intra-amoebal microorganisms, including L. drancourtii and Coxiella burnetii. The role of the sterol delta-7 reductase in prokaryotes is as yet unknown but we speculate that it is involved in host cholesterol parasitism.
Zellweger, Ralph; Lopes, Massimo
The DNA replication process can be heavily perturbed by several different conditions of genotoxic stress, particularly relevant for cancer onset and therapy. The combination of psoralen crosslinking and electron microscopy has proven instrumental to reveal the fine architecture of in vivo DNA replication intermediates and to uncover their remodeling upon specific conditions of genotoxic stress. The replication structures are stabilized in vivo (by psoralen crosslinking) prior to extraction and enrichment procedures, allowing their visualization at the transmission electron microscope. This chapter outlines the procedures required to visualize and interpret in vivo replication intermediates of eukaryotic genomic DNA, and includes an improved method for enrichment of replication intermediates, compared to previously used BND-cellulose columns.
Sliva, Anna; Yang, Huanming; Boeke, Jef D.; Mathews, Debra J. H.
First introduced in 2011, the Synthetic Yeast Genome (Sc2.0) Project is a large international synthetic genomics project that will culminate in the first eukaryotic cell (Saccharomyces cerevisiae) with a fully synthetic genome. With collaborators from across the globe and from a range of institutions spanning from do-it-yourself biology (DIYbio) to commercial enterprises, it is important that all scientists working on this project are cognizant of the ethical and policy issues associated with...
Brown, David S; Jarman, Simon N; Symondson, William O C
Little quantitative ecological information exists on the diets of most invertebrate feeding reptiles, particularly nocturnal or elusive species that are difficult to observe. In the UK and elsewhere, reptiles are legally required to be relocated before land development can proceed, but without knowledge of their dietary requirements, the suitability of receptor sites cannot be known. Here, we tested the ability of non-invasive DNA-based molecular diagnostics (454 pyrosequencing) to analyse reptile diets, with the specific aims of determining which earthworm species are exploited by slow worms (the legless lizard Anguis fragilis) and whether they feed on the deeper-living earthworm species that only come to the surface at night. Slow worm faecal samples from four different habitats were analysed using earthworm-specific PCR primers. We found that 86% of slow worms (N=80) had eaten earthworms. In lowland heath and marshy/acid grassland, Lumbricus rubellus, a surface-dwelling epigeic species, dominated slow worm diet. In two other habitats, riverside pasture and calciferous coarse grassland, diet was dominated by deeper-living anecic and endogeic species. We conclude that all species of earthworm are exploited by these reptiles and lack of specialization allows slow worms to thrive in a wide variety of habitats. Pyrosequencing of prey DNA in faeces showed promise as a practical, rapid and relatively inexpensive means of obtaining detailed and valuable ecological information on the diets of reptiles. © 2011 Blackwell Publishing Ltd.
Ye, Lin; Zhang, Tong
This study applied 454 high-throughput pyrosequencing to analyze potentially pathogenic bacteria in activated sludge from 14 municipal wastewater treatment plants (WWTPs) across four countries (China, U.S., Canada, and Singapore), plus the influent and effluent of one of the 14 WWTPs. A total of 370,870 16S rRNA gene sequences with average length of 207 bps were obtained and all of them were assigned to corresponding taxonomic ranks by using RDP classifier and MEGAN. It was found that the most abundant potentially pathogenic bacteria in the WWTPs were affiliated with the genera of Aeromonas and Clostridium. Aeromonas veronii, Aeromonas hydrophila, and Clostridium perfringens were species most similar to the potentially pathogenic bacteria found in this study. Some sequences highly similar (>99%) to Corynebacterium diphtheriae were found in the influent and activated sludge samples from a saline WWTP. Overall, the percentage of the sequences closely related (>99%) to known pathogenic bacteria sequences was about 0.16% of the total sequences. Additionally, a platform-independent Java application (BAND) was developed for graphical visualization of the data of microbial abundance generated by high-throughput pyrosequencing. The approach demonstrated in this study could examine most of the potentially pathogenic bacteria simultaneously instead of one-by-one detection by other methods.
Full Text Available BACKGROUND: Given the continuing co-circulation of the 2009 H1N1 pandemic influenza A viruses with seasonal H3N2 viruses, rapid and reliable detection of newly emerging influenza reassortant viruses is important to enhance our influenza surveillance. METHODOLOGY/PRINCIPAL FINDINGS: A novel pyrosequencing assay was developed for the rapid identification and subtyping of potential human influenza A virus reassortants based on all eight gene segments of the virus. Except for HA and NA genes, one universal set of primers was used to amplify and subtype each of the six internal genes. With this method, all eight gene segments of 57 laboratory isolates and 17 original specimens of seasonal H1N1, H3N2 and 2009 H1N1 pandemic viruses were correctly matched with their corresponding subtypes. In addition, this method was shown to be capable of detecting reassortant viruses by correctly identifying the source of all 8 gene segments from three vaccine production reassortant viruses and three H1N2 viruses. CONCLUSIONS/SIGNIFICANCE: In summary, this pyrosequencing assay is a sensitive and specific procedure for screening large numbers of viruses for reassortment events amongst the commonly circulating human influenza A viruses, which is more rapid and cheaper than using conventional sequencing approaches.
Deng, Yi-Mo; Caldwell, Natalie; Barr, Ian G
Given the continuing co-circulation of the 2009 H1N1 pandemic influenza A viruses with seasonal H3N2 viruses, rapid and reliable detection of newly emerging influenza reassortant viruses is important to enhance our influenza surveillance. A novel pyrosequencing assay was developed for the rapid identification and subtyping of potential human influenza A virus reassortants based on all eight gene segments of the virus. Except for HA and NA genes, one universal set of primers was used to amplify and subtype each of the six internal genes. With this method, all eight gene segments of 57 laboratory isolates and 17 original specimens of seasonal H1N1, H3N2 and 2009 H1N1 pandemic viruses were correctly matched with their corresponding subtypes. In addition, this method was shown to be capable of detecting reassortant viruses by correctly identifying the source of all 8 gene segments from three vaccine production reassortant viruses and three H1N2 viruses. In summary, this pyrosequencing assay is a sensitive and specific procedure for screening large numbers of viruses for reassortment events amongst the commonly circulating human influenza A viruses, which is more rapid and cheaper than using conventional sequencing approaches.
Sundberg, Carina; Al-Soud, Waleed A; Larsson, Madeleine; Alm, Erik; Yekta, Sepehr S; Svensson, Bo H; Sørensen, Søren J; Karlsson, Anna
The microbial community of 21 full-scale biogas reactors was examined using 454 pyrosequencing of 16S rRNA gene sequences. These reactors included seven (six mesophilic and one thermophilic) digesting sewage sludge (SS) and 14 (ten mesophilic and four thermophilic) codigesting (CD) various combinations of wastes from slaughterhouses, restaurants, households, etc. The pyrosequencing generated more than 160,000 sequences representing 11 phyla, 23 classes, and 95 genera of Bacteria and Archaea. The bacterial community was always both more abundant and more diverse than the archaeal community. At the phylum level, the foremost populations in the SS reactors included Actinobacteria, Proteobacteria, Chloroflexi, Spirochetes, and Euryarchaeota, while Firmicutes was the most prevalent in the CD reactors. The main bacterial class in all reactors was Clostridia. Acetoclastic methanogens were detected in the SS, but not in the CD reactors. Their absence suggests that methane formation from acetate takes place mainly via syntrophic acetate oxidation in the CD reactors. A principal component analysis of the communities at genus level revealed three clusters: SS reactors, mesophilic CD reactors (including one thermophilic CD and one SS), and thermophilic CD reactors. Thus, the microbial composition was mainly governed by the substrate differences and the process temperature. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Barker, Christopher J; Gillett, Amber; Polkinghorne, Adam; Timms, Peter
As a dietary source, the foliage of Eucalyptus spp. is low in available protein and carbohydrate while containing polyphenolic compounds that interfere with enzymatic digestion. To overcome this, the koala (Phascolarctos cinereus) has evolved a range of anatomical and physiological adaptations to assist with digestion and absorption of nutrients from this food source. Microbial fermentation of partially digested eucalyptus leaves is thought to be critical in this process, however, little is known about the composition and diversity of microorganisms that are associated with digestive health in this native species. In this study, we performed 16S rRNA gene pyrosequencing of caecum, colon and faecal pellet samples from two wild, free ranging, Queensland koalas. Our results reveal a highly complex and diverse ecosystem with considerable intra-individual variation. Although samples were dominated by sequences from the Bacteroidetes and Firmicutes phyla there was considerable variation at the genus level. This study is the first non-culture based microbiota analysis, using 454-amplicon pyrosequencing, and provides preliminary data to expand our understanding of the koala hindgut. Copyright © 2013 Elsevier B.V. All rights reserved.
Rogozin, Igor B; Wolf, Yuri I; Sorokin, Alexander V; Mirkin, Boris G; Koonin, Eugene V
Sequencing of eukaryotic genomes allows one to address major evolutionary problems, such as the evolution of gene structure. We compared the intron positions in 684 orthologous gene sets from 8 complete genomes of animals, plants, fungi, and protists and constructed parsimonious scenarios of evolution of the exon-intron structure for the respective genes. Approximately one-third of the introns in the malaria parasite Plasmodium falciparum are shared with at least one crown group eukaryote; this number indicates that these introns have been conserved through >1.5 billion years of evolution that separate Plasmodium from the crown group. Paradoxically, humans share many more introns with the plant Arabidopsis thaliana than with the fly or nematode. The inferred evolutionary scenario holds that the common ancestor of Plasmodium and the crown group and, especially, the common ancestor of animals, plants, and fungi had numerous introns. Most of these ancestral introns, which are retained in the genomes of vertebrates and plants, have been lost in fungi, nematodes, arthropods, and probably Plasmodium. In addition, numerous introns have been inserted into vertebrate and plant genes, whereas, in other lineages, intron gain was much less prominent.
The transition from endosymbiont to organelle in eukaryotic cells involves the transfer of significant numbers of genes to the host genomes, a process known as endosymbiotic gene transfer (EGT). In the case of plastid organelles, EGTs have been shown to leave a footprint in the nuclear genome that can be indicative of ancient photosynthetic activity in present-day plastid-lacking organisms, or even hint at the existence of cryptic plastids. Here,we evaluated the impact of EGTon eukaryote genomes by reanalyzing the recently published EST dataset for Chromera velia, an interesting test case of a photosynthetic alga closely related to apicomplexan parasites. Previously, 513 genes were reported to originate from red and green algae in a 1:1 ratio. In contrast, by manually inspecting newly generated trees indicating putative algal ancestry, we recovered only 51 genes congruent with EGT, of which 23 and 9 were of red and green algal origin, respectively,whereas 19 were ambiguous regarding the algal provenance.Our approach also uncovered 109 genes that branched within a monocot angiosperm clade, most likely representing a contamination. We emphasize the lack of congruence and the subjectivity resulting from independent phylogenomic screens for EGT, which appear to call for extreme caution when drawing conclusions for major evolutionary events. 2012 The Author(s).
Gallagher Margaret L
Full Text Available Abstract Background Identification of CYP2A6 alleles associated with reduced enzyme activity is important in the study of inter-individual differences in drug metabolism. CYP2A6*12 is a hybrid allele that results from unequal crossover between CYP2A6 and CYP2A7 genes. The 5' regulatory region and exons 1–2 are derived from CYP2A7, and exons 3–9 are derived from CYP2A6. Conventional methods for detection of CYP2A6*12 consist of two-step PCR protocols that are laborious and unsuitable for high-throughput genotyping. We developed a rapid and accurate method to detect the CYP2A6*12 allele by Pyrosequencing technology. Methods A single set of PCR primers was designed to specifically amplify both the CYP2A6*1 wild-type allele and the CYP2A6*12 hybrid allele. An internal Pyrosequencing primer was used to generate allele-specific sequence information, which detected homozygous wild-type, heterozygous hybrid, and homozygous hybrid alleles. We first validated the assay on 104 DNA samples that were also genotyped by conventional two-step PCR and by cycle sequencing. CYP2A6*12 allele frequencies were then determined using the Pyrosequencing assay on 181 multi-ethnic DNA samples from subjects of African American, European Caucasian, Pacific Rim, and Hispanic descent. Finally, we streamlined the Pyrosequencing assay by integrating liquid handling robotics into the workflow. Results Pyrosequencing results demonstrated 100% concordance with conventional two-step PCR and cycle sequencing methods. Allele frequency data showed slightly higher prevalence of the CYP2A6*12 allele in European Caucasians and Hispanics. Conclusion This Pyrosequencing assay proved to be a simple, rapid, and accurate alternative to conventional methods, which can be easily adapted to the needs of higher-throughput studies.
Prest, Emmanuelle I E C
The combination of flow cytometry (FCM) and 16S rRNA gene pyrosequencing data was investigated for the purpose of monitoring and characterizing microbial changes in drinking water distribution systems. High frequency sampling (5min intervals for 1h) was performed at the outlet of a treatment plant and at one location in the full-scale distribution network. In total, 52 bulk water samples were analysed with FCM, pyrosequencing and conventional methods (adenosine-triphosphate, ATP; heterotrophic plate count, HPC). FCM and pyrosequencing results individually showed that changes in the microbial community occurred in the water distribution system, which was not detected with conventional monitoring. FCM data showed an increase in the total bacterial cell concentrations (from 345±15×103 to 425±35×103cellsmL-1) and in the percentage of intact bacterial cells (from 39±3.5% to 53±4.4%) during water distribution. This shift was also observed in the FCM fluorescence fingerprints, which are characteristic of each water sample. A similar shift was detected in the microbial community composition as characterized with pyrosequencing, showing that FCM and genetic fingerprints are congruent. FCM and pyrosequencing data were subsequently combined for the calculation of cell concentration changes for each bacterial phylum. The results revealed an increase in cell concentrations of specific bacterial phyla (e.g., Proteobacteria), along with a decrease in other phyla (e.g., Actinobacteria), which could not be concluded from the two methods individually. The combination of FCM and pyrosequencing methods is a promising approach for future drinking water quality monitoring and for advanced studies on drinking water distribution pipeline ecology. © 2014 Elsevier Ltd.
Prest, E I; El-Chakhtoura, J; Hammes, F; Saikaly, P E; van Loosdrecht, M C M; Vrouwenvelder, J S
The combination of flow cytometry (FCM) and 16S rRNA gene pyrosequencing data was investigated for the purpose of monitoring and characterizing microbial changes in drinking water distribution systems. High frequency sampling (5 min intervals for 1 h) was performed at the outlet of a treatment plant and at one location in the full-scale distribution network. In total, 52 bulk water samples were analysed with FCM, pyrosequencing and conventional methods (adenosine-triphosphate, ATP; heterotrophic plate count, HPC). FCM and pyrosequencing results individually showed that changes in the microbial community occurred in the water distribution system, which was not detected with conventional monitoring. FCM data showed an increase in the total bacterial cell concentrations (from 345 ± 15 × 10(3) to 425 ± 35 × 10(3) cells mL(-1)) and in the percentage of intact bacterial cells (from 39 ± 3.5% to 53 ± 4.4%) during water distribution. This shift was also observed in the FCM fluorescence fingerprints, which are characteristic of each water sample. A similar shift was detected in the microbial community composition as characterized with pyrosequencing, showing that FCM and genetic fingerprints are congruent. FCM and pyrosequencing data were subsequently combined for the calculation of cell concentration changes for each bacterial phylum. The results revealed an increase in cell concentrations of specific bacterial phyla (e.g., Proteobacteria), along with a decrease in other phyla (e.g., Actinobacteria), which could not be concluded from the two methods individually. The combination of FCM and pyrosequencing methods is a promising approach for future drinking water quality monitoring and for advanced studies on drinking water distribution pipeline ecology. Copyright © 2014 Elsevier Ltd. All rights reserved.
Periwal, Vinita; Scaria, Vinod
Structural variations (SVs) are genomic rearrangements that affect fairly large fragments of DNA. Most of the SVs such as inversions, deletions and translocations have been largely studied in context of genetic diseases in eukaryotes. However, recent studies demonstrate that genome rearrangements can also have profound impact on prokaryotic genomes, leading to altered cell phenotype. In contrast to single-nucleotide variations, SVs provide a much deeper insight into organization of bacterial genomes at a much better resolution. SVs can confer change in gene copy number, creation of new genes, altered gene expression and many other functional consequences. High-throughput technologies have now made it possible to explore SVs at a much refined resolution in bacterial genomes. Through this review, we aim to highlight the importance of the less explored field of SVs in prokaryotic genomes and their impact. We also discuss its potential applicability in the emerging fields of synthetic biology and genome engineering where targeted SVs could serve to create sophisticated and accurate genome editing. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: email@example.com.
Matheus Sanitá Lima
Full Text Available Organelle genomes are among the most sequenced kinds of chromosome. This is largely because they are small and widely used in molecular studies, but also because next-generation sequencing technologies made sequencing easier, faster, and cheaper. However, studies of organelle RNA have not kept pace with those of DNA, despite huge amounts of freely available eukaryotic RNA-sequencing (RNA-seq data. Little is known about organelle transcription in nonmodel species, and most of the available eukaryotic RNA-seq data have not been mined for organelle transcripts. Here, we use publicly available RNA-seq experiments to investigate organelle transcription in 30 diverse plastid-bearing protists with varying organelle genomic architectures. Mapping RNA-seq data to organelle genomes revealed pervasive, genome-wide transcription, regardless of the taxonomic grouping, gene organization, or noncoding content. For every species analyzed, transcripts covered ≥85% of the mitochondrial and/or plastid genomes (all of which were ≤105 kb, indicating that most of the organelle DNA—coding and noncoding—is transcriptionally active. These results follow earlier studies of model species showing that organellar transcription is coupled and ubiquitous across the genome, requiring significant downstream processing of polycistronic transcripts. Our findings suggest that noncoding organelle DNA can be transcriptionally active, raising questions about the underlying function of these transcripts and underscoring the utility of publicly available RNA-seq data for recovering complete genome sequences. If pervasive transcription is also found in bigger organelle genomes (>105 kb and across a broader range of eukaryotes, this could indicate that noncoding organelle RNAs are regulating fundamental processes within eukaryotic cells.
Sanitá Lima, Matheus; Smith, David Roy
Organelle genomes are among the most sequenced kinds of chromosome. This is largely because they are small and widely used in molecular studies, but also because next-generation sequencing technologies made sequencing easier, faster, and cheaper. However, studies of organelle RNA have not kept pace with those of DNA, despite huge amounts of freely available eukaryotic RNA-sequencing (RNA-seq) data. Little is known about organelle transcription in nonmodel species, and most of the available eukaryotic RNA-seq data have not been mined for organelle transcripts. Here, we use publicly available RNA-seq experiments to investigate organelle transcription in 30 diverse plastid-bearing protists with varying organelle genomic architectures. Mapping RNA-seq data to organelle genomes revealed pervasive, genome-wide transcription, regardless of the taxonomic grouping, gene organization, or noncoding content. For every species analyzed, transcripts covered ≥85% of the mitochondrial and/or plastid genomes (all of which were ≤105 kb), indicating that most of the organelle DNA-coding and noncoding-is transcriptionally active. These results follow earlier studies of model species showing that organellar transcription is coupled and ubiquitous across the genome, requiring significant downstream processing of polycistronic transcripts. Our findings suggest that noncoding organelle DNA can be transcriptionally active, raising questions about the underlying function of these transcripts and underscoring the utility of publicly available RNA-seq data for recovering complete genome sequences. If pervasive transcription is also found in bigger organelle genomes (>105 kb) and across a broader range of eukaryotes, this could indicate that noncoding organelle RNAs are regulating fundamental processes within eukaryotic cells. Copyright © 2017 Sanitá Lima and Smith.
Boore, Jeffrey L.
Mitochondria and plastids (collectively called organelles)descended from prokaryotes that adopted an intracellular, endosymbioticlifestyle within early eukaryotes. Comparisons of their remnant genomesaddress a wide variety of biological questions, especially when includingthe genomes of their prokaryotic relatives and the many genes transferredto the eukaryotic nucleus during the transitions from endosymbiont toorganelle. The pace of producing complete organellar genome sequences nowmakes it unfeasible to do broad comparisons using the primary literatureand, even if it were feasible, it is now becoming uncommon for journalsto accept detailed descriptions of genome-level features. Unfortunatelyno database is currently useful for this task, since they have littlestandardization and are riddled with error. Here I outline what iscurrently wrong and what must be done to make this data useful to thescientific community.
Wijffels, R.H.; Kruse, O.; Hellingwerf, K.J.
Both cyanobacteria and eukaryotic microalgae are promising organisms for sustainable production of bulk products such as food, feed, materials, chemicals and fuels. In this review we will summarize the potential and current biotechnological developments.Cyanobacteria are promising host organisms for
Khalil, Ahmad S.; Lu, Timothy K.; Bashor, Caleb J.; Ramirez, Cherie L.; Pyenson, Nora C.; Joung, J. Keith; Collins, James J.
SUMMARY Eukaryotic transcription factors (TFs) perform complex and combinatorial functions within transcriptional networks. Here, we present a synthetic framework for systematically constructing eukaryotic transcription functions using artificial zinc fingers, modular DNA-binding domains found within many eukaryotic TFs. Utilizing this platform, we construct a library of orthogonal synthetic transcription factors (sTFs) and use these to wire synthetic transcriptional circuits in yeast. We engineer complex functions, such as tunable output strength and transcriptional cooperativity, by rationally adjusting a decomposed set of key component properties, e.g., DNA specificity, affinity, promoter design, protein-protein interactions. We show that subtle perturbations to these properties can transform an individual sTF between distinct roles (activator, cooperative factor, inhibitory factor) within a transcriptional complex, thus drastically altering the signal processing behavior of multi-input systems. This platform provides new genetic components for synthetic biology and enables bottom-up approaches to understanding the design principles of eukaryotic transcriptional complexes and networks. PMID:22863014
Melixetian, Marina; Helin, Kristian
Eukaryotes have evolved multiple mechanisms to restrict DNA replication to once per cell cycle. These mechanisms prevent relicensing of origins of replication after initiation of DNA replication in S phase until the end of mitosis. Most of our knowledge of mechanisms controlling prereplication...
Supplementary figure 1. Plots depicting the classification accuracy of Eu-Detect with various combinations of. 'cumulative sequence count' (40K, 50K, 60K, 70K, 80K) and 'coverage threshold' (20%, 30%, 40%, 50%, 60%, 70%,. 80%). While blue bars represent Eu-Detect's average classification accuracy with eukaryotic ...
Gross, Jeferson; Bhattacharya, Debashish
Theories about eukaryote origins (eukaryogenesis) need to provide unified explanations for the emergence of diverse complex features that define this lineage. Models that propose a prokaryote-to-eukaryote transition are gridlocked between the opposing "phagocytosis first" and "mitochondria as seed" paradigms, neither of which fully explain the origins of eukaryote cell complexity. Sex (outcrossing with meiosis) is an example of an elaborate trait not yet satisfactorily addressed in theories about eukaryogenesis. The ancestral nature of meiosis and its dependence on eukaryote cell biology suggest that the emergence of sex and eukaryogenesis were simultaneous and synergic and may be explained by a common selective pressure. We propose that a local rise in oxygen levels, due to cyanobacterial photosynthesis in ancient Archean microenvironments, was highly toxic to the surrounding biota. This selective pressure drove the transformation of an archaeal (archaebacterial) lineage into the first eukaryotes. Key is that oxygen might have acted in synergy with environmental stresses such as ultraviolet (UV) radiation and/or desiccation that resulted in the accumulation of reactive oxygen species (ROS). The emergence of eukaryote features such as the endomembrane system and acquisition of the mitochondrion are posited as strategies to cope with a metabolic crisis in the cell plasma membrane and the accumulation of ROS, respectively. Selective pressure for efficient repair of ROS/UV-damaged DNA drove the evolution of sex, which required cell-cell fusions, cytoskeleton-mediated chromosome movement, and emergence of the nuclear envelope. Our model implies that evolution of sex and eukaryogenesis were inseparable processes. Several types of data can be used to test our hypothesis. These include paleontological predictions, simulation of ancient oxygenic microenvironments, and cell biological experiments with Archaea exposed to ROS and UV stresses. Studies of archaeal conjugation
Masoud, Wafa Mahmoud Hasan; Takamiya, Monica K Wik; Vogensen, Finn Kvist
ripening. Other bacteria like Corynebacterium, Halomonas, Pediococcus, Micrococcus and Staphylococcus, which were encountered in some cheese samples at low percentages compared with the total bacterial populations, were only detected by pyrosequencing. 16S rRNA gene pyrosequencing is an efficient method...
Keskin, Fahriye; Ciftci, Sevgi; Akyuz, Filiz; Abaci, Neslihan; Cakiris, Aris; Akyuz, Umit; Demir, Kadir; Besisik, Fatih; Ustek, Duran; Kaymakoglu, Sabahattin
HCV (Hepatitis C Virus) is genetically more diverse than HBV and HIV (Human Immunodeficiency Virus) and exists as quasispecies within infected individuals. This is due to the lack of efficient proofreading of the viral RNA-dependent RNA polymerase. Consequently, quasispecies emerge depending on the mutation rate of the viral polymerase, which may display a high level of genetic variability in a population. In infected individuals, HCV replicates and circulates as quasispecies composed of a complex mixture of different but closely related genomes that undergoes continuous change due to competitive selection and cooperation between arising mutants. The aim of this study is to investigate mutations in the NS5A region as a whole, including ISDR, PKRBD, IRRDR, and V3 of HCV genotype 1b cirrhosis patients being naive and nonresponders, treated with IFN (interferon) + ribavirin (RBN) by using an ultra-deep pyrosequencing method (UDPS). During the study, five patients (four females, and one male, mean age 59.8 ± 11 years) with HCV related cirrhosis were analyzed. Three patients received IFN + RBN for six months, but two patients did not receive any therapy. HCV-RNA concentrations in patients' sera were determined using a COBAS AMPLICOR HCV MONITOR Test, Version 2.0. Genotyping was performed by using a commercial reverse hybridization method, Line Probe Assay. The quasispecies for the NS5A region were investigated using UDPS. All five patients were HCV genotype 1b (Mean Child-Pugh score 7.2 ± 1.9, 2 pts Child A, 2 pts Child B, and one pt Child C) but only one patient had hepatocellular carcinoma (HCC). A total of 19 different mutations were detected in each of the five patients (ranging from 3 to 6 mutations per patient). In all five patients, several mutations in the ISDR and PKR-BD regions were detected. On the other hand, mutations in the V3 and IRRDR regions were only detected in one patient. UDPS is a new sequencing technology and a very sensitive method in detection
Bennetzen, Jeffrey L; Park, Minkyu
Most annotations of large eukaryotic genomes initially find transposable elements (TEs) and other repeats, then mask them so that subsequent efforts can be concentrated on the annotation and study of non-TE genes. However, TEs often contribute to host biology, and their community biologies are of intrinsic interest. This review discusses the challenges, rationale and technologies for comprehensive TE annotation in the commonly giant genomes of animals and plants. Complete discovery of the TEs in a fully sequenced genome is laborious, but feasible, with current strategies in the hands of a careful researcher. These deep TE studies have begun to provide important perspectives on how genomes evolve and the degree to which genome changes do and do not affect eukaryotic biology. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Full Text Available Transposons are discrete segments of DNA that have the distinctive ability to move and replicate within genomes across the tree of life. 'Cut and paste' DNA transposition involves excision from a donor locus and reintegration into a new locus in the genome. We studied molecular events following the excision steps of two eukaryotic DNA transposons, Sleeping Beauty (SB and piggyBac (PB that are widely used for genome manipulation in vertebrate species. SB originates from fish and PB from insects; thus, by introducing these transposons to human cells we aimed to monitor the process of establishing a transposon-host relationship in a naïve cellular environment. Similarly to retroviruses, neither SB nor PB is capable of self-avoidance because a significant portion of the excised transposons integrated back into its own genome in a suicidal process called autointegration. Barrier-to-autointegration factor (BANF1, a cellular co-factor of certain retroviruses, inhibited transposon autointegration, and was detected in higher-order protein complexes containing the SB transposase. Increasing size sensitized transposition for autointegration, consistent with elevated vulnerability of larger transposons. Both SB and PB were affected similarly by the size of the transposon in three different assays: excision, autointegration and productive transposition. Prior to reintegration, SB is completely separated from the donor molecule and followed an unbiased autointegration pattern, not associated with local hopping. Self-disruptive autointegration occurred at similar frequency for both transposons, while aberrant, pseudo-transposition events were more frequently observed for PB.
Kisand, Veljo; Lettieri, Teresa
De novo genome sequencing of previously uncharacterized microorganisms has the potential to open up new frontiers in microbial genomics by providing insight into both functional capabilities and biodiversity. Until recently, Roche 454 pyrosequencing was the NGS method of choice for de novo assembly because it generates hundreds of thousands of long reads (tools for processing NGS data are increasingly free and open source and are often adopted for both their high quality and role in promoting academic freedom. The error rate of pyrosequencing the Alcanivorax borkumensis genome was such that thousands of insertions and deletions were artificially introduced into the finished genome. Despite a high coverage (~30 fold), it did not allow the reference genome to be fully mapped. Reads from regions with errors had low quality, low coverage, or were missing. The main defect of the reference mapping was the introduction of artificial indels into contigs through lower than 100% consensus and distracting gene calling due to artificial stop codons. No assembler was able to perform de novo assembly comparable to reference mapping. Automated annotation tools performed similarly on reference mapped and de novo draft genomes, and annotated most CDSs in the de novo assembled draft genomes. Free and open source software (FOSS) tools for assembly and annotation of NGS data are being developed rapidly to provide accurate results with less computational effort. Usability is not high priority and these tools currently do not allow the data to be processed without manual intervention. Despite this, genome assemblers now readily assemble medium short reads into long contigs (>97-98% genome coverage). A notable gap in pyrosequencing technology is the quality of base pair calling and conflicting base pairs between single reads at the same nucleotide position. Regardless, using draft whole genomes that are not finished and remain fragmented into tens of contigs allows one to characterize
Pyrosequencing of the samples resulted in a total of 18520 sequences (3000 per sample, approximately with an average length of 325 bp (base pairs. The taxonomic assignment of sequences allowed the identification to the genus level, being observed a large bacterial diversity with over 110 genera. The most prevalent genera in the samples were Hydrotalea, Acinetobacter, Delftia, Sediminibacter and Pseudomonas, among others. Differences in the microbial communities were observed among the samples, and the PCoA analysis allowed their separation by means on their gender and if they proceed from sampling before or after the spawning. Nevertheless, the rarefaction curves obtained for each sample failed to reach a saturation phase, indicating that more sequencing effort would be necessary.
Díaz, S; Echeverría, M G; It, V; Posik, D M; Rogberg-Muñoz, A; Pena, N L; Peral-García, P; Vega-Pla, J L; Giovambattista, G
The polymorphism of equine lymphocyte antigen (ELA) class II DRA gene had been detected by polymerase chain reaction-single-strand conformational polymorphism (PCR-SSCP) and reference strand-mediated conformation analysis. These methodologies allowed to identify 11 ELA-DRA exon 2 sequences, three of which are widely distributed among domestic horse breeds. Herein, we describe the development of a pyrosequencing-based method applicable to ELA-DRA typing, by screening samples from eight different horse breeds previously typed by PCR-SSCP. This sequence-based method would be useful in high-throughput genotyping of major histocompatibility complex genes in horses and other animal species, making this system interesting as a rapid screening method for animal genotyping of immune-related genes.
Větrovský, Tomáš; Baldrian, Petr
Roč. 49, č. 8 (2013), s. 1027-1037 ISSN 0178-2762 R&D Projects: GA MŠk LD12050; GA MŠk LD12048; GA ČR GAP504/12/0709 Institutional support: RVO:61388971 Keywords : Fungal community * Internal transcribed spacer * Pyrosequencing pipeline Subject RIV: EE - Microbiology, Virology Impact factor: 3.396, year: 2013
Pu, Jian; Kazama, Shinobu; Miura, Takayuki; Azraini, Nabila Dhyan; Konta, Yoshimitsu; Ito, Hiroaki; Ueki, You; Cahyaningrum, Ermaya Eka; Omura, Tatsuo; Watanabe, Toru
Norovirus GII.3, GII.4, and GII.17 were detected using pyrosequencing in sewage and oysters in January and February 2015, in Japan. The strains in sewage and oyster samples were genetically identical or similar, predominant strains belonging to GII.17 Kawasaki 2014 lineage. This is the first report of GII.17 Kawasaki 2014 in oysters.
Migheli, Francesca; Stoccoro, Andrea; Coppedè, Fabio; Wan Omar, Wan Adnan; Failli, Alessandra; Consolini, Rita; Seccia, Massimo; Spisni, Roberto; Miccoli, Paolo; Mathers, John C.; Migliore, Lucia
There is increasing interest in the development of cost-effective techniques for the quantification of DNA methylation biomarkers. We analyzed 90 samples of surgically resected colorectal cancer tissues for APC and CDKN2A promoter methylation using methylation sensitive-high resolution melting (MS-HRM) and pyrosequencing. MS-HRM is a less expensive technique compared with pyrosequencing but is usually more limited because it gives a range of methylation estimates rather than a single value. Here, we developed a method for deriving single estimates, rather than a range, of methylation using MS-HRM and compared the values obtained in this way with those obtained using the gold standard quantitative method of pyrosequencing. We derived an interpolation curve using standards of known methylated/unmethylated ratio (0%, 12.5%, 25%, 50%, 75%, and 100% of methylation) to obtain the best estimate of the extent of methylation for each of our samples. We observed similar profiles of methylation and a high correlation coefficient between the two techniques. Overall, our new approach allows MS-HRM to be used as a quantitative assay which provides results which are comparable with those obtained by pyrosequencing. PMID:23326336
Vettraino, A M; Bonants, P; Tomassini, A; Bruni, N; Vannini, A
To evaluate the accuracy of pyrosequencing for the description of Phytophthora communities in terms of taxa identification and risk of assignment for false Molecular Operational Taxonomic Units (MOTUs). Pyrosequencing of Internal Transcribed Spacer 1 (ITS1) amplicons was used to describe the structure of a DNA mixture comprising eight Phytophthora spp. and Pythium vexans. Pyrosequencing resulted in 16 965 reads, detecting all species in the template DNA mixture. Reducing the ITS1 sequence identity threshold resulted in a decrease in numbers of unmatched reads but a concomitant increase in the numbers of false MOTUs. The total error rate was 0·63% and comprised mainly mismatches (0·25%) Pyrosequencing of ITS1 region is an efficient and accurate technique for the detection and identification of Phytophthora spp. in environmental samples. However, the risk of allocating false MOTUs, even when demonstrated to be low, may require additional validation with alternative detection methods. Phytophthora spp. are considered among the most destructive groups of invasive plant pathogens, affecting thousands of cultivated and wild plants worldwide. Simultaneous early detection of Phytophthora complexes in environmental samples offers an unique opportunity for the interception of known and unknown species along pathways of introduction, along with the identification of these organisms in invaded environments. © 2012 The Authors Letters in Applied Microbiology © 2012 The Society for Applied Microbiology.
Reyes, Carlen; Dellwig, Olaf; Dähnke, K.
To gain insight into the bacterial communities involved in iron-(Fe) cycling under marine conditions, we analysed sediments with Fe-contents (0.5-1.5 wt %) from the suboxic zone at a marine site in the Skagerrak (SK) and a brackish site in the Bothnian Bay (BB) using 16S rRNA gene pyrosequencing....
Jeanne, Nicolas; Saliou, Adrien; Carcenac, Romain; Lefebvre, Caroline; Dubois, Martine; Cazabat, Michelle; Nicot, Florence; Loiseau, Claire; Raymond, Stéphanie; Izopet, Jacques; Delobel, Pierre
HIV-1 coreceptor usage must be accurately determined before starting CCR5 antagonist-based treatment as the presence of undetected minor CXCR4-using variants can cause subsequent virological failure. Ultra-deep pyrosequencing of HIV-1 V3 env allows to detect low levels of CXCR4-using variants that current genotypic approaches miss. However, the computation of the mass of sequence data and the need to identify true minor variants while excluding artifactual sequences generated during amplification and ultra-deep pyrosequencing is rate-limiting. Arbitrary fixed cut-offs below which minor variants are discarded are currently used but the errors generated during ultra-deep pyrosequencing are sequence-dependant rather than random. We have developed an automated processing of HIV-1 V3 env ultra-deep pyrosequencing data that uses biological filters to discard artifactual or non-functional V3 sequences followed by statistical filters to determine position-specific sensitivity thresholds, rather than arbitrary fixed cut-offs. It allows to retain authentic sequences with point mutations at V3 positions of interest and discard artifactual ones with accurate sensitivity thresholds.
Man Kit Cheung
Full Text Available BACKGROUND: Tuberculosis (TB remains a global threat in the 21st century. Traditional studies of the disease are focused on the single pathogen Mycobacterium tuberculosis. Recent studies have revealed associations of some diseases with an imbalance in the microbial community. Characterization of the TB microbiota could allow a better understanding of the disease. METHODOLOGY/PRINCIPAL FINDINGS: Here, the sputum microbiota in TB infection was examined by using 16S rRNA pyrosequencing. A total of 829,873 high-quality sequencing reads were generated from 22 TB and 14 control sputum samples. Firmicutes, Proteobacteria, Bacteroidetes, Actinobacteria, and Fusobacteria were the five major bacterial phyla recovered, which together composed over 98% of the microbial community. Proteobacteria and Bacteroidetes were more represented in the TB samples and Firmicutes was more predominant in the controls. Sixteen major bacterial genera were recovered. Streptococcus, Neisseria and Prevotella were the most predominant genera, which were dominated by several operational taxonomic units grouped at a 97% similarity level. Actinomyces, Fusobacterium, Leptotrichia, Prevotella, Streptococcus, and Veillonella were found in all TB samples, possibly representing the core genera in TB sputum microbiota. The less represented genera Mogibacterium, Moryella and Oribacterium were enriched statistically in the TB samples, while a genus belonging to the unclassified Lactobacillales was enriched in the controls. The diversity of microbiota was similar in the TB and control samples. CONCLUSIONS/SIGNIFICANCE: The composition and diversity of sputum microbiota in TB infection was characterized for the first time by using high-throughput pyrosequencing. It lays the framework for examination of potential roles played by the diverse microbiota in TB pathogenesis and progression, and could ultimately facilitate advances in TB treatment.
Full Text Available The current literature reveals that the intrathalline coexistence of multiple microalgal taxa in lichens is more common than previously thought, and additional complexity is supported by the coexistence of bacteria and basidiomycete yeasts in lichen thalli. This replaces the old paradigm that lichen symbiosis occurs between a fungus and a single photobiont. The lichen Ramalina farinacea has proven to be a suitable model to study the multiplicity of microalgae in lichen thalli due to the constant coexistence of Trebouxia sp. TR9 and T. jamesii in long-distance populations. To date, studies involving phycobiont diversity within entire thalli are based on Sanger sequencing, but this method seems to underestimate the diversity. Here, we aim to analyze both the microalgal diversity and its community structure in a single thallus of the lichen R. farinacea by applying a 454 pyrosequencing approach coupled with a careful ad hoc-performed protocol for lichen sample processing prior to DNA extraction. To ascertain the reliability of the pyrosequencing results and the applied bioinformatics pipeline results, the thalli were divided into three sections (apical, middle and basal zones, and a mock community sample was used. The developed methodology allowed 40448 filtered algal reads to be obtained from a single lichen thallus, which encompassed 31 OTUs representative of different microalgae genera. In addition to corroborating the coexistence of the two Trebouxia sp. TR9 and T. jamesii taxa in the same thallus, this study showed a much higher microalgal diversity associated with the lichen. Along the thallus ramifications, we also detected variations in phycobiont distribution that might correlate with different microenvironmental conditions. These results highlight R. farinacea as a suitable material for studying microalgal diversity and further strengthen the concept of lichens as multispecies microecosystems. Future analyses will be relevant to
Tekin, Saban; Dowd, Scot E; Davinic, Marko; Bursali, Ahmet; Keskin, Adem
Ticks continue to be a threat to human and animal health in Turkey, as they are considered important vectors of human and animal diseases. The objectives of this investigation are to characterize the microbial communities of two tick species, Rhipicephalus annulatus and Dermacenter marginatus, analyze patterns of co-occurrence among microbial taxa, identify and compare pathogens contributing human diseases, and determine whether avirulent symbionts could exclude human pathogens from tick communities. Furthermore, this study explores a microbiome of the R. annulatus and D. marginatus via the bacterial 16S tag-encoded FLX-titanium amplicon pyrosequencing (bTEFAP) technique to describe their bacterial diversity. Pyrosequencing was performed on adult males and females isolated from humans from two high-risk Turkish provinces, Sivas and Amasya, during tick outbreaks in 2009. A total of 36,253 sequences were utilized for analyses of the 8 tick samples. Several pathogenic genera such as Francisella, Coxiella, Rickettsia, and Shigella were detected in the ticks tested. The most distinguishable difference between the two species of ticks was the lack of known human pathogen Rickettsia in R. annulatus and in samples 9 and 10 of D. marginatus. These samples had higher relative abundance of Flavobacterium sp., Curvibacter sp., Acidovorax sp., and Bacteroidaceae genera mostly representing symbionts which form a large component of normal tick microbiota. The outcome of this study is consistent with the predictions of the community ecological theory that diversity-rich bacteriomes are more resistant to bacterial invasion (and consequent pathogen dissemination) than diversity-deprived ones.
Tuohy, Marion J; Hall, Gerri S; Sholtis, Mary; Procop, Gary W
Pyrosequencing technology, sequencing by addition, was evaluated for categorization of mycobacterial isolates. One hundred and eighty-nine isolates, including 18 ATCC and Trudeau Mycobacterial Culture Collection (TMC) strains, were studied. There were 38 Mycobacterium tuberculosis complex, 27 M. kansasii, 27 MAI complex, 21 M. marinum, 14 M. gordonae, 20 M. chelonae-abscessus group, 10 M. fortuitum, 5 M. xenopi, 3 M. celatum, 2 M. terrae complex, 20 M. mucogenicum, and 2 M. scrofulaceum. Nucleic acid extracts were prepared from solid media or MGIT broth. Traditional PCR was performed with one of the primers biotinylated; the assay targeted a portion of the 16S rRNA gene that contains a hypervariable region, which has been previously shown to be useful for the identification of mycobacteria. The PSQ Sample Preparation Kit was used, and the biotinylated PCR product was processed to a single-stranded DNA template. The sequencing primer was hybridized to the DNA template in a PSQ96 plate. Incorporation of the complementary nucleotides resulted in light generation peaks, forming a pyrogram, which was evaluated by the instrument software. Thirty basepairs were used for isolate categorization. Manual interpretation of the sequences was performed if the quality of the 30-bp sequence was in doubt or if more than 4 bp homopolymers were recognized. Sequences with more than 5 bp of bad quality were deemed unacceptable. When blasted against GenBank, 179 of 189 sequences (94.7%) assigned isolates to the correct molecular genus or group. Ten M. gordonae isolates had more than 5 bp of bad quality sequence and were not accepted. Pyrosequencing of this hypervariable region afforded rapid and acceptable characterization of common, routinely isolated clinical Mycobacterium sp. Algorithms are recommended for further differentiation with an additional sequencing primer or additional biochemicals.
Kobayashi, Naomi; Bauer, Thomas W; Tuohy, Marion J; Lieberman, Isador H; Krebs, Viktor; Togawa, Daisuke; Fujishiro, Takaaki; Procop, Gary W
We have developed a combined real-time PCR and pyrosequencing assay that successfully differentiated the vast majority of gram-positive and gram-negative bacteria when bacterial isolates were tested. The purpose of this study was to evaluate this assay on clinical specimens obtained from orthopedic surgeries, and to prospectively compare the results of "molecular Gram stain" with culture and conventional direct Gram stain. Forty-five surgical specimens were obtained from patients who underwent orthopedic surgery procedures. The DNA was extracted and a set of broad-range PCR primers that targeted a part of the 16S rDNA gene was used for pan-bacterial PCR. The amplicons were submitted for pyrosequencing and the resulting molecular Gram stain characteristics were recorded. Culture and direct Gram staining were performed using standard methods for all cases. Surgical specimens were reviewed histologically for all cases that had a discrepancy between culture and molecular results. There was an 86.7% (39/45) agreement between the traditional and molecular methods. In 12/14 (85.7%) culture-proven cases of bacterial infection, molecular Gram stain characteristics were in agreement with the culture results, while the conventional Gram stain result was in agreement only for five cases (35.7%). In the 31 culture negative cases, 27 cases were also PCR negative, whereas 4 were PCR positive. Three of these were characterized as gram negative and one as gram positive by this molecular method. Molecular determination of the Gram stain characteristics of bacteria that cause orthopedic infections may be achieved, in most instances, by this method. Further studies are necessary to understand the clinical importance of PCR-positive/culture-negative results.
Full Text Available Objective The gastrointestinal tract of sheep contain complex microbial communities that influence numerous aspects of the sheep’s health and development. The objective of this study was to analyze the composition and diversity of the microbiota in the gastrointestinal tract sections (rumen, reticulum, omasum, abomasum, duodenum, jejunum, ileum, cecum, colon, and rectum of sheep. Methods This analysis was performed by 454 pyrosequencing using the V3-V6 region of the 16S rRNA genes. Samples were collected from five healthy, small tailed Han sheep aged 10 months, obtained at market. The bacterial composition of sheep gastrointestinal microbiota was investigated at the phylum, class, order, family, genus, and species levels. Results The dominant bacterial phyla in the entire gastrointestinal sections were Firmicutes, Bacteroidetes, and Proteobacteria. In the stomach, the three most dominant genera in the sheep were Prevotella, unclassified Lachnospiraceae, and Butyrivibrio. In the small intestine, the three most dominant genera in the sheep were Escherichia, unclassified Lachnospiraceae, and Ruminococcus. In the large intestine, the three most dominant genera in the sheep were Ruminococcus, unclassified Ruminococcaceae, and Prevotella. R. flavefaciens, B. fibrisolvens, and S. ruminantium were three most dominant species in the sheep gastrointestinal tract. Principal Coordinates Analysis showed that the microbial communities from each gastrointestinal section could be separated into three groups according to similarity of community composition: stomach (rumen, reticulum, omasum, and abomasum, small intestine (duodenum, jejunum, and ileum, and large intestine (cecum, colon, and rectum. Conclusion This is the first study to characterize the entire gastrointestinal microbiota in sheep by use of 16S rRNA gene amplicon pyrosequencing, expanding our knowledge of the gastrointestinal bacterial community of sheep.
Platre, Matthieu Pierre; Jaillais, Yvon
A wide range of signaling processes occurs at the cell surface through the reversible association of proteins from the cytosol to the plasma membrane. Some low abundant lipids are enriched at the membrane of specific compartments and thereby contribute to the identity of cell organelles by acting as biochemical landmarks. Lipids also influence membrane biophysical properties, which emerge as an important feature in specifying cellular territories. Such parameters are crucial for signal transduction and include lipid packing, membrane curvature and electrostatics. In particular, membrane electrostatics specifies the identity of the plasma membrane inner leaflet. Membrane surface charges are carried by anionic phospholipids, however the exact nature of the lipid(s) that powers the plasma membrane electrostatic field varies among eukaryotes and has been hotly debated during the last decade. Herein, we discuss the role of anionic lipids in setting up plasma membrane electrostatics and we compare similarities and differences that were found in different eukaryotic cells.
Abid Ali, Ferdos; Costa, Alessandro
The MCM motor of the CMG helicase powers ahead of the eukaryotic replication machinery to unwind DNA, in a process that requires ATP hydrolysis. The reconstitution of DNA replication in vitro has established the succession of events that lead to replication origin activation by the MCM and recent studies have started to elucidate the structural basis of duplex DNA unwinding. Despite the exciting progress, how the MCM translocates on DNA remains a matter of debate. Copyright © 2016. Published by Elsevier Ltd.
Oleg Mosin; Ignat Ignatov
Biologic influence of deuterium (D) on cells of various taxonomic groups of prokaryotic and eukaryotic microorganisms realizing methylotrophic, chemoheterotrophic, photo-organotrophic, and photosynthetic ways of assimilation of carbon substrates are investigated at growth on media with heavy water (D2О). The method of step by step adaptation technique of cells to D2О was developed, consisting in plating of cells on 2 % agarose nutrient media containing increasing gradient of concentration of ...
Neil W. Blackstone
Two major obstacles hinder the application of evolutionary theory to the origin of eukaryotes. The first is more apparent than real?the endosymbiosis that led to the mitochondrion is often described as ?non-Darwinian? because it deviates from the incremental evolution championed by the modern synthesis. Nevertheless, endosymbiosis can be accommodated by a multi-level generalization of evolutionary theory, which Darwin himself pioneered. The second obstacle is more serious?all of the major fea...
Full Text Available Ample evidence exists showing that eukaryotic signal molecules synthesized and released by the host can activate the virulence of opportunistic pathogens. The sensitivity of prokaryotes to host signal molecules requires the presence of bacterial sensors. These prokaryotic sensors, or receptors, have a double function: stereospecific recognition in a complex environment and transduction of the message in order to initiate bacterial physiological modifications. As messengers are generally unable to freely cross the bacterial membrane, they require either the presence of sensors anchored in the membrane or transporters allowing direct recognition inside the bacterial cytoplasm. Since the discovery of quorum sensing, it was established that the production of virulence factors by bacteria is tightly growth-phase regulated. It is now obvious that expression of bacterial virulence is also controlled by detection of the eukaryotic messengers released in the micro-environment as endocrine or neuro-endocrine modulators. In the presence of host physiological stress many eukaryotic factors are released and detected by Gram-negative bacteria which in return rapidly adapt their physiology. For instance, Pseudomonas aeruginosa can bind elements of the host immune system such as interferon-γ and dynorphin and then through quorum sensing circuitry enhance its virulence. Escherichia coli sensitivity to the neurohormones of the catecholamines family appears relayed by a recently identified bacterial adrenergic receptor. In the present review, we will describe the mechanisms by which various eukaryotic signal molecules produced by host may activate Gram-negative bacteria virulence. Particular attention will be paid to Pseudomonas, a genus whose representative species, P. aeruginosa, is a common opportunistic pathogen. The discussion will be particularly focused on the pivotal role played by these new types of pathogen sensors from the sensing to the transduction
Morrow, John F.; Cohen, Stanley N.; Chang, Annie C. Y.; Boyer, Herbert W.; Goodman, Howard M.; Helling, Robert B.
Fragments of amplified Xenopus laevis DNA, coding for 18S and 28S ribosomal RNA and generated by EcoRI restriction endonuclease, have been linked in vitro to the bacterial plasmid pSC101; and the recombinant molecular species have been introduced into E. coli by transformation. These recombinant plasmids, containing both eukaryotic and prokaryotic DNA, replicate stably in E. coli. RNA isolated from E. coli minicells harboring the plasmids hybridizes to amplified X. laevis rDNA. Images PMID:4600264
Flegontova, Olga; Flegontov, Pavel; Malviya, S.; Audic, S.; Wincker, P.; de Vargas, C.; Bowler, C.; Lukeš, Julius; Horák, Aleš
Roč. 26, č. 22 (2016), s. 3060-3065 ISSN 0960-9822 R&D Projects: GA ČR GPP506/12/P931; GA ČR(CZ) GA14-23986S Institutional support: RVO:60077344 Keywords : virus-sized particles * microbial eukaryotes * sea-floor * phytoplankton * communities * euglenozoa * dispersal * ecosystem Subject RIV: EG - Zoology Impact factor: 8.851, year: 2016
Sun, Chen; Hu, Zhiqiang; Zheng, Tianqing; Lu, Kuangchen; Zhao, Yue; Wang, Wensheng; Shi, Jianxin; Wang, Chunchao; Lu, Jinyuan; Zhang, Dabing; Li, Zhikang; Wei, Chaochun
A pan-genome is the union of the gene sets of all the individuals of a clade or a species and it provides a new dimension of genome complexity with the presence/absence variations (PAVs) of genes among these genomes. With the progress of sequencing technologies, pan-genome study is becoming affordable for eukaryotes with large-sized genomes. The Asian cultivated rice, Oryza sativa L., is one of the major food sources for the world and a model organism in plant biology. Recently, the 3000 Rice Genome Project (3K RGP) sequenced more than 3000 rice genomes with a mean sequencing depth of 14.3×, which provided a tremendous resource for rice research. In this paper, we present a genome browser, Rice Pan-genome Browser (RPAN), as a tool to search and visualize the rice pan-genome derived from 3K RGP. RPAN contains a database of the basic information of 3010 rice accessions, including genomic sequences, gene annotations, PAV information and gene expression data of the rice pan-genome. At least 12 000 novel genes absent in the reference genome were included. RPAN also provides multiple search and visualization functions. RPAN can be a rich resource for rice biology and rice breeding. It is available at http://cgm.sjtu.edu.cn/3kricedb/ or http://www.rmbreeding.cn/pan3k. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Qin, Jie; Lehr, Corinne R; Yuan, Chungang; Le, X Chris; McDermott, Timothy R; Rosen, Barry P
Arsenic is the most common toxic substance in the environment, ranking first on the Superfund list of hazardous substances. It is introduced primarily from geochemical sources and is acted on biologically, creating an arsenic biogeocycle. Geothermal environments are known for their elevated arsenic content and thus provide an excellent setting in which to study microbial redox transformations of arsenic. To date, most studies of microbial communities in geothermal environments have focused on Bacteria and Archaea, with little attention to eukaryotic microorganisms. Here, we show the potential of an extremophilic eukaryotic alga of the order Cyanidiales to influence arsenic cycling at elevated temperatures. Cyanidioschyzon sp. isolate 5508 oxidized arsenite [As(III)] to arsenate [As(V)], reduced As(V) to As(III), and methylated As(III) to form trimethylarsine oxide (TMAO) and dimethylarsenate [DMAs(V)]. Two arsenic methyltransferase genes, CmarsM7 and CmarsM8, were cloned from this organism and demonstrated to confer resistance to As(III) in an arsenite hypersensitive strain of Escherichia coli. The 2 recombinant CmArsMs were purified and shown to transform As(III) into monomethylarsenite, DMAs(V), TMAO, and trimethylarsine gas, with a T(opt) of 60-70 degrees C. These studies illustrate the importance of eukaryotic microorganisms to the biogeochemical cycling of arsenic in geothermal systems, offer a molecular explanation for how these algae tolerate arsenic in their environment, and provide the characterization of algal methyltransferases.
Niño, Carlos A; Chaparro, Jenny; Soffientini, Paolo; Polo, Simona; Wasserman, Moises
Ubiquitination is a highly dynamic and versatile posttranslational modification that regulates protein function, stability, and interactions. To investigate the roles of ubiquitination in a primitive eukaryotic lineage, we utilized the early-branching eukaryote Giardia intestinalis. Using a combination of biochemical, immunofluorescence-based, and proteomics approaches, we assessed the ubiquitination status during the process of differentiation in Giardia. We observed that different types of ubiquitin modifications present specific cellular and temporal distribution throughout the Giardia life cycle from trophozoites to cyst maturation. Ubiquitin signal was detected in the wall of mature cysts, and enzymes implicated in cyst wall biogenesis were identified as substrates for ubiquitination. Interestingly, inhibition of proteasome activity did not affect trophozoite replication and differentiation, while it caused a decrease in cyst viability, arguing for proteasome involvement in cyst wall maturation. Using a proteomics approach, we identified around 200 high-confidence ubiquitinated candidates that vary their ubiquitination status during differentiation. Our results indicate that ubiquitination is critical for several cellular processes in this primitive eukaryote. PMID:23613346
Gene transfer into eukaryotic cells plays an important role in cell biology. Over the last 30 years a number of transfection methods have been developed to mediate gene transfer into eukaryotic cells. Classical methods include co-precipitation of DNA with calcium phosphate, charge-dependent precipitation of DNA with DEAE-dextran, electroporation of nucleic acids, and formation of transfection complexes between DNA and cationic liposomes. Gene transfer technologies based on activated PAMAM-dendrimers provide another class of transfection reagents. PAMAM-dendrimers are highly branched, spherical molecules. Activation of newly synthesized dendrimers involves hydrolytic removal of some of the branches, and results in a molecule with a higher degree of flexibility. Activated dendrimers assemble DNA into compact structures via charge interactions. Activated dendrimer - DNA complexes bind to the cell membrane of eukaryotic cells, and are transported into the cell by non-specific endocytosis. A structural model of the activated dendrimer - DNA complex and a potential mechanism for its uptake into cells will be discussed.
Parker, Matthew W; Botchan, Michael R; Berger, James M
Cellular DNA replication is initiated through the action of multiprotein complexes that recognize replication start sites in the chromosome (termed origins) and facilitate duplex DNA melting within these regions. In a typical cell cycle, initiation occurs only once per origin and each round of replication is tightly coupled to cell division. To avoid aberrant origin firing and re-replication, eukaryotes tightly regulate two events in the initiation process: loading of the replicative helicase, MCM2-7, onto chromatin by the origin recognition complex (ORC), and subsequent activation of the helicase by its incorporation into a complex known as the CMG. Recent work has begun to reveal the details of an orchestrated and sequential exchange of initiation factors on DNA that give rise to a replication-competent complex, the replisome. Here, we review the molecular mechanisms that underpin eukaryotic DNA replication initiation - from selecting replication start sites to replicative helicase loading and activation - and describe how these events are often distinctly regulated across different eukaryotic model organisms.
John A Capra
Full Text Available The evolutionary history of a protein reflects the functional history of its ancestors. Recent phylogenetic studies identified distinct evolutionary signatures that characterize proteins involved in cancer, Mendelian disease, and different ontogenic stages. Despite the potential to yield insight into the cellular functions and interactions of proteins, such comparative phylogenetic analyses are rarely performed, because they require custom algorithms. We developed ProteinHistorian to make tools for performing analyses of protein origins widely available. Given a list of proteins of interest, ProteinHistorian estimates the phylogenetic age of each protein, quantifies enrichment for proteins of specific ages, and compares variation in protein age with other protein attributes. ProteinHistorian allows flexibility in the definition of protein age by including several algorithms for estimating ages from different databases of evolutionary relationships. We illustrate the use of ProteinHistorian with three example analyses. First, we demonstrate that proteins with high expression in human, compared to chimpanzee and rhesus macaque, are significantly younger than those with human-specific low expression. Next, we show that human proteins with annotated regulatory functions are significantly younger than proteins with catalytic functions. Finally, we compare protein length and age in many eukaryotic species and, as expected from previous studies, find a positive, though often weak, correlation between protein age and length. ProteinHistorian is available through a web server with an intuitive interface and as a set of command line tools; this allows biologists and bioinformaticians alike to integrate these approaches into their analysis pipelines. ProteinHistorian's modular, extensible design facilitates the integration of new datasets and algorithms. The ProteinHistorian web server, source code, and pre-computed ages for 32 eukaryotic genomes are
Driscoll, Timothy P; Verhoeve, Victoria I; Guillotte, Mark L; Lehman, Stephanie S; Rennoll, Sherri A; Beier-Sexton, Magda; Rahman, M Sayeedur; Azad, Abdu F; Gillespie, Joseph J
Reductive genome evolution has purged many metabolic pathways from obligate intracellular Rickettsia ( Alphaproteobacteria ; Rickettsiaceae ). While some aspects of host-dependent rickettsial metabolism have been characterized, the array of host-acquired metabolites and their cognate transporters remains unknown. This dearth of information has thwarted efforts to obtain an axenic Rickettsia culture, a major impediment to conventional genetic approaches. Using phylogenomics and computational pathway analysis, we reconstructed the Rickettsia metabolic and transport network, identifying 51 host-acquired metabolites (only 21 previously characterized) needed to compensate for degraded biosynthesis pathways. In the absence of glycolysis and the pentose phosphate pathway, cell envelope glycoconjugates are synthesized from three imported host sugars, with a range of additional host-acquired metabolites fueling the tricarboxylic acid cycle. Fatty acid and glycerophospholipid pathways also initiate from host precursors, and import of both isoprenes and terpenoids is required for the synthesis of ubiquinone and the lipid carrier of lipid I and O-antigen. Unlike metabolite-provisioning bacterial symbionts of arthropods, rickettsiae cannot synthesize B vitamins or most other cofactors, accentuating their parasitic nature. Six biosynthesis pathways contain holes (missing enzymes); similar patterns in taxonomically diverse bacteria suggest alternative enzymes that await discovery. A paucity of characterized and predicted transporters emphasizes the knowledge gap concerning how rickettsiae import host metabolites, some of which are large and not known to be transported by bacteria. Collectively, our reconstructed metabolic network offers clues to how rickettsiae hijack host metabolic pathways. This blueprint for growth determinants is an important step toward the design of axenic media to rescue rickettsiae from the eukaryotic cell. IMPORTANCE A hallmark of obligate intracellular
Swain, Martin T; Tsai, Isheng J; Assefa, Samual A; Newbold, Chris; Berriman, Matthew; Otto, Thomas D
Genome projects now produce draft assemblies within weeks owing to advanced high-throughput sequencing technologies. For milestone projects such as Escherichia coli or Homo sapiens, teams of scientists were employed to manually curate and finish these genomes to a high standard. Nowadays, this is not feasible for most projects, and the quality of genomes is generally of a much lower standard. This protocol describes software (PAGIT) that is used to improve the quality of draft genomes. It offers flexible functionality to close gaps in scaffolds, correct base errors in the consensus sequence and exploit reference genomes (if available) in order to improve scaffolding and generating annotations. The protocol is most accessible for bacterial and small eukaryotic genomes (up to 300 Mb), such as pathogenic bacteria, malaria and parasitic worms. Applying PAGIT to an E. coli assembly takes ∼24 h: it doubles the average contig size and annotates over 4,300 gene models.
Redrejo-Rodríguez, Modesto; Salas, Margarita
Protein-primed replication constitutes a generalized mechanism to initiate DNA or RNA synthesis in linear genomes, including viruses, gram-positive bacteria, linear plasmids and mobile elements. By this mechanism a specific amino acid primes replication and becomes covalently linked to the genome ends. Despite the fact that TPs lack sequence homology, they share a similar structural arrangement, with the priming residue in the C-terminal half of the protein and an accumulation of positively charged residues at the N-terminal end. In addition, various bacteriophage TPs have been shown to have DNA-binding capacity that targets TPs and their attached genomes to the host nucleoid. Furthermore, a number of bacteriophage TPs from different viral families and with diverse hosts also contain putative nuclear localization signals and localize in the eukaryotic nucleus, which could lead to the transport of the attached DNA. This suggests a possible role of bacteriophage TPs in prokaryote-to-eukaryote horizontal gene transfer. - Highlights: • Protein-primed genome replication constitutes a strategy to initiate DNA or RNA synthesis in linear genomes. • Bacteriophage terminal proteins (TPs) are covalently attached to viral genomes by their primary function priming DNA replication. • TPs are also DNA-binding proteins and target phage genomes to the host nucleoid. • TPs can also localize in the eukaryotic nucleus and may have a role in phage-mediated interkingdom gene transfer
Redrejo-Rodríguez, Modesto; Salas, Margarita, E-mail: firstname.lastname@example.org
Protein-primed replication constitutes a generalized mechanism to initiate DNA or RNA synthesis in linear genomes, including viruses, gram-positive bacteria, linear plasmids and mobile elements. By this mechanism a specific amino acid primes replication and becomes covalently linked to the genome ends. Despite the fact that TPs lack sequence homology, they share a similar structural arrangement, with the priming residue in the C-terminal half of the protein and an accumulation of positively charged residues at the N-terminal end. In addition, various bacteriophage TPs have been shown to have DNA-binding capacity that targets TPs and their attached genomes to the host nucleoid. Furthermore, a number of bacteriophage TPs from different viral families and with diverse hosts also contain putative nuclear localization signals and localize in the eukaryotic nucleus, which could lead to the transport of the attached DNA. This suggests a possible role of bacteriophage TPs in prokaryote-to-eukaryote horizontal gene transfer. - Highlights: • Protein-primed genome replication constitutes a strategy to initiate DNA or RNA synthesis in linear genomes. • Bacteriophage terminal proteins (TPs) are covalently attached to viral genomes by their primary function priming DNA replication. • TPs are also DNA-binding proteins and target phage genomes to the host nucleoid. • TPs can also localize in the eukaryotic nucleus and may have a role in phage-mediated interkingdom gene transfer.
Full Text Available Abstract Background The origin of eukaryotic cells was one of the most dramatic evolutionary transitions in the history of life. It is generally assumed that eukaryotes evolved later then prokaryotes by the transformation or fusion of prokaryotic lineages. However, as yet there is no consensus regarding the nature of the prokaryotic group(s ancestral to eukaryotes. Regardless of this, a hardly debatable fundamental novel characteristic of the last eukaryotic common ancestor was the ability to exploit prokaryotic biomass by the ingestion of entire cells, i.e. phagocytosis. The recent advances in our understanding of the social life of prokaryotes may help to explain the origin of this form of total exploitation. Presentation of the hypothesis Here I propose that eukaryotic cells originated in a social environment, a differentiated microbial mat or biofilm that was maintained by the cooperative action of its members. Cooperation was costly (e.g. the production of developmental signals or an extracellular matrix but yielded benefits that increased the overall fitness of the social group. I propose that eukaryotes originated as selfish cheaters that enjoyed the benefits of social aggregation but did not contribute to it themselves. The cheaters later evolved into predators that lysed other cells and eventually became professional phagotrophs. During several cycles of social aggregation and dispersal the number of cheaters was contained by a chicken game situation, i.e. reproductive success of cheaters was high when they were in low abundance but was reduced when they were over-represented. Radical changes in cell structure, including the loss of the rigid prokaryotic cell wall and the development of endomembranes, allowed the protoeukaryotes to avoid cheater control and to exploit nutrients more efficiently. Cellular changes were buffered by both the social benefits and the protective physico-chemical milieu of the interior of biofilms. Symbiosis
Radiosensitivity of cells (D 0 ) is considered with regard to the structural organization of the genome. The following terms are introduced: ''karyotaxon'', organisms with identical structural organization of the genome, and ''specific genome stability'' K=D 0 C, where C is the quantity of DNA in the cell nucleus; K is the amount of energy (eV) the sorption of which in DNA is necessary and sufficient for one elementary damage to occur. It was shown that Ksub(i)=const. within every karyotaxon ''i''. K 1 =100 eV for viruses, and K 4 =61000 eV for the highest level of genome organization (diploid eukaryotes including man). Potential mechanisms of increasing Ksub(i) with increasing level of genome organization and the role of this factor in evolution are discussed [ru
Backofen, Rolf; Gorodkin, Jan; Hofacker, Ivo L.
Over the last two decades it has become clear that RNA is much more than just a boring intermediate in protein expression. Ancient RNAs still appear in the core information metabolism and comprise a surprisingly large component in bacterial gene regulation. A common theme with these types of mostly...... small RNAs is their reliance of conserved secondary structures. Large scale sequencing projects, on the other hand, have profoundly changed our understanding of eukaryotic genomes. Pervasively transcribed, they give rise to a plethora of large and evolutionarily extremely flexible noncoding RNAs...... that exert a vastly diverse array of molecule functions. In this chapter we provide a—necessarily incomplete—overview of the current state of comparative analysis of noncoding RNAs, emphasizing computational approaches as a means to gain a global picture of the modern RNA world....
Nygaard, Sanne; Braunstein, Alexander; Malsen, Gareth
Plasmodium parasites, the causal agents of malaria, result in more than 1 million deaths annually. Plasmodium are unicellular eukaryotes with small ~23 Mb genomes encoding ~5200 protein-coding genes. The protein-coding genes comprise about half of these genomes. Although evolutionary processes ha...
Bendele Kylie G
Full Text Available Abstract Background Ticks are regarded as the most relevant vectors of disease-causing pathogens in domestic and wild animals. The cattle tick, Rhipicephalus (Boophilus microplus, hinders livestock production in tropical and subtropical parts of the world where it is endemic. Tick microbiomes remain largely unexplored. The objective of this study was to explore the R. microplus microbiome by applying the bacterial 16S tag-encoded FLX-titanium amplicon pyrosequencing (bTEFAP technique to characterize its bacterial diversity. Pyrosequencing was performed on adult males and females, eggs, and gut and ovary tissues from adult females derived from samples of R. microplus collected during outbreaks in southern Texas. Results Raw data from bTEFAP were screened and trimmed based upon quality scores and binned into individual sample collections. Bacteria identified to the species level include Staphylococcus aureus, Staphylococcus chromogenes, Streptococcus dysgalactiae, Staphylococcus sciuri, Serratia marcescens, Corynebacterium glutamicum, and Finegoldia magna. One hundred twenty-one bacterial genera were detected in all the life stages and tissues sampled. The total number of genera identified by tick sample comprised: 53 in adult males, 61 in adult females, 11 in gut tissue, 7 in ovarian tissue, and 54 in the eggs. Notable genera detected in the cattle tick include Wolbachia, Coxiella, and Borrelia. The molecular approach applied in this study allowed us to assess the relative abundance of the microbiota associated with R. microplus. Conclusions This report represents the first survey of the bacteriome in the cattle tick using non-culture based molecular approaches. Comparisons of our results with previous bacterial surveys provide an indication of geographic variation in the assemblages of bacteria associated with R. microplus. Additional reports on the identification of new bacterial species maintained in nature by R. microplus that may be
Background Ticks are regarded as the most relevant vectors of disease-causing pathogens in domestic and wild animals. The cattle tick, Rhipicephalus (Boophilus) microplus, hinders livestock production in tropical and subtropical parts of the world where it is endemic. Tick microbiomes remain largely unexplored. The objective of this study was to explore the R. microplus microbiome by applying the bacterial 16S tag-encoded FLX-titanium amplicon pyrosequencing (bTEFAP) technique to characterize its bacterial diversity. Pyrosequencing was performed on adult males and females, eggs, and gut and ovary tissues from adult females derived from samples of R. microplus collected during outbreaks in southern Texas. Results Raw data from bTEFAP were screened and trimmed based upon quality scores and binned into individual sample collections. Bacteria identified to the species level include Staphylococcus aureus, Staphylococcus chromogenes, Streptococcus dysgalactiae, Staphylococcus sciuri, Serratia marcescens, Corynebacterium glutamicum, and Finegoldia magna. One hundred twenty-one bacterial genera were detected in all the life stages and tissues sampled. The total number of genera identified by tick sample comprised: 53 in adult males, 61 in adult females, 11 in gut tissue, 7 in ovarian tissue, and 54 in the eggs. Notable genera detected in the cattle tick include Wolbachia, Coxiella, and Borrelia. The molecular approach applied in this study allowed us to assess the relative abundance of the microbiota associated with R. microplus. Conclusions This report represents the first survey of the bacteriome in the cattle tick using non-culture based molecular approaches. Comparisons of our results with previous bacterial surveys provide an indication of geographic variation in the assemblages of bacteria associated with R. microplus. Additional reports on the identification of new bacterial species maintained in nature by R. microplus that may be pathogenic to its vertebrate hosts
Morrison, Hilary G; McArthur, Andrew G; Gillin, Frances D; Aley, Stephen B; Adam, Rodney D; Olsen, Gary J; Best, Aaron A; Cande, W Zacheus; Chen, Feng; Cipriano, Michael J; Davids, Barbara J; Dawson, Scott C; Elmendorf, Heidi G; Hehl, Adrian B; Holder, Michael E; Huse, Susan M; Kim, Ulandt U; Lasek-Nesselquist, Erica; Manning, Gerard; Nigam, Anuranjini; Nixon, Julie E J; Palm, Daniel; Passamaneck, Nora E; Prabhu, Anjali; Reich, Claudia I; Reiner, David S; Samuelson, John; Svard, Staffan G; Sogin, Mitchell L
The genome of the eukaryotic protist Giardia lamblia, an important human intestinal parasite, is compact in structure and content, contains few introns or mitochondrial relics, and has simplified machinery for DNA replication, transcription, RNA processing, and most metabolic pathways. Protein kinases comprise the single largest protein class and reflect Giardia's requirement for a complex signal transduction network for coordinating differentiation. Lateral gene transfer from bacterial and archaeal donors has shaped Giardia's genome, and previously unknown gene families, for example, cysteine-rich structural proteins, have been discovered. Unexpectedly, the genome shows little evidence of heterozygosity, supporting recent speculations that this organism is sexual. This genome sequence will not only be valuable for investigating the evolution of eukaryotes, but will also be applied to the search for new therapeutics for this parasite.
Full Text Available The species in family Planctomycetaceae are ideal groups for investigating the origin of eukaryotes. Their cells are divided by a lipidic intracytoplasmic membrane and they share a number of eukaryote-like molecular characteristics. However, their genomic structures, potential abilities, and evolutionary status are still unknown. In this study, we searched for common protein families and a core genome/pan genome based on 11 sequenced species in family Planctomycetaceae. Then, we constructed phylogenetic tree based on their 832 common protein families. We also annotated the 11 genomes using the Clusters of Orthologous Groups database. Moreover, we predicted and reconstructed their core/pan metabolic pathways using the KEGG (Kyoto Encyclopedia of Genes and Genomes orthology system. Subsequently, we identified genomic islands (GIs and structural variations (SVs among the five complete genomes and we specifically investigated the integration of two Planctomycetaceae plasmids in all 11 genomes. The results indicate that Planctomycetaceae species share diverse genomic variations and unique genomic characteristics, as well as have huge potential for human applications.
Kersey, Paul Julian; Allen, James E; Christensen, Mikkel; Davis, Paul; Falin, Lee J; Grabmueller, Christoph; Hughes, Daniel Seth Toney; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Langridge, Nicholas; McDowall, Mark D; Maheswari, Uma; Maslen, Gareth; Nuhn, Michael; Ong, Chuang Kee; Paulini, Michael; Pedro, Helder; Toneva, Iliana; Tuli, Mary Ann; Walts, Brandon; Williams, Gareth; Wilson, Derek; Youens-Clark, Ken; Monaco, Marcela K; Stein, Joshua; Wei, Xuehong; Ware, Doreen; Bolser, Daniel M; Howe, Kevin Lee; Kulesha, Eugene; Lawson, Daniel; Staines, Daniel Michael
Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species. The project exploits and extends technologies for genome annotation, analysis and dissemination, developed in the context of the vertebrate-focused Ensembl project, and provides a complementary set of resources for non-vertebrate species through a consistent set of programmatic and interactive interfaces. These provide access to data including reference sequence, gene models, transcriptional data, polymorphisms and comparative analysis. This article provides an update to the previous publications about the resource, with a focus on recent developments. These include the addition of important new genomes (and related data sets) including crop plants, vectors of human disease and eukaryotic pathogens. In addition, the resource has scaled up its representation of bacterial genomes, and now includes the genomes of over 9000 bacteria. Specific extensions to the web and programmatic interfaces have been developed to support users in navigating these large data sets. Looking forward, analytic tools to allow targeted selection of data for visualization and download are likely to become increasingly important in future as the number of available genomes increases within all domains of life, and some of the challenges faced in representing bacterial data are likely to become commonplace for eukaryotes in future.
Full Text Available Abstract Background The concept of orthology is key to decoding evolutionary relationships among genes across different species using comparative genomics. QuartetS is a recently reported algorithm for large-scale orthology detection. Based on the well-established evolutionary principle that gene duplication events discriminate paralogous from orthologous genes, QuartetS has been shown to improve orthology detection accuracy while maintaining computational efficiency. Description QuartetS-DB is a new orthology database constructed using the QuartetS algorithm. The database provides orthology predictions among 1621 complete genomes (1365 bacterial, 92 archaeal, and 164 eukaryotic, covering more than seven million proteins and four million pairwise orthologs. It is a major source of orthologous groups, containing more than 300,000 groups of orthologous proteins and 236,000 corresponding gene trees. The database also provides over 500,000 groups of inparalogs. In addition to its size, a distinguishing feature of QuartetS-DB is the ability to allow users to select a cutoff value that modulates the balance between prediction accuracy and coverage of the retrieved pairwise orthologs. The database is accessible at https://applications.bioanalysis.org/quartetsdb. Conclusions QuartetS-DB is one of the largest orthology resources available to date. Because its orthology predictions are underpinned by evolutionary evidence obtained from sequenced genomes, we expect its accuracy to continue to increase in future releases as the genomes of additional species are sequenced.
Redondo-Nieto, Miguel; Barret, Matthieu; Morrisey, John P.; Germaine, Kieran; Martínez-Granero, Francisco; Barahona, Emma; Navazo, Ana; Sánchez-Contreras, María; Moynihan, Jennifer A.; Giddens, Stephen R.; Coppoolse, Eric R.; Muriel, Candela; Stiekema, Willem J.; Rainey, Paul B.; Dowling, David; O'Gara, Fergal; Martín, Marta
Pseudomonas fluorescens F113 is a plant growth-promoting rhizobacterium (PGPR) that has biocontrol activity against fungal plant pathogens and is a model for rhizosphere colonization. Here, we present its complete genome sequence, which shows that besides a core genome very similar to those of other strains sequenced within this species, F113 possesses a wide array of genes encoding specialized functions for thriving in the rhizosphere and interacting with eukaryotic organisms. PMID:22328765
Espagne, Eric; Lespinet, Olivier; Malagnac, Fabienne; Da Silva, Corinne; Jaillon, Olivier; Porcel, Betina M; Couloux, Arnaud; Aury, Jean-Marc; Ségurens, Béatrice; Poulain, Julie; Anthouard, Véronique; Grossetete, Sandrine; Khalili, Hamid; Coppin, Evelyne; Déquard-Chablat, Michelle; Picard, Marguerite; Contamine, Véronique; Arnaise, Sylvie; Bourdais, Anne; Berteaux-Lecellier, Véronique; Gautheret, Daniel; de Vries, Ronald P; Battaglia, Evy; Coutinho, Pedro M; Danchin, Etienne Gj; Henrissat, Bernard; Khoury, Riyad El; Sainsard-Chanet, Annie; Boivin, Antoine; Pinan-Lucarré, Bérangère; Sellem, Carole H; Debuchy, Robert; Wincker, Patrick; Weissenbach, Jean; Silar, Philippe
BACKGROUND: The dung-inhabiting ascomycete fungus Podospora anserina is a model used to study various aspects of eukaryotic and fungal biology, such as ageing, prions and sexual development. RESULTS: We present a 10X draft sequence of P. anserina genome, linked to the sequences of a large expressed
RPS16 of eukaryote is a component of the 40S small ribosomal subunit encoded by RPS16 gene and is also a homolog of prokaryotic RPS9. The cDNA and genomic sequence of RPS16 was cloned successfully for the first time from the Giant Panda (Ailuropoda melanoleuca) using reverse transcription-polymerase chain ...
Huynen, M.A.; Snel, B.; Bork, P.
Comparisons of the gene order in closely related genomes reveal a major role for inversions in the genome shuffling process. In contrast to prokaryotes, where the inversions are predominantly large, half of the inversions between Saccharomyces cerevisiae and Candida albicans appear to be small,
Peng, Qian; Yang, Yanping; Guo, Yanyun; Han, Ye
The vinegar pei harbors complex bacterial communities. Prior studies revealing the bacterial diversity involved were mainly conducted by culture-dependent methods and PCR-DGGE. In this study, 454 pyrosequencing was used to investigate the bacterial communities in vinegar pei during the acetic acid fermentation (AAF) of Tianjin Duliu aged vinegar (TDAV). The results showed that there were 7 phyla and 24 families existing in the vinegar pei, with 2 phyla (Firmicutes, Protebacteria) and 4 families (Lactobacillaceae, Acetobacteracae, Enterobacteriaceae, Chloroplast) predominating. The genus-level identification revealed that 9 genera were the relatively stable, consistent components in different stages of AAF, including the most abundant genus Lactobacillus followed by Acetobacter and Serratia. Additionally, the bacterial community in the early fermentation stage was more complex than those in the later stages, indicating that the accumulation of organic acids provided an appropriate environment to filter unwanted bacteria and to accelerate the growth of required ones. This study provided basic information of bacterial patterns in vinegar pei and relevant changes during AAF of TDAV, and could be used as references in the following study on the implementation of starter culture as well as the improvement of AAF process.
Dai, Mulan; Hamel, Chantal; St Arnaud, Marc; He, Yong; Grant, Cynthia; Lupwayi, Newton; Janzen, Henry; Malhi, Sukhdev S; Yang, Xiaohong; Zhou, Zhiqin
The arbuscular mycorrhizal (AM) fungal resources present in wheat fields of the Canadian Prairie were explored using 454 pyrosequencing. Of the 33 dominant AM fungal operational taxonomic units (OTUs) found in the 76 wheat fields surveyed at anthesis in 2009, 14 clustered as Funneliformis - Rhizophagus, 16 as Claroideoglomus, and 3 as Diversisporales. An OTU of Funneliformis mosseae and one OTU of Diversisporales each accounted for approximately 16% of all AM fungal OTUs. The former was ubiquitous, and the latter was mainly restricted to the Black and Dark Brown Chernozems. AM fungal OTU community composition was better explained by the Chernozem great groups (P = 0.044) than by measured soil properties. Fifty-two percent of the AM fungal OTUs were unrelated to measured soil properties. Black Chernozems hosted the largest AM fungal OTU diversity and almost twice the number of AM fungal sequences seen in Dark Brown Chernozems, the great group ranking second for AM fungal sequence abundance. Brown Chernozems hosted the lowest AM fungal abundance and an AM fungal diversity as low as that seen in Gray soils. We concluded that Black Chernozems are most conducive to AM fungal proliferation. AM fungi are generally distributed according to Chernozem great groups in the Canadian Prairie, although some taxa are evenly distributed in all soil groups.
Jiang, Wen; Zhang, Jie; Chen, Hui
Severe early childhood caries are a prevalent public health problem among preschool children throughout the world. However, little is known about the microbiota found in association with severe early childhood caries. Our study aimed to explore the bacterial microbiota of dental plaques to study the etiology of severe early childhood caries through pyrosequencing analysis based on 16S rRNA gene V1-V3 hypervariable regions. Forty participants were enrolled in the study, and we obtained twenty samples of supragingival plaque from caries-free subjects and twenty samples from subjects with severe early childhood caries. A total of 175,918 reads met the quality control standards, and the bacteria found belonged to fourteen phyla and sixty-three genera. Our results show the overall structure and microbial composition of oral bacterial communities, and they suggest that these bacteria may present a core microbiome in the dental plaque microbiota. Three genera, Streptococcus, Granulicatella, and Actinomyces, were increased significantly in children with severe dental cavities. These data may facilitate improvements in the prevention and treatment of severe early childhood caries.
Gomar-Vercher, S; Cabrera-Rubio, R; Mira, A; Montiel-Company, J M; Almerich-Silla, J M
Different dental caries status could be related with alterations in oral microbiota. Previous studies have collected saliva as a representative medium of the oral ecosystem. The purpose of this study was to assess the composition of oral microbiota and its relation to the presence of dental caries at different degrees of severity. One hundred ten saliva samples from 12-year-old children were taken and divided into six groups defined in strict accordance with their dental caries prevalence according to the International Caries Detection and Assessment System II criteria. These samples were studied by pyrosequencing PCR products of the 16S ribosomal RNA gene. The results showed statistically significant intergroup differences at the class and genus taxonomic levels. Streptococcus is the most frequent genus in all groups; although it did not show intergroup statistical differences. In patients with cavities, Porphyromonas and Prevotella showed an increasing percentage compared to healthy individuals. Bacterial diversity diminished as the severity of the disease increased, so those patients with more advanced stages of caries presented less bacterial diversity than healthy subjects. Although microbial composition tended to be different, the intragroup variation is large, as evidenced by the lack of clear intragroup clustering in principal component analyses. Thus, no clear differences were found, indicating that using bacterial composition as the sole source of biomarkers for dental caries may not be reliable in the unstimulated saliva samples used in the current study.
Smith, Drake M; Snow, David E; Rees, Eric; Zischkau, Ann M; Hanson, J Delton; Wolcott, Randall D; Sun, Yan; White, Jennifer; Kumar, Shashi; Dowd, Scot E
Decubitus ulcers, also known as bedsores or pressure ulcers, affect millions of hospitalized patients each year. The microflora of chronic wounds such as ulcers most commonly exist in the biofilm phenotype and have been known to significantly impair normal healing trajectories. Bacterial tag-encoded FLX amplicon pyrosequencing (bTEFAP), a universal bacterial identification method, was used to identify bacterial populations in 49 decubitus ulcers. Diversity estimators were utilized and wound community compositions analyzed in relation to metadata such as Age, race, gender, and comorbidities. Decubitus ulcers are shown to be polymicrobial in nature with no single bacterium exclusively colonizing the wounds. The microbial community among such ulcers is highly variable. While there are between 3 and 10 primary populations in each wound there can be hundreds of different species present many of which are in trace amounts. There is no clearly significant differences in the microbial ecology of decubitus ulcer in relation to metadata except when considering diabetes. The microbial populations and composition in the decubitus ulcers of diabetics may be significantly different from the communities in non-diabetics. Based upon the continued elucidation of chronic wound bioburdens as polymicrobial infections, it is recommended that, in addition to traditional biofilm-based wound care strategies, an antimicrobial/antibiofilm treatment program can be tailored to each patient's respective wound microflora.
Wolcott Randall D
Full Text Available Abstract Background Decubitus ulcers, also known as bedsores or pressure ulcers, affect millions of hospitalized patients each year. The microflora of chronic wounds such as ulcers most commonly exist in the biofilm phenotype and have been known to significantly impair normal healing trajectories. Methods Bacterial tag-encoded FLX amplicon pyrosequencing (bTEFAP, a universal bacterial identification method, was used to identify bacterial populations in 49 decubitus ulcers. Diversity estimators were utilized and wound community compositions analyzed in relation to metadata such as Age, race, gender, and comorbidities. Results Decubitus ulcers are shown to be polymicrobial in nature with no single bacterium exclusively colonizing the wounds. The microbial community among such ulcers is highly variable. While there are between 3 and 10 primary populations in each wound there can be hundreds of different species present many of which are in trace amounts. There is no clearly significant differences in the microbial ecology of decubitus ulcer in relation to metadata except when considering diabetes. The microbial populations and composition in the decubitus ulcers of diabetics may be significantly different from the communities in non-diabetics. Conclusions Based upon the continued elucidation of chronic wound bioburdens as polymicrobial infections, it is recommended that, in addition to traditional biofilm-based wound care strategies, an antimicrobial/antibiofilm treatment program can be tailored to each patient's respective wound microflora.
Eiko E Kuramae
Full Text Available We assessed soil fungal diversity and community structure at two sampling times (t1 = 47 days and t2 = 104 days of plant age in pots associated with four maize cultivars, including two genetically modified (GM cultivars by high-throughput pyrosequencing of the 18S rRNA gene using DNA and RNA templates. We detected no significant differences in soil fungal diversity and community structure associated with different plant cultivars. However, DNA-based analyses yielded lower fungal OTU richness as compared to RNA-based analyses. Clear differences in fungal community structure were also observed in relation to sampling time and the nucleic acid pool targeted (DNA versus RNA. The most abundant soil fungi, as recovered by DNA-based methods, did not necessary represent the most "active" fungi (as recovered via RNA. Interestingly, RNA-derived community compositions at t1 were highly similar to DNA-derived communities at t2, based on presence/absence measures of OTUs. We recovered large proportions of fungal sequences belonging to arbuscular mycorrhizal fungi and Basidiomycota, especially at the RNA level, suggesting that these important and potentially beneficial fungi are not affected by the plant cultivars nor by GM traits (Bt toxin production. Our results suggest that even though DNA- and RNA-derived soil fungal communities can be very different at a given time, RNA composition may have a predictive power of fungal community development through time.
Yousaf, Sohail; Bulgari, Daniela; Bergna, Alessandro; Pancher, Michael; Quaglino, Fabio; Casati, Paola; Campisano, Andrea
Generally, plants are not considered as hosts for human and animal pathogens (HAP). The recent produce-associated outbreaks of food-borne diseases have drawn attention toward significant deficiencies in our understanding of the ecology of HAP, and their potential for interkingdom transfer. To examine the association of microorganisms classified as HAP with plants, we surveyed the presence and distribution of HAP bacterial taxa (henceforth HAPT, for brevity's sake) in the endosphere of grapevine (Vitis vinifera L.) both in the plant stems and leaves. An enrichment protocol was used on leaves to detect taxa with very low abundance in undisturbed tissues. We used pyrosequencing and phylogenetic analyses of the 16S rDNA gene. We identified several HAPT, and focused on four genera (Propionibacterium, Staphylococcus, Clostridium, and Burkholderia). The majority of the bacterial sequences in the genus Propionibacterium, from grapevine leaf and stem, were identified as P. acnes. Clostridia were detected in leaves and stems, but their number was much higher in leaves after enrichment. HAPT were indentified both in leaves and wood of grapevines. This depicts the ability of these taxa to be internalized within plant tissues and maintain their population levels in a variety of environments. Our analysis highlighted the presence of HAPT in the grapevine endosphere and unexpected occurrence of these bacterial taxa in this atypical environment.
Full Text Available Despite recent successes in the control of dental caries, the mechanism of caries development remains unclear. To investigate the causes of dental decay, especially in early childhood caries, the supragingival microflora composition of 20 twins with discordant caries phenotypes were analyzed using high-throughput pyrosequencing. In addition, the parents completed a lifestyle questionnaire. A total of 228,789 sequencing reads revealed 10 phyla, 84 genera, and 155 species of microflora, the relative abundances of these strains varied dramatically among the children, Comparative analysis between groups revealed that Veillonella, Corynebacterium and Actinomyces were presumed to be caries-related genera, Fusobacterium, Kingella and Leptotrichia were presumed to be healthy-related genus, yet this six genera were not statistically significant (P>0.05. Moreover, a cluster analysis revealed that the microbial composition of samples in the same group was often dissimilar but that the microbial composition observed in twins was usually similar. Although the genetic and environmental factors that strongly influence the microbial composition of dental caries remains unknown, we speculate that genetic factors primarily influence the individual's susceptibility to dental caries and that environmental factors primarily regulate the microbial composition of the dental plaque and the progression to caries. By using improved twins models and increased sample sizes, our study can be extended to analyze the specific genetic and environmental factors that affect the development of caries.
Jing, H.; Rocke, E.; Kong, L.; Xia, X.; Liu, H.; Landry, M. R.
Marine planktonic protists, including microalgae and protistan grazers, are an important contributor to global primary production and carbon and mineral cycles, however, little is known about their population shifts along the oxic-anoxic gradient in the water column. We used 454 pyrosequencing of the 18S rRNA gene and gene transcripts to study the community composition of whole and active protists throughout a water column in the Costa Rica Dome, where a stable oxygen minimum zone (OMZ) exists at a depth of 400~700 m. A clear shift of protist composition from photosynthetic Dinoflagellates in the surface to potential parasitic Dinoflagellates and Ciliates in the deeper water was revealed along the vertical profile at both rRNA and rDNA levels. Those protist groups recovered only at the rDNA level represent either lysed aggregates sinking from the upper waters or potential hosts for parasitic groups. UPGMA clustering demonstrated that total and active protists in the anoxic core of OMZ (550 m) were distinct from those in other water depths. The reduced community diversity and presence of a parasitic/symbiotic trophic lifestyle in the OMZ, especially the anoxic core, suggests that OMZs can exert a selective pressure on protist communities. Such changes in community structure and a shift in trophic lifestyle could result in a modulation of the microbial loop and associated biogeochemical cycling.
Garcia, A; Polonio, J C; Polli, A D; Santos, C M; Rhoden, S A; Quecine, M C; Azevedo, J L; Pamphile, J A
Sapindus saponaria L. of Sapindaceae family is popularly known as soldier soap and is found in Central and South America. A study of such medicinal plants might reveal a more complex diversity of microorganisms as compared to non-medicinal plants, considering their metabolic potential and the chemical communication between their natural microbiota. Rhizosphere is a highly diverse microbial habitat with respect to both the diversity of species and the size of the community. Rhizosphere bacteriome associated with medicinal plant S. saponaria is still poorly known. The objective of this study was to assess the rhizosphere microbiome of the medicinal plant S. saponaria using pyrosequencing, a culture-independent approach that is increasingly being used to estimate the number of bacterial species present in different environments. In their rhizosphere microbiome, 26 phyla were identified from 5089 sequences of 16S rRNA gene, with a predominance of Actinobacteria (33.54%), Acidobacteria (22.62%), and Proteobacteria (24.72%). The rarefaction curve showed a linear increase, with 2660 operational taxonomic units at 3% distance sequence dissimilarity, indicating that the rhizosphere microbiome associated with S. saponaria was highly diverse with groups of bacteria important for soil management, which could be further exploited for agricultural and biotechnological purposes.
Jing, Hongmei; Xia, Xiaomin; Liu, Hongbin; Zhou, Zhi; Wu, Chen; Nagarajan, Sanjay
Diazotrophs in the mangrove rhizosphere play a major role in providing new nitrogen to the mangrove ecosystem and their composition and activity are strongly influenced by anthropogenic activity and ecological conditions. In this study, the diversity of the diazotroph communities in the rhizosphere sediment of five tropical mangrove sites with different levels of pollution along the north and south coastline of Singapore were studied by pyrosequencing of the nifH gene. Bioinformatics analysis revealed that in all the studied locations, the diazotroph communities comprised mainly of members of the diazotrophic cluster I and cluster III. The detected cluster III diazotrophs, which were composed entirely of sulfate-reducing bacteria, were more abundant in the less polluted locations. The metabolic capacities of these diazotrophs indicate the potential for bioremediation and resiliency of the ecosystem to anthropogenic impact. In heavily polluted locations, the diazotrophic community structures were markedly different and the diversity of species was significantly reduced when compared with those in a pristine location. This, together with the increased abundance of Marinobacterium, which is a bioindicator of pollution, suggests that anthropogenic activity has a negative impact on the genetic diversity of diazotrophs in the mangrove rhizosphere.
Full Text Available Twenty macropods from five locations in Queensland, Australia, grazing on a variety of native pastures were surveyed and the bacterial community of the foregut was examined using 454-amplicon pyrosequencing. Specifically, the V3/V4 region of 16S rRNA gene was examined. A total of 5040 OTUs were identified in the data set (post filtering. Thirty-two OTUs were identified as 'shared' OTUS (i.e. present in all samples belonging to either Firmicutes or Bacteroidetes (Clostridiales/Bacteroidales. These phyla predominated the general microbial community in all macropods. Genera represented within the shared OTUs included: unclassified Ruminococcaceae, unclassified Lachnospiraceae, unclassified Clostridiales, Peptococcus sp. Coprococcus spp., Streptococcus spp., Blautia sp., Ruminoccocus sp., Eubacterium sp., Dorea sp., Oscillospira sp. and Butyrivibrio sp. The composition of the bacterial community of the foregut samples of each the host species (Macropus rufus, Macropus giganteus and Macropus robustus was significantly different allowing differentiation between the host species based on alpha and beta diversity measures. Specifically, eleven dominant OTUs that separated the three host species were identified and classified as: unclassified Ruminococcaceae, unclassified Bacteroidales, Prevotella spp. and a Syntrophococcus sucromutans. Putative reductive acetogens and fibrolytic bacteria were also identified in samples. Future work will investigate the presence and role of fibrolytics and acetogens in these ecosystems. Ideally, the isolation and characterization of these organisms will be used for enhanced feed efficiency in cattle, methane mitigation and potentially for other industries such as the biofuel industry.
Harris, Stephen; Croft, Julie; O'Flynn, Ciaran; Deusch, Oliver; Colyer, Alison; Allsopp, Judi; Milella, Lisa; Davis, Ian J
Periodontitis is the most frequently diagnosed health problem in cats yet little is known about the bacterial species important for the disease. The objective of this study was to identify bacterial species associated with health, gingivitis or mild periodontitis (gingivitis or mild periodontitis. Pyrosequencing of the V1-V3 region of the 16S rDNA from these plaque samples generated more than one million reads and identified a total of 267 operational taxonomic units after bioinformatic and statistical analysis. Porphyromonas was the most abundant genus in all gingival health categories, particularly in health along with Moraxella and Fusobacteria. The Peptostreptococcaceae were the most abundant family in gingivitis and mild periodontitis. Logistic regression analysis identified species from various genera that were significantly associated with health, gingivitis or mild periodontitis. The species identified were very similar to those observed in canine plaque in the corresponding health and disease states. Such similarities were not observed between cat and human at the bacterial species level but with disease progression similarities did emerge at the phylum level. This suggests that interventions targeted at human pathogenic species will not be effective for use in cats but there is more potential for commonalities in interventions for cats and dogs.
Dunaeva, E A; Mironov, K O; Dribnokhodova, T E; Subbotina, E E; Bashmakova; Ol'hovskiĭ, I A; Shipulin, G A
The somatic mutation V617F in gen JAK2 is a frequent cause of chronic myeloprolific diseases not conditioned by BCR/ABL mutation. The quantitative testing of relative percentage of mutant allele can be used in establishing severity of disease and its prognosis and in prescription of remedy inhibiting activity of JAK2. To quantitatively test mutation the pyrosequencing technique was applied. The developed technique permits detecting and quantitatively, testing percentage of mutation fraction since 7%. The "gray zone" is presented by samples with percentage of mutant allele from 4% to 7%. The dependence of expected percentage of mutant fraction in analyzed sample from observed value of signal is described by equation of line with regression coefficients y = - 0.97, x = -1.32 and at that measurement uncertainty consists ± 0.7. The developed technique is approved officially on clinical material from 192 patients with main forms of myeloprolific diseases not conditioned by BCR/ABL mutation. It was detected 64 samples with mautant fraction percentage from 13% to 91%. The developed technique permits implementing monitoring of therapy of myeloprolific diseases and facilitates to optimize tactics of treatment.
Frank, Renee; Baloch, Zubair W; Gentile, Caren; Watt, Christopher D; LiVolsi, Virginia A
Multifocal fibrosing thyroiditis (MFT) is characterized by numerous foci of fibrosis in a stellate configuration with fibroelastotic and fibroblastic centers entrapping epithelial structures. MFT has been proposed as a risk factor for papillary thyroid carcinoma (PTC) development. We attempted to identify whether MFT showed such molecular changes and could possibly be related to PTC. We identified seven cases of PTC with MFT in our institutional pathology database and personal consult service of one of the authors (VAL) for the years 1999 to 2012. Areas of PTC, MFT, and normal tissue were selected for BRAF analysis. Macro-dissection, DNA extraction and PCR amplification, and pyrosequencing were performed to detect BRAF mutations in codon 600. All of the MFT lesions and normal thyroid tissue were negative for BRAF mutations. Of the seven PTCs analyzed, five (71 %) were negative for BRAF mutations, while two cases were positive. In our study, none of the MFT lesions harbored BRAF mutations, whereas 29 % (two of seven) PTCs in the same gland were positive. Hence, in this small study, we found no evidence that the MFT lesion is a direct precursor to PTC. It is likely an incidental bystander in the process and a reflection of the background thyroiditis.
Full Text Available O6-methylguanine-DNA-methyltransferase (MGMT has emerged as a relevant predictor of therapeutic response and good prognosis in patients with glioblastoma (GBM. Transcriptionally active MGMT rapidly removes the alkyl adducts, preventing the formation of cross-links and thereby causing resistance to alkylating drugs. Studies with pyrosequencing (PSQ showed that this technique has a higher reproducibility and sensitivity than other techniques. However, the definition of a prognostically relevant threshold for the percentage of MGMT methylation remains one of the most critical issues in the use of PSQ analysis. The aim of this study was to define the cut-off value correlated with good favourable prognostic outcomes. We retrospectively analyzed 51 patients (33 males, 18 females with GBM who underwent surgery or biopsy. The Receiver Operating Characteristics analysis showed that the best possible criteria for PSQ-detected percentage of MGMT methylation that predicted progression-free survival (PFS and overall survival (OS were 19% and 13%, respectively. Patients with ≤19% of PSQ-detected MGMT had a shorter PFS (HR: 0.24, p<0.01; those ones with ≤13% had a shorter OS (HR: 0.33, p<0.05. Our study reinforces the importance of MGMT in the management of GBM patients, but future studies with larger sample sizes are warranted to confirm our findings.
Gulino, Lisa-Maree; Ouwerkerk, Diane; Kang, Alicia Y H; Maguire, Anita J; Kienzle, Marco; Klieve, Athol V
Twenty macropods from five locations in Queensland, Australia, grazing on a variety of native pastures were surveyed and the bacterial community of the foregut was examined using 454-amplicon pyrosequencing. Specifically, the V3/V4 region of 16S rRNA gene was examined. A total of 5040 OTUs were identified in the data set (post filtering). Thirty-two OTUs were identified as 'shared' OTUS (i.e. present in all samples) belonging to either Firmicutes or Bacteroidetes (Clostridiales/Bacteroidales). These phyla predominated the general microbial community in all macropods. Genera represented within the shared OTUs included: unclassified Ruminococcaceae, unclassified Lachnospiraceae, unclassified Clostridiales, Peptococcus sp. Coprococcus spp., Streptococcus spp., Blautia sp., Ruminoccocus sp., Eubacterium sp., Dorea sp., Oscillospira sp. and Butyrivibrio sp. The composition of the bacterial community of the foregut samples of each the host species (Macropus rufus, Macropus giganteus and Macropus robustus) was significantly different allowing differentiation between the host species based on alpha and beta diversity measures. Specifically, eleven dominant OTUs that separated the three host species were identified and classified as: unclassified Ruminococcaceae, unclassified Bacteroidales, Prevotella spp. and a Syntrophococcus sucromutans. Putative reductive acetogens and fibrolytic bacteria were also identified in samples. Future work will investigate the presence and role of fibrolytics and acetogens in these ecosystems. Ideally, the isolation and characterization of these organisms will be used for enhanced feed efficiency in cattle, methane mitigation and potentially for other industries such as the biofuel industry.
Castro-Carrera, T; Toral, P G; Frutos, P; McEwan, N R; Hervás, G; Abecia, L; Pinloche, E; Girdwood, S E; Belenguer, A
Developing novel strategies to increase the content of bioactive unsaturated fatty acids (FA) in ruminant-derived products requires a deeper understanding of rumen biohydrogenation and bacteria involved in this process. Although high-throughput pyrosequencing may allow for a great coverage of bacterial diversity, it has hardly been used to investigate the microbiology of ruminal FA metabolism. In this experiment, 454 pyrosequencing and a molecular fingerprinting technique (terminal restriction fragment length polymorphism; T-RFLP) were used concurrently to assess the effect of diet supplementation with marine algae (MA) on the rumen bacterial community of dairy sheep. Eleven lactating ewes were divided in 2 lots and offered a total mixed ration based on alfalfa hay and concentrate (40:60), supplemented with 0 (control) or 8 (MA) g of MA/kg of dry matter. After 54 d on treatments, animals were slaughtered and samples of rumen content and fluid were collected separately for microbial analysis. Pyrosequencing yielded a greater coverage of bacterial diversity than T-RFLP and allowed the identification of low abundant populations. Conversely, both molecular approaches pointed to similar conclusions and showed that relevant changes due to MA addition were observed within the major ruminal phyla, namely Bacteroidetes, Firmicutes, and Proteobacteria. Decreases in the abundance of unclassified Bacteroidales, Porphyromonadaceae, and Ruminococcaceae and increases in as-yet uncultured species of the family Succinivibrionaceae, might be related to a potential role of these groups in different pathways of rumen FA metabolism. Diet supplementation with MA, however, had no effect on the relative abundance of Butyrivibrio and Pseudobutyrivibrio genera. In addition, results from both 454 pyrosequencing and T-RFLP indicate that the effect of MA was rather consistent in rumen content or fluid samples, despite inherent differences between these fractions in their bacterial composition
Switzeny, Olivier J; Christmann, Markus; Renovanz, Mirjam; Giese, Alf; Sommer, Clemens; Kaina, Bernd
The DNA repair protein O(6)-methylguanine-DNA methyltransferase (MGMT) causes resistance of cancer cells to alkylating agents and, therefore, is a well-established predictive marker for high-grade gliomas that are routinely treated with alkylating drugs. Since MGMT is highly epigenetically regulated, the MGMT promoter methylation status is taken as an indicator of MGMT silencing, predicting the outcome of glioma therapy. MGMT promoter methylation is usually determined by methylation specific PCR (MSP), which is a labor intensive and error-prone method often used semi-quantitatively. Searching for alternatives, we used closed-tube high resolution melt (HRM) analysis, which is a quantitative method, and compared it with MSP and pyrosequencing regarding its predictive value. We analyzed glioblastoma cell lines with known MGMT activity and formalin-fixed samples from IDH1 wild-type high-grade glioma patients (WHO grade III/IV) treated with radiation and temozolomide by HRM, MSP, and pyrosequencing. The data were compared as to progression-free survival (PFS) and overall survival (OS) of patients exhibiting the methylated and unmethylated MGMT status. A promoter methylation cut-off level relevant for PFS and OS was determined. In a multivariate Cox regression model, methylation of MGMT promoter of high-grade gliomas analyzed by HRM, but not MSP, was found to be an independent predictive marker for OS. Univariate Kaplan-Meier analyses revealed for PFS and OS a significant and better discrimination between methylated and unmethylated tumors when quantitative HRM was used instead of MSP. Compared to MSP and pyrosequencing, the HRM method is simple, cost effective, highly accurate and fast. HRM is at least equivalent to pyrosequencing in quantifying the methylation level. It is superior in predicting PFS and OS of high-grade glioma patients compared to MSP and, therefore, can be recommended being used routinely for determination of the MGMT status of gliomas.
Oikonomou, Georgios; Machado, Vinicius Silva; Santisteban, Carlos; Schukken, Ynte Hein; Bicalho, Rodrigo Carvalho
Dairy cow mastitis is an important disease in the dairy industry. Different microbial species have been identified as causative agents in mastitis, and are traditionally diagnosed by bacterial culture. The objective of this study was to use metagenomic pyrosequencing of bacterial 16S rRNA genes to investigate bacterial DNA diversity in milk samples of mastitic and healthy dairy cows and compare the results with those obtained by classical bacterial culture. One hundred and thirty-six milk sam...
OKI, Kaihei; DUGERSUREN, Jamyan; DEMBEREL, Shirchin; WATANABE, Koichi
Here, we used pyrosequencing to obtain a detailed analysis of the microbial diversities of traditional fermented dairy products of Mongolia. From 22 Airag (fermented mare’s milk), 5 Khoormog (fermented camel’s milk) and 26 Tarag (fermented milk of cows, goats and yaks) samples collected in the Mongolian provinces of Arhangai, Bulgan, Dundgobi, Tov, Uburhangai and Umnugobi, we obtained a total of 81 operational taxonomic units, which were assigned to 15 families, 21 genera and 41 species in 3 ...
Full Text Available Abstract Background The main function of gene promoters appears to be the integration of different gene products in their biological pathways in order to maintain homeostasis. Generally, promoters have been classified in two major classes, namely TATA and CpG. Nevertheless, many genes using the same combinatorial formation of transcription factors have different gene expression patterns. Accordingly, we tried to ask ourselves some fundamental questions: Why certain genes have an overall predisposition for higher gene expression levels than others? What causes such a predisposition? Is there a structural relationship of these sequences in different tissues? Is there a strong phylogenetic relationship between promoters of closely related species? Results In order to gain valuable insights into different promoter regions, we obtained a series of image-based patterns which allowed us to identify 10 generic classes of promoters. A comprehensive analysis was undertaken for promoter sequences from Arabidopsis thaliana, Drosophila melanogaster, Homo sapiens and Oryza sativa, and a more extensive analysis of tissue-specific promoters in humans. We observed a clear preference for these species to use certain classes of promoters for specific biological processes. Moreover, in humans, we found that different tissues use distinct classes of promoters, reflecting an emerging promoter network. Depending on the tissue type, comparisons made between these classes of promoters reveal a complementarity between their patterns whereas some other classes of promoters have been observed to occur in competition. Furthermore, we also noticed the existence of some transitional states between these classes of promoters that may explain certain evolutionary mechanisms, which suggest a possible predisposition for specific levels of gene expression and perhaps for a different number of factors responsible for triggering gene expression. Our conclusions are based on comprehensive data from three different databases and a new computer model whose core is using Kappa index of coincidence. Conclusions To fully understand the connections between gene promoters and gene expression, we analyzed thousands of promoter sequences using our Kappa Index of Coincidence method and a specialized Optical Character Recognition (OCR neural network. Under our criteria, 10 classes of promoters were detected. In addition, the existence of “transitional” promoters suggests that there is an evolutionary weighted continuum between classes, depending perhaps upon changes in their gene products.
Jaron, K. S.; Moravec, J. C.; Martínková, Natália
Roč. 30, č. 8 (2014), s. 1081-1086 ISSN 1367-4803 R&D Projects: GA ČR(CZ) GAP506/12/1064 Institutional support: RVO:68081766 Keywords : fungus Aspergillus fumigatus * Cryptosporidium parvum * sequence * evolution * identification * islands * ecology Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 4.981, year: 2014
Full Text Available The mammalian gut houses a complex microbial community which is believed to play a significant role in host physiology. In recent years, several microbial community analysis methods have been implemented to study the whole gut microbial environment, in contrast to classical microbiological methods focusing on bacteria which can be cultivated. One of these is automated ribosomal intergenic spacer analysis (ARISA, an inexpensive and popular way of analyzing bacterial diversity and community fingerprinting in ecological samples. ARISA uses the natural variability in length of the DNA fragment found between the 16S and 23S genes in different bacterial lineages to infer diversity. This method is now being supplanted by affordable next-generation sequencing technologies that can also simultaneously annotate operational taxonomic units for taxonomic identification. We compared ARISA and pyrosequencing of samples from the rumen microbiome of cows, previously sampled at different stages of development and varying in microbial complexity using several ecological parameters. We revealed close agreement between ARISA and pyrosequencing outputs, especially in their ability to discriminate samples from different ecological niches. In contrast, the ARISA method seemed to underestimate sample richness. The good performance of the relatively inexpensive ARISA makes it relevant for straightforward use in bacterial fingerprinting analysis as well as for quick cross-validation of pyrosequencing data.
Trama, Jason P; Adelson, Martin E; Mordechai, Eli
Laboratory diagnosis of molluscum contagiosum virus (MCV) is important as lesions can be confused with those caused by Cryptococcus neoformans, herpes simplex virus, human papillomavirus, and varicella-zoster virus. To develop a rapid method for identifying patients infected with MCV via swab sampling. Two dual-labeled probe real-time PCR assays, one homologous to the p43K gene and one to the MC080R gene, were designed. The p43K PCR was designed to be used in conjunction with Pyrosequencing for confirmation of PCR products and discrimination between MCV1 and MCV2. Both PCR assays were optimized with respect to reaction components, thermocycling parameters, and primer and probe concentrations. The specificities of both PCR assays were confirmed by non-amplification of 38 known human pathogens. Sensitivity assays demonstrated detection of as few as 10 copies per reaction. Testing 703 swabs, concordance between the two real-time PCR assays was 99.9%. Under the developed conditions, Pyrosequencing of the p43K PCR product was capable of providing enough nucleotide sequence to definitively differentiate MCV1 and MCV2. These real-time PCR assays can be used for the rapid, sensitive, and specific detection of MCV and, when combined with Pyrosequencing, can further discriminate between MCV1 and MCV2.
With the completion of the human genome and the growing number of diverse genomes being sequenced, a new age of evolutionary research is currently taking shape. The myriad of technological breakthroughs in biology that are leading to the unification of broad scientific fields such as molecular biology, biochemistry, physics, mathematics and computer science are now known as systems biology. Here I present an overview, with an emphasis on eukaryotes, of how the postgenomics era is adopting comparative approaches that go beyond comparisons among model organisms to shape the nascent field of evolutionary systems biology.
Full Text Available Many plants and animals are defended from predation or herbivory by inhibitory secondary metabolites, which in the marine environment are very common among sessile organisms. Among bacteria, where there is the greatest metabolic potential, little is known about chemical defenses against bacterivorous consumers. An emerging hypothesis is that sessile bacterial communities organized as biofilms serve as bacterial refuge from predation. By testing growth and survival of two common bacterivorous nanoflagellates, we find evidence that chemically mediated resistance against protozoan predators is common among biofilm populations in a diverse set of marine bacteria. Using bioassay-guided chemical and genetic analysis, we identified one of the most effective antiprotozoal compounds as violacein, an alkaloid that we demonstrate is produced predominately within biofilm cells. Nanomolar concentrations of violacein inhibit protozoan feeding by inducing a conserved eukaryotic cell death program. Such biofilm-specific chemical defenses could contribute to the successful persistence of biofilm bacteria in various environments and provide the ecological and evolutionary context for a number of eukaryote-targeting bacterial metabolites.
Rosario, Karyna; Duffy, Siobain; Breitbart, Mya
Single-stranded DNA (ssDNA) viruses with circular genomes are the smallest viruses known to infect eukaryotes. The present study identified 10 novel genomes similar to ssDNA circoviruses through data-mining of public viral metagenomes. The metagenomic libraries included samples from reclaimed water and three different marine environments (Chesapeake Bay, British Columbia coastal waters and Sargasso Sea). All the genomes have similarities to the replication (Rep) protein of circoviruses; however, only half have genomic features consistent with known circoviruses. Some of the genomes exhibit a mixture of genomic features associated with different families of ssDNA viruses (i.e. circoviruses, geminiviruses and parvoviruses). Unique genome architectures and phylogenetic analysis of the Rep protein suggest that these viruses belong to novel genera and/or families. Investigating the complex community of ssDNA viruses in the environment can lead to the discovery of divergent species and help elucidate evolutionary links between ssDNA viruses.
Decelle, Johan; Romac, Sarah; Stern, Rowena F; Bendif, El Mahdi; Zingone, Adriana; Audic, Stéphane; Guiry, Michael D; Guillou, Laure; Tessier, Désiré; Le Gall, Florence; Gourvil, Priscillia; Dos Santos, Adriana L; Probert, Ian; Vaulot, Daniel; de Vargas, Colomban; Christen, Richard
Photosynthetic eukaryotes have a critical role as the main producers in most ecosystems of the biosphere. The ongoing environmental metabarcoding revolution opens the perspective for holistic ecosystems biological studies of these organisms, in particular the unicellular microalgae that often lack distinctive morphological characters and have complex life cycles. To interpret environmental sequences, metabarcoding necessarily relies on taxonomically curated databases containing reference sequences of the targeted gene (or barcode) from identified organisms. To date, no such reference framework exists for photosynthetic eukaryotes. In this study, we built the PhytoREF database that contains 6490 plastidial 16S rDNA reference sequences that originate from a large diversity of eukaryotes representing all known major photosynthetic lineages. We compiled 3333 amplicon sequences available from public databases and 879 sequences extracted from plastidial genomes, and generated 411 novel sequences from cultured marine microalgal strains belonging to different eukaryotic lineages. A total of 1867 environmental Sanger 16S rDNA sequences were also included in the database. Stringent quality filtering and a phylogeny-based taxonomic classification were applied for each 16S rDNA sequence. The database mainly focuses on marine microalgae, but sequences from land plants (representing half of the PhytoREF sequences) and freshwater taxa were also included to broaden the applicability of PhytoREF to different aquatic and terrestrial habitats. PhytoREF, accessible via a web interface (http://phytoref.fr), is a new resource in molecular ecology to foster the discovery, assessment and monitoring of the diversity of photosynthetic eukaryotes using high-throughput sequencing. © 2015 John Wiley & Sons Ltd.
Johri, Parul; Krenek, Sascha; Marinov, Georgi K; Doak, Thomas G; Berendonk, Thomas U; Lynch, Michael
Population-genomic analyses are essential to understanding factors shaping genomic variation and lineage-specific sequence constraints. The dearth of such analyses for unicellular eukaryotes prompted us to assess genomic variation in Paramecium, one of the most well-studied ciliate genera. The Paramecium aurelia complex consists of ∼15 morphologically indistinguishable species that diverged subsequent to two rounds of whole-genome duplications (WGDs, as long as 320 MYA) and possess extremely streamlined genomes. We examine patterns of both nuclear and mitochondrial polymorphism, by sequencing whole genomes of 10-13 worldwide isolates of each of three species belonging to the P. aurelia complex: P. tetraurelia, P. biaurelia, P. sexaurelia, as well as two outgroup species that do not share the WGDs: P. caudatum and P. multimicronucleatum. An apparent absence of global geographic population structure suggests continuous or recent dispersal of Paramecium over long distances. Intergenic regions are highly constrained relative to coding sequences, especially in P. caudatum and P. multimicronucleatum that have shorter intergenic distances. Sequence diversity and divergence are reduced up to ∼100-150 bp both upstream and downstream of genes, suggesting strong constraints imposed by the presence of densely packed regulatory modules. In addition, comparison of sequence variation at non-synonymous and synonymous sites suggests similar recent selective pressures on paralogs within and orthologs across the deeply diverging species. This study presents the first genome-wide population-genomic analysis in ciliates and provides a valuable resource for future studies in evolutionary and functional genetics in Paramecium. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: email@example.com.
Hartmann, Fanny E; Croll, Daniel
Differences in gene content are a significant source of variability within species and have an impact on phenotypic traits. However, little is known about the mechanisms responsible for the most recent gene gains and losses. We screened the genomes of 123 worldwide isolates of the major pathogen of wheat Zymoseptoria tritici for robust evidence of gene copy number variation. Based on orthology relationships in three closely related fungi, we identified 599 gene gains and 1,024 gene losses that have not yet reached fixation within the focal species. Our analyses of gene gains and losses segregating in populations showed that gene copy number variation arose preferentially in subtelomeres and in proximity to transposable elements. Recently lost genes were enriched in virulence factors and secondary metabolite gene clusters. In contrast, recently gained genes encoded mostly secreted protein lacking a conserved domain. We analyzed the frequency spectrum at loci segregating a gene presence-absence polymorphism in four worldwide populations. Recent gene losses showed a significant excess in low-frequency variants compared with genome-wide single nucleotide polymorphism, which is indicative of strong negative selection against gene losses. Recent gene gains were either under weak negative selection or neutral. We found evidence for strong divergent selection among populations at individual loci segregating a gene presence-absence polymorphism. Hence, gene gains and losses likely contributed to local adaptation. Our study shows that microbial eukaryotes harbor extensive copy number variation within populations and that functional differences among recently gained and lost genes led to distinct evolutionary trajectories. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Parks, Matthew B; Wickett, Norman J; Alverson, Andrew J
Abstract Diatoms (Bacillariophyta) are a species-rich group of eukaryotic microbes diverse in morphology, ecology, and metabolism. Previous reconstructions of the diatom phylogeny based on one or a few genes have resulted in inconsistent resolution or low support for critical nodes. We applied phylogenetic paralog pruning techniques to a data set of 94 diatom genomes and transcriptomes to infer perennially difficult species relationships, using concatenation and summary-coalescent methods to reconstruct species trees from data sets spanning a wide range of thresholds for taxon and column occupancy in gene alignments. Conflicts between gene and species trees decreased with both increasing taxon occupancy and bootstrap cutoffs applied to gene trees. Concordance between gene and species trees was lowest for short internodes and increased logarithmically with increasing edge length, suggesting that incomplete lineage sorting disproportionately affects species tree inference at short internodes, which are a common feature of the diatom phylogeny. Although species tree topologies were largely consistent across many data treatments, concatenation methods appeared to outperform summary-coalescent methods for sparse alignments. Our results underscore that approaches to species-tree inference based on few loci are likely to be misled by unrepresentative sampling of gene histories, particularly in lineages that may have diversified rapidly. In addition, phylogenomic studies of diatoms, and potentially other hyperdiverse groups, should maximize the number of gene trees with high taxon occupancy, though there is clearly a limit to how many of these genes will be available. PMID:29040712
Kaur, Charanpreet; Sharma, Shweta; Hasan, Mohammad Rokebul; Pareek, Ashwani; Singla-Pareek, Sneh L; Sopory, Sudhir K
The glyoxalase system is the ubiquitous pathway for the detoxification of methylglyoxal (MG) in the biological systems. It comprises two enzymes, glyoxalase I (GLYI) and glyoxalase II (GLYII), which act sequentially to convert MG into d-lactate, thereby helping living systems get rid of this otherwise cytotoxic byproduct of metabolism. In addition, a glutathione-independent GLYIII enzyme activity also exists in the biological systems that can directly convert MG to d-lactate. Humans and Escherichia coli possess a single copy of GLYI (encoding either the Ni- or Zn-dependent form) and GLYII genes, which through MG detoxification provide protection against various pathological and disease conditions. By contrast, the plant genome possesses multiple GLYI and GLYII genes with a role in abiotic stress tolerance. Plants possess both Ni 2+ - and Zn 2+ -dependent forms of GLYI, and studies on plant glyoxalases reveal the various unique features of these enzymes distinguishing them from prokaryotic and other eukaryotic glyoxalases. Through this review, we provide an overview of the plant glyoxalase family along with a comparative analysis of glyoxalases across various species, highlighting similarities as well as differences in the biochemical, molecular, and physiological properties of these enzymes. We believe that the evolution of multiple glyoxalases isoforms in plants is an important component of their robust defense strategies.
R. Shyama Prasad Rao
Full Text Available Many proteins contain a large number of NXS/T sequences (where X is any amino acid except proline which are the potential sites of asparagine (N linked glycosylation. However, the patterns of occurrence of these N-glycosylation sequons in related proteins or groups of proteins and their underlying causes have largely been unexplored. We computed the actual and probabilistic occurrence of NXS/T sequons in ABC protein superfamilies from eight diverse eukaryotic organisms. The ABC proteins contained significantly higher NXS/T sequon numbers compared to respective genome-wide average, but the sequon density was significantly lower owing to the increase in protein size and decrease in sequon specific amino acids. However, mammalian ABC proteins have significantly higher sequon density, and both serine and threonine containing sequons (NXS and NXT have been positively selected—against the recent findings of only threonine specific Darwinian selection of sequons in proteins. The occurrence of sequons was positively correlated with the frequency of sequon specific amino acids and negatively correlated with proline and the NPS/T sequences. Further, the NPS/T sequences were significantly higher than expected in plant ABC proteins which have the lowest number of NXS/T sequons. Accord- ingly, compared to overall proteins, N-glycosylation sequons in ABC protein superfamilies have a distinct pattern of occurrence, and the results are discussed in an evolutionary perspective.
Full Text Available We report for the first time the genomics of a nuclear compartment of the eukaryotic cell. 454 sequencing and microarray analysis revealed the pattern of nucleolus-associated chromatin domains (NADs in the linear human genome and identified different gene families and certain satellite repeats as the major building blocks of NADs, which constitute about 4% of the genome. Bioinformatic evaluation showed that NAD-localized genes take part in specific biological processes, like the response to other organisms, odor perception, and tissue development. 3D FISH and immunofluorescence experiments illustrated the spatial distribution of NAD-specific chromatin within interphase nuclei and its alteration upon transcriptional changes. Altogether, our findings describe the nature of DNA sequences associated with the human nucleolus and provide insights into the function of the nucleolus in genome organization and establishment of nuclear architecture.
Németh, Attila; Conesa, Ana; Santoyo-Lopez, Javier; Medina, Ignacio; Montaner, David; Péterfia, Bálint; Solovei, Irina; Cremer, Thomas; Dopazo, Joaquin; Längst, Gernot
We report for the first time the genomics of a nuclear compartment of the eukaryotic cell. 454 sequencing and microarray analysis revealed the pattern of nucleolus-associated chromatin domains (NADs) in the linear human genome and identified different gene families and certain satellite repeats as the major building blocks of NADs, which constitute about 4% of the genome. Bioinformatic evaluation showed that NAD–localized genes take part in specific biological processes, like the response to other organisms, odor perception, and tissue development. 3D FISH and immunofluorescence experiments illustrated the spatial distribution of NAD–specific chromatin within interphase nuclei and its alteration upon transcriptional changes. Altogether, our findings describe the nature of DNA sequences associated with the human nucleolus and provide insights into the function of the nucleolus in genome organization and establishment of nuclear architecture. PMID:20361057
Hollister, Jesse D
Genomic evidence of ancestral whole genome duplication (WGD) and polyploidy is widespread among eukaryotic species, and especially among plants. WGD is thought to provide the raw material for adaptation in the form of duplicated genes, and polyploids are thought to benefit from both physiological and genetic buffering. Comparatively little attention has focused on the genomic challenge of polyploidy, however, although much evidence exists that polyploidy severely perturbs important cellular functions. Here, I review recent progress in the study of the re-establishment of stable meiosis in recently evolved polyploids, focusing on four plant species. This work has yielded an insight into the mechanisms underlying stabilization of genome transmission in polyploids, and is revealing remarkable parallels among diverse taxa. Importantly, these studies also provide a road map for investigating how polyploids respond to the challenge of WGD.
U.S. Department of Health & Human Services — EuPathDB Bioinformatics Resource Center for Biodefense and Emerging/Re-emerging Infectious Diseases is a portal for accessing genomic-scale datasets associated with...
Marcela S Montecchia
Full Text Available The Southern Andean Yungas in Northwest Argentina constitute one of the main biodiversity hotspots in the world. Considerable changes in land use have taken place in this ecoregion, predominantly related to forest conversion to croplands, inducing losses in above-ground biodiversity and with potential impact on soil microbial communities. In this study, we used high-throughput pyrosequencing of the 16S ribosomal RNA gene to assess whether land-use change and time under agriculture affect the composition and diversity of soil bacterial communities. We selected two areas dedicated to sugarcane and soybean production, comprising both short- and long-term agricultural sites, and used the adjacent native forest soils as a reference. Land-use change altered the composition of bacterial communities, with differences between productive areas despite the similarities between both forests. At the phylum level, only Verrucomicrobia and Firmicutes changed in abundance after deforestation for sugarcane and soybean cropping, respectively. In cultivated soils, Verrucomicrobia decreased sharply (~80%, while Firmicutes were more abundant. Despite the fact that local diversity was increased in sugarcane systems and was not altered by soybean cropping, phylogenetic beta diversity declined along both chronosequences, evidencing a homogenization of soil bacterial communities over time. In spite of the detected alteration in composition and diversity, we found a core microbiome resistant to the disturbances caused by the conversion of forests to cultivated lands and few or none exclusive OTUs for each land-use type. The overall changes in the relative abundance of copiotrophic and oligotrophic taxa may have an impact in soil ecosystem functionality. However, communities with many taxa in common may also share many functional attributes, allowing to maintain at least some soil ecosystem services after forest conversion to croplands.
Full Text Available The horn fly, Haematobia irritans, is one of the most economically important pests of cattle. Insecticides have been a major element of horn fly management programs. Growing concerns with insecticide resistance, insecticide residues on farm products, and non-availability of new generation insecticides, are serious issues for the livestock industry. Alternative horn fly control methods offer the promise to decrease the use of insecticides and reduce the amount of insecticide residues on livestock products and give an impetus to the organic livestock farming segment. The horn fly, an obligatory blood feeder, requires the help of microflora to supply additional nutrients and metabolize the blood meal. Recent advancements in DNA sequencing methodologies enable researchers to examine the microflora diversity independent of culture methods. We used the bacterial 16S tag-encoded FLX-titanium amplicon pyrosequencing (bTEFAP method to carry out the classification analysis of bacterial flora in adult female and male horn flies and horn fly eggs. The bTEFAP method identified 16S rDNA sequences in our samples which allowed the identification of various prokaryotic taxa associated with the life stage examined. This is the first comprehensive report of bacterial flora associated with the horn fly using a culture-independent method. Several rumen, environmental, symbiotic and pathogenic bacteria associated with the horn fly were identified and quantified. This is the first report of the presence of Wolbachia in horn flies of USA origin and is the first report of the presence of Rikenella in an obligatory blood feeding insect.
Li, Tongtong; Long, Meng; Gatesoupe, François-Joël; Zhang, Qianqian; Li, Aihua; Gong, Xiaoning
Gut microbiota is increasingly regarded as an integral component of the host, due to important roles in the modulation of the immune system, the proliferation of the intestinal epithelium and the regulation of the dietary energy intake. Understanding the factors that influence the composition of these microbial communities is essential to health management, and the application to aquatic animals still requires basic investigation. In this study, we compared the bacterial communities harboured in the intestines and in the rearing water of grass carp (Ctenopharyngodon idellus), crucian carp (Carassius cuvieri), and bighead carp (Hypophthalmichthys nobilis), by using 454-pyrosequencing with barcoded primers targeting the V4 to V5 regions of the bacterial 16S rRNA gene. The specimens of the three species were cohabiting in the same pond. Between 6,218 and 10,220 effective sequences were read from each sample, resulting in a total of 110,398 sequences for 13 samples from gut microbiota and pond water. In general, the microbial communities of the three carps were dominated by Fusobacteria, Firmicutes, Proteobacteria and Bacteroidetes, but the abundance of each phylum was significantly different between species. At the genus level, the overwhelming group was Cetobacterium (97.29 ± 0.46 %) in crucian carp, while its abundance averaged c. 40 and 60 % of the sequences read in the other two species. There was higher microbial diversity in the gut of filter-feeding bighead carp than the gut of the two other species, with grazing feeding habits. The composition of intestine microbiota of grass carp and crucian carp shared higher similarity when compared with bighead carp. The principal coordinates analysis (PCoA) with the weighted UniFrac distance and the heatmap analysis suggested that gut microbiota was not a simple reflection of the microbial community in the local habitat but resulted from species-specific selective pressures, possibly dependent on behavioural, immune
James R White
Full Text Available BACKGROUND: Marine sponge species are of significant interest to many scientific fields including marine ecology, conservation biology, genetics, host-microbe symbiosis and pharmacology. One of the most intriguing aspects of the sponge "holobiont" system is the unique physiology, interaction with microbes from the marine environment and the development of a complex commensal microbial community. However, intraspecific variability and temporal stability of sponge-associated bacterial symbionts remain relatively unknown. METHODOLOGY/PRINCIPAL FINDINGS: We have characterized the bacterial symbiont community biodiversity of seven different individuals of the Caribbean reef sponge Axinella corrugata, from two different Florida reef locations during variable seasons using multiplex 454 pyrosequencing of 16 S rRNA amplicons. Over 265,512 high-quality 16 S rRNA sequences were generated and analyzed. Utilizing versatile bioinformatics methods and analytical software such as the QIIME and CloVR packages, we have identified 9,444 distinct bacterial operational taxonomic units (OTUs. Approximately 65,550 rRNA sequences (24% could not be matched to bacteria at the class level, and may therefore represent novel taxa. Differentially abundant classes between seasonal Axinella communities included Gammaproteobacteria, Flavobacteria, Alphaproteobacteria, Cyanobacteria, Acidobacter and Nitrospira. Comparisons with a proximal outgroup sponge species (Amphimedon compressa, and the growing sponge symbiont literature, indicate that this study has identified approximately 330 A. corrugata-specific symbiotic OTUs, many of which are related to the sulfur-oxidizing Ectothiorhodospiraceae. This family appeared exclusively within A. corrugata, comprising >34.5% of all sequenced amplicons. Other A. corrugata symbionts such as Deltaproteobacteria, Bdellovibrio, and Thiocystis among many others are described. CONCLUSIONS/SIGNIFICANCE: Slight shifts in several bacterial taxa
Zhang, Bingchang; Li, Renhui; Xiao, Peng; Su, Yangui; Zhang, Yuanming
Cyanobacteria are the primary colonizers and form a dominant component of soil photosynthetic communities in biological soil crusts. They are crucial in improving soil environments, namely accumulating soil carbon and nitrogen. Many classical studies have examined cyanobacterial diversity in desert crusts, but relatively few comprehensive molecular surveys have been conducted. We used 454 pyrosequencing of 16S rRNA to investigate cyanobacterial composition and distribution on regional scales in the Gurbantunggut Desert. The relationship between cyanobacterial distribution and environmental factors was also explored. A total of 24,973 cyanobacteria partial 16S rRNA gene sequences were obtained, and 507OTUs were selected, as most OTUs had very few reads. Among these, 347 OTU sequences were of cyanobacteria origin, belonging to Oscillatoriales, Nostocales, Chroococcales, and uncultured cyanobacterium clone, respectively. Microcoleus vaginatus, Chroococcidiopsis spp. and M. steenstrupii were the dominant species in most areas of the Gurbantunggut Desert. Compared with other desert, the Gurbantunggut Desert differed in the prominence of Chroococcidiopsis spp. and lack of Pseudanabaenales. Species composition and abundance of cyanobacteria also showed distinct variations. Soil texture, precipitation, and nutrients and salt levels affected cyanobacterial distribution. Increased precipitation was helpful in improving cyanobacterial diversity. A higher content of coarse sand promoted the colonization and growth of Oscillatoriales and some phylotypes of Chroococcales. The fine-textured soil with higher nutrients and salts supported more varied populations of cyanobacteria, namely some heterocystous cyanobacteria. The results suggested that the Gurbantunggut Desert was rich in cyanobacteria and that precipitation was a primary regulating factor for cyanobacterial composition on a regional scale. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Li, Ou; Xiao, Rong; Sun, Lihua; Guan, Chenglin; Kong, Dedong; Hu, Xiufang
As an epiphyte orchid, Dendrobium catenatum relies on microorganisms for requisite nutrients. Metagenome pyrosequencing based on 16S rRNA and nifH genes was used to characterize the bacterial and diazotrophic communities associated with D. catenatum collected from 5 districts in China. Based on Meta-16S rRNA sequencing, 22 bacterial phyla and 699 genera were identified, distributed as 125 genera from 8 phyla and 319 genera from 10 phyla shared by all the planting bases and all the tissues, respectively. The predominant Proteobacteria varied from 71.81% (GZ) to 96.08% (YN), and Delftia (10.39-38.42%), Burkholderia (2.71-15.98%), Escherichia/Shigella (4.90-25.12%), Pseudomonas (2.68-30.72%) and Sphingomonas (1.83-2.05%) dominated in four planting bases. Pseudomonas (17.94-22.06%), Escherichia/Shigella (6.59-11.59%), Delftia (9.65-22.14%) and Burkholderia (3.12-11.05%) dominated in all the tissues. According to Meta-nifH sequencing, 4 phyla and 45 genera were identified, while 17 genera and 24 genera from 4 phyla were shared by all the planting bases and all the tissues, respectively. Burkholderia and Bradyrhizobium were the most popular in the planting bases, followed by Methylovirgula and Mesorhizobium. Mesorhizobium was the most popular in different tissues, followed by Beijerinckia, Xanthobacter, and Burkholderia. Among the genera, 39 were completely overlapped with the results based on the 16S rRNA gene. In conclusion, abundant bacteria and diazotrophs were identified in common in different tissues of D. catenatum from five planting bases, which might play a great role in the supply of nutrients such as nitrogen. The exact abundance of phylum and genus on the different tissues from different planting bases need deeper sequencing with more samples.
Full Text Available BACKGROUND: Plants selectively attract particular soil microorganisms, in particular consumers of root-excreted compounds. It is unclear to what extent cultivar type and/or growth stage affect this process. METHODOLOGY/PRINCIPAL FINDINGS: DNA-based pyrosequencing was used to characterize the structure of bacterial communities in a field cropped with potato. The rhizospheres of six cultivars denoted Aveka, Aventra, Karnico, Modena, Premiere and Desiree, at three growth stages (young, flowering and senescence were examined, in addition to corresponding bulk soils. Around 350,000 sequences were obtained (5,700 to 38,000 per sample. Across all samples, rank abundance distributions best fitted the power law model, which indicates a community composed of a few highly dominant species next to numerous rare species. Grouping of the sequences showed that members of the Actinobacteria, Alphaproteobacteria, next to as-yet-unclassified bacteria, dominated. Other groups that were consistently found, albeit at lower abundance, were Beta-, Gamma- and Deltaproteobacteria and Acidobacteria. Principal components analyses revealed that rhizosphere samples were significantly different from corresponding bulk soil in each growth stage. Furthermore, cultivar effects were found in the young plant stage, whereas these became insignificant in the flowering and senescence stages. Besides, an effect of time of season was observed for both rhizosphere and bulk soils. The analyzed rhizosphere samples of the potato cultivars were grouped into two groups, in accordance with the allocation of carbon to starch in their tubers, i.e. Aveka, Aventra and Karnico (high versus Premiere and Desiree (low and thus replicates per group were established. CONCLUSIONS: Across all potato cultivars, the young plant stages revealed cultivar-dependent bacterial community structures, which disappeared in the flowering and senescence stages. Furthermore, Pseudomonas, Beta-, Alpha- and
Wolcott Benjamin M
Full Text Available Abstract Background Chronic wound pathogenic biofilms are host-pathogen environments that colonize and exist as a cohabitation of many bacterial species. These bacterial populations cooperate to promote their own survival and the chronic nature of the infection. Few studies have performed extensive surveys of the bacterial populations that occur within different types of chronic wound biofilms. The use of 3 separate16S-based molecular amplifications followed by pyrosequencing, shotgun Sanger sequencing, and denaturing gradient gel electrophoresis were utilized to survey the major populations of bacteria that occur in the pathogenic biofilms of three types of chronic wound types: diabetic foot ulcers (D, venous leg ulcers (V, and pressure ulcers (P. Results There are specific major populations of bacteria that were evident in the biofilms of all chronic wound types, including Staphylococcus, Pseudomonas, Peptoniphilus, Enterobacter, Stenotrophomonas, Finegoldia, and Serratia spp. Each of the wound types reveals marked differences in bacterial populations, such as pressure ulcers in which 62% of the populations were identified as obligate anaerobes. There were also populations of bacteria that were identified but not recognized as wound pathogens, such as Abiotrophia para-adiacens and Rhodopseudomonas spp. Results of molecular analyses were also compared to those obtained using traditional culture-based diagnostics. Only in one wound type did culture methods correctly identify the primary bacterial population indicating the need for improved diagnostic methods. Conclusion If clinicians can gain a better understanding of the wound's microbiota, it will give them a greater understanding of the wound's ecology and will allow them to better manage healing of the wound improving the prognosis of patients. This research highlights the necessity to begin evaluating, studying, and treating chronic wound pathogenic biofilms as multi-species entities in
Full Text Available As an epiphyte orchid, Dendrobium catenatum relies on microorganisms for requisite nutrients. Metagenome pyrosequencing based on 16S rRNA and nifH genes was used to characterize the bacterial and diazotrophic communities associated with D. catenatum collected from 5 districts in China. Based on Meta-16S rRNA sequencing, 22 bacterial phyla and 699 genera were identified, distributed as 125 genera from 8 phyla and 319 genera from 10 phyla shared by all the planting bases and all the tissues, respectively. The predominant Proteobacteria varied from 71.81% (GZ to 96.08% (YN, and Delftia (10.39-38.42%, Burkholderia (2.71-15.98%, Escherichia/Shigella (4.90-25.12%, Pseudomonas (2.68-30.72% and Sphingomonas (1.83-2.05% dominated in four planting bases. Pseudomonas (17.94-22.06%, Escherichia/Shigella (6.59-11.59%, Delftia (9.65-22.14% and Burkholderia (3.12-11.05% dominated in all the tissues. According to Meta-nifH sequencing, 4 phyla and 45 genera were identified, while 17 genera and 24 genera from 4 phyla were shared by all the planting bases and all the tissues, respectively. Burkholderia and Bradyrhizobium were the most popular in the planting bases, followed by Methylovirgula and Mesorhizobium. Mesorhizobium was the most popular in different tissues, followed by Beijerinckia, Xanthobacter, and Burkholderia. Among the genera, 39 were completely overlapped with the results based on the 16S rRNA gene. In conclusion, abundant bacteria and diazotrophs were identified in common in different tissues of D. catenatum from five planting bases, which might play a great role in the supply of nutrients such as nitrogen. The exact abundance of phylum and genus on the different tissues from different planting bases need deeper sequencing with more samples.
Li, Ou; Sun, Lihua; Guan, Chenglin; Kong, Dedong
As an epiphyte orchid, Dendrobium catenatum relies on microorganisms for requisite nutrients. Metagenome pyrosequencing based on 16S rRNA and nifH genes was used to characterize the bacterial and diazotrophic communities associated with D. catenatum collected from 5 districts in China. Based on Meta-16S rRNA sequencing, 22 bacterial phyla and 699 genera were identified, distributed as 125 genera from 8 phyla and 319 genera from 10 phyla shared by all the planting bases and all the tissues, respectively. The predominant Proteobacteria varied from 71.81% (GZ) to 96.08% (YN), and Delftia (10.39–38.42%), Burkholderia (2.71–15.98%), Escherichia/Shigella (4.90–25.12%), Pseudomonas (2.68–30.72%) and Sphingomonas (1.83–2.05%) dominated in four planting bases. Pseudomonas (17.94–22.06%), Escherichia/Shigella (6.59–11.59%), Delftia (9.65–22.14%) and Burkholderia (3.12–11.05%) dominated in all the tissues. According to Meta-nifH sequencing, 4 phyla and 45 genera were identified, while 17 genera and 24 genera from 4 phyla were shared by all the planting bases and all the tissues, respectively. Burkholderia and Bradyrhizobium were the most popular in the planting bases, followed by Methylovirgula and Mesorhizobium. Mesorhizobium was the most popular in different tissues, followed by Beijerinckia, Xanthobacter, and Burkholderia. Among the genera, 39 were completely overlapped with the results based on the 16S rRNA gene. In conclusion, abundant bacteria and diazotrophs were identified in common in different tissues of D. catenatum from five planting bases, which might play a great role in the supply of nutrients such as nitrogen. The exact abundance of phylum and genus on the different tissues from different planting bases need deeper sequencing with more samples. PMID:28931073
Full Text Available The commercial availability of next generation sequencing (NGS technologies facilitated the assessment of functional groups of microorganisms in the environment with high coverage, resolution, and reproducibility. Soil methylotrophs were among the first microorganisms in the environment that were assessed with molecular tools, and nowadays, as well with NGS technologies. Studies in the past years re-attracted notice to the pivotal role of methylotrophs in global conversions of methanol, which mainly originates from plants, and is involved in oxidative reactions and ozone formation in the atmosphere. Aerobic methanol utilizers belong to Bacteria, yeasts, Ascomycota, and molds. Numerous bacterial methylotrophs are facultatively aerobic, and also contribute to anaerobic methanol oxidation in the environment, whereas strict anaerobic methanol utilizers belong to methanogens and acetogens. The diversity of enzymes catalyzing the initial oxidation of methanol is considerable, and comprises at least five different enzyme types in aerobes, and one in strict anaerobes. Only the gene of the large subunit of PQQ-dependent methanol dehydrogenase (mxaF has been analyzed by environmental pyrosequencing. To enable a comprehensive assessment of methanol utilizers in the environment, new primers targeting genes of the PQQ MDH in Methylibium (mdh2, of the NAD-dependent MDH (mdh, of the methanol oxidoreductase of Actinobacteria (mdo, of the fungal FAD-dependent alcohol oxidase (mod1, mod2, and homologues, and of the gene of the large subunit of the methanol:corrinoid methyltransferases (mtaC in methanogens and acetogens need to be developed. Combined stable isotope probing of nucleic acids or proteins with amplicon-based NGS are straightforward approaches to reveal insights into functions of certain methylotrophic taxa in the global methanol cycle.
Marine sponges are associated with a remarkable array of microorganisms. Using a tag pyrosequencing technology, this study was the first to investigate in depth the microbial communities associated with three Red Sea sponges, Hyrtios erectus, Stylissa carteri and Xestospongia testudinaria. We revealed highly diverse sponge-associated bacterial communities with up to 1000 microbial operational taxonomic units (OTUs) and richness estimates of up to 2000 species. Altogether, 26 bacterial phyla were detected from the Red Sea sponges, 11 of which were absent from the surrounding sea water and 4 were recorded in sponges for the first time. Up to 100 OTUs with richness estimates of up to 300 archaeal species were revealed from a single sponge species. This is by far the highest archaeal diversity ever recorded for sponges. A non-negligible proportion of unclassified reads was observed in sponges. Our results demonstrated that the sponge-associated microbial communities remained highly consistent in the same sponge species from different locations, although they varied at different degrees among different sponge species. A significant proportion of the tag sequences from the sponges could be assigned to one of the sponge-specific clusters previously defined. In addition, the sponge-associated microbial communities were consistently divergent from those present in the surrounding sea water. Our results suggest that the Red Sea sponges possess highly sponge-specific or even sponge-species-specific microbial communities that are resistant to environmental disturbance, and much of their microbial diversity remains to be explored. © 2011 International Society for Microbial Ecology All rights reserved.
Estelle Silvia Kilias
Full Text Available Protists in the central Arctic Ocean are adapted to the harsh environmental conditions of its various habitats. During the Polarstern cruise ARK-XXVI/3 in 2011, at one sea-ice station, large aggregates accumulated at the bottom of the melt ponds. In this study, the protist assemblages of the bottom layer of the sea-ice and melt-pond aggregate were investigated using flow cytometry and 454-pyrosequencing. The objective is to provide a first molecular overview of protist biodiversity in these habitats and to consider the overlaps and/or differences in the community compositions. Results of flow cytometry pointed to a cell size distribution that was dominated by 3–10 µm nanoflagellates. The phylogenetic classification of all sequences was conducted at a high taxonomic level, while a selection of abundant (≥1% of total reads sequences was further classified at a lower level. At a high taxonomic level, both habitats showed very similar community structures, dominated by chrysophytes and chlorophytes. At a lower taxonomic level, dissimilarities in the diversity of both groups were encountered in the abundant biosphere. While sea-ice chlorophytes and chrysophytes were dominated by Chlamydomonas/Chloromonas spp. and Ochromonas spp., the melt-pond aggregate was dominated by Carteria sp., Ochromonas spp. and Dinobryon faculiferum. We suppose that the similarities in richness and community structure are a consequence of melt-pond freshwater seeping through porous sea ice in late summer. Differences in the abundant biosphere nevertheless indicate that environmental conditions in both habitats vary enough to select for different dominant species.
Urban, J. M.; Foulk, M. S.; Casella, Cinzia
Origins of DNA replication (ORIs) occur at defined regions in the genome. Although DNA sequence defines the position of ORIs in budding yeast, the factors for ORI specification remain elusive in metazoa. Several methods have been used recently to map ORIs in metazoan genomes with the hope...... that features for ORI specification might emerge. These methods are reviewed here with analysis of their advantages and shortcomings. The various factors that may influence ORI selection for initiation of DNA replication are discussed....
Uldahl, Kristine Buch; Wu, Linping; Hall, Arnaldur; Papathanasiou, Pavlos; Peng, Xu; Moghimi, Seyed Moein
Viruses from the third domain of life, Archaea, exhibit unusual features including extreme stability that allow their survival in harsh environments. In addition, these species have never been reported to integrate into human or any other eukaryotic genomes, and could thus serve for exploration of novel medical nanoplatforms. Here, we selected two archaeal viruses Sulfolobus monocaudavirus 1 (SMV1) and Sulfolobus spindle shaped virus 2 (SSV2) owing to their unique spindle shape, hyperthermostable and acid-resistant nature and studied their interaction with mammalian cells. Accordingly, we followed viral uptake, intracellular trafficking and cell viability in human endothelial cells of brain (hCMEC/D3 cells) and umbilical vein (HUVEC) origin. Whereas SMV1 is efficiently internalized into both types of human cells, SSV2 differentiates between HUVECs and hCMEC/D3 cells, thus opening a path for selective cell targeting. On internalization, both viruses localize to the lysosomal compartments. Neither SMV1, nor SSV2 induced any detrimental effect on cell morphology, plasma membrane and mitochondrial functionality. This is the first study demonstrating recognition of archaeal viruses by eukaryotic cells which provides good basis for future exploration of archaeal viruses in bioengineering and development of multifunctional vectors. PMID:27892499
Klionsky, Daniel J.; Tomashek, John J.
Describes a cooperative, interactive problem-solving exercise for studying eukaryotic cell structure and function. Highlights the dynamic aspects of movement through the cell. Contains 15 references. (WRM)
Zhang, Yan-Qiong; Wen, Jian-Fan
microRNAs (miRNAs) in higher multicellular eukaryotes have been extensively studied in recent years. Great progresses have also been achieved for miRNAs in unicellular eukaryotes. All these studies not only enrich our knowledge about the complex expression regulation system in diverse organisms, but also have evolutionary significance for understanding the origin of this system. In this review, Authors summarize the recent advance in the studies of miRNA in unicellular eukaryotes, including that on the most primitive unicellular eukaryote--Giardia. The origin and evolution of miRNA system is also discussed.
Full Text Available Abstract Background Information on the occurrence of sequence features in genomes is crucial to comparative genomics, evolutionary analysis, the analyses of regulatory sequences and the quantitative evaluation of sequences. Computing the frequencies and the occurrences of a pattern in complete genomes is time-consuming. Results The proposed database provides information about sequence features generated by exhaustively computing the sequences of the complete genome. The repetitive elements in the eukaryotic genomes, such as LINEs, SINEs, Alu and LTR, are obtained from Repbase. The database supports various complete genomes including human, yeast, worm, and 128 microbial genomes. Conclusions This investigation presents and implements an efficiently computational approach to accumulate the occurrences of the oligonucleotides or patterns in complete genomes. A database is established to maintain the information of the sequence features, including the distributions of oligonucleotide, the gene distribution, the distribution of repetitive elements in genomes and the occurrences of the oligonucleotides. The database can provide more effective and efficient way to access the repetitive features in genomes.
DeLong, Edward F
The complete genome sequence of Thermoplasma acidophilum, an acid- and heat-loving archaeon, has recently been reported. Comparative genomic analysis of this 'extremophile' is providing new insights into the metabolic machinery, ecology and evolution of thermophilic archaea.
Teif, Vladimir B; Rippe, Karsten
A recent study of transcription regulation in Drosophila embryonic development revealed a complex non-monotonic dependence of gene expression on the distance between binding sites of repressor and activator proteins at the corresponding enhancer cis-regulatory modules (Fakhouri et al 2010 Mol. Syst. Biol. 6 341). The repressor efficiency was high at small separations, low around 30 bp, reached a maximum at 50–60 bp, and decreased at larger distances to the activator binding sites. Here, we propose a straightforward explanation for the distance dependence of repressor activity by considering the effect of the presence of a nucleosome. Using a method that considers partial unwrapping of nucleosomal DNA from the histone octamer core, we calculated the dependence of activator binding on the repressor–activator distance and found a quantitative agreement with the distance dependence reported for the Drosophila enhancer element. In addition, the proposed model offers explanations for other distance-dependent effects at eukaryotic enhancers. (communication)
Pedersen, Anders Gorm; Baldi, Pierre; Chauvin, Yves
between functional promoters has been estimated to be in the range of 30-40 kilobases. Although it is conceivable that some of these predicted promoters correspond to cryptic initiation sites that are used in vivo, it is likely that most are false positives. This suggests that it is important to carefully......Computational prediction of eukaryotic promoters from the nucleotide sequence is one of the most attractive problems in sequence analysis today, but it is also a very difficult one. Thus, current methods predict in the order of one promoter per kilobase in human DNA, while the average distance...... reconsider the biological data that forms the basis of current algorithms, and we here present a review of data that may be useful in this regard. The review covers the following topics: (1) basal transcription and core promoters, (2) activated transcription and transcription factor binding sites, (3) Cp...
Feng, Justin; Jester, Benjamin W; Tinberg, Christine E; Mandell, Daniel J; Antunes, Mauricio S; Chari, Raj; Morey, Kevin J; Rios, Xavier; Medford, June I; Church, George M; Fields, Stanley; Baker, David
Biosensors for small molecules can be used in applications that range from metabolic engineering to orthogonal control of transcription. Here, we produce biosensors based on a ligand-binding domain (LBD) by using a method that, in principle, can be applied to any target molecule. The LBD is fused to either a fluorescent protein or a transcriptional activator and is destabilized by mutation such that the fusion accumulates only in cells containing the target ligand. We illustrate the power of this method by developing biosensors for digoxin and progesterone. Addition of ligand to yeast, mammalian, or plant cells expressing a biosensor activates transcription with a dynamic range of up to ~100-fold. We use the biosensors to improve the biotransformation of pregnenolone to progesterone in yeast and to regulate CRISPR activity in mammalian cells. This work provides a general methodology to develop biosensors for a broad range of molecules in eukaryotes.
Koslover, Elena; Mogre, Saurabh; Chan, Caleb; Theriot, Julie
The cytoplasm of a living cell is an active environment through which intracellular components move and mix. We explore, using theoretical modeling coupled with microrheological measurements, the efficiency of particle dispersion via different modes of transport within this active environment. In particular, we focus on the role of cytoplasmic flow over different scales in contributing to organelle transport within two different cell types. In motile neutrophil cells, we show that bulk fluid flow associated with rapid cell deformation enhances particle transport to and from the cell periphery. In narrow fungal hyphae, localized flows due to hydrodynamic entrainment are shown to contribute to optimally efficient organelle dispersion. Our results highlight the importance of non-traditional modes of transport associated with flow of the cytoplasmic fluid in the distribution of organelles throughout eukaryotic cells.
Azua-Bustos, A.; Gonzalez-Silva, C.; Mancilla, R.A.; Salas, L.; Palma, R.E.; Wynne, J.J.; McKay, C.P.; Vicuna, R.
Caves offer a stable and protected environment from harsh and changing outside prevailing conditions. Hence, they represent an interesting habitat for studying life in extreme environments. Here, we report the presence of a member of the ancient eukaryote red algae Cyanidium group in a coastal cave of the hyperarid Atacama Desert. This microorganism was found to form a seemingly monospecific biofilm growing under extremely low photon flux levels. Our work suggests that this species, Cyanidium sp. Atacama, is a new member of a recently proposed novel monophyletic lineage of mesophilic "cave" Cyanidium sp., distinct from the remaining three other lineages which are all thermo-acidophilic. The cave described in this work may represent an evolutionary island for life in the midst of the Atacama Desert. ?? Springer Science + Business Media, LLC 2009.
Full Text Available In eukaryotes, short RNAs guide a variety of enzymatic activities that range from RNA editing to translation repression. It is hypothesized that pre-existing proteins evolved to bind and use guide RNA during evolution. However, the capacity of modern proteins to adopt new RNA guides has never been demonstrated. Here we show that Rnt1p, the yeast orthologue of the bacterial dsRNA-specific RNase III, can bind short RNA transcripts and use them as guides for sequence-specific cleavage. Target cleavage occurred at a constant distance from the Rnt1p binding site, leaving the guide RNA intact for subsequent cleavage. Our results indicate that RNase III may trigger sequence-specific RNA degradation independent of the RNAi machinery, and they open the road for a new generation of precise RNA silencing tools that do not trigger a dsRNA-mediated immune response.
Bennetzen, Jeffrey L.; SanMiguel, Phillip; Chen, Mingsheng; Tikhonov, Alexander; Francki, Michael; Avramova, Zoya
For the most part, studies of grass genome structure have been limited to the generation of whole-genome genetic maps or the fine structure and sequence analysis of single genes or gene clusters. We have investigated large contiguous segments of the genomes of maize, sorghum, and rice, primarily focusing on intergenic spaces. Our data indicate that much (>50%) of the maize genome is composed of interspersed repetitive DNAs, primarily nested retrotransposons that in...
Matthew R Henn
Full Text Available BACKGROUND: Bacterial viruses (phages play a critical role in shaping microbial populations as they influence both host mortality and horizontal gene transfer. As such, they have a significant impact on local and global ecosystem function and human health. Despite their importance, little is known about the genomic diversity harbored in phages, as methods to capture complete phage genomes have been hampered by the lack of knowledge about the target genomes, and difficulties in generating sufficient quantities of genomic DNA for sequencing. Of the approximately 550 phage genomes currently available in the public domain, fewer than 5% are marine phage. METHODOLOGY/PRINCIPAL FINDINGS: To advance the study of phage biology through comparative genomic approaches we used marine cyanophage as a model system. We compared DNA preparation methodologies (DNA extraction directly from either phage lysates or CsCl purified phage particles, and sequencing strategies that utilize either Sanger sequencing of a linker amplification shotgun library (LASL or of a whole genome shotgun library (WGSL, or 454 pyrosequencing methods. We demonstrate that genomic DNA sample preparation directly from a phage lysate, combined with 454 pyrosequencing, is best suited for phage genome sequencing at scale, as this method is capable of capturing complete continuous genomes with high accuracy. In addition, we describe an automated annotation informatics pipeline that delivers high-quality annotation and yields few false positives and negatives in ORF calling. CONCLUSIONS/SIGNIFICANCE: These DNA preparation, sequencing and annotation strategies enable a high-throughput approach to the burgeoning field of phage genomics.
Full Text Available BACKGROUND: Pyrosequencing technology has the potential to rapidly sequence HIV-1 viral quasispecies without requiring the traditional approach of cloning. In this study, we investigated the utility of ultra-deep pyrosequencing to characterize genetic diversity of the HIV-1 gag quasispecies and assessed the possible contribution of pyrosequencing technology in studying HIV-1 biology and evolution. METHODOLOGY/PRINCIPAL FINDINGS: HIV-1 gag gene was amplified from 96 patients using nested PCR. The PCR products were cloned and sequenced using capillary based Sanger fluorescent dideoxy termination sequencing. The same PCR products were also directly sequenced using the 454 pyrosequencing technology. The two sequencing methods were evaluated for their ability to characterize quasispecies variation, and to reveal sites under host immune pressure for their putative functional significance. A total of 14,034 variations were identified by 454 pyrosequencing versus 3,632 variations by Sanger clone-based (SCB sequencing. 11,050 of these variations were detected only by pyrosequencing. These undetected variations were located in the HIV-1 Gag region which is known to contain putative cytotoxic T lymphocyte (CTL and neutralizing antibody epitopes, and sites related to virus assembly and packaging. Analysis of the positively selected sites derived by the two sequencing methods identified several differences. All of them were located within the CTL epitope regions. CONCLUSIONS/SIGNIFICANCE: Ultra-deep pyrosequencing has proven to be a powerful tool for characterization of HIV-1 genetic diversity with enhanced sensitivity, efficiency, and accuracy. It also improved reliability of downstream evolutionary and functional analysis of HIV-1 quasispecies.
Norrild, Bodil; Guldberg, Per; Ralfkiær, Elisabeth Methner
Almost all cells in the human body contain a complete copy of the genome with an estimated number of 25,000 genes. The sequences of these genes make up about three percent of the genome and comprise the inherited set of genetic information. The genome also contains information that determines whe...
Full Text Available Hemiascomycete yeasts cover an evolutionary span comparable to that of the entire phylum of chordates. Since this group currently contains the largest number of complete genome sequences it presents unique opportunities to understand the evolution of genome organization in eukaryotes. We inferred rates of genome instability on all branches of a phylogenetic tree for 11 species and calculated species-specific rates of genome rearrangements. We characterized all inversion events that occurred within synteny blocks between six representatives of the different lineages. We show that the rates of macro- and microrearrangements of gene order are correlated within individual lineages but are highly variable across different lineages. The most unstable genomes correspond to the pathogenic yeasts Candida albicans and Candida glabrata. Chromosomal maps have been intensively shuffled by numerous interchromosomal rearrangements, even between species that have retained a very high physical fraction of their genomes within small synteny blocks. Despite this intensive reshuffling of gene positions, essential genes, which cluster in low recombination regions in the genome of Saccharomyces cerevisiae, tend to remain syntenic during evolution. This work reveals that the high plasticity of eukaryotic genomes results from rearrangement rates that vary between lineages but also at different evolutionary times of a given lineage.
Cerveau, Nicolas; Jackson, Daniel J
Next-generation sequencing (NGS) technologies are arguably the most revolutionary technical development to join the list of tools available to molecular biologists since PCR. For researchers working with nonconventional model organisms one major problem with the currently dominant NGS platform (Illumina) stems from the obligatory fragmentation of nucleic acid material that occurs prior to sequencing during library preparation. This step creates a significant bioinformatic challenge for accurate de novo assembly of novel transcriptome data. This challenge becomes apparent when a variety of modern assembly tools (of which there is no shortage) are applied to the same raw NGS dataset. With the same assembly parameters these tools can generate markedly different assembly outputs. In this study we present an approach that generates an optimized consensus de novo assembly of eukaryotic coding transcriptomes. This approach does not represent a new assembler, rather it combines the outputs of a variety of established assembly packages, and removes redundancy via a series of clustering steps. We test and validate our approach using Illumina datasets from six phylogenetically diverse eukaryotes (three metazoans, two plants and a yeast) and two simulated datasets derived from metazoan reference genome annotations. All of these datasets were assembled using three currently popular assembly packages (CLC, Trinity and IDBA-tran). In addition, we experimentally demonstrate that transcripts unique to one particular assembly package are likely to be bioinformatic artefacts. For all eight datasets our pipeline generates more concise transcriptomes that in fact possess more unique annotatable protein domains than any of the three individual assemblers we employed. Another measure of assembly completeness (using the purpose built BUSCO databases) also confirmed that our approach yields more information. Our approach yields coding transcriptome assemblies that are more likely to be
Metaxa: a software tool for automated detection and discrimination among ribosomal small subunit (12S/16S/18S) sequences of archaea, bacteria, eukaryotes, mitochondria, and chloroplasts in metagenomes and environmental sequencing datasets.
Bengtsson, Johan; Eriksson, K Martin; Hartmann, Martin; Wang, Zheng; Shenoy, Belle Damodara; Grelet, Gwen-Aëlle; Abarenkov, Kessy; Petri, Anna; Rosenblad, Magnus Alm; Nilsson, R Henrik
The ribosomal small subunit (SSU) rRNA gene has emerged as an important genetic marker for taxonomic identification in environmental sequencing datasets. In addition to being present in the nucleus of eukaryotes and the core genome of prokaryotes, the gene is also found in the mitochondria of eukaryotes and in the chloroplasts of photosynthetic eukaryotes. These three sets of genes are conceptually paralogous and should in most situations not be aligned and analyzed jointly. To identify the origin of SSU sequences in complex sequence datasets has hitherto been a time-consuming and largely manual undertaking. However, the present study introduces Metaxa ( http://microbiology.se/software/metaxa/ ), an automated software tool to extract full-length and partial SSU sequences from larger sequence datasets and assign them to an archaeal, bacterial, nuclear eukaryote, mitochondrial, or chloroplast origin. Using data from reference databases and from full-length organelle and organism genomes, we show that Metaxa detects and scores SSU sequences for origin with very low proportions of false positives and negatives. We believe that this tool will be useful in microbial and evolutionary ecology as well as in metagenomics.
Qiu, Huan; Price, Dana C; Yang, Eun Chan; Yoon, Hwan Su; Bhattacharya, Debashish
Red algae (Rhodophyta) comprise a monophyletic eukaryotic lineage of ~6,500 species with a fossil record that extends back 1.2 billion years. A surprising aspect of red algal evolution is that sequenced genomes encode a relatively limited gene inventory (~5-10 thousand genes) when compared with other free-living algae or to other eukaryotes. This suggests that the common ancestor of red algae may have undergone extensive genome reduction, which can result from lineage specialization to a symbiotic or parasitic lifestyle or adaptation to an extreme or oligotrophic environment. We gathered genome and transcriptome data from a total of 14 red algal genera that represent the major branches of this phylum to study genome evolution in Rhodophyta. Analysis of orthologous gene gains and losses identifies two putative major phases of genome reduction: (i) in the stem lineage leading to all red algae resulting in the loss of major functions such as flagellae and basal bodies, the glycosyl-phosphatidylinositol anchor biosynthesis pathway, and the autophagy regulation pathway; and (ii) in the common ancestor of the extremophilic Cyanidiophytina. Red algal genomes are also characterized by the recruitment of hundreds of bacterial genes through horizontal gene transfer that have taken on multiple functions in shared pathways and have replaced eukaryotic gene homologs. Our results suggest that Rhodophyta may trace their origin to a gene depauperate ancestor. Unlike plants, it appears that a limited gene inventory is sufficient to support the diversification of a major eukaryote lineage that possesses sophisticated multicellular reproductive structures and an elaborate triphasic sexual cycle. © 2015 Phycological Society of America.
Xiomara V Thomas
Full Text Available BACKGROUND & AIMS: Telaprevir, a hepatitis C virus NS3/4A protease inhibitor has significantly improved sustained viral response rates when given in combination with pegylated interferon alfa-2a and ribavirin, compared with current standard of care in hepatitis C virus genotype 1 infected patients. In patients with a failed sustained response, the emergence of drug-resistant variants during treatment has been reported. It is unclear to what extent these variants persist in untreated patients. The aim of this study was to assess using ultra-deep pyrosequencing, whether after 4 years follow-up, the frequency of resistant variants is increased compared to pre-treatment frequencies following 14 days of telaprevir treatment. METHODS: Fifteen patients from 2 previous telaprevir phase 1 clinical studies (VX04-950-101 and VX05-950-103 were included. These patients all received telaprevir monotherapy for 14 days, and 2 patients subsequently received standard of care. Variants at previously well-characterized NS3 protease positions V36, T54, R155 and A156 were assessed at baseline and after a follow-up of 4±1.2 years by ultra-deep pyrosequencing. The prevalence of resistant variants at follow-up was compared to baseline. RESULTS: Resistance associated mutations were detectable at low frequency at baseline. In general, prevalence of resistance mutations at follow-up was not increased compared to baseline. Only one patient had a small, but statistically significant, increase in the number of V36M and T54S variants 4 years after telaprevir-dosing. CONCLUSION: In patients treated for 14 days with telaprevir monotherapy, ultra-deep pyrosequencing indicates that long-term persistence of resistant variants is rare.
Kobayashi, Naomi; Bauer, Thomas W; Togawa, Daisuke; Lieberman, Isador H; Sakai, Hiroshige; Fujishiro, Takaaki; Tuohy, Marion J; Procop, Gary W
The bacteria associated with orthopaedic infections are usually common gram-positive and gram-negative bacteria. This fundamental grouping of bacteria is a necessary first step in the selection of appropriate antibiotics. Since polymerase chain reaction (PCR) is more rapid and may be more sensitive than culture, we developed a postamplification pyrosequencing method to subcategorize bacteria based on a few nucleotide polymorphisms in the 16S rRNA gene. We validated this method using well-characterized strains of bacteria and applied it to specimens from spinal surgery cases with suspected infections. Lysates of 114 bacteria including 75 species were created following standard cultivation to obtain DNA. The DNA was amplified by a broad-range real-time PCR. The amplicons were evaluated by pyrosequencing and were classified as gram-positive, gram-negative, or acid-fast bacilli based on the first three to five nucleotides sequenced. In addition, clinical cases of suspected infection were obtained from spinal surgery. The results of the "molecular Gram stain" were compared with the results of traditional Gram stain and culture. The lysates of 107 (93.9%) of the bacteria extracts tested were appropriately categorized as gram-positive and gram-negative or as acid-fast bacilli on the basis of this assay. The sensitivity and specificity of this assay were 100% and 97.4% for gram-positive and 88.3% and 100% for gram-negative isolates. All of the five clinical samples were appropriately categorized as containing gram-positive or gram-negative bacteria with this assay. This study demonstrates that high sensitivity and specificity of a molecular gram stain may be achieved using broad-range real-time PCR and pyrosequencing.
Ingmar S Middelbos
Full Text Available BACKGROUND: Dogs suffer from many of the same maladies as humans that may be affected by the gut microbiome, but knowledge of the canine microbiome is incomplete. This work aimed to use 16S rDNA tag pyrosequencing to phylogenetically characterize hindgut microbiome in dogs and determine how consumption of dietary fiber affects community structure. PRINCIPAL FINDINGS: Six healthy adult dogs were used in a crossover design. A control diet without supplemental fiber and a beet pulp-supplemented (7.5% diet were fed. Fecal DNA was extracted and the V3 hypervariable region of the microbial 16S rDNA gene amplified using primers suitable for 454-pyrosequencing. Microbial diversity was assessed on random 2000-sequence subsamples of individual and pooled DNA samples by diet. Our dataset comprised 77,771 reads with an average length of 141 nt. Individual samples contained approximately 129 OTU, with Fusobacteria (23-40% of reads, Firmicutes (14-28% of reads and Bacteroidetes (31-34% of reads being co-dominant phyla. Feeding dietary fiber generally decreased Fusobacteria and increased Firmicutes, but these changes were not equally apparent in all dogs. UniFrac analysis revealed that structure of the gut microbiome was affected by diet and Firmicutes appeared to play a strong role in by-diet clustering. CONCLUSIONS: Our data suggest three co-dominant bacterial phyla in the canine hindgut. Furthermore, a relatively small amount of dietary fiber changed the structure of the gut microbiome detectably. Our data are among the first to characterize the healthy canine gut microbiome using pyrosequencing and provide a basis for studies focused on devising dietary interventions for microbiome-associated diseases.
Full Text Available Gut microbes are known to play various roles in insects such as digestion of inaccessible nutrients, synthesis of deficient amino acids, and interaction with ecological environments, including host plants. Here, we analyzed the gut microbiome in Hessian fly, a serious pest of wheat. A total of 3,654 high quality sequences of the V3 hypervariable region of the 16S rRNA gene were obtained through 454-pyrosequencing. From these sequences, 311 operational taxonomic units (OTUs were obtained at the >97% similarity cutoff. In the gut of 1st instar, otu01, a member of Pseudomonas, was predominant, representing 90.2% of total sequences. otu13, an unidentified genus in the Pseudomonadaceae family, represented 1.9% of total sequences. The remaining OTUs were each less than 1%. In the gut of the 2nd instar, otu01 and otu13 decreased to 85.5% and 1.5%, respectively. otu04, a member of Buttiauxella, represented 9.7% of total sequences. The remaining OTUs were each less than 1%. In the gut of the 3rd instar, otu01 and otu13 further decreased to 29.0% and 0%, respectively. otu06, otu08, and otu16, also three members of the Pseudomonadaceae family were 13.2%, 8.6%, and 2.3%, respectively. In addition, otu04 and otu14, two members of the Enterobacteriaceae family, were 4.7% and 2.5%; otu18 and otu20, two members of the Xanthomonadaceae family, were 1.3% and 1.2%, respectively; otu12, a member of Achromobacter, was 4.2%; otu19, a member of Undibacterium, was 1.4%; and otu9, otu10, and otu15, members of various families, were 6.1%, 6.3%, and 1.9%, respectively. The investigation into dynamics of Pseudomonas, the most abundant genera, revealed that its population level was at peak in freshly hatched or 1 day larvae as well as in later developmental stages, thus suggesting a prominent role for this bacterium in Hessian fly development and in its interaction with host plants. This study is the first comprehensive survey on bacteria associated with the gut of a gall
Full Text Available We have investigated the reliability and reproducibility of HCV viral quasispecies quantification by ultra-deep pyrosequencing (UDPS methods. Our study has been divided in two parts. First of all, by UDPS sequencing of clone mixes samples we have established the global noise level of UDPS and fine tuned a data treatment workflow previously optimized for HBV sequence analysis. Secondly, we have studied the reproducibility of the methodology by comparing 5 amplicons from two patient samples on three massive sequencing platforms (FLX+, FLX and Junior after applying the error filters developed from the clonal/control study. After noise filtering the UDPS results, the three replicates showed the same 12 polymorphic sites above 0.7%, with a mean CV of 4.86%. Two polymorphic sites below 0.6% were identified by two replicates and one replicate respectively. A total of 25, 23 and 26 haplotypes were detected by GS-Junior, GS-FLX and GS-FLX+. The observed CVs for the normalized Shannon entropy (Sn, the mutation frequency (Mf, and the nucleotidic diversity (Pi were 1.46%, 3.96% and 3.78%. The mean absolute difference in the two patients (5 amplicons each, in the GS-FLX and GS-FLX+, were 1.46%, 3.96% and 3.78% for Sn, Mf and Pi. No false polymorphic site was observed above 0.5%. Our results indicate that UDPS is an optimal alternative to molecular cloning for quantitative study of HCV viral quasispecies populations, both in complexity and composition. We propose an UDPS data treatment workflow for amplicons from the RNA viral quasispecies which, at a sequencing depth of at least 10,000 reads per strand, enables to obtain sequences and frequencies of consensus haplotypes above 0.5% abundance with no erroneous mutations, with high confidence, resistant mutants as minor variants at the level of 1%, with high confidence that variants are not missed, and highly confident measures of quasispecies complexity.
Full Text Available Maize (Zea mays L. is the staple diet of the native peasants in the Quechua region of the Peruvian Andes who continue growing it in small plots called chacras following ancestral traditions. The abundance and structure of bacterial communities associated with the roots of amilaceous maize has not been studied in Andean chacras. Accordingly, the main objective of this study was to describe the rhizospheric bacterial diversity of amilaceous maize grown either in the presence or the absence of bur clover cultivated in soils from the Quechua maize belt. Three 16S rRNA gene libraries, one corresponding to sequences of bacteria from bulk soil of a chacra maintained under fallow conditions, the second from the rhizosphere of maize-cultivated soils, and the third prepared from rhizospheric soil of maize cultivated in intercropping with bur clover were examined using pyrosequencing tags spanning the V4 and V5 hypervariable regions of the gene. A total of 26031 sequences were found that grouped into 5955 distinct operational taxonomic units which distributed in 309 genera. The numbers of OTUs in the libraries from the maize-cultivated soils were significantly higher than those found in the libraries from bulk soil. One hundred ninety seven genera were found in the bulk soil library and 234 and 203 were in those from the maize and maize/bur clover-cultivated soils. Sixteen out of the 309 genera had a relative abundance higher than 0.5% and the were (in decreasing order of abundance Gp4, Gp6, Flavobacterium, Subdivision3 genera incertae sedis of the Verrucomicrobia phylum, Gemmatimonas, Dechloromonas, Ohtaekwangia, Rhodoferax, Gaiella, Opitutus, Gp7, Spartobacteria genera incertae sedis, Terrimonas, Gp5, Steroidobacter and Parcubacteria genera incertae sedis. Genera Gp4 and Gp6 of the Acidobacteria, Gemmatimonas and Rhodoferax were the most abundant in bulk soil, whereas Flavobacterium, Dechloromonas and Ohtaekwangia were the main genera in the rhizosphere
Rong, Yan; Wang, Yi; Guan, Yina; Ma, Jiangtao; Cai, Zhiqiang; Yang, Guanghua; Zhao, Xiyue
Graphene (GN) and graphene oxides (GOs) are novel carbon nanomaterial; they have been attracting much attention because of their excellent properties and are widely applied in many areas, including energy, electronics, biomedicine, environmental science, etc. With industrial production and consumption of GN/GO, they will inevitably enter the soil and water environments. GN/GO may directly cause certain harm to microorganisms and lead to ecological and environmental risks. GOs are GN derivatives with abundant oxygen-containing functional groups in their graphitic backbone. The structure and chemistry of GN show obvious differences compared to those of GO, which lead to the different environmental behaviors. In this study, four different types of soil (S1-S4) were employed to investigate the effect of GN and GO on soil enzymatic activity, microbial population, and bacterial community through pyrosequencing of 16S rRNA gene amplicons. The results showed that soil enzyme activity (invertase, protease, catalase, and urease) and microbial population (bacteria, actinomycetes, and fungi) changed after GN/GO release into soils. Soil microbial community species are more rich, and the diversity also increases after GO/GN application. The phylum of Proteobacteria increased at 90 days after treatment (DAT) after GN/GO application. The phylum of Chloroflexi occurred after GN application at 90 DAT in S1 soil and reached 4.6%. Proteobacteria was the most abundant phylum in S2, S3, and S4 soils; it ranged from 43.6 to 71.4% in S2 soil, from 45.6 to 73.7% in S3 soil, and from 38.1 to 56.7% in S4 soil. The most abundant genera were Bacillus (37.5-47.0%) and Lactococcus (28.0-39.0%) in S1 soil, Lysobacter and Flavobacterium in S2 soil, Pedobacter in S3 soil, and Massilia in S4 soil. The effect of GN and GO on the soil microbial community is time-dependent, and there are no significant differences between the samples at 10 and 90 DAT.
Liu, Guozheng; Cao, Dandan; Li, Shuangshuang; Su, Aiguo; Geng, Jianing; Grover, Corrinne E; Hu, Songnian; Hua, Jinping
Mitochondria are the main manufacturers of cellular ATP in eukaryotes. The plant mitochondrial genome contains large number of foreign DNA and repeated sequences undergone frequently intramolecular recombination. Upland Cotton (Gossypium hirsutum L.) is one of the main natural fiber crops and also an important oil-producing plant in the world. Sequencing of the cotton mitochondrial (mt) genome could be helpful for the evolution research of plant mt genomes. We utilized 454 technology for sequencing and combined with Fosmid library of the Gossypium hirsutum mt genome screening and positive clones sequencing and conducted a series of evolutionary analysis on Cycas taitungensis and 24 angiosperms mt genomes. After data assembling and contigs joining, the complete mitochondrial genome sequence of G. hirsutum was obtained. The completed G.hirsutum mt genome is 621,884 bp in length, and contained 68 genes, including 35 protein genes, four rRNA genes and 29 tRNA genes. Five gene clusters are found conserved in all plant mt genomes; one and four clusters are specifically conserved in monocots and dicots, respectively. Homologous sequences are distributed along the plant mt genomes and species closely related share the most homologous sequences. For species that have both mt and chloroplast genome sequences available, we checked the location of cp-like migration and found several fragments closely linked with mitochondrial genes. The G. hirsutum mt genome possesses most of the common characters of higher plant mt genomes. The existence of syntenic gene clusters, as well as the conservation of some intergenic sequences and genic content among the plant mt genomes suggest that evolution of mt genomes is consistent with plant taxonomy but independent among different species.
Full Text Available In contrast to most DNA viruses, poxviruses replicate their genomes in the cytoplasm without host involvement. We find that vaccinia virus induces cytoplasmic activation of ATR early during infection, before genome uncoating, which is unexpected because ATR plays a fundamental nuclear role in maintaining host genome integrity. ATR, RPA, INTS7, and Chk1 are recruited to cytoplasmic DNA viral factories, suggesting canonical ATR pathway activation. Consistent with this, pharmacological and RNAi-mediated inhibition of canonical ATR signaling suppresses genome replication. RPA and the sliding clamp PCNA interact with the viral polymerase E9 and are required for DNA replication. Moreover, the ATR activator TOPBP1 promotes genome replication and associates with the viral replisome component H5. Our study suggests that, in contrast to long-held beliefs, vaccinia recruits conserved components of the eukaryote DNA replication and repair machinery to amplify its genome in the host cytoplasm.
Radzvilavicius, Arunas L; Kokko, Hanna; Christie, Joshua R
Mitochondria are ATP-producing organelles of bacterial ancestry that played a key role in the origin and early evolution of complex eukaryotic cells. Most modern eukaryotes transmit mitochondrial genes uniparentally, often without recombination among genetically divergent organelles. While this asymmetric inheritance maintains the efficacy of purifying selection at the level of the cell, the absence of recombination could also make the genome susceptible to Muller's ratchet. How mitochondria escape this irreversible defect accumulation is a fundamental unsolved question. Occasional paternal leakage could in principle promote recombination, but it would also compromise the purifying selection benefits of uniparental inheritance. We assess this tradeoff using a stochastic population-genetic model. In the absence of recombination, uniparental inheritance of freely-segregating genomes mitigates mutational erosion, while paternal leakage exacerbates the ratchet effect. Mitochondrial fusion-fission cycles ensure independent genome segregation, improving purifying selection. Paternal leakage provides opportunity for recombination to slow down the mutation accumulation, but always at a cost of increased steady-state mutation load. Our findings indicate that random segregation of mitochondrial genomes under uniparental inheritance can effectively combat the mutational meltdown, and that homologous recombination under paternal leakage might not be needed. Copyright © 2017 by the Genetics Society of America.
Full Text Available Abstract Background Recent studies of 16S rRNA genes in the mammalian gut microbiota distinguished a higher Firmicutes/Bacteroidetes ratio in obese individuals compared to lean individuals. This ratio was estimated using a clonal Sanger sequencing approach which is time-consuming and requires laborious data analysis. In contrast, new high-throughput pyrosequencing technology offers an inexpensive alternative to clonal Sanger sequencing and would significantly advance our understanding of obesity via the development of a clinical diagnostic method. Here we present a cost-effective method that combines 16S rRNA pyrosequencing and DNA barcodes of the Firmicutes and Bacteroidetes 16S rRNA genes to determine the Firmicutes/Bacteroidetes ratio in the gut microbiota of obese humans. Results The main result was the identification of DNA barcodes targeting the Firmicutes and Bacteroidetes phyla. These barcodes were validated using previously published 16S rRNA gut microbiota clone libraries. In addition, an accurate F/B ratio was found when the DNA barcodes were applied to short