Full Text Available Abstract Background The concept of orthology is key to decoding evolutionary relationships among genes across different species using comparative genomics. QuartetS is a recently reported algorithm for large-scale orthology detection. Based on the well-established evolutionary principle that gene duplication events discriminate paralogous from orthologous genes, QuartetS has been shown to improve orthology detection accuracy while maintaining computational efficiency. Description QuartetS-DB is a new orthology database constructed using the QuartetS algorithm. The database provides orthology predictions among 1621 complete genomes (1365 bacterial, 92 archaeal, and 164 eukaryotic, covering more than seven million proteins and four million pairwise orthologs. It is a major source of orthologous groups, containing more than 300,000 groups of orthologous proteins and 236,000 corresponding gene trees. The database also provides over 500,000 groups of inparalogs. In addition to its size, a distinguishing feature of QuartetS-DB is the ability to allow users to select a cutoff value that modulates the balance between prediction accuracy and coverage of the retrieved pairwise orthologs. The database is accessible at https://applications.bioanalysis.org/quartetsdb. Conclusions QuartetS-DB is one of the largest orthology resources available to date. Because its orthology predictions are underpinned by evolutionary evidence obtained from sequenced genomes, we expect its accuracy to continue to increase in future releases as the genomes of additional species are sequenced.
Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A
Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .
Huerta-Cepas, J.; Szklarczyk, D.; Forslund, K.; Cook, H.; Heller, D.; Walter, M.C.; Rattei, T.; Mende, D.R.; Sunagawa, S.; Kuhn, M.; Jensen, L.J.; von Mering, C.; Bork, P.
eggNOG is a public resource that provides Orthologous Groups (OGs) of proteins at different taxonomic levels, each with integrated and summarized functional annotations. Developments since the latest public release include changes to the algorithm for creating OGs across taxonomic levels, making nested groups hierarchically consistent. This allows for a better propagation of functional terms across nested OGs and led to the novel annotation of 95 890 previously uncharacterized OGs, increasing...
Huynen Martijn A
Full Text Available Abstract Background Orthology is one of the cornerstones of gene function prediction. Dividing the phylogenetic relations between genes into either orthologs or paralogs is however an oversimplification. Already in two-species gene-phylogenies, the complicated, non-transitive nature of phylogenetic relations results in inparalogs and outparalogs. For situations with more than two species we lack semantics to specifically describe the phylogenetic relations, let alone to exploit them. Published procedures to extract orthologous groups from phylogenetic trees do not allow identification of orthology at various levels of resolution, nor do they document the relations between the orthologous groups. Results We introduce "levels of orthology" to describe the multi-level nature of gene relations. This is implemented in a program LOFT (Levels of Orthology From Trees that assigns hierarchical orthology numbers to genes based on a phylogenetic tree. To decide upon speciation and gene duplication events in a tree LOFT can be instructed either to perform classical species-tree reconciliation or to use the species overlap between partitions in the tree. The hierarchical orthology numbers assigned by LOFT effectively summarize the phylogenetic relations between genes. The resulting high-resolution orthologous groups are depicted in colour, facilitating visual inspection of (large trees. A benchmark for orthology prediction, that takes into account the varying levels of orthology between genes, shows that the phylogeny-based high-resolution orthology assignments made by LOFT are reliable. Conclusion The "levels of orthology" concept offers high resolution, reliable orthology, while preserving the relations between orthologous groups. A Windows as well as a preliminary Java version of LOFT is available from the LOFT website http://www.cmbi.ru.nl/LOFT.
Full Text Available Abstract Background The ferlin gene family possesses a rare and identifying feature consisting of multiple tandem C2 domains and a C-terminal transmembrane domain. Much currently remains unknown about the fundamental function of this gene family, however, mutations in its two most well-characterised members, dysferlin and otoferlin, have been implicated in human disease. The availability of genome sequences from a wide range of species makes it possible to explore the evolution of the ferlin family, providing contextual insight into characteristic features that define the ferlin gene family in its present form in humans. Results Ferlin genes were detected from all species of representative phyla, with two ferlin subgroups partitioned within the ferlin phylogenetic tree based on the presence or absence of a DysF domain. Invertebrates generally possessed two ferlin genes (one with DysF and one without, with six ferlin genes in most vertebrates (three DysF, three non-DysF. Expansion of the ferlin gene family is evident between the divergence of lamprey (jawless vertebrates and shark (cartilaginous fish. Common to almost all ferlins is an N-terminal C2-FerI-C2 sandwich, a FerB motif, and two C-terminal C2 domains (C2E and C2F adjacent to the transmembrane domain. Preservation of these structural elements throughout eukaryotic evolution suggests a fundamental role of these motifs for ferlin function. In contrast, DysF, C2DE, and FerA are optional, giving rise to subtle differences in domain topologies of ferlin genes. Despite conservation of multiple C2 domains in all ferlins, the C-terminal C2 domains (C2E and C2F displayed higher sequence conservation and greater conservation of putative calcium binding residues across paralogs and orthologs. Interestingly, the two most studied non-mammalian ferlins (Fer-1 and Misfire in model organisms C. elegans and D. melanogaster, present as outgroups in the phylogenetic analysis, with results suggesting
Heijden, R.T.J.M. van der; Snel, B.; Noort, V. van; Huynen, M.A.
BACKGROUND: Orthology is one of the cornerstones of gene function prediction. Dividing the phylogenetic relations between genes into either orthologs or paralogs is however an oversimplification. Already in two-species gene-phylogenies, the complicated, non-transitive nature of phylogenetic
Chiba, Hirokazu; Nishide, Hiroyo; Uchiyama, Ikuo
Recently, various types of biological data, including genomic sequences, have been rapidly accumulating. To discover biological knowledge from such growing heterogeneous data, a flexible framework for data integration is necessary. Ortholog information is a central resource for interlinking corresponding genes among different organisms, and the Semantic Web provides a key technology for the flexible integration of heterogeneous data. We have constructed an ortholog database using the Semantic Web technology, aiming at the integration of numerous genomic data and various types of biological information. To formalize the structure of the ortholog information in the Semantic Web, we have constructed the Ortholog Ontology (OrthO). While the OrthO is a compact ontology for general use, it is designed to be extended to the description of database-specific concepts. On the basis of OrthO, we described the ortholog information from our Microbial Genome Database for Comparative Analysis (MBGD) in the form of Resource Description Framework (RDF) and made it available through the SPARQL endpoint, which accepts arbitrary queries specified by users. In this framework based on the OrthO, the biological data of different organisms can be integrated using the ortholog information as a hub. Besides, the ortholog information from different data sources can be compared with each other using the OrthO as a shared ontology. Here we show some examples demonstrating that the ortholog information described in RDF can be used to link various biological data such as taxonomy information and Gene Ontology. Thus, the ortholog database using the Semantic Web technology can contribute to biological knowledge discovery through integrative data analysis.
Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G
The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.
Singh Rama S
Full Text Available Abstract Background Much of the morphological diversity in eukaryotes results from differential regulation of gene expression in which transcription factors (TFs play a central role. The nematode Caenorhabditis elegans is an established model organism for the study of the roles of TFs in controlling the spatiotemporal pattern of gene expression. Using the fully sequenced genomes of three Caenorhabditid nematode species as well as genome information from additional more distantly related organisms (fruit fly, mouse, and human we sought to identify orthologous TFs and characterized their patterns of evolution. Results We identified 988 TF genes in C. elegans, and inferred corresponding sets in C. briggsae and C. remanei, containing 995 and 1093 TF genes, respectively. Analysis of the three gene sets revealed 652 3-way reciprocal 'best hit' orthologs (nematode TF set, approximately half of which are zinc finger (ZF-C2H2 and ZF-C4/NHR types and HOX family members. Examination of the TF genes in C. elegans and C. briggsae identified the presence of significant tandem clustering on chromosome V, the majority of which belong to ZF-C4/NHR family. We also found evidence for lineage-specific duplications and rapid evolution of many of the TF genes in the two species. A search of the TFs conserved among nematodes in Drosophila melanogaster, Mus musculus and Homo sapiens revealed 150 reciprocal orthologs, many of which are associated with important biological processes and human diseases. Finally, a comparison of the sequence, gene interactions and function indicates that nematode TFs conserved across phyla exhibit significantly more interactions and are enriched in genes with annotated mutant phenotypes compared to those that lack orthologs in other species. Conclusion Our study represents the first comprehensive genome-wide analysis of TFs across three nematode species and other organisms. The findings indicate substantial conservation of transcription
Lechner, Marcus; Findeiss, Sven; Steiner, Lydia; Marz, Manja; Stadler, Peter F; Prohaska, Sonja J
Orthology analysis is an important part of data analysis in many areas of bioinformatics such as comparative genomics and molecular phylogenetics. The ever-increasing flood of sequence data, and hence the rapidly increasing number of genomes that can be compared simultaneously, calls for efficient software tools as brute-force approaches with quadratic memory requirements become infeasible in practise. The rapid pace at which new data become available, furthermore, makes it desirable to compute genome-wide orthology relations for a given dataset rather than relying on relations listed in databases. The program Proteinortho described here is a stand-alone tool that is geared towards large datasets and makes use of distributed computing techniques when run on multi-core hardware. It implements an extended version of the reciprocal best alignment heuristic. We apply Proteinortho to compute orthologous proteins in the complete set of all 717 eubacterial genomes available at NCBI at the beginning of 2009. We identified thirty proteins present in 99% of all bacterial proteomes. Proteinortho significantly reduces the required amount of memory for orthology analysis compared to existing tools, allowing such computations to be performed on off-the-shelf hardware.
Full Text Available Abstract Background Orthology analysis is an important part of data analysis in many areas of bioinformatics such as comparative genomics and molecular phylogenetics. The ever-increasing flood of sequence data, and hence the rapidly increasing number of genomes that can be compared simultaneously, calls for efficient software tools as brute-force approaches with quadratic memory requirements become infeasible in practise. The rapid pace at which new data become available, furthermore, makes it desirable to compute genome-wide orthology relations for a given dataset rather than relying on relations listed in databases. Results The program Proteinortho described here is a stand-alone tool that is geared towards large datasets and makes use of distributed computing techniques when run on multi-core hardware. It implements an extended version of the reciprocal best alignment heuristic. We apply Proteinortho to compute orthologous proteins in the complete set of all 717 eubacterial genomes available at NCBI at the beginning of 2009. We identified thirty proteins present in 99% of all bacterial proteomes. Conclusions Proteinortho significantly reduces the required amount of memory for orthology analysis compared to existing tools, allowing such computations to be performed on off-the-shelf hardware.
Nichio, Bruno T L; Marchaukoski, Jeroniza Nunes; Raittz, Roberto Tadeu
Nowadays defying homology relationships among sequences is essential for biological research. Within homology the analysis of orthologs sequences is of great importance for computational biology, annotation of genomes and for phylogenetic inference. Since 2007, with the increase in the number of new sequences being deposited in large biological databases, researchers have begun to analyse computerized methodologies and tools aimed at selecting the most promising ones in the prediction of orthologous groups. Literature in this field of research describes the problems that the majority of available tools show, such as those encountered in accuracy, time required for analysis (especially in light of the increasing volume of data being submitted, which require faster techniques) and the automatization of the process without requiring manual intervention. Conducting our search through BMC, Google Scholar, NCBI PubMed, and Expasy, we examined more than 600 articles pursuing the most recent techniques and tools developed to solve most the problems still existing in orthology detection. We listed the main computational tools created and developed between 2011 and 2017, taking into consideration the differences in the type of orthology analysis, outlining the main features of each tool and pointing to the problems that each one tries to address. We also observed that several tools still use as their main algorithm the BLAST "all-against-all" methodology, which entails some limitations, such as limited number of queries, computational cost, and high processing time to complete the analysis. However, new promising tools are being developed, like OrthoVenn (which uses the Venn diagram to show the relationship of ortholog groups generated by its algorithm); or proteinOrtho (which improves the accuracy of ortholog groups); or ReMark (tackling the integration of the pipeline to turn the entry process automatic); or OrthAgogue (using algorithms developed to minimize processing
Bruno T. L. Nichio
Full Text Available Nowadays defying homology relationships among sequences is essential for biological research. Within homology the analysis of orthologs sequences is of great importance for computational biology, annotation of genomes and for phylogenetic inference. Since 2007, with the increase in the number of new sequences being deposited in large biological databases, researchers have begun to analyse computerized methodologies and tools aimed at selecting the most promising ones in the prediction of orthologous groups. Literature in this field of research describes the problems that the majority of available tools show, such as those encountered in accuracy, time required for analysis (especially in light of the increasing volume of data being submitted, which require faster techniques and the automatization of the process without requiring manual intervention. Conducting our search through BMC, Google Scholar, NCBI PubMed, and Expasy, we examined more than 600 articles pursuing the most recent techniques and tools developed to solve most the problems still existing in orthology detection. We listed the main computational tools created and developed between 2011 and 2017, taking into consideration the differences in the type of orthology analysis, outlining the main features of each tool and pointing to the problems that each one tries to address. We also observed that several tools still use as their main algorithm the BLAST “all-against-all” methodology, which entails some limitations, such as limited number of queries, computational cost, and high processing time to complete the analysis. However, new promising tools are being developed, like OrthoVenn (which uses the Venn diagram to show the relationship of ortholog groups generated by its algorithm; or proteinOrtho (which improves the accuracy of ortholog groups; or ReMark (tackling the integration of the pipeline to turn the entry process automatic; or OrthAgogue (using algorithms developed to
Lechner, Marcus; Findeiß, Sven; Steiner, Lydia; Marz, Manja; Stadler, Peter F; Prohaska, Sonja J
Abstract Background Orthology analysis is an important part of data analysis in many areas of bioinformatics such as comparative genomics and molecular phylogenetics. The ever-increasing flood of sequence data, and hence the rapidly increasing number of genomes that can be compared simultaneously, calls for efficient software tools as brute-force approaches with quadratic memory requirements become infeasible in practise. The rapid pace at which new data become available, furthermore, makes i...
Bruno T. L. Nichio; Jeroniza Nunes Marchaukoski; Roberto Tadeu Raittz
Nowadays defying homology relationships among sequences is essential for biological research. Within homology the analysis of orthologs sequences is of great importance for computational biology, annotation of genomes and for phylogenetic inference. Since 2007, with the increase in the number of new sequences being deposited in large biological databases, researchers have begun to analyse computerized methodologies and tools aimed at selecting the most promising ones in the prediction of orth...
Uchiyama, Ikuo; Mihara, Motohiro; Nishide, Hiroyo; Chiba, Hirokazu
The microbial genome database for comparative analysis (MBGD) (available at http://mbgd.genome.ad.jp/) is a comprehensive ortholog database for flexible comparative analysis of microbial genomes, where the users are allowed to create an ortholog table among any specified set of organisms. Because of the rapid increase in microbial genome data owing to the next-generation sequencing technology, it becomes increasingly challenging to maintain high-quality orthology relationships while allowing the users to incorporate the latest genomic data available into an analysis. Because many of the recently accumulating genomic data are draft genome sequences for which some complete genome sequences of the same or closely related species are available, MBGD now stores draft genome data and allows the users to incorporate them into a user-specific ortholog database using the MyMBGD functionality. In this function, draft genome data are incorporated into an existing ortholog table created only from the complete genome data in an incremental manner to prevent low-quality draft data from affecting clustering results. In addition, to provide high-quality orthology relationships, the standard ortholog table containing all the representative genomes, which is first created by the rapid classification program DomClust, is now refined using DomRefine, a recently developed program for improving domain-level clustering using multiple sequence alignment information. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Full Text Available BOG (Bacterium and virus analysis of Orthologous Groups is a package for identifying groups of differentially regulated genes in the light of gene functions for various virus and bacteria genomes. It is designed to identify Clusters of Orthologous Groups (COGs that are enriched among genes that have gone through significant changes under different conditions. This would contribute to the detection of pathogens, an important scientific research area of relevance in uncovering bioterrorism, among others. Particular statistical analyses include hypergeometric, Mann–Whitney rank sum, and gene set enrichment. Results from the analyses are organized and presented in tabular and graphical forms for ease of understanding and dissemination of results. BOG is implemented as an R-package, which is available from CRAN or can be downloaded from http://www.stat.osu.edu/~statgen/SOFTWARE/BOG/.
Sven Heinicke; Michael S Livstone; Charles Lu; Rose Oughtred; Fan Kang; Samuel V Angiuoli; Owen White; David Botstein; Kara Dolinski
Many biological databases that provide comparative genomics information and tools are now available on the internet. While certainly quite useful, to our knowledge none of the existing databases combine results from multiple comparative genomics methods with manually curated information from the literature. Here we describe the Princeton Protein Orthology Database (P-POD, http://ortholog.princeton.edu), a user-friendly database system that allows users to find and visualize the phylogenetic r...
Goh, K.-I.; Kahng, B.; Kim, D.
Owing to the recent progress in high-throughput experimental techniques, the datasets of large-scale protein interactions of prototypical multicellular species, the nematode worm Caenorhabditis elegans and the fruit fly Drosophila melanogaster, have been assayed. The datasets are obtained mainly by using the yeast hybrid method, which contains false-positive and false-negative simultaneously. Accordingly, while it is desirable to test such datasets through further wet experiments, here we invoke recent developed network theory to test such high-throughput datasets in a simple way. Based on the fact that the key biological processes indispensable to maintaining life are conserved across eukaryotic species, and the comparison of structural properties of the protein interaction networks (PINs) of the two species with those of the yeast PIN, we find that while the worm and yeast PIN datasets exhibit similar structural properties, the current fly dataset, though most comprehensively screened ever, does not reflect generic structural properties correctly as it is. The modularity is suppressed and the connectivity correlation is lacking. Addition of interologs to the current fly dataset increases the modularity and enhances the occurrence of triangular motifs as well. The connectivity correlation function of the fly, however, remains distinct under such interolog additions, for which we present a possible scenario through an in silico modeling.
Background As orthologous proteins are expected to retain function more often than other homologs, they are often used for functional annotation transfer between species. However, ortholog identification methods do not take into account changes in domain architecture, which are likely to modify a protein's function. By domain architecture we refer to the sequential arrangement of domains along a protein sequence. To assess the level of domain architecture conservation among orthologs, we carried out a large-scale study of such events between human and 40 other species spanning the entire evolutionary range. We designed a score to measure domain architecture similarity and used it to analyze differences in domain architecture conservation between orthologs and paralogs relative to the conservation of primary sequence. We also statistically characterized the extents of different types of domain swapping events across pairs of orthologs and paralogs. Results The analysis shows that orthologs exhibit greater domain architecture conservation than paralogous homologs, even when differences in average sequence divergence are compensated for, for homologs that have diverged beyond a certain threshold. We interpret this as an indication of a stronger selective pressure on orthologs than paralogs to retain the domain architecture required for the proteins to perform a specific function. In general, orthologs as well as the closest paralogous homologs have very similar domain architectures, even at large evolutionary separation. The most common domain architecture changes observed in both ortholog and paralog pairs involved insertion/deletion of new domains, while domain shuffling and segment duplication/deletion were very infrequent. Conclusions On the whole, our results support the hypothesis that function conservation between orthologs demands higher domain architecture conservation than other types of homologs, relative to primary sequence conservation. This supports the
Meiler, Arno; Klinger, Claudia; Kaufmann, Michael
The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC's NUCOCOG dataset as the largest one available for that purpose thus far. Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.
Full Text Available Abstract Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.
Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills. PMID:22958836
Rao, R Shyama Prasad; Møller, Ian Max
in proteins is currently lacking. We have therefore analyzed the occurrence and occupancy of phosphorylated sites (~ 100,281) in a large set of eukaryotic proteins (~ 22,995). Phosphorylation probability was found to be much higher in both the termini of protein sequences and this is much pronounced...... maximum randomness. An analysis of phosphorylation motifs indicated that just 40 motifs and a much lower number of associated kinases might account for nearly 50% of the known phosphorylations in eukaryotic proteins. Our results provide a broad picture of the phosphorylation sites in eukaryotic proteins.......Many recent high throughput technologies have enabled large-scale discoveries of new phosphorylation sites and phosphoproteins. Although they have provided a number of insights into protein phosphorylation and the related processes, an inclusive analysis on the nature of phosphorylated sites...
Keller, Lani C; Romijn, Edwin P; Zamora, Ivan; Yates, John R; Marshall, Wallace F
The centriole is one of the most enigmatic organelles in the cell. Centrioles are cylindrical, microtubule-based barrels found in the core of the centrosome. Centrioles also act as basal bodies during interphase to nucleate the assembly of cilia and flagella. There are currently only a handful of known centriole proteins. We used mass-spectrometry-based MudPIT (multidimensional protein identification technology) to identify the protein composition of basal bodies (centrioles) isolated from the green alga Chlamydomonas reinhardtii. This analysis detected the majority of known centriole proteins, including centrin, epsilon tubulin, and the cartwheel protein BLD10p. By combining proteomic data with information about gene expression and comparative genomics, we identified 45 cross-validated centriole candidate proteins in two classes. Members of the first class of proteins (BUG1-BUG27) are encoded by genes whose expression correlates with flagellar assembly and which therefore may play a role in ciliogenesis-related functions of basal bodies. Members of the second class (POC1-POC18) are implicated by comparative-genomics and -proteomics studies to be conserved components of the centriole. We confirmed centriolar localization for the human homologs of four candidate proteins. Three of the cross-validated centriole candidate proteins are encoded by orthologs of genes (OFD1, NPHP-4, and PACRG) implicated in mammalian ciliary function and disease, suggesting that oral-facial-digital syndrome and nephronophthisis may involve a dysfunction of centrioles and/or basal bodies. By analyzing isolated Chlamydomonas basal bodies, we have been able to obtain the first reported proteomic analysis of the centriole.
Bui, Khanh Huy; Pigino, Gaia; Ishikawa, Takashi
Based on the molecular architecture revealed by electron cryo-tomography, the mechanism of the bending motion of eukaryotic flagella/cilia is discussed. Electron cryo-tomography is a potential approach to analyzing the three-dimensional conformation of frozen hydrated biological macromolecules using electron microscopy. Since projections of each individual object illuminated from different orientations are merged, electron tomography is capable of structural analysis of such heterogeneous environments as in vivo or with polymorphism, although radiation damage and the missing wedge are severe problems. Here, recent results on the structure of eukaryotic flagella, which is an ATP-driven bending organelle, from green algae Chlamydomonas are presented. Tomographic analysis reveals asymmetric molecular arrangements, especially that of the dynein motor proteins, in flagella, giving insight into the mechanism of planar asymmetric bending motion. Methodological challenges to obtaining higher-resolution structures from this technique are also discussed
Identification of four families of yCCR4- and Mg2+-dependent endonuclease-related proteins in higher eukaryotes, and characterization of orthologs of yCCR4 with a conserved leucine-rich repeat essential for hCAF1/hPOP2 binding
Full Text Available Abstract Background The yeast yCCR4 factor belongs to the CCR4-NOT transcriptional regulatory complex, in which it interacts, through its leucine-rich repeat (LRR motif with yPOP2. Recently, yCCR4 was shown to be a component of the major cytoplasmic mRNA deadenylase complex, and to contain a fold related to the Mg2+-dependent endonuclease core. Results Here, we report the identification of nineteen yCCR4-related proteins in eukaryotes (including yeast, plants and animals, which all contain the yCCR4 endonuclease-like fold, with highly conserved CCR4-specific residues. Phylogenetic and genomic analyses show that they form four distinct families, one of which contains the yCCR4 orthologs. The orthologs in animals possess a leucine-rich repeat domain. We show, using two-hybrid and far-Western assays, that the human member binds to the human yPOP2 homologs, i.e. hCAF1 and hPOP2, in a LRR-dependent manner. Conclusions We have identified the mammalian orthologs of yCCR4 and have shown that the human member binds to the human yPOP2 homologs, thus strongly suggesting conservation of the CCR4-NOT complex from yeast to human. All members of the four identified yCCR4-related protein families show stricking conservation of the endonuclease-like catalytic motifs of the yCCR4 C-terminal domain and therefore constitute a new family of potential deadenylases in mammals.
Romano, Stefano; Fernàndez-Guerra, Antonio; Reen, F Jerry; Glöckner, Frank O; Crowley, Susan P; O'Sullivan, Orla; Cotter, Paul D; Adams, Claire; Dobson, Alan D W; O'Gara, Fergal
Strains of the Pseudovibrio genus have been detected worldwide, mainly as part of bacterial communities associated with marine invertebrates, particularly sponges. This recurrent association has been considered as an indication of a symbiotic relationship between these microbes and their host. Until recently, the availability of only two genomes, belonging to closely related strains, has limited the knowledge on the genomic and physiological features of the genus to a single phylogenetic lineage. Here we present 10 newly sequenced genomes of Pseudovibrio strains isolated from marine sponges from the west coast of Ireland, and including the other two publicly available genomes we performed an extensive comparative genomic analysis. Homogeneity was apparent in terms of both the orthologous genes and the metabolic features shared amongst the 12 strains. At the genomic level, a key physiological difference observed amongst the isolates was the presence only in strain P. axinellae AD2 of genes encoding proteins involved in assimilatory nitrate reduction, which was then proved experimentally. We then focused on studying those systems known to be involved in the interactions with eukaryotic and prokaryotic cells. This analysis revealed that the genus harbors a large diversity of toxin-like proteins, secretion systems and their potential effectors. Their distribution in the genus was not always consistent with the phylogenetic relationship of the strains. Finally, our analyses identified new genomic islands encoding potential toxin-immunity systems, previously unknown in the genus. Our analyses shed new light on the Pseudovibrio genus, indicating a large diversity of both metabolic features and systems for interacting with the host. The diversity in both distribution and abundance of these systems amongst the strains underlines how metabolically and phylogenetically similar bacteria may use different strategies to interact with the host and find a niche within its
Skipper, Martin; Johansen, Louise Buchholt; Pedersen, Kim B.
Studies have shown that several plant species posses AGAMOUS (AG) and SEEDSTICK (STK) orthologs. These genes are part of the so-called C- and D MADS-box gene lineages and play key roles in ovule development in Arabidopsis thaliana. We have cloned an AG- and STK ortholog in the orchid Dendrobium...
Full Text Available Abstract Background Plant inducible immunity includes the accumulation of a set of defense proteins during infection called pathogenesis-related (PR proteins, which are grouped into families termed PR-1 to PR-17. The PR-5 family is composed of thaumatin-like proteins (TLPs, which are responsive to biotic and abiotic stress and are widely studied in plants. TLPs were also recently discovered in fungi and animals. In the poplar genome, TLPs are over-represented compared with annual species and their transcripts strongly accumulate during stress conditions. Results Our analysis of the poplar TLP family suggests that the expansion of this gene family was followed by diversification, as differences in expression patterns and predicted properties correlate with phylogeny. In particular, we identified a clade of poplar TLPs that cluster to a single 350 kb locus of chromosome I and that are up-regulated by poplar leaf rust infection. A wider phylogenetic analysis of eukaryote TLPs - including plant, animal and fungi sequences - shows that TLP gene content and diversity increased markedly during land plant evolution. Mapping the reported functions of characterized TLPs to the eukaryote phylogenetic tree showed that antifungal or glycan-lytic properties are widespread across eukaryote phylogeny, suggesting that these properties are shared by most TLPs and are likely associated with the presence of a conserved acidic cleft in their 3D structure. Also, we established an exhaustive catalog of TLPs with atypical architectures such as small-TLPs, TLP-kinases and small-TLP-kinases, which have potentially developed alternative functions (such as putative receptor kinases for pathogen sensing and signaling. Conclusion Our study, based on the most recent plant genome sequences, provides evidence for TLP gene family diversification during land plant evolution. We have shown that the diverse functions described for TLPs are not restricted to specific clades but seem
Huma, Tayyaba; Maryam, Arooma; Rehman, Shahid Ur; Qamar, Muhammad Tahir Ul; Shaheen, Tayyaba; Haque, Asma; Shaheen, Bushra
Alpha amylase family is generally defined as a group of enzymes that can hydrolyse and transglycosylase α-(1, 4) or α-(1, 6) glycosidic bonds along with the preservation of anomeric configuration. For the comparative analysis of alpha amylase family, nucleotide sequences of seven thermo stable organisms of Kingdom Archea i.e. Pyrococcus furiosus (100-105°C), Kingdom Prokaryotes i.e. Bacillus licheniformis (90-95°C), Geobacillus stearothermophilus (75°C), Bacillus amyloliquefaciens (72°C), Bacillus subtilis (70°C) and Bacillus KSM K38 (55°C) and Eukaryotes i.e. Aspergillus oryzae (60°C) were selected from NCBI. Primary structure composition analysis and Conserved sequence analysis were conducted through Bio Edit tools. Results from BioEdit shown only three conserved regions of base pairs and least similarity in MSA of the above mentioned alpha amylases. In Mega 5.1 Phylogeny of thermo stable alpha amylases of Kingdom Archea, Prokaryotes and Eukaryote was handled by Neighbor-Joining (NJ) algorithm. Mega 5.1 phylogenetic results suggested that alpha amylases of thermo stable organisms i.e. Pyrococcus furiosus (100-105°C), Bacillus licheniformis (90-95°C), Geobacillus stearothermophilus (75°C) and Bacillus amyloliquefaciens (72°C) are more distantly related as compared to less thermo stable organisms. By keeping in mind the characteristics of most thermo stable alpha amylases novel and improved features can be introduced in less thermo stable alpha amylases so that they become more thermo tolerant and productive for industry.
Stuart Gary W
Full Text Available Abstract Background Eukaryotic whole genome sequences are accumulating at an impressive rate. Effective methods for comparing multiple whole eukaryotic genomes on a large scale are needed. Most attempted solutions involve the production of large scale alignments, and many of these require a high stringency pre-screen for putative orthologs in order to reduce the effective size of the dataset and provide a reasonably high but unknown fraction of correctly aligned homologous sites for comparison. As an alternative, highly efficient methods that do not require the pre-alignment of operationally defined orthologs are also being explored. Results A non-alignment method based on the Singular Value Decomposition (SVD was used to compare the predicted protein complement of nine whole eukaryotic genomes ranging from yeast to man. This analysis resulted in the simultaneous identification and definition of a large number of well conserved motifs and gene families, and produced a species tree supporting one of two conflicting hypotheses of metazoan relationships. Conclusions Our SVD-based analysis of the entire protein complement of nine whole eukaryotic genomes suggests that highly conserved motifs and gene families can be identified and effectively compared in a single coherent definition space for the easy extraction of gene and species trees. While this occurs without the explicit definition of orthologs or homologous sites, the analysis can provide a basis for these definitions.
Lentendu, Guillaume; Hübschmann, Thomas; Müller, Susann; Dunker, Susanne; Buscot, François; Wilhelm, Christian
Eukaryotic unicellular organisms are an important part of the soil microbial community, but they are often neglected in soil functional microbial diversity analysis, principally due to the absence of specific investigation methods in the special soil environment. In this study we used a method based on high-density centrifugation to specifically isolate intact algal and yeast cells, with the aim to analyze them with flow cytometry and sort them for further molecular analysis such as deep sequencing. Recovery efficiency was tested at low abundance levels that fit those in natural environments (10(4) to 10(6) cells per g soil). Five algae and five yeast morphospecies isolated from soil were used for the testing. Recovery efficiency was between 1.5 to 43.16% and 2 to 30.2%, respectively, and was dependent on soil type for three of the algae. Control treatments without soil showed that the majority of cells were lost due to the method itself (58% and 55.8% respectively). However, the cell extraction technique did not much compromise cell vitality because a fluorescein di-acetate assay indicated high viability percentages (73.3% and 97.2% of cells, respectively). The low abundant algae and yeast morphospecies recovered from soil were cytometrically analyzed and sorted. Following, their DNA was isolated and amplified using specific primers. The developed workflow enables isolation and enrichment of intact autotrophic and heterotrophic soil unicellular eukaryotes from natural environments for subsequent application of deep sequencing technologies. Copyright © 2013 Elsevier B.V. All rights reserved.
Sverdlov Alexander V
Full Text Available Abstract Background The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. Results We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after eukaryotic orthologous groups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted proteins encoded in 66 genomes of unicellular organisms. The eukaryotic orthologous groups (KOGs include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens, one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe, and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or ~54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of ~20% of the KOG set. This conserved portion of the
John A Capra
Full Text Available The evolutionary history of a protein reflects the functional history of its ancestors. Recent phylogenetic studies identified distinct evolutionary signatures that characterize proteins involved in cancer, Mendelian disease, and different ontogenic stages. Despite the potential to yield insight into the cellular functions and interactions of proteins, such comparative phylogenetic analyses are rarely performed, because they require custom algorithms. We developed ProteinHistorian to make tools for performing analyses of protein origins widely available. Given a list of proteins of interest, ProteinHistorian estimates the phylogenetic age of each protein, quantifies enrichment for proteins of specific ages, and compares variation in protein age with other protein attributes. ProteinHistorian allows flexibility in the definition of protein age by including several algorithms for estimating ages from different databases of evolutionary relationships. We illustrate the use of ProteinHistorian with three example analyses. First, we demonstrate that proteins with high expression in human, compared to chimpanzee and rhesus macaque, are significantly younger than those with human-specific low expression. Next, we show that human proteins with annotated regulatory functions are significantly younger than proteins with catalytic functions. Finally, we compare protein length and age in many eukaryotic species and, as expected from previous studies, find a positive, though often weak, correlation between protein age and length. ProteinHistorian is available through a web server with an intuitive interface and as a set of command line tools; this allows biologists and bioinformaticians alike to integrate these approaches into their analysis pipelines. ProteinHistorian's modular, extensible design facilitates the integration of new datasets and algorithms. The ProteinHistorian web server, source code, and pre-computed ages for 32 eukaryotic genomes are
Cenci, Albero; Guignon, Valentin; Roux, Nicolas; Rouard, Mathieu
Identifying the molecular mechanisms underlying tolerance to abiotic stresses is important in crop breeding. A comprehensive understanding of the gene families associated with drought tolerance is therefore highly relevant. NAC transcription factors form a large plant-specific gene family involved in the regulation of tissue development and responses to biotic and abiotic stresses. The main goal of this study was to set up a framework of orthologous groups determined by an expert sequence comparison of NAC genes from both monocots and dicots. In order to clarify the orthologous relationships among NAC genes of different species, we performed an in-depth comparative study of four divergent taxa, in dicots and monocots, whose genomes have already been completely sequenced: Arabidopsis thaliana, Vitis vinifera, Musa acuminata and Oryza sativa. Due to independent evolution, NAC copy number is highly variable in these plant genomes. Based on an expert NAC sequence comparison, we propose forty orthologous groups of NAC sequences that were probably derived from an ancestor gene present in the most recent common ancestor of dicots and monocots. These orthologous groups provide a curated resource for large-scale protein sequence annotation of NAC transcription factors. The established orthology relationships also provide a useful reference for NAC function studies in newly sequenced genomes such as M. acuminata and other plant species.
Wolf Yuri I; Novichkov Pavel S; Sorokin Alexander V; Makarova Kira S; Koonin Eugene V
Abstract Background An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs ...
Anita M Quintana
Full Text Available There are 8 different human syndromes caused by mutations in the cholesterol synthesis pathway. A subset of these disorders such as Smith-Lemli-Opitz disorder, are associated with facial dysmorphia. However, the molecular and cellular mechanisms underlying such facial deficits are not fully understood, primarily because of the diverse functions associated with the cholesterol synthesis pathway. Recent evidence has demonstrated that mutation of the zebrafish ortholog of HMGCR results in orofacial clefts. Here we sought to expand upon these data, by deciphering the cholesterol dependent functions of the cholesterol synthesis pathway from the cholesterol independent functions. Moreover, we utilized loss of function analysis and pharmacological inhibition to determine the extent of sonic hedgehog (Shh signaling in animals with aberrant cholesterol and/or isoprenoid synthesis. Our analysis confirmed that mutation of hmgcs1, which encodes the first enzyme in the cholesterol synthesis pathway, results in craniofacial abnormalities via defects in cranial neural crest cell differentiation. Furthermore targeted pharmacological inhibition of the cholesterol synthesis pathway revealed a novel function for isoprenoid synthesis during vertebrate craniofacial development. Mutation of hmgcs1 had no effect on Shh signaling at 2 and 3 days post fertilization (dpf, but did result in a decrease in the expression of gli1, a known Shh target gene, at 4 dpf, after morphological deficits in craniofacial development and chondrocyte differentiation were observed in hmgcs1 mutants. These data raise the possibility that deficiencies in cholesterol modulate chondrocyte differentiation by a combination of Shh independent and Shh dependent mechanisms. Moreover, our results describe a novel function for isoprenoids in facial development and collectively suggest that cholesterol regulates craniofacial development through versatile mechanisms.
Rieu, Ivo; Bots, Marc; Mariani, Celestina; Weterings, Koen A P
The Arabidopsis AINTEGUMENTA (ANT) protein is essential for proper ovule development, but functions in cell proliferation and organ growth throughout the plant. Here we report the isolation of a full-length cDNA clone from tobacco (Nicotiana tabacum L.) that encodes a protein with high similarity to ANT and is preferentially expressed in the pistil. In situ hybridization analysis on the tobacco ovary shows that the expression pattern of the corresponding gene is different from that of ANT in Arabidopsis.
Møller, Annette; Asp, Torben; Holm, Preben Bach
prokaryotic genome. Based on a protein alignment we could group the P5 ATPases into two subfamilies, P5A and P5B that, based on the number of negative charges in conserved trans-membrane segment 4, are likely to have different ion specificities. P5A ATPases are present in all eukaryotic genomes sequenced so......Eukaryotes encompass a remarkable variety of organisms and unresolved lineages. Different phylogenetic analyses have lead to conflicting conclusions as to the origin and associations between lineages and species. In this work, we investigated evolutionary relationship of a family of cation pumps...... exclusive for the secretory pathway of eukaryotes by combining the identification of lineage-specific genes with phylogenetic evolution of common genes. Sequences of P5 ATPases, which are regarded to be cation pumps in the endoplasmic reticulum (ER), were identified in all eukaryotic lineages but not in any...
Full Text Available The phytophagous mirid bugs of Apolygus lucorum, Lygus pratensis as well as three Adelphocoris spp., including Adelphocoris lineolatus, A. suturalis, and A. fasciaticollis are major pests of multiple agricultural crops in China, which have distinct geographical distribution and occurrence ranges. Like many insect species, these bugs heavily rely on olfactory cues to search preferred host plants, thereby investigation on functional co-evolution and divergence of olfactory genes seems to be necessary and is of great interest. In the odorant detection pathway, olfactory receptor co-receptor (Orco plays critical role in the perception of odors. In this study, we identified the full-length cDNA sequences encoding three putative Orcos (AsutOrco, AfasOrco, and LpraOrco in bug species of A. suturalis, A. fasciaticollis, and L. pratensis based on homology cloning method. Next, sequence alignment, membrane topology and gene structure analysis showed that these three Orco orthologs together with previously reported AlinOrco and AlucOrco shared high amino acid identities and similar topology structure, but had different gene structure especially at the length and insertion sites of introns. Furthermore, the evolutional estimation on the ratios of non-synonymous to synonymous (Ka/Ks revealed that Orco genes were under strong purifying selection, but the degrees of variation were significant different between genera. The results of quantitative real-time PCR experiments showed that these five Orco genes had a similar antennae-biased tissue expression pattern. Taking these data together, it is thought that Orco genes in the mirid species could share conserved olfaction roles but had different evolution rates. These findings would lay a foundation to further investigate the molecular mechanisms of evolutionary interactions between mirid bugs and their host plants, which might in turn contribute to the development of pest management strategy for mirid bugs.
Full Text Available Research on the relationship between changes in the gut microbiota and human disease, including AIDS, is a growing field. However, studies on the eukaryotic component of the intestinal microbiota have just begun and have not yet been conducted in HIV-infected patients. Moreover, eukaryotic community profiling is influenced by the use of different methodologies at each step of culture-independent techniques. Herein, initially, four DNA extraction protocols were compared to test the efficiency of each method in recovering eukaryotic DNA from fecal samples. Our results revealed that recovering eukaryotic components from fecal samples differs significantly among DNA extraction methods. Subsequently, the composition of the intestinal eukaryotic microbiota was evaluated in HIV-infected patients and healthy volunteers through clone sequencing, high-throughput sequencing of nuclear ribosomal internal transcribed spacers 1 (ITS1 and 2 (ITS2 amplicons and real-time PCRs. Our results revealed that not only richness (Chao-1 index and alpha diversity (Shannon diversity differ between HIV-infected patients and healthy volunteers, depending on the molecular strategy used, but also the global eukaryotic community composition, with little overlapping taxa found between techniques. Moreover, our results based on cloning libraries and ITS1/ITS2 metabarcoding sequencing showed significant differences in fungal composition between HIV-infected patients and healthy volunteers, but without distinct clusters separating the two groups. Malassezia restricta was significantly more prevalent in fecal samples of HIV-infected patients, according to cloning libraries, whereas operational taxonomic units (OTUs belonging to Candida albicans and Candida tropicalis were significantly more abundant in fecal samples of HIV-infected patients compared to healthy subjects in both ITS subregions. Finally, real-time PCR showed the presence of Microsporidia, Giardia lamblia, Blastocystis
George L Sutphin
Full Text Available The rapid advancement of technology in genomics and targeted genetic manipulation has made comparative biology an increasingly prominent strategy to model human disease processes. Predicting orthology relationships between species is a vital component of comparative biology. Dozens of strategies for predicting orthologs have been developed using combinations of gene and protein sequence, phylogenetic history, and functional interaction with progressively increasing accuracy. A relatively new class of orthology prediction strategies combines aspects of multiple methods into meta-tools, resulting in improved prediction performance. Here we present WORMHOLE, a novel ortholog prediction meta-tool that applies machine learning to integrate 17 distinct ortholog prediction algorithms to identify novel least diverged orthologs (LDOs between 6 eukaryotic species-humans, mice, zebrafish, fruit flies, nematodes, and budding yeast. Machine learning allows WORMHOLE to intelligently incorporate predictions from a wide-spectrum of strategies in order to form aggregate predictions of LDOs with high confidence. In this study we demonstrate the performance of WORMHOLE across each combination of query and target species. We show that WORMHOLE is particularly adept at improving LDO prediction performance between distantly related species, expanding the pool of LDOs while maintaining low evolutionary distance and a high level of functional relatedness between genes in LDO pairs. We present extensive validation, including cross-validated prediction of PANTHER LDOs and evaluation of evolutionary divergence and functional similarity, and discuss future applications of machine learning in ortholog prediction. A WORMHOLE web tool has been developed and is available at http://wormhole.jax.org/.
Sutphin, George L.; Mahoney, J. Matthew; Sheppard, Keith; Walton, David O.; Korstanje, Ron
The rapid advancement of technology in genomics and targeted genetic manipulation has made comparative biology an increasingly prominent strategy to model human disease processes. Predicting orthology relationships between species is a vital component of comparative biology. Dozens of strategies for predicting orthologs have been developed using combinations of gene and protein sequence, phylogenetic history, and functional interaction with progressively increasing accuracy. A relatively new class of orthology prediction strategies combines aspects of multiple methods into meta-tools, resulting in improved prediction performance. Here we present WORMHOLE, a novel ortholog prediction meta-tool that applies machine learning to integrate 17 distinct ortholog prediction algorithms to identify novel least diverged orthologs (LDOs) between 6 eukaryotic species—humans, mice, zebrafish, fruit flies, nematodes, and budding yeast. Machine learning allows WORMHOLE to intelligently incorporate predictions from a wide-spectrum of strategies in order to form aggregate predictions of LDOs with high confidence. In this study we demonstrate the performance of WORMHOLE across each combination of query and target species. We show that WORMHOLE is particularly adept at improving LDO prediction performance between distantly related species, expanding the pool of LDOs while maintaining low evolutionary distance and a high level of functional relatedness between genes in LDO pairs. We present extensive validation, including cross-validated prediction of PANTHER LDOs and evaluation of evolutionary divergence and functional similarity, and discuss future applications of machine learning in ortholog prediction. A WORMHOLE web tool has been developed and is available at http://wormhole.jax.org/. PMID:27812085
Full Text Available Abstract Background The outer membranes of mitochondria are thought to be homologous to the outer membranes of Gram negative bacteria, which contain 100's of distinct families of β-barrel membrane proteins (BOMPs often forming channels for transport of nutrients or drugs. However, only four families of mitochondrial BOMPs (MBOMPs have been confirmed to date. Although estimates as high as 100 have been made in the past, the number of yet undiscovered MBOMPs is an open question. Fortunately, the recent discovery of a membrane integration signal (the β-signal for MBOMPs gave us an opportunity to look for undiscovered MBOMPs. Results We present the results of a comprehensive survey of eukaryotic protein sequences intended to identify new MBOMPs. Our search employs recent results on β-signals as well as structural information and a novel BOMP predictor trained on both bacterial and mitochondrial BOMPs. Our principal finding is circumstantial evidence suggesting that few MBOMPs remain to be discovered, if one assumes that, like known MBOMPs, novel MBOMPs will be monomeric and β-signal dependent. In addition to this, our analysis of MBOMP homologs reveals some exceptions to the current model of the β-signal, but confirms its consistent presence in the C-terminal region of MBOMP proteins. We also report a β-signal independent search for MBOMPs against the yeast and Arabidopsis proteomes. We find no good candidates MBOMPs in yeast but the Arabidopsis results are less conclusive. Conclusions Our results suggest there are no remaining MBOMPs left to discover in yeast; and if one assumes all MBOMPs are β-signal dependent, few MBOMP families remain undiscovered in any sequenced organism.
Pedersen, Anders Gorm; Nielsen, Henrik
Translation in eukaryotes does not always start at the first AUG in an mRNA, implying that context information also plays a role.This makes prediction of translation initiation sites a non-trivial task, especially when analysing EST and genome data where the entire mature mRNA sequence is not known...
Germot, A; Philippe, H
Trichomonads, together with diplomonads and microsporidia, emerge at the base of the eukaryotic tree, on the basis of the small subunit rRNA phylogeny. However, phylogenies based on protein sequences such as tubulin are markedly different with these protists emerging much later. We have investigated 70 kDa heat-shock protein (HSP70), which could be a reliable phylogenetic marker. In eukaryotes, HSP70s are found in cytosol, endoplasmic reticulum, and organelles (mitochondria and chloroplasts). In Trichomonas vaginalis we identified nine different HSP70-encoding genes and sequenced three nearly complete cDNAs corresponding to cytosolic, endoplasmic reticulum, and mitochondrial-type HSP70. Phylogenies of eukaryotes were reconstructed using the classical methods while varying the number of species and characters considered. Almost all the undoubtedly monophyletic groups, defined by ultrastructural characters, were recovered. However, due to the long branch attraction phenomenon, the evolutionary rates were the main factor determining the position of species, even with the use of a close outgroup, which is an important advantage of HSP70 with respect to many other markers. Numerous variable sites are peculiar to Trichomonas and probably generated the artefactual placement of this species at the base of the eukaryotes or as the sister group of fast-evolving species. The inter-phyla relationships were not well supported and were sensitive to the reconstruction method, the number of species; and the quantity of information used. This lack of resolution could be explained by the very rapid diversification of eukaryotes, likely after the mitochondrial endosymbiosis.
Dessimoz, Christophe; Boeckmann, Brigitte; Roth, Alexander C J; Gonnet, Gaston H
Correct orthology assignment is a critical prerequisite of numerous comparative genomics procedures, such as function prediction, construction of phylogenetic species trees and genome rearrangement analysis. We present an algorithm for the detection of non-orthologs that arise by mistake in current orthology classification methods based on genome-specific best hits, such as the COGs database. The algorithm works with pairwise distance estimates, rather than computationally expensive and error-prone tree-building methods. The accuracy of the algorithm is evaluated through verification of the distribution of predicted cases, case-by-case phylogenetic analysis and comparisons with predictions from other projects using independent methods. Our results show that a very significant fraction of the COG groups include non-orthologs: using conservative parameters, the algorithm detects non-orthology in a third of all COG groups. Consequently, sequence analysis sensitive to correct orthology assignments will greatly benefit from these findings.
Zhu, Yanglong; Stribinskis, Vilius; Ramos, Kenneth S.; Li, Yong
RNase MRP is a eukaryote-specific endoribonuclease that generates RNA primers for mitochondrial DNA replication and processes precursor rRNA. RNase P is a ubiquitous endoribonuclease that cleaves precursor tRNA transcripts to produce their mature 5′ termini. We found extensive sequence homology of catalytic domains and specificity domains between their RNA subunits in many organisms. In Candida glabrata, the internal loop of helix P3 is 100% conserved between MRP and P RNAs. The helix P8 of MRP RNA from microsporidia Encephalitozoon cuniculi is identical to that of P RNA. Sequence homology can be widely spread over the whole molecule of MRP RNA and P RNA, such as those from Dictyostelium discoideum. These conserved nucleotides between the MRP and P RNAs strongly support the hypothesis that the MRP RNA is derived from the P RNA molecule in early eukaryote evolution. PMID:16540690
Makarova, Kira; Wolf, Yuri; Koonin, Eugene
With the continuously accelerating genome sequencing from diverse groups of archaea and bacteria, accurate identification of gene orthology and availability of readily expandable clusters of orthologous genes are essential for the functional annotation of new genomes. We report an update of the collection of archaeal Clusters of Orthologous Genes (arCOGs) to cover, on average, 91% of the protein-coding genes in 168 archaeal genomes. The new arCOGs were constructed using refined algorithms for...
Ashby, Jamie A.; Stevenson, Clare E. M.; Maule, Andrew J.; Lawson, David M.
Crystals of N-terminally truncated eIF4E from pea were obtained and X-ray data were recorded in-house to a resolution of 2.2 Å. Crystals of an N-terminally truncated 20 kDa fragment of Pisum sativum eIF4E (ΔN-eIF4E) were grown by vapour diffusion. X-ray data were recorded to a resolution of 2.2 Å from a single crystal in-house. Indexing was consistent with primitive monoclinic symmetry and solvent-content estimations suggested that between four and nine copies of the eIF4E fragment were possible per crystallographic asymmetric unit. eIF4E is an essential component of the eukaryotic translation machinery and recent studies have shown that point mutations of plant eIF4Es can confer resistance to potyvirus infection
Ariyarathna, H A Chandima K; Oldach, Klaus H; Francki, Michael G
Although the HKT transporter genes ascertain some of the key determinants of crop salt tolerance mechanisms, the diversity and functional role of group II HKT genes are not clearly understood in bread wheat. The advanced knowledge on rice HKT and whole genome sequence was, therefore, used in comparative gene analysis to identify orthologous wheat group II HKT genes and their role in trait variation under different saline environments. The four group II HKTs in rice identified two orthologous gene families from bread wheat, including the known TaHKT2;1 gene family and a new distinctly different gene family designated as TaHKT2;2. A single copy of TaHKT2;2 was found on each homeologous chromosome arm 7AL, 7BL and 7DL and each gene was expressed in leaf blade, sheath and root tissues under non-stressed and at 200 mM salt stressed conditions. The proteins encoded by genes of the TaHKT2;2 family revealed more than 93% amino acid sequence identity but ≤52% amino acid identity compared to the proteins encoded by TaHKT2;1 family. Specifically, variations in known critical domains predicted functional differences between the two protein families. Similar to orthologous rice genes on chromosome 6L, TaHKT2;1 and TaHKT2;2 genes were located approximately 3 kb apart on wheat chromosomes 7AL, 7BL and 7DL, forming a static syntenic block in the two species. The chromosomal region on 7AL containing TaHKT2;1 7AL-1 co-located with QTL for shoot Na(+) concentration and yield in some saline environments. The differences in copy number, genes sequences and encoded proteins between TaHKT2;2 homeologous genes and other group II HKT gene families within and across species likely reflect functional diversity for ion selectivity and transport in plants. Evidence indicated that neither TaHKT2;2 nor TaHKT2;1 were associated with primary root Na(+) uptake but TaHKT2;1 may be associated with trait variation for Na(+) exclusion and yield in some but not all saline environments.
Fagerlund, Robert D; Perederina, Anna; Berezin, Igor; Krasilnikov, Andrey S
Ribonuclease (RNase) P and RNase MRP are closely related catalytic ribonucleoproteins involved in the metabolism of a wide range of RNA molecules, including tRNA, rRNA, and some mRNAs. The catalytic RNA component of eukaryotic RNase P retains the core elements of the bacterial RNase P ribozyme; however, the peripheral RNA elements responsible for the stabilization of the global architecture are largely absent in the eukaryotic enzyme. At the same time, the protein makeup of eukaryotic RNase P is considerably more complex than that of the bacterial RNase P. RNase MRP, an essential and ubiquitous eukaryotic enzyme, has a structural organization resembling that of eukaryotic RNase P, and the two enzymes share most of their protein components. Here, we present the results of the analysis of interactions between the largest protein component of yeast RNases P/MRP, Pop1, and the RNA moieties of the enzymes, discuss structural implications of the results, and suggest that Pop1 plays the role of a scaffold for the stabilization of the global architecture of eukaryotic RNase P RNA, substituting for the network of RNA-RNA tertiary interactions that maintain the global RNA structure in bacterial RNase P. © 2015 Fagerlund et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Full Text Available AtDIR1 (Defective in Induced Resistance1 is an acidic lipid transfer protein essential for systemic acquired resistance (SAR in Arabidopsis thaliana. Upon SAR induction, DIR1 moves from locally infected to distant uninfected leaves to activate defense priming; however, a molecular function for DIR1 has not been elucidated. Bioinformatic analysis and in silico homology modeling identified putative AtDIR1 orthologs in crop species, revealing conserved protein motifs within and outside of DIR1’s central hydrophobic cavity. In vitro assays to compare the capacity of recombinant AtDIR1 and targeted AtDIR1-variant proteins to bind the lipophilic probe TNS (6,P-toluidinylnaphthalene-2-sulfonate provided evidence that conserved leucine 43 and aspartic acid 39 contribute to the size of the DIR1 hydrophobic cavity and possibly hydrophobic ligand binding. An Arabidopsis–cucumber SAR model was developed to investigate the conservation of DIR1 function in cucumber (Cucumis sativus, and we demonstrated that phloem exudates from SAR-induced cucumber rescued the SAR defect in the Arabidopsis dir1-1 mutant. Additionally, an AtDIR1 antibody detected a protein of the same size as AtDIR1 in SAR-induced cucumber phloem exudates, providing evidence that DIR1 function during SAR is conserved in Arabidopsis and cucumber. In vitro TNS displacement assays demonstrated that recombinant AtDIR1 did not bind the SAR signals azelaic acid (AzA, glycerol-3-phosphate or pipecolic acid. However, recombinant CsDIR1 and CsDIR2 interacted weakly with AzA and pipecolic acid. Bioinformatic and functional analyses using the Arabidopsis–cucumber SAR model provide evidence that DIR1 orthologs exist in tobacco, tomato, cucumber, and soybean, and that DIR1-mediated SAR signaling is conserved in Arabidopsis and cucumber.
Bitard-Feildel, Tristan; Kemena, Carsten; Greenwood, Jenny M; Bornberg-Bauer, Erich
Background Orthologous protein detection software mostly uses pairwise comparisons of amino-acid sequences to assert whether two proteins are orthologous or not. Accordingly, when the number of sequences for comparison increases, the number of comparisons to compute grows in a quadratic order. A current challenge of bioinformatic research, especially when taking into account the increasing number of sequenced organisms available, is to make this ever-growing number of comparisons computationa...
Erives, Albert J
While the genomes of eukaryotes and Archaea both encode the histone-fold domain, only eukaryotes encode the core histone paralogs H2A, H2B, H3, and H4. With DNA, these core histones assemble into the nucleosomal octamer underlying eukaryotic chromatin. Importantly, core histones for H2A and H3 are maintained as neofunctionalized paralogs adapted for general bulk chromatin (canonical H2 and H3) or specialized chromatin (H2A.Z enriched at gene promoters and cenH3s enriched at centromeres). In this context, the identification of core histone-like "doublets" in the cytoplasmic replication factories of the Marseilleviridae (MV) is a novel finding with possible relevance to understanding the origin of eukaryotic chromatin. Here, we analyze and compare the core histone doublet genes from all known MV genomes as well as other MV genes relevant to the origin of the eukaryotic replisome. Using different phylogenetic approaches, we show that MV histone domains encode obligate H2B-H2A and H4-H3 dimers of possible proto-eukaryotic origin. MV core histone moieties form sister clades to each of the four eukaryotic clades of canonical and variant core histones. This suggests that MV core histone moieties diverged prior to eukaryotic neofunctionalizations associated with paired linear chromosomes and variant histone octamer assembly. We also show that MV genomes encode a proto-eukaryotic DNA topoisomerase II enzyme that forms a sister clade to eukaryotes. This is a relevant finding given that DNA topo II influences histone deposition and chromatin compaction and is the second most abundant nuclear protein after histones. The combined domain architecture and phylogenomic analyses presented here suggest that a primitive origin for MV histone genes is a more parsimonious explanation than horizontal gene transfers + gene fusions + sufficient divergence to eliminate relatedness to eukaryotic neofunctionalizations within the H2A and H3 clades without loss of relatedness to each of
Wei, Jingli; Hu, Xiaorong; Yang, Jingjing; Yang, Wencai
The genus Physalis includes a number of commercially important edible and ornamental species. Its high nutritional value and potential medicinal properties leads to the increased commercial interest in the products of this genus worldwide. However, lack of molecular markers prevents the detailed study of genetics and phylogeny in Physalis, which limits the progress of breeding. In the present study, we compared the DNA sequences between Physalis and tomato, and attempted to analyze genetic diversity in Physalis using tomato markers. Blasting 23180 DNA sequences derived from Physalis against the International Tomato Annotation Group (ITAG) Release2.3 Predicted CDS (SL2.40) discovered 3356 single-copy orthologous genes between them. A total of 38 accessions from at least six species of Physalis were subjected to genetic diversity analysis using 97 tomato markers and 25 SSR markers derived from P. peruviana. Majority (73.2%) of tomato markers could amplify DNA fragments from at least one accession of Physalis. Diversity in Physalis at molecular level was also detected. The average Nei's genetic distance between accessions was 0.3806 with a range of 0.2865 to 0.7091. These results indicated Physalis and tomato had similarity at both molecular marker and DNA sequence levels. Therefore, the molecular markers developed in tomato can be used in genetic study in Physalis.
Full Text Available The genus Physalis includes a number of commercially important edible and ornamental species. Its high nutritional value and potential medicinal properties leads to the increased commercial interest in the products of this genus worldwide. However, lack of molecular markers prevents the detailed study of genetics and phylogeny in Physalis, which limits the progress of breeding. In the present study, we compared the DNA sequences between Physalis and tomato, and attempted to analyze genetic diversity in Physalis using tomato markers. Blasting 23180 DNA sequences derived from Physalis against the International Tomato Annotation Group (ITAG Release2.3 Predicted CDS (SL2.40 discovered 3356 single-copy orthologous genes between them. A total of 38 accessions from at least six species of Physalis were subjected to genetic diversity analysis using 97 tomato markers and 25 SSR markers derived from P. peruviana. Majority (73.2% of tomato markers could amplify DNA fragments from at least one accession of Physalis. Diversity in Physalis at molecular level was also detected. The average Nei's genetic distance between accessions was 0.3806 with a range of 0.2865 to 0.7091. These results indicated Physalis and tomato had similarity at both molecular marker and DNA sequence levels. Therefore, the molecular markers developed in tomato can be used in genetic study in Physalis.
Full Text Available The suppression of amber stop codons with non-canonical amino acids (ncAAs is used for the site-specific introduction of many unusual functions into proteins. Specific orthogonal aminoacyl-tRNA synthetase (o-aaRS/amber suppressor tRNA(CUA pairs (o-pairs for the incorporation of ncAAs in S. cerevisiae were previously selected from an E. coli tyrosyl-tRNA synthetase/tRNA(CUA mutant library. Incorporation fidelity relies on the specificity of the o-aaRSs for their ncAAs and the ability to effectively discriminate against their natural substrate Tyr or any other canonical amino acid.We used o-pairs previously developed for ncAAs carrying reactive alkyne-, azido-, or photocrosslinker side chains to suppress an amber mutant of human superoxide dismutase 1 in S. cerevisiae. We found worse incorporation efficiencies of the alkyne- and the photocrosslinker ncAAs than reported earlier. In our hands, amber suppression with the ncAA containing the azido group did not occur at all. In addition to the incorporation experiments in S. cerevisiae, we analyzed the catalytic properties of the o-aaRSs in vitro. Surprisingly, all o-aaRSs showed much higher preference for their natural substrate Tyr than for any of the tested ncAAs. While it is unclear why efficiently recognized Tyr is not inserted at amber codons, we speculate that metabolically inert ncAAs accumulate in the cell, and for this reason they are incorporated despite being weak substrates for the o-aaRSs.O-pairs have been developed for a whole plethora of ncAAs. However, a systematic and detailed analysis of their catalytic properties is still missing. Our study provides a comprehensive scrutiny of o-pairs developed for the site-specific incorporation of reactive ncAAs in S. cerevisiae. It suggests that future development of o-pairs as efficient biotechnological tools will greatly benefit from sound characterization in vivo and in vitro in parallel to monitoring intracellular ncAA levels.
Christen M Klinger
Full Text Available Apicomplexa are obligate intracellular parasites that cause tremendous disease burden world-wide. They utilize a set of specialized secretory organelles in their invasive process that require delivery of components for their biogenesis and function, yet the precise mechanisms underpinning such processes remain unclear. One set of potentially important components is the multi-subunit tethering complexes (MTCs, factors increasingly implicated in all aspects of vesicle-target interactions. Prompted by the results of previous studies indicating a loss of membrane trafficking factors in Apicomplexa, we undertook a bioinformatic analysis of MTC conservation. Building on knowledge of the ancient presence of most MTC proteins, we demonstrate the near complete retention of MTCs in the newly available genomes for Guillardiatheta and Bigelowiellanatans. The latter is a key taxonomic sampling point as a basal sister taxa to the group including Apicomplexa. We also demonstrate an ancient origin of the CORVET complex subunits Vps8 and Vps3, as well as the TRAPPII subunit Tca17. Having established that the lineage leading to Apicomplexa did at one point possess the complete eukaryotic complement of MTC components, we undertook a deeper taxonomic investigation in twelve apicomplexan genomes. We observed excellent conservation of the VpsC core of the HOPS and CORVET complexes, as well as the core TRAPP subunits, but sparse conservation of TRAPPII, COG, Dsl1, and HOPS/CORVET-specific subunits. However, those subunits that we did identify appear to be expressed with similar patterns to the fully conserved MTC proteins, suggesting that they may function as minimal complexes or with analogous partners. Strikingly, we failed to identify any subunits of the exocyst complex in all twelve apicomplexan genomes, as well as the dinoflagellate Perkinsus marinus. Overall, we demonstrate reduction of MTCs in Apicomplexa and their ancestors, consistent with modification during
Full Text Available Abstract Background Alternative splicing has been reported in various eukaryotic groups including plants, apicomplexans, diatoms, amoebae, animals and fungi. However, whether widespread alternative splicing has evolved independently in the different eukaryotic groups or was inherited from their last common ancestor, and may therefore predate multicellularity, is still unknown. To better understand the origin and evolution of alternative splicing and its usage in diverse organisms, we studied alternative splicing in 12 eukaryotic species, comparing rates of alternative splicing across genes of different functional classes, cellular locations, intron/exon structures and evolutionary origins. Results For each species, we find that genes from most functional categories are alternatively spliced. Ancient genes (shared between animals, fungi and plants show high levels of alternative splicing. Genes with products expressed in the nucleus or plasma membrane are generally more alternatively spliced while those expressed in extracellular location show less alternative splicing. We find a clear correspondence between incidence of alternative splicing and intron number per gene both within and between genomes. In general, we find several similarities in patterns of alternative splicing across these diverse eukaryotes. Conclusion Along with previous studies indicating intron-rich genes with weak intron boundary consensus and complex spliceosomes in ancestral organisms, our results suggest that at least a simple form of alternative splicing may already have been present in the unicellular ancestor of plants, fungi and animals. A role for alternative splicing in the evolution of multicellularity then would largely have arisen by co-opting the preexisting process.
Bitard-Feildel, Tristan; Kemena, Carsten; Greenwood, Jenny M; Bornberg-Bauer, Erich
Orthologous protein detection software mostly uses pairwise comparisons of amino-acid sequences to assert whether two proteins are orthologous or not. Accordingly, when the number of sequences for comparison increases, the number of comparisons to compute grows in a quadratic order. A current challenge of bioinformatic research, especially when taking into account the increasing number of sequenced organisms available, is to make this ever-growing number of comparisons computationally feasible in a reasonable amount of time. We propose to speed up the detection of orthologous proteins by using strings of domains to characterize the proteins. We present two new protein similarity measures, a cosine and a maximal weight matching score based on domain content similarity, and new software, named porthoDom. The qualities of the cosine and the maximal weight matching similarity measures are compared against curated datasets. The measures show that domain content similarities are able to correctly group proteins into their families. Accordingly, the cosine similarity measure is used inside porthoDom, the wrapper developed for proteinortho. porthoDom makes use of domain content similarity measures to group proteins together before searching for orthologs. By using domains instead of amino acid sequences, the reduction of the search space decreases the computational complexity of an all-against-all sequence comparison. We demonstrate that representing and comparing proteins as strings of discrete domains, i.e. as a concatenation of their unique identifiers, allows a drastic simplification of search space. porthoDom has the advantage of speeding up orthology detection while maintaining a degree of accuracy similar to proteinortho. The implementation of porthoDom is released using python and C++ languages and is available under the GNU GPL licence 3 at http://www.bornberglab.org/pages/porthoda .
Irimia, Manuel; Rukov, Jakob Lewin; Penny, David
, and may therefore predate multicellularity, is still unknown. To better understand the origin and evolution of alternative splicing and its usage in diverse organisms, we studied alternative splicing in 12 eukaryotic species, comparing rates of alternative splicing across genes of different functional......, we find several similarities in patterns of alternative splicing across these diverse eukaryotes. CONCLUSION: Along with previous studies indicating intron-rich genes with weak intron boundary consensus and complex spliceosomes in ancestral organisms, our results suggest that at least a simple form...... of alternative splicing may already have been present in the unicellular ancestor of plants, fungi and animals. A role for alternative splicing in the evolution of multicellularity then would largely have arisen by co-opting the preexisting process....
Cells need a constant supply of precursors to enable the production of macromolecules to sustain growth and survival. Unlike metazoans, unicellular eukaryotes depend exclusively on the extracellular medium for this supply. When environmental nutrients become depleted, existing cytoplasmic components
Szelecz, Ildikó; Lösch, Sandra; Seppey, Christophe V W; Lara, Enrique; Singer, David; Sorge, Franziska; Tschui, Joelle; Perotti, M Alejandra; Mitchell, Edward A D
Criminal investigations of suspected murder cases require estimating the post-mortem interval (PMI, or time after death) which is challenging for long PMIs. Here we present the case of human remains found in a Swiss forest. We have used a multidisciplinary approach involving the analysis of bones and soil samples collected beneath the remains of the head, upper and lower body and "control" samples taken a few meters away. We analysed soil chemical characteristics, mites and nematodes (by microscopy) and micro-eukaryotes (by Illumina high throughput sequencing). The PMI estimate on hair 14 C-data via bomb peak radiocarbon dating gave a time range of 1 to 3 years before the discovery of the remains. Cluster analyses for soil chemical constituents, nematodes, mites and micro-eukaryotes revealed two clusters 1) head and upper body and 2) lower body and controls. From mite evidence, we conclude that the body was probably brought to the site after death. However, chemical analyses, nematode community analyses and the analyses of micro-eukaryotes indicate that decomposition took place at least partly on site. This study illustrates the usefulness of combining several lines of evidence for the study of homicide cases to better calibrate PMI inference tools.
Findeisen, Peggy; Mühlhausen, Stefanie; Dempewolf, Silke; Hertzog, Jonny; Zietlow, Alexander; Carlomagno, Teresa; Kollmar, Martin
Tubulins belong to the most abundant proteins in eukaryotes providing the backbone for many cellular substructures like the mitotic and meiotic spindles, the intracellular cytoskeletal network, and the axonemes of cilia and flagella. Homologs have even been reported for archaea and bacteria. However, a taxonomically broad and whole-genome-based analysis of the tubulin protein family has never been performed, and thus, the number of subfamilies, their taxonomic distribution, and the exact grouping of the supposed archaeal and bacterial homologs are unknown. Here, we present the analysis of 3,524 tubulins from 504 species. The tubulins formed six major subfamilies, α to ζ. Species of all major kingdoms of the eukaryotes encode members of these subfamilies implying that they must have already been present in the last common eukaryotic ancestor. The proposed archaeal homologs grouped together with the bacterial TubZ proteins as sister clade to the FtsZ proteins indicating that tubulins are unique to eukaryotes. Most species contained α- and/or β-tubulin gene duplicates resulting from recent branch- and species-specific duplication events. This shows that tubulins cannot be used for constructing species phylogenies without resolving their ortholog-paralog relationships. The many gene duplicates and also the independent loss of the δ-, ε-, or ζ-tubulins, which have been shown to be part of the triplet microtubules in basal bodies, suggest that tubulins can functionally substitute each other. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Xu, Yuli; Gristwood, Tamzin; Hodgson, Ben; Trinidad, Jonathan C; Albers, Sonja-Verena; Bell, Stephen D
The regulated recruitment of Cdc45 and GINS is key to activating the eukaryotic MCM(2-7) replicative helicase. We demonstrate that the homohexameric archaeal MCM helicase associates with orthologs of GINS and Cdc45 in vivo and in vitro. Association of these factors with MCM robustly stimulates the MCM helicase activity. In contrast to the situation in eukaryotes, archaeal Cdc45 and GINS form an extremely stable complex before binding MCM. Further, the archaeal GINS•Cdc45 complex contains two copies of Cdc45. Our analyses give insight into the function and evolution of the conserved core of the archaeal/eukaryotic replisome.
Trachana, Kalliopi; Larsson, Tomas A; Powell, Sean; Chen, Wei-Hua; Doerks, Tobias; Muller, Jean; Bork, Peer
The increasing number of sequenced genomes has prompted the development of several automated orthology prediction methods. Tests to evaluate the accuracy of predictions and to explore biases caused by biological and technical factors are therefore required. We used 70 manually curated families to analyze the performance of five public methods in Metazoa. We analyzed the strengths and weaknesses of the methods and quantified the impact of biological and technical challenges. From the latter part of the analysis, genome annotation emerged as the largest single influencer, affecting up to 30% of the performance. Generally, most methods did well in assigning orthologous group but they failed to assign the exact number of genes for half of the groups. The publicly available benchmark set (http://eggnog.embl.de/orthobench/) should facilitate the improvement of current orthology assignment protocols, which is of utmost importance for many fields of biology and should be tackled by a broad scientific community. Copyright © 2011 WILEY Periodicals, Inc.
Ruttink, Tom; Sterck, Lieven; Rohde, Antje
to outbreeding crop species hamper De Bruijn Graph-based de novo assembly algorithms, causing transcript fragmentation and the redundant assembly of allelic contigs. If multiple genotypes are sequenced to study genetic diversity, primary de novo assembly is best performed per genotype to limit the level......Despite current advances in next-generation sequencing data analysis procedures, de novo assembly of a reference sequence required for SNP discovery and expression analysis is still a major challenge in genetically uncharacterized, highly heterozygous species. High levels of polymorphism inherent...... of polymorphism and avoid transcript fragmentation. Here, we propose an Orthology Guided Assembly procedure that first uses sequence similarity (tBLASTn) to proteins of a model species to select allelic and fragmented contigs from all genotypes and then performs CAP3 clustering on a gene-by-gene basis. Thus, we...
Goodsell, David S.
Diverse biological data may be used to create illustrations of molecules in their cellular context. This report describes the scientific results that support an illustration of a eukaryotic cell, enlarged by one million times to show the distribution and arrangement of macromolecules. The panoramic cross section includes eight panels that extend…
Bae, Albert; Westendorf, Christian; Erlenkamper, Christoph; Galland, Edouard; Franck, Carl; Bodenschatz, Eberhard; Beta, Carsten
Eukaryotic cell flattening is valuable for improving microscopic observations, ranging from bright field to total internal reflection fluorescence microscopy. In this talk, we will discuss traditional overlay techniques, and more modern, microfluidic based flattening, which provides a greater level of control. We demonstrate these techniques on the social amoebae Dictyostelium discoideum, comparing the advantages and disadvantages of each method.
Noort, V. van
This thesis focuses on developing comparative genomics methods in eukaryotes, with an emphasis on applications for gene function prediction and regulatory element detection. In the past, methods have been developed to predict functional associations between gene pairs in prokaryotes. The challenge
Full Text Available Volvox is a very interesting oogamous organism that exhibits various types of sexuality and/or sexual spheroids depending upon species or strains. However, molecular bases of such sexual reproduction characteristics have not been studied in this genus. In the model species V. carteri, an ortholog of the minus mating type-determining or minus dominance gene (MID of isogamous Chlamydomonas reinhardtii is male-specific and determines the sperm formation. Male and female genders are genetically determined (heterothallism in V. carteri, whereas in several other species of Volvox both male and female gametes (sperm and eggs are formed within the same clonal culture (homothallism. To resolve the molecular basis of the evolution of Volvox species with monoecious spheroids, we here describe a MID ortholog in the homothallic species V. africanus that produces both monoecious and male spheroids within a single clonal culture. Comparison of synonymous and nonsynonymous nucleotide substitutions in MID genes between V. africanus and heterothallic volvocacean species suggests that the MID gene of V. africanus evolved under the same degree of functional constraint as those of the heterothallic species. Based on semi quantitative reverse transcription polymerase chain reaction analyses using the asexual, male and monoecious spheroids isolated from a sexually induced V. africanus culture, the MID mRNA level was significantly upregulated in the male spheroids, but suppressed in the monoecious spheroids. These results suggest that the monoecious spheroid-specific down regulation of gene expression of the MID homolog correlates with the formation of both eggs and sperm in the same spheroid in V. africanus.
Altenhoff, Adrian M; Boeckmann, Brigitte; Capella-Gutierrez, Salvador
Achieving high accuracy in orthology inference is essential for many comparative, evolutionary and functional genomic analyses, yet the true evolutionary history of genes is generally unknown and orthologs are used for very different applications across phyla, requiring different precision-recall...
Porfírio-Sousa, Alfredo L; Ribeiro, Giulia M; Lahr, Daniel J G
Testate amoebae are eukaryotic microorganisms characterized by the presence of an external shell (test). The shell morphology is used as a diagnostic character, but discordance between morphological and molecular data has been demonstrated in groups of arcellinids (Amoebozoa), one of the principal groups of testate amoebae. Morphology of the test is supposed to differentiate genera and species and it is applied in ecological, monitoring and paleontological studies. However, if phenotype does not reflect genotype, conclusions in these types of studies become severely impaired. The objective of this work is to evaluate the morphometrical and morphological variation of the closely related and morphologically similar taxa Arcella intermedia laevis Tsyganov and Mazei, 2006 and Arcella intermedia (Deflandre 1928) Tsyganov and Mazei, 2006 in nature and in cultured individuals and see how these are correlated with molecular data. Our results demonstrate that phenotypic plasticity in Arcella intermedia make morphological distinctions impossible in both taxa. Arcella intermedia and Arcella intermedia laevis are molecularly identical for SSU rDNA and a mitochondrial molecular marker (NAD9/7). We conclude that morphological techniques alone cannot identify phenotypic plasticity from natural populations. More work is clearly needed to better understand the morphological, morphometric and molecular variability in these organisms. Copyright © 2016 Elsevier GmbH. All rights reserved.
Full Text Available The model of predominant clonal evolution (PCE proposed for micropathogens does not state that genetic exchange is totally absent, but rather, that it is too rare to break the prevalent PCE pattern. However, the actual impact of this "residual" genetic exchange should be evaluated. Multilocus Sequence Typing (MLST is an excellent tool to explore the problem. Here, we compared online available MLST datasets for seven eukaryotic microbial pathogens: Trypanosoma cruzi, the Fusarium solani complex, Aspergillus fumigatus, Blastocystis subtype 3, the Leishmania donovani complex, Candida albicans and Candida glabrata. We first analyzed phylogenetic relationships among genotypes within each dataset. Then, we examined different measures of branch support and incongruence among loci as signs of genetic structure and levels of past recombination. The analyses allow us to identify three types of genetic structure. The first was characterized by trees with well-supported branches and low levels of incongruence suggesting well-structured populations and PCE. This was the case for the T. cruzi and F. solani datasets. The second genetic structure, represented by Blastocystis spp., A. fumigatus and the L. donovani complex datasets, showed trees with weakly-supported branches but low levels of incongruence among loci, whereby genetic structuration was not clearly defined by MLST. Finally, trees showing weakly-supported branches and high levels of incongruence among loci were observed for Candida species, suggesting that genetic exchange has a higher evolutionary impact in these mainly clonal yeast species. Furthermore, simulations showed that MLST may fail to show right clustering in population datasets even in the absence of genetic exchange. In conclusion, these results make it possible to infer variable impacts of genetic exchange in populations of predominantly clonal micro-pathogens. Moreover, our results reveal different problems of MLST to determine the
Full Text Available Abstract Background Mapping of orthologous genes among species serves an important role in functional genomics by allowing researchers to develop hypotheses about gene function in one species based on what is known about the functions of orthologs in other species. Several tools for predicting orthologous gene relationships are available. However, these tools can give different results and identification of predicted orthologs is not always straightforward. Results We report a simple but effective tool, the Drosophila RNAi Screening Center Integrative Ortholog Prediction Tool (DIOPT; http://www.flyrnai.org/diopt, for rapid identification of orthologs. DIOPT integrates existing approaches, facilitating rapid identification of orthologs among human, mouse, zebrafish, C. elegans, Drosophila, and S. cerevisiae. As compared to individual tools, DIOPT shows increased sensitivity with only a modest decrease in specificity. Moreover, the flexibility built into the DIOPT graphical user interface allows researchers with different goals to appropriately 'cast a wide net' or limit results to highest confidence predictions. DIOPT also displays protein and domain alignments, including percent amino acid identity, for predicted ortholog pairs. This helps users identify the most appropriate matches among multiple possible orthologs. To facilitate using model organisms for functional analysis of human disease-associated genes, we used DIOPT to predict high-confidence orthologs of disease genes in Online Mendelian Inheritance in Man (OMIM and genes in genome-wide association study (GWAS data sets. The results are accessible through the DIOPT diseases and traits query tool (DIOPT-DIST; http://www.flyrnai.org/diopt-dist. Conclusions DIOPT and DIOPT-DIST are useful resources for researchers working with model organisms, especially those who are interested in exploiting model organisms such as Drosophila to study the functions of human disease genes.
Hu, Yanhui; Flockhart, Ian; Vinayagam, Arunachalam; Bergwitz, Clemens; Berger, Bonnie; Perrimon, Norbert; Mohr, Stephanie E
Mapping of orthologous genes among species serves an important role in functional genomics by allowing researchers to develop hypotheses about gene function in one species based on what is known about the functions of orthologs in other species. Several tools for predicting orthologous gene relationships are available. However, these tools can give different results and identification of predicted orthologs is not always straightforward. We report a simple but effective tool, the Drosophila RNAi Screening Center Integrative Ortholog Prediction Tool (DIOPT; http://www.flyrnai.org/diopt), for rapid identification of orthologs. DIOPT integrates existing approaches, facilitating rapid identification of orthologs among human, mouse, zebrafish, C. elegans, Drosophila, and S. cerevisiae. As compared to individual tools, DIOPT shows increased sensitivity with only a modest decrease in specificity. Moreover, the flexibility built into the DIOPT graphical user interface allows researchers with different goals to appropriately 'cast a wide net' or limit results to highest confidence predictions. DIOPT also displays protein and domain alignments, including percent amino acid identity, for predicted ortholog pairs. This helps users identify the most appropriate matches among multiple possible orthologs. To facilitate using model organisms for functional analysis of human disease-associated genes, we used DIOPT to predict high-confidence orthologs of disease genes in Online Mendelian Inheritance in Man (OMIM) and genes in genome-wide association study (GWAS) data sets. The results are accessible through the DIOPT diseases and traits query tool (DIOPT-DIST; http://www.flyrnai.org/diopt-dist). DIOPT and DIOPT-DIST are useful resources for researchers working with model organisms, especially those who are interested in exploiting model organisms such as Drosophila to study the functions of human disease genes.
Fritz-Laylin, Lillian K.; Prochnik, Simon E.; Ginger, Michael L.; Dacks, Joel; Carpenter, Meredith L.; Field, Mark C.; Kuo, Alan; Paredez, Alex; Chapman, Jarrod; Pham, Jonathan; Shu, Shengqiang; Neupane, Rochak; Cipriano, Michael; Mancuso, Joel; Tu, Hank; Salamov, Asaf; Lindquist, Erika; Shapiro, Harris; Lucas, Susan; Grigoriev, Igor V.; Cande, W. Zacheus; Fulton, Chandler; Rokhsar, Daniel S.; Dawson, Scott C.
Genome sequences of diverse free-living protists are essential for understanding eukaryotic evolution and molecular and cell biology. The free-living amoeboflagellate Naegleria gruberi belongs to a varied and ubiquitous protist clade (Heterolobosea) that diverged from other eukaryotic lineages over a billion years ago. Analysis of the 15,727 protein-coding genes encoded by Naegleria's 41 Mb nuclear genome indicates a capacity for both aerobic respiration and anaerobic metabolism with concomitant hydrogen production, with fundamental implications for the evolution of organelle metabolism. The Naegleria genome facilitates substantially broader phylogenomic comparisons of free-living eukaryotes than previously possible, allowing us to identify thousands of genes likely present in the pan-eukaryotic ancestor, with 40% likely eukaryotic inventions. Moreover, we construct a comprehensive catalog of amoeboid-motility genes. The Naegleria genome, analyzed in the context of other protists, reveals a remarkably complex ancestral eukaryote with a rich repertoire of cytoskeletal, sexual, signaling, and metabolic modules.
Phospholipid biosynthetic enzymes produce diverse molecular structures and are often present in multiple forms encoded by different genes. This work utilizes comparative genomics and phylogenetics for exploring the distribution, structure and evolution of phospholipid biosynthetic genes and pathways in 26 eukaryotic genomes. Although the basic structure of the pathways was formed early in eukaryotic evolution, the emerging picture indicates that individual enzyme families followed unique evolutionary courses. For example, choline and ethanolamine kinases and cytidylyltransferases emerged in ancestral eukaryotes, whereas, multiple forms of the corresponding phosphatidyltransferases evolved mainly in a lineage specific manner. Furthermore, several unicellular eukaryotes maintain bacterial-type enzymes and reactions for the synthesis of phosphatidylglycerol and cardiolipin. Also, base-exchange phosphatidylserine synthases are widespread and ancestral enzymes. The multiplicity of phospholipid biosynthetic enzymes has been largely generated by gene expansion in a lineage specific manner. Thus, these observations suggest that phospholipid biosynthesis has been an actively evolving system. Finally, comparative genomic analysis indicates the existence of novel phosphatidyltransferases and provides a candidate for the uncharacterized eukaryotic phosphatidylglycerol phosphate phosphatase.
Horner David S
Full Text Available Abstract Background Recent discoveries have highlighted the fact that alternative splicing and alternative transcripts are the rule, rather than the exception, in metazoan genes. Since multiple transcript and protein variants expressed by the same gene are, by definition, structurally distinct and need not to be functionally equivalent, the concept of gene orthology should be extended to the transcript level in order to describe evolutionary relationships between structurally similar transcript variants. In other words, the identification of true orthology relationships between gene products now should progress beyond primary sequence and "splicing orthology", consisting in ancestrally shared exon-intron structures, is required to define orthologous isoforms at transcript level. Results As a starting step in this direction, in this work we performed a large scale human- mouse gene comparison with a twofold goal: first, to assess if and to which extent traditional gene annotations such as RefSeq capture genuine splicing orthology; second, to provide a more detailed annotation and quantification of true human-mouse orthologous transcripts defined as transcripts of orthologous genes exhibiting the same splicing patterns. Conclusions We observed an identical exon/intron structure for 32% of human and mouse orthologous genes. This figure increases to 87% using less stringent criteria for gene structure similarity, thus implying that for about 13% of the human RefSeq annotated genes (and about 25% of the corresponding transcripts we could not identify any mouse transcript showing sufficient similarity to be confidently assigned as a splicing ortholog. Our data suggest that current gene and transcript data may still be rather incomplete - with several splicing variants still unknown. The observation that alternative splicing produces large numbers of alternative transcripts and proteins, some of them conserved across species and others truly species
Burgers, Peter M J; Kunkel, Thomas A
This review focuses on the biogenesis and composition of the eukaryotic DNA replication fork, with an emphasis on the enzymes that synthesize DNA and repair discontinuities on the lagging strand of the replication fork. Physical and genetic methodologies aimed at understanding these processes are discussed. The preponderance of evidence supports a model in which DNA polymerase ε (Pol ε) carries out the bulk of leading strand DNA synthesis at an undisturbed replication fork. DNA polymerases α and δ carry out the initiation of Okazaki fragment synthesis and its elongation and maturation, respectively. This review also discusses alternative proposals, including cellular processes during which alternative forks may be utilized, and new biochemical studies with purified proteins that are aimed at reconstituting leading and lagging strand DNA synthesis separately and as an integrated replication fork.
Full Text Available The glycosyltransferase family 43 (GT43 has been suggested to be involved in the synthesis of xylans in plant cell walls and proteoglycans in animals. Very recently GT43 family was also found in Charophycean green algae (CGA, the closest relatives of extant land plants. Here we present evidence that non-plant and non-animal early eukaryotes such as fungi, Haptophyceae, Choanoflagellida, Ichthyosporea and Haptophyceae also have GT43-like genes, which are phylogenetically close to animal GT43 genes. By mining RNA sequencing data (RNA-Seq of selected plants, we showed that CGA have evolved three major groups of GT43 genes, one orthologous to IRX14 (IRREGULAR XYLEM14, one orthologous to IRX9/IRX9L and the third one ancestral to all land plant GT43 genes. We confirmed that land plant GT43 has two major clades A and B, while in angiosperms, clade A further evolved into three subclades and the expression and motif pattern of A3 (containing IRX9 are fairly different from the other two clades likely due to rapid evolution. Our in-depth sequence analysis contributed to our overall understanding of the early evolution of GT43 family and could serve as an example for the study of other plant cell wall-related enzyme families.
Genome wide analysis of orthologous clusters is an important component of comparative genomics studies. Identifying the overlap among orthologous clusters can enable us to elucidate the function and evolution of proteins across multiple species. Here, we report a web platform named OrthoVenn that i...
Lafond, Manuel; El-Mabrouk, Nadia
A variety of methods based on sequence similarity, reconciliation, synteny or functional characteristics, can be used to infer orthology and paralogy relations between genes of a given gene family G. But is a given set C of orthology/paralogy constraints possible, i.e., can they simultaneously co-exist in an evolutionary history for G? While previous studies have focused on full sets of constraints, here we consider the general case where C does not necessarily involve a constraint for each pair of genes. The problem is subdivided in two parts: (1) Is C satisfiable, i.e. can we find an event-labeled gene tree G inducing C? (2) Is there such a G which is consistent, i.e., such that all displayed triplet phylogenies are included in a species tree? Previous results on the Graph sandwich problem can be used to answer to (1), and we provide polynomial-time algorithms for satisfiability and consistency with a given species tree. We also describe a new polynomial-time algorithm for the case of consistency with an unknown species tree and full knowledge of pairwise orthology/paralogy relationships, as well as a branch-and-bound algorithm in the case when unknown relations are present. We show that our algorithms can be used in combination with ProteinOrtho, a sequence similarity-based orthology detection tool, to extract a set of robust orthology/paralogy relationships.
Full Text Available Abstract Background With the rapid growth in the availability of genome sequence data, the automated identification of orthologous genes between species (orthologs is of fundamental importance to facilitate functional annotation and studies on comparative and evolutionary genomics. Genes with no apparent orthologs between the bovine and human genome may be responsible for major differences between the species, however, such genes are often neglected in functional genomics studies. Results A BLAST-based method was exploited to explore the current annotation and orthology predictions in Ensembl. Genes with no orthologs between the two genomes were classified into groups based on alignments, ontology, manual curation and publicly available information. Starting from a high quality and specific set of orthology predictions, as provided by Ensembl, hidden relationship between genes and genomes of different mammalian species were unveiled using a highly sensitive approach, based on sequence similarity and genomic comparison. Conclusions The analysis identified 3,801 bovine genes with no orthologs in human and 1010 human genes with no orthologs in cow, among which 411 and 43 genes, respectively, had no match at all in the other species. Most of the apparently non-orthologous genes may potentially have orthologs which were missed in the annotation process, despite having a high percentage of identity, because of differences in gene length and structure. The comparative analysis reported here identified gene variants, new genes and species-specific features and gave an overview of the other side of orthology which may help to improve the annotation of the bovine genome and the knowledge of structural differences between species.
Cole, Elisabeth B; Miller, David; Rometo, David; Greenberg, Robert M; Brömme, Dieter; Cataltepe, Sule; Pak, Stephen C; Mills, David R; Silverman, Gary A; Luke, Cliff J
Delineating the phylogenetic relationships among members of a protein family can provide a high degree of insight into the evolution of domain structure and function relationships. To identify an early metazoan member of the high molecular weight serine proteinase inhibitor (serpin) superfamily, we initiated a cDNA library screen of the cnidarian, Cyanea capillata. We identified one serpin cDNA encoding for a full-length serpin, jellypin. Phylogenetic analysis using the deduced amino acid sequence showed that jellypin was most similar to the platyhelminthe Echinococcus multiocularis serpin and the clade P serpins, suggesting that this serpin evolved approximately 1000 million years ago (MYA). Modeling of jellypin showed that it contained all the functional elements of an inhibitory serpin. In vitro biochemical analysis confirmed that jellypin was an inhibitor of the S1 clan SA family of serine proteinases. Analysis of the interactions between the human serine proteinases, chymotrypsin, cathepsin G, and elastase, showed that jellypin inhibited these enzymes in the classical serpin manner, forming a SDS stable enzyme/inhibitor complex. These data suggest that the coevolution of serpin structure and inhibitory function date back to at least early metazoan evolution, approximately 1000 MYA.
Zaher, Manal S.; Oke, Muse; Hamdan, Samir
The current model of the eukaryotic DNA replication fork includes three replicative DNA polymerases, polymerase α/primase complex (Pol α), polymerase δ (Pol δ), and polymerase ε (Pol ε). The primase synthesizes 8–12 nucleotide RNA primers that are extended by the DNA polymerization activity of Pol α into 30–35 nucleotide RNA-DNA primers. Replication factor C (RFC) opens the polymerase clamp-like processivity factor, proliferating cell nuclear antigen (PCNA), and loads it onto the primer-template. Pol δ utilizes PCNA to mediate highly processive DNA synthesis, while Pol ε has intrinsic high processivity that is modestly stimulated by PCNA. Pol ε replicates the leading strand and Pol δ replicates the lagging strand in a division of labor that is not strict. The three polymerases are comprised of multiple subunits and share unifying features in their large catalytic and B subunits. The remaining subunits are evolutionarily not related and perform diverse functions. The catalytic subunits are members of family B, which are distinguished by their larger sizes due to inserts in their N- and C-terminal regions. The sizes of these inserts vary among the three polymerases, and their functions remain largely unknown. Strikingly, the quaternary structures of Pol α, Pol δ, and Pol ε are arranged similarly. The catalytic subunits adopt a globular structure that is linked via its conserved C-terminal region to the B subunit. The remaining subunits are linked to the catalytic and B subunits in a highly flexible manner.
Zaher, Manal S.
The current model of the eukaryotic DNA replication fork includes three replicative DNA polymerases, polymerase α/primase complex (Pol α), polymerase δ (Pol δ), and polymerase ε (Pol ε). The primase synthesizes 8–12 nucleotide RNA primers that are extended by the DNA polymerization activity of Pol α into 30–35 nucleotide RNA-DNA primers. Replication factor C (RFC) opens the polymerase clamp-like processivity factor, proliferating cell nuclear antigen (PCNA), and loads it onto the primer-template. Pol δ utilizes PCNA to mediate highly processive DNA synthesis, while Pol ε has intrinsic high processivity that is modestly stimulated by PCNA. Pol ε replicates the leading strand and Pol δ replicates the lagging strand in a division of labor that is not strict. The three polymerases are comprised of multiple subunits and share unifying features in their large catalytic and B subunits. The remaining subunits are evolutionarily not related and perform diverse functions. The catalytic subunits are members of family B, which are distinguished by their larger sizes due to inserts in their N- and C-terminal regions. The sizes of these inserts vary among the three polymerases, and their functions remain largely unknown. Strikingly, the quaternary structures of Pol α, Pol δ, and Pol ε are arranged similarly. The catalytic subunits adopt a globular structure that is linked via its conserved C-terminal region to the B subunit. The remaining subunits are linked to the catalytic and B subunits in a highly flexible manner.
Lafond, Manuel; El-Mabrouk, Nadia
Background A variety of methods based on sequence similarity, reconciliation, synteny or functional characteristics, can be used to infer orthology and paralogy relations between genes of a given gene family G . But is a given set C of orthology/paralogy constraints possible, i.e., can they simultaneously co-exist in an evolutionary history for G ? While previous studies have focused on full sets of constraints, here we consider the general case where C does not necessarily involve a ...
Full Text Available BACKGROUND: The complexity of phosphoinositide signaling in higher eukaryotes is partly due to expansion of specific families and types of phosphoinositide kinases (PIKs that can generate all phosphoinositides via multiple routes. This is particularly evident in the PI3Ks and PIPKs, and it is considered an evolutionary trait associated with metazoan diversification. Yet, there are limited comprehensive studies on the PIK repertoire of free living unicellular organisms. METHODOLOGY/PRINCIPAL FINDINGS: We undertook a genome-wide analysis of putative PIK genes in two free living ciliated cells, Tetrahymena and Paramecium. The Tetrahymena thermophila and Paramecium tetraurelia genomes were probed with representative kinases from all families and types. Putative homologs were verified by EST, microarray and deep RNA sequencing database searches and further characterized for domain structure, catalytic efficiency, expression patterns and phylogenetic relationships. In total, we identified and characterized 22 genes in the Tetrahymena thermophila genome and 62 highly homologues genes in Paramecium tetraurelia suggesting a tight evolutionary conservation in the ciliate lineage. Comparison to the kinome of fungi reveals a significant expansion of PIK genes in ciliates. CONCLUSIONS/SIGNIFICANCE: Our study highlights four important aspects concerning ciliate and other unicellular PIKs. First, ciliate-specific expansion of PI4KIII-like genes. Second, presence of class I PI3Ks which, at least in Tetrahymena, are associated with a metazoan-type machinery for PIP3 signaling. Third, expansion of divergent PIPK enzymes such as the recently described type IV transmembrane PIPKs. Fourth, presence of possible type II PIPKs and presumably inactive PIKs (hence, pseudo-PIKs not previously described. Taken together, our results provide a solid framework for future investigation of the roles of PIKs in ciliates and indicate that novel functions and novel regulatory
Wolf Yuri I
Full Text Available Abstract Background: The presence of introns in protein-coding genes is a universal feature of eukaryotic genome organization, and the genes of multicellular eukaryotes, typically, contain multiple introns, a substantial fraction of which share position in distant taxa, such as plants and animals. Depending on the methods and data sets used, researchers have reached opposite conclusions on the causes of the high fraction of shared introns in orthologous genes from distant eukaryotes. Some studies conclude that shared intron positions reflect, almost entirely, a remarkable evolutionary conservation, whereas others attribute it to parallel gain of introns. To resolve these contradictions, it is crucial to analyze the evolution of introns by using a model that minimally relies on arbitrary assumptions. Results: We developed a probabilistic model of evolution that allows for variability of intron gain and loss rates over branches of the phylogenetic tree, individual genes, and individual sites. Applying this model to an extended set of conserved eukaryotic genes, we find that parallel gain, on average, accounts for only ~8% of the shared intron positions. However, the distribution of parallel gains over the phylogenetic tree of eukaryotes is highly non-uniform. There are, practically, no parallel gains in closely related lineages, whereas for distant lineages, such as animals and plants, parallel gains appear to contribute up to 20% of the shared intron positions. In accord with these findings, we estimated that ancestral introns have a high probability to be retained in extant genomes, and conversely, that a substantial fraction of extant introns have retained their positions since the early stages of eukaryotic evolution. In addition, the density of sites that are available for intron insertion is estimated to be, approximately, one in seven basepairs. Conclusion: We obtained robust estimates of the contribution of parallel gain to the observed
Analysis of genomic DNA of DcACS1, a 1-aminocyclopropane-1-carboxylate synthase gene, expressed in senescing petals of carnation (Dianthus caryophyllus) and its orthologous genes in D. superbus var. longicalycinus.
Harada, Taro; Murakoshi, Yuino; Torii, Yuka; Tanase, Koji; Onozaki, Takashi; Morita, Shigeto; Masumura, Takehiro; Satoh, Shigeru
Carnation (Dianthus caryophyllus) flowers exhibit climacteric ethylene production followed by petal wilting, a senescence symptom. DcACS1, which encodes 1-aminocyclopropane-1-carboxylate synthase (ACS), is a gene involved in this phenomenon. We determined the genomic DNA structure of DcACS1 by genomic PCR. In the genome of 'Light Pink Barbara', we found two distinct nucleotide sequences: one corresponding to the gene previously shown as DcACS1, designated here as DcACS1a, and the other novel one designated as DcACS1b. It was revealed that both DcACS1a and DcACS1b have five exons and four introns. These two genes had almost identical nucleotide sequences in exons, but not in some introns and 3'-UTR. Analysis of transcript accumulation revealed that DcACS1b is expressed in senescing petals as well as DcACS1a. Genomic PCR analysis of 32 carnation cultivars showed that most cultivars have only DcACS1a and some have both DcACS1a and DcACS1b. Moreover, we found two DcACS1 orthologous genes with different nucleotide sequences from D. superbus var. longicalycinus, and designated them as DsuACS1a and DsuACS1b. Petals of D. superbus var. longicalycinus produced ethylene in response to exogenous ethylene, accompanying accumulation of DsuACS1 transcripts. These data suggest that climacteric ethylene production in flowers was genetically established before the cultivation of carnation.
Bhat, Wajid Waheed; Razdan, Sumeer; Rana, Satiander; Dhar, Niha; Wani, Tariq Ahmad; Qazi, Parvaiz; Vishwakarma, Ram; Lattoo, Surrinder K
Picrorhiza kurrooa Royle ex Benth. is a highly reputed medicinal herb utilised in the preparation of a number of herbal drug formulations, principally due to the presence of novel monoterpene iridoid glycosides kenned as picrosides. Phenylalanine ammonia-lyase catalyses an important rate-limiting step in phenylpropanoid pathway and supplies precursors like cinnamic acid, vanillic acid, ferulic acid, etc., to a variety of secondary metabolites including picrosides. The imperilled status of P. kurrooa coupled with lack of information regarding biogenesis of picrosides necessitates deciphering the biosynthetic pathway for picrosides. In the present study, a PAL gene, designated PkPAL1 was isolated from P. kurrooa. The cDNA is 2312 bp in length, consisting of an ORF of 2142 bp encoding for a 713 amino acid protein having a predicted molecular weight of 77.66 kDa and an isoelectric point of pH 6.82. qRT-PCR analysis of various tissues of P. kurrooa showed that PkPAL1 transcript levels were highest in the leaves, consistent with picroside accumulation pattern. Using Genome walking, a 718 bp promoter region was also isolated resulting in identification of distinct cis-regulatory elements including TGA-element, TGACG-motif, CGTCA-motif, etc. qRT-PCR indicated up-regulation of PkPAL1 by methyl jasmonate, salicylic acid, 2,4-dicholorophenoxy acetic acid and UV-B elicitations that corroborated positively with the identified cis-elements within the promoter region. Moreover, altitude was found to have a positive effect on the PkPAL1 transcript levels, driving the expression of PkPAL1 abundantly. Based on docking analysis, we identified eight residues as potentially essential for substrate binding in PkPAL1. Copyright © 2014 Elsevier B.V. All rights reserved.
Ludolf, Fernanda; Bahia, Diana; Andrade, Luiza F; Cousin, Alexandre; Capron, Monique; Dissous, Colette; Pierce, Raymond J; Oliveira, Guilherme
A novel protein tyrosine kinase (PTK) was identified in Schistosoma mansoni and designated SmFes. SmFes exhibits the characteristic features of Fes/Fps/Fer (fes, feline sarcoma; fps, Fujinami poultry sarcoma; fer, fes related) PTKs, containing three coiled-coil regions, an SH2 (Src-homology-2) and a TK (tyrosine kinase catalytic) domain signature. SmFes is the first gene from the Fes/Fps/Fer family identified in S. mansoni, and is a single copy gene. Phylogenetic analyses revealed that SmFes is most closely related to its invertebrate orthologues. The assembly of the SmFes cDNA and genomic sequences indicated the presence of 18 introns in SmFes. Comparison of its genomic structure with those of human Fps/Fes and Drosophila Fps indicates that intron positions are conserved within the region encoding the kinase domain. Analysis of partial cDNA clones showed the presence of a 9 bp insertion at the 3' end of exon 10, producing two different cDNA populations, pointed as an alternative splicing event. In addition, an allele of SmFes containing a 15 bp insertion was observed in the genomic sequence. Quantitative RT-PCR indicated that the overall transcription level of SmFes is rather low in all parasite developmental stages. Moreover, SmFes mRNA levels decrease progressively after cercarial transformation, consistent with a role for the corresponding protein in the early stages of infection.
Sherman, F; McKnight, G; Stewart, J W
An analysis of mutants of the yeast Saccharomyces cerevisiae indicates that AUG is the sole codon capable of initiating translation of iso-1-cytochrome c. This result with yeast and the sequence results of numerous eukaryotic genes indicate that AUG is the only initiation codon in eukaryotes; in contrast, results with Escherichia colia and bacteriophages indicate that both AUG and GUG are initiation codons in prokaryotes. The difference can be explained by the lack of the t/sup 6/ A hypermodified nucleoside (N-(9-(..beta..-D-ribofuranosyl)purin-6-ylcarbamoyl)threonine) in prokaryotic initiator tRNA and its presence in eukaryotic initiator tRNA.
Koonin, Eugene V.; Dolja, Valerian V.; Krupovic, Mart
Viruses and other selfish genetic elements are dominant entities in the biosphere, with respect to both physical abundance and genetic diversity. Various selfish elements parasitize on all cellular life forms. The relative abundances of different classes of viruses are dramatically different between prokaryotes and eukaryotes. In prokaryotes, the great majority of viruses possess double-stranded (ds) DNA genomes, with a substantial minority of single-stranded (ss) DNA viruses and only limited presence of RNA viruses. In contrast, in eukaryotes, RNA viruses account for the majority of the virome diversity although ssDNA and dsDNA viruses are common as well. Phylogenomic analysis yields tangible clues for the origins of major classes of eukaryotic viruses and in particular their likely roots in prokaryotes. Specifically, the ancestral genome of positive-strand RNA viruses of eukaryotes might have been assembled de novo from genes derived from prokaryotic retroelements and bacteria although a primordial origin of this class of viruses cannot be ruled out. Different groups of double-stranded RNA viruses derive either from dsRNA bacteriophages or from positive-strand RNA viruses. The eukaryotic ssDNA viruses apparently evolved via a fusion of genes from prokaryotic rolling circle-replicating plasmids and positive-strand RNA viruses. Different families of eukaryotic dsDNA viruses appear to have originated from specific groups of bacteriophages on at least two independent occasions. Polintons, the largest known eukaryotic transposons, predicted to also form virus particles, most likely, were the evolutionary intermediates between bacterial tectiviruses and several groups of eukaryotic dsDNA viruses including the proposed order “Megavirales” that unites diverse families of large and giant viruses. Strikingly, evolution of all classes of eukaryotic viruses appears to have involved fusion between structural and replicative gene modules derived from different sources
Koonin, Eugene V., E-mail: firstname.lastname@example.org [National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894 (United States); Dolja, Valerian V., E-mail: email@example.com [Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331 (United States); Krupovic, Mart, E-mail: firstname.lastname@example.org [Institut Pasteur, Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Department of Microbiology, Paris 75015 (France)
Viruses and other selfish genetic elements are dominant entities in the biosphere, with respect to both physical abundance and genetic diversity. Various selfish elements parasitize on all cellular life forms. The relative abundances of different classes of viruses are dramatically different between prokaryotes and eukaryotes. In prokaryotes, the great majority of viruses possess double-stranded (ds) DNA genomes, with a substantial minority of single-stranded (ss) DNA viruses and only limited presence of RNA viruses. In contrast, in eukaryotes, RNA viruses account for the majority of the virome diversity although ssDNA and dsDNA viruses are common as well. Phylogenomic analysis yields tangible clues for the origins of major classes of eukaryotic viruses and in particular their likely roots in prokaryotes. Specifically, the ancestral genome of positive-strand RNA viruses of eukaryotes might have been assembled de novo from genes derived from prokaryotic retroelements and bacteria although a primordial origin of this class of viruses cannot be ruled out. Different groups of double-stranded RNA viruses derive either from dsRNA bacteriophages or from positive-strand RNA viruses. The eukaryotic ssDNA viruses apparently evolved via a fusion of genes from prokaryotic rolling circle-replicating plasmids and positive-strand RNA viruses. Different families of eukaryotic dsDNA viruses appear to have originated from specific groups of bacteriophages on at least two independent occasions. Polintons, the largest known eukaryotic transposons, predicted to also form virus particles, most likely, were the evolutionary intermediates between bacterial tectiviruses and several groups of eukaryotic dsDNA viruses including the proposed order “Megavirales” that unites diverse families of large and giant viruses. Strikingly, evolution of all classes of eukaryotic viruses appears to have involved fusion between structural and replicative gene modules derived from different sources
Wolf Yuri I
Full Text Available Abstract Background An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs. Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes. Results New Archaeal Clusters of Orthologous Genes (arCOGs were constructed for 41 archaeal genomes (13 Crenarchaeota, 27 Euryarchaeota and one Nanoarchaeon using an improved procedure that employs a similarity tree between smaller, group-specific clusters, semi-automatically partitions orthology domains in multidomain proteins, and uses profile searches for identification of remote orthologs. The annotation of arCOGs is a consensus between three assignments based on the COGs, the CDD database, and the annotations of homologs in the NR database. The 7538 arCOGs, on average, cover ~88% of the genes in a genome compared to a ~76% coverage in COGs. The finer granularity of ortholog identification in the arCOGs is apparent from the fact that 4538 arCOGs correspond to 2362 COGs; ~40% of the arCOGs are new. The archaeal gene core (protein-coding genes found in all 41 genome consists of 166 arCOGs. The arCOGs were used to reconstruct gene loss and gene gain events during archaeal evolution and gene sets of ancestral forms. The Last Archaeal Common Ancestor (LACA is conservatively estimated to possess 996 genes compared to 1245 and 1335 genes for the last common ancestors of Crenarchaeota and Euryarchaeota, respectively. It is inferred that LACA was a chemoautotrophic hyperthermophile
Hall Ross S
Full Text Available Abstract Background New drug targets are urgently needed for parasites of socio-economic importance. Genes that are essential for parasite survival are highly desirable targets, but information on these genes is lacking, as gene knockouts or knockdowns are difficult to perform in many species of parasites. We examined the applicability of large-scale essentiality information from four model eukaryotes, Caenorhabditis elegans, Drosophila melanogaster, Mus musculus and Saccharomyces cerevisiae, to discover essential genes in each of their genomes. Parasite genes that lack orthologues in their host are desirable as selective targets, so we also examined prediction of essential genes within this subset. Results Cross-species analyses showed that the evolutionary conservation of genes and the presence of essential orthologues are each strong predictors of essentiality in eukaryotes. Absence of paralogues was also found to be a general predictor of increased relative essentiality. By combining several orthology and essentiality criteria one can select gene sets with up to a five-fold enrichment in essential genes compared with a random selection. We show how quantitative application of such criteria can be used to predict a ranked list of potential drug targets from Ancylostoma caninum and Haemonchus contortus - two blood-feeding strongylid nematodes, for which there are presently limited sequence data but no functional genomic tools. Conclusions The present study demonstrates the utility of using orthology information from multiple, diverse eukaryotes to predict essential genes. The data also emphasize the challenge of identifying essential genes among those in a parasite that are absent from its host.
Full Text Available Abstract Background The Anaphase Promoting Complex or Cyclosome (APC/C is the largest member of the ubiquitin ligase [E3] family. It plays a crucial role in the control of the cell cycle and cell proliferation by mediating the proteolysis of key components by the proteasome. APC/C is made of a dozen subunits that assemble into a large complex of ~1.5 MDa, which interacts with various cofactors and targets. Results Using comparative genomic and phylogenetic approaches, we showed that 24 out of 37 known APC/C subunits, adaptors/co-activators and main targets, were already present in the Last Eukaryotic Common Ancestor (LECA and were well conserved to a few exceptions in all present-day eukaryotic lineages. The phylogenetic analysis of the 24 components inferred to be present in LECA showed that they contain a reliable phylogenetic signal to reconstruct the phylogeny of the domain Eucarya. Conclusions Taken together our analyses indicated that LECA had a complex and highly controlled modern-like cell cycle. Moreover, we showed that, despite what is generally assumed, proteins involved in housekeeping cellular functions may be a good complement to informational genes to study the phylogeny of eukaryotes.
Balaji, Jayashree; Crouch, Jonathan H; Petite, Prasad V N S; Hoisington, David A
A minimal requirement to initiate a comparative genomics study on plant responses to abiotic stresses is a dataset of orthologous sequences. The availability of a large amount of sequence information, including those derived from stress cDNA libraries allow for the identification of stress related genes and orthologs associated with the stress response. Orthologous sequences serve as tools to explore genes and their relationships across species. For this purpose, ESTs from stress cDNA libraries across 16 crop species including 6 important cereal crops and 10 dicots were systematically collated and subjected to bioinformatics analysis such as clustering, grouping of tentative orthologous sets, identification of protein motifs/patterns in the predicted protein sequence, and annotation with stress conditions, tissue/library source and putative function. All data are available to the scientific community at http://intranet.icrisat.org/gt1/tog/homepage.htm. We believe that the availability of annotated plant abiotic stress ortholog sets will be a valuable resource for researchers studying the biology of environmental stresses in plant systems, molecular evolution and genomics.
Satori, Ch. P.; Henderson, M. M.; Krautkramer, E. A.; Košťál, Vratislav; Distefano, M. M.; Arriaga, E. A.
Roč. 113, č. 4 (2013), s. 2733-2811 ISSN 0009-2665 R&D Projects: GA ČR(CZ) GBP206/12/G014 Grant - others:GA AV ČR(CZ) M200311201 Institutional support: RVO:68081715 Keywords : green fluorescent protein * atomic-force microscopy * capillary electrophoretic analysis Subject RIV: CB - Analytical Chemistry , Separation Impact factor: 45.661, year: 2013
Satori, Ch. P.; Henderson, M. M.; Krautkramer, E. A.; Košťál, Vratislav; Distefano, M. M.; Arriaga, E. A.
Roč. 113, č. 4 (2013), s. 2733-2811 ISSN 0009-2665 R&D Projects: GA ČR(CZ) GBP206/12/G014 Grant - others:GA AV ČR(CZ) M200311201 Institutional support: RVO:68081715 Keywords : green fluorescent protein * atomic-force microscopy * capillary electrophoretic analysis Subject RIV: CB - Analytical Chemistry, Separation Impact factor: 45.661, year: 2013
Ciobanu, Doina; Clum, Alicia; Singh, Vasanth; Salamov, Asaf; Han, James; Copeland, Alex; Grigoriev, Igor; James, Timothy; Singer, Steven; Woyke, Tanja; Malmstrom, Rex; Cheng, Jan-Fang
Despite their small size, unicellular eukaryotes have complex genomes with a high degree of plasticity that allow them to adapt quickly to environmental changes. Unicellular eukaryotes live with prokaryotes and higher eukaryotes, frequently in symbiotic or parasitic niches. To this day their contribution to the dynamics of the environmental communities remains to be understood. Unfortunately, the vast majority of eukaryotic microorganisms are either uncultured or unculturable, making genome sequencing impossible using traditional approaches. We have developed an approach to isolate unicellular eukaryotes of interest from environmental samples, and to sequence and analyze their genomes and transcriptomes. We have tested our methods with six species: an uncharacterized protist from cellulose-enriched compost identified as Platyophrya, a close relative of P. vorax; the fungus Metschnikowia bicuspidate, a parasite of water flea Daphnia; the mycoparasitic fungi Piptocephalis cylindrospora, a parasite of Cokeromyces and Mucor; Caulochytrium protosteloides, a parasite of Sordaria; Rozella allomycis, a parasite of the water mold Allomyces; and the microalgae Chlamydomonas reinhardtii. Here, we present the four components of our approach: pre-sequencing methods, sequence analysis for single cell genome assembly, sequence analysis of single cell transcriptomes, and genome annotation. This technology has the potential to uncover the complexity of single cell eukaryotes and their role in the environmental samples.
Roger Andrew J
Full Text Available Abstract Background Lateral gene transfer can introduce genes with novel functions into genomes or replace genes with functionally similar orthologs or paralogs. Here we present a study of the occurrence of the latter gene replacement phenomenon in the four gene families encoding different classes of glutamate dehydrogenase (GDH, to evaluate and compare the patterns and rates of lateral gene transfer (LGT in prokaryotes and eukaryotes. Results We extend the taxon sampling of gdh genes with nine new eukaryotic sequences and examine the phylogenetic distribution pattern of the various GDH classes in combination with maximum likelihood phylogenetic analyses. The distribution pattern analyses indicate that LGT has played a significant role in the evolution of the four gdh gene families. Indeed, a number of gene transfer events are identified by phylogenetic analyses, including numerous prokaryotic intra-domain transfers, some prokaryotic inter-domain transfers and several inter-domain transfers between prokaryotes and microbial eukaryotes (protists. Conclusion LGT has apparently affected eukaryotes and prokaryotes to a similar extent within the gdh gene families. In the absence of indications that the evolution of the gdh gene families is radically different from other families, these results suggest that gene transfer might be an important evolutionary mechanism in microbial eukaryote genome evolution.
Lechner, Marcus; Hernandez-Rosales, Maribel; Doerr, Daniel; Wieseke, Nicolas; Thévenin, Annelyse; Stoye, Jens; Hartmann, Roland K.; Prohaska, Sonja J.; Stadler, Peter F.
The elucidation of orthology relationships is an important step both in gene function prediction as well as towards understanding patterns of sequence evolution. Orthology assignments are usually derived directly from sequence similarities for large data because more exact approaches exhibit too high computational costs. Here we present PoFF, an extension for the standalone tool Proteinortho, which enhances orthology detection by combining clustering, sequence similarity, and synteny. In the ...
James, J.F.; Lammel, C.J.; Draper, D.L.; Brown, D.A.; Sweet, R.L.; Brooks, G.F.
The attachment of Neisseria gonorrhoeae to eukaryotic cells grown in tissue culture was analyzed by use of light and electron microscopy and by labeling of the bacteria with [ 3 H]- and [ 14 C]adenine. Isogenic piliated and nonpiliated N. gonorrhoeae from opaque and transparent colonies were studied. The results of light microscopy studies showed that the gonococci attached to cells of human origin, including Flow 2000, HeLa 229, and HEp 2. Studies using radiolabeled gonococci gave comparable results. Piliated N. gonorrhoeae usually attached in larger numbers than nonpiliated organisms, and those from opaque colonies attached more often than isogenic variants from transparent colonies. Day-to-day variation in rate of attachment was observed. Scanning electron microscopy studies showed the gonococcal attachment to be specific for microvilli of the host cells. It is concluded that more N. gonorrhoeae from opaque colonies, as compared with isogenic variants from transparent colonies, attach to eukaryotic cells grown in tissue culture
Patrícia M. Silva
Full Text Available Over the last years, antimicrobial peptides (AMPs have been the focus of intense research towards the finding of a viable alternative to current antifungal drugs. Defensins are one of the major families of AMPs and the most represented among all eukaryotic groups, providing an important first line of host defense against pathogenic microorganisms. Several of these cysteine-stabilized peptides present a relevant effect against fungi. Defensins are the AMPs with the broader distribution across all eukaryotic kingdoms, namely, Fungi, Plantæ and Animalia, and were recently shown to have an ancestor in a bacterial organism. As a part of the host defense, defensins act as an important vehicle of information between innate and adaptive immune system and have a role in immunomodulation. This multidimensionality represents a powerful host shield, hard for microorganisms to overcome using single approach resistance strategies. Pathogenic fungi resistance to conventional antimycotic drugs is becoming a major problem. Defensins, as other AMPs, have shown to be an effective alternative to the current antimycotic therapies, demonstrating potential as novel therapeutic agents or drug leads. In this review, we summarize the current knowledge on some eukaryotic defensins with antifungal action. An overview of the main targets in the fungal cell and the mechanism of action of these AMPs (namely, the selectivity for some fungal membrane components are presented. Additionally, recent works on antifungal defensins structure, activity and citotoxicity are also reviewed.
Marayati, Bahjat Fadi; Hoskins, Victoria; Boger, Robert W; Tucker, James F; Fishman, Emily S; Bray, Andrew S; Zhang, Ke
Meiosis is a highly regulated process by which genetic information is transmitted through sexual reproduction. It encompasses unique mechanisms that do not occur in vegetative cells, producing a distinct, well-regulated meiotic transcriptome. During vegetative growth, many meiotic genes are constitutively transcribed, but most of the resulting mRNAs are rapidly eliminated by the Mmi1-MTREC (Mtl1-Red1 core) complex. While Mmi1-MTREC targets premature meiotic RNAs for degradation by the nuclear 3'-5' exoribonuclease exosome during mitotic growth, its role in meiotic gene expression during meiosis is not known. Here, we report that Red5, an essential MTREC component, interacts with pFal1, an ortholog of eukaryotic translation initiation factor eIF4aIII in the fission yeast Schizosaccharomyces pombe In mammals, together with MAGO (Mnh1), Rnps1, and Y14, elF4AIII (pFal1) forms the core of the exon junction complex (EJC), which is essential for transcriptional surveillance and localization of mature mRNAs. In fission yeast, two EJC orthologs, pFal1 and Mnh1, are functionally connected with MTREC, specifically in the process of meiotic gene expression during meiosis. Although pFal1 interacts with Mnh1, Y14, and Rnps1, its association with Mnh1 is not disrupted upon loss of Y14 or Rnps1. Mutations of Red1, Red5, pFal1, or Mnh1 produce severe meiotic defects; the abundance of meiotic transcripts during meiosis decreases; and mRNA maturation processes such as splicing are impaired. Since studying meiosis in mammalian germline cells is difficult, our findings in fission yeast may help to define the general mechanisms involved in accurate meiotic gene expression in higher eukaryotes. © 2016 Marayati et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Full Text Available BACKGROUND: Accurate identification of orthologs is crucial for evolutionary studies and for functional annotation. Several algorithms have been developed for ortholog delineation, but so far, manually curated genome-scale biological databases of orthologous genes for algorithm evaluation have been lacking. We evaluated four popular ortholog prediction algorithms (MultiParanoid; and OrthoMCL; RBH: Reciprocal Best Hit; RSD: Reciprocal Smallest Distance; the last two extended into clustering algorithms cRBH and cRSD, respectively, so that they can predict orthologs across multiple taxa against a set of 2,723 groups of high-quality curated orthologs from 6 Saccharomycete yeasts in the Yeast Gene Order Browser. RESULTS: Examination of sensitivity [TP/(TP+FN], specificity [TN/(TN+FP], and accuracy [(TP+TN/(TP+TN+FP+FN] across a broad parameter range showed that cRBH was the most accurate and specific algorithm, whereas OrthoMCL was the most sensitive. Evaluation of the algorithms across a varying number of species showed that cRBH had the highest accuracy and lowest false discovery rate [FP/(FP+TP], followed by cRSD. Of the six species in our set, three descended from an ancestor that underwent whole genome duplication. Subsequent differential duplicate loss events in the three descendants resulted in distinct classes of gene loss patterns, including cases where the genes retained in the three descendants are paralogs, constituting 'traps' for ortholog prediction algorithms. We found that the false discovery rate of all algorithms dramatically increased in these traps. CONCLUSIONS: These results suggest that simple algorithms, like cRBH, may be better ortholog predictors than more complex ones (e.g., OrthoMCL and MultiParanoid for evolutionary and functional genomics studies where the objective is the accurate inference of single-copy orthologs (e.g., molecular phylogenetics, but that all algorithms fail to accurately predict orthologs when paralogy
van Noort Vera
Full Text Available Abstract Background Proteomes of thermophilic prokaryotes have been instrumental in structural biology and successfully exploited in biotechnology, however many proteins required for eukaryotic cell function are absent from bacteria or archaea. With Chaetomium thermophilum, Thielavia terrestris and Thielavia heterothallica three genome sequences of thermophilic eukaryotes have been published. Results Studying the genomes and proteomes of these thermophilic fungi, we found common strategies of thermal adaptation across the different kingdoms of Life, including amino acid biases and a reduced genome size. A phylogenetics-guided comparison of thermophilic proteomes with those of other, mesophilic Sordariomycetes revealed consistent amino acid substitutions associated to thermophily that were also present in an independent lineage of thermophilic fungi. The most consistent pattern is the substitution of lysine by arginine, which we could find in almost all lineages but has not been extensively used in protein stability engineering. By exploiting mutational paths towards the thermophiles, we could predict particular amino acid residues in individual proteins that contribute to thermostability and validated some of them experimentally. By determining the three-dimensional structure of an exemplar protein from C. thermophilum (Arx1, we could also characterise the molecular consequences of some of these mutations. Conclusions The comparative analysis of these three genomes not only enhances our understanding of the evolution of thermophily, but also provides new ways to engineer protein stability.
IONS: Identification of Orthologs by Neighborhood and Similarity-an Automated Method to Identify Orthologs in Chromosomal Regions of Common Evolutionary Ancestry and its Application to Hemiascomycetous Yeasts.
Seret, Marie-Line; Baret, Philippe V
Comparative sequence analysis is widely used to infer gene function and study genome evolution and requires proper ortholog identification across different genomes. We have developed a program for the Identification of Orthologs in one-to-one relationship by Neighborhood and Similarity (IONS) between closely related species. The algorithm combines two levels of evidence to determine co-ancestrality at the genome scale: sequence similarity and shared neighborhood. The method was initially designed to provide anchor points for syntenic blocks within the Génolevures project concerning nine hemiascomycetous yeasts (about 50,000 genes) and is applicable to different input databases. Comparison based on use of a Rand index shows that the results are highly consistent with the pillars of the Yeast Gene Order Browser, a manually curated database. Compared with SYNERGY, another algorithm reporting homology relationships, our method's main advantages are its automation and the absence of dataset-dependent parameters, facilitating consistent integration of newly released genomes.
Hindle, Matthew M; Martin, Sarah F; Noordally, Zeenat B; van Ooijen, Gerben; Barrios-Llerena, Martin E; Simpson, T Ian; Le Bihan, Thierry; Millar, Andrew J
The current knowledge of eukaryote signalling originates from phenotypically diverse organisms. There is a pressing need to identify conserved signalling components among eukaryotes, which will lead to the transfer of knowledge across kingdoms. Two useful properties of a eukaryote model for signalling are (1) reduced signalling complexity, and (2) conservation of signalling components. The alga Ostreococcus tauri is described as the smallest free-living eukaryote. With less than 8,000 genes, it represents a highly constrained genomic palette. Our survey revealed 133 protein kinases and 34 protein phosphatases (1.7% and 0.4% of the proteome). We conducted phosphoproteomic experiments and constructed domain structures and phylogenies for the catalytic protein-kinases. For each of the major kinases families we review the completeness and divergence of O. tauri representatives in comparison to the well-studied kinomes of the laboratory models Arabidopsis thaliana and Saccharomyces cerevisiae, and of Homo sapiens. Many kinase clades in O. tauri were reduced to a single member, in preference to the loss of family diversity, whereas TKL and ABC1 clades were expanded. We also identified kinases that have been lost in A. thaliana but retained in O. tauri. For three, contrasting eukaryotic pathways - TOR, MAPK, and the circadian clock - we established the subset of conserved components and demonstrate conserved sites of substrate phosphorylation and kinase motifs. We conclude that O. tauri satisfies our two central requirements. Several of its kinases are more closely related to H. sapiens orthologs than S. cerevisiae is to H. sapiens. The greatly reduced kinome of O. tauri is therefore a suitable model for signalling in free-living eukaryotes.
Williams, Tom A.; Embley, T. Martin
Current hypotheses about the history of cellular life are mainly based on analyses of cultivated organisms, but these represent only a small fraction of extant biodiversity. The sequencing of new environmental lineages therefore provides an opportunity to test, revise, or reject existing ideas about the tree of life and the origin of eukaryotes. According to the textbook three domains hypothesis, the eukaryotes emerge as the sister group to a monophyletic Archaea. However, recent analyses incorporating better phylogenetic models and an improved sampling of the archaeal domain have generally supported the competing eocyte hypothesis, in which core genes of eukaryotic cells originated from within the Archaea, with important implications for eukaryogenesis. Given this trend, it was surprising that a recent analysis incorporating new genomes from uncultivated Archaea recovered a strongly supported three domains tree. Here, we show that this result was due in part to the use of a poorly fitting phylogenetic model and also to the inclusion by an automated pipeline of genes of putative bacterial origin rather than nucleocytosolic versions for some of the eukaryotes analyzed. When these issues were resolved, analyses including the new archaeal lineages placed core eukaryotic genes within the Archaea. These results are consistent with a number of recent studies in which improved archaeal sampling and better phylogenetic models agree in supporting the eocyte tree over the three domains hypothesis. PMID:24532674
Full Text Available BACKGROUND: The replication of DNA in Archaea and eukaryotes requires several ancillary complexes, including proliferating cell nuclear antigen (PCNA, replication factor C (RFC, and the minichromosome maintenance (MCM complex. Bacterial DNA replication utilizes comparable proteins, but these are distantly related phylogenetically to their archaeal and eukaryotic counterparts at best. METHODOLOGY/PRINCIPAL FINDINGS: While the structures of each of the complexes do not differ significantly between the archaeal and eukaryotic versions thereof, the evolutionary dynamic in the two cases does. The number of subunits in each complex is constant across all taxa. However, they vary subtly with regard to composition. In some taxa the subunits are all identical in sequence, while in others some are homologous rather than identical. In the case of eukaryotes, there is no phylogenetic variation in the makeup of each complex-all appear to derive from a common eukaryotic ancestor. This is not the case in Archaea, where the relationship between the subunits within each complex varies taxon-to-taxon. We have performed a detailed phylogenetic analysis of these relationships in order to better understand the gene duplications and divergences that gave rise to the homologous subunits in Archaea. CONCLUSION/SIGNIFICANCE: This domain level difference in evolution suggests that different forces have driven the evolution of DNA replication proteins in each of these two domains. In addition, the phylogenies of all three gene families support the distinctiveness of the proposed archaeal phylum Thaumarchaeota.
Pedersen, Anders Gorm; Baldi, Pierre; Brunak, Søren
In this paper we utilize hidden Markov models (HMMs) and information theory to analyze prokaryotic and eukaryotic promoters. We perform this analysis with special emphasis on the fact that promoters are divided into a number of different classes, depending on which polymerase-associated factors...... that bind to them. We find that HMMs trained on such subclasses of Escherichia coli promoters (specifically, the so-called sigma-70 and sigma-54 classes) give an excellent classification of unknown promoters with respect to sigma-class. HMMs trained on eukaryotic sequences from human genes also model nicely...
Pedersen, Anders Gorm; Baldi, P.; Chauvin, Y.
In this paper we utilize hidden Markov models (HMMs) and information theory to analyze prokaryotic and eukaryotic promoters. We perform this analysis with special emphasis on the fact that promoters are divided into a number of different classes, depending on which polymerase-associated factors...... that bind to them. We find that HMMs trained on such subclasses of Escherichia coli promoters (specifically, the so-called sigma 70 and sigma 54 classes) give an excellent classification of unknown promoters with respect to sigma-class. HMMs trained on eukaryotic sequences from human genes also model nicely...
Full Text Available Abstract Background Signal peptide is one of the most important motifs involved in protein trafficking and it ultimately influences protein function. Considering the expected functional conservation among orthologs it was hypothesized that divergence in signal peptides within orthologous groups is mainly due to N-terminal protein sequence misannotation. Thus, discrepancies in signal peptide prediction of orthologous proteins were used to identify misannotated proteins in five Plasmodium species. Methods Signal peptide (SignalP and orthology (OrthoMCL were combined in an innovative strategy to identify orthologous groups showing discrepancies in signal peptide prediction among their protein members (Mixed groups. In a comparative analysis, multiple alignments for each of these groups and gene models were visually inspected in search of misannotated proteins and, whenever possible, alternative gene models were proposed. Thresholds for signal peptide prediction parameters were also modified to reduce their impact as a possible source of discrepancy among orthologs. Validation of new gene models was based on RT-PCR (few examples or on experimental evidence already published (ApiLoc. Results The rate of misannotated proteins was significantly higher in Mixed groups than in Positive or Negative groups, corroborating the proposed hypothesis. A total of 478 proteins were reannotated and change of signal peptide prediction from negative to positive was the most common. Reannotations triggered the conversion of almost 50% of all Mixed groups, which were further reduced by optimization of signal peptide prediction parameters. Conclusions The methodological novelty proposed here combining orthology and signal peptide prediction proved to be an effective strategy for the identification of proteins showing wrongly N-terminal annotated sequences, and it might have an important impact in the available data for genome-wide searching of potential vaccine and drug
Current analysis shows that the last eukaryotic common ancestor (LECA) was capable of full meiotic sex. The original eukaryotic life cycle can probably be described as clonal, interrupted by episodic sex triggered by external or internal stressors. The cycle could have started in a highly flexible
Gains and losses shape the gene complement of animal lineages and are a fundamental aspect of genomic evolution. Acquiring a comprehensive view of the evolution of gene repertoires is limited by the intrinsic limitations of common sequence similarity searches and available databases. Thus, a subset of the gene complement of an organism consists of hidden orthologs, i.e., those with no apparent homology to sequenced animal lineages—mistakenly considered new genes—but actually representing rapidly evolving orthologs or undetected paralogs. Here, we describe Leapfrog, a simple automated BLAST pipeline that leverages increased taxon sampling to overcome long evolutionary distances and identify putative hidden orthologs in large transcriptomic databases by transitive homology. As a case study, we used 35 transcriptomes of 29 flatworm lineages to recover 3427 putative hidden orthologs, some unidentified by OrthoFinder and HaMStR, two common orthogroup inference algorithms. Unexpectedly, we do not observe a correlation between the number of putative hidden orthologs in a lineage and its “average” evolutionary rate. Hidden orthologs do not show unusual sequence composition biases that might account for systematic errors in sequence similarity searches. Instead, gene duplication with divergence of one paralog and weak positive selection appear to underlie hidden orthology in Platyhelminthes. By using Leapfrog, we identify key centrosome-related genes and homeodomain classes previously reported as absent in free-living flatworms, e.g., planarians. Altogether, our findings demonstrate that hidden orthologs comprise a significant proportion of the gene repertoire in flatworms, qualifying the impact of gene losses and gains in gene complement evolution. PMID:28400424
Zhabinskaya, Dina; Benham, Craig; Madden, Sally
We analyze the competitive transitions to alternate secondary DNA structures in a negatively supercoiled DNA molecule of kilobase length and specified base sequence. We use statistical mechanics to calculate the competition among all regions within the sequence that are susceptible to transitions to alternate structures. We use an approximate numerical method since the calculation of an exact partition function is numerically cumbersome for DNA molecules of lengths longer than hundreds of base pairs. This method yields accurate results in reasonable computational times. We implement algorithms that calculate the competition between transitions to denatured states and to Z-form DNA. We analyze these transitions near the transcription start sites (TSS) of a set of eukaryotic genes. We find an enhancement of Z-forming regions upstream of the TSS and a depletion of denatured regions around the start sites. We confirm that these finding are statistically significant by comparing our results to a set of randomized genes with preserved base composition at each position relative to the gene start sites. When we study the correlation of these transitions in orthologous mouse and human genes we find a clear evolutionary conservation of both types of transitions around the TSS.
Woodhams, Michael D; Stadler, Peter F; Penny, David; Collins, Lesley J
Within eukaryotes there is a complex cascade of RNA-based macromolecules that process other RNA molecules, especially mRNA, tRNA and rRNA. An example is RNase MRP processing ribosomal RNA (rRNA) in ribosome biogenesis. One hypothesis is that this complexity was present early in eukaryotic evolution; an alternative is that an initial simpler network later gained complexity by gene duplication in lineages that led to animals, fungi and plants. Recently there has been a rapid increase in support for the complexity-early theory because the vast majority of these RNA-processing reactions are found throughout eukaryotes, and thus were likely to be present in the last common ancestor of living eukaryotes, herein called the Eukaryotic Ancestor. We present an overview of the RNA processing cascade in the Eukaryotic Ancestor and investigate in particular, RNase MRP which was previously thought to have evolved later in eukaryotes due to its apparent limited distribution in fungi and animals and plants. Recent publications, as well as our own genomic searches, find previously unknown RNase MRP RNAs, indicating that RNase MRP has a wide distribution in eukaryotes. Combining secondary structure and promoter region analysis of RNAs for RNase MRP, along with analysis of the target substrate (rRNA), allows us to discuss this distribution in the light of eukaryotic evolution. We conclude that RNase MRP can now be placed in the RNA-processing cascade of the Eukaryotic Ancestor, highlighting the complexity of RNA-processing in early eukaryotes. Promoter analyses of MRP-RNA suggest that regulation of the critical processes of rRNA cleavage can vary, showing that even these key cellular processes (for which we expect high conservation) show some species-specific variability. We present our consensus MRP-RNA secondary structure as a useful model for further searches.
particular functions and identify species that contain these proteins. For example, if users select two species, Homo sapiens and Mus musculus, and...Kerr AR, McCormack TJ, Riley M: Evolution by leaps: gene duplication in bacteria. Biol Direct 2009, 4:46. 12. Remm M, Storm CE, Sonnhammer EL
Doolittle, W Ford
Systematics balances uneasily between realism and nominalism, uncommitted as to whether biological taxa are discoveries or inventions. If the former, they might be taken as natural kinds. I briefly review some philosophers' concepts of natural kinds and then argue that several of these apply well enough to "eukaryote." Although there are some sticky issues around genomic chimerism and when eukaryotes first appeared, if we allow for degrees in the naturalness of kinds, existing eukaryotes rank highly, higher than prokaryotes. Most biologists feel this intuitively: All I attempt to do here is provide some conceptual justification. Copyright © 2014 Cold Spring Harbor Laboratory Press; all rights reserved.
Paul D Thomas
Full Text Available A recent paper (Nehrt et al., PLoS Comput. Biol. 7:e1002073, 2011 has proposed a metric for the "functional similarity" between two genes that uses only the Gene Ontology (GO annotations directly derived from published experimental results. Applying this metric, the authors concluded that paralogous genes within the mouse genome or the human genome are more functionally similar on average than orthologous genes between these genomes, an unexpected result with broad implications if true. We suggest, based on both theoretical and empirical considerations, that this proposed metric should not be interpreted as a functional similarity, and therefore cannot be used to support any conclusions about the "ortholog conjecture" (or, more properly, the "ortholog functional conservation hypothesis". First, we reexamine the case studies presented by Nehrt et al. as examples of orthologs with divergent functions, and come to a very different conclusion: they actually exemplify how GO annotations for orthologous genes provide complementary information about conserved biological functions. We then show that there is a global ascertainment bias in the experiment-based GO annotations for human and mouse genes: particular types of experiments tend to be performed in different model organisms. We conclude that the reported statistical differences in annotations between pairs of orthologous genes do not reflect differences in biological function, but rather complementarity in experimental approaches. Our results underscore two general considerations for researchers proposing novel types of analysis based on the GO: 1 that GO annotations are often incomplete, potentially in a biased manner, and subject to an "open world assumption" (absence of an annotation does not imply absence of a function, and 2 that conclusions drawn from a novel, large-scale GO analysis should whenever possible be supported by careful, in-depth examination of examples, to help ensure the
Mia M. Bengtsson
Full Text Available Eelgrass (Zostera marina is a marine foundation species essential for coastal ecosystem services around the northern hemisphere. Like all macroscopic organisms, it possesses a microbiome (here defined as an associated prokaryotic community which may play critical roles in modulating the interaction of eelgrass with its environment. For example, its leaf surface microbiome could inhibit or attract eukaryotic epibionts which may overgrow the eelgrass leading to reduced primary productivity and subsequent eelgrass meadow decline. We used amplicon sequencing of the 16S and 18S rRNA genes of prokaryotes and eukaryotes to assess the leaf surface microbiome (prokaryotes as well as eukaryotic epibionts in- and outside lagoons on the German Baltic Sea coast. Prokaryote microbiomes varied substantially both between sites inside lagoons and between open coastal and lagoon sites. Water depth, leaf area and biofilm chlorophyll a concentration explained a large amount of variation in both prokaryotic and eukaryotic community composition. The prokaryotic microbiome and eukaryotic epibiont communities were highly correlated, and network analysis revealed disproportionate co-occurrence between a limited number of eukaryotic taxa and several bacterial taxa. This suggests that eelgrass leaf surfaces are home to a mosaic of microbiomes of several epibiotic eukaryotes, in addition to the microbiome of the eelgrass itself. Our findings thereby underline that eukaryotic diversity should be taken into account in order to explain prokaryotic microbiome assembly and dynamics in aquatic environments.
Hervé, Cécile; Siméon, Amandine; Jam, Murielle
Arabinogalactan proteins (AGPs) are highly glycosylated, hydroxyproline-rich proteins found at the cell surface of plants, where they play key roles in developmental processes. Brown algae are marine, multicellular, photosynthetic eukaryotes. They belong to the phylum Stramenopiles, which...
Zhang, Guojie; Wang, Hongsheng; Shi, Junjie
melanogaster, Anopheles gambiae, Bombyx mori, Tribolium castaneum, and Apis mellifera were compared to the complete genomes of three non-insect eukaryotes (opisthokonts) Homo sapiens, Caenorhabditis elegans and Saccharomyces cerevisiae. This operation yielded 154 groups of orthologous proteins in Drosophila...
Chaudhuri, Rima; Sadrieh, Arash; Hoffman, Nolan J; Parker, Benjamin L; Humphrey, Sean J; Stöckli, Jacqueline; Hill, Adam P; James, David E; Yang, Jean Yee Hwa
Most biological processes are influenced by protein post-translational modifications (PTMs). Identifying novel PTM sites in different organisms, including humans and model organisms, has expedited our understanding of key signal transduction mechanisms. However, with increasing availability of deep, quantitative datasets in diverse species, there is a growing need for tools to facilitate cross-species comparison of PTM data. This is particularly important because functionally important modification sites are more likely to be evolutionarily conserved; yet cross-species comparison of PTMs is difficult since they often lie in structurally disordered protein domains. Current tools that address this can only map known PTMs between species based on known orthologous phosphosites, and do not enable the cross-species mapping of newly identified modification sites. Here, we addressed this by developing a web-based software tool, PhosphOrtholog ( www.phosphortholog.com ) that accurately maps protein modification sites between different species. This facilitates the comparison of datasets derived from multiple species, and should be a valuable tool for the proteomics community. Here we describe PhosphOrtholog, a web-based application for mapping known and novel orthologous PTM sites from experimental data obtained from different species. PhosphOrtholog is the only generic and automated tool that enables cross-species comparison of large-scale PTM datasets without relying on existing PTM databases. This is achieved through pairwise sequence alignment of orthologous protein residues. To demonstrate its utility we apply it to two sets of human and rat muscle phosphoproteomes generated following insulin and exercise stimulation, respectively, and one publicly available mouse phosphoproteome following cellular stress revealing high mapping and coverage efficiency. Although coverage statistics are dataset dependent, PhosphOrtholog increased the number of cross-species mapped sites
Lechner, Marcus; Hernandez-Rosales, Maribel; Doerr, Daniel; Wieseke, Nicolas; Thévenin, Annelyse; Stoye, Jens; Hartmann, Roland K; Prohaska, Sonja J; Stadler, Peter F
The elucidation of orthology relationships is an important step both in gene function prediction as well as towards understanding patterns of sequence evolution. Orthology assignments are usually derived directly from sequence similarities for large data because more exact approaches exhibit too high computational costs. Here we present PoFF, an extension for the standalone tool Proteinortho, which enhances orthology detection by combining clustering, sequence similarity, and synteny. In the course of this work, FFAdj-MCS, a heuristic that assesses pairwise gene order using adjacencies (a similarity measure related to the breakpoint distance) was adapted to support multiple linear chromosomes and extended to detect duplicated regions. PoFF largely reduces the number of false positives and enables more fine-grained predictions than purely similarity-based approaches. The extension maintains the low memory requirements and the efficient concurrency options of its basis Proteinortho, making the software applicable to very large datasets.
Full Text Available The elucidation of orthology relationships is an important step both in gene function prediction as well as towards understanding patterns of sequence evolution. Orthology assignments are usually derived directly from sequence similarities for large data because more exact approaches exhibit too high computational costs. Here we present PoFF, an extension for the standalone tool Proteinortho, which enhances orthology detection by combining clustering, sequence similarity, and synteny. In the course of this work, FFAdj-MCS, a heuristic that assesses pairwise gene order using adjacencies (a similarity measure related to the breakpoint distance was adapted to support multiple linear chromosomes and extended to detect duplicated regions. PoFF largely reduces the number of false positives and enables more fine-grained predictions than purely similarity-based approaches. The extension maintains the low memory requirements and the efficient concurrency options of its basis Proteinortho, making the software applicable to very large datasets.
Full Text Available BACKGROUND: The Usher syndrome (USH is the most frequent deaf-blindness hereditary disease in humans. Deafness is attributed to the disorganization of stereocilia in the inner ear. USH1, the most severe subtype, is associated with mutations in genes encoding myosin VIIa, harmonin, cadherin 23, protocadherin 15, and sans. Myosin VIIa, harmonin, cadherin 23, and protocadherin 15 physically interact in vitro and localize to stereocilia tips in vivo, indicating that they form functional complexes. Sans, in contrast, localizes to vesicle-like structures beneath the apical membrane of stereocilia-displaying hair cells. How mutations in sans result in deafness and blindness is not well understood. Orthologs of myosin VIIa and protocadherin 15 have been identified in Drosophila melanogaster and their genetic analysis has identified essential roles in auditory perception and microvilli morphogenesis, respectively. PRINCIPAL FINDINGS: Here, we have identified and characterized the Drosophila ortholog of human sans. Drosophila Sans is expressed in tubular organs of the embryo, in lens-secreting cone cells of the adult eye, and in microvilli-displaying follicle cells during oogenesis. Sans mutants are viable, fertile, and mutant follicle cells appear to form microvilli, indicating that Sans is dispensable for fly development and microvilli morphogenesis in the follicle epithelium. In follicle cells, Sans protein localizes, similar to its vertebrate ortholog, to intracellular punctate structures, which we have identified as early endosomes associated with the syntaxin Avalanche. CONCLUSIONS: Our work is consistent with an evolutionary conserved function of Sans in vesicle trafficking. Furthermore it provides a significant basis for further understanding of the role of this Usher syndrome ortholog in development and disease.
Full Text Available Historically, the members of the Agrobacterium genus have been considered the only bacterial species naturally able to transfer and integrate DNA into the genomes of their eukaryotic hosts. Yet, increasing evidence suggests that this ability to genetically transform eukaryotic host cells might be more widespread in the bacterial world. Indeed, analyses of accumulating genomic data reveal cases of horizontal gene transfer from bacteria to eukaryotes and suggest that it represents a significant force in adaptive evolution of eukaryotic species. Specifically, recent reports indicate that bacteria other than Agrobacterium, such as Bartonella henselae (a zoonotic pathogen, Rhizobium etli (a plant-symbiotic bacterium related to Agrobacterium, or even Escherichia coli, have the ability to genetically transform their host cells under laboratory conditions. This DNA transfer relies on type IV secretion systems (T4SSs, the molecular machines that transport macromolecules during conjugative plasmid transfer and also during transport of proteins and/or DNA to the eukaryotic recipient cells. In this review article, we explore the extent of possible transfer of genetic information from bacteria to eukaryotic cells as well as the evolutionary implications and potential applications of this transfer.
Natale, D A; Shankavaram, U T; Galperin, M Y; Wolf, Y I; Aravind, L; Koonin, E V
Standard archival sequence databases have not been designed as tools for genome annotation and are far from being optimal for this purpose. We used the database of Clusters of Orthologous Groups of proteins (COGs) to reannotate the genomes of two archaea, Aeropyrum pernix, the first member of the Crenarchaea to be sequenced, and Pyrococcus abyssi. A. pernix and P. abyssi proteins were assigned to COGs using the COGNITOR program; the results were verified on a case-by-case basis and augmented by additional database searches using the PSI-BLAST and TBLASTN programs. Functions were predicted for over 300 proteins from A. pernix, which could not be assigned a function using conventional methods with a conservative sequence similarity threshold, an approximately 50% increase compared to the original annotation. A. pernix shares most of the conserved core of proteins that were previously identified in the Euryarchaeota. Cluster analysis or distance matrix tree construction based on the co-occurrence of genomes in COGs showed that A. pernix forms a distinct group within the archaea, although grouping with the two species of Pyrococci, indicative of similar repertoires of conserved genes, was observed. No indication of a specific relationship between Crenarchaeota and eukaryotes was obtained in these analyses. Several proteins that are conserved in Euryarchaeota and most bacteria are unexpectedly missing in A. pernix, including the entire set of de novo purine biosynthesis enzymes, the GTPase FtsZ (a key component of the bacterial and euryarchaeal cell-division machinery), and the tRNA-specific pseudouridine synthase, previously considered universal. A. pernix is represented in 48 COGs that do not contain any euryarchaeal members. Many of these proteins are TCA cycle and electron transport chain enzymes, reflecting the aerobic lifestyle of A. pernix. Special-purpose databases organized on the basis of phylogenetic analysis and carefully curated with respect to known and
Arkhipova, Irina R; Batzer, Mark A; Brosius, Juergen; Feschotte, Cédric; Moran, John V; Schmitz, Jürgen; Jurka, Jerzy
The third international conference on the genomic impact of eukaryotic transposable elements (TEs) was held 24 to 28 February 2012 at the Asilomar Conference Center, Pacific Grove, CA, USA. Sponsored in part by the National Institutes of Health grant 5 P41 LM006252, the goal of the conference was to bring together researchers from around the world who study the impact and mechanisms of TEs using multiple computational and experimental approaches. The meeting drew close to 170 attendees and included invited floor presentations on the biology of TEs and their genomic impact, as well as numerous talks contributed by young scientists. The workshop talks were devoted to computational analysis of TEs with additional time for discussion of unresolved issues. Also, there was ample opportunity for poster presentations and informal evening discussions. The success of the meeting reflects the important role of Repbase in comparative genomic studies, and emphasizes the need for close interactions between experimental and computational biologists in the years to come.
Arkhipova Irina R
Full Text Available Abstract The third international conference on the genomic impact of eukaryotic transposable elements (TEs was held 24 to 28 February 2012 at the Asilomar Conference Center, Pacific Grove, CA, USA. Sponsored in part by the National Institutes of Health grant 5 P41 LM006252, the goal of the conference was to bring together researchers from around the world who study the impact and mechanisms of TEs using multiple computational and experimental approaches. The meeting drew close to 170 attendees and included invited floor presentations on the biology of TEs and their genomic impact, as well as numerous talks contributed by young scientists. The workshop talks were devoted to computational analysis of TEs with additional time for discussion of unresolved issues. Also, there was ample opportunity for poster presentations and informal evening discussions. The success of the meeting reflects the important role of Repbase in comparative genomic studies, and emphasizes the need for close interactions between experimental and computational biologists in the years to come.
We sequenced the genome of Theileria orientalis, a tick-borne apicomplexan protozoan parasite of cattle. The focus of this study was a comparative genome analysis of T. orientalis relative to other highly pathogenic Theileria species, T. parva and T. annulata. T. parva and T. annulata induce transformation of infected cells of lymphocyte or macrophage/monocyte lineages; in contrast, T. orientalis does not induce uncontrolled proliferation of infected leukocytes and multiplies predominantly within infected erythrocytes. While synteny across homologous chromosomes of the three Theileria species was found to be well conserved overall, subtelomeric structures were found to differ substantially, as T. orientalis lacks the large tandemly arrayed subtelomere-encoded variable secreted protein-encoding gene family. Moreover, expansion of particular gene families by gene duplication was found in the genomes of the two transforming Theileria species, most notably, the TashAT/TpHN and Tar/Tpr gene families. Gene families that are present only in T. parva and T. annulata and not in T. orientalis, Babesia bovis, or Plasmo-dium were also identified. Identification of differences between the genome sequences of Theileria species with different abilities to transform and immortalize bovine leukocytes will provide insight into proteins and mechanisms that have evolved to induce and regulate this process. The T. orientalis genome database is available at http://totdb.czc.hokudai.ac.jp/. 2012 Hayashida et al. T.
Kleinow, T.; Bhalerao, R.; Breuer, F.
Yeast Snf4 is a prototype of activating gamma-subunits of conserved Snf1/AMPK-related protein kinases (SnRKs) controlling glucose and stress signaling in eukaryotes. The catalytic subunits of Arabidopsis SnRKs, AKIN10 and AKIN11, interact with Snf4 and suppress the snf1 and snf4 mutations in yeast....... By expression of an Arabidopsis cDNA library in yeast, heterologous multicopy snf4 suppressors were isolated. In addition to AKIN10 and AKIN11, the deficiency of yeast snf4 mutant to grown on non-fermentable carbon source was suppressed by Arabidopsis Myb30, CAAT-binding factor Hap3b, casein kinase I, zinc......-finger factors AZF2 and ZAT10, as well as orthologs of hexose/UDP-hexose transporters, calmodulin, SMC1-cohesin and Snf4. Here we describe the characterization of AtSNF4, a functional Arabidopsis Snf4 ortholog, that interacts with yeast Snf1 and specifically binds to the C-terminal regulatory domain...
Breton, Sophie; Stewart, Donald T
Mitochondrial DNA (mtDNA) is predominantly maternally inherited in eukaryotes. Diverse molecular mechanisms underlying the phenomenon of strict maternal inheritance (SMI) of mtDNA have been described, but the evolutionary forces responsible for its predominance in eukaryotes remain to be elucidated. Exceptions to SMI have been reported in diverse eukaryotic taxa, leading to the prediction that several distinct molecular mechanisms controlling mtDNA transmission are present among the eukaryotes. We propose that these mechanisms will be better understood by studying the deviations from the predominating pattern of SMI. This minireview summarizes studies on eukaryote species with unusual or rare mitochondrial inheritance patterns, i.e., other than the predominant SMI pattern, such as maternal inheritance of stable heteroplasmy, paternal leakage of mtDNA, biparental and strictly paternal inheritance, and doubly uniparental inheritance of mtDNA. The potential genes and mechanisms involved in controlling mitochondrial inheritance in these organisms are discussed. The linkage between mitochondrial inheritance and sex determination is also discussed, given that the atypical systems of mtDNA inheritance examined in this minireview are frequently found in organisms with uncommon sexual systems such as gynodioecy, monoecy, or andromonoecy. The potential of deviations from SMI for facilitating a better understanding of a number of fundamental questions in biology, such as the evolution of mtDNA inheritance, the coevolution of nuclear and mitochondrial genomes, and, perhaps, the role of mitochondria in sex determination, is considerable.
Parfrey, Laura Wegener; Walters, William A.; Lauber, Christian L.; Clemente, Jose C.; Berg-Lyons, Donna; Teiling, Clotilde; Kodira, Chinnappa; Mohiuddin, Mohammed; Brunelle, Julie; Driscoll, Mark; Fierer, Noah; Gilbert, Jack A.; Knight, Rob
Eukaryotic microbes (protists) residing in the vertebrate gut influence host health and disease, but their diversity and distribution in healthy hosts is poorly understood. Protists found in the gut are typically considered parasites, but many are commensal and some are beneficial. Further, the hygiene hypothesis predicts that association with our co-evolved microbial symbionts may be important to overall health. It is therefore imperative that we understand the normal diversity of our eukaryotic gut microbiota to test for such effects and avoid eliminating commensal organisms. We assembled a dataset of healthy individuals from two populations, one with traditional, agrarian lifestyles and a second with modern, westernized lifestyles, and characterized the human eukaryotic microbiota via high-throughput sequencing. To place the human gut microbiota within a broader context our dataset also includes gut samples from diverse mammals and samples from other aquatic and terrestrial environments. We curated the SILVA ribosomal database to reflect current knowledge of eukaryotic taxonomy and employ it as a phylogenetic framework to compare eukaryotic diversity across environment. We show that adults from the non-western population harbor a diverse community of protists, and diversity in the human gut is comparable to that in other mammals. However, the eukaryotic microbiota of the western population appears depauperate. The distribution of symbionts found in mammals reflects both host phylogeny and diet. Eukaryotic microbiota in the gut are less diverse and more patchily distributed than bacteria. More broadly, we show that eukaryotic communities in the gut are less diverse than in aquatic and terrestrial habitats, and few taxa are shared across habitat types, and diversity patterns of eukaryotes are correlated with those observed for bacteria. These results outline the distribution and diversity of microbial eukaryotic communities in the mammalian gut and across
Xiong, Jinbo; Yu, Weina; Dai, Wenfang; Zhang, Jinjie; Qiu, Qiongfen; Ou, Changrong
One common notion is emerging that gut eukaryotes are commensal or beneficial, rather than detrimental. To date, however, surprisingly few studies have been taken to discern the factors that govern the assembly of gut eukaryotes, despite growing interest in the dysbiosis of gut microbiota-disease relationship. Herein, we firstly explored how the gut eukaryotic microbiotas were assembled over shrimp postlarval to adult stages and a disease progression. The gut eukaryotic communities changed markedly as healthy shrimp aged, and converged toward an adult-microbiota configuration. However, the adult-like stability was distorted by disease exacerbation. A null model untangled that the deterministic processes that governed the gut eukaryotic assembly tended to be more important over healthy shrimp development, whereas this trend was inverted as the disease progressed. After ruling out the baseline of gut eukaryotes over shrimp ages, we identified disease-discriminatory taxa (species level afforded the highest accuracy of prediction) that characteristic of shrimp health status. The profiles of these taxa contributed an overall 92.4% accuracy in predicting shrimp health status. Notably, this model can accurately diagnose the onset of shrimp disease. Interspecies interaction analysis depicted how the disease-discriminatory taxa interacted with one another in sustaining shrimp health. Taken together, our findings offer novel insights into the underlying ecological processes that govern the assembly of gut eukaryotes over shrimp postlarval to adult stages and a disease progression. Intriguingly, the established model can quantitatively and accurately predict the incidences of shrimp disease.
Mariscal, Carlos; Doolittle, W Ford
In the half century since the formulation of the prokaryote : eukaryote dichotomy, many authors have proposed that the former evolved from something resembling the latter, in defiance of common (and possibly common sense) views. In such 'eukaryotes first' (EF) scenarios, the last universal common ancestor is imagined to have possessed significantly many of the complex characteristics of contemporary eukaryotes, as relics of an earlier 'progenotic' period or RNA world. Bacteria and Archaea thus must have lost these complex features secondarily, through 'streamlining'. If the canonical three-domain tree in which Archaea and Eukarya are sisters is accepted, EF entails that Bacteria and Archaea are convergently prokaryotic. We ask what this means and how it might be tested. © 2015 The Author(s).
Kong, Jin Sun; Kang, Ji-Youn; Kim, Hye Lim; Kwon, O-Seob; Lee, Kon Ho; Park, Young Shik
6-Pyruvoyltetrahydropterin synthase (PTPS) catalyzes the second step of tetrahydrobiopterin (BH4) synthesis. We previously identified PTPS orthologs (bPTPS-Is) in bacteria which do not produce BH4. In this study we disrupted the gene encoding bPTPS-I in Synechococcus sp. PCC 7942, which produces BH4-glucoside. The mutant was normal in BH4-glucoside production, demonstrating that bPTPS-I does not participate in BH4 synthesis in vivo and bringing us a new PTPS ortholog (bPTPS-II) of a bimodular polypeptide. The recombinant Synechococcus bPTPS-II was assayed in vitro to show PTPS activity higher than human enzyme. Further computational analysis revealed the presence of mono and bimodular bPTPS-II orthologs mostly in green sulfur bacteria and cyanobacteria, respectively, which are well known for BH4-glycoside production. In summary we found new bacterial PTPS orthologs, having either a single or dual domain structure and being responsible for BH4 synthesis in vivo, thereby disclosing all the bacterial PTPS homologs.
This paper develops a conceptual framework for addressing questions about reproduction, individuality, and the units of selection in symbiotic associations, with special attention to the origin of the eukaryotic cell. Three kinds of reproduction are distinguished, and a possible evolutionary sequence giving rise to a mitochondrion-containing eukaryotic cell from an endosymbiotic partnership is analyzed as a series of transitions between each of the three forms of reproduction. The sequence of changes seen in this “egalitarian” evolutionary transition is compared with those that apply in “fraternal” transitions, such as the evolution of multicellularity in animals. PMID:26286983
IONS: Identification of Orthologs by Neighborhood and Similarity—an Automated Method to Identify Orthologs in Chromosomal Regions of Common Evolutionary Ancestry and its Application to Hemiascomycetous Yeasts
Seret, Marie-Line; Baret, Philippe V.
Comparative sequence analysis is widely used to infer gene function and study genome evolution and requires proper ortholog identification across different genomes. We have developed a program for the Identification of Orthologs in one-to-one relationship by Neighborhood and Similarity (IONS) between closely related species. The algorithm combines two levels of evidence to determine co-ancestrality at the genome scale: sequence similarity and shared neighborhood. The method was initially designed to provide anchor points for syntenic blocks within the Génolevures project concerning nine hemiascomycetous yeasts (about 50,000 genes) and is applicable to different input databases. Comparison based on use of a Rand index shows that the results are highly consistent with the pillars of the Yeast Gene Order Browser, a manually curated database. Compared with SYNERGY, another algorithm reporting homology relationships, our method’s main advantages are its automation and the absence of dataset-dependent parameters, facilitating consistent integration of newly released genomes. PMID:21918595
Full Text Available List Contact us MicrobeDB.jp Ortholog Data detail Data name Ortholog DOI 10.18908/lsdba.nbdc01181-010.V002 V...814 triples - About This Database Database Description Download License Update History of This Database Site Policy | Contact Us Ortholog - MicrobeDB.jp | LSDB Archive ...
The endosymbiotic hypothesis for the origin of the eukaryotic cell has been applied to the origin of the mitochondria and chloroplasts. However as has been pointed out by Mereschowsky in 1905, it should also be applied to the nucleus as well. If the nucleus, mitochondria and chloroplasts are endosymbionts, then it is likely that the organism that did the engulfing was not a DNA-based organism. In fact, it is useful to postulate that this organism was a primitive RNA-based organism. This hypothesis would explain the preponderance of RNA viruses found in eukaryotic cells. The centriole and basal body do not have a double membrane or DNA. Like all MTOCs (microtubule organising centres), they have a structural or morphic RNA implicated in their formation. This would argue for their origin in the early RNA-based organism rather than in an endosymbiotic event involving bacteria. Finally, the eukaryotic cell uses RNA in ways quite unlike bacteria, thus pointing to a greater emphasis of RNA in both control and structure in the cell. The origin of the eukaryotic cell may tell us why it rather than its prokaryotic relative evolved into the metazoans who are reading this paper.
The yeast Saccharomyces cerevisiae is one of the champions of basic biomedical research due to its compact eukaryotic genome and ease of experimental manipulation. Despite these immense strengths, its impact on understanding the genetic basis of natural phenotypic variation has been limited by strai...
Cabrera, Antonio; Kozik, Alex; Howad, Werner; Arus, Pere; Iezzoni, Amy F; van der Knaap, Esther
Detailed comparative genome analyses within the economically important Rosaceae family have not been conducted. This is largely due to the lack of conserved gene-based molecular markers that are transferable among the important crop genera within the family [e.g. Malus (apple), Fragaria (strawberry), and Prunus (peach, cherry, apricot and almond)]. The lack of molecular markers and comparative whole genome sequence analysis for this family severely hampers crop improvement efforts as well as QTL confirmation and validation studies. We identified a set of 3,818 rosaceaous unigenes comprised of two or more ESTs that correspond to single copy Arabidopsis genes. From this Rosaceae Conserved Orthologous Set (RosCOS), 1039 were selected from which 857 were used for the development of intron-flanking primers and allele amplification. This led to successful amplification and subsequent mapping of 613 RosCOS onto the Prunus TxE reference map resulting in a genome-wide coverage of 0.67 to 1.06 gene-based markers per cM per linkage group. Furthermore, the RosCOS primers showed amplification success rates from 23 to 100% across the family indicating that a substantial part of the RosCOS primers can be directly employed in other less studied rosaceaous crops. Comparisons of the genetic map positions of the RosCOS with the physical locations of the orthologs in the Populus trichocarpa genome identified regions of colinearity between the genomes of Prunus-Rosaceae and Populus-Salicaceae. Conserved orthologous genes are extremely useful for the analysis of genome evolution among closely and distantly related species. The results presented in this study demonstrate the considerable potential of the mapped Prunus RosCOS for genome-wide marker employment and comparative whole genome studies within the Rosaceae family. Moreover, these markers will also function as useful anchor points for the genome sequencing efforts currently ongoing in this family as well as for comparative QTL
Atteia, Ariane; van Lis, Robert; Tielens, Aloysius G M; Martin, William F
Anaerobic metabolic pathways allow unicellular organisms to tolerate or colonize anoxic environments. Over the past ten years, genome sequencing projects have brought a new light on the extent of anaerobic metabolism in eukaryotes. A surprising development has been that free-living unicellular algae capable of photoautotrophic lifestyle are, in terms of their enzymatic repertoire, among the best equipped eukaryotes known when it comes to anaerobic energy metabolism. Some of these algae are marine organisms, common in the oceans, others are more typically soil inhabitants. All these species are important from the ecological (O(2)/CO(2) budget), biotechnological, and evolutionary perspectives. In the unicellular algae surveyed here, mixed-acid type fermentations are widespread while anaerobic respiration, which is more typical of eukaryotic heterotrophs, appears to be rare. The presence of a core anaerobic metabolism among the algae provides insights into its evolutionary origin, which traces to the eukaryote common ancestor. The predicted fermentative enzymes often exhibit an amino acid extension at the N-terminus, suggesting that these proteins might be compartmentalized in the cell, likely in the chloroplast or the mitochondrion. The green algae Chlamydomonas reinhardtii and Chlorella NC64 have the most extended set of fermentative enzymes reported so far. Among the eukaryotes with secondary plastids, the diatom Thalassiosira pseudonana has the most pronounced anaerobic capabilities as yet. From the standpoints of genomic, transcriptomic, and biochemical studies, anaerobic energy metabolism in C. reinhardtii remains the best characterized among photosynthetic protists. This article is part of a Special Issue entitled: The evolutionary aspects of bioenergetic systems. Copyright © 2012 Elsevier B.V. All rights reserved.
Rasmussen, Jane Lind Nybo; Vesth, Tammi Camilla; Theobald, Sebastian
of genotype-to-phenotype. To achieve this, we have developed orthologous protein prediction software that utilizes genus-wide genetic diversity. The approach is optimized for large data sets, based on BLASTp considering protein identity and alignment coverage, and clustering using single linkage of bi......The Aspergillus genus contains leading industrial microorganisms, excelling in producing bioactive compounds and enzymes. Using synthetic biology and bioinformatics, we aim to re-engineer these organisms for applications within human health, pharmaceuticals, environmental engineering, and food......-directional hits. The result is orthologous protein families describing the genomic and functional features of individual species, clades and the core/pan genome of Aspergillus; and applicable to genotype-to-phenotype analyses in other microbial genera....
Giladi, Moshe; Sasson, Yehezkel; Fang, Xianyang; Hiller, Reuben; Buki, Tal; Wang, Yun-Xing; Hirsch, Joel A.; Khananshvili, Daniel
Na+/Ca2+ exchanger (NCX) proteins mediate Ca2+-fluxes across the cell membrane to maintain Ca2+ homeostasis in many cell types. Eukaryotic NCX contains Ca2+-binding regulatory domains, CBD1 and CBD2. Ca2+ binding to a primary sensor (Ca3-Ca4 sites) on CBD1 activates mammalian NCXs, whereas CALX, a Drosophila NCX ortholog, displays an inhibitory response to regulatory Ca2+. To further elucidate the underlying regulatory mechanisms, we determined the 2.7 Å crystal structure of mammalian CBD12-E454K, a two-domain construct that retains wild-type properties. In conjunction with stopped-flow kinetics and SAXS (small-angle X-ray scattering) analyses of CBD12 mutants, we show that Ca2+ binding to Ca3-Ca4 sites tethers the domains via a network of interdomain salt-bridges. This Ca2+-driven interdomain switch controls slow dissociation of “occluded” Ca2+ from the primary sensor and thus dictates Ca2+ sensing dynamics. In the Ca2+-bound conformation, the interdomain angle of CBD12 is very similar in NCX and CALX, meaning that the interdomain distances cannot account for regulatory diversity in NCX and CALX. Since the two-domain interface is nearly identical among eukaryotic NCXs, including CALX, we suggest that the Ca2+-driven interdomain switch described here represents a general mechanism for initial conduction of regulatory signals in NCX variants. PMID:22768191
Wang, Yongbo; Liu, Zexian; Cheng, Han; Gao, Tianshun; Pan, Zhicheng; Yang, Qing; Guo, Anyuan; Xue, Yu
Full Text Available Abstract Background In the Calvin cycle of eubacteria, the dephosphorylations of both fructose-1, 6-bisphosphate (FBP and sedoheptulose-1, 7-bisphosphate (SBP are catalyzed by the same bifunctional enzyme: fructose-1, 6-bisphosphatase/sedoheptulose-1, 7-bisphosphatase (F/SBPase, while in that of eukaryotic chloroplasts by two distinct enzymes: chloroplastic fructose-1, 6-bisphosphatase (FBPase and sedoheptulose-1, 7-bisphosphatase (SBPase, respectively. It was proposed that these two eukaryotic enzymes arose from the divergence of a common ancestral eubacterial bifunctional F/SBPase of mitochondrial origin. However, no specific affinity between SBPase and eubacterial FBPase or F/SBPase can be observed in the previous phylogenetic analyses, and it is hard to explain why SBPase and/or F/SBPase are/is absent from most extant nonphotosynthetic eukaryotes according to this scenario. Results Domain analysis indicated that eubacterial F/SBPase of two different resources contain distinct domains: proteobacterial F/SBPases contain typical FBPase domain, while cyanobacterial F/SBPases possess FBPase_glpX domain. Therefore, like prokaryotic FBPase, eubacterial F/SBPase can also be divided into two evolutionarily distant classes (Class I and II. Phylogenetic analysis based on a much larger taxonomic sampling than previous work revealed that all eukaryotic SBPase cluster together and form a close sister group to the clade of epsilon-proteobacterial Class I FBPase which are gluconeogenesis-specific enzymes, while all eukaryotic chloroplast FBPase group together with eukaryotic cytosolic FBPase and form another distinct clade which then groups with the Class I FBPase of diverse eubacteria. Motif analysis of these enzymes also supports these phylogenetic correlations. Conclusions There are two evolutionarily distant classes of eubacterial bifunctional F/SBPase. Eukaryotic FBPase and SBPase do not diverge from either of them but have two independent origins
Yang, Xiaowen; Wang, Jiawei; Bing, Guoxia; Bie, Pengfei; De, Yanyan; Lyu, Yanli; Wu, Qingmin
Bioinformatics and comparative genomics analysis methods were used to predict unknown pathogen genes based on homology with identified or functionally clustered genes. In this study, the genes of common pathogens were analyzed to screen and identify genes associated with intracellular survival through sequence similarity, phylogenetic tree analysis and the λ-Red recombination system test method. The total 38,952 protein-coding genes of common pathogens were divided into 19,775 clusters. As demonstrated through a COG analysis, information storage and processing genes might play an important role intracellular survival. Only 19 clusters were present in facultative intracellular pathogens, and not all were present in extracellular pathogens. Construction of a phylogenetic tree selected 18 of these 19 clusters. Comparisons with the DEG database and previous research revealed that seven other clusters are considered essential gene clusters and that seven other clusters are associated with intracellular survival. Moreover, this study confirmed that clusters screened by orthologs with similar function could be replaced with an approved uvrY gene and its orthologs, and the results revealed that the usg gene is associated with intracellular survival. The study improves the current understanding of intracellular pathogens characteristics and allows further exploration of the intracellular survival-related gene modules in these pathogens. Copyright © 2018. Published by Elsevier B.V.
OhEigeartaigh, Sean S
Abstract Background In standard BLAST searches, no information other than the sequences of the query and the database entries is considered. However, in situations where two genes from different species have only borderline similarity in a BLAST search, the discovery that the genes are located within a region of conserved gene order (synteny) can provide additional evidence that they are orthologs. Thus, for interpreting borderline search results, it would be useful to know whether the syntenic context of a database hit is similar to that of the query. This principle has often been used in investigations of particular genes or genomic regions, but to our knowledge it has never been implemented systematically. Results We made use of the synteny information contained in the Yeast Gene Order Browser database for 11 yeast species to carry out a systematic search for protein-coding genes that were overlooked in the original annotations of one or more yeast genomes but which are syntenic with their orthologs. Such genes tend to have been overlooked because they are short, highly divergent, or contain introns. The key features of our software - called SearchDOGS - are that the database entries are classified into sets of genomic segments that are already known to be orthologous, and that very weak BLAST hits are retained for further analysis if their genomic location is similar to that of the query. Using SearchDOGS we identified 595 additional protein-coding genes among the 11 yeast species, including two new genes in Saccharomyces cerevisiae. We found additional genes for the mating pheromone a-factor in six species including Kluyveromyces lactis. Conclusions SearchDOGS has proven highly successful for identifying overlooked genes in the yeast genomes. We anticipate that our approach can be adapted for study of further groups of species, such as bacterial genomes. More generally, the concept of doing sequence similarity searches against databases to which external
Full Text Available Eggplant, pepper and tomato are the most exploited berry-producing vegetables within the Solanaceae family. Their genomes differ in size, but each has 12 chromosomes which have undergone rearrangements causing a redistribution of loci. The genome sequences of all three species are available but differ in coverage, assembly quality and percentage of anchorage.Determining their syntenic relationship and QTL orthology will contribute to exploit genomic resources and genetic data for key agronomic traits.The syntenic analysis between tomato and pepper based on the alignment of 34,727 tomato CDS to the pepper genome sequence, identified 19,734 unique hits. The resulting synteny map confirmed the 14 inversions and 10 translocations previously documented, but also highlighted 3 new translocations and 4 major new inversions. Furthermore, each of the 12 chromosomes exhibited a number of rearrangements involving small regions of 0.5-0.7 Mbp.Due to high fragmentation of the publicly available eggplant genome sequence, physical localization of most eggplant QTL was not possible, thus, we compared the organization of the eggplant genetic map with the genome sequence of both tomato and pepper. The eggplant/tomato syntenic map confirmed all the 10 translocations but only 9 of the 14 known inversions; on the other hand, a newly detected inversion was recognized while another one was not confirmed. The eggplant/pepper syntenic map confirmed 10 translocations and 8 inversions already detected and suggested a putative new translocation.In order to perform the assessment of eggplant and pepper QTL orthology, the eggplant and pepper sequence-based markers located in their respective genetic map were aligned onto the pepper genome. GBrowse in pepper was used as reference platform for QTL positioning. A set of 151 pepper QTL were located as well as 212 eggplant QTL, including 76 major QTL (PVE ≥ 10% affecting key agronomic traits. Most were confirmed to cluster in
Huson, Daniel H; Tappu, Rewati; Bazinet, Adam L; Xie, Chao; Cummings, Michael P; Nieselt, Kay; Williams, Rohan
Microbiome sequencing projects typically collect tens of millions of short reads per sample. Depending on the goals of the project, the short reads can either be subjected to direct sequence analysis or be assembled into longer contigs. The assembly of whole genomes from metagenomic sequencing reads is a very difficult problem. However, for some questions, only specific genes of interest need to be assembled. This is then a gene-centric assembly where the goal is to assemble reads into contigs for a family of orthologous genes. We present a new method for performing gene-centric assembly, called protein-alignment-guided assembly, and provide an implementation in our metagenome analysis tool MEGAN. Genes are assembled on the fly, based on the alignment of all reads against a protein reference database such as NCBI-nr. Specifically, the user selects a gene family based on a classification such as KEGG and all reads binned to that gene family are assembled. Using published synthetic community metagenome sequencing reads and a set of 41 gene families, we show that the performance of this approach compares favorably with that of full-featured assemblers and that of a recently published HMM-based gene-centric assembler, both in terms of the number of reference genes detected and of the percentage of reference sequence covered. Protein-alignment-guided assembly of orthologous gene families complements whole-metagenome assembly in a new and very useful way.
Margulis, L.; Hinkle, G.
Ongoing work to test the hypothesis of the origin of eukaryotic cell organelles by microbial symbioses is discussed. Because of the widespread acceptance of the serial endosymbiotic theory (SET) of the origin of plastids and mitochondria, the idea of the symbiotic origin of the centrioles and axonemes for spirochete bacteria motility symbiosis was tested. Intracellular microtubular systems are purported to derive from symbiotic associations between ancestral eukaryotic cells and motile bacteria. Four lines of approach to this problem are being pursued: (1) cloning the gene of a tubulin-like protein discovered in Spirocheata bajacaliforniesis; (2) seeking axoneme proteins in spirochets by antibody cross-reaction; (3) attempting to cultivate larger, free-living spirochetes; and (4) studying in detail spirochetes (e.g., Cristispira) symbiotic with marine animals. Other aspects of the investigation are presented.
Liu, J.; Bolstad, D; Bolstad, E; Wright, D; Anderson, A
Trimethoprim, an antifolate commonly prescribed in combination with sulfamethoxazole, potently inhibits several prokaryotic species of dihydrofolate reductase (DHFR). However, several eukaryotic pathogenic organisms are resistant to trimethoprim, preventing its effective use as a therapeutic for those infections. We have been building a program to reengineer trimethoprim to more potently and selectively inhibit eukaryotic species of DHFR as a viable strategy for new drug discovery targeting several opportunistic pathogens. We have developed a series of compounds that exhibit potent and selective inhibition of DHFR from the parasitic protozoa Cryptosporidium and Toxoplasma as well as the fungus Candida glabrata. A comparison of the structures of DHFR from the fungal species Candida glabrata and Pneumocystis suggests that the compounds may also potently inhibit Pneumocystis DHFR.
Dick, Claudia F; Dos-Santos, André L A; Meyer-Fernandes, José R
Inorganic phosphate (Pi) is an essential nutrient for all organisms. The route of Pi utilization begins with Pi transport across the plasma membrane. Here, we analyzed the gene sequences and compared the biochemical profiles, including kinetic and modulator parameters, of Pi transporters in unicellular eukaryotes. The objective of this review is to evaluate the recent findings regarding Pi uptake mechanisms in microorganisms, such as the fungi Neurospora crassa and Saccharomyces cerevisiae and the parasite protozoans Trypanosoma cruzi, Trypanosoma rangeli, Leishmania infantum and Plasmodium falciparum. Pi uptake is the key step of Pi homeostasis and in the subsequent signaling event in eukaryotic microorganisms. Biochemical and structural studies are important for clarifying mechanisms of Pi homeostasis, as well as Pi sensor and downstream pathways, and raise possibilities for future studies in this field. Copyright © 2014 Elsevier B.V. All rights reserved.
Full Text Available The industrial production of fine chemicals, feed and food ingredients, pharmaceuticals, agrochemicals and their respective intermediates relies on an increasing application of biocatalysis, i.e. on enzyme or whole-cell catalyzed conversions of molecules. Simple procedures for discovery, cloning and over-expression as well as fast growth favour fungi, yeasts and especially bacteria as sources of biocatalysts. Higher eukaryotes also harbour an almost unlimited number of potential biocatalysts, although to date the limited supply of enzymes, the high heterogeneity of enzyme preparations and the hazard of infectious contaminants keep some interesting candidates out of reach for industrial bioprocesses. In the past only a few animal and plant enzymes from agricultural waste materials were employed in food processing. The use of bacterial expression strains or non-conventional yeasts for the heterologous production of efficient eukaryotic enzymes can overcome the bottleneck in enzyme supply and provide sufficient amounts of homogenous enzyme preparations for reliable and economically feasible applications at large scale. Ideal enzymatic processes represent an environmentally friendly, »near-to-completion« conversion of (mostly non-natural substrates to pure products. Recent developments demonstrate the commercial feasibility of large-scale biocatalytic processes employing enzymes from higher eukaryotes (e.g. plants, animals and also their usefulness in some small-scale industrial applications.
Full Text Available Arsenic and antimony are toxic metalloids, naturally present in the environment and all organisms have developed pathways for their detoxification. The most effective metalloid tolerance systems in eukaryotes include downregulation of metalloid uptake, efflux out of the cell, and complexation with phytochelatin or glutathione followed by sequestration into the vacuole. Understanding of arsenic and antimony transport system is of high importance due to the increasing usage of arsenic-based drugs in the treatment of certain types of cancer and diseases caused by protozoan parasites as well as for the development of bio- and phytoremediation strategies for metalloid polluted areas. However, in contrast to prokaryotes, the knowledge about specific transporters of arsenic and antimony and the mechanisms of metalloid transport in eukaryotes has been very limited for a long time. Here, we review the recent advances in understanding of arsenic and antimony transport pathways in eukaryotes, including a dual role of aquaglyceroporins in uptake and efflux of metalloids, elucidation of arsenic transport mechanism by the yeast Acr3 transporter and its role in arsenic hyperaccumulation in ferns, identification of vacuolar transporters of arsenic-phytochelatin complexes in plants and forms of arsenic substrates recognized by mammalian ABC transporters.
Maciaszczyk-Dziubinska, Ewa; Wawrzycka, Donata; Wysocki, Robert
Arsenic and antimony are toxic metalloids, naturally present in the environment and all organisms have developed pathways for their detoxification. The most effective metalloid tolerance systems in eukaryotes include downregulation of metalloid uptake, efflux out of the cell, and complexation with phytochelatin or glutathione followed by sequestration into the vacuole. Understanding of arsenic and antimony transport system is of high importance due to the increasing usage of arsenic-based drugs in the treatment of certain types of cancer and diseases caused by protozoan parasites as well as for the development of bio- and phytoremediation strategies for metalloid polluted areas. However, in contrast to prokaryotes, the knowledge about specific transporters of arsenic and antimony and the mechanisms of metalloid transport in eukaryotes has been very limited for a long time. Here, we review the recent advances in understanding of arsenic and antimony transport pathways in eukaryotes, including a dual role of aquaglyceroporins in uptake and efflux of metalloids, elucidation of arsenic transport mechanism by the yeast Acr3 transporter and its role in arsenic hyperaccumulation in ferns, identification of vacuolar transporters of arsenic-phytochelatin complexes in plants and forms of arsenic substrates recognized by mammalian ABC transporters. PMID:22489166
Full Text Available Abstract Background Since Darwin's Origin of Species, reconstructing the Tree of Life has been a goal of evolutionists, and tree-thinking has become a major concept of evolutionary biology. Practically, building the Tree of Life has proven to be tedious. Too few morphological characters are useful for conducting conclusive phylogenetic analyses at the highest taxonomic level. Consequently, molecular sequences (genes, proteins, and genomes likely constitute the only useful characters for constructing a phylogeny of all life. For this reason, tree-makers expect a lot from gene comparisons. The simultaneous study of the largest number of molecular markers possible is sometimes considered to be one of the best solutions in reconstructing the genealogy of organisms. This conclusion is a direct consequence of tree-thinking: if gene inheritance conforms to a tree-like model of evolution, sampling more of these molecules will provide enough phylogenetic signal to build the Tree of Life. The selection of congruent markers is thus a fundamental step in simultaneous analysis of many genes. Results Heat map analyses were used to investigate the congruence of orthologues in four datasets (archaeal, bacterial, eukaryotic and alpha-proteobacterial. We conclude that we simply cannot determine if a large portion of the genes have a common history. In addition, none of these datasets can be considered free of lateral gene transfer. Conclusion Our phylogenetic analyses do not support tree-thinking. These results have important conceptual and practical implications. We argue that representations other than a tree should be investigated in this case because a non-critical concatenation of markers could be highly misleading.
Gottwein, Eva; Mukherjee, Neelanjan; Sachse, Christoph; Frenzel, Corina; Majoros, William H.; Chi, Jen-Tsan A.; Braich, Ravi; Manoharan, Muthiah; Soutschek, Jürgen; Ohler, Uwe; Cullen, Bryan R.
All metazoan eukaryotes express microRNAs (miRNAs), ∼22 nt regulatory RNAs that can repress the expression of mRNAs bearing complementary sequences1. Several DNA viruses also express miRNAs in infected cells, suggesting a role in viral replication and pathogenesis2. While specific viral miRNAs have been shown to autoregulate viral mRNAs3,4 or downregulate cellular mRNAs5,6, the function of the majority of viral miRNAs remains unknown. Here, we report that the miR-K12−11 miRNA encoded by Kaposi's Sarcoma Associated Herpesvirus (KSHV) shows significant homology to cellular miR-155, including the entire miRNA “seed” region7. Using a range of assays, we demonstrate that expression of physiological levels of miR-K12−11 or miR-155 results in the downregulation of an extensive set of common mRNA targets, including genes with known roles in cell growth regulation. Our findings indicate that viral miR-K12−11 functions as an ortholog of cellular miR-155 and has likely evolved to exploit a pre-existing gene regulatory pathway in B-cells. Moreover, the known etiological role of miR-155 in B-cell transformation8-10 suggests that miR-K12−11 may contribute to the induction of KSHV-positive B-cell tumors in infected patients. PMID:18075594
Muller, J; Szklarczyk, D; Julien, P
The identification of orthologous relationships forms the basis for most comparative genomics studies. Here, we present the second version of the eggNOG database, which contains orthologous groups (OGs) constructed through identification of reciprocal best BLAST matches and triangular linkage...... of the tree of life; in addition to the species groups included in our first release (i.e. fungi, metazoa, insects, vertebrates and mammals), we have now constructed OGs for archaea, fishes, rodents and primates. We automatically annotate the non-supervised orthologous groups (NOGs) with functional...... descriptions, protein domains, and functional categories as defined initially for the COG/KOG database. In-depth analysis is facilitated by precomputed high-quality multiple sequence alignments and maximum-likelihood trees for each of the available OGs. Altogether, eggNOG covers 2,242 035 proteins (built from...
Jain, Sourabh; Panda, Arup; Colson, Philippe; Raoult, Didier; Pontarotti, Pierre
With the inclusion of new members, understanding about evolutionary mechanisms and processes by which members of the proposed order, Megavirales, have evolved has become a key area of interest. The central role of gene acquisition has been shown in previous studies. However, the major drawback in gene acquisition studies is the focus on few MV families or putative families with large variation in their genetic structure. Thus, here we have tried to develop a methodology by which we can detect horizontal gene transfers (HGTs), taking into consideration orthologous groups of distantly related Megavirale families. Here, we report an automated workflow MimiLook, prepared as a Perl command line program, that deduces orthologous groups (OGs) from ORFomes of Megavirales and constructs phylogenetic trees by performing alignment generation, alignment editing and protein-protein BLAST (BLASTP) searching across the National Center for Biotechnology Information (NCBI) non-redundant (nr) protein sequence database. Finally, this tool detects statistically validated events of gene acquisitions with the help of the T-REX algorithm by comparing individual gene tree with NCBI species tree. In between the steps, the workflow decides about handling paralogs, filtering outputs, identifying Megavirale specific OGs, detection of HGTs, along with retrieval of information about those OGs that are monophyletic with organisms from cellular domains of life. By implementing MimiLook, we noticed that nine percent of Megavirale gene families (i.e., OGs) have been acquired by HGT, 80% OGs were Megaviralespecific and eight percent were found to be sharing common ancestry with members of cellular domains (Eukaryote, Bacteria, Archaea, Phages or other viruses) and three percent were ambivalent. The results are briefly discussed to emphasize methodology. Also, MimiLook is relevant for detecting evolutionary scenarios in other targeted phyla with user defined modifications. It can be accessed at
Kuo, Alan; Grigoriev, Igor
Pyrosequencing technologies such as 454/Roche and Solexa/Illumina vastly lower the cost of nucleotide sequencing compared to the traditional Sanger method, and thus promise to greatly expand the number of sequenced eukaryotic genomes. However, the new technologies also bring new challenges such as shorter reads and new kinds and higher rates of sequencing errors, which complicate genome assembly and gene prediction. At JGI we are deploying 454 technology for the sequencing and assembly of ever-larger eukaryotic genomes. Here we describe our first whole-genome annotation of a purely 454-sequenced fungal genome that is larger than a yeast (>30 Mbp). The pezizomycotine (filamentous ascomycote) Aspergillus carbonarius belongs to the Aspergillus section Nigri species complex, members of which are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as agricultural toxigens. Application of a modified version of the standard JGI Annotation Pipeline has so far predicted ~;;10k genes. ~;;12percent of these preliminary annotations suffer a potential frameshift error, which is somewhat higher than the ~;;9percent rate in the Sanger-sequenced and conventionally assembled and annotated genome of fellow Aspergillus section Nigri member A. niger. Also,>90percent of A. niger genes have potential homologs in the A. carbonarius preliminary annotation. Weconclude, and with further annotation and comparative analysis expect to confirm, that 454 sequencing strategies provide a promising substrate for annotation of modestly sized eukaryotic genomes. We will also present results of annotation of a number of other pyrosequenced fungal genomes of bioenergy interest.
Alsmark, Cecilia; Foster, Peter G; Sicheritz-Pontén, Thomas
BACKGROUND: The influence of lateral gene transfer on gene origins and biology in eukaryotes is poorly understood compared with those of prokaryotes. A number of independent investigations focusing on specific genes, individual genomes, or specific functional categories from various eukaryotes have...... approach to systematically investigate lateral gene transfer affecting the proteomes of thirteen, mainly parasitic, microbial eukaryotes, representing four of the six eukaryotic super-groups. All of the genomes investigated have been significantly affected by prokaryote-to-eukaryote lateral gene transfers...... indicated that lateral gene transfer does indeed affect eukaryotic genomes. However, the lack of common methodology and criteria in these studies makes it difficult to assess the general importance and influence of lateral gene transfer on eukaryotic genome evolution. RESULTS: We used a phylogenomic...
Full Text Available 0”. This cluster ID is uniquely-assigned by the PGDBj Ortholog Database. Cluster size Number of proteins aff...r About This Database Database Description Download License Update History of This Database Site Policy | Contact Us Cluster (Viridiplantae) - PGDBj - Ortholog DB | LSDB Archive ... ...List Contact us PGDBj - Ortholog DB Cluster (Viridiplantae) Data detail Data name Cluster (Viridiplantae) DO...switchLanguage; BLAST Search Image Search Home About Archive Update History Data
Full Text Available 3090”. This cluster ID is uniquely-assigned by the PGDBj Ortholog Database. Cluster size Number of proteins ...ster About This Database Database Description Download License Update History of This Database Site Policy | Contact Us Cluster (Cyanobacteria) - PGDBj - Ortholog DB | LSDB Archive ... ...List Contact us PGDBj - Ortholog DB Cluster (Cyanobacteria) Data detail Data name Cluster (Cyanobacteria) DO...switchLanguage; BLAST Search Image Search Home About Archive Update History Data
Full Text Available Microorganisms represent the largest component of biodiversity in our world. For millions of years, prokaryotic microorganisms have functioned as a major selective force shaping eukaryotic evolution. Microbes that live inside and on animals outnumber the animals’ actual somatic and germ cells by an estimated 10-fold. Collectively, the intestinal microbiome represents a ‘forgotten organ’, functioning as an organ inside another that can execute many physiological responsibilities. The nature of primitive eukaryotes was drastically changed due to the association with symbiotic prokaryotes facilitating mutual coevolution of host and microbe. Phytophagous insects have long been used to test theories of evolutionary diversification; moreover, the diversification of a number of phytophagous insect lineages has been linked to mutualisms with microbes. From termites and honey bees to ruminants and mammals, depending on novel biochemistries provided by the prokaryotic microbiome, the association helps to metabolize several nutrients that the host cannot digest and converting these into useful end products (such as short chain fatty acids, a process which has huge impact on the biology and homeostasis of metazoans. More importantly, in a direct and/or indirect way, the intestinal microbiota influences the assembly of gut-associated lymphoid tissue, helps to educate immune system, affects the integrity of the intestinal mucosal barrier, modulates proliferation and differentiation of its epithelial lineages, regulates angiogenesis, and modifies the activity of enteric as well as the central nervous system,. Despite these important effects, the mechanisms by which the gut microbial community influences the host’s biology remains almost entirely unknown. Our aim here is to encourage empirical inquiry into the relationship between mutualism and evolutionary diversification between prokaryotes and eukaryotes which encourage us to postulate: Who is
Jarmuszkiewicz, Wieslawa; Woyda-Ploszczyca, Andrzej; Antos-Krzeminska, Nina; Sluse, Francis E
Uncoupling proteins (UCPs) are members of the mitochondrial anion carrier protein family that are present in the mitochondrial inner membrane and mediate free fatty acid (FFA)-activated, purine nucleotide (PN)-inhibited proton conductance. Since 1999, the presence of UCPs has been demonstrated in some non-photosynthesising unicellular eukaryotes, including amoeboid and parasite protists, as well as in non-fermentative yeast and filamentous fungi. In the mitochondria of these organisms, UCP activity is revealed upon FFA-induced, PN-inhibited stimulation of resting respiration and a decrease in membrane potential, which are accompanied by a decrease in membranous ubiquinone (Q) reduction level. UCPs in unicellular eukaryotes are able to divert energy from oxidative phosphorylation and thus compete for a proton electrochemical gradient with ATP synthase. Our recent work indicates that membranous Q is a metabolic sensor that might utilise its redox state to release the PN inhibition of UCP-mediated mitochondrial uncoupling under conditions of phosphorylation and resting respiration. The action of reduced Q (QH2) could allow higher or complete activation of UCP. As this regulatory feature was demonstrated for microorganism UCPs (A. castellanii UCP), plant and mammalian UCP1 analogues, and UCP1 in brown adipose tissue, the process could involve all UCPs. Here, we discuss the functional connection and physiological role of UCP and alternative oxidase, two main energy-dissipating systems in the plant-type mitochondrial respiratory chain of unicellular eukaryotes, including the control of cellular energy balance as well as preventive action against the production of reactive oxygen species. Copyright © 2009 Elsevier B.V. All rights reserved.
Full Text Available Junfeng Liu1,2, Liang Chen3, Hongyu Zhao4, Dirk F Moore1,2, Yong Lin1,2, Weichung Joe Shih1,21Biometrics Division, The Cancer, Institute of New Jersey, New Brunswick, NJ, USA; 2Department of Biostatistics, School of Public Health, University of Medicine and Dentistry of New Jersey, Piscataway, NJ, USA; 3Department of Biological Sciences, University of Southern California, Los Angeles, CA, USA; 4Department of Epidemiology and Public Health, Yale University School of Medicine, New Haven, CT, USAAbstract: Probabilistic DNA sequence models have been intensively applied to genome research. Within the evolutionary biology framework, this article investigates the feasibility for rigorously estimating the probability of a set of orthologous DNA sequences which evolve from a common progenitor. We propose Monte Carlo integration algorithms to sample the unknown ancestral and/or root sequences a posteriori conditional on a reference sequence and apply pairwise Needleman–Wunsch alignment between the sampled and nonreference species sequences to estimate the probability. We test our algorithms on both simulated and real sequences and compare calculated probabilities from Monte Carlo integration to those induced by single multiple alignment.Keywords: evolution, Jukes–Cantor model, Monte Carlo integration, Needleman–Wunsch alignment, orthologous
Bracchi-Ricard, V; Nguyen, K T; Zhou, Y; Rajagopalan, P T; Chakrabarti, D; Pei, D
Ribosomal protein synthesis in eubacteria and eukaryotic organelles initiates with an N-formylmethionyl-tRNA(i), resulting in N-terminal formylation of all nascent polypeptides. Peptide deformylase (PDF) catalyzes the subsequent removal of the N-terminal formyl group from the majority of bacterial proteins. Until recently, PDF has been thought as an enzyme unique to the bacterial kingdom. Searches of the genomic DNA databases identified several genes that encode proteins of high sequence homology to bacterial PDF from eukaryotic organisms. The cDNA encoding Plasmodium falciparum PDF (PfPDF) has been cloned and overexpressed in Escherichia coli. The recombinant protein is catalytically active in deformylating N-formylated peptides, shares many of the properties of bacterial PDF, and is inhibited by specific PDF inhibitors. Western blot analysis indicated expression of mature PfPDF in trophozoite, schizont, and segmenter stages of intraerythrocytic development. These results provide strong evidence that a functional PDF is present in P. falciparum. In addition, PDF inhibitors inhibited the growth of P. falciparum in the intraerythrocytic culture. (c)2001 Elsevier Science.
The complete and faithful transmission of eukaryotic genome to daughter cells involves the timely duplication of mother cell's DNA. DNA replication starts at multiple chromosomal positions called replication origin. From each activated replication origin two replication forks progress in opposite direction and duplicate the mother cell's DNA. While it is widely accepted that in eukaryotic organisms replication origins are activated in a stochastic manner, little is known on the sources of the observed stochasticity. It is often associated to the population variability to enter S phase. We extract from a growing Saccharomyces cerevisiae population the average rate of origin activation in a single cell by combining single molecule measurements and a numerical deconvolution technique. We show that the temporal profile of the rate of origin activation in a single cell is similar to the one extracted from a replicating cell population. Taking into account this observation we exclude the population variability as the origin of observed stochasticity in origin activation. We confirm that the rate of origin activation increases in the early stage of S phase and decreases at the latter stage. The population average activation rate extracted from single molecule analysis is in prefect accordance with the activation rate extracted from published micro-array data, confirming therefore the homogeneity and genome scale invariance of dynamic of replication process. All these observations point toward a possible role of replication fork to control the rate of origin activation.
Full Text Available Many plants and animals are defended from predation or herbivory by inhibitory secondary metabolites, which in the marine environment are very common among sessile organisms. Among bacteria, where there is the greatest metabolic potential, little is known about chemical defenses against bacterivorous consumers. An emerging hypothesis is that sessile bacterial communities organized as biofilms serve as bacterial refuge from predation. By testing growth and survival of two common bacterivorous nanoflagellates, we find evidence that chemically mediated resistance against protozoan predators is common among biofilm populations in a diverse set of marine bacteria. Using bioassay-guided chemical and genetic analysis, we identified one of the most effective antiprotozoal compounds as violacein, an alkaloid that we demonstrate is produced predominately within biofilm cells. Nanomolar concentrations of violacein inhibit protozoan feeding by inducing a conserved eukaryotic cell death program. Such biofilm-specific chemical defenses could contribute to the successful persistence of biofilm bacteria in various environments and provide the ecological and evolutionary context for a number of eukaryote-targeting bacterial metabolites.
Full Text Available BACKGROUND: The objective of this work was to investigate the hypothesis that eukaryotic Internal Ribosome Entry Sites (IRES lack secondary structure and to examine the generality of the hypothesis. METHODOLOGY/PRINCIPAL FINDINGS: IRESs of the yeast and the fruit fly are located in the 5'UTR immediately upstream of the initiation codon. The minimum folding energy (MFE of 60 nt RNA segments immediately upstream of the initiation codons was calculated as a proxy of secondary structure stability. MFE of the reverse complements of these 60 nt segments was also calculated. The relationship between MFE and empirically determined IRES activity was investigated to test the hypothesis that strong IRES activity is associated with weak secondary structure. We show that IRES activity in the yeast and the fruit fly correlates strongly with the structural stability, with highest IRES activity found in RNA segments that exhibit the weakest secondary structure. CONCLUSIONS: We found that a subset of eukaryotic IRESs exhibits very low secondary structure in the 5'-UTR sequences immediately upstream of the initiation codon. The consistency in results between the yeast and the fruit fly suggests a possible shared mechanism of cap-independent translation initiation that relies on an unstructured RNA segment.
Okamura, Masumi; Inose, Haruko; Masuda, Seiji
In eukaryotic cells, RNAs are transcribed in the nucleus and exported to the cytoplasm through the nuclear pore complex. The RNA molecules that are exported from the nucleus into the cytoplasm include messenger RNAs (mRNAs), ribosomal RNAs (rRNAs), transfer RNAs (tRNAs), small nuclear RNAs (snRNAs), micro RNAs (miRNAs), and viral mRNAs. Each RNA is transported by a specific nuclear export receptor. It is believed that most of the mRNAs are exported by Nxf1 (Mex67 in yeast), whereas rRNAs, snRNAs, and a certain subset of mRNAs are exported in a Crm1/Xpo1-dependent manner. tRNAs and miRNAs are exported by Xpot and Xpo5. However, multiple export receptors are involved in the export of some RNAs, such as 60S ribosomal subunit. In addition to these export receptors, some adapter proteins are required to export RNAs. The RNA export system of eukaryotic cells is also used by several types of RNA virus that depend on the machineries of the host cell in the nucleus for replication of their genome, therefore this review describes the RNA export system of two representative viruses. We also discuss the NPC anchoring-dependent mRNA export factors that directly recruit specific genes to the NPC.
Full Text Available Abstract Background Detailed comparative genome analyses within the economically important Rosaceae family have not been conducted. This is largely due to the lack of conserved gene-based molecular markers that are transferable among the important crop genera within the family [e.g. Malus (apple, Fragaria (strawberry, and Prunus (peach, cherry, apricot and almond]. The lack of molecular markers and comparative whole genome sequence analysis for this family severely hampers crop improvement efforts as well as QTL confirmation and validation studies. Results We identified a set of 3,818 rosaceaous unigenes comprised of two or more ESTs that correspond to single copy Arabidopsis genes. From this Rosaceae Conserved Orthologous Set (RosCOS, 1039 were selected from which 857 were used for the development of intron-flanking primers and allele amplification. This led to successful amplification and subsequent mapping of 613 RosCOS onto the Prunus TxE reference map resulting in a genome-wide coverage of 0.67 to 1.06 gene-based markers per cM per linkage group. Furthermore, the RosCOS primers showed amplification success rates from 23 to 100% across the family indicating that a substantial part of the RosCOS primers can be directly employed in other less studied rosaceaous crops. Comparisons of the genetic map positions of the RosCOS with the physical locations of the orthologs in the Populus trichocarpa genome identified regions of colinearity between the genomes of Prunus-Rosaceae and Populus-Salicaceae. Conclusion Conserved orthologous genes are extremely useful for the analysis of genome evolution among closely and distantly related species. The results presented in this study demonstrate the considerable potential of the mapped Prunus RosCOS for genome-wide marker employment and comparative whole genome studies within the Rosaceae family. Moreover, these markers will also function as useful anchor points for the genome sequencing efforts currently
Jagus, Rosemary; Bachvaroff, Tsvetan R; Joshi, Bhavesh; Place, Allen R
The greatest diversity of eukaryotic species is within the microbial eukaryotes, the protists, with plants and fungi/metazoa representing just two of the estimated seventy five lineages of eukaryotes. Protists are a diverse group characterized by unusual genome features and a wide range of genome sizes from 8.2 Mb in the apicomplexan parasite Babesia bovis to 112,000-220,050 Mb in the dinoflagellate Prorocentrum micans. Protists possess numerous cellular, molecular and biochemical traits not observed in "text-book" model organisms. These features challenge some of the concepts and assumptions about the regulation of gene expression in eukaryotes. Like multicellular eukaryotes, many protists encode multiple eIF4Es, but few functional studies have been undertaken except in parasitic species. An earlier phylogenetic analysis of protist eIF4Es indicated that they cannot be grouped within the three classes that describe eIF4E family members from multicellular organisms. Many more protist sequences are now available from which three clades can be recognized that are distinct from the plant/fungi/metazoan classes. Understanding of the protist eIF4Es will be facilitated as more sequences become available particularly for the under-represented opisthokonts and amoebozoa. Similarly, a better understanding of eIF4Es within each clade will develop as more functional studies of protist eIF4Es are completed.
Full Text Available of This Database Site Policy | Contact Us Taxon (Viridiplantae) - PGDBj - Ortholog DB | LSDB Archive ... ...List Contact us PGDBj - Ortholog DB Taxon (Viridiplantae) Data detail Data name Taxon (Viridiplantae) DOI 10...switchLanguage; BLAST Search Image Search Home About Archive Update History Data
Full Text Available e Description Download License Update History of This Database Site Policy | Contact Us Download - PGDBj - Ortholog DB | LSDB Archive ... ...List Contact us PGDBj - Ortholog DB Download First of all, please read the license of this database. Data na...switchLanguage; BLAST Search Image Search Home About Archive Update History Data
Full Text Available ut This Database Database Description Download License Update History of This Database Site Policy | Contact Us Protein (Cyanobacteria) - PGDBj - Ortholog DB | LSDB Archive ... ...List Contact us PGDBj - Ortholog DB Protein (Cyanobacteria) Data detail Data name Protein (Cyanobacteria) DO...switchLanguage; BLAST Search Image Search Home About Archive Update History Data
Full Text Available of This Database Site Policy | Contact Us Taxon (Cyanobacteria) - PGDBj - Ortholog DB | LSDB Archive ... ...List Contact us PGDBj - Ortholog DB Taxon (Cyanobacteria) Data detail Data name Taxon (Cyanobacteria) DOI 10...switchLanguage; BLAST Search Image Search Home About Archive Update History Data
Full Text Available ase Description Download License Update History of This Database Site Policy | Contact Us Protein (Viridiplantae) - PGDBj - Ortholog DB | LSDB Archive ... ...List Contact us PGDBj - Ortholog DB Protein (Viridiplantae) Data detail Data name Protein (Viridiplantae) DO...switchLanguage; BLAST Search Image Search Home About Archive Update History Data
Bourke, Michael F.; Marriott, Philip J.; Glud, Ronnie N.
Permeable sediments are common across continental shelves and are critical contributors to marine biogeochemical cycling. Organic matter in permeable sediments is dominated by microalgae, which as eukaryotes have different anaerobic metabolic pathways to prokaryotes such as bacteria and archaea....... Here we present analyses of flow-through reactor experiments showing that dissolved inorganic carbon is produced predominantly as a result of anaerobic eukaryotic metabolic activity. In our experiments, anaerobic production of dissolved inorganic carbon was consistently accompanied by large dissolved H....../hydrogenase pathway of fermentative eukaryotic H2 production, suggesting that pathway as the source of H2 and dissolved inorganic carbon production. Metabolomic analysis showed large increases in lipid production at the onset of anoxia, consistent with documented pathways of anoxic dark fermentation in microalgae...
Gupta, Akshari; Kitagawa, Daiju
Several decades of centriole research have revealed the beautiful symmetry present in these microtubule-based organelles, which are required to form centrosomes, cilia, and flagella in many eukaryotes. Centriole architecture is largely conserved across most organisms, however, individual centriolar features such as the central cartwheel or microtubule walls exhibit considerable variability when examined with finer resolution. Here, we review the ultrastructural characteristics of centrioles in commonly studied organisms, highlighting the subtle and not-so-subtle differences between specific structural components of these centrioles. Additionally, we survey some non-canonical centriole structures that have been discovered in various species, from the coaxial bicentrioles of protists and lower land plants to the giant irregular centrioles of the fungus gnat Sciara. Finally, we speculate on the functional significance of these differences between centrioles, and the contribution of individual structural elements such as the cartwheel or microtubules towards the stability of centrioles.Centriole structure, cartwheel, triplet microtubules, SAS-6, centrosome.
Starokadomskyy P. L.
Full Text Available Inteins, or protein introns, are parts of protein sequences that are post-translationally excised, their flanking regions (exteins being spliced together. This process was called protein splicing. Originally inteins were found in prokaryotic or unicellular eukaryotic organisms. But the general principles of post-translation protein rearrangement are evolving yielding different post-translation modification of proteins in multicellular organisms. For clarity, these non-intein mediated events call either protein rearrangements or protein editing. The most intriguing example of protein editing is proteasome-mediated splicing of antigens in vertebrates that may play important role in antigen presentation. Other examples of protein rearrangements are maturation of Hg-proteins (critical receptors in embryogenesis as well as maturation of several metabolic enzymes. Despite a lack of experimental data we try to analyze some intriguing examples of protein splicing evolution.
López-Mirabal, H Reynaldo; Winther, Jakob R
The eukaryotic cytoplasm has long been regarded as a cellular compartment in which the reduced state of protein cysteines is largely favored. Under normal conditions, the cytosolic low-molecular weight redox buffer, comprising primarily of glutathione, is highly reducing and reactive oxygen species...... (ROS) and glutathionylated proteins are maintained at very low levels. In the present review, recent progress in the understanding of the cytosolic thiol-disulfide redox metabolism and novel analytical approaches to studying cytosolic redox properties are discussed. We will focus on the yeast model...... organism, Saccharomyces cerevisiae, where the combination of genetic and biochemical approaches has brought us furthest in understanding the mechanisms underlying cellular redox regulation. It has been shown in yeast that, in addition to the enzyme glutathione reductase, other mechanisms may exist...
Derelle, R.; Torruella, G.; Klimeš, V.; Brinkmann, H.; Kim, E.; Vlček, Čestmír; Lang, B.F.; Eliáš, M.
Roč. 112, č. 7 (2015), E693-E699 ISSN 0027-8424 R&D Projects: GA ČR GA13-24983S Grant - others:GA MŠk(CZ) ED2.1.00/03.0100; Howard Hughes Medical Institute International Early Career Scientist Program(US) 55007424; Spanish Ministry of Economy and Competitiveness, European Molecular Biology Organization Young Investigator Program(ES) BFU2012-31329; Spanish Ministry of Economy and Competitiveness, "Centro de Excelencia Severo Ochoa" - European Regional Development Fund(ES) Sev-2012-0208, BES-2013-064004 Institutional support: RVO:68378050 Keywords : eukaryote phylogeny * phylogenomics * Opimoda * Diphoda * LECA Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 9.423, year: 2015
Pedersen, Anders Gorm; Baldi, Pierre; Chauvin, Yves
between functional promoters has been estimated to be in the range of 30-40 kilobases. Although it is conceivable that some of these predicted promoters correspond to cryptic initiation sites that are used in vivo, it is likely that most are false positives. This suggests that it is important to carefully......Computational prediction of eukaryotic promoters from the nucleotide sequence is one of the most attractive problems in sequence analysis today, but it is also a very difficult one. Thus, current methods predict in the order of one promoter per kilobase in human DNA, while the average distance...... reconsider the biological data that forms the basis of current algorithms, and we here present a review of data that may be useful in this regard. The review covers the following topics: (1) basal transcription and core promoters, (2) activated transcription and transcription factor binding sites, (3) Cp...
Full Text Available Inactivation of host Rho GTPases is a widespread strategy employed by bacterial pathogens to manipulate mammalian cellular functions and avoid immune defenses. Some bacterial toxins mimic eukaryotic Rho GTPase-activating proteins (GAPs to inactivate mammalian GTPases, probably as a result of evolutionary convergence. An intriguing question remains whether eukaryotic pathogens or parasites may use endogenous GAPs as immune-suppressive toxins to target the same key genes as bacterial pathogens. Interestingly, a RhoGAP domain-containing protein, LbGAP, was recently characterized from the parasitoid wasp Leptopilina boulardi, and shown to protect parasitoid eggs from the immune response of Drosophila host larvae. We demonstrate here that LbGAP has structural characteristics of eukaryotic RhoGAPs but that it acts similarly to bacterial RhoGAP toxins in mammals. First, we show by immunocytochemistry that LbGAP enters Drosophila immune cells, plasmatocytes and lamellocytes, and that morphological changes in lamellocytes are correlated with the quantity of LbGAP they contain. Demonstration that LbGAP displays a GAP activity and specifically interacts with the active, GTP-bound form of the two Drosophila Rho GTPases Rac1 and Rac2, both required for successful encapsulation of Leptopilina eggs, was then achieved using biochemical tests, yeast two-hybrid analysis, and GST pull-down assays. In addition, we show that the overall structure of LbGAP is similar to that of eukaryotic RhoGAP domains, and we identify distinct residues involved in its interaction with Rac GTPases. Altogether, these results show that eukaryotic parasites can use endogenous RhoGAPs as virulence factors and that despite their differences in sequence and structure, eukaryotic and bacterial RhoGAP toxins are similarly used to target the same immune pathways in insects and mammals.
Till now, in biological sciences, the term, transcription, mainly refers to DNA to RNA transcription. But our recently published experimental findings obtained from Plasmodium falciparum strongly suggest the existence of DNA to DNA transcription in the genome of eukaryotic cells, which could shed some light on the functions of certain noncoding DNA in the human and other eukaryotic genomes.
Rogozin, Igor B; Wolf, Yuri I; Sorokin, Alexander V; Mirkin, Boris G; Koonin, Eugene V
Sequencing of eukaryotic genomes allows one to address major evolutionary problems, such as the evolution of gene structure. We compared the intron positions in 684 orthologous gene sets from 8 complete genomes of animals, plants, fungi, and protists and constructed parsimonious scenarios of evolution of the exon-intron structure for the respective genes. Approximately one-third of the introns in the malaria parasite Plasmodium falciparum are shared with at least one crown group eukaryote; this number indicates that these introns have been conserved through >1.5 billion years of evolution that separate Plasmodium from the crown group. Paradoxically, humans share many more introns with the plant Arabidopsis thaliana than with the fly or nematode. The inferred evolutionary scenario holds that the common ancestor of Plasmodium and the crown group and, especially, the common ancestor of animals, plants, and fungi had numerous introns. Most of these ancestral introns, which are retained in the genomes of vertebrates and plants, have been lost in fungi, nematodes, arthropods, and probably Plasmodium. In addition, numerous introns have been inserted into vertebrate and plant genes, whereas, in other lineages, intron gain was much less prominent.
Javaux, E J; Knoll, A H; Walter, M R
Molecular phylogeny and biogeochemistry indicate that eukaryotes differentiated early in Earth history. Sequence comparisons of small-subunit ribosomal RNA genes suggest a deep evolutionary divergence of Eukarya and Archaea; C27-C29 steranes (derived from sterols synthesized by eukaryotes) and strong depletion of 13C (a biogeochemical signature of methanogenic Archaea) in 2,700 Myr old kerogens independently place a minimum age on this split. Steranes, large spheroidal microfossils, and rare macrofossils of possible eukaryotic origin occur in Palaeoproterozoic rocks. Until now, however, evidence for morphological and taxonomic diversification within the domain has generally been restricted to very late Mesoproterozoic and Neoproterozoic successions. Here we show that the cytoskeletal and ecological prerequisites for eukaryotic diversification were already established in eukaryotic microorganisms fossilized nearly 1,500 Myr ago in shales of the early Mesoproterozoic Roper Group in northern Australia.
Mihail R Halachev
Full Text Available Among proteins, orthologs are defined as those that are derived by vertical descent from a single progenitor in the last common ancestor of their host organisms. Our goal is to compute a complete set of protein orthologs derived from all currently available complete bacterial and archaeal genomes. Traditional approaches typically rely on all-against-all BLAST searching which is prohibitively expensive in terms of hardware requirements or computational time (requiring an estimated 18 months or more on a typical server. Here, we present xBASE-Orth, a system for ongoing ortholog annotation, which applies a "divide and conquer" approach and adopts a pragmatic scheme that trades accuracy for speed. Starting at species level, xBASE-Orth carefully constructs and uses pan-genomes as proxies for the full collections of coding sequences at each level as it progressively climbs the taxonomic tree using the previously computed data. This leads to a significant decrease in the number of alignments that need to be performed, which translates into faster computation, making ortholog computation possible on a global scale. Using xBASE-Orth, we analyzed an NCBI collection of 1,288 bacterial and 94 archaeal complete genomes with more than 4 million coding sequences in 5 weeks and predicted more than 700 million ortholog pairs, clustered in 175,531 orthologous groups. We have also identified sets of highly conserved bacterial and archaeal orthologs and in so doing have highlighted anomalies in genome annotation and in the proposed composition of the minimal bacterial genome. In summary, our approach allows for scalable and efficient computation of the bacterial and archaeal ortholog annotations. In addition, due to its hierarchical nature, it is suitable for incorporating novel complete genomes and alternative genome annotations. The computed ortholog data and a continuously evolving set of applications based on it are integrated in the xBASE database, available
Full Text Available The eukaryotic translation factor, eIF5A has been recently reported as a sequence-specific elongation factor that facilitates peptide bond formation at consecutive prolines in Saccharomyces cerevisiae, as its ortholog elongation factor P (EF-P does in bacteria. We have searched the genome databases of 35 representative organisms from six kingdoms of life for PPP (Pro-Pro-Pro and/or PPG (Pro-Pro-Gly-encoding genes whose expression is expected to depend on eIF5A. We have made detailed analyses of proteome data of 5 selected species, Escherichia coli, Saccharomyces cerevisiae, Drosophila melanogaster, Mus musculus and Homo sapiens. The PPP and PPG motifs are low in the prokaryotic proteomes. However, their frequencies markedly increase with the biological complexity of eukaryotic organisms, and are higher in newly derived proteins than in those orthologous proteins commonly shared in all species. Ontology classifications of S. cerevisiae and human genes encoding the highest level of polyprolines reveal their strong association with several specific biological processes, including actin/cytoskeletal associated functions, RNA splicing/turnover, DNA binding/transcription and cell signaling. Previously reported phenotypic defects in actin polarity and mRNA decay of eIF5A mutant strains are consistent with the proposed role for eIF5A in the translation of the polyproline-containing proteins. Of all the amino acid tandem repeats (≥3 amino acids, only the proline repeat frequency correlates with functional complexity of the five organisms examined. Taken together, these findings suggest the importance of proline repeat-rich proteins and a potential role for eIF5A and its hypusine modification pathway in the course of eukaryotic evolution.
Furse, Samuel; Shearman, Gemma C
Successful passage through the cell cycle presents a number of structural challenges to the cell. Inceptive studies carried out in the last five years have produced clear evidence of modulations in the lipid profile (sometimes referred to as the lipidome) of eukaryotes as a function of the cell cycle. This mounting body of evidence indicates that lipids play key roles in the structural transformations seen across the cycle. The accumulation of this evidence coincides with a revolution in our understanding of how lipid composition regulates a plethora of biological processes ranging from protein activity through to cellular signalling and membrane compartmentalisation. In this review, we discuss evidence from biological, chemical and physical studies of the lipid fraction across the cell cycle that demonstrate that lipids are well-developed cellular components at the heart of the biological machinery responsible for managing progress through the cell cycle. Furthermore, we discuss the mechanisms by which this careful control is exercised. Crown Copyright © 2017. Published by Elsevier B.V. All rights reserved.
West, Patrick T; Probst, Alexander J; Grigoriev, Igor V; Thomas, Brian C; Banfield, Jillian F
Microbial eukaryotes are integral components of natural microbial communities, and their inclusion is critical for many ecosystem studies, yet the majority of published metagenome analyses ignore eukaryotes. In order to include eukaryotes in environmental studies, we propose a method to recover eukaryotic genomes from complex metagenomic samples. A key step for genome recovery is separation of eukaryotic and prokaryotic fragments. We developed a k -mer-based strategy, EukRep, for eukaryotic sequence identification and applied it to environmental samples to show that it enables genome recovery, genome completeness evaluation, and prediction of metabolic potential. We used this approach to test the effect of addition of organic carbon on a geyser-associated microbial community and detected a substantial change of the community metabolism, with selection against almost all candidate phyla bacteria and archaea and for eukaryotes. Near complete genomes were reconstructed for three fungi placed within the Eurotiomycetes and an arthropod. While carbon fixation and sulfur oxidation were important functions in the geyser community prior to carbon addition, the organic carbon-impacted community showed enrichment for secreted proteases, secreted lipases, cellulose targeting CAZymes, and methanol oxidation. We demonstrate the broader utility of EukRep by reconstructing and evaluating relatively high-quality fungal, protist, and rotifer genomes from complex environmental samples. This approach opens the way for cultivation-independent analyses of whole microbial communities. © 2018 West et al.; Published by Cold Spring Harbor Laboratory Press.
Wijffels, René H; Kruse, Olaf; Hellingwerf, Klaas J
Both cyanobacteria and eukaryotic microalgae are promising organisms for sustainable production of bulk products such as food, feed, materials, chemicals and fuels. In this review we will summarize the potential and current biotechnological developments. Cyanobacteria are promising host organisms for the production of small molecules that can be secreted such as ethanol, butanol, fatty acids and other organic acids. Eukaryotic microalgae are interesting for products for which cellular storage is important such as proteins, lipids, starch and alkanes. For the development of new and promising lines of production, strains of both cyanobacteria and eukaryotic microalgae have to be improved. Transformation systems have been much better developed in cyanobacteria. However, several products would be preferably produced with eukaryotic microalgae. In the case of cyanobacteria a synthetic-systems biology approach has a great potential to exploit cyanobacteria as cell factories. For eukaryotic microalgae transformation systems need to be further developed. A promising strategy is transformation of heterologous (prokaryotic and eukaryotic) genes in established eukaryotic hosts such as Chlamydomonas reinhardtii. Experimental outdoor pilots under containment for the production of genetically modified cyanobacteria and microalgae are in progress. For full scale production risks of release of genetically modified organisms need to be assessed. Copyright © 2013. Published by Elsevier Ltd.
Xu, Dapeng; Jiao, Nianzhi; Ren, Rui; Warren, Alan
Little is known about the biodiversity of microbial eukaryotes in the South China Sea, especially in waters at bathyal depths. Here, we employed SSU rDNA gene sequencing to reveal the diversity and community structure across depth and distance gradients in the South China Sea. Vertically, the highest alpha diversity was found at 75-m depth. The communities of microbial eukaryotes were clustered into shallow-, middle-, and deep-water groups according to the depth from which they were collected, indicating a depth-related diversity and distribution pattern. Rhizaria sequences dominated the microeukaryote community and occurred in all samples except those from less than 50-m deep, being most abundant near the sea floor where they contributed ca. 64-97% and 40-74% of the total sequences and OTUs recovered, respectively. A large portion of rhizarian OTUs has neither a nearest named neighbor nor a nearest neighbor in the GenBank database which indicated the presence of new phylotypes in the South China Sea. Given their overwhelming abundance and richness, further phylogenetic analysis of rhizarians were performed and three new genetic clusters were revealed containing sequences retrieved from the deep waters of the South China Sea. Our results shed light on the diversity and community structure of microbial eukaryotes in this not yet fully explored area. © 2016 The Author(s) Journal of Eukaryotic Microbiology © 2016 International Society of Protistologists.
Curtis, Darren S.; Phillips, Aaron R.; Callister, Stephen J.; Conlan, Sean; McCue, Lee Ann
At the rate that prokaryotic genomes can now be generated, comparative genomics studies require a flexible method for quickly and accurately predicting orthologs among the rapidly changing set of genomes available. SPOCS implements a graph-based ortholog prediction method to generate a simple tab-delimited table of orthologs and in addition, html files that provide a visualization of the predicted ortholog/paralog relationships to which gene/protein expression metadata may be overlaid. AVAILABILITY AND IMPLEMENTATION: A SPOCS web application is freely available at http://cbb.pnnl.gov/portal/tools/spocs.html. Source code for Linux systems is also freely available under an open source license at http://cbb.pnnl.gov/portal/software/spocs.html; the Boost C++ libraries and BLAST are required.
We have found that Rad51 and RecA Proteins form strikingly similar structures together with dsDNA and ATP. Their right handed helical nucleoprotein filaments extend the B-form DNA double helixes to 1.5 times in length and wind the helix. The similarity and uniqueness of their structures must reflect functional homologies between these proteins. Therefore, it is highly probable that similar recombination proteins are present in various organisms of different evolutional states. We have succeeded to clone RAD51 genes from human, mouse, chicken and fission yeast genes, and found that the homologues are widely distributed in eukaryotes. The HsRad51 and MmRad51 or ChRad51 proteins consist of 339 amino acids differing only by 4 or 12 amino acids, respectively, and highly homologous to both yeast proteins, but less so to Dmcl. All of these proteins are homologous to the region from residues 33 to 240 of RecA which was named ''homologous core. The homologous core is likely to be responsible for functions common for all of them, such as the formation of helical nucleoprotein filament that is considered to be involved in homologous pairing in the recombination reaction. The mouse gene is transcribed at a high level in thymus, spleen, testis, and ovary, at lower level in brain and at a further lower level in some other tissues. It is transcribed efficiently in recombination active tissues. A clear functional difference of Rad51 homologues from RecA was suggested by the failure of heterologous genes to complement the deficiency of Scrad51 mutants. This failure seems to reflect the absence of a compatible partner, such as ScRad52 protein in the case of ScRad51 protein, between different species. Thus, these discoveries play a role of the starting point to understand the fundamental gene targeting in mammalian cells and in gene therapy. (J.P.N.)
Full Text Available Eukaryotic translation is a complex process composed of three main steps: initiation, elongation, and termination. During infections by RNA- and DNA-viruses, the eukaryotic translation machinery is used to assure optimal viral protein synthesis. Human immunodeficiency virus type I (HIV-1 uses several non-canonical pathways to translate its own proteins, such as leaky scanning, frameshifting, shunt, and cap-independent mechanisms. Moreover, HIV-1 modulates the host translation machinery by targeting key translation factors and overcomes different cellular obstacles that affect protein translation. In this review, we describe how HIV-1 proteins target several components of the eukaryotic translation machinery, which consequently improves viral translation and replication.
Ho, Louis K; Nodwell, Justin R
Environmental microbes produce biologically active small molecules that have been mined extensively as antibiotics and a smaller number of drugs that act on eukaryotic cells. It is known that there are additional bioactives to be discovered from this source. While the discovery of new antibiotics is challenged by the frequent discovery of known compounds, we contend that the eukaryote-active compounds may be less saturated. Indeed, despite there being far fewer eukaryotic-active natural products these molecules interact with a far richer diversity of molecular and cellular targets.
Full Text Available To address whether there are differences of variation among repeat motif types and among taxonomic groups, we present here an analysis of variation and correlation of dinucleotide microsatellite repeats in eukaryotic genomes. Ten taxonomic groups were compared, those being primates, mammalia (excluding primates and rodentia, rodentia, birds, fish, amphibians and reptiles, insects, molluscs, plants and fungi, respectively. The data used in the analysis is from the literature published in the Journal of Molecular Ecology Notes. Analysis of variation reveals that there are no significant differences between AC and AG repeat motif types. Moreover, the number of alleles correlates positively with the copy number in both AG and AC repeats. Similar conclusions can be obtained from each taxonomic group. These results strongly suggest that the increase of SSR variation is almost linear with the increase of the copy number of each repeat motif. As well, the results suggest that the variability of SSR in the genomes of low-ranking species seem to be more than that of high-ranking species, excluding primates and fungi.
Su, Fei; Ou, Hong-Yu; Tao, Fei; Tang, Hongzhi; Xu, Ping
With genomic sequences of many closely related bacterial strains made available by deep sequencing, it is now possible to investigate trends in prokaryotic microevolution. Positive selection is a sub-process of microevolution, in which a particular mutation is favored, causing the allele frequency to continuously shift in one direction. Wide scanning of prokaryotic genomes has shown that positive selection at the molecular level is much more frequent than expected. Genes with significant positive selection may play key roles in bacterial adaption to different environmental pressures. However, selection pressure analyses are computationally intensive and awkward to configure. Here we describe an open access web server, which is designated as PSP (Positive Selection analysis for Prokaryotic genomes) for performing evolutionary analysis on orthologous coding genes, specially designed for rapid comparison of dozens of closely related prokaryotic genomes. Remarkably, PSP facilitates functional exploration at the multiple levels by assignments and enrichments of KO, GO or COG terms. To illustrate this user-friendly tool, we analyzed Escherichia coli and Bacillus cereus genomes and found that several genes, which play key roles in human infection and antibiotic resistance, show significant evidence of positive selection. PSP is freely available to all users without any login requirement at: http://db-mml.sjtu.edu.cn/PSP/. PSP ultimately allows researchers to do genome-scale analysis for evolutionary selection across multiple prokaryotic genomes rapidly and easily, and identify the genes undergoing positive selection, which may play key roles in the interactions of host-pathogen and/or environmental adaptation.
Wijffels, R.H.; Kruse, O.; Hellingwerf, K.J.
Both cyanobacteria and eukaryotic microalgae are promising organisms for sustainable production of bulk products such as food, feed, materials, chemicals and fuels. In this review we will summarize the potential and current biotechnological developments. Cyanobacteria are promising host organisms
Comparisons among a variety of eukaryotes have revealed considerable variability in the structures and processes involved in their meiosis. Nevertheless, conventional forms of meiosis occur in all major groups of eukaryotes, including early-branching protists. This finding confirms that meiosis originated in the common ancestor of all eukaryotes and suggests that primordial meiosis may have had many characteristics in common with conventional extant meiosis. However, it is possible that the synaptonemal complex and the delicate crossover control related to its presence were later acquisitions. Later still, modifications to meiotic processes occurred within different groups of eukaryotes. Better knowledge on the spectrum of derived and uncommon forms of meiosis will improve our understanding of many still mysterious aspects of the meiotic process and help to explain the evolutionary basis of functional adaptations to the meiotic program.
Liu, Dekang; Keijzers, Guido; Rasmussen, Lene Juel
in the clinic, and as a biomarker of cancer susceptibility in animal model systems. Prokaryotic MMR is well-characterized at the molecular and mechanistic level; however, MMR is considerably more complex in eukaryotic cells than in prokaryotic cells, and in recent years, it has become evident that MMR plays...... novel roles in eukaryotic cells, several of which are not yet well-defined or understood. Many MMR-deficient human cancer cells lack mutations in known human MMR genes, which strongly suggests that essential eukaryotic MMR components/cofactors remain unidentified and uncharacterized. Furthermore......, the mechanism by which the eukaryotic MMR machinery discriminates between the parental (template) and the daughter (nascent) DNA strand is incompletely understood and how cells choose between the EXO1-dependent and the EXO1–independent subpathways of MMR is not known. This review summarizes recent literature...
Kamp, Anja; Høgslund, Signe; Risgaard-Petersen, Nils
The microbial nitrogen cycle is one of the most complex and environmentally important element cycles on Earth and has long been thought to be mediated exclusively by prokaryotic microbes. Rather recently, it was discovered that certain eukaryotic microbes are able to store nitrate intracellularly......, suggesting that eukaryotes may rival prokaryotes in terms of dissimilatory nitrate reduction. Finally, this review article sketches some evolutionary perspectives of eukaryotic nitrate metabolism and identifies open questions that need to be addressed in future investigations....... and use it for dissimilatory nitrate reduction in the absence of oxygen. The paradigm shift that this entailed is ecologically significant because the eukaryotes in question comprise global players like diatoms, foraminifers, and fungi. This review article provides an unprecedented overview of nitrate...
de Villena Fernando
Full Text Available Abstract Background The central metabolic pathway of glycolysis converts glucose to pyruvate, with the net production of 2 ATP and 2 NADH per glucose molecule. Each of the ten reactions in this pathway is typically catalyzed by multiple isozymes encoded by a multigene family. Several isozymes in this pathway are expressed only during spermatogenesis, and gene targeting studies indicate that they are essential for sperm function and male fertility in mouse. At least three of the novel glycolytic isozymes are encoded by retrogenes (Pgk2, Aldoart1, and Aldoart2. Their restricted expression profile suggests that retrotransposition may play a significant role in the evolution of sperm glycolytic enzymes. Results We conducted a comprehensive genomic analysis of glycolytic enzymes in the human and mouse genomes and identified several intronless copies for all enzymes in the pathway, except Pfk. Within each gene family, a single orthologous gene was typically retrotransposed frequently and independently in both species. Several retroposed sequences maintained open reading frames (ORFs and/or provided evidence of alternatively spliced exons. We analyzed expression of sequences with ORFs and Gpi1 transcript in mouse spermatogenic cells. Conclusions Our analysis detected frequent, recent, and lineage-specific retrotransposition of orthologous glycolytic enzymes in the human and mouse genomes. Retrotransposition events are associated with LINE/LTR and genomic integration is random. We found evidence for the alternative splicing of parent genes. Many retroposed sequences have maintained ORFs, suggesting a functional role for these genes.
Full Text Available Abstract Background Insects constitute the vast majority of known species with their importance including biodiversity, agricultural, and human health concerns. It is likely that the successful adaptation of the Insecta clade depends on specific components in its proteome that give rise to specialized features. However, proteome determination is an intensive undertaking. Here we present results from a computational method that uses genome analysis to characterize insect and eukaryote proteomes as an approximation complementary to experimental approaches. Results Homologs in common to Drosophila melanogaster, Anopheles gambiae, Bombyx mori, Tribolium castaneum, and Apis mellifera were compared to the complete genomes of three non-insect eukaryotes (opisthokonts Homo sapiens, Caenorhabditis elegans and Saccharomyces cerevisiae. This operation yielded 154 groups of orthologous proteins in Drosophila to be insect-specific homologs; 466 groups were determined to be common to eukaryotes (represented by three opisthokonts. ESTs from the hemimetabolous insect Locust migratoria were also considered in order to approximate their corresponding genes in the insect-specific homologs. Stress and stimulus response proteins were found to constitute a higher fraction in the insect-specific homologs than in the homologs common to eukaryotes. Conclusion The significant representation of stress response and stimulus response proteins in proteins determined to be insect-specific, along with specific cuticle and pheromone/odorant binding proteins, suggest that communication and adaptation to environments may distinguish insect evolution relative to other eukaryotes. The tendency for low Ka/Ks ratios in the insect-specific protein set suggests purifying selection pressure. The generally larger number of paralogs in the insect-specific proteins may indicate adaptation to environment changes. Instances in our insect-specific protein set have been arrived at through
Huerta, Carlos; Borek, Dominika; Machius, Mischa; Grishin, Nick V.; Zhang, Hong
Flavin mononucleotide adenylyltransferase (FMNAT) catalyzes the formation of the essential flavocoenzyme FAD and plays an important role in flavocoenzyme homeostasis regulation. By sequence comparison, bacterial and eukaryotic FMNAT enzymes belong to two different protein superfamilies and apparently utilize different set of active site residues to accomplish the same chemistry. Here we report the first structural characterization of a eukaryotic FMNAT from a pathogenic yeast Candida glabrata...
Full Text Available Abstract Background Calpains are Ca2+-dependent cysteine proteases that participate in a range of crucial cellular processes. Dysfunction of these enzymes may cause, for instance, life-threatening diseases in humans, the loss of sex determination in nematodes and embryo lethality in plants. Although the calpain family is well characterized in animal and plant model organisms, there is a great lack of knowledge about these genes in unicellular eukaryote species (i.e. protists. Here, we study the distribution and evolution of calpain genes in a wide range of eukaryote genomes from major branches in the tree of life. Results Our investigations reveal 24 types of protein domains that are combined with the calpain-specific catalytic domain CysPc. In total we identify 41 different calpain domain architectures, 28 of these domain combinations have not been previously described. Based on our phylogenetic inferences, we propose that at least four calpain variants were established in the early evolution of eukaryotes, most likely before the radiation of all the major supergroups of eukaryotes. Many domains associated with eukaryotic calpain genes can be found among eubacteria or archaebacteria but never in combination with the CysPc domain. Conclusions The analyses presented here show that ancient modules present in prokaryotes, and a few de novo eukaryote domains, have been assembled into many novel domain combinations along the evolutionary history of eukaryotes. Some of the new calpain genes show a narrow distribution in a few branches in the tree of life, likely representing lineage-specific innovations. Hence, the functionally important classical calpain genes found among humans and vertebrates make up only a tiny fraction of the calpain family. In fact, a massive expansion of the calpain family occurred by domain shuffling among unicellular eukaryotes and contributed to a wealth of functionally different genes.
Zhao, Sen; Liang, Zhe; Demko, Viktor; Wilson, Robert; Johansen, Wenche; Olsen, Odd-Arne; Shalchian-Tabrizi, Kamran
Calpains are Ca2+-dependent cysteine proteases that participate in a range of crucial cellular processes. Dysfunction of these enzymes may cause, for instance, life-threatening diseases in humans, the loss of sex determination in nematodes and embryo lethality in plants. Although the calpain family is well characterized in animal and plant model organisms, there is a great lack of knowledge about these genes in unicellular eukaryote species (i.e. protists). Here, we study the distribution and evolution of calpain genes in a wide range of eukaryote genomes from major branches in the tree of life. Our investigations reveal 24 types of protein domains that are combined with the calpain-specific catalytic domain CysPc. In total we identify 41 different calpain domain architectures, 28 of these domain combinations have not been previously described. Based on our phylogenetic inferences, we propose that at least four calpain variants were established in the early evolution of eukaryotes, most likely before the radiation of all the major supergroups of eukaryotes. Many domains associated with eukaryotic calpain genes can be found among eubacteria or archaebacteria but never in combination with the CysPc domain. The analyses presented here show that ancient modules present in prokaryotes, and a few de novo eukaryote domains, have been assembled into many novel domain combinations along the evolutionary history of eukaryotes. Some of the new calpain genes show a narrow distribution in a few branches in the tree of life, likely representing lineage-specific innovations. Hence, the functionally important classical calpain genes found among humans and vertebrates make up only a tiny fraction of the calpain family. In fact, a massive expansion of the calpain family occurred by domain shuffling among unicellular eukaryotes and contributed to a wealth of functionally different genes.
Wang, H.; Perrault, A.R.; Takeda, Y.; Iliakis, G.
Cells of higher eukaryotes process within minutes double strand breaks (DSBs) in their genome using a NHEJ apparatus that engages DNA-PKcs, Ku, DNA ligase IV, XRCC4, and other as of yet unidentified factors. Although chemical inhibition, or mutation, in any of these factors delays processing, cells ultimately remove the majority of DNA DSBs using an alternative pathway operating with slower kinetics. This alternative pathway is active in mutants deficient in genes of the RAD52 epistasis group. We proposed, therefore, that it reflects an alternative form of NHEJ that operates as a backup (B-NHEJ) to the DNA-PK- dependent (D-NHEJ) pathway, rather than homology directed repair of DSBs. We studied the role of Ku and DNA-PKcs in the coordination of these pathways using as a model end joining of restriction endonuclease linearized plasmid DNA in whole cell extracts. Efficient error-free endjoining observed in such in-vitro reactions is strongly inhibited by anti-Ku antibodies. The inhibition requires DNA-PKcs, despite that fact that Ku efficiently binds DNA ends in the presence of antibodies, or in the absence of DNA-PKcs. Strong inhibition of DNA endjoining is also mediated by wortmannin, an inhibitor of DNA-PKcs, in the presence but not in the absence of Ku, and this inhibition can be rescued by pre-incubating the reaction with double stranded oligonucleotides. The results are compatible with a role of Ku in directing endjoining to a DNA-PK dependent pathway, mediated by efficient end binding and productive interactions with DNA-PKcs. On the other hand, efficient end joining is observed in extracts of cells lacking DNA-PKcs, as well as in Ku-depleted extracts sugggesting the operation of alternative pathways. Extracts depleted of Ku and DNA-PKcs rejoin blunt ends, as well as homologous ends with 3' or 5' protruding single strands with similar efficiency, but addition of Ku suppresses joining of blunt ends and homologous ends with 3' overhangs. We propose that the
Full Text Available Diversity is one of the most remarkable features of living organisms. Current assessments of eukaryote biodiversity reaches 1.5 million species, but the true figure could be several times that number. Diversity is ingrained in all stages and echelons of life, namely, the occupancy of ecological niches, behavioral patterns, body plans and organismal complexity, as well as metabolic needs and genetics. In this review, we will discuss that diversity also exists in a key biochemical process, translation, across eukaryotes. Translation is a fundamental process for all forms of life, and the basic components and mechanisms of translation in eukaryotes have been largely established upon the study of traditional, so-called model organisms. By using modern genome-wide, high-throughput technologies, recent studies of many nonmodel eukaryotes have unveiled a surprising diversity in the configuration of the translation apparatus across eukaryotes, showing that this apparatus is far from being evolutionarily static. For some of the components of this machinery, functional differences between different species have also been found. The recent research reviewed in this article highlights the molecular and functional diversification the translational machinery has undergone during eukaryotic evolution. A better understanding of all aspects of organismal diversity is key to a more profound knowledge of life.
Neil W. Blackstone
Full Text Available Two major obstacles hinder the application of evolutionary theory to the origin of eukaryotes. The first is more apparent than real—the endosymbiosis that led to the mitochondrion is often described as “non-Darwinian” because it deviates from the incremental evolution championed by the modern synthesis. Nevertheless, endosymbiosis can be accommodated by a multi-level generalization of evolutionary theory, which Darwin himself pioneered. The second obstacle is more serious—all of the major features of eukaryotes were likely present in the last eukaryotic common ancestor thus rendering comparative methods ineffective. In addition to a multi-level theory, the development of rigorous, sequence-based phylogenetic and comparative methods represents the greatest achievement of modern evolutionary theory. Nevertheless, the rapid evolution of major features in the eukaryotic stem group requires the consideration of an alternative framework. Such a framework, based on the contingent nature of these evolutionary events, is developed and illustrated with three examples: the putative intron proliferation leading to the nucleus and the cell cycle; conflict and cooperation in the origin of eukaryotic bioenergetics; and the inter-relationship between aerobic metabolism, sterol synthesis, membranes, and sex. The modern synthesis thus provides sufficient scope to develop an evolutionary framework to understand the origin of eukaryotes.
Costantini, Maria; Alvarez-Valin, Fernando; Costantini, Susan; Cammarano, Rosalia; Bernardi, Giorgio
The genomes of multicellular eukaryotes are compartmentalized in mosaics of isochores, large and fairly homogeneous stretches of DNA that belong to a small number of families characterized by different average GC levels, by different gene concentration (that increase with GC), different chromatin structures, different replication timing in the cell cycle, and other different properties. A question raised by these basic results concerns how far back in evolution the compartmentalized organization of the eukaryotic genomes arose. In the present work we approached this problem by studying the compositional organization of the genomes from the unicellular eukaryotes for which full sequences are available, the sample used being representative. The average GC levels of the genomes from unicellular eukaryotes cover an extremely wide range (19%-60% GC) and the compositional patterns of individual genomes are extremely different but all genomes tested show a compositional compartmentalization. The average GC range of the genomes of unicellular eukaryotes is very broad (as broad as that of prokaryotes) and individual compositional patterns cover a very broad range from very narrow to very complex. Both features are not surprising for organisms that are very far from each other both in terms of phylogenetic distances and of environmental life conditions. Most importantly, all genomes tested, a representative sample of all supergroups of unicellular eukaryotes, are compositionally compartmentalized, a major difference with prokaryotes.
Full Text Available WRKY transcription factors have been extensively characterized in the past 20 years, but in wheat, studies onWRKY genes and their function are lagging behind many other species. To explore the function of wheat WRKY genes, we identified a TaWRKY68 gene from a common wheat cultivar. It encodes a protein comprising 313 amino acids which harbors 19 conserved motifs or active sites. Gene expression patterns were determined by analyzing microarray data of TaWRKY68 in wheat and of orthologous genes from maize, rice and barley using Genevestigator. TaWRKY68 orthologs were identified and clustered using DELTA-BLAST and COBALT programs available at NCBI. The results showed that these genes, which are expressed in all tissues tested, had relatively higher levels in the roots and were up-regulated in response to biotic stresses. Bioinformatics results were confirmed by RT-PCR experiments using wheat plants infected by Agrobacterium tumefaciens and Blumeria graminis, or treated with Deoxynivalenol, a Fusarium graminearum-induced mycotoxin in wheat or barley. In summary,TaWRKY68 functions differ during plant developmental stages and might be representing a hub gene function in wheat responses to various biotic stresses. It was also found that including data from major cereal genes in the bioinformatics analysis gave more accurate and comprehensive predictions of wheat gene functions.
Novichkov, Pavel S.; Ratnere, Igor; Wolf, Yuri I.; Koonin, Eugene V.; Dubchak, Inna
The database of Alignable Tight Genomic Clusters (ATGCs) consists of closely related genomes of archaea and bacteria, and is a resource for research into prokaryotic microevolution. Construction of a data set with appropriate characteristics is a major hurdle for this type of studies. With the current rate of genome sequencing, it is difficult to follow the progress of the field and to determine which of the available genome sets meet the requirements of a given research project, in particular, with respect to the minimum and maximum levels of similarity between the included genomes. Additionally, extraction of specific content, such as genomic alignments or families of orthologs, from a selected set of genomes is a complicated and time-consuming process. The database addresses these problems by providing an intuitive and efficient web interface to browse precomputed ATGCs, select appropriate ones and access ATGC-derived data such as multiple alignments of orthologous proteins, matrices of pairwise intergenomic distances based on genome-wide analysis of synonymous and nonsynonymous substitution rates and others. The ATGC database will be regularly updated following new releases of the NCBI RefSeq. The database is hosted by the Genomics Division at Lawrence Berkeley National laboratory and is publicly available at http://atgc.lbl.gov.
Lohr M.; Schwender J.; Polle, J. E. W.
Isoprenoids are one of the largest groups of natural compounds and have a variety of important functions in the primary metabolism of land plants and algae. In recent years, our understanding of the numerous facets of isoprenoid metabolism in land plants has been rapidly increasing, while knowledge on the metabolic network of isoprenoids in algae still lags behind. Here, current views on the biochemistry and genetics of the core isoprenoid metabolism in land plants and in the major algal phyla are compared and some of the most pressing open questions are highlighted. Based on the different evolutionary histories of the various groups of eukaryotic phototrophs, we discuss the distribution and regulation of the mevalonate (MVA) and the methylerythritol phosphate (MEP) pathways in land plants and algae and the potential consequences of the loss of the MVA pathway in groups such as the green algae. For the prenyltransferases, serving as gatekeepers to the various branches of terpenoid biosynthesis in land plants and algae, we explore the minimal inventory necessary for the formation of primary isoprenoids and present a preliminary analysis of their occurrence and phylogeny in algae with primary and secondary plastids. The review concludes with some perspectives on genetic engineering of the isoprenoid metabolism in algae.
Full Text Available Progress in uncovering the protein interaction networks of several species has led to questions of what underlying principles might govern their organization. Few studies have tried to determine the impact of protein interaction network evolution on the observed physiological differences between species. Using comparative genomics and structural information, we show here that eukaryotic species have rewired their interactomes at a fast rate of approximately 10(-5 interactions changed per protein pair, per million years of divergence. For Homo sapiens this corresponds to 10(3 interactions changed per million years. Additionally we find that the specificity of binding strongly determines the interaction turnover and that different biological processes show significantly different link dynamics. In particular, human proteins involved in immune response, transport, and establishment of localization show signs of positive selection for change of interactions. Our analysis suggests that a small degree of molecular divergence can give rise to important changes at the network level. We propose that the power law distribution observed in protein interaction networks could be partly explained by the cell's requirement for different degrees of protein binding specificity.
Liu, Bernard A
Today there exists a rapidly expanding number of sequenced genomes. Cataloging protein interaction domains such as the Src Homology 2 (SH2) domain across these various genomes can be accomplished with ease due to existing algorithms and predictions models. An evolutionary analysis of SH2 domains provides a step towards understanding how SH2 proteins integrated with existing signaling networks to position phosphotyrosine signaling as a crucial driver of robust cellular communication networks in metazoans. However organizing and tracing SH2 domain across organisms and understanding their evolutionary trajectory remains a challenge. This chapter describes several methodologies towards analyzing the evolutionary trajectory of SH2 domains including a global SH2 domain classification system, which facilitates annotation of new SH2 sequences essential for tracing the lineage of SH2 domains throughout eukaryote evolution. This classification utilizes a combination of sequence homology, protein domain architecture and the boundary positions between introns and exons within the SH2 domain or genes encoding these domains. Discrete SH2 families can then be traced across various genomes to provide insight into its origins. Furthermore, additional methods for examining potential mechanisms for divergence of SH2 domains from structural changes to alterations in the protein domain content and genome duplication will be discussed. Therefore a better understanding of SH2 domain evolution may enhance our insight into the emergence of phosphotyrosine signaling and the expansion of protein interaction domains.
Audrey M V Ah-Fong
Full Text Available Cdc14 protein phosphatases are well known for regulating the eukaryotic cell cycle, particularly during mitosis. Here we reveal a distinctly new role for Cdc14 based on studies of the microbial eukaryote Phytophthora infestans, the Irish potato famine agent. While Cdc14 is transcribed constitutively in yeast and animal cells, the P. infestans ortholog is expressed exclusively in spore stages of the life cycle and not in vegetative hyphae where the bulk of mitosis takes place. PiCdc14 expression is first detected in nuclei at sporulation, and during zoospore formation the protein accumulates at the basal body, which is the site from which flagella develop. The association of PiCdc14 with basal bodies was supported by co-localization studies with the DIP13 basal body protein and flagellar β-tubulin, and by demonstrating the enrichment of PiCdc14 in purified flagella-basal body complexes. Overexpressing PiCdc14 did not cause defects in growth or mitosis in hyphae, but interfered with cytoplasmic partitioning during zoosporogenesis. This cytokinetic defect might relate to its ability to bind microtubules, which was shown using an in vitro cosedimentation assay. The use of gene silencing to reveal the precise function of PiCdc14 in flagella is not possible since we showed previously that silencing prevents the formation of the precursor stage, sporangia. Nevertheless, the association of Cdc14 with flagella and basal bodies is consistent with their phylogenetic distribution in eukaryotes, as species that lack the ability to produce flagella generally also lack Cdc14. An ancestral role of Cdc14 in the flagellar stage of eukaryotes is thereby proposed.
Tang, Mingyong; Tao, Yan-Bin; Fu, Qiantang; Song, Yaling; Niu, Longjian; Xu, Zeng-Fu
Jatropha curcas seeds are an excellent biofuel feedstock, but seed yields of Jatropha are limited by its poor flowering and fruiting ability. Thus, identifying genes controlling flowering is critical for genetic improvement of seed yield. We isolated the JcLFY, a Jatropha ortholog of Arabidopsis thaliana LEAFY (LFY), and identified JcLFY function by overexpressing it in Arabidopsis and Jatropha. JcLFY is expressed in Jatropha inflorescence buds, flower buds, and carpels, with highest expression in the early developmental stage of flower buds. JcLFY overexpression induced early flowering, solitary flowers, and terminal flowers in Arabidopsis, and also rescued the delayed flowering phenotype of lfy-15, a LFY loss-of-function Arabidopsis mutant. Microarray and qPCR analysis revealed several flower identity and flower organ development genes were upregulated in JcLFY-overexpressing Arabidopsis. JcLFY overexpression in Jatropha also induced early flowering. Significant changes in inflorescence structure, floral organs, and fruit shape occurred in JcLFY co-suppressed plants in which expression of several flower identity and floral organ development genes were changed. This suggests JcLFY is involved in regulating flower identity, floral organ patterns, and fruit shape, although JcLFY function in Jatropha floral meristem determination is not as strong as that of Arabidopsis.
Alagarasan, Ganesh; Dubey, Mahima; Aswathy, Kumar S; Chandel, Girish
Genes in the ZIP family encode transcripts to store and transport bivalent metal micronutrient, particularly iron (Fe) and or zinc (Zn). These transcripts are important for a variety of functions involved in the developmental and physiological processes in many plant species, including most, if not all, Poaceae plant species and the model species Arabidopsis. Here, we present the report of a genome wide investigation of orthologous ZIP genes in Setaria italica and the identification of 7 single copy genes. RT-PCR shows 4 of them could be used to increase the bio-availability of zinc and iron content in grains. Of 36 ZIP members, 25 genes have traces of signal peptide based sub-cellular localization, as compared to those of plant species studied previously, yet translocation of ions remains unclear. In silico analysis of gene structure and protein nature suggests that these two were preeminent in shaping the functional diversity of the ZIP gene family in S. italica . NAC, bZIP and bHLH are the predominant Fe and Zn responsive transcription factors present in SiZIP genes. Together, our results provide new insights into the signal peptide based/independent iron and zinc translocation in the plant system and allowed identification of ZIP genes that may be involved in the zinc and iron absorption from the soil, and thus transporting it to the cereal grain underlying high micronutrient accumulation.
Full Text Available Genes in the ZIP family encode transcripts to store and transport bivalent metal micronutrient, particularly iron (Fe and or zinc (Zn. These transcripts are important for a variety of functions involved in the developmental and physiological processes in many plant species, including most, if not all, Poaceae plant species and the model species Arabidopsis. Here, we present the report of a genome wide investigation of orthologous ZIP genes in Setaria italica and the identification of 7 single copy genes. RT-PCR shows 4 of them could be used to increase the bio-availability of zinc and iron content in grains. Of 36 ZIP members, 25 genes have traces of signal peptide based sub-cellular localization, as compared to those of plant species studied previously, yet translocation of ions remains unclear. In silico analysis of gene structure and protein nature suggests that these two were preeminent in shaping the functional diversity of the ZIP gene family in S. italica. NAC, bZIP and bHLH are the predominant Fe and Zn responsive transcription factors present in SiZIP genes. Together, our results provide new insights into the signal peptide based/independent iron and zinc translocation in the plant system and allowed identification of ZIP genes that may be involved in the zinc and iron absorption from the soil, and thus transporting it to the cereal grain underlying high micronutrient accumulation.
Zhang, Li-Na; Zhang, Xian-Zhi; Zhang, Yu-Xiao; Zeng, Chun-Xia; Ma, Peng-Fei; Zhao, Lei; Guo, Zhen-Hua; Li, De-Zhu
The temperate woody bamboos (Arundinarieae) are highly diverse in morphology but lack a substantial amount of genetic variation. The taxonomy of this lineage is intractable, and the relationships within the tribe have not been well resolved. Recent studies indicated that this tribe could have a complex evolutionary history. Although phylogenetic studies of the tribe have been carried out, most of these phylogenetic reconstructions were based on plastid data, which provide lower phylogenetic resolution compared with nuclear data. In this study, we intended to identify a set of desirable nuclear genes for resolving the phylogeny of the temperate woody bamboos. Using two different methodologies, we identified 209 and 916 genes, respectively, as putative single copy orthologous genes. A total of 112 genes was successfully amplified and sequenced by next-generation sequencing technologies in five species sampled from the tribe. As most of the genes exhibited intra-individual allele heterozygotes, we investigated phylogenetic utility by reconstructing the phylogeny based on individual genes. Discordance among gene trees was observed and, to resolve the conflict, we performed a range of analyses using BUCKy and HybTree. While caution should be taken when inferring a phylogeny from multiple conflicting genes, our analysis indicated that 74 of the 112 investigated genes are potential markers for resolving the phylogeny of the temperate woody bamboos. © 2014 John Wiley & Sons Ltd.
Full Text Available Abstract Background Ortholog assignment is a critical and fundamental problem in comparative genomics, since orthologs are considered to be functional counterparts in different species and can be used to infer molecular functions of one species from those of other species. MSOAR is a recently developed high-throughput system for assigning one-to-one orthologs between closely related species on a genome scale. It attempts to reconstruct the evolutionary history of input genomes in terms of genome rearrangement and gene duplication events. It assumes that a gene duplication event inserts a duplicated gene into the genome of interest at a random location (i.e., the random duplication model. However, in practice, biologists believe that genes are often duplicated by tandem duplications, where a duplicated gene is located next to the original copy (i.e., the tandem duplication model. Results In this paper, we develop MSOAR 2.0, an improved system for one-to-one ortholog assignment. For a pair of input genomes, the system first focuses on the tandemly duplicated genes of each genome and tries to identify among them those that were duplicated after the speciation (i.e., the so-called inparalogs, using a simple phylogenetic tree reconciliation method. For each such set of tandemly duplicated inparalogs, all but one gene will be deleted from the concerned genome (because they cannot possibly appear in any one-to-one ortholog pairs, and MSOAR is invoked. Using both simulated and real data experiments, we show that MSOAR 2.0 is able to achieve a better sensitivity and specificity than MSOAR. In comparison with the well-known genome-scale ortholog assignment tool InParanoid, Ensembl ortholog database, and the orthology information extracted from the well-known whole-genome multiple alignment program MultiZ, MSOAR 2.0 shows the highest sensitivity. Although the specificity of MSOAR 2.0 is slightly worse than that of InParanoid in the real data experiments
Nicholas J Hudson
Full Text Available Codon bias in the genome of an organism influences its phenome by changing the speed and efficiency of mRNA translation and hence protein abundance. We hypothesized that differences in codon bias, either between-species differences in orthologous genes, or within-species differences between genes, may play an evolutionary role. To explore this hypothesis, we compared the genome-wide codon bias in six species that occupy vital positions in the Eukaryotic Tree of Life. We acquired the entire protein coding sequences for these organisms, computed the codon bias for all genes in each organism and explored the output for relationships between codon bias and protein function, both within- and between-lineages. We discovered five notable coordinated patterns, with extreme codon bias most pronounced in traits considered highly characteristic of a given lineage. Firstly, the Homo sapiens genome had stronger codon bias for DNA-binding transcription factors than the Saccharomyces cerevisiae genome, whereas the opposite was true for ribosomal proteins--perhaps underscoring transcriptional regulation in the origin of complexity. Secondly, both mammalian species examined possessed extreme codon bias in genes relating to hair--a tissue unique to mammals. Thirdly, Arabidopsis thaliana showed extreme codon bias in genes implicated in cell wall formation and chloroplast function--which are unique to plants. Fourthly, Gallus gallus possessed strong codon bias in a subset of genes encoding mitochondrial proteins--perhaps reflecting the enhanced bioenergetic efficiency in birds that co-evolved with flight. And lastly, the G. gallus genome had extreme codon bias for the Ciliary Neurotrophic Factor--which may help to explain their spontaneous recovery from deafness. We propose that extreme codon bias in groups of genes that encode functionally related proteins has a pathway-level energetic explanation.
Full Text Available The glycerophospholipid cardiolipin is a unique constituent of bacterial and mitochondrial membranes. It is involved in forming and stabilizing high molecular mass membrane protein complexes and in maintaining membrane architecture. Absence of cardiolipin leads to reduced efficiency of the electron transport chain, decreased membrane potential, and, ultimately, impaired respiratory metabolism. For the protozoan parasite Trypanosoma brucei cardiolipin synthesis is essential for survival, indicating that the enzymes involved in cardiolipin production represent potential drug targets. T. brucei cardiolipin synthase (TbCLS is unique as it belongs to the family of phospholipases D (PLD, harboring a prokaryotic-type cardiolipin synthase (CLS active site domain. In contrast, most other eukaryotic CLS, including the yeast ortholog ScCrd1, are members of the CDP-alcohol phosphatidyl transferase family. To study if these mechanistically distinct CLS enzymes are able to catalyze cardiolipin production in a cell that normally expresses a different type of CLS, we expressed TbCLS and ScCrd1 in CLS-deficient yeast and trypanosome strains, respectively. Our results show that TbCLS complemented cardiolipin production in CRD1 knockout yeast and partly restored wild-type colony forming capability under stress conditions. Remarkably, CL remodeling appeared to be impaired in the transgenic construct, suggesting that CL production and remodeling are tightly coupled processes that may require a clustering of the involved proteins into specific CL-synthesizing domains. In contrast, no complementation was observed by heterologous expression of ScCrd1 in conditional TbCLS knockout trypanosomes, despite proper mitochondrial targeting of the protein.
Hartmann, Fanny E; Croll, Daniel
Differences in gene content are a significant source of variability within species and have an impact on phenotypic traits. However, little is known about the mechanisms responsible for the most recent gene gains and losses. We screened the genomes of 123 worldwide isolates of the major pathogen of wheat Zymoseptoria tritici for robust evidence of gene copy number variation. Based on orthology relationships in three closely related fungi, we identified 599 gene gains and 1,024 gene losses that have not yet reached fixation within the focal species. Our analyses of gene gains and losses segregating in populations showed that gene copy number variation arose preferentially in subtelomeres and in proximity to transposable elements. Recently lost genes were enriched in virulence factors and secondary metabolite gene clusters. In contrast, recently gained genes encoded mostly secreted protein lacking a conserved domain. We analyzed the frequency spectrum at loci segregating a gene presence-absence polymorphism in four worldwide populations. Recent gene losses showed a significant excess in low-frequency variants compared with genome-wide single nucleotide polymorphism, which is indicative of strong negative selection against gene losses. Recent gene gains were either under weak negative selection or neutral. We found evidence for strong divergent selection among populations at individual loci segregating a gene presence-absence polymorphism. Hence, gene gains and losses likely contributed to local adaptation. Our study shows that microbial eukaryotes harbor extensive copy number variation within populations and that functional differences among recently gained and lost genes led to distinct evolutionary trajectories. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Full Text Available Orthology detection requires more effective scaling algorithms. In this paper, a set of gene pair features based on similarity measures (alignment scores, sequence length, gene membership to conserved regions, and physicochemical profiles are combined in a supervised pairwise ortholog detection approach to improve effectiveness considering low ortholog ratios in relation to the possible pairwise comparison between two genomes. In this scenario, big data supervised classifiers managing imbalance between ortholog and nonortholog pair classes allow for an effective scaling solution built from two genomes and extended to other genome pairs. The supervised approach was compared with RBH, RSD, and OMA algorithms by using the following yeast genome pairs: Saccharomyces cerevisiae-Kluyveromyces lactis, Saccharomyces cerevisiae-Candida glabrata, and Saccharomyces cerevisiae-Schizosaccharomyces pombe as benchmark datasets. Because of the large amount of imbalanced data, the building and testing of the supervised model were only possible by using big data supervised classifiers managing imbalance. Evaluation metrics taking low ortholog ratios into account were applied. From the effectiveness perspective, MapReduce Random Oversampling combined with Spark SVM outperformed RBH, RSD, and OMA, probably because of the consideration of gene pair features beyond alignment similarities combined with the advances in big data supervised classification.
Walker, Giselle; Dorrell, Richard G; Schlacht, Alexander; Dacks, Joel B
Single-celled parasites like Entamoeba, Trypanosoma, Phytophthora and Plasmodium wreak untold havoc on human habitat and health. Understanding the position of the various protistan pathogens in the larger context of eukaryotic diversity informs our study of how these parasites operate on a cellular level, as well as how they have evolved. Here, we review the literature that has brought our understanding of eukaryotic relationships from an idea of parasites as primitive cells to a crystallized view of diversity that encompasses 6 major divisions, or supergroups, of eukaryotes. We provide an updated taxonomic scheme (for 2011), based on extensive genomic, ultrastructural and phylogenetic evidence, with three differing levels of taxonomic detail for ease of referencing and accessibility (see supplementary material at Cambridge Journals On-line). Two of the most pressing issues in cellular evolution, the root of the eukaryotic tree and the evolution of photosynthesis in complex algae, are also discussed along with ideas about what the new generation of genome sequencing technologies may contribute to the field of eukaryotic systematics. We hope that, armed with this user's guide, cell biologists and parasitologists will be encouraged about taking an increasingly evolutionary point of view in the battle against parasites representing real dangers to our livelihoods and lives.
Takeda, Kosuke; Shao, Danying; Adler, Micha; Charest, Pascale G; Loomis, William F; Levine, Herbert; Groisman, Alex; Rappel, Wouter-Jan; Firtel, Richard A
Adaptation in signaling systems, during which the output returns to a fixed baseline after a change in the input, often involves negative feedback loops and plays a crucial role in eukaryotic chemotaxis. We determined the dynamical response to a uniform change in chemoattractant concentration of a eukaryotic chemotaxis pathway immediately downstream from G protein-coupled receptors. The response of an activated Ras showed near-perfect adaptation, leading us to attempt to fit the results using mathematical models for the two possible simple network topologies that can provide perfect adaptation. Only the incoherent feedforward network accurately described the experimental results. This analysis revealed that adaptation in this Ras pathway is achieved through the proportional activation of upstream components and not through negative feedback loops. Furthermore, these results are consistent with a local excitation, global inhibition mechanism for gradient sensing, possibly with a Ras guanosine triphosphatase-activating protein acting as a global inhibitor.
Koonin, Eugene V; Wolf, Yuri I; Nagasaki, Keizo; Dolja, Valerian V
The recent discovery of RNA viruses in diverse unicellular eukaryotes and developments in evolutionary genomics have provided the means for addressing the origin of eukaryotic RNA viruses. The phylogenetic analyses of RNA polymerases and helicases presented in this Analysis article reveal close evolutionary relationships between RNA viruses infecting hosts from the Chromalveolate and Excavate supergroups and distinct families of picorna-like viruses of plants and animals. Thus, diversification of picorna-like viruses probably occurred in a 'Big Bang' concomitant with key events of eukaryogenesis. The origins of the conserved genes of picorna-like viruses are traced to likely ancestors including bacterial group II retroelements, the family of HtrA proteases and DNA bacteriophages.
Parreiras-e-Silva, Lucas T.; Gomes, Marcelo D.; Oliveira, Eduardo B.; Costa-Neto, Claudio M.
The eukaryotic translation initiation factor 5A (eIF5A) is a ubiquitous protein of eukaryotic and archaeal organisms which undergoes hypusination, a unique post-translational modification. We have generated a polyclonal antibody against murine eIF5A, which in immunocytochemical assays in B16-F10 cells revealed that the endogenous protein is preferentially localized to the nuclear region. We therefore analyzed possible structural features present in eIF5A proteins that could be responsible for that characteristic. Multiple sequence alignment analysis of eIF5A proteins from different eukaryotic and archaeal organisms showed that the former sequences have an extended N-terminal segment. We have then performed in silico prediction analyses and constructed different truncated forms of murine eIF5A to verify any possible role that the N-terminal extension might have in determining the subcellular localization of the eIF5A in eukaryotic organisms. Our results indicate that the N-terminal extension of the eukaryotic eIF5A contributes in signaling this protein to nuclear localization, despite of bearing no structural similarity with classical nuclear localization signals
Pernice, Massimo C; Giner, Caterina R; Logares, Ramiro; Perera-Bel, Júlia; Acinas, Silvia G; Duarte, Carlos M; Gasol, Josep M; Massana, Ramon
In this work, we study the diversity of bathypelagic microbial eukaryotes (0.8-20 μm) in the global ocean. Seawater samples from 3000 to 4000 m depth from 27 stations in the Atlantic, Pacific and Indian Oceans were analyzed by pyrosequencing the V4 region of the 18S ribosomal DNA. The relative abundance of the most abundant operational taxonomic units agreed with the results of a parallel metagenomic analysis, suggesting limited PCR biases in the tag approach. Although rarefaction curves for single stations were seldom saturated, the global analysis of all sequences together suggested an adequate recovery of bathypelagic diversity. Community composition presented a large variability among samples, which was poorly explained by linear geographic distance. In fact, the similarity between communities was better explained by water mass composition (26% of the variability) and the ratio in cell abundance between prokaryotes and microbial eukaryotes (21%). Deep diversity appeared dominated by four taxonomic groups (Collodaria, Chrysophytes, Basidiomycota and MALV-II) appearing in different proportions in each sample. Novel diversity amounted to 1% of the pyrotags and was lower than expected. Our study represents an essential step in the investigation of bathypelagic microbial eukaryotes, indicating dominating taxonomic groups and suggesting idiosyncratic assemblages in distinct oceanic regions.
Pernice, Massimo C.
In this work, we study the diversity of bathypelagic microbial eukaryotes (0.8–20 μm) in the global ocean. Seawater samples from 3000 to 4000 m depth from 27 stations in the Atlantic, Pacific and Indian Oceans were analyzed by pyrosequencing the V4 region of the 18S ribosomal DNA. The relative abundance of the most abundant operational taxonomic units agreed with the results of a parallel metagenomic analysis, suggesting limited PCR biases in the tag approach. Although rarefaction curves for single stations were seldom saturated, the global analysis of all sequences together suggested an adequate recovery of bathypelagic diversity. Community composition presented a large variability among samples, which was poorly explained by linear geographic distance. In fact, the similarity between communities was better explained by water mass composition (26% of the variability) and the ratio in cell abundance between prokaryotes and microbial eukaryotes (21%). Deep diversity appeared dominated by four taxonomic groups (Collodaria, Chrysophytes, Basidiomycota and MALV-II) appearing in different proportions in each sample. Novel diversity amounted to 1% of the pyrotags and was lower than expected. Our study represents an essential step in the investigation of bathypelagic microbial eukaryotes, indicating dominating taxonomic groups and suggesting idiosyncratic assemblages in distinct oceanic regions.The ISME Journal advance online publication, 9 October 2015; doi:10.1038/ismej.2015.170
Hu, Zhiqiang; Sun, Chen; Lu, Kuang-Chen; Chu, Xixia; Zhao, Yue; Lu, Jinyuan; Shi, Jianxin; Wei, Chaochun
Pan-genome analyses are routinely carried out for bacteria to interpret the within-species gene presence/absence variations (PAVs). However, pan-genome analyses are rare for eukaryotes due to the large sizes and higher complexities of their genomes. Here we proposed EUPAN, a eukaryotic pan-genome analysis toolkit, enabling automatic large-scale eukaryotic pan-genome analyses and detection of gene PAVs at a relatively low sequencing depth. In the previous studies, we demonstrated the effectiveness and high accuracy of EUPAN in the pan-genome analysis of 453 rice genomes, in which we also revealed widespread gene PAVs among individual rice genomes. Moreover, EUPAN can be directly applied to the current re-sequencing projects primarily focusing on single nucleotide polymorphisms. EUPAN is implemented in Perl, R and C ++. It is supported under Linux and preferred for a computer cluster with LSF and SLURM job scheduling system. EUPAN together with its standard operating procedure (SOP) is freely available for non-commercial use (CC BY-NC 4.0) at http://cgm.sjtu.edu.cn/eupan/index.html . email@example.com or firstname.lastname@example.org. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: email@example.com
Full Text Available This paper is a review of currently available data concerning interactions of tRNAs with the eukaryotic ribosome at various stages of translation. These data include the results obtained by means of cryo-electron microscopy and X-ray crystallography applied to various model ribosomal complexes, site-directed cross-linking with the use of tRNA derivatives bearing chemically or photochemically reactive groups in the CCA-terminal fragment and chemical probing of 28S rRNA in the region of the peptidyl transferase center. Similarities and differences in the interactions of tRNAs with prokaryotic and eukaryotic ribosomes are discussed with concomitant consideration of the extent of resemblance between molecular mechanisms of translation in eukaryotes and bacteria.
He, Mingyue; Edwards, Bryan M; Kastelic, Damjana; Taussig, Michael J
Ribosome display is a cell-free display technology for in vitro selection and optimisation of proteins from large diversified libraries. It operates through the formation of stable protein-ribosome-mRNA (PRM) complexes and selection of ligand-binding proteins, followed by DNA recovery from the selected genetic information. Both prokaryotic and eukaryotic ribosome display systems have been developed. In this chapter, we describe the eukaryotic rabbit reticulocyte method in which a distinct in situ single-primer RT-PCR procedure is used to recover DNA from the selected PRM complexes without the need for prior disruption of the ribosome.
Palmgren, Michael; Engström, Karin; Hallström, Björn M.
the evolutionary origin of AS3MT and assessed the ability of different genotypes to produce methylated arsenic metabolites. Phylogenetic analysis suggests that multiple, independent horizontal gene transfers between different bacteria, and from bacteria to eukaryotes, increased tolerance to environmental arsenic...
Full Text Available While the basal transcription machinery in archaea is eukaryal-like, transcription factors in archaea and their viruses are usually related to bacterial transcription factors. Nevertheless, some of these organisms show predicted classical zinc fingers motifs of the C2H2 type, which are almost exclusively found in proteins of eukaryotes and most often associated with transcription regulators. In this work, we focused on the protein AFV1p06 from the hyperthermophilic archaeal virus AFV1. The sequence of the protein consists of the classical eukaryotic C2H2 motif with the fourth histidine coordinating zinc missing, as well as of N- and C-terminal extensions. We showed that the protein AFV1p06 binds zinc and solved its solution structure by NMR. AFV1p06 displays a zinc finger fold with a novel structure extension and disordered N- and C-termini. Structure calculations show that a glutamic acid residue that coordinates zinc replaces the fourth histidine of the C2H2 motif. Electromobility gel shift assays indicate that the protein binds to DNA with different affinities depending on the DNA sequence. AFV1p06 is the first experimentally characterised archaeal zinc finger protein with a DNA binding activity. The AFV1p06 protein family has homologues in diverse viruses of hyperthermophilic archaea. A phylogenetic analysis points out a common origin of archaeal and eukaryotic C2H2 zinc fingers.
Hampl, Vladimir; Hug, Laura; Leigh, Jessica W; Dacks, Joel B; Lang, B Franz; Simpson, Alastair G B; Roger, Andrew J
Nearly all of eukaryotic diversity has been classified into 6 suprakingdom-level groups (supergroups) based on molecular and morphological/cell-biological evidence; these are Opisthokonta, Amoebozoa, Archaeplastida, Rhizaria, Chromalveolata, and Excavata. However, molecular phylogeny has not provided clear evidence that either Chromalveolata or Excavata is monophyletic, nor has it resolved the relationships among the supergroups. To establish the affinities of Excavata, which contains parasites of global importance and organisms regarded previously as primitive eukaryotes, we conducted a phylogenomic analysis of a dataset of 143 proteins and 48 taxa, including 19 excavates. Previous phylogenomic studies have not included all major subgroups of Excavata, and thus have not definitively addressed their interrelationships. The enigmatic flagellate Andalucia is sister to typical jakobids. Jakobids (including Andalucia), Euglenozoa and Heterolobosea form a major clade that we name Discoba. Analyses of the complete dataset group Discoba with the mitochondrion-lacking excavates or "metamonads" (diplomonads, parabasalids, and Preaxostyla), but not with the final excavate group, Malawimonas. This separation likely results from a long-branch attraction artifact. Gradual removal of rapidly-evolving taxa from the dataset leads to moderate bootstrap support (69%) for the monophyly of all Excavata, and 90% support once all metamonads are removed. Most importantly, Excavata robustly emerges between unikonts (Amoebozoa + Opisthokonta) and "megagrouping" of Archaeplastida, Rhizaria, and chromalveolates. Our analyses indicate that Excavata forms a monophyletic suprakingdom-level group that is one of the 3 primary divisions within eukaryotes, along with unikonts and a megagroup of Archaeplastida, Rhizaria, and the chromalveolate lineages.
Felder, Marius; Romualdi, Alessandro; Petzold, Andreas; Platzer, Matthias; Sühnel, Jürgen; Glöckner, Gernot
Many sequence data repositories can give a quick and easily accessible overview on genomes and their annotations. Less widespread is the possibility to compare related genomes with each other in a common database environment. We have previously described the GenColors database system (http://gencolors.fli-leibniz.de) and its applications to a number of bacterial genomes such as Borrelia, Legionella, Leptospira and Treponema. This system has an emphasis on genome comparison. It combines data from related genomes and provides the user with an extensive set of visualization and analysis tools. Eukaryote genomes are normally larger than prokaryote genomes and thus pose additional challenges for such a system. We have, therefore, adapted GenColors to also handle larger datasets of small eukaryotic genomes and to display eukaryotic gene structures. Further recent developments include whole genome views, genome list options and, for bacterial genome browsers, the display of horizontal gene transfer predictions. Two new GenColors-based databases for two fungal species (http://fgb.fli-leibniz.de) and for four social amoebas (http://sacgb.fli-leibniz.de) were set up. Both new resources open up a single entry point for related genomes for the amoebozoa and fungal research communities and other interested users. Comparative genomics approaches are greatly facilitated by these resources.
Bourke, Michael F.; Marriott, Philip J.; Glud, Ronnie N.; Hasler-Sheetal, Harald; Kamalanathan, Manoj; Beardall, John; Greening, Chris; Cook, Perran L. M.
Permeable sediments are common across continental shelves and are critical contributors to marine biogeochemical cycling. Organic matter in permeable sediments is dominated by microalgae, which as eukaryotes have different anaerobic metabolic pathways to bacteria and archaea. Here we present analyses of flow-through reactor experiments showing that dissolved inorganic carbon is produced predominantly as a result of anaerobic eukaryotic metabolic activity. In our experiments, anaerobic production of dissolved inorganic carbon was consistently accompanied by large dissolved H2 production rates, suggesting the presence of fermentation. The production of both dissolved inorganic carbon and H2 persisted following administration of broad spectrum bactericidal antibiotics, but ceased following treatment with metronidazole. Metronidazole inhibits the ferredoxin/hydrogenase pathway of fermentative eukaryotic H2 production, suggesting that pathway as the source of H2 and dissolved inorganic carbon production. Metabolomic analysis showed large increases in lipid production at the onset of anoxia, consistent with documented pathways of anoxic dark fermentation in microalgae. Cell counts revealed a predominance of microalgae in the sediments. H2 production was observed in dark anoxic cultures of diatoms (Fragilariopsis sp.) and a chlorophyte (Pyramimonas) isolated from the study site, substantiating the hypothesis that microalgae undertake fermentation. We conclude that microalgal dark fermentation could be an important energy-conserving pathway in permeable sediments.
Full Text Available The ability to predict gene content is highly desirable for characterization of not-yet sequenced genomes like those of dinoflagellates. Using data from completely sequenced and annotated genomes from phylogenetically diverse lineages, we investigated the relationship between gene content and genome size using regression analyses. Distinct relationships between log(10-transformed protein-coding gene number (Y' versus log(10-transformed genome size (X', genome size in kbp were found for eukaryotes and non-eukaryotes. Eukaryotes best fit a logarithmic model, Y' = ln(-46.200+22.678X', whereas non-eukaryotes a linear model, Y' = 0.045+0.977X', both with high significance (p0.91. Total gene number shows similar trends in both groups to their respective protein coding regressions. The distinct correlations reflect lower and decreasing gene-coding percentages as genome size increases in eukaryotes (82%-1% compared to higher and relatively stable percentages in prokaryotes and viruses (97%-47%. The eukaryotic regression models project that the smallest dinoflagellate genome (3x10(6 kbp contains 38,188 protein-coding (40,086 total genes and the largest (245x10(6 kbp 87,688 protein-coding (92,013 total genes, corresponding to 1.8% and 0.05% gene-coding percentages. These estimates do not likely represent extraordinarily high functional diversity of the encoded proteome but rather highly redundant genomes as evidenced by high gene copy numbers documented for various dinoflagellate species.
Li, Airong; Hooli, Basavaraj; Mullin, Kristina; Tate, Rebecca E; Bubnys, Adele; Kirchner, Rory; Chapman, Brad; Hofmann, Oliver; Hide, Winston; Tanzi, Rudolph E
SOX5 encodes a transcription factor that is expressed in multiple tissues including heart, lung and brain. Mutations in SOX5 have been previously found in patients with amyotrophic lateral sclerosis (ALS) and developmental delay, intellectual disability and dysmorphic features. To characterize the neuronal role of SOX5, we silenced the Drosophila ortholog of SOX5, Sox102F, by RNAi in various neuronal subtypes in Drosophila. Silencing of Sox102F led to misorientated and disorganized michrochaetes, neurons with shorter dendritic arborization (DA) and reduced complexity, diminished larval peristaltic contractions, loss of neuromuscular junction bouton structures, impaired olfactory perception, and severe neurodegeneration in brain. Silencing of SOX5 in human SH-SY5Y neuroblastoma cells resulted in a significant repression of WNT signaling activity and altered expression of WNT-related genes. Genetic association and meta-analyses of the results in several large family-based and case-control late-onset familial Alzheimer's disease (LOAD) samples of SOX5 variants revealed several variants that show significant association with AD disease status. In addition, analysis for rare and highly penetrate functional variants revealed four novel variants/mutations in SOX5, which taken together with functional prediction analysis, suggests a strong role of SOX5 causing AD in the carrier families. Collectively, these findings indicate that SOX5 is a novel candidate gene for LOAD with an important role in neuronal function. The genetic findings warrant further studies to identify and characterize SOX5 variants that confer risk for AD, ALS and intellectual disability. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: firstname.lastname@example.org.
Camarata, Troy; Vasilyev, Aleksandr; Hadjiargyrou, Michael
Mustn1 is a small nuclear protein that is involved in the development and regeneration of the musculoskeletal system. Previous work established a role for Mustn1 in myogenic and chondrogenic differentiation. In addition, recent evidence suggests a potential role for Mustn1 in cilia function in zebrafish. A detailed study of Mustn1 expression has yet to be conducted in zebrafish. As such, we report herein the cloning of the zebrafish Mustn1 orthologs, mustn1a and mustn1b, and their expression during zebrafish embryonic and larval development. Results indicate a 44% nucleotide identity between the two paralogs. Phylogenetic analysis further confirmed that the Mustn1a and 1b predicted proteins were highly related to other vertebrate members of the Mustn1 protein family. Whole mount in situ hybridization revealed expression of both mustn1a and 1b at the 7-somite stage through 72hpf in structures such as Kupffer's vesicle, segmental mesoderm, head structures, and otic vesicle. Additionally, in 5day old larva, mustn1a and 1b expression is detected in the neurocranium, otic capsule, and the gut. Although both were expressed in the neurocranium, mustn1a was localized in the hypophyseal fenestra whereas mustn1b was found near the posterior basicapsular commissure. mustn1b also displayed expression in the ceratohyal and ceratobranchial elements of the pharyngeal skeleton. These expression patterns were verified temporally by q-PCR analysis. Taken together, we conclude that Mustn1 expression is conserved in vertebrates and that the variations in expression of the two zebrafish paralogs suggest different modes of molecular regulation. Copyright © 2016 Elsevier B.V. All rights reserved.
Galagan James E
Full Text Available Abstract Background Natural selection has traditionally been understood as a force responsible for pushing genes to states of higher translational efficiency, whereas lower translational efficiency has been explained by neutral mutation and genetic drift. We looked for evidence of directional selection resulting in increased unpreferred codon usage (and presumably reduced translational efficiency in three divergent clusters of eukaryotic genomes using a simple optimal-codon-based metric (Kp/Ku. Results Here we show that for some genes natural selection is indeed responsible for causing accelerated unpreferred codon substitution, and document the scope of this selection. In Cryptococcus and to a lesser extent Drosophila, we find many genes showing a statistically significant signal of selection for unpreferred codon usage in one or more lineages. We did not find evidence for this type of selection in Saccharomyces. The signal of positive selection observed from unpreferred synonymous codon substitutions is coincident in Cryptococcus and Drosophila with the distribution of upstream open reading frames (uORFs, another genic feature known to reduce translational efficiency. Functional enrichment analysis of genes exhibiting low Kp/Ku ratios reveals that genes in regulatory roles are particularly subject to this type of selection. Conclusion Through genome-wide scans, we find recent selection for unpreferred codon usage at approximately 1% of genetic loci in a Cryptococcus and several genes in Drosophila. Unpreferred codons can impede translation efficiency, and we find that genes with translation-impeding uORFs are enriched for this selection signal. We find that regulatory genes are particularly likely to be subject to selection for unpreferred codon usage. Given that expression noise can propagate through regulatory cascades, and that low translational efficiency can reduce expression noise, this finding supports the hypothesis that translational
Haberer, Georg; Panda, Arup; Das Laha, Shayani; Ghosh, Tapas Chandra; Schäffner, Anton R.
The identification of functionally equivalent, orthologous genes (functional orthologs) across genomes is necessary for accurate transfer of experimental knowledge from well-characterized organisms to others. This frequently relies on automated, coding sequence-based approaches such as OrthoMCL, Inparanoid, and KOG, which usually work well for one-to-one homologous states. However, this strategy does not reliably work for plants due to the occurrence of extensive gene/genome duplication. Frequently, for one query gene, multiple orthologous genes are predicted in the other genome, and it is not clear a priori from sequence comparison and similarity which one preserves the ancestral function. We have studied 11 organ-dependent and stress-induced gene expression patterns of 286 Arabidopsis lyrata duplicated gene groups and compared them with the respective Arabidopsis (Arabidopsis thaliana) genes to predict putative expressologs and nonexpressologs based on gene expression similarity. Promoter sequence divergence as an additional tool to substantiate functional orthology only partially overlapped with expressolog classification. By cloning eight A. lyrata homologs and complementing them in the respective four Arabidopsis loss-of-function mutants, we experimentally proved that predicted expressologs are indeed functional orthologs, while nonexpressologs or nonfunctionalized orthologs are not. Our study demonstrates that even a small set of gene expression data in addition to sequence homologies are instrumental in the assignment of functional orthologs in the presence of multiple orthologs. PMID:27303025
species but also from accidental contamination from the genome of eukaryotic host cells. The latter scenario generally occurs in the case of host-associated metagenomes, e.g. microbes living in human gut. In such cases, one needs to identify and remove contaminating host DNA sequences, since the latter sequences will ...
Wijffels, R.H.; Kruse, O.; Hellingwerf, K.J.
Both cyanobacteria and eukaryotic microalgae are promising organisms for sustainable production of bulk products such as food, feed, materials, chemicals and fuels. In this review we will summarize the potential and current biotechnological developments.Cyanobacteria are promising host organisms for
Khalil, Ahmad S.; Lu, Timothy K.; Bashor, Caleb J.; Ramirez, Cherie L.; Pyenson, Nora C.; Joung, J. Keith; Collins, James J.
SUMMARY Eukaryotic transcription factors (TFs) perform complex and combinatorial functions within transcriptional networks. Here, we present a synthetic framework for systematically constructing eukaryotic transcription functions using artificial zinc fingers, modular DNA-binding domains found within many eukaryotic TFs. Utilizing this platform, we construct a library of orthogonal synthetic transcription factors (sTFs) and use these to wire synthetic transcriptional circuits in yeast. We engineer complex functions, such as tunable output strength and transcriptional cooperativity, by rationally adjusting a decomposed set of key component properties, e.g., DNA specificity, affinity, promoter design, protein-protein interactions. We show that subtle perturbations to these properties can transform an individual sTF between distinct roles (activator, cooperative factor, inhibitory factor) within a transcriptional complex, thus drastically altering the signal processing behavior of multi-input systems. This platform provides new genetic components for synthetic biology and enables bottom-up approaches to understanding the design principles of eukaryotic transcriptional complexes and networks. PMID:22863014
Melixetian, Marina; Helin, Kristian
Eukaryotes have evolved multiple mechanisms to restrict DNA replication to once per cell cycle. These mechanisms prevent relicensing of origins of replication after initiation of DNA replication in S phase until the end of mitosis. Most of our knowledge of mechanisms controlling prereplication...
Supplementary figure 1. Plots depicting the classification accuracy of Eu-Detect with various combinations of. 'cumulative sequence count' (40K, 50K, 60K, 70K, 80K) and 'coverage threshold' (20%, 30%, 40%, 50%, 60%, 70%,. 80%). While blue bars represent Eu-Detect's average classification accuracy with eukaryotic ...
Yardimci, Hasan; Loveland, Anna B.; Habuchi, Satoshi; van Oijen, Antoine M.; Walter, Johannes C.
The duplication of eukaryotic genomes involves the replication of DNA from multiple origins of replication. In S phase, two sister replisomes assemble at each active origin, and they replicate DNA in opposite directions. Little is known about the functional relationship between sister replisomes.
Gross, Jeferson; Bhattacharya, Debashish
Theories about eukaryote origins (eukaryogenesis) need to provide unified explanations for the emergence of diverse complex features that define this lineage. Models that propose a prokaryote-to-eukaryote transition are gridlocked between the opposing "phagocytosis first" and "mitochondria as seed" paradigms, neither of which fully explain the origins of eukaryote cell complexity. Sex (outcrossing with meiosis) is an example of an elaborate trait not yet satisfactorily addressed in theories about eukaryogenesis. The ancestral nature of meiosis and its dependence on eukaryote cell biology suggest that the emergence of sex and eukaryogenesis were simultaneous and synergic and may be explained by a common selective pressure. We propose that a local rise in oxygen levels, due to cyanobacterial photosynthesis in ancient Archean microenvironments, was highly toxic to the surrounding biota. This selective pressure drove the transformation of an archaeal (archaebacterial) lineage into the first eukaryotes. Key is that oxygen might have acted in synergy with environmental stresses such as ultraviolet (UV) radiation and/or desiccation that resulted in the accumulation of reactive oxygen species (ROS). The emergence of eukaryote features such as the endomembrane system and acquisition of the mitochondrion are posited as strategies to cope with a metabolic crisis in the cell plasma membrane and the accumulation of ROS, respectively. Selective pressure for efficient repair of ROS/UV-damaged DNA drove the evolution of sex, which required cell-cell fusions, cytoskeleton-mediated chromosome movement, and emergence of the nuclear envelope. Our model implies that evolution of sex and eukaryogenesis were inseparable processes. Several types of data can be used to test our hypothesis. These include paleontological predictions, simulation of ancient oxygenic microenvironments, and cell biological experiments with Archaea exposed to ROS and UV stresses. Studies of archaeal conjugation
Full Text Available Chitin synthases (CHSs are key enzymes in the biosynthesis of chitin, an important structural component of fungal cell walls that can trigger innate immune responses in host plants and animals. Members of CHS gene family perform various functions in fungal cellular processes. Previous studies focused primarily on classifying diverse CHSs into different classes, regardless of their functional diversification, or on characterizing their functions in individual fungal species. A complete and systematic comparative analysis of CHS genes based on their orthologous relationships will be valuable for elucidating the evolution and functions of different CHS genes in fungi. Here, we identified and compared members of the CHS gene family across the fungal tree of life, including 18 divergent fungal lineages. Phylogenetic analysis revealed that the fungal CHS gene family is comprised of at least 10 ancestral orthologous clades, which have undergone multiple independent duplications and losses in different fungal lineages during evolution. Interestingly, one of these CHS clades (class III was expanded in plant or animal pathogenic fungi belonging to different fungal lineages. Two clades (classes VIb and VIc identified for the first time in this study occurred mainly in plant pathogenic fungi from Sordariomycetes and Dothideomycetes. Moreover, members of classes III and VIb were specifically up-regulated during plant infection, suggesting important roles in pathogenesis. In addition, CHS-associated networks conserved among plant pathogenic fungi are involved in various biological processes, including sexual reproduction and plant infection. We also identified specificity-determining sites, many of which are located at or adjacent to important structural and functional sites that are potentially responsible for functional divergence of different CHS classes. Overall, our results provide new insights into the evolution and function of members of CHS gene
Finan Christopher L
Full Text Available Abstract Background In Micrococcus luteus growth and resuscitation from starvation-induced dormancy is controlled by the production of a secreted growth factor. This autocrine resuscitation-promoting factor (Rpf is the founder member of a family of proteins found throughout and confined to the actinobacteria (high G + C Gram-positive bacteria. The aim of this work was to search for and characterise a cognate gene family in the firmicutes (low G + C Gram-positive bacteria and obtain information about how they may control bacterial growth and resuscitation. Results In silico analysis of the accessory domains of the Rpf proteins permitted their classification into several subfamilies. The RpfB subfamily is related to a group of firmicute proteins of unknown function, represented by YabE of Bacillus subtilis. The actinobacterial RpfB and firmicute YabE proteins have very similar domain structures and genomic contexts, except that in YabE, the actinobacterial Rpf domain is replaced by another domain, which we have called Sps. Although totally unrelated in both sequence and secondary structure, the Rpf and Sps domains fulfil the same function. We propose that these proteins have undergone "non-orthologous domain displacement", a phenomenon akin to "non-orthologous gene displacement" that has been described previously. Proteins containing the Sps domain are widely distributed throughout the firmicutes and they too fall into a number of distinct subfamilies. Comparative analysis of the accessory domains in the Rpf and Sps proteins, together with their weak similarity to lytic transglycosylases, provide clear evidence that they are muralytic enzymes. Conclusions The results indicate that the firmicute Sps proteins and the actinobacterial Rpf proteins are cognate and that they control bacterial culturability via enzymatic modification of the bacterial cell envelope.
Yang, Xian-Xian; Zhang, Mei; Yan, Zhao-Wen; Zhang, Ru-Hong; Mu, Xiong-Zheng
To construct a high effective eukaryotic expressing plasmid PcDNA 3.1-MSX-2 encoding Sprague-Dawley rat MSX-2 gene for the further study of MSX-2 gene function. The full length SD rat MSX-2 gene was amplified by PCR, and the full length DNA was inserted in the PMD1 8-T vector. It was isolated by restriction enzyme digest with BamHI and Xhol, then ligated into the cloning site of the PcDNA3.1 expression plasmid. The positive recombinant was identified by PCR analysis, restriction endonudease analysis and sequence analysis. Expression of RNA and protein was detected by RT-PCR and Western blot analysis in PcDNA3.1-MSX-2 transfected HEK293 cells. Sequence analysis and restriction endonudease analysis of PcDNA3.1-MSX-2 demonstrated that the position and size of MSX-2 cDNA insertion were consistent with the design. RT-PCR and Western blot analysis showed specific expression of mRNA and protein of MSX-2 in the transfected HEK293 cells. The high effective eukaryotic expression plasmid PcDNA3.1-MSX-2 encoding Sprague-Dawley Rat MSX-2 gene which is related to craniofacial development can be successfully reconstructed. It may serve as the basis for the further study of MSX-2 gene function.
Poietti, S.; Bertini, L.; Ent, S. van der; Leon Reyes, H.A.; Pieterse, C.M.J.; Tucci, M.; Caporale, C.; Caruso, C.
WRKY proteins are transcription factors involved in many plant processes including plant responses to pathogens. Here, the cross activity of TaWRKY78 from the monocot wheat and AtWRKY20 from the dicot Arabidopsis on the cognate promoters of the orthologous PR4-type genes wPR4e and AtHEL of wheat and
Trachtulec, Zdeněk; Forejt, Jiří
Roč. 12, č. 3 (2001), s. 227-231 ISSN 0938-8990 Institutional research plan: CEZ:AV0Z5052915 Keywords : synteny of orthologous genes Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 2.318, year: 2001
Full Text Available e relevant data in the databases. By submitting queries to the PGDBj Ortholog DB with keywords or amino acid sequences, users... taxa including both model plants and crop plants. Following the links obtained, users can retrieve the actu
Full Text Available Abstract Background Comparison of completely sequenced microbial genomes has revealed how fluid these genomes are. Detecting synteny blocks requires reliable methods to determining the orthologs among the whole set of homologs detected by exhaustive comparisons between each pair of completely sequenced genomes. This is a complex and difficult problem in the field of comparative genomics but will help to better understand the way prokaryotic genomes are evolving. Results We have developed a suite of programs that automate three essential steps to study conservation of gene order, and validated them with a set of 107 bacteria and archaea that cover the majority of the prokaryotic taxonomic space. We identified the whole set of shared homologs between two or more species and computed the evolutionary distance separating each pair of homologs. We applied two strategies to extract from the set of homologs a collection of valid orthologs shared by at least two genomes. The first computes the Reciprocal Smallest Distance (RSD using the PAM distances separating pairs of homologs. The second method groups homologs in families and reconstructs each family's evolutionary tree, distinguishing bona fide orthologs as well as paralogs created after the last speciation event. Although the phylogenetic tree method often succeeds where RSD fails, the reverse could occasionally be true. Accordingly, we used the data obtained with either methods or their intersection to number the orthologs that are adjacent in for each pair of genomes, the Positional Orthologous Genes (POGs, and to further study their properties. Once all these synteny blocks have been detected, we showed that POGs are subject to more evolutionary constraints than orthologs outside synteny groups, whichever the taxonomic distance separating the compared organisms. Conclusion The suite of programs described in this paper allows a reliable detection of orthologs and is useful for evaluating gene
Santamaria, Monica; Fosso, Bruno; Licciulli, Flavio; Balech, Bachir; Larini, Ilaria; Grillo, Giorgio; De Caro, Giorgio; Liuni, Sabino; Pesole, Graziano
A holistic understanding of environmental communities is the new challenge of metagenomics. Accordingly, the amplicon-based or metabarcoding approach, largely applied to investigate bacterial microbiomes, is moving to the eukaryotic world too. Indeed, the analysis of metabarcoding data may provide a comprehensive assessment of both bacterial and eukaryotic composition in a variety of environments, including human body. In this respect, whereas hypervariable regions of the 16S rRNA are the de facto standard barcode for bacteria, the Internal Transcribed Spacer 1 (ITS1) of ribosomal RNA gene cluster has shown a high potential in discriminating eukaryotes at deep taxonomic levels. As metabarcoding data analysis rely on the availability of a well-curated barcode reference resource, a comprehensive collection of ITS1 sequences supplied with robust taxonomies, is highly needed. To address this issue, we created ITSoneDB (available at http://itsonedb.cloud.ba.infn.it/) which in its current version hosts 985 240 ITS1 sequences spanning over 134 000 eukaryotic species. Each ITS1 is mapped on the NCBI reference taxonomy with its start and end positions precisely annotated. ITSoneDB has been developed in agreement to the FAIR guidelines by enabling the users to query and download its content through a simple web-interface and access relevant metadata by cross-linking to European Nucleotide Archive. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Padieu, P.; Maume, B.F.
Carbon-14 labelled compounds in cell cultures are used to establish the interconnections between different metabolic pathways as well as the competitive action of effectors on these different pathways. Analysis was performed by the GC-MS combination. Identification was carried out by comparison with the mass spectra of d9-TMS, 35 Cl-TMS and 37 Cl-TMS derivatizations of the culture extracts. Examples are given of the metabolic study of hormonal steroids and of safrale, a carcinogenic compound, by differentiated eukaryotic cells in cultures from the rat
Platre, Matthieu Pierre; Jaillais, Yvon
A wide range of signaling processes occurs at the cell surface through the reversible association of proteins from the cytosol to the plasma membrane. Some low abundant lipids are enriched at the membrane of specific compartments and thereby contribute to the identity of cell organelles by acting as biochemical landmarks. Lipids also influence membrane biophysical properties, which emerge as an important feature in specifying cellular territories. Such parameters are crucial for signal transduction and include lipid packing, membrane curvature and electrostatics. In particular, membrane electrostatics specifies the identity of the plasma membrane inner leaflet. Membrane surface charges are carried by anionic phospholipids, however the exact nature of the lipid(s) that powers the plasma membrane electrostatic field varies among eukaryotes and has been hotly debated during the last decade. Herein, we discuss the role of anionic lipids in setting up plasma membrane electrostatics and we compare similarities and differences that were found in different eukaryotic cells.
Yeeles, Joseph T P; Deegan, Tom D; Janska, Agnieszka; Early, Anne; Diffley, John F X
Eukaryotic cells initiate DNA replication from multiple origins, which must be tightly regulated to promote precise genome duplication in every cell cycle. To accomplish this, initiation is partitioned into two temporally discrete steps: a double hexameric minichromosome maintenance (MCM) complex is first loaded at replication origins during G1 phase, and then converted to the active CMG (Cdc45-MCM-GINS) helicase during S phase. Here we describe the reconstitution of budding yeast DNA replication initiation with 16 purified replication factors, made from 42 polypeptides. Origin-dependent initiation recapitulates regulation seen in vivo. Cyclin-dependent kinase (CDK) inhibits MCM loading by phosphorylating the origin recognition complex (ORC) and promotes CMG formation by phosphorylating Sld2 and Sld3. Dbf4-dependent kinase (DDK) promotes replication by phosphorylating MCM, and can act either before or after CDK. These experiments define the minimum complement of proteins, protein kinase substrates and co-factors required for regulated eukaryotic DNA replication.
Full Text Available Abstract Background In mammals, pheromones play an important role in social and innate reproductive behavior within species. In rodents, vomeronasal receptor type 1 (V1R, which is specifically expressed in the vomeronasal organ, is thought to detect pheromones. The V1R gene repertoire differs dramatically between mammalian species, and the presence of species-specific V1R subfamilies in mouse and rat suggests that V1R plays a profound role in species-specific recognition of pheromones. In ruminants, however, the molecular mechanism(s for pheromone perception is not well understood. Interestingly, goat male pheromone, which can induce out-of-season ovulation in anestrous females, causes the same pheromone response in sheep, and vice versa, suggesting that there may be mechanisms for detecting "inter-species" pheromones among ruminant species. Results We isolated 23 goat and 21 sheep intact V1R genes based on sequence similarity with 32 cow V1R genes in the cow genome database. We found that all of the goat and sheep V1R genes have orthologs in their cross-species counterparts among these three ruminant species and that the sequence identity of V1R orthologous pairs among these ruminants is much higher than that of mouse-rat V1R orthologous pairs. Furthermore, all goat V1Rs examined thus far are expressed not only in the vomeronasal organ but also in the main olfactory epithelium. Conclusion Our results suggest that, compared with rodents, the repertoire of orthologous V1R genes is remarkably conserved among the ruminants cow, sheep and goat. We predict that these orthologous V1Rs can detect the same or closely related chemical compound(s within each orthologous set/pair. Furthermore, all identified goat V1Rs are expressed in the vomeronasal organ and the main olfactory epithelium, suggesting that V1R-mediated ligand information can be detected and processed by both the main and accessory olfactory systems. The fact that ruminant and rodent V1Rs
Abid Ali, Ferdos; Costa, Alessandro
The MCM motor of the CMG helicase powers ahead of the eukaryotic replication machinery to unwind DNA, in a process that requires ATP hydrolysis. The reconstitution of DNA replication in vitro has established the succession of events that lead to replication origin activation by the MCM and recent studies have started to elucidate the structural basis of duplex DNA unwinding. Despite the exciting progress, how the MCM translocates on DNA remains a matter of debate. Copyright © 2016. Published by Elsevier Ltd.
Oleg Mosin; Ignat Ignatov
Biologic influence of deuterium (D) on cells of various taxonomic groups of prokaryotic and eukaryotic microorganisms realizing methylotrophic, chemoheterotrophic, photo-organotrophic, and photosynthetic ways of assimilation of carbon substrates are investigated at growth on media with heavy water (D2О). The method of step by step adaptation technique of cells to D2О was developed, consisting in plating of cells on 2 % agarose nutrient media containing increasing gradient of concentration of ...
Neil W. Blackstone
Two major obstacles hinder the application of evolutionary theory to the origin of eukaryotes. The first is more apparent than real?the endosymbiosis that led to the mitochondrion is often described as ?non-Darwinian? because it deviates from the incremental evolution championed by the modern synthesis. Nevertheless, endosymbiosis can be accommodated by a multi-level generalization of evolutionary theory, which Darwin himself pioneered. The second obstacle is more serious?all of the major fea...
Full Text Available Ample evidence exists showing that eukaryotic signal molecules synthesized and released by the host can activate the virulence of opportunistic pathogens. The sensitivity of prokaryotes to host signal molecules requires the presence of bacterial sensors. These prokaryotic sensors, or receptors, have a double function: stereospecific recognition in a complex environment and transduction of the message in order to initiate bacterial physiological modifications. As messengers are generally unable to freely cross the bacterial membrane, they require either the presence of sensors anchored in the membrane or transporters allowing direct recognition inside the bacterial cytoplasm. Since the discovery of quorum sensing, it was established that the production of virulence factors by bacteria is tightly growth-phase regulated. It is now obvious that expression of bacterial virulence is also controlled by detection of the eukaryotic messengers released in the micro-environment as endocrine or neuro-endocrine modulators. In the presence of host physiological stress many eukaryotic factors are released and detected by Gram-negative bacteria which in return rapidly adapt their physiology. For instance, Pseudomonas aeruginosa can bind elements of the host immune system such as interferon-γ and dynorphin and then through quorum sensing circuitry enhance its virulence. Escherichia coli sensitivity to the neurohormones of the catecholamines family appears relayed by a recently identified bacterial adrenergic receptor. In the present review, we will describe the mechanisms by which various eukaryotic signal molecules produced by host may activate Gram-negative bacteria virulence. Particular attention will be paid to Pseudomonas, a genus whose representative species, P. aeruginosa, is a common opportunistic pathogen. The discussion will be particularly focused on the pivotal role played by these new types of pathogen sensors from the sensing to the transduction
Morrow, John F.; Cohen, Stanley N.; Chang, Annie C. Y.; Boyer, Herbert W.; Goodman, Howard M.; Helling, Robert B.
Fragments of amplified Xenopus laevis DNA, coding for 18S and 28S ribosomal RNA and generated by EcoRI restriction endonuclease, have been linked in vitro to the bacterial plasmid pSC101; and the recombinant molecular species have been introduced into E. coli by transformation. These recombinant plasmids, containing both eukaryotic and prokaryotic DNA, replicate stably in E. coli. RNA isolated from E. coli minicells harboring the plasmids hybridizes to amplified X. laevis rDNA. Images PMID:4600264
Flegontova, Olga; Flegontov, Pavel; Malviya, S.; Audic, S.; Wincker, P.; de Vargas, C.; Bowler, C.; Lukeš, Julius; Horák, Aleš
Roč. 26, č. 22 (2016), s. 3060-3065 ISSN 0960-9822 R&D Projects: GA ČR GPP506/12/P931; GA ČR(CZ) GA14-23986S Institutional support: RVO:60077344 Keywords : virus-sized particles * microbial eukaryotes * sea-floor * phytoplankton * communities * euglenozoa * dispersal * ecosystem Subject RIV: EG - Zoology Impact factor: 8.851, year: 2016
Simon, Martin; Plattner, Helmut
Unicellular eukaryotes have been appreciated as model systems for the analysis of crucial questions in cell and molecular biology. This includes Dictyostelium (chemotaxis, amoeboid movement, phagocytosis), Tetrahymena (telomere structure, telomerase function), Paramecium (variant surface antigens, exocytosis, phagocytosis cycle) or both ciliates (ciliary beat regulation, surface pattern formation), Chlamydomonas (flagellar biogenesis and beat), and yeast (S. cerevisiae) for innumerable aspects. Nowadays many problems may be tackled with "higher" eukaryotic/metazoan cells for which full genomic information as well as domain databases, etc., were available long before protozoa. Established molecular tools, commercial antibodies, and established pharmacology are additional advantages available for higher eukaryotic cells. Moreover, an increasing number of inherited genetic disturbances in humans have become elucidated and can serve as new models. Among lower eukaryotes, yeast will remain a standard model because of its peculiarities, including its reduced genome and availability in the haploid form. But do protists still have a future as models? This touches not only the basic understanding of biology but also practical aspects of research, such as fund raising. As we try to scrutinize, due to specific advantages some protozoa should and will remain favorable models for analyzing novel genes or specific aspects of cell structure and function. Outstanding examples are epigenetic phenomena-a field of rising interest. © 2014 Elsevier Inc. All rights reserved.
Full Text Available Abstract Background The number of completed eukaryotic genome sequences and cDNA projects has increased exponentially in the past few years although most of them have not been published yet. In addition, many microarray analyses yielded thousands of sequenced EST and cDNA clones. For the researcher interested in single gene analyses (from a phylogenetic, a structural biology or other perspective it is therefore important to have up-to-date knowledge about the various resources providing primary data. Description The database is built around 3 central tables: species, sequencing projects and publications. The species table contains commonly and alternatively used scientific names, common names and the complete taxonomic information. For projects the sequence type and links to species project web-sites and species homepages are stored. All publications are linked to projects. The web-interface provides comprehensive search modules with detailed options and three different views of the selected data. We have especially focused on developing an elaborate taxonomic tree search tool that allows the user to instantaneously identify e.g. the closest relative to the organism of interest. Conclusion We have developed a database, called diArk, to store, organize, and present the most relevant information about completed genome projects and EST/cDNA data from eukaryotes. Currently, diArk provides information about 415 eukaryotes, 823 sequencing projects, and 248 publications.
Qin, Jie; Lehr, Corinne R; Yuan, Chungang; Le, X Chris; McDermott, Timothy R; Rosen, Barry P
Arsenic is the most common toxic substance in the environment, ranking first on the Superfund list of hazardous substances. It is introduced primarily from geochemical sources and is acted on biologically, creating an arsenic biogeocycle. Geothermal environments are known for their elevated arsenic content and thus provide an excellent setting in which to study microbial redox transformations of arsenic. To date, most studies of microbial communities in geothermal environments have focused on Bacteria and Archaea, with little attention to eukaryotic microorganisms. Here, we show the potential of an extremophilic eukaryotic alga of the order Cyanidiales to influence arsenic cycling at elevated temperatures. Cyanidioschyzon sp. isolate 5508 oxidized arsenite [As(III)] to arsenate [As(V)], reduced As(V) to As(III), and methylated As(III) to form trimethylarsine oxide (TMAO) and dimethylarsenate [DMAs(V)]. Two arsenic methyltransferase genes, CmarsM7 and CmarsM8, were cloned from this organism and demonstrated to confer resistance to As(III) in an arsenite hypersensitive strain of Escherichia coli. The 2 recombinant CmArsMs were purified and shown to transform As(III) into monomethylarsenite, DMAs(V), TMAO, and trimethylarsine gas, with a T(opt) of 60-70 degrees C. These studies illustrate the importance of eukaryotic microorganisms to the biogeochemical cycling of arsenic in geothermal systems, offer a molecular explanation for how these algae tolerate arsenic in their environment, and provide the characterization of algal methyltransferases.
Niño, Carlos A; Chaparro, Jenny; Soffientini, Paolo; Polo, Simona; Wasserman, Moises
Ubiquitination is a highly dynamic and versatile posttranslational modification that regulates protein function, stability, and interactions. To investigate the roles of ubiquitination in a primitive eukaryotic lineage, we utilized the early-branching eukaryote Giardia intestinalis. Using a combination of biochemical, immunofluorescence-based, and proteomics approaches, we assessed the ubiquitination status during the process of differentiation in Giardia. We observed that different types of ubiquitin modifications present specific cellular and temporal distribution throughout the Giardia life cycle from trophozoites to cyst maturation. Ubiquitin signal was detected in the wall of mature cysts, and enzymes implicated in cyst wall biogenesis were identified as substrates for ubiquitination. Interestingly, inhibition of proteasome activity did not affect trophozoite replication and differentiation, while it caused a decrease in cyst viability, arguing for proteasome involvement in cyst wall maturation. Using a proteomics approach, we identified around 200 high-confidence ubiquitinated candidates that vary their ubiquitination status during differentiation. Our results indicate that ubiquitination is critical for several cellular processes in this primitive eukaryote. PMID:23613346
Gene transfer into eukaryotic cells plays an important role in cell biology. Over the last 30 years a number of transfection methods have been developed to mediate gene transfer into eukaryotic cells. Classical methods include co-precipitation of DNA with calcium phosphate, charge-dependent precipitation of DNA with DEAE-dextran, electroporation of nucleic acids, and formation of transfection complexes between DNA and cationic liposomes. Gene transfer technologies based on activated PAMAM-dendrimers provide another class of transfection reagents. PAMAM-dendrimers are highly branched, spherical molecules. Activation of newly synthesized dendrimers involves hydrolytic removal of some of the branches, and results in a molecule with a higher degree of flexibility. Activated dendrimers assemble DNA into compact structures via charge interactions. Activated dendrimer - DNA complexes bind to the cell membrane of eukaryotic cells, and are transported into the cell by non-specific endocytosis. A structural model of the activated dendrimer - DNA complex and a potential mechanism for its uptake into cells will be discussed.
Moriyama, Takashi; Sato, Naoki
Plastids and mitochondria possess their own genomes. Although the replication mechanisms of these organellar genomes remain unclear in photosynthetic eukaryotes, several organelle-localized enzymes related to genome replication, including DNA polymerase, DNA primase, DNA helicase, DNA topoisomerase, single-stranded DNA maintenance protein, DNA ligase, primer removal enzyme, and several DNA recombination-related enzymes, have been identified. In the reference Eudicot plant Arabidopsis thaliana, the replication-related enzymes of plastids and mitochondria are similar because many of them are dual targeted to both organelles, whereas in the red alga Cyanidioschyzon merolae, plastids and mitochondria contain different replication machinery components. The enzymes involved in organellar genome replication in green plants and red algae were derived from different origins, including proteobacterial, cyanobacterial, and eukaryotic lineages. In the present review, we summarize the available data for enzymes related to organellar genome replication in green plants and red algae. In addition, based on the type and distribution of replication enzymes in photosynthetic eukaryotes, we discuss the transitional history of replication enzymes in the organelles of plants.
Parker, Matthew W; Botchan, Michael R; Berger, James M
Cellular DNA replication is initiated through the action of multiprotein complexes that recognize replication start sites in the chromosome (termed origins) and facilitate duplex DNA melting within these regions. In a typical cell cycle, initiation occurs only once per origin and each round of replication is tightly coupled to cell division. To avoid aberrant origin firing and re-replication, eukaryotes tightly regulate two events in the initiation process: loading of the replicative helicase, MCM2-7, onto chromatin by the origin recognition complex (ORC), and subsequent activation of the helicase by its incorporation into a complex known as the CMG. Recent work has begun to reveal the details of an orchestrated and sequential exchange of initiation factors on DNA that give rise to a replication-competent complex, the replisome. Here, we review the molecular mechanisms that underpin eukaryotic DNA replication initiation - from selecting replication start sites to replicative helicase loading and activation - and describe how these events are often distinctly regulated across different eukaryotic model organisms.
Full Text Available Abstract Background The origin of eukaryotic cells was one of the most dramatic evolutionary transitions in the history of life. It is generally assumed that eukaryotes evolved later then prokaryotes by the transformation or fusion of prokaryotic lineages. However, as yet there is no consensus regarding the nature of the prokaryotic group(s ancestral to eukaryotes. Regardless of this, a hardly debatable fundamental novel characteristic of the last eukaryotic common ancestor was the ability to exploit prokaryotic biomass by the ingestion of entire cells, i.e. phagocytosis. The recent advances in our understanding of the social life of prokaryotes may help to explain the origin of this form of total exploitation. Presentation of the hypothesis Here I propose that eukaryotic cells originated in a social environment, a differentiated microbial mat or biofilm that was maintained by the cooperative action of its members. Cooperation was costly (e.g. the production of developmental signals or an extracellular matrix but yielded benefits that increased the overall fitness of the social group. I propose that eukaryotes originated as selfish cheaters that enjoyed the benefits of social aggregation but did not contribute to it themselves. The cheaters later evolved into predators that lysed other cells and eventually became professional phagotrophs. During several cycles of social aggregation and dispersal the number of cheaters was contained by a chicken game situation, i.e. reproductive success of cheaters was high when they were in low abundance but was reduced when they were over-represented. Radical changes in cell structure, including the loss of the rigid prokaryotic cell wall and the development of endomembranes, allowed the protoeukaryotes to avoid cheater control and to exploit nutrients more efficiently. Cellular changes were buffered by both the social benefits and the protective physico-chemical milieu of the interior of biofilms. Symbiosis
Nicholas J Marini
Full Text Available Computational predictions of the functional impact of genetic variation play a critical role in human genetics research. For nonsynonymous coding variants, most prediction algorithms make use of patterns of amino acid substitutions observed among homologous proteins at a given site. In particular, substitutions observed in orthologous proteins from other species are often assumed to be tolerated in the human protein as well. We examined this assumption by evaluating a panel of nonsynonymous mutants of a prototypical human enzyme, methylenetetrahydrofolate reductase (MTHFR, in a yeast cell-based functional assay. As expected, substitutions in human MTHFR at sites that are well-conserved across distant orthologs result in an impaired enzyme, while substitutions present in recently diverged sequences (including a 9-site mutant that "resurrects" the human-macaque ancestor result in a functional enzyme. We also interrogated 30 sites with varying degrees of conservation by creating substitutions in the human enzyme that are accepted in at least one ortholog of MTHFR. Quite surprisingly, most of these substitutions were deleterious to the human enzyme. The results suggest that selective constraints vary between phylogenetic lineages such that inclusion of distant orthologs to infer selective pressures on the human enzyme may be misleading. We propose that homologous proteins are best used to reconstruct ancestral sequences and infer amino acid conservation among only direct lineal ancestors of a particular protein. We show that such an "ancestral site preservation" measure outperforms other prediction methods, not only in our selected set for MTHFR, but also in an exhaustive set of E. coli LacI mutants.
Moreira, Maria-Céu; Klur, Sandra; Watanabe, Mitsunori; Németh, Andrea H; Le Ber, Isabelle; Moniz, José-Carlos; Tranchant, Christine; Aubourg, Patrick; Tazir, Meriem; Schöls, Lüdger; Pandolfo, Massimo; Schulz, Jörg B; Pouget, Jean; Calvas, Patrick; Shizuka-Ikeda, Masami; Shoji, Mikio; Tanaka, Makoto; Izatt, Louise; Shaw, Christopher E; M'Zahem, Abderrahim; Dunne, Eimear; Bomont, Pascale; Benhassine, Traki; Bouslam, Naïma; Stevanin, Giovanni; Brice, Alexis; Guimarães, João; Mendonça, Pedro; Barbot, Clara; Coutinho, Paula; Sequeiros, Jorge; Dürr, Alexandra; Warter, Jean-Marie; Koenig, Michel
Ataxia-ocular apraxia 2 (AOA2) was recently identified as a new autosomal recessive ataxia. We have now identified causative mutations in 15 families, which allows us to clinically define this entity by onset between 10 and 22 years, cerebellar atrophy, axonal sensorimotor neuropathy, oculomotor apraxia and elevated alpha-fetoprotein (AFP). Ten of the fifteen mutations cause premature termination of a large DEAxQ-box helicase, the human ortholog of yeast Sen1p, involved in RNA maturation and termination.
Fujiwara, Shoko; Takatsuka, Yukiko; Hirokawa, Yasutaka; Tsuzuki, Mikio; Takano, Tomoyuki; Kobayashi, Masaaki; Suda, Kunihiro; Asamizu, Erika; Yokoyama, Koji; Shibata, Daisuke; Tabata, Satoshi; Yano, Kentaro
Pleurochrysis is a coccolithophorid genus, which belongs to the Coccolithales in the Haptophyta. The genus has been used extensively for biological research, together with Emiliania in the Isochrysidales, to understand distinctive features between the two coccolithophorid-including orders. However, molecular biological research on Pleurochrysis such as elucidation of the molecular mechanism behind coccolith formation has not made great progress at least in part because of lack of comprehensive gene information. To provide such information to the research community, we built an open web database, the Pleurochrysome (http://bioinf.mind.meiji.ac.jp/phapt/), which currently stores 9,023 unique gene sequences (designated as UNIGENEs) assembled from expressed sequence tag sequences of P. haptonemofera as core information. The UNIGENEs were annotated with gene sequences sharing significant homology, conserved domains, Gene Ontology, KEGG Orthology, predicted subcellular localization, open reading frames and orthologous relationship with genes of 10 other algal species, a cyanobacterium and the yeast Saccharomyces cerevisiae. This sequence and annotation information can be easily accessed via several search functions. Besides fundamental functions such as BLAST and keyword searches, this database also offers search functions to explore orthologous genes in the 12 organisms and to seek novel genes. The Pleurochrysome will promote molecular biological and phylogenetic research on coccolithophorids and other haptophytes by helping scientists mine data from the primary transcriptome of P. haptonemofera. PMID:26746174
Lentz, Christian S; Halls, Victoria S; Hannam, Jeffrey S; Strassel, Silke; Lawrence, Sarah H; Jaffe, Eileen K; Famulok, Michael; Hoerauf, Achim; Pfarr, Kenneth M
The heme biosynthesis enzyme porphobilinogen synthase (PBGS) is a potential drug target in several human pathogens. wALADin1 benzimidazoles have emerged as species-selective PBGS inhibitors against Wolbachia endobacteria of filarial worms. In the present study, we have systematically tested wALADins against PBGS orthologs from bacteria, protozoa, metazoa, and plants to elucidate the inhibitory spectrum. However, the effect of wALADin1 on different PBGS orthologs was not limited to inhibition: several orthologs were stimulated by wALADin1; others remained unaffected. We demonstrate that wALADins allosterically modulate the PBGS homooligomeric equilibrium with inhibition mediated by favoring low-activity oligomers, while 5-aminolevulinic acid, Mg(2+), or K(+) stabilized high-activity oligomers. Pseudomonas aeruginosa PBGS could be inhibited or stimulated by wALADin1 depending on these factors and pH. We have defined the wALADin chemotypes responsible for either inhibition or stimulation, facilitating the design of tailored PBGS modulators for potential application as antimicrobial agents, herbicides, or drugs for porphyric disorders.
Hamidou Soumana, Illiassou; Tchicaya, Bernadette; Rialle, Stéphanie; Parrinello, Hugues; Geiger, Anne
Blood-feeding Glossina palpalis gambiense (Gpg) fly transmits the single-celled eukaryotic parasite Trypanosoma brucei gambiense (Tbg), the second Glossina fly African trypanosome pair being Glossina morsitans / T .brucei rhodesiense. Whatever the T. brucei subspecies, whereas the onset of their developmental program in the zoo-anthropophilic blood feeding flies does unfold in the fly midgut, its completion is taking place in the fly salivary gland where does emerge a low size metacyclic trypomastigote population displaying features that account for its establishment in mammals-human individuals included. Considering that the two Glossina - T. brucei pairs introduced above share similarity with respect to the developmental program of this African parasite, we were curious to map on the Glossina morsitans morsitans (Gmm), the Differentially Expressed Genes (DEGs) we listed in a previous study. Briefly, using the gut samples collected at days 3, 10, and 20 from Gpg that were fed or not at day 0 on Tbg-hosting mice, these DGE lists were obtained from RNA seq-based approaches. Here, post the mapping on the quality controlled DEGs on the Gmm genome, the identified ortholog genes were further annotated, the resulting datasets being compared. Around 50% of the Gpg DEGs were shown to have orthologs in the Gmm genome. Under one of the three Glossina midgut sampling conditions, the number of DEGs was even higher when mapping on the Gmm genome than initially recorded. Many Gmm genes annotated as "Hypothetical" were mapped and annotated on many distinct databases allowing some of them to be properly identified. We identify Glossina fly candidate genes encoding (a) a broad panel of proteases as well as (b) chitin-binding proteins, (c) antimicrobial peptide production-Pro3 protein, transferrin, mucin, atttacin, cecropin, etc-to further select in functional studies, the objectives being to probe and validated fly genome manipulation that prevents the onset of the developmental
Li Xinli; Zhu Ran; Zhu Wei; Fan Saijun; Meng Qinghui
Objective: To generate eukaryotic expression vector of pcDNA3-UHRF1(ubiquitin-like, containing PHD and RING finger domains 1, UHRF1) and testify its expression in breast cancer cells MDA-MB-231. Methods: A 2.3 kb cDNA fragment was amplified from the total RNA of the human breast cancer cells MCF-7 by the RT-PCR method and was cloned into the plasmid pcDNA3. The vector was identified by the double digestion with restriction enzymes Kpn I and Xho I and was sequenced. The cDNA of UHRF1 was transfected into human breast cancer cells MDA-MB-231 by Lipofactamin2000. The positive clones were selected by G418. The expression of the UHRF1 was detected by RT-PCR and Western blot analysis. Results: The recombinant eukaryotic expression vector pcDNA3-UHRF1 was digested with Kpn I and BamH I, and the electrophoresis of the digested products showed two fragments; 2.3kb fragment of UHRF1 and 5.4 kb fragment of pcDNA3, and the sequence inserted was identical to the published sequence. The MDA-MB-231 cells transfected with the pcDNA3-UHRF1 plasmid expressed a high level of the UHRF1 mRNA and protein. Conclusion: The recombinant eukaryotic cell expression vector of pcDNA3-UHRF1 is constructed successfully. The recombinant plasmid pcDNA3-UHRF1 can provide a very useful tool and lay an important foundation for the research on the function of UHRF1. (authors)
The eukaryotic cytoskeleton evolved from prokaryotic cytomotive filaments. Prokaryotic filament systems show bewildering structural and dynamic complexity and, in many aspects, prefigure the self-organizing properties of the eukaryotic cytoskeleton. Here, the dynamic properties of the prokaryotic and eukaryotic cytoskeleton are compared, and how these relate to function and evolution of organellar networks is discussed. The evolution of new aspects of filament dynamics in eukaryotes, including severing and branching, and the advent of molecular motors converted the eukaryotic cytoskeleton into a self-organizing "active gel," the dynamics of which can only be described with computational models. Advances in modeling and comparative genomics hold promise of a better understanding of the evolution of the self-organizing cytoskeleton in early eukaryotes, and its role in the evolution of novel eukaryotic functions, such as amoeboid motility, mitosis, and ciliary swimming. Copyright © 2014 Cold Spring Harbor Laboratory Press; all rights reserved.
The eocyte hypothesis, in which Eukarya emerged from within Archaea, has been boosted by the description of a new candidate archaeal phylum, “Lokiarchaeota”, from metagenomic data. Eukarya branch within Lokiarchaeota in a tree reconstructed from the concatenation of 36 universal proteins. However, individual phylogenies revealed that lokiarchaeal proteins sequences have different evolutionary histories. The individual markers phylogenies revealed at least two subsets of proteins, either supporting the Woese or the Eocyte tree of life. Strikingly, removal of a single protein, the elongation factor EF2, is sufficient to break the Eukaryotes-Lokiarchaea affiliation. Our analysis suggests that the three lokiarchaeal EF2 proteins have a chimeric organization that could be due to contamination and/or homologous recombination with patches of eukaryotic sequences. A robust phylogenetic analysis of RNA polymerases with a new dataset indicates that Lokiarchaeota and related phyla of the Asgard superphylum are sister group to Euryarchaeota, not to Eukarya, and supports the monophyly of Archaea with their rooting in the branch leading to Thaumarchaeota. PMID:28604769
Goodstadt, Leo; Ponting, Chris P
Accurate predictions of orthology and paralogy relationships are necessary to infer human molecular function from experiments in model organisms. Previous genome-scale approaches to predicting these relationships have been limited by their use of protein similarity and their failure to take into account multiple splicing events and gene prediction errors. We have developed PhyOP, a new phylogenetic orthology prediction pipeline based on synonymous rate estimates, which accurately predicts orthology and paralogy relationships for transcripts, genes, exons, or genomic segments between closely related genomes. We were able to identify orthologue relationships to human genes for 93% of all dog genes from Ensembl. Among 1:1 orthologues, the alignments covered a median of 97.4% of protein sequences, and 92% of orthologues shared essentially identical gene structures. PhyOP accurately recapitulated genomic maps of conserved synteny. Benchmarking against predictions from Ensembl and Inparanoid showed that PhyOP is more accurate, especially in its predictions of paralogy. Nearly half (46%) of PhyOP paralogy predictions are unique. Using PhyOP to investigate orthologues and paralogues in the human and dog genomes, we found that the human assembly contains 3-fold more gene duplications than the dog. Species-specific duplicate genes, or "in-paralogues," are generally shorter and have fewer exons than 1:1 orthologues, which is consistent with selective constraints and mutation biases based on the sizes of duplicated genes. In-paralogues have experienced elevated amino acid and synonymous nucleotide substitution rates. Duplicates possess similar biological functions for either the dog or human lineages. Having accounted for 2,954 likely pseudogenes and gene fragments, and after separating 346 erroneously merged genes, we estimated that the human genome encodes a minimum of 19,700 protein-coding genes, similar to the gene count of nematode worms. PhyOP is a fast and robust
Full Text Available Accurate predictions of orthology and paralogy relationships are necessary to infer human molecular function from experiments in model organisms. Previous genome-scale approaches to predicting these relationships have been limited by their use of protein similarity and their failure to take into account multiple splicing events and gene prediction errors. We have developed PhyOP, a new phylogenetic orthology prediction pipeline based on synonymous rate estimates, which accurately predicts orthology and paralogy relationships for transcripts, genes, exons, or genomic segments between closely related genomes. We were able to identify orthologue relationships to human genes for 93% of all dog genes from Ensembl. Among 1:1 orthologues, the alignments covered a median of 97.4% of protein sequences, and 92% of orthologues shared essentially identical gene structures. PhyOP accurately recapitulated genomic maps of conserved synteny. Benchmarking against predictions from Ensembl and Inparanoid showed that PhyOP is more accurate, especially in its predictions of paralogy. Nearly half (46% of PhyOP paralogy predictions are unique. Using PhyOP to investigate orthologues and paralogues in the human and dog genomes, we found that the human assembly contains 3-fold more gene duplications than the dog. Species-specific duplicate genes, or "in-paralogues," are generally shorter and have fewer exons than 1:1 orthologues, which is consistent with selective constraints and mutation biases based on the sizes of duplicated genes. In-paralogues have experienced elevated amino acid and synonymous nucleotide substitution rates. Duplicates possess similar biological functions for either the dog or human lineages. Having accounted for 2,954 likely pseudogenes and gene fragments, and after separating 346 erroneously merged genes, we estimated that the human genome encodes a minimum of 19,700 protein-coding genes, similar to the gene count of nematode worms. PhyOP is a
Kriventseva, Evgenia V; Tegenfeldt, Fredrik; Petty, Tom J; Waterhouse, Robert M; Simão, Felipe A; Pozdnyakov, Igor A; Ioannidis, Panagiotis; Zdobnov, Evgeny M
Orthology, refining the concept of homology, is the cornerstone of evolutionary comparative studies. With the ever-increasing availability of genomic data, inference of orthology has become instrumental for generating hypotheses about gene functions crucial to many studies. This update of the OrthoDB hierarchical catalog of orthologs (http://www.orthodb.org) covers 3027 complete genomes, including the most comprehensive set of 87 arthropods, 61 vertebrates, 227 fungi and 2627 bacteria (sampling the most complete and representative genomes from over 11,000 available). In addition to the most extensive integration of functional annotations from UniProt, InterPro, GO, OMIM, model organism phenotypes and COG functional categories, OrthoDB uniquely provides evolutionary annotations including rates of ortholog sequence divergence, copy-number profiles, sibling groups and gene architectures. We re-designed the entirety of the OrthoDB website from the underlying technology to the user interface, enabling the user to specify species of interest and to select the relevant orthology level by the NCBI taxonomy. The text searches allow use of complex logic with various identifiers of genes, proteins, domains, ontologies or annotation keywords and phrases. Gene copy-number profiles can also be queried. This release comes with the freely available underlying ortholog clustering pipeline (http://www.orthodb.org/software). © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Bruck, Irina; Perez-Arnaiz, Patricia; Colbert, Max K; Kaplan, Daniel L
The initiation of DNA replication is a highly regulated event in eukaryotic cells to ensure that the entire genome is copied once and only once during S phase. The primary target of cellular regulation of eukaryotic DNA replication initiation is the assembly and activation of the replication fork helicase, the 11-subunit assembly that unwinds DNA at a replication fork. The replication fork helicase, called CMG for Cdc45-Mcm2-7, and GINS, assembles in S phase from the constituent Cdc45, Mcm2-7, and GINS proteins. The assembly and activation of the CMG replication fork helicase during S phase is governed by 2 S-phase specific kinases, CDK and DDK. CDK stimulates the interaction between Sld2, Sld3, and Dpb11, 3 initiation factors that are each required for the initiation of DNA replication. DDK, on the other hand, phosphorylates the Mcm2, Mcm4, and Mcm6 subunits of the Mcm2-7 complex. Sld3 recruits Cdc45 to Mcm2-7 in a manner that depends on DDK, and recent work suggests that Sld3 binds directly to Mcm2-7 and also to single-stranded DNA. Furthermore, recent work demonstrates that Sld3 and its human homolog Treslin substantially stimulate DDK phosphorylation of Mcm2. These data suggest that the initiation factor Sld3/Treslin coordinates the assembly and activation of the eukaryotic replication fork helicase by recruiting Cdc45 to Mcm2-7, stimulating DDK phosphorylation of Mcm2, and binding directly to single-stranded DNA as the origin is melted.
Wuhe, Jike; Zefeng, Wu; Sanhong, Fan; Xuguang, Xi
The FANCJ-like protein family is a class of ATP-dependent helicases that can catalytically unwind duplex DNA along the 5'-3' direction. It is involved in the processes of DNA damage repair, homologous recombination and G-quadruplex DNA unwinding, and plays a critical role in maintaining genome integrity. In this study, we systemically analyzed FNACJ-like proteins from 47 eukaryotic species and discussed their sequences diversity, origin and evolution, motif organization patterns and spatial structure differences. Four members of FNACJ-like proteins, including XPD, CHL1, RTEL1 and FANCJ, were found in eukaryotes, but some of them were seriously deficient in most fungi and some insects. For example, the Zygomycota fungi lost RTEL1, Basidiomycota and Ascomycota fungi lost RTEL1 and FANCJ, and Diptera insect lost FANCJ. FANCJ-like proteins contain canonical motor domains HD1 and HD2, and the HD1 domain further integrates with three unique domains Fe-S, Arch and Extra-D. Fe-S and Arch domains are relatively conservative in all members of the family, but the Extra-D domain is lost in XPD and differs from one another in rest members. There are 7, 10 and 2 specific motifs found from the three unique domains respectively, while 5 and 12 specific motifs are found from HD1 and HD2 domains except the conserved motifs reported previously. By analyzing the arrangement pattern of these specific motifs, we found that RTEL1 and FANCJ are more closer and share two specific motifs Vb2 and Vc in HD2 domain, which are likely related with their G-quadruplex DNA unwinding activity. The evidence of evolution showed that FACNJ-like proteins were originated from a helicase, which has a HD1 domain inserted by extra Fe-S domain and Arch domain. By three continuous gene duplication events and followed specialization, eukaryotes finally possessed the current four members of FANCJ-like proteins.
Uldahl, Kristine Buch; Wu, Linping; Hall, Arnaldur
followed viral uptake, intracellular trafficking and cell viability in human endothelial cells of brain (hCMEC/D3 cells) and umbilical vein (HUVEC) origin. Whereas SMV1 is efficiently internalized into both types of human cells, SSV2 differentiates between HUVECs and hCMEC/D3 cells, thus opening a path......Viruses from the third domain of life, Archaea, exhibit unusual features including extreme stability that allow their survival in harsh environments. In addition, these species have never been reported to integrate into human or any other eukaryotic genomes, and could thus serve for exploration...
Feulner, Philine G. D.; Johnston, Susan E.; Santure, Anna W.; Smadja, Carole M.
Recombination, the exchange of DNA between maternal and paternal chromosomes during meiosis, is an essential feature of sexual reproduction in nearly all multicellular organisms. While the role of recombination in the evolution of sex has received theoretical and empirical attention, less is known about how recombination rate itself evolves and what influence this has on evolutionary processes within sexually reproducing organisms. Here, we explore the patterns of, and processes governing recombination in eukaryotes. We summarize patterns of variation, integrating current knowledge with an analysis of linkage map data in 353 organisms. We then discuss proximate and ultimate processes governing recombination rate variation and consider how these influence evolutionary processes. Genome-wide recombination rates (cM/Mb) can vary more than tenfold across eukaryotes, and there is large variation in the distribution of recombination events across closely related taxa, populations and individuals. We discuss how variation in rate and distribution relates to genome architecture, genetic and epigenetic mechanisms, sex, environmental perturbations and variable selective pressures. There has been great progress in determining the molecular mechanisms governing recombination, and with the continued development of new modelling and empirical approaches, there is now also great opportunity to further our understanding of how and why recombination rate varies. This article is part of the themed issue ‘Evolutionary causes and consequences of recombination rate variation in sexual organisms’. PMID:29109219
Full Text Available The MOB family includes a group of cell cycle-associated proteins highly conserved throughout eukaryotes, whose founding members are implicated in mitotic exit and co-ordination of cell cycle progression with cell polarity and morphogenesis. Here we report the characterization and evolution of the MOB domain-containing proteins as inferred from the 43 eukaryotic genomes so far sequenced. We show that genes for Mob-like proteins are present in at least 41 of these genomes, confi rming the universal distribution of this protein family and suggesting its prominent biological function. The phylogenetic analysis reveals fi ve distinct MOB domain classes, showing a progressive expansion of this family from unicellular to multicellular organisms, reaching the highest number in mammals. Plant Mob genes appear to have evolved from a single ancestor, most likely after the loss of one or more genes during the early stage of Viridiplantae evolutionary history. Three of the Mob classes are widespread among most of the analyzed organisms. The possible biological and molecular function of Mob proteins and their role in conserved signaling pathways related to cell proliferation, cell death and cell polarity are also presented and critically discussed.
Batista, Marcelo Bueno; Chandra, Govind; Monteiro, Rose Adele; de Souza, Emanuel Maltempi; Dixon, Ray
Bacteria adjust the composition of their electron transport chain (ETC) to efficiently adapt to oxygen gradients. This involves differential expression of various ETC components to optimize energy generation. In Herbaspirillum seropedicae, reprogramming of gene expression in response to oxygen availability is controlled at the transcriptional level by three Fnr orthologs. Here, we characterised Fnr regulons using a combination of RNA-Seq and ChIP-Seq analysis. We found that Fnr1 and Fnr3 directly regulate discrete groups of promoters (Groups I and II, respectively), and that a third group (Group III) is co-regulated by both transcription factors. Comparison of DNA binding motifs between the three promoter groups suggests Group III promoters are potentially co-activated by Fnr3-Fnr1 heterodimers. Specific interaction between Fnr1 and Fnr3, detected in two-hybrid assays, was dependent on conserved residues in their dimerization interfaces, indicative of heterodimer formation in vivo. The requirements for co-activation of the fnr1 promoter, belonging to Group III, suggest either sequential activation by Fnr3 and Fnr1 homodimers or the involvement of Fnr3-Fnr1 heterodimers. Analysis of Fnr proteins with swapped activation domains provides evidence that co-activation by Fnr1 and Fnr3 at Group III promoters optimises interactions with RNA polymerase to fine-tune transcription in response to prevailing oxygen concentrations.
Greganova, Eva; Steinmann, Michael; Mäser, Pascal; Fankhauser, Niklaus
Ion transporters are fundamental to life. Due to their ancient origin and conservation in sequence, ion transporters are also particularly well suited for comparative genomics of distantly related species. Here, we perform genome-wide ion transporter profiling as a basis for comparative genomics of eukaryotes. From a given predicted proteome, we identify all bona fide ion channels, ion porters, and ion pumps. Concentrating on unicellular eukaryotes (n = 37), we demonstrate that clustering of species according to their repertoire of ion transporters segregates obligate endoparasites (n = 23) on the one hand, from free-living species and facultative parasites (n = 14) on the other hand. This surprising finding indicates strong convergent evolution of the parasites regarding the acquisition and homeostasis of inorganic ions. Random forest classification identifies transporters of ammonia, plus transporters of iron and other transition metals, as the most informative for distinguishing the obligate parasites. Thus, in silico ionomics further underscores the importance of iron in infection biology and suggests access to host sources of nitrogen and transition metals to be selective forces in the evolution of parasitism. This finding is in agreement with the phenomenon of iron withholding as a primordial antimicrobial strategy of infected mammals.
Dieci, Giorgio; Preti, Milena; Montanini, Barbara
Small nucleolar RNAs (snoRNAs) are one of the most ancient and numerous families of non-protein-coding RNAs (ncRNAs). The main function of snoRNAs - to guide site-specific rRNA modification - is the same in Archaea and all eukaryotic lineages. In contrast, as revealed by recent genomic and RNomic studies, their genomic organization and expression strategies are the most varied. Seemingly snoRNA coding units have adopted, in the course of evolution, all the possible ways of being transcribed, thus providing a unique paradigm of gene expression flexibility. By focusing on representative fungal, plant and animal genomes, we review here all the documented types of snoRNA gene organization and expression, and we provide a comprehensive account of snoRNA expressional freedom by precisely estimating the frequency, in each genome, of each type of genomic organization. We finally discuss the relevance of snoRNA genomic studies for our general understanding of ncRNA family evolution and expression in eukaryotes.
Bonnet, Amandine; Grosso, Ana R; Elkaoutari, Abdessamad; Coleno, Emeline; Presle, Adrien; Sridhara, Sreerama C; Janbon, Guilhem; Géli, Vincent; de Almeida, Sérgio F; Palancade, Benoit
Transcription is a source of genetic instability that can notably result from the formation of genotoxic DNA:RNA hybrids, or R-loops, between the nascent mRNA and its template. Here we report an unexpected function for introns in counteracting R-loop accumulation in eukaryotic genomes. Deletion of endogenous introns increases R-loop formation, while insertion of an intron into an intronless gene suppresses R-loop accumulation and its deleterious impact on transcription and recombination in yeast. Recruitment of the spliceosome onto the mRNA, but not splicing per se, is shown to be critical to attenuate R-loop formation and transcription-associated genetic instability. Genome-wide analyses in a number of distant species differing in their intron content, including human, further revealed that intron-containing genes and the intron-richest genomes are best protected against R-loop accumulation and subsequent genetic instability. Our results thereby provide a possible rationale for the conservation of introns throughout the eukaryotic lineage. Copyright © 2017 Elsevier Inc. All rights reserved.
Full Text Available The detection and amplification of extracellular signals requires the involvement of multiple protein components. In mammalian cells the receptor of activated C kinase (RACK1 is an important scaffolding protein for signal transduction networks. Further, it also performs a critical function in regulating the cell cycle by modulating the G1/S transition. Many eukaryotic cells express RACK1 orthologs, with one example being Cpc2p in the fission yeast Schizosaccharomyces pombe. In contrast to RACK1, Cpc2p has been described to positively regulate, at the ribosomal level, cells entry into M phase. In addition, Cpc2p controls the stress response pathways through an interaction with Msa2p, and sexual development by modulating Ran1p/Pat1p. Here we describe investigations into the role, which Cpc2p performs in controlling the G protein-mediated mating response pathway. Despite structural similarity to Gβ-like subunits, Cpc2p appears not to function at the G protein level. However, upon pheromone stimulation, cells overexpressing Cpc2p display substantial cell morphology defects, disorientation of septum formation and a significantly protracted G1 arrest. Cpc2p has the potential to function at multiple positions within the pheromone response pathway. We provide a mechanistic interpretation of this novel data by linking Cpc2p function, during the mating response, with its previous described interactions with Ran1p/Pat1p. We suggest that overexpressing Cpc2p prolongs the stimulated state of pheromone-induced cells by increasing ste11 gene expression. These data indicate that Cpc2p regulates the pheromone-induced cell cycle arrest in fission yeast by delaying cells entry into S phase.
Zaiko, Anastasija; Schimanski, Kate; Pochon, Xavier; Hopkins, Grant A; Goldstien, Sharyn; Floerl, Oliver; Wood, Susanna A
In this experimental study the patterns in early marine biofouling communities and possible implications for surveillance and environmental management were explored using metabarcoding, viz. 18S ribosomal RNA gene barcoding in combination with high-throughput sequencing. The community structure of eukaryotic assemblages and the patterns of initial succession were assessed from settlement plates deployed in a busy port for one, five and 15 days. The metabarcoding results were verified with traditional morphological identification of taxa from selected experimental plates. Metabarcoding analysis identified > 400 taxa at a comparatively low taxonomic level and morphological analysis resulted in the detection of 25 taxa at varying levels of resolution. Despite the differences in resolution, data from both methods were consistent at high taxonomic levels and similar patterns in community shifts were observed. A high percentage of sequences belonging to genera known to contain non-indigenous species (NIS) were detected after exposure for only one day.
Harb, Omar S; Roos, David S
Over the past 20 years, advances in high-throughput biological techniques and the availability of computational resources including fast Internet access have resulted in an explosion of large genome-scale data sets "big data." While such data are readily available for download and personal use and analysis from a variety of repositories, often such analysis requires access to seldom-available computational skills. As a result a number of databases have emerged to provide scientists with online tools enabling the interrogation of data without the need for sophisticated computational skills beyond basic knowledge of Internet browser utility. This chapter focuses on the Eukaryotic Pathogen Databases (EuPathDB: http://eupathdb.org) Bioinformatic Resource Center (BRC) and illustrates some of the available tools and methods.
Miñarro-Gimenez, Jose Antonio; Egaña Aranguren, Mikel; Martínez Béjar, Rodrigo; Fernández-Breis, Jesualdo Tomás; Madrid, Marisa
Semantic Web technologies like RDF and OWL are currently applied in life sciences to improve knowledge management by integrating disparate information. Many of the systems that perform such task, however, only offer a SPARQL query interface, which is difficult to use for life scientists. We present the OGO system, which consists of a knowledge base that integrates information of orthologous sequences and genetic diseases, providing an easy to use ontology-constrain driven query interface. Such interface allows the users to define SPARQL queries through a graphical process, therefore not requiring SPARQL expertise. Copyright © 2011 Elsevier Inc. All rights reserved.
of these two genes with all other genes of the other one species. In addition, to be considered orthologs, the BBH pairs had to satisfy two conditions ...BBH pair computations employed as part of the outgroup and QuartetS methods, we used the same two conditions as the ones described above. In our...versus proteins. Genetica , 118, 209–216. 4. Serres,M.H., Kerr,A.R., McCormack,T.J. and Riley,M. (2009) Evolution by leaps: gene duplication in bacteria
Full Text Available List Contact us PGDBj - Ortholog DB License License to Use This Database Last updated : 2017/03/07 You may use this database...cifies the license terms regarding the use of this database and the requirements you must follow in using this database.... The license for this database is specified in the Creative Commons A...ttribution-Share Alike 4.0 International . If you use data from this database, please be sure attribute this database...hare Alike 4.0 International is found here . With regard to this database, you are licensed to: freely acces
Huang, Bi; Bao, Lang; Zhong, Qi; Shang, Zheng-ling; Zhang, Hui-dong; Zhang, Ying
To construct the eukaryotic experssion vector of LipL32 gene from Leptospira serovar Lai and express the recombinant plasmid in COS-7 cell. The LipL32 gene was amplified from Leptospira strain 017 genomic DNA by PCR and cloned into pcDNA3.1, through restriction nuclease enzyme digestion. Then the recombinant plasmid was transformed into E.coli DH5alpha. After identified by nuclease digestion, PCR and sequencing analysis, the recombinant vector was transfected into COS-7 cell with lipsome. The expression of the target gene was detected by RT-PCR and Western blot. The eukaryotic experssion vector pcDNA3.1-LipL32 was successfully constructed and stably expressed in COS-7 cell. The eukaryotic recombinant vector of outer membrane protein LipL32 gene from Leptospira serovar Lai can be expressed in mammalian cell, which provides an experimental basis for the application of the Leptospira DNA vaccine.
Jensen, Jacob Kruger [Michigan State Univ., East Lansing, MI (United States). Dept. of Plant Biology; Michigan State Univ., East Lansing, MI (United States). DOE Great Lakes Bioenergy Research Center; Busse-Wicher, Marta [Univ. of Cambridge (United Kingdom). Dept. of Biochemistry; Poulsen, Christian Peter [Carlsberg Research Lab., Copenhagen (Denmark); Fangel, Jonatan Ulrik [Carlsberg Research Lab., Copenhagen (Denmark); Smith, Peter James [Univ. of Georgia, Athens, GA (United States). Complex Carbohydrate Research Center; Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). BioEnergy Science Center (BESC); Yang, Jeong-Yeh [Univ. of Georgia, Athens, GA (United States). Complex Carbohydrate Research Center; Peña, Maria-Jesus [Univ. of Georgia, Athens, GA (United States). Complex Carbohydrate Research Center; Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). BioEnergy Science Center (BESC); Dinesen, Malene Hessellund [Carlsberg Research Lab., Copenhagen (Denmark); Martens, Helle Juel [Univ. of Copenhagen (Denmark). Dept. of Plant and Environmental Sciences; Melkonian, Michael [Univ. zu Koln (Germany). Botanical Inst., Dept. of Biological Sciences; Wong, Gane Ka-Shu [BGI-Shenzhen, Shenzhen, Guangdong (China); Moremen, Kelley W. [Univ. of Georgia, Athens, GA (United States). Complex Carbohydrate Research Center; Wilkerson, Curtis Gene [Michigan State Univ., East Lansing, MI (United States). Dept. of Plant Biology; Michigan State Univ., East Lansing, MI (United States). DOE Great Lakes Bioenergy Research Center; Michigan State Univ., East Lansing, MI (United States). Dept. of Biochemistry and Molecular Biology; Scheller, Henrik Vibe [Joint BioEnergy Inst. (JBEI), Emeryville, CA (United States); Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Environmental Genomics and Systems Biology Division; Dupree, Paul [Univ. of Cambridge (United Kingdom). Dept. of Biochemistry; Ulvskov, Peter [Univ. of Georgia, Athens, GA (United States). Complex Carbohydrate Research Center; Urbanowicz, Breeanna Rae [Univ. of Georgia, Athens, GA (United States). Complex Carbohydrate Research Center; Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). BioEnergy Science Center (BESC); Harholt, Jesper [Carlsberg Research Lab., Copenhagen (Denmark)
Insights into the evolution of plant cell walls have important implications for comprehending these diverse and abundant biological structures. In order to understand the evolving structure-function relationships of the plant cell wall, it is imperative to trace the origin of its different components. The present study is focused on plant 1,4-β-xylan, tracing its evolutionary origin by genome and transcriptome mining followed by phylogenetic analysis, utilizing a large selection of plants and algae. It substantiates the findings by heterologous expression and biochemical characterization of a charophyte alga xylan synthase. Of the 12 known gene classes involved in 1,4-β-xylan formation, XYS1/IRX10 in plants, IRX7, IRX8, IRX9, IRX14 and GUX occurred for the first time in charophyte algae. An XYS1/IRX10 ortholog from Klebsormidium flaccidum, designated K. flaccidumXYLAN SYNTHASE-1 (KfXYS1), possesses 1,4-β-xylan synthase activity, and 1,4-β-xylan occurs in the K. flaccidum cell wall. Finally, these data suggest that plant 1,4-β-xylan originated in charophytes and shed light on the origin of one of the key cell wall innovations to occur in charophyte algae, facilitating terrestrialization and emergence of polysaccharide-based plant cell walls.
Klionsky, Daniel J.; Tomashek, John J.
Describes a cooperative, interactive problem-solving exercise for studying eukaryotic cell structure and function. Highlights the dynamic aspects of movement through the cell. Contains 15 references. (WRM)
Zhang, Yan-Qiong; Wen, Jian-Fan
microRNAs (miRNAs) in higher multicellular eukaryotes have been extensively studied in recent years. Great progresses have also been achieved for miRNAs in unicellular eukaryotes. All these studies not only enrich our knowledge about the complex expression regulation system in diverse organisms, but also have evolutionary significance for understanding the origin of this system. In this review, Authors summarize the recent advance in the studies of miRNA in unicellular eukaryotes, including that on the most primitive unicellular eukaryote--Giardia. The origin and evolution of miRNA system is also discussed.
Zielezinski, Andrzej; Dziubek, Michal; Sliski, Jan; Karlowski, Wojciech M
ORCAN (ORtholog sCANner) is a web-based meta-server for one-click evolutionary and functional annotation of protein sequences. The server combines information from the most popular orthology-prediction resources, including four tools and four online databases. Functional annotation utilizes five additional comparisons between the query and identified homologs, including: sequence similarity, protein domain architectures, functional motifs, Gene Ontology term assignments and a list of associated articles. Furthermore, the server uses a plurality-based rating system to evaluate the orthology relationships and to rank the reference proteins by their evolutionary and functional relevance to the query. Using a dataset of ∼1 million true yeast orthologs as a sample reference set, we show that combining multiple orthology-prediction tools in ORCAN increases the sensitivity and precision by 1-2 percent points. The service is available for free at http://www.combio.pl/orcan/ . email@example.com. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: firstname.lastname@example.org
Moustacchi, E.; Waters, R.; Heude, M.; Chanet, R.
The repair mechanisms of altered DNA are generally less well understood for eukaryotes than they are for prokaryotes and bacteriophages. For mammalian cell lines cultured in vitro the specific labelling of DNA has allowed the biochemical analysis of some of the steps of the repair processes whereas the determination of their genetic controls is, with a few exceptions, obviously difficult. On the other hand, with fungi and more specifically with yeast taken as a model unicellular eukaryotic system, the genetic approach has been extensively explored: radiosensitive mutants are readily detected and genetically analyzed, double and multiple mutants can be constructed and from their responses to irradiation the number of repair pathways involved can be suggested. The lack of thymidine kinase in these organisms has hampered for a certain time the biochemical analysis of repair. However, the recent isolation of yeast strains capable of taking up and incorporating thymidine 5'-monophosphate into their DNA opens new possibilities for the future. In spite of this difficulty, attempts to measure the induction and removal of uv-induced pyrimidine dimers were performed by several groups during the last three years. The two main repair pathways described for E. coli, i.e., the excision-resynthesis and post-replicative recombinational repair pathways, do exist in yeast. The existence of the former pathway is supported not only by indirect evidence but also by biochemical analysis. The rad 1 and rad 2 mutants for instance have been shown to be blocked in the excision of uv-induced pyrimidine dimers. Other loci are epistatic to rad 1 and rad 2 (rad 3 , rad 4 ) and are likely to act on this excision pathway. The genetic control of the mitochondrial response to a uv treatment involves nuclear genes and mitochondrial determinants
Terkelsen, Kasper Munch; Krogh, A.
and quality of reliable gene annotation grows. Results We present a procedure, Agene, that automatically generates a species-specific gene predictor from a set of reliable mRNA sequences and a genome. We apply a Hidden Markov model (HMM) that implements explicit length distribution modelling for all gene......Background The number of sequenced eukaryotic genomes is rapidly increasing. This means that over time it will be hard to keep supplying customised gene finders for each genome. This calls for procedures to automatically generate species-specific gene finders and to re-train them as the quantity...... structure blocks using acyclic discrete phase type distributions. The state structure of the each HMM is generated dynamically from an array of sub-models to include only gene features represented in the training set. Conclusion Acyclic discrete phase type distributions are well suited to model sequence...
Teif, Vladimir B; Rippe, Karsten
A recent study of transcription regulation in Drosophila embryonic development revealed a complex non-monotonic dependence of gene expression on the distance between binding sites of repressor and activator proteins at the corresponding enhancer cis-regulatory modules (Fakhouri et al 2010 Mol. Syst. Biol. 6 341). The repressor efficiency was high at small separations, low around 30 bp, reached a maximum at 50–60 bp, and decreased at larger distances to the activator binding sites. Here, we propose a straightforward explanation for the distance dependence of repressor activity by considering the effect of the presence of a nucleosome. Using a method that considers partial unwrapping of nucleosomal DNA from the histone octamer core, we calculated the dependence of activator binding on the repressor–activator distance and found a quantitative agreement with the distance dependence reported for the Drosophila enhancer element. In addition, the proposed model offers explanations for other distance-dependent effects at eukaryotic enhancers. (communication)
Feng, Justin; Jester, Benjamin W; Tinberg, Christine E; Mandell, Daniel J; Antunes, Mauricio S; Chari, Raj; Morey, Kevin J; Rios, Xavier; Medford, June I; Church, George M; Fields, Stanley; Baker, David
Biosensors for small molecules can be used in applications that range from metabolic engineering to orthogonal control of transcription. Here, we produce biosensors based on a ligand-binding domain (LBD) by using a method that, in principle, can be applied to any target molecule. The LBD is fused to either a fluorescent protein or a transcriptional activator and is destabilized by mutation such that the fusion accumulates only in cells containing the target ligand. We illustrate the power of this method by developing biosensors for digoxin and progesterone. Addition of ligand to yeast, mammalian, or plant cells expressing a biosensor activates transcription with a dynamic range of up to ~100-fold. We use the biosensors to improve the biotransformation of pregnenolone to progesterone in yeast and to regulate CRISPR activity in mammalian cells. This work provides a general methodology to develop biosensors for a broad range of molecules in eukaryotes.
Kramerov, D A; Vassetzky, N S
Short interspersed elements (SINEs) are one of the two most prolific mobile genomic elements in most of the higher eukaryotes. Although their biology is still not thoroughly understood, unusual life cycle of these simple elements amplified as genomic parasites makes their evolution unique in many ways. In contrast to most genetic elements including other transposons, SINEs emerged de novo many times in evolution from available molecules (for example, tRNA). The involvement of reverse transcription in their amplification cycle, huge number of genomic copies and modular structure allow variation mechanisms in SINEs uncommon or rare in other genetic elements (module exchange between SINE families, dimerization, and so on.). Overall, SINE evolution includes their emergence, progressive optimization and counteraction to the cell's defense against mobile genetic elements.
Koslover, Elena; Mogre, Saurabh; Chan, Caleb; Theriot, Julie
The cytoplasm of a living cell is an active environment through which intracellular components move and mix. We explore, using theoretical modeling coupled with microrheological measurements, the efficiency of particle dispersion via different modes of transport within this active environment. In particular, we focus on the role of cytoplasmic flow over different scales in contributing to organelle transport within two different cell types. In motile neutrophil cells, we show that bulk fluid flow associated with rapid cell deformation enhances particle transport to and from the cell periphery. In narrow fungal hyphae, localized flows due to hydrodynamic entrainment are shown to contribute to optimally efficient organelle dispersion. Our results highlight the importance of non-traditional modes of transport associated with flow of the cytoplasmic fluid in the distribution of organelles throughout eukaryotic cells.
Azua-Bustos, A.; Gonzalez-Silva, C.; Mancilla, R.A.; Salas, L.; Palma, R.E.; Wynne, J.J.; McKay, C.P.; Vicuna, R.
Caves offer a stable and protected environment from harsh and changing outside prevailing conditions. Hence, they represent an interesting habitat for studying life in extreme environments. Here, we report the presence of a member of the ancient eukaryote red algae Cyanidium group in a coastal cave of the hyperarid Atacama Desert. This microorganism was found to form a seemingly monospecific biofilm growing under extremely low photon flux levels. Our work suggests that this species, Cyanidium sp. Atacama, is a new member of a recently proposed novel monophyletic lineage of mesophilic "cave" Cyanidium sp., distinct from the remaining three other lineages which are all thermo-acidophilic. The cave described in this work may represent an evolutionary island for life in the midst of the Atacama Desert. ?? Springer Science + Business Media, LLC 2009.
Full Text Available In eukaryotes, short RNAs guide a variety of enzymatic activities that range from RNA editing to translation repression. It is hypothesized that pre-existing proteins evolved to bind and use guide RNA during evolution. However, the capacity of modern proteins to adopt new RNA guides has never been demonstrated. Here we show that Rnt1p, the yeast orthologue of the bacterial dsRNA-specific RNase III, can bind short RNA transcripts and use them as guides for sequence-specific cleavage. Target cleavage occurred at a constant distance from the Rnt1p binding site, leaving the guide RNA intact for subsequent cleavage. Our results indicate that RNase III may trigger sequence-specific RNA degradation independent of the RNAi machinery, and they open the road for a new generation of precise RNA silencing tools that do not trigger a dsRNA-mediated immune response.
Welander, P. V.; Banta, A. B.; Lee, A. K.; Wei, J. H.
Lipid biomarkers are organic molecules preserved in sediments and sedimentary rocks that can function as geological proxies for certain microbial taxa or for specific environmental conditions. These molecular fossils provide a link between organisms and their environments in both modern and ancient settings and have afforded significant insight into ancient climatic events, mass extinctions, and various evolutionary transitions throughout Earth's history. However, the proper interpretation of lipid biomarkers is dependent on a broad understanding of their diagenetic precursors in modern systems. This includes understanding the taphonomic transformations that these molecules undergo, their biosynthetic pathways, and the ecological conditions that affect their cellular production. In this study, we focus on one group of lipid biomarkers - the sterols. These are polycyclic isoprenoidal lipids that have a high preservation potential and play a critical role in the physiology of most eukaryotes. However, the synthesis and function of these lipids in the bacterial domain has not been fully explored. Here we utilize a combination of bioinformatics, microbial genetics, and biochemistry to demonstrate that bacterial sterol producers are more prevalent in environmental metagenomic samples than in the genomic databases of cultured organisms and to identify novel proteins required to synthesize and modify sterols in bacteria. These proteins represent a distinct pathway for sterol synthesis exclusive to bacteria and indicate that sterol synthesis in bacteria may have evolved independently of eukaryotic sterol biosynthesis. Taken together, these results demonstrate how studies in extant bacteria can provide insight into the biological sources and the biosynthetic pathways of specific lipid biomarkers and in turn may allow for more robust interpretation of biomarker signatures.
Stone, Stacie; Sobeck, Alexandra; van Kogelenberg, Margriet; de Graaf, Bendert; Joenje, Hans; Christian, Jan; Hoatlin, Maureen E
Fanconi anemia (FA) is associated with variable developmental abnormalities, bone marrow failure and cancer susceptibility. FANCG/XRCC9 is member of the FA core complex, a group of proteins that control the monoubiquitylation of FANCD2, an event that plays a critical role in maintaining genomic stability. Here we report the identification of the Xenopus laevis ortholog of human FANCG (xFANCG), its expression during development, and its molecular interactions with a partner protein, xFANCA. The xFANCG protein sequence is 47% similar to its human ortholog, with highest conservation in the two putative N-terminal leucine zippers and the tetratricopeptide repeat (TPR) motifs. xFANCG is maternally and zygotically transcribed. Prior to the midblastula stage, a single xFANCG transcript is observed but two additional alternatively spliced mRNAs are detected after the midblastula transition. One of the variants is predicted to encode a novel isoform of xFANCG lacking exon 2. The mutual association between FANCG and FANCA required for their nuclear import is conserved in Xenopus egg extracts. Our data demonstrate that interactions between FANCA and FANCG occur at the earliest stage of vertebrate development and raise the possibility that functionally different isoforms of xFANCG may play a role in early development.
Full Text Available The human cytomegalovirus (HCMV-encoded protein kinase, pUL97, is considered a cyclin-dependent kinase (CDK ortholog, due to shared structural and functional characteristics. The primary mechanism of CDK activation is binding to corresponding cyclins, including cyclin T1, which is the usual regulatory cofactor of CDK9. This study provides evidence of direct interaction between pUL97 and cyclin T1 using yeast two-hybrid and co-immunoprecipitation analyses. Confocal immunofluorescence revealed partial colocalization of pUL97 with cyclin T1 in subnuclear compartments, most pronounced in viral replication centres. The distribution patterns of pUL97 and cyclin T1 were independent of HCMV strain and host cell type. The sequence domain of pUL97 responsible for the interaction with cyclin T1 was between amino acids 231–280. Additional co-immunoprecipitation analyses showed cyclin B1 and cyclin A as further pUL97 interaction partners. Investigation of the pUL97-cyclin T1 interaction in an ATP consumption assay strongly suggested phosphorylation of pUL97 by the CDK9/cyclin T1 complex in a substrate concentration-dependent manner. This is the first demonstration of interaction between a herpesviral CDK ortholog and cellular cyclins.
Full Text Available Background and objective Raf is a key molecule in the Ras-Raf-MEK-ERK signal transduction pathway and is highly activated in different human carcinomas. However, its biological functions and regulation mechanisms are still unclear. The aims of this study were to construct eukaryotic expression vectors with Raf full encoding region, truncated amino-terminus and carboxyl-terminus, respectively. Methods Eukaryotic expression vectors of pCMV-Tag2b-Raf-1, pCMV-Tag2b-N-Raf and pCMV-Tag2b-C-Raf were constructed by gene recombination technique and confirmed by restriction enzyme analysis and DNA sequencing. Furthermore, the expression of these fusion proteins was detected by western blot in transient transfected 293T cells. Results The sequences and open reading frames of these three vectors were completely consistent with experimental design. All target proteins can be detected in 293T cells. Conclusion Eukaryotic expression vectors of pCMV-Tag2b-Raf-1, pCMV-Tag2b-N-Raf and pCMV-Tag2b-C-Raf were successfully constructed and can be expressed in 293T cells.
Driscoll, Timothy P; Verhoeve, Victoria I; Guillotte, Mark L; Lehman, Stephanie S; Rennoll, Sherri A; Beier-Sexton, Magda; Rahman, M Sayeedur; Azad, Abdu F; Gillespie, Joseph J
Reductive genome evolution has purged many metabolic pathways from obligate intracellular Rickettsia ( Alphaproteobacteria ; Rickettsiaceae ). While some aspects of host-dependent rickettsial metabolism have been characterized, the array of host-acquired metabolites and their cognate transporters remains unknown. This dearth of information has thwarted efforts to obtain an axenic Rickettsia culture, a major impediment to conventional genetic approaches. Using phylogenomics and computational pathway analysis, we reconstructed the Rickettsia metabolic and transport network, identifying 51 host-acquired metabolites (only 21 previously characterized) needed to compensate for degraded biosynthesis pathways. In the absence of glycolysis and the pentose phosphate pathway, cell envelope glycoconjugates are synthesized from three imported host sugars, with a range of additional host-acquired metabolites fueling the tricarboxylic acid cycle. Fatty acid and glycerophospholipid pathways also initiate from host precursors, and import of both isoprenes and terpenoids is required for the synthesis of ubiquinone and the lipid carrier of lipid I and O-antigen. Unlike metabolite-provisioning bacterial symbionts of arthropods, rickettsiae cannot synthesize B vitamins or most other cofactors, accentuating their parasitic nature. Six biosynthesis pathways contain holes (missing enzymes); similar patterns in taxonomically diverse bacteria suggest alternative enzymes that await discovery. A paucity of characterized and predicted transporters emphasizes the knowledge gap concerning how rickettsiae import host metabolites, some of which are large and not known to be transported by bacteria. Collectively, our reconstructed metabolic network offers clues to how rickettsiae hijack host metabolic pathways. This blueprint for growth determinants is an important step toward the design of axenic media to rescue rickettsiae from the eukaryotic cell. IMPORTANCE A hallmark of obligate intracellular
Speijer, Dave; Lukeš, Julius; Eliáš, Marek
Sexual reproduction and clonality in eukaryotes are mostly seen as exclusive, the latter being rather exceptional. This view might be biased by focusing almost exclusively on metazoans. We analyze and discuss reproduction in the context of extant eukaryotic diversity, paying special attention to
Møller, Henrik D.; Bojsen, Rasmus Kenneth; Tachibana, Chris
Extrachromosomal circular DNAs (eccDNAs) are common genetic elements in Saccharomyces cerevisiae and are reported in other eukaryotes as well. EccDNAs contribute to genetic variation among somatic cells in multicellular organisms and to evolution of unicellular eukaryotes. Sensitive methods for d...
The present invention relates to the use of transcriptional activators from prokaryotic organisms for use in eukaryotic cells, such as yeast as sensors of intracellular and extracellular accumulation of a ligand or metabolite specifically activating this transcriptional activator in a eukaryot...
Møller, Henrik D.; Bojsen, Rasmus Kenneth; Tachibana, Chris
Extrachromosomal circular DNAs (eccDNAs) are common genetic elements in Saccharomyces cerevisiae and are reported in other eukaryotes as well. EccDNAs contribute to genetic variation among somatic cells in multicellular organisms and to evolution of unicellular eukaryotes. Sensitive methods...
Dey, Gautam; Thattai, Mukund; Baum, Buzz
If eukaryotes arose through a merger between archaea and bacteria, what did the first true eukaryotic cell look like? A major step toward an answer came with the discovery of Lokiarchaeum, an archaeon whose genome encodes small GTPases related to those used by eukaryotes to regulate membrane traffic. Although 'Loki' cells have yet to be seen, their existence has prompted the suggestion that the archaeal ancestor of eukaryotes engulfed the future mitochondrion by phagocytosis. We propose instead that the archaeal ancestor was a relatively simple cell, and that eukaryotic cellular organization arose as the result of a gradual transfer of bacterial genes and membranes driven by an ever-closer symbiotic partnership between a bacterium and an archaeon. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Liljeruhm, Josefine; Funk, Saskia K; Tietscher, Sandra; Edlund, Anders D; Jamal, Sabri; Wistrand-Yuen, Pikkei; Dyrhage, Karl; Gynnå, Arvid; Ivermark, Katarina; Lövgren, Jessica; Törnblom, Viktor; Virtanen, Anders; Lundin, Erik R; Wistrand-Yuen, Erik; Forster, Anthony C
Coral reefs are colored by eukaryotic chromoproteins (CPs) that are homologous to green fluorescent protein. CPs differ from fluorescent proteins (FPs) by intensely absorbing visible light to give strong colors in ambient light. This endows CPs with certain advantages over FPs, such as instrument-free detection uncomplicated by ultra-violet light damage or background fluorescence, efficient Förster resonance energy transfer (FRET) quenching, and photoacoustic imaging. Thus, CPs have found utility as genetic markers and in teaching, and are attractive for potential cell biosensor applications in the field. Most near-term applications of CPs require expression in a different domain of life: bacteria. However, it is unclear which of the eukaryotic CP genes might be suitable and how best to assay them. Here, taking advantage of codon optimization programs in 12 cases, we engineered 14 CP sequences (meffRed, eforRed, asPink, spisPink, scOrange, fwYellow, amilGFP, amajLime, cjBlue, meffBlue, aeBlue, amilCP, tsPurple and gfasPurple) into a palette of Escherichia coli BioBrick plasmids. BioBricks comply with synthetic biology's most widely used, simplified, cloning standard. Differences in color intensities, maturation times and fitness costs of expression were compared under the same conditions, and visible readout of gene expression was quantitated. A surprisingly large variation in cellular fitness costs was found, resulting in loss of color in some overnight liquid cultures of certain high-copy-plasmid-borne CPs, and cautioning the use of multiple CPs as markers in competition assays. We solved these two problems by integrating pairs of these genes into the chromosome and by engineering versions of the same CP with very different colors. Availability of 14 engineered CP genes compared in E. coli , together with chromosomal mutants suitable for competition assays, should simplify and expand CP study and applications. There was no single plasmid-borne CP that combined
Dominic J. Bennett
Full Text Available The exceptional increase in molecular DNA sequence data in open repositories is mirrored by an ever-growing interest among evolutionary biologists to harvest and use those data for phylogenetic inference. Many quality issues, however, are known and the sheer amount and complexity of data available can pose considerable barriers to their usefulness. A key issue in this domain is the high frequency of sequence mislabeling encountered when searching for suitable sequences for phylogenetic analysis. These issues include, among others, the incorrect identification of sequenced species, non-standardized and ambiguous sequence annotation, and the inadvertent addition of paralogous sequences by users. Taken together, these issues likely add considerable noise, error or bias to phylogenetic inference, a risk that is likely to increase with the size of phylogenies or the molecular datasets used to generate them. Here we present a software package, phylotaR that bypasses the above issues by using instead an alignment search tool to identify orthologous sequences. Our package builds on the framework of its predecessor, PhyLoTa, by providing a modular pipeline for identifying overlapping sequence clusters using up-to-date GenBank data and providing new features, improvements and tools. We demonstrate and test our pipeline’s effectiveness by presenting trees generated from phylotaR clusters for two large taxonomic clades: Palms and primates. Given the versatility of this package, we hope that it will become a standard tool for any research aiming to use GenBank data for phylogenetic analysis.
Hurni, Severine; Brunner, Susanne; Stirnweis, Daniel; Herren, Gerhard; Peditto, David; McIntosh, Robert A; Keller, Beat
The powdery mildew resistance gene Pm8 derived from rye is located on a 1BL.1RS chromosome translocation in wheat. However, some wheat lines with this translocation do not show resistance to isolates of the wheat powdery mildew pathogen avirulent to Pm8 due to an unknown genetically dominant suppression mechanism. Here we show that lines with suppressed Pm8 activity contain an intact and expressed Pm8 gene. Therefore, the absence of Pm8 function in certain 1BL.1RS-containing wheat lines is not the result of gene loss or mutation but is based on suppression. The wheat gene Pm3, an ortholog of rye Pm8, suppressed Pm8-mediated powdery mildew resistance in lines containing Pm8 in a transient single-cell expression assay. This result was further confirmed in transgenic lines with combined Pm8 and Pm3 transgenes. Expression analysis revealed that suppression is not the result of gene silencing, either in wheat 1BL.1RS translocation lines carrying Pm8 or in transgenic genotypes with both Pm8 and Pm3 alleles. In addition, a similar abundance of the PM8 and PM3 proteins in single or double homozygous transgenic lines suggested that a post-translational mechanism is involved in suppression of Pm8. Co-expression of Pm8 and Pm3 genes in Nicotiana benthamiana leaves followed by co-immunoprecipitation analysis showed that the two proteins interact. Therefore, the formation of a heteromeric protein complex might result in inefficient or absent signal transmission for the defense reaction. These data provide a molecular explanation for the suppression of resistance genes in certain genetic backgrounds and suggest ways to circumvent it in future plant breeding. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.
Wang, Xin-Cun; Liu, Chang; Huang, Liang; Bengtsson-Palme, Johan; Chen, Haimei; Zhang, Jian-Hui; Cai, Dayong; Li, Jian-Qin
A DNA barcode is a short piece of DNA sequence used for species determination and discovery. The internal transcribed spacer (ITS/ITS2) region has been proposed as the standard DNA barcode for fungi and seed plants and has been widely used in DNA barcoding analyses for other biological groups, for example algae, protists and animals. The ITS region consists of both ITS1 and ITS2 regions. Here, a large-scale meta-analysis was carried out to compare ITS1 and ITS2 from three aspects: PCR amplification, DNA sequencing and species discrimination, in terms of the presence of DNA barcoding gaps, species discrimination efficiency, sequence length distribution, GC content distribution and primer universality. In total, 85 345 sequence pairs in 10 major groups of eukaryotes, including ascomycetes, basidiomycetes, liverworts, mosses, ferns, gymnosperms, monocotyledons, eudicotyledons, insects and fishes, covering 611 families, 3694 genera, and 19 060 species, were analysed. Using similarity-based methods, we calculated species discrimination efficiencies for ITS1 and ITS2 in all major groups, families and genera. Using Fisher's exact test, we found that ITS1 has significantly higher efficiencies than ITS2 in 17 of the 47 families and 20 of the 49 genera, which are sample-rich. By in silico PCR amplification evaluation, primer universality of the extensively applied ITS1 primers was found superior to that of ITS2 primers. Additionally, shorter length of amplification product and lower GC content was discovered to be two other advantages of ITS1 for sequencing. In summary, ITS1 represents a better DNA barcode than ITS2 for eukaryotic species. © 2014 John Wiley & Sons Ltd.
Full Text Available Abstract Background Plant leucine-rich repeat receptor-like kinases (LRR-RLKs are receptor kinases that contain LRRs in their extracellular domain. In the last 15 years, many research groups have demonstrated major roles played by LRR-RLKs in plants during almost all developmental processes throughout the life of the plant and in defense/resistance against a large range of pathogens. Recently, a breakthrough has been made in this field that challenges the dogma of the specificity of plant LRR-RLKs. Results We analyzed ~1000 complete genomes and show that LRR-RK genes have now been identified in 8 non-plant genomes. We performed an exhaustive phylogenetic analysis of all of these receptors, revealing that all of the LRR-containing receptor subfamilies form lineage-specific clades. Our results suggest that the association of LRRs with RKs appeared independently at least four times in eukaryotic evolutionary history. Moreover, the molecular evolutionary history of the LRR-RKs found in oomycetes is reminiscent of the pattern observed in plants: expansion with amplification/deletion and evolution of the domain organization leading to the functional diversification of members of the gene family. Finally, the expression data suggest that oomycete LRR-RKs may play a role in several stages of the oomycete life cycle. Conclusions In view of the key roles that LRR-RLKs play throughout the entire lifetime of plants and plant-environment interactions, the emergence and expansion of this type of receptor in several phyla along the evolution of eukaryotes, and particularly in oomycete genomes, questions their intrinsic functions in mimicry and/or in the coevolution of receptors between hosts and pathogens.
Shemi, Adva; Ben-Dor, Shifra; Vardi, Assaf
Aquatic photosynthetic eukaryotes represent highly diverse groups (green, red, and chromalveolate algae) derived from multiple endosymbiosis events, covering a wide spectrum of the tree of life. They are responsible for about 50% of the global photosynthesis and serve as the foundation for oceanic and fresh water food webs. Although the ecophysiology and molecular ecology of some algal species are extensively studied, some basic aspects of algal cell biology are still underexplored. The recent wealth of genomic resources from algae has opened new frontiers to decipher the role of cell signaling pathways and their function in an ecological and biotechnological context. Here, we took a bioinformatic approach to explore the distribution and conservation of TOR and autophagy-related (ATG) proteins (Atg in yeast) in diverse algal groups. Our genomic analysis demonstrates conservation of TOR and ATG proteins in green algae. In contrast, in all 5 available red algal genomes, we could not detect the sequences that encode for any of the 17 core ATG proteins examined, albeit TOR and its interacting proteins are conserved. This intriguing data suggests that the autophagy pathway is not conserved in red algae as it is in the entire eukaryote domain. In contrast, chromalveolates, despite being derived from the red-plastid lineage, retain and express ATG genes, which raises a fundamental question regarding the acquisition of ATG genes during algal evolution. Among chromalveolates, Emiliania huxleyi (Haptophyta), a bloom-forming coccolithophore, possesses the most complete set of ATG genes, and may serve as a model organism to study autophagy in marine protists with great ecological significance. PMID:25915714
Greshake, Bastian; Zehr, Simonida; Dal Grande, Francesco; Meiser, Anjuli; Schmitt, Imke; Ebersberger, Ingo
Whole-genome shotgun sequencing of multispecies communities using only a single library layout is commonly used to assess taxonomic and functional diversity of microbial assemblages. Here, we investigate to what extent such metagenome skimming approaches are applicable for in-depth genomic characterizations of eukaryotic communities, for example lichens. We address how to best assemble a particular eukaryotic metagenome skimming data, what pitfalls can occur, and what genome quality can be expected from these data. To facilitate a project-specific benchmarking, we introduce the concept of twin sets, simulated data resembling the outcome of a particular metagenome sequencing study. We show that the quality of genome reconstructions depends essentially on assembler choice. Individual tools, including the metagenome assemblers Omega and MetaVelvet, are surprisingly sensitive to low and uneven coverages. In combination with the routine of assembly parameter choice to optimize the assembly N50 size, these tools can preclude an entire genome from the assembly. In contrast, MIRA, an all-purpose overlap assembler, and SPAdes, a multisized de Bruijn graph assembler, facilitate a comprehensive view on the individual genomes across a wide range of coverage ratios. Testing assemblers on a real-world metagenome skimming data from the lichen Lasallia pustulata demonstrates the applicability of twin sets for guiding method selection. Furthermore, it reveals that the assembly outcome for the photobiont Trebouxia sp. falls behind the a priori expectation given the simulations. Although the underlying reasons remain still unclear, this highlights that further studies on this organism require special attention during sequence data generation and downstream analysis. © 2015 John Wiley & Sons Ltd.
Han, S.; Tainer, J.A.
ADP-ribosylation is a widely occurring and biologically critical covalent chemical modification process in pathogenic mechanisms, intracellular signaling systems, DNA repair, and cell division. The reaction is catalyzed by ADP-ribosyltransferases, which transfer the ADP-ribose moiety of NAD to a target protein with nicotinamide release. A family of bacterial toxins and eukaryotic enzymes has been termed the mono-ADP-ribosyltransferases, in distinction to the poly-ADP-ribosyltransferases, which catalyze the addition of multiple ADP-ribose groups to the carboxyl terminus of eukaryotic nucleoproteins. Despite the limited primary sequence homology among the different ADP-ribosyltransferases, a central cleft bearing NAD-binding pocket formed by the two perpendicular b-sheet core has been remarkably conserved between bacterial toxins and eukaryotic mono- and poly-ADP-ribosyltransferases. The majority of bacterial toxins and eukaryotic mono-ADP-ribosyltransferases are characterized by conserved His and catalytic Glu residues. In contrast, Diphtheria toxin, Pseudomonas exotoxin A, and eukaryotic poly-ADP-ribosyltransferases are characterized by conserved Arg and catalytic Glu residues. The NAD-binding core of a binary toxin and a C3-like toxin family identified an ARTT motif (ADP-ribosylating turn-turn motif) that is implicated in substrate specificity and recognition by structural and mutagenic studies. Here we apply structure-based sequence alignment and comparative structural analyses of all known structures of ADP-ribosyltransfeases to suggest that this ARTT motif is functionally important in many ADP-ribosylating enzymes that bear a NAD binding cleft as characterized by conserved Arg and catalytic Glu residues. Overall, structure-based sequence analysis reveals common core structures and conserved active sites of ADP-ribosyltransferases to support similar NAD binding mechanisms but differing mechanisms of target protein binding via sequence variations within the ARTT
Yeats, Trevor H.; Huang, Wenlin; Chatterjee, Subhasish
synthases within the large GDSL superfamily. We demonstrate that members of this ancient and conserved family of cutin synthase‐like (CUS) proteins act as polyester synthases with negligible hydrolytic activity. Moreover, solution‐state NMR analysis indicates that CD1 catalyzes the formation of primarily...... of hydroxyacylglycerol precursors, catalyzed by the GDSL‐motif lipase/hydrolase family protein (GDSL) Cutin Deficient 1 (CD1). Here, we present additional biochemical characterization of CD1 and putative orthologs from Arabidopsis thaliana and the moss Physcomitrella patens, which represent a distinct clade of cutin...... linear cutin oligomeric products in vitro. These results reveal a conserved mechanism of cutin polyester synthesis in land plants, and suggest that elaborations of the linear polymer, such as branching or cross‐linking, may require additional, as yet unknown, factors....
Ondracek, Caitlin R; Frappier, Vincent; Ringel, Alison E; Wolberger, Cynthia; Guarente, Leonard
Sirtuin enzymes depend on NAD + to catalyze protein deacetylation. Therefore, the lowering of NAD + during aging leads to decreased sirtuin activity and may speed up aging processes in laboratory animals and humans. In this study, we used a genetic screen to identify two mutations in the catalytic domain of yeast Sir2 that allow the enzyme to function in an NAD + -depleted environment. These mutant enzymes give rise to a significant increase of yeast replicative lifespan and increase deacetylation by the Sir2 ortholog, SIRT1, in mammalian cells. Our data suggest that these mutations increase the stability of the conserved catalytic sirtuin domain, thereby increasing the catalytic efficiency of the mutant enzymes. Our approach to identifying sirtuin mutants that permit function in NAD + -limited environments may inform the design of small molecules that can maintain sirtuin activity in aging organisms. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Hiraiwa, Tetsuya; Nishikawa, Masatoshi; Shibata, Tatsuo; Nagamatsu, Akihiro; Akuzawa, Naohiro
Eukaryotic chemotaxis is usually mediated by intracellular signals that tend to localize at the front or back of the cell. Such intracellular polarities frequently require no extracellular guidance cues, indicating that spontaneous polarization occurs in the signal network. Spontaneous polarization activity is considered relevant to the persistent motions in random cell migrations and chemotaxis. In this study, we propose a theoretical model that connects spontaneous intracellular polarity and motile ability in a chemoattractant solution. We demonstrate that the intracellular polarity can enhance the accuracy of chemotaxis. Chemotactic accuracy should also depend on chemoattractant concentration through the concentration-dependent correlation time in the polarity direction. Both the polarity correlation time and the chemotactic accuracy depend on the degree of responsiveness to the chemical gradient. We show that optimally accurate chemotaxis occurs at an intermediate responsiveness of intracellular polarity. Experimentally, we find that the persistence time of randomly migrating Dictyostelium cells depends on the chemoattractant concentration, as predicted by our theory. At the optimum responsiveness, this ameboid cell can enhance its chemotactic accuracy tenfold. (paper)
Chau, Meihing; Cai, Meng Li; Timkovich, R.
1 H NMR spectroscopy has been used to examine ferrocytochrome c-551 from Pseudomonas aeruginosa (ATCC 19429) over the pH range 3.5-10.6 and the temperature range 4-60 degree C. Resonance assignments are proposed for main-chain and side-chain protons. Comparison of results for cytochrome c-551 to recently assigned spectra for horse cytochrome c and mutants of yeast iso-1 cytochrome reveals some unique resonances with unusual chemical shifts in all cytochromes that may serve as markers for the heme region. Results for cytochrome c-551 indicate that in the smaller prokaryotic cytochrome, all benzoid side chains are rapidly flipping on the NMR time scale. In contrast, in eukaryotic cytochromes there are some rings flipping slowly on the NMR time scale. The ferrocytochrome c-551 undergoes a transition linked to pH with a pK around 7. The pH behavior of assigned resonances provides evidence that the site of protonation is the inner or buried 17-propionic acid heme substituent (IUPAC-IUB porphyrin nomenclature). Conformational heterogeneity has been observed for segments near the inner heme propionate substituent
Welfle, H.; Boehm, S.; Damaschun, G.; Fabian, H.; Gast, K.; Misselwitz, R.; Mueller, J.J.; Zirwer, D.; Filimonov, V.V.; Venyaminov, S.Yu.; Zalkova, T.N.
5S RNAs from rat liver and E. coli have been studied by diffuse X-ray and dynamic light scattering and by infrared and Raman spectroscopy. Identical structures at a resolution of 1 nm can be deduced from the comparison of the experimental X-ray scattering curves and electron distance distribution functions and from the agreement of the shape parameters. A flat shape model with a compact central region and two protruding arms was derived. Double helical stems are eleven-fold helices with a mean base pair distance of 0.28 nm. The number of base pairs (26 GC, 9 AU for E. coli; 27 GC, 9 AU for rat liver) and the degree of base stacking are the same within the experimental error. A very high regularity in the ribophosphate backbone is indicated for both 5S RNAs. The observed structural similarity and the consensus secondary structure pattern derived from comparative sequence analyses suggest the conclusion that prokaryotic and eukaryotic 5S RNAs are in general very similar with respect to their fundamental structural features. (author)
Roelants, Sophie L K W; De Maeseneire, Sofie L; Ciesielska, Katarzyna; Van Bogaert, Inge N A; Soetaert, Wim
Biosurfactants (BSs) are a class of secondary metabolites representing a wide variety of structures that can be produced from renewable feedstock by a wide variety of micro-organisms. They have (potential) applications in the medical world, personal care sector, mining processes, food industry, cosmetics, crop protection, pharmaceuticals, bio-remediation, household detergents, paper and pulp industry, textiles, paint industries, etc. Especially glycolipid BSs like sophorolipids (SLs), rhamnolipids (RLs), mannosylerythritol lipids (MELs) and cellobioselipids (CBLs) have been described to provide significant opportunities to (partially) replace chemical surfactants. The major two factors currently limiting the penetration of BSs into the market are firstly the limited structural variety and secondly the rather high production price linked with the productivity. One of the keys to resolve the above mentioned bottlenecks can be found in the genetic engineering of natural producers. This could not only result in more efficient (economical) recombinant producers, but also in a diversification of the spectrum of available BSs as such resolving both limiting factors at once. Unraveling the genetics behind the biosynthesis of these interesting biological compounds is indispensable for the tinkering, fine tuning and rearrangement of these biological pathways with the aim of obtaining higher yields and a more extensive structural variety. Therefore, this review focuses on recent developments in the investigation of the biosynthesis, genetics and regulation of some important members of the family of the eukaryotic glycolipid BSs (MELs, CBLs and SLs). Moreover, recent biotechnological achievements and the industrial potential of engineered strains are discussed.
Das, Mitali; Singh, Sunita; Pradhan, Satyajit; Narayan, Gopeshwar
As a crucial component of DNA replication licensing system, minichromosome maintenance (MCM) 2-7 complex acts as the eukaryotic DNA replicative helicase. The six related MCM proteins form a heterohexamer and bind with ORC, CDC6, and Cdt1 to form the prereplication complex. Although the MCMs are well known as replicative helicases, their overabundance and distribution patterns on chromatin present a paradox called the "MCM paradox." Several approaches had been taken to solve the MCM paradox and describe the purpose of excess MCMs distributed beyond the replication origins. Alternative functions of these MCMs rather than a helicase had also been proposed. This review focuses on several models and concepts generated to solve the MCM paradox coinciding with their helicase function and provides insight into the concept that excess MCMs are meant for licensing dormant origins as a backup during replication stress. Finally, we extend our view towards the effect of alteration of MCM level. Though an excess MCM constituent is needed for normal cells to withstand stress, there must be a delineation of the threshold level in normal and malignant cells. This review also outlooks the future prospects to better understand the MCM biology.
Drawing on documents both published and archival, this paper explains how the prokaryote-eukaryote dichotomy of the 1960s was constructed, the purposes it served, and what it implied in terms of classification and phylogeny. In doing so, I first show how the concept was attributed to Edouard Chatton and the context in which he introduced the terms. Following, I examine the context in which the terms were reintroduced into biology in 1962 by Roger Stanier and C. B. van Niel. I study the discourse over the subsequent decade to understand how the organizational dichotomy took on the form of a natural classification as the kingdom Monera or superkingdom Procaryotae. Stanier and van Niel admitted that, in regard to constructing a natural classification of bacteria, structural characteristics were no more useful than physiological properties. They repeatedly denied that bacterial phylogenetics was possible. I thus examine the great historical irony that the "prokaryote," in both its organizational and phylogenetic senses, was defined (negatively) on the basis of structure. Finally, we see how phylogenetic research based on 16S rRNA led by Carl Woese and his collaborators confronted the prokaryote concept while moving microbiology to the center of evolutionary biology.
Bauer, Nicholas C; Corbett, Anita H; Doetsch, Paul W
DNA damage is a natural hazard of life. The most common DNA lesions are base, sugar, and single-strand break damage resulting from oxidation, alkylation, deamination, and spontaneous hydrolysis. If left unrepaired, such lesions can become fixed in the genome as permanent mutations. Thus, evolution has led to the creation of several highly conserved, partially redundant pathways to repair or mitigate the effects of DNA base damage. The biochemical mechanisms of these pathways have been well characterized and the impact of this work was recently highlighted by the selection of Tomas Lindahl, Aziz Sancar and Paul Modrich as the recipients of the 2015 Nobel Prize in Chemistry for their seminal work in defining DNA repair pathways. However, how these repair pathways are regulated and interconnected is still being elucidated. This review focuses on the classical base excision repair and strand incision pathways in eukaryotes, considering both Saccharomyces cerevisiae and humans, and extends to some important questions and challenges facing the field of DNA base damage repair. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Cabelli, D.E.; Testa, F.; Mastronicola, D.; Bordi, E.; Pucillo, L.P.; Sarti, P.; Saraiva, L.M.; Giuffre, A.; Teixeira, M.
Unlike superoxide dismutases (SODs), superoxidereductases (SORs) eliminate superoxide anion (O 2 # sm b ullet# - ) not through its dismutation, but via reduction to hydrogen peroxide (H 2 O 2 ) in the presence of an electron donor. The microaerobic protist Giardia intestinalis, responsible for a common intestinal disease in humans, though lacking SOD and other canonical reactive oxygen species-detoxifying systems, is among the very few eukaryotes encoding a SOR yet identified. In this study, the recombinant SOR from Giardia (SOR Gi ) was purified and characterized by pulse radiolysis and stopped-flow spectrophotometry. The protein, isolated in the reduced state, after oxidation by superoxide or hexachloroiridate(IV), yields a resting species (T final ) with Fe 3+ ligated to glutamate or hydroxide depending on pH (apparent pK a = 8.7). Although showing negligible SOD activity, reduced SOR Gi reacts with O 2 # sm b ullet# - with a pH-independent second-order rate constant k 1 = 1.0 x 10 9 M -1 s -1 and yields the ferric-(hydro)peroxo intermediate T 1 ; this in turn rapidly decays to the T final state with pH-dependent rates, without populating other detectable intermediates. Immunoblotting assays show that SOR Gi is expressed in the disease-causing trophozoite of Giardia. We propose that the superoxide-scavenging activity of SOR in Giardia may promote the survival of this air-sensitive parasite in the fairly aerobic proximal human small intestine during infection.
Flor-Parra, Ignacio; Iglesias-Romero, Ana Belén; Chang, Fred
The organization and number of microtubules (MTs) in a cell depend on the proper regulation of MT nucleation. Currently, the mechanism of nucleation is the most poorly understood aspect of MT dynamics. XMAP215/chTOG/Alp14/Stu2 proteins are MT polymerases that stimulate MT polymerization at MT plus ends by binding and releasing tubulin dimers. Although these proteins also localize to MT organizing centers and have nucleating activity in vitro, it is not yet clear whether these proteins participate in MT nucleation in vivo. Here, we demonstrate that in the fission yeast Schizosaccharomyces pombe, the XMAP215 ortholog Alp14 is critical for efficient MT nucleation in vivo. In multiple assays, loss of Alp14 function led to reduced nucleation rate and numbers of interphase MT bundles. Conversely, activation of Alp14 led to increased nucleation frequency. Alp14 associated with Mto1 and γ-tubulin complex components, and artificially targeting Alp14 to the γ-tubulin ring complexes (γ-TuRCs) stimulated nucleation. In imaging individual nucleation events, we found that Alp14 transiently associated with a γ-tubulin particle shortly before the appearance of a new MT. The transforming acidic coiled-coil (TACC) ortholog Alp7 mediated the localization of Alp14 at nucleation sites but not plus ends, and was required for efficient nucleation but not for MT polymerization. Our findings provide the strongest evidence to date that Alp14 serves as a critical MT nucleation factor in vivo. We suggest a model in which Alp14 associates with the γ-tubulin complex in an Alp7-dependent manner to facilitate the assembly or stabilization of the nascent MT. Copyright © 2018 Elsevier Ltd. All rights reserved.
Background Conserved ortholog set (COS) markers are an important functional genomics resource that has greatly improved orthology detection in Asterid species. A comprehensive list of these markers is available at Sol Genomics Network (http://solgenomics.net/) and many of these have been placed on the genetic maps of a number of solanaceous species. Results We amplified over 300 COS markers from eight potato accessions involving two diploid landraces of Solanum tuberosum Andigenum group (formerly classified as S. goniocalyx, S. phureja), and a dihaploid clone derived from a modern tetraploid cultivar of S. tuberosum and the wild species S. berthaultii, S. chomatophilum, and S. paucissectum. By BLASTn (Basic Local Alignment Search Tool of the NCBI, National Center for Biotechnology Information) algorithm we mapped the DNA sequences of these markers into the potato genome sequence. Additionally, we mapped a subset of these markers genetically in potato and present a comparison between the physical and genetic locations of these markers in potato and in comparison with the genetic location in tomato. We found that most of the COS markers are single-copy in the reference genome of potato and that the genetic location in tomato and physical location in potato sequence are mostly in agreement. However, we did find some COS markers that are present in multiple copies and those that map in unexpected locations. Sequence comparisons between species show that some of these markers may be paralogs. Conclusions The sequence-based physical map becomes helpful in identification of markers for traits of interest thereby reducing the number of markers to be tested for applications like marker assisted selection, diversity, and phylogenetic studies. PMID:23758607
Leal, Fernanda Munhoz Dos Anjos; Virginio, Veridiana Gomes; Martello, Carolina Lumertz; Paes, Jéssica Andrade; Borges, Thiago J; Jaeger, Natália; Bonorino, Cristina; Ferreira, Henrique Bunselmeyer
Mycoplasma hyopneumoniae and Mycoplasma flocculare are two genetically close species found in the swine respiratory tract. Despite their similarities, while M. hyopneumoniae is the causative agent of porcine enzootic pneumonia, M. flocculare is a commensal bacterium. Genomic and transcriptional comparative analyses so far failed to explain the difference in pathogenicity between these two species. We then hypothesized that such difference might be, at least in part, explained by amino acid sequence and immunological or functional differences between ortholog surface proteins. In line with that, it was verified that approximately 85% of the ortholog surface proteins from M. hyopneumoniae 7448 and M. flocculare present one or more differential domains. To experimentally assess possible immunological implications of this kind of difference, the extracellular differential domains from one pair of orthologous surface proteins (MHP7448_0612, from M. hyopneumoniae, and MF_00357, from M. flocculare) were expressed in E. coli and used to immunize mice. The recombinant polypeptides (rMHP61267-169 and rMF35767-196, respectively) induced distinct cellular immune responses. While, rMHP61267-169 induced both Th1 and Th2 responses, rMF35767-196 induced just an early pro-inflammatory response. These results indicate that immunological properties determined by differential domains in orthologous surface protein might play a role in pathogenicity, contributing to elicit specific and differential immune responses against each species. Copyright © 2016 Elsevier B.V. All rights reserved.
Chai, Zhao Yang; He, Zhi Li; Deng, Yun Yan; Yang, Yu Feng; Tang, Ying Zhong
Plankton diversity reflects the quality and health of waters and should be monitored as a critical feature of marine ecosystems. This study applied a pair of 28S rRNA gene-specific primers and pyrosequencing to assess the effects of large-scale cultivation of the seaweed Gracilaria lemaneiformis on the biodiversity of eukaryotic plankton community in the coastal water of Guangdong, China. With 1 million sequences (2,221 operational taxonomic units [OTUs]) obtained from 51 samples, we found that the biodiversity of eukaryotic plankton community was significantly higher in the seaweed cultivation area than that in the nearby control area as reflected in OTU richness, evenness (Shannon-Wiener index) and dominance (Simpson index) for total plankton community and its four subcategories when Gracilaria biomass reached the maximum, while no such a significant difference was observed before seaweed inoculation. Our laboratory experiment using an artificial phytoplankton community of nine species observed the same effects of Gracilaria exposure. Principal component analysis and principal coordinates analysis showed the plankton community structure in cultivation area markedly differed from the control area when Gracilaria biomass reached its maximum. Redundancy analysis showed that G. lemaneiformis was the critical factor in controlling the dynamics of eukaryotic plankton communities in the studied coastal ecosystem. Our results explicitly demonstrated G. lemaneiformis cultivation could enhance biodiversity of plankton community via allelopathy, which prevents one or several plankton species from blooming and consequently maintains a relatively higher biodiversity. Our study provided further support for using large-scale G. lemaneiformis cultivation as an effective approach for improving costal ecosystem health. © 2018 John Wiley & Sons Ltd.
Galpert, Deborah; Fernández, Alberto; Herrera, Francisco; Antunes, Agostinho; Molina-Ruiz, Reinaldo; Agüero-Chapin, Guillermin
The development of new ortholog detection algorithms and the improvement of existing ones are of major importance in functional genomics. We have previously introduced a successful supervised pairwise ortholog classification approach implemented in a big data platform that considered several pairwise protein features and the low ortholog pair ratios found between two annotated proteomes (Galpert, D et al., BioMed Research International, 2015). The supervised models were built and tested using a Saccharomycete yeast benchmark dataset proposed by Salichos and Rokas (2011). Despite several pairwise protein features being combined in a supervised big data approach; they all, to some extent were alignment-based features and the proposed algorithms were evaluated on a unique test set. Here, we aim to evaluate the impact of alignment-free features on the performance of supervised models implemented in the Spark big data platform for pairwise ortholog detection in several related yeast proteomes. The Spark Random Forest and Decision Trees with oversampling and undersampling techniques, and built with only alignment-based similarity measures or combined with several alignment-free pairwise protein features showed the highest classification performance for ortholog detection in three yeast proteome pairs. Although such supervised approaches outperformed traditional methods, there were no significant differences between the exclusive use of alignment-based similarity measures and their combination with alignment-free features, even within the twilight zone of the studied proteomes. Just when alignment-based and alignment-free features were combined in Spark Decision Trees with imbalance management, a higher success rate (98.71%) within the twilight zone could be achieved for a yeast proteome pair that underwent a whole genome duplication. The feature selection study showed that alignment-based features were top-ranked for the best classifiers while the runners-up were
Hong-Jing Li; Xiang-Hua Li; Jing-Hua Xiao; Rod A. Wing; Shi-Ping Wang
The rice disease resistance (R) gene Xa3/Xa26 (having also been named Xa3 and Xa26) against Xanthomonas oryzae pv.oryzae (Xoo),which causes bacterial blight disease,belongs to a multiple gene family clustered in chromosome 11 and is from an AA genome rice cultivar (Oryza sativa L.).This family encodes leucine-rich repeat (LRR) receptor kinasetype proteins.Here,we show that the orthologs (alleles) of Xa3/Xa26,Xa3/Xa26-2,and Xa3/Xa26-3,from wild Oryza species O.officinalis (CC genome) and O.minuta (BBCC genome),respectively,were also R genes against Xoo.Xa3/Xa26-2 and Xa3/Xa26-3 conferred resistance to 16 of the 18 Xoo strains examined.Comparative sequence analysis of the Xa3/Xa26 families in the two wild Oryza species showed that Xa3/Xa26-3 appeared to have originated from the CC genome of O.minuta.The predicted proteins encoded by Xa3/Xa26,Xa3/Xa26-2,and Xa3/Xa26-3 share 91-99％ sequence identity and 94-99％ sequence similarity.Transgenic plants carrying a single copy of Xa3/Xa26,Xa3/Xa26-2,or Xa3/Xa26-3,in the same genetic background,showed a similar resistance spectrum to a set of Xoo strains,although plants carrying Xa3/Xa26-2 or Xa3/Xa26-3 showed lower resistance levels than the plants carrying Xa3/Xa26.These results suggest that the Xa3/Xa26 locus predates the speciation of A and C genome,which is approximately 7.5 million years ago.Thus,the resistance specificity of this locus has been conserved for a long time.
Lacroix, Benoît; Citovsky, Vitaly
Besides the massive gene transfer from organelles to the nuclear genomes, which occurred during the early evolution of eukaryote lineages, the importance of horizontal gene transfer (HGT) in eukaryotes remains controversial. Yet, increasing amounts of genomic data reveal many cases of bacterium-to-eukaryote HGT that likely represent a significant force in adaptive evolution of eukaryotic species. However, DNA transfer involved in genetic transformation of plants by Agrobacterium species has traditionally been considered as the unique example of natural DNA transfer and integration into eukaryotic genomes. Recent discoveries indicate that the repertoire of donor bacterial species and of recipient eukaryotic hosts potentially are much wider than previously thought, including donor bacterial species, such as plant symbiotic nitrogen-fixing bacteria (e.g., Rhizobium etli) and animal bacterial pathogens (e.g., Bartonella henselae, Helicobacter pylori), and recipient species from virtually all eukaryotic clades. Here, we review the molecular pathways and potential mechanisms of these trans-kingdom HGT events and discuss their utilization in biotechnology and research.
Keeling Patrick J
Full Text Available Abstract Background Horizontal or lateral transfer of genetic material between distantly related prokaryotes has been shown to play a major role in the evolution of bacterial and archaeal genomes, but exchange of genes between prokaryotes and eukaryotes is not as well understood. In particular, gene flow from eukaryotes to prokaryotes is rarely documented with strong support, which is unusual since prokaryotic genomes appear to readily accept foreign genes. Results Here, we show that abundant marine cyanobacteria in the related genera Synechococcus and Prochlorococcus acquired a key Calvin cycle/glycolytic enzyme from a eukaryote. Two non-homologous forms of fructose bisphosphate aldolase (FBA are characteristic of eukaryotes and prokaryotes respectively. However, a eukaryotic gene has been inserted immediately upstream of the ancestral prokaryotic gene in several strains (ecotypes of Synechococcus and Prochlorococcus. In one lineage this new gene has replaced the ancestral gene altogether. The eukaryotic gene is most closely related to the plastid-targeted FBA from red algae. This eukaryotic-type FBA once replaced the plastid/cyanobacterial type in photosynthetic eukaryotes, hinting at a possible functional advantage in Calvin cycle reactions. The strains that now possess this eukaryotic FBA are scattered across the tree of Synechococcus and Prochlorococcus, perhaps because the gene has been transferred multiple times among cyanobacteria, or more likely because it has been selectively retained only in certain lineages. Conclusion A gene for plastid-targeted FBA has been transferred from red algae to cyanobacteria, where it has inserted itself beside its non-homologous, functional analogue. Its current distribution in Prochlorococcus and Synechococcus is punctate, suggesting a complex history since its introduction to this group.
Nedelcu, Aurora M
Programmed cell death (PCD) represents a significant component of normal growth and development in multicellular organisms. Recently, PCD-like processes have been reported in single-celled eukaryotes, implying that some components of the PCD machinery existed early in eukaryotic evolution. This study provides a comparative analysis of PCD-related sequences across more than 50 unicellular genera from four eukaryotic supergroups: Unikonts, Excavata, Chromalveolata, and Plantae. A complex set of PCD-related sequences that correspond to domains or proteins associated with all main functional classes--from ligands and receptors to executors of PCD--was found in many unicellular lineages. Several PCD domains and proteins previously thought to be restricted to animals or land plants are also present in unicellular species. Noteworthy, the yeast, Saccharomyces cerevisiae--used as an experimental model system for PCD research, has a rather reduced set of PCD-related sequences relative to other unicellular species. The phylogenetic distribution of the PCD-related sequences identified in unicellular lineages suggests that the genetic basis for the evolution of the complex PCD machinery present in extant multicellular lineages has been established early in the evolution of eukaryotes. The shaping of the PCD machinery in multicellular lineages involved the duplication, co-option, recruitment, and shuffling of domains already present in their unicellular ancestors.
Full Text Available Min Wu,1 Aruna Kalyanasundaram,2 Jie Zhu1 1Laboratory of Biomechanics and Engineering, Institute of Biophysics, College of Science, Northwest A&F University, Yangling, Shaanxi, People's Republic of China; 2College of Pharmacology, University of Illinois at Chicago, Chicago, IL, USA Abstract: Mitochondria serve as energy-producing organelles in eukaryotic cells. In addition to providing the energy supply for cells, the mitochondria are also involved in other processes, such as proliferation, differentiation, information transfer, and apoptosis, and play an important role in regulation of cell growth and the cell cycle. In order to achieve these functions, the mitochondria need to move to the corresponding location. Therefore, mitochondrial movement has a crucial role in normal physiologic activity, and any mitochondrial movement disorder will cause irreparable damage to the organism. For example, recent studies have shown that abnormal movement of the mitochondria is likely to be the reason for Charcot–Marie–Tooth disease, amyotrophic lateral sclerosis, Alzheimer's disease, Huntington's disease, Parkinson's disease, and schizophrenia. So, in the cell, especially in the particular polarized cell, the appropriate distribution of mitochondria is crucial to the function and survival of the cell. Mitochondrial movement is mainly associated with the cytoskeleton and related proteins. However, those components play different roles according to cell type. In this paper, we summarize the structural basis of mitochondrial movement, including microtubules, actin filaments, motor proteins, and adaptin, and review studies of the biomechanical mechanisms of mitochondrial movement in different types of cells. Keywords: mitochondrial movement, microtubules, actin filaments, motor proteins, adaptin
Full Text Available Abstract Background Translation initiation site (TIS identification is an important aspect of the gene annotation process, requisite for the accurate delineation of protein sequences from transcript data. We have developed the MetWAMer package for TIS prediction in eukaryotic open reading frames of non-viral origin. MetWAMer can be used as a stand-alone, third-party tool for post-processing gene structure annotations generated by external computational programs and/or pipelines, or directly integrated into gene structure prediction software implementations. Results MetWAMer currently implements five distinct methods for TIS prediction, the most accurate of which is a routine that combines weighted, signal-based translation initiation site scores and the contrast in coding potential of sequences flanking TISs using a perceptron. Also, our program implements clustering capabilities through use of the k-medoids algorithm, thereby enabling cluster-specific TIS parameter utilization. In practice, our static weight array matrix-based indexing method for parameter set lookup can be used with good results in data sets exhibiting moderate levels of 5'-complete coverage. Conclusion We demonstrate that improvements in statistically-based models for TIS prediction can be achieved by taking the class of each potential start-methionine into account pending certain testing conditions, and that our perceptron-based model is suitable for the TIS identification task. MetWAMer represents a well-documented, extensible, and freely available software system that can be readily re-trained for differing target applications and/or extended with existing and novel TIS prediction methods, to support further research efforts in this area.
Clark, Lindsay; Zahm, Jacob A.; Ali, Rustam; Kukula, Maciej; Bian, Liangqiao; Patrie, Steven M.; Gardner, Kevin H.; Rosen, Michael K.; Rosenbaum, Daniel M.
13 C Methyl TROSY NMR spectroscopy has emerged as a powerful method for studying the dynamics of large systems such as macromolecular assemblies and membrane proteins. Specific 13 C labeling of aliphatic methyl groups and perdeuteration has been limited primarily to proteins expressed in E. coli, preventing studies of many eukaryotic proteins of physiological and biomedical significance. We demonstrate the feasibility of efficient 13 C isoleucine δ1-methyl labeling in a deuterated background in an established eukaryotic expression host, Pichia pastoris, and show that this method can be used to label the eukaryotic protein actin, which cannot be expressed in bacteria. This approach will enable NMR studies of previously intractable targets
Ortega, Alvaro D.; Quereda, Juan J; Pucciarelli, M Graciela; García-del Portillo, Francisco
Intracellular bacterial pathogens have evolved distinct lifestyles inside eukaryotic cells. Some pathogens coexist with the infected cell in an obligate intracellular state, whereas others transit between the extracellular and intracellular environment. Adaptation to these intracellular lifestyles
Pearman, John K.; Irigoien, Xabier; Carvalho, Susana
The present study aims to characterize the benthic eukaryotic biodiversity patterns at a coarse taxonomic level in three areas of the central Red Sea (a lagoon, an offshore area in Thuwal and a shallow coastal area near Jeddah) based
Lukeš, Julius; López-García, P.; Louis, E.; Boekhout, T.
Roč. 167, č. 3 (2016), s. 217-221 ISSN 1434-4610 Institutional support: RVO:60077344 Keywords : protist * eukaryotic microorganisms * genomics Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 2.794, year: 2016
Microbial decay processes are one of the mechanisms whereby sewage contamination is reduced in the environment. This decomposition process involves a highly complex array of bacterial and eukaryotic communities from both sewage and ambient waters. However, relatively little is kn...
Eukaryotic microorganisms are naturally present in many water resources and can enter, grow and colonize water treatment and transport systems, including reservoirs, pipes and premise plumbing. In this study, we explored the eukaryotic microbial community structure in water during the (i) production of drinking water in a seawater desalination plant and (ii) transport of the drinking water in the distribution network. The desalination plant treatment involved pre-treatment (e.g. spruce filters), reverse osmosis (RO) membrane filtration and post-treatment steps (e.g. remineralization). 454 pyrosequencing analysis of the 18S rRNA gene revealed a highly diverse (35 phyla) and spatially variable eukaryotic community during water treatment and distribution. The desalination plant feed water contained a typical marine picoeukaryotic community dominated by Stramenopiles, Alveolates and Porifera. In the desalination plant Ascomycota was the most dominant phylum (15.5% relative abundance), followed by Alveolata (11.9%), unclassified fungi clade (10.9%) and Porifera (10.7%). In the drinking water distribution network, an uncultured fungi phylum was the major group (44.0%), followed by Chordata (17.0%), Ascomycota (11.0%) and Arthropoda (8.0%). Fungi constituted 40% of the total eukaryotic community in the treatment plant and the distribution network and their taxonomic composition was dominated by an uncultured fungi clade (55%). Comparing the plant effluent to the network samples, 84 OTUs (2.1%) formed the core eukaryotic community while 35 (8.4%) and 299 (71.5%) constituted unique OTUs in the produced water at the plant and combined tap water samples from the network, respectively. RO membrane filtration treatment significantly changed the water eukaryotic community composition and structure, highlighting the fact that (i) RO produced water is not sterile and (ii) the microbial community in the final tap water is influenced by the downstream distribution system. The study
Haas, B J; Salzberg, S L; Zhu, W; Pertea, M; Allen, J E; Orvis, J; White, O; Buell, C R; Wortman, J R
EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.
Christaki, Urania; Kormas, Konstantinos A; Genitsaris, Savvas; Georges, Clément; Sime-Ngando, Télesphore; Viscogliosi, Eric; Monchy, Sébastien
The objective of this study was to explore the succession of planktonic unicellular eukaryotes by means of 18S rRNA gene tag pyrosequencing in the eastern English Channel (EEC) during the winter to summer transition. The 59 most representative (>0.1%, representing altogether 95% of total reads), unique operational taxonomic units (OTUs) from all samples belonged to 18 known high-level taxonomic groups and 1 unaffiliated clade. The five most abundant OTUs (69.2% of total reads) belonged to Dinophyceae, Cercozoa, Haptophyceae, marine alveolate group I, and Fungi. Cluster and network analysis between samples distinguished the winter, the pre-bloom, the Phaeocystis globosa bloom and the post-bloom early summer conditions. The OTUs-based network revealed that P. globosa showed a relatively low number of connections-most of them negative-with all other OTUs. Fungi were linked to all major taxonomic groups, except Dinophyceae. Cercozoa mostly co-occurred with the Fungi, the Bacillariophyceae and several of the miscellaneous OTUs. This study provided a more detailed exploration into the planktonic succession pattern of the EEC due to its increased depth of taxonomic sampling over previous efforts based on classical monitoring observations. Data analysis implied that the food web concept in a coastal system based on predator-prey (e.g. grazer-phytoplankton) relationships is just a part of the ecological picture; and those organisms exploiting a variety of strategies, such as saprotrophy and parasitism, are persistent and abundant members of the community.
Zhu, Shijia; Beaulaurier, John; Deikus, Gintaras; Wu, Tao; Strahl, Maya; Hao, Ziyang; Luo, Guanzheng; Gregory, James A; Chess, Andrew; He, Chuan; Xiao, Andrew; Sebra, Robert; Schadt, Eric E; Fang, Gang
N6-methyladenine (m6dA) has been discovered as a novel form of DNA methylation prevalent in eukaryotes, however, methods for high resolution mapping of m6dA events are still lacking. Single-molecule real-time (SMRT) sequencing has enabled the detection of m6dA events at single-nucleotide resolution in prokaryotic genomes, but its application to detecting m6dA in eukaryotic genomes has not been rigorously examined. Herein, we identified unique characteristics of eukaryotic m6dA methylomes that fundamentally differ from those of prokaryotes. Based on these differences, we describe the first approach for mapping m6dA events using SMRT sequencing specifically designed for the study of eukaryotic genomes, and provide appropriate strategies for designing experiments and carrying out sequencing in future studies. We apply the novel approach to study two eukaryotic genomes. For green algae, we construct the first complete genome-wide map of m6dA at single nucleotide and single molecule resolution. For human lymphoblastoid cells (hLCLs), joint analyses of SMRT sequencing and independent sequencing data suggest that putative m6dA events are enriched in the promoters of young, full length LINE-1 elements (L1s). These analyses demonstrate a general method for rigorous mapping and characterization of m6dA events in eukaryotic genomes. Published by Cold Spring Harbor Laboratory Press.
Wu, Wenxue; Wang, Lei; Liao, Yu; Huang, Bangqin
To evaluate microbial eukaryotic diversity and distribution in mesoscale processes, we investigated 18S rDNA diversity in a river plume and cyclonic eddy-influenced ecosystem in the southwestern South China Sea (SCS). Restriction fragment length polymorphism analysis was carried out using multiple primer sets. Relative to a wide range of previous similar studies, we observed a significantly higher proportion of sequences of pigmented taxa. Among the photosynthetic groups, Haptophyta accounted for 27.7% of the sequenced clones, which belonged primarily to Prymnesiophyceae. Unexpectedly, five operational taxonomic units of Cryptophyta were closely related to freshwater species. The Chlorophyta mostly fell within the Prasinophyceae, which was comprised of six clades, including Clade III, which is detected in the SCS for the first time in this study. Among the photosynthetic stramenopiles, Chrysophyceae was the most diverse taxon, which included seven clades. The majority of 18S rDNA sequences affiliated with the Dictyochophyceae, Eustigmatophyceae, and Pelagophyceae were closely related to those of pure cultures. The results of redundancy analysis and the permutation Mantel test based on unweighted UniFrac distances, conducted for spatial analyses of the Haptophyta subclades suggested that the Mekong River plume and cyclonic eddy play important roles in regulating microbial eukaryotic diversity and distribution in the southwestern SCS. © 2015 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.
J Peter Svensson
Full Text Available Studies in Saccharomyces cerevisiae show that many proteins influence cellular survival upon exposure to DNA damaging agents. We hypothesized that human orthologs of these S. cerevisiae proteins would also be required for cellular survival after treatment with DNA damaging agents. For this purpose, human homologs of S. cerevisiae proteins were identified and mapped onto the human protein-protein interaction network. The resulting human network was highly modular and a series of selection rules were implemented to identify 45 candidates for human toxicity-modulating proteins. The corresponding transcripts were targeted by RNA interference in human cells. The cell lines with depleted target expression were challenged with three DNA damaging agents: the alkylating agents MMS and 4-NQO, and the oxidizing agent t-BuOOH. A comparison of the survival revealed that the majority (74% of proteins conferred either sensitivity or resistance. The identified human toxicity-modulating proteins represent a variety of biological functions: autophagy, chromatin modifications, RNA and protein metabolism, and telomere maintenance. Further studies revealed that MMS-induced autophagy increase the survival of cells treated with DNA damaging agents. In summary, we show that damage recovery proteins in humans can be identified through homology to S. cerevisiae and that many of the same pathways are represented among the toxicity modulators.
Michelle L Castelletto
Full Text Available Based on metabolic and morphological similarities between infective third-stage larvae of parasitic nematodes and dauer larvae of Caenorhabditis elegans, it is hypothesized that similar genetic mechanisms control the development of these forms. In the parasite Strongyloides stercoralis, FKTF-1 is an ortholog of DAF-16, a forkhead transcription factor that regulates dauer larval development in C. elegans. Using transgenesis, we investigated the role of FKTF-1 in S. stercoralis' infective larval development. In first-stage larvae, GFP-tagged recombinant FKTF-1b localizes to the pharynx and hypodermis, tissues remodeled in infective larvae. Activating and inactivating mutations at predicted AKT phosphorylation sites on FKTF-1b give constitutive cytoplasmic and nuclear localization of the protein, respectively, indicating that its post-translational regulation is similar to other FOXO-class transcription factors. Mutant constructs designed to interfere with endogenous FKTF-1b function altered the intestinal and pharyngeal development of the larvae and resulted in some transgenic larvae failing to arrest in the infective stage. Our findings indicate that FKTF-1b is required for proper morphogenesis of S. stercoralis infective larvae and support the overall hypothesis of similar regulation of dauer development in C. elegans and the formation of infective larvae in parasitic nematodes.
Full Text Available One of the fundamental goals of genetics is to understand gene functions and their associated phenotypes. To achieve this goal, in this study we developed a computational algorithm that uses orthology and protein-protein interaction information to infer gene-phenotype associations for multiple species. Furthermore, we developed a web server that provides genome-wide phenotype inference for six species: fly, human, mouse, worm, yeast, and zebrafish. We evaluated our inference method by comparing the inferred results with known gene-phenotype associations. The high Area Under the Curve values suggest a significant performance of our method. By applying our method to two human representative diseases, Type 2 Diabetes and Breast Cancer, we demonstrated that our method is able to identify related Gene Ontology terms and Kyoto Encyclopedia of Genes and Genomes pathways. The web server can be used to infer functions and putative phenotypes of a gene along with the candidate genes of a phenotype, and thus aids in disease candidate gene discovery. Our web server is available at http://jjwanglab.org/PhenoPPIOrth.
Grabowski, Melissa M; Svrzikapa, Nenad; Tissenbaum, Heidi A
Bloom syndrome is caused by mutation of the Bloom helicase (BLM), a member of the RecQ helicase family. Loss of BLM function results in genomic instability that causes a high incidence of cancer. It has been demonstrated that BLM is important for maintaining genomic stability by playing a role in DNA recombination and repair; however, the exact function of BLM is not clearly understood. To determine the mechanism by which BLM controls genomic stability in vivo, we examined the phenotypes caused by mutation of the C. elegans BLM helicase ortholog, HIM-6. We find that the loss of HIM-6 leads to genomic instability as evidenced by an increased number of genomic insertions and deletions, which results in visible random mutant phenotypes. In addition to the mutator phenotype, him-6 mutants have a low brood size, a high incidence of males, a shortened life span, and an increased amount of germ line apoptosis. Upon exposure to high temperature, him-6 mutants that are serially passed become sterile demonstrating a mortal germ line phenotype. Our data suggest a model in which loss of HIM-6 results in genomic instability due to an increased number of DNA lesions, which either cannot be repaired and/or are introduced by low fidelity recombination events. The increased level of genomic instability that leads to him-6(ok412) mutants having a shortened life span.
Michael D Stubenvoll
Full Text Available Centrosomes are critical sites for orchestrating microtubule dynamics, and exhibit dynamic changes in size during the cell cycle. As cells progress to mitosis, centrosomes recruit more microtubules (MT to form mitotic bipolar spindles that ensure proper chromosome segregation. We report a new role for ATX-2, a C. elegans ortholog of Human Ataxin-2, in regulating centrosome size and MT dynamics. ATX-2, an RNA-binding protein, forms a complex with SZY-20 in an RNA-independent fashion. Depleting ATX-2 results in embryonic lethality and cytokinesis failure, and restores centrosome duplication to zyg-1 mutants. In this pathway, SZY-20 promotes ATX-2 abundance, which inversely correlates with centrosome size. Centrosomes depleted of ATX-2 exhibit elevated levels of centrosome factors (ZYG-1, SPD-5, γ-Tubulin, increasing MT nucleating activity but impeding MT growth. We show that ATX-2 influences MT behavior through γ-Tubulin at the centrosome. Our data suggest that RNA-binding proteins play an active role in controlling MT dynamics and provide insight into the control of proper centrosome size and MT dynamics.
Wolf Yuri I
Full Text Available Abstract Background Collections of Clusters of Orthologous Genes (COGs provide indispensable tools for comparative genomic analysis, evolutionary reconstruction and functional annotation of new genomes. Initially, COGs were made for all complete genomes of cellular life forms that were available at the time. However, with the accumulation of thousands of complete genomes, construction of a comprehensive COG set has become extremely computationally demanding and prone to error propagation, necessitating the switch to taxon-specific COG collections. Previously, we reported the collection of COGs for 41 genomes of Archaea (arCOGs. Here we present a major update of the arCOGs and describe evolutionary reconstructions to reveal general trends in the evolution of Archaea. Results The updated version of the arCOG database incorporates 91% of the pangenome of 120 archaea (251,032 protein-coding genes altogether into 10,335 arCOGs. Using this new set of arCOGs, we performed maximum likelihood reconstruction of the genome content of archaeal ancestral forms and gene gain and loss events in archaeal evolution. This reconstruction shows that the last Common Ancestor of the extant Archaea was an organism of greater complexity than most of the extant archaea, probably with over 2,500 protein-coding genes. The subsequent evolution of almost all archaeal lineages was apparently dominated by gene loss resulting in genome streamlining. Overall, in the evolution of Archaea as well as a representative set of bacteria that was similarly analyzed for comparison, gene losses are estimated to outnumber gene gains at least 4 to 1. Analysis of specific patterns of gene gain in Archaea shows that, although some groups, in particular Halobacteria, acquire substantially more genes than others, on the whole, gene exchange between major groups of Archaea appears to be largely random, with no major ‘highways’ of horizontal gene transfer. Conclusions The updated collection
Jonavičienė, Kristina; Studer, Bruno; Asp, Torben
In order to characterize the response of selected grasses to water stress, relative water content (RWC) in leaves and quantum efficiency of photosystem 2 (Fv/Fm) were measured in Phleum pratense L., P. bertolonii DC. and P. phleoides H. Karst. during 6 d of water stress. The results indicated...... differential responses to water stress among the three Phleum species with higher water deficit sensitivity of P. pratense and P. bertolonii than that of P. phleoides. The cDNA-amplified fragment length polymorphism (cDNA-AFLP) technique was applied to identify differentially expressed genes responding...... to water stress in P. pratense. Cloned and sequenced differentially expressed fragments (DEFs) were used for primer design in order to identify orthologous genes in Lolium perenne L. Twelve genes orthologous to P. pratense DEFs were mapped in the L. perenne mapping population VrnA based on a high...
Full Text Available AIM: To construct a recombinant eukaryotic expression vector of rat beta defensin-2(rBD-2, transfect it into the rat corneal epithelial cells with lipofection, determine the expression of target gene in the transfected cells, and discuss the potentiality of recombinant plasmid expressed in corneal epithelial cells, hoping to provide an experimental foundation for further study on the antimicrobial activity of rBD-2 in vitro and in vivo and to assess the probability of defensins as a new application for infectious corneal diseases in the future. METHODS: The synthetic rBD-2 DNA fragment was inserted between the XhoI and BamHI restriction enzyme cutting sites of eukaryotic expression vector pIRES2-ZsGreen1 to construct the recombinant plasmid pIRES2-ZsGreen1-rBD-2, then transformed it into E.coli DH5α, positive clones were screened by kanamycin and identified with restriction endonucleases and sequencing analysis. Transfection into the rat corneal epithelial cells was performed by lipofection. Then the experiment was divided into three groups: rat corneal epithelial cell was transfected with the recombinant plasmid pIRES2- ZsGreen1-rBD-2, rat corneal epithelial cell was transfected with the empty plasmid pIRES2-ZsGreen1 and the non-transfected group. The inverted fluorescence microscope was used to observe the transfection process. At last, the level of rBD-2 mRNA expressed in the transfected cells and the control groups are compared by the real-time fluoresence relative quantitative PCR. RESULTS: The recombinant eukaryotic expression vector of pIRES2-ZsGreen1-rBD-2 was successfully constructed. The level of rBD-2 mRNA in transfected cells was significantly higher than that in control groups through the real-time fluorescence relative quantitative PCR. CONCLUSION: The recombinant eukaryotic expression vector pIRES2-ZsGreen1-rBD-2 could be transfected into rat corneal epithelial cells, and exogenous rBD-2 gene could be transcripted into mRNA in
Rovenská, Miroslava; Hlouchová, Klára; Šácha, Pavel; Mlčochová, Petra; Horák, Vratislav; Zámečník, J.; Bařinka, C.; Konvalinka, Jan
Roč. 68, č. 2 (2008), s. 171-182 ISSN 0270-4137 R&D Projects: GA MŠk 1M0508; GA ČR GA524/04/0102 Institutional research plan: CEZ:AV0Z40550506; CEZ:AV0Z50450515 Keywords : prostate specific membrane antigen * glutamate carboxypeptidase II * animal orthologs * prostate cancer * animal model Subject RIV: CE - Biochemistry Impact factor: 3.069, year: 2008
Enciso-Rodríguez, Felix; Martínez, Rodrigo; Lobo, Mario; Barrero, Luz Stella
The Lulo or naranjilla (Solanum quitoense Lam.) and the tree tomato or tamarillo (Solanum betaceum Cav. Sendt.) are both Andean tropical fruit species with high nutritional value and the potential for becoming premium products in local and export markets. Herein, we present a report on the genetic characterization of 62 accessions of lulos (n = 32) and tree tomatoes (n = 30) through the use of PCR-based markers developed from single-copy conserved orthologous genes (COSII) in other Solanaceae...
S. Andrei Anghel
Full Text Available Members of the evolutionarily conserved Oxa1/Alb3/YidC family mediate membrane protein biogenesis at the mitochondrial inner membrane, chloroplast thylakoid membrane, and bacterial plasma membrane, respectively. Despite their broad phylogenetic distribution, no Oxa1/Alb3/YidC homologs are known to operate in eukaryotic cells outside the endosymbiotic organelles. Here, we present bioinformatic evidence that the tail-anchored protein insertion factor WRB/Get1, the “endoplasmic reticulum (ER membrane complex” subunit EMC3, and TMCO1 are ER-resident homologs of the Oxa1/Alb3/YidC family. Topology mapping and co-evolution-based modeling demonstrate that Get1, EMC3, and TMCO1 share a conserved Oxa1-like architecture. Biochemical analysis of human TMCO1, the only homolog not previously linked to membrane protein biogenesis, shows that it associates with the Sec translocon and ribosomes. These findings suggest a specific biochemical function for TMCO1 and define a superfamily of proteins—the “Oxa1 superfamily”—whose shared function is to facilitate membrane protein biogenesis.
Kaur, Charanpreet; Sharma, Shweta; Hasan, Mohammad Rokebul; Pareek, Ashwani; Singla-Pareek, Sneh L; Sopory, Sudhir K
The glyoxalase system is the ubiquitous pathway for the detoxification of methylglyoxal (MG) in the biological systems. It comprises two enzymes, glyoxalase I (GLYI) and glyoxalase II (GLYII), which act sequentially to convert MG into d-lactate, thereby helping living systems get rid of this otherwise cytotoxic byproduct of metabolism. In addition, a glutathione-independent GLYIII enzyme activity also exists in the biological systems that can directly convert MG to d-lactate. Humans and Escherichia coli possess a single copy of GLYI (encoding either the Ni- or Zn-dependent form) and GLYII genes, which through MG detoxification provide protection against various pathological and disease conditions. By contrast, the plant genome possesses multiple GLYI and GLYII genes with a role in abiotic stress tolerance. Plants possess both Ni 2+ - and Zn 2+ -dependent forms of GLYI, and studies on plant glyoxalases reveal the various unique features of these enzymes distinguishing them from prokaryotic and other eukaryotic glyoxalases. Through this review, we provide an overview of the plant glyoxalase family along with a comparative analysis of glyoxalases across various species, highlighting similarities as well as differences in the biochemical, molecular, and physiological properties of these enzymes. We believe that the evolution of multiple glyoxalases isoforms in plants is an important component of their robust defense strategies.
Zhou, Xiangjun; Sun, Tian-Hu; Wang, Ning; Ling, Hong-Qing; Lu, Shan; Li, Li
The cauliflower (Brassica oleracea var. botrytis) Orange (Or) gene affects plant growth and development in addition to conferring β-carotene accumulation. This study was undertaken to investigate the molecular basis for the effects of the Or gene mutation in on plant growth. The OR protein was found to interact with cauliflower and Arabidopsis eukaryotic release factor 1-2 (eRF1-2), a member of the eRF1 family, by yeast two-hybrid analysis and by bimolecular fluorescence complementation (BiFC) assay. Concomitantly, the Or mutant showed reduced expression of the BoeRF1 family genes. Transgenic cauliflower plants with suppressed expression of BoeRF1-2 and BoeRF1-3 were generated by RNA interference. Like the Or mutant, the BoeRF1 RNAi lines showed increased elongation of the leaf petiole. This long-petiole phenotype was largely caused by enhanced cell elongation, which resulted from increased cell length and elevated expression of genes involved in cell-wall loosening. These findings demonstrate that the cauliflower Or gene controls petiole elongation by suppressing the expression of eRF1 genes, and provide new insights into the molecular mechanism of leaf petiole regulation. © 2010 The Authors. New Phytologist © 2010 New Phytologist Trust.
Aspinall, Tanya V; Gordon, James M B; Bennett, Hayley J; Karahalios, Panagiotis; Bukowski, John-Paul; Walker, Scott C; Engelke, David R; Avis, Johanna M
Ribonuclease MRP is an endonuclease, related to RNase P, which functions in eukaryotic pre-rRNA processing. In Saccharomyces cerevisiae, RNase MRP comprises an RNA subunit and ten proteins. To improve our understanding of subunit roles and enzyme architecture, we have examined protein-protein and protein-RNA interactions in vitro, complementing existing yeast two-hybrid data. In total, 31 direct protein-protein interactions were identified, each protein interacting with at least three others. Furthermore, seven proteins self-interact, four strongly, pointing to subunit multiplicity in the holoenzyme. Six protein subunits interact directly with MRP RNA and four with pre-rRNA. A comparative analysis with existing data for the yeast and human RNase P/MRP systems enables confident identification of Pop1p, Pop4p and Rpp1p as subunits that lie at the enzyme core, with probable addition of Pop5p and Pop3p. Rmp1p is confirmed as an integral subunit, presumably associating preferentially with RNase MRP, rather than RNase P, via interactions with Snm1p and MRP RNA. Snm1p and Rmp1p may act together to assist enzyme specificity, though roles in substrate binding are also indicated for Pop4p and Pop6p. The results provide further evidence of a conserved eukaryotic RNase P/MRP architecture and provide a strong basis for studies of enzyme assembly and subunit function.
Yoon, Ho-Sung; Lee, In-Ae; Lee, Hyoshin; Lee, Byung-Hyun; Jo, Jinki
Glutathione reductase (GR) plays an essential role in a cell's defense against reactive oxygen metabolites by sustaining the reduced status of an important antioxidant glutathione. We constructed a recombinant plasmid based on the expression vector pET-18a that overexpresses a eukaryotic GR from Brassica campestris (BcGR) in Escherichia coli. For comparative analyses, E. coli GR (EcGR) was also subcloned in the same manner. The transformed E. coli with the recombinant constructs accumulated a high level of GR transcripts upon IPTG induction. Also, Western blot analysis showed overproduction of the BcGR protein in a soluble fraction of the transformed E. coli extract. When treated with oxidative stress generating reagents such as paraquat, salicylic acid, and cadmium, the BcGR overproducing E. coli exhibited a higher level of growth and survival rate than the control E. coli strain, but it was not as high as the E. coli strain transformed with the inducible EcGR. The translated amino acid sequences of BcGR and EcGR share 37.3% identity but all the functionally known important residues are conserved. It appears that eukaryotic BcGR functions in a prokaryotic system by providing protection against oxidative damages in E. coli
Full Text Available Unicellular eukaryotes are abundant in most aquatic environments and contribute in several ways to ecosystems. For example, they act as producers, as primary and secondary consumers, and they aid in the decrease and control of bacteria communities and in nutrient cycling. The aim of this study was to describe the abundance and distribution of unicellular eukaryotes in Ponta Porã stream and in its main tributary streams, as well as possible relationships between their presence and water quality on the sampled sites. We selected six sites (locals, considering headwaters (L1 and L2, downstreams (L5 and L6 and the main pollution source (L3. We used similarity and canonical correspondence analysis to evaluate relationships between microorganism abundance and distribution to abiotic variables (dissolved oxygen, pH, temperature, turbidity and conductivity. Abundance was related to the increase of organic and inorganic sediment due to decomposition and erosion. We had a variation on the genera distribution, with slight predominance of ciliates, and some genera used as eutrophicated environmental indicators. The presence of Vorticella, Saprodinium, Paramecium, Metopus and Chilodonella suggests an environment ranging from polysaprobic to olygosaprobic. Urostyla, that has been used as water quality indicators, the flagellate Cercomonas, that has broad dispersion in aquatic environments, Didinium, a ciliate predator widely dispersed, Synura, a chrysophyte sensitive to basic pH and high temperatures, and Amoeba, frequently found in clean or in depuration process waters, may suggest a recuperating environment, since they are also found on lesser impacted sites.
Senoh, Mitsutoshi; Hamabata, Takashi; Takeda, Yoshifumi
In our previous work, we demonstrated that viable but nonculturable (VBNC) Vibrio cholerae O1 and O139 were converted to culturable by coculture with eukaryotic cells. Furthermore, we isolated a factor converting VBNC V. cholerae to culturable (FCVC) from a eukaryotic cell line, HT-29. In this study, we purified FCVC by successive column chromatographies comprising UNO Q-6 anion exchange, Bio-Scale CHT2-1 hydroxyapatite, and Superdex 200 10/300 GL. Homogeneity of the purified FCVC was demonstrated by SDS-PAGE. Nano-LC MS/MS analysis showed that the purified FCVC was a human catalase. An experiment of RNAi knockdown of catalase mRNA from HT-29 cells and treatment of the purified FCVC with a catalase inhibitor, 3-amino-1,2,4-triazole confirmed that the FCVC was a catalase. A possible role of the catalase in converting a VBNC V. cholerae to a culturable state in the human intestine is discussed. © 2015 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.
Liu, Bernard A.; Shah, Eshana; Jablonowski, Karl; Stergachis, Andrew; Engelmann, Brett; Nash, Piers D.
The Src homology 2 (SH2) domains are participants in metazoan signal transduction, acting as primary mediators for regulated protein-protein interactions with tyrosine-phosphorylated substrates. Here, we describe the origin and evolution of SH2 domain proteins by means of sequence analysis from 21 eukaryotic organisms from the basal unicellular eukaryotes, where SH2 domains first appeared, through the multicellular animals and increasingly complex metazoans. On the basis of our results, SH2 domains and phosphotyrosine signaling emerged in the early Unikonta, and the numbers of SH2 domains expanded in the choanoflagellate and metazoan lineages with the development of tyrosine kinases, leading to rapid elaboration of phosphotyrosine signaling in early multicellular animals. Our results also indicated that SH2 domains coevolved and the number of the domains expanded alongside protein tyrosine kinases and tyrosine phosphatases, thereby coupling phosphotyrosine signaling to downstream signaling networks. Gene duplication combined with domain gain or loss produced novel SH2-containing proteins that function within phosphotyrosine signaling, which likely have contributed to diversity and complexity in metazoans. We found that intra- and intermolecular interactions within and between SH2 domain proteins increased in prevalence along with organismal complexity and may function to generate more highly connected and robust phosphotyrosine signaling networks. PMID:22155787
Cutler Sean R
Full Text Available Abstract Background The carboxy termini of proteins are a frequent site of activity for a variety of biologically important functions, ranging from post-translational modification to protein targeting. Several short peptide motifs involved in protein sorting roles and dependent upon their proximity to the C-terminus for proper function have already been characterized. As a limited number of such motifs have been identified, the potential exists for genome-wide statistical analysis and comparative genomics to reveal novel peptide signatures functioning in a C-terminal dependent manner. We have applied a novel methodology to the prediction of C-terminal-anchored peptide motifs involving a simple z-statistic and several techniques for improving the signal-to-noise ratio. Results We examined the statistical over-representation of position-specific C-terminal tripeptides in 7 eukaryotic proteomes. Sequence randomization models and simple-sequence masking were applied to the successful reduction of background noise. Similarly, as C-terminal homology among members of large protein families may artificially inflate tripeptide counts in an irrelevant and obfuscating manner, gene-family clustering was performed prior to the analysis in order to assess tripeptide over-representation across protein families as opposed to across all proteins. Finally, comparative genomics was used to identify tripeptides significantly occurring in multiple species. This approach has been able to predict, to our knowledge, all C-terminally anchored targeting motifs present in the literature. These include the PTS1 peroxisomal targeting signal (SKL*, the ER-retention signal (K/HDEL*, the ER-retrieval signal for membrane bound proteins (KKxx*, the prenylation signal (CC* and the CaaX box prenylation motif. In addition to a high statistical over-representation of these known motifs, a collection of significant tripeptides with a high propensity for biological function exists
Full Text Available The human cytochrome P450 (CYP superfamily consisting of 57 functional genes is the most important group of Phase I drug metabolizing enzymes that oxidize a large number of xenobiotics and endogenous compounds, including therapeutic drugs and environmental toxicants. The CYP superfamily has been shown to expand itself through gene duplication, and some of them become pseudogenes due to gene mutations. Orthologs and paralogs are homologous genes resulting from speciation or duplication, respectively. To explore the evolutionary and functional relationships of human CYPs, we conducted this bioinformatic study to identify their corresponding paralogs, homologs, and orthologs. The functional implications and implications in drug discovery and evolutionary biology were then discussed. GeneCards and Ensembl were used to identify the paralogs of human CYPs. We have used a panel of online databases to identify the orthologs of human CYP genes: NCBI, Ensembl Compara, GeneCards, OMA (“Orthologous MAtrix” Browser, PATHER, TreeFam, EggNOG, and Roundup. The results show that each human CYP has various numbers of paralogs and orthologs using GeneCards and Ensembl. For example, the paralogs of CYP2A6 include CYP2A7, 2A13, 2B6, 2C8, 2C9, 2C18, 2C19, 2D6, 2E1, 2F1, 2J2, 2R1, 2S1, 2U1, and 2W1; CYP11A1 has 6 paralogs including CYP11B1, 11B2, 24A1, 27A1, 27B1, and 27C1; CYP51A1 has only three paralogs: CYP26A1, 26B1, and 26C1; while CYP20A1 has no paralog. The majority of human CYPs are well conserved from plants, amphibians, fishes, or mammals to humans due to their important functions in physiology and xenobiotic disposition. The data from different approaches are also cross-validated and validated when experimental data are available. These findings facilitate our understanding of the evolutionary relationships and functional implications of the human CYP superfamily in drug discovery.
Grandi, Nicole; Cadeddu, Marta; Blomberg, Jonas; Mayer, Jens; Tramontano, Enzo
The genomes of all vertebrates harbor remnants of ancient retroviral infections, having affected the germ line cells during the last 100 million years. These sequences, named Endogenous Retroviruses (ERVs), have been transmitted to the offspring in a Mendelian way, being relatively stable components of the host genome even long after their exogenous counterparts went extinct. Among human ERVs (HERVs), the HERV-W group is of particular interest for our physiology and pathology. A HERV-W provirus in locus 7q21.2 has been coopted during evolution to exert an essential role in placenta, and the group expression has been tentatively linked to Multiple Sclerosis and other diseases. Following up on a detailed analysis of 213 HERV-W insertions in the human genome, we now investigated the ERV-W group genomic spread within primate lineages. We analyzed HERV-W orthologous loci in the genome sequences of 12 non-human primate species belonging to Simiiformes (parvorders Catarrhini and Platyrrhini), Tarsiiformes and to the most primitive Prosimians. Analysis of HERV-W orthologous loci in non-human Catarrhini primates revealed species-specific insertions in the genomes of Chimpanzee (3), Gorilla (4), Orangutan (6), Gibbon (2) and especially Rhesus Macaque (66). Such sequences were acquired in a retroviral fashion and, in the majority of cases, by L1-mediated formation of processed pseudogenes. There were also a number of LTR-LTR homologous recombination events that occurred subsequent to separation of Catarrhini sub-lineages. Moreover, we retrieved 130 sequences in Marmoset and Squirrel Monkeys (family Cebidae, Platyrrhini parvorder), identified as ERV1-1_CJa based on RepBase annotations, which appear closely related to the ERV-W group. Such sequences were also identified in Atelidae and Pitheciidae, representative of the other Platyrrhini families. In contrast, no ERV-W-related sequences were found in genome sequence assemblies of Tarsiiformes and Prosimians. Overall, our
Full Text Available Abstract Background It has repeatedly been shown that interacting protein families tend to have similar phylogenetic trees. These similarities can be used to predicting the mapping between two families of interacting proteins (i.e. which proteins from one family interact with which members of the other. The correct mapping will be that which maximizes the similarity between the trees. The two families may eventually comprise orthologs and paralogs, if members of the two families are present in more than one organism. This fact can be exploited to restrict the possible mappings, simply by impeding links between proteins of different organisms. We present here an algorithm to predict the mapping between families of interacting proteins which is able to incorporate information regarding orthologues, or any other assignment of proteins to "classes" that may restrict possible mappings. Results For the first time in methods for predicting mappings, we have tested this new approach on a large number of interacting protein domains in order to statistically assess its performance. The method accurately predicts around 80% in the most favourable cases. We also analysed in detail the results of the method for a well defined case of interacting families, the sensor and kinase components of the Ntr-type two-component system, for which up to 98% of the pairings predicted by the method were correct. Conclusion Based on the well established relationship between tree similarity and interactions we developed a method for predicting the mapping between two interacting families using genomic information alone. The program is available through a web interface.
Vader, Anna; Laughinghouse, Haywood D; Griffiths, Colin; Jakobsen, Kjetill S; Gabrielsen, Tove M
Proton-pumping rhodopsins provide an alternative pathway to photosynthesis by which solar energy can enter the marine food web. Rhodopsin genes are widely found in marine bacteria, also in the Arctic, and were recently reported from several eukaryotic lineages. So far, little is known about rhodopsin expression in Arctic eukaryotes. In this study, we used metatranscriptomics and 18S rDNA tag sequencing to examine the mid-summer function and composition of marine protists (size 0.45-10 µm) in the high-Arctic Billefjorden (Spitsbergen), especially focussing on the expression of microbial proton-pumping rhodopsins. Rhodopsin transcripts were highly abundant, at a level similar to that of genes involved in photosynthesis. Phylogenetic analyses placed the environmental rhodopsins within disparate eukaryotic lineages, including dinoflagellates, stramenopiles, haptophytes and cryptophytes. Sequence comparison indicated the presence of several functional types, including xanthorhodopsins and a eukaryotic clade of proteorhodopsin. Transcripts belonging to the proteorhodopsin clade were also abundant in published metatranscriptomes from other oceanic regions, suggesting a global distribution. The diversity and abundance of rhodopsins show that these light-driven proton pumps play an important role in Arctic microbial eukaryotes. Understanding this role is imperative to predicting the future of the Arctic marine ecosystem faced by a changing light climate due to diminishing sea-ice. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.
Sommers, Pacifica; Darcy, John L; Gendron, Eli M S; Stanish, Lee F; Bagshaw, Elizabeth A; Porazinska, Dorota L; Schmidt, Steven K
Ice-lidded cryoconite holes on glaciers in the Taylor Valley, Antarctica, provide a unique system of natural mesocosms for studying community structure and assembly. We used high-throughput DNA sequencing to characterize both microbial eukaryotic communities and bacterial communities within cryoconite holes across three glaciers to study similarities in their spatial patterns. We expected that the alpha (phylogenetic diversity) and beta (pairwise community dissimilarity) diversity patterns of eukaryotes in cryoconite holes would be related to those of bacteria, and that they would be related to the biogeochemical gradient within the Taylor Valley. We found that eukaryotic alpha and beta diversity were strongly related to those of bacteria across scales ranging from 140 m to 41 km apart. Alpha diversity of both was significantly related to position in the valley and surface area of the cryoconite hole, with pH also significantly correlated with the eukaryotic diversity. Beta diversity for both bacteria and eukaryotes was significantly related to position in the valley, with bacterial beta diversity also related to nitrate. These results are consistent with transport of sediments onto glaciers occurring primarily at local scales relative to the size of the valley, thus creating feedbacks in local chemistry and diversity. © FEMS 2017. All rights reserved. For permissions, please e-mail: email@example.com.
Wallau, Gabriel Luz; Vieira, Cristina; Loreto, Élgion Lúcio Silva
All living species contain genetic information that was once shared by their common ancestor. DNA is being inherited through generations by vertical transmission (VT) from parents to offspring and from ancestor to descendant species. This process was considered the sole pathway by which biological entities exchange inheritable information. However, Horizontal Transfer (HT), the exchange of genetic information by other means than parents to offspring, was discovered in prokaryotes along with strong evidence showing that it is a very important process by which prokaryotes acquire new genes. For some time now, it has been a scientific consensus that HT events were rare and non-relevant for evolution of eukaryotic species, but there is growing evidence supporting that HT is an important and frequent phenomenon in eukaryotes as well. Here, we will discuss the latest findings regarding HT among eukaryotes, mainly HT of transposons (HTT), establishing HTT once and for all as an important phenomenon that should be taken into consideration to fully understand eukaryotes genome evolution. In addition, we will discuss the latest development methods to detect such events in a broader scale and highlight the new approaches which should be pursued by researchers to fill the knowledge gaps regarding HTT among eukaryotes.
Novák, Lukáš; Zubáčová, Zuzana; Karnkowska, Anna; Kolisko, Martin; Hroudová, Miluše; Stairs, Courtney W; Simpson, Alastair G B; Keeling, Patrick J; Roger, Andrew J; Čepička, Ivan; Hampl, Vladimír
Multiple prokaryotic lineages use the arginine deiminase (ADI) pathway for anaerobic energy production by arginine degradation. The distribution of this pathway among eukaryotes has been thought to be very limited, with only two specialized groups living in low oxygen environments (Parabasalia and Diplomonadida) known to possess the complete set of all three enzymes. We have performed an extensive survey of available sequence data in order to map the distribution of these enzymes among eukaryotes and to reconstruct their phylogenies. We have found genes for the complete pathway in almost all examined representatives of Metamonada, the anaerobic protist group that includes parabasalids and diplomonads. Phylogenetic analyses indicate the presence of the complete pathway in the last common ancestor of metamonads and heterologous transformation experiments suggest its cytosolic localization in the metamonad ancestor. Outside Metamonada, the complete pathway occurs rarely, nevertheless, it was found in representatives of most major eukaryotic clades. Phylogenetic relationships of complete pathways are consistent with the presence of the Archaea-derived ADI pathway in the last common ancestor of all eukaryotes, although other evolutionary scenarios remain possible. The presence of the incomplete set of enzymes is relatively common among eukaryotes and it may be related to the fact that these enzymes are involved in other cellular processes, such as the ornithine-urea cycle. Single protein phylogenies suggest that the evolutionary history of all three enzymes has been shaped by frequent gene losses and horizontal transfers, which may sometimes be connected with their diverse roles in cellular metabolism.
Full Text Available Alternative splicing, polyadenylation, and chemical modifications of RNA generate astonishing complexity within eukaryotic transcriptomes. The last decade has brought numerous advances in sequencing technologies that allow biologists to investigate these phenomena with greater depth and accuracy while reducing time and cost. A commensurate development in biochemical techniques for the enrichment and analysis of different RNA variants has accompanied the advancement of global sequencing analysis platforms. Here, we present a detailed overview of the latest biochemical methods, along with bioinformatics pipelines that have aided in identifying different RNA variants. We also highlight the ongoing developments and challenges associated with RNA variant detection and quantification, including sample heterogeneity and isolation, as well as ‘Omics’ big data handling.
Baker, Brett J; Tyson, Gene W; Goosherst, Lindsey; Banfield, Jillian F
Microscopic eukaryotes are known to have important ecosystem functions, but their diversity in most environments remains vastly unexplored. Here we analyzed an 18S rRNA gene library from a subsurface iron- and sulfur-oxidizing microbial community growing in highly acidic (pH morphological characterization. Results revealed that the populations vary significantly with the habitat and no group is ubiquitous. Surprisingly, many of the eukaryotic lineages (with the exception of the APC) are closely related to neutrophiles, suggesting that they recently adapted to this extreme environment. Molecular analyses presented here confirm that the number of eukaryotic species associated with the acid mine drainage (AMD) communities is low. This finding is consistent with previous results showing a limited diversity of archaea, bacteria, and viruses in AMD environments and suggests that the environmental pressures and interplay between the members of these communities limit species diversity at all trophic levels.
Andersen, Gorm; Bjornberg, Olof; Polakova, Silvia
Pyrimidine bases are the central precursors for RNA and DNA, and their intracellular pools are determined by de novo, salvage and catabolic pathways. In eukaryotes, degradation of uracil has been believed to proceed only via the reduction to dihydrouracil. Using a yeast model, Saccharomyces kluyv...... of the eukaryotic or prokaryotic genes involved in pyrimidine degradation described to date.......Pyrimidine bases are the central precursors for RNA and DNA, and their intracellular pools are determined by de novo, salvage and catabolic pathways. In eukaryotes, degradation of uracil has been believed to proceed only via the reduction to dihydrouracil. Using a yeast model, Saccharomyces......, respectively. The gene products of URC1 and URC4 are highly conserved proteins with so far unknown functions and they are present in a variety of prokaryotes and fungi. In bacteria and in some fungi, URC1 and URC4 are linked on the genome together with the gene for uracil phosphoribosyltransferase (URC6). Urc1...
Kevin J Liu
Full Text Available One outcome of interspecific hybridization and subsequent effects of evolutionary forces is introgression, which is the integration of genetic material from one species into the genome of an individual in another species. The evolution of several groups of eukaryotic species has involved hybridization, and cases of adaptation through introgression have been already established. In this work, we report on PhyloNet-HMM-a new comparative genomic framework for detecting introgression in genomes. PhyloNet-HMM combines phylogenetic networks with hidden Markov models (HMMs to simultaneously capture the (potentially reticulate evolutionary history of the genomes and dependencies within genomes. A novel aspect of our work is that it also accounts for incomplete lineage sorting and dependence across loci. Application of our model to variation data from chromosome 7 in the mouse (Mus musculus domesticus genome detected a recently reported adaptive introgression event involving the rodent poison resistance gene Vkorc1, in addition to other newly detected introgressed genomic regions. Based on our analysis, it is estimated that about 9% of all sites within chromosome 7 are of introgressive origin (these cover about 13 Mbp of chromosome 7, and over 300 genes. Further, our model detected no introgression in a negative control data set. We also found that our model accurately detected introgression and other evolutionary processes from synthetic data sets simulated under the coalescent model with recombination, isolation, and migration. Our work provides a powerful framework for systematic analysis of introgression while simultaneously accounting for dependence across sites, point mutations, recombination, and ancestral polymorphism.
Pemberton Trevor J
Full Text Available Abstract Background The peptidyl-prolyl cis/trans isomerase (PPIase class of proteins is present in all known eukaryotes, prokaryotes, and archaea, and it is comprised of three member families that share the ability to catalyze the cis/trans isomerisation of a prolyl bond. Some fungi have been used as model systems to investigate the role of PPIases within the cell, however how representative these repertoires are of other fungi or humans has not been fully investigated. Results PPIase numbers within these fungal repertoires appears associated with genome size and orthology between repertoires was found to be low. Phylogenetic analysis showed the single-domain FKBPs to evolve prior to the multi-domain FKBPs, whereas the multi-domain cyclophilins appear to evolve throughout cyclophilin evolution. A comparison of their known functions has identified, besides a common role within protein folding, multiple roles for the cyclophilins within pre-mRNA splicing and cellular signalling, and within transcription and cell cycle regulation for the parvulins. However, no such commonality was found with the FKBPs. Twelve of the 17 human cyclophilins and both human parvulins, but only one of the 13 human FKBPs, identified orthologues within these fungi. hPar14 orthologues were restricted to the Pezizomycotina fungi, and R. oryzae is unique in the known fungi in possessing an hCyp33 orthologue and a TPR-containing FKBP. The repertoires of Cryptococcus neoformans, Aspergillus fumigatus, and Aspergillus nidulans were found to exhibit the highest orthology to the human repertoire, and Saccharomyces cerevisiae one of the lowest. Conclusion Given this data, we would hypothesize that: (i the evolution of the fungal PPIases is driven, at least in part, by the size of the proteome, (ii evolutionary pressures differ both between the different PPIase families and the different fungi, and (iii whilst the cyclophilins and parvulins have evolved to perform conserved
Colussi, Timothy M; Costantino, David A; Zhu, Jianyu; Donohue, John Paul; Korostelev, Andrei A; Jaafar, Zane A; Plank, Terra-Dawn M; Noller, Harry F; Kieft, Jeffrey S
The central dogma of gene expression (DNA to RNA to protein) is universal, but in different domains of life there are fundamental mechanistic differences within this pathway. For example, the canonical molecular signals used to initiate protein synthesis in bacteria and eukaryotes are mutually exclusive. However, the core structures and conformational dynamics of ribosomes that are responsible for the translation steps that take place after initiation are ancient and conserved across the domains of life. We wanted to explore whether an undiscovered RNA-based signal might be able to use these conserved features, bypassing mechanisms specific to each domain of life, and initiate protein synthesis in both bacteria and eukaryotes. Although structured internal ribosome entry site (IRES) RNAs can manipulate ribosomes to initiate translation in eukaryotic cells, an analogous RNA structure-based mechanism has not been observed in bacteria. Here we report our discovery that a eukaryotic viral IRES can initiate translation in live bacteria. We solved the crystal structure of this IRES bound to a bacterial ribosome to 3.8 Å resolution, revealing that despite differences between bacterial and eukaryotic ribosomes this IRES binds directly to both and occupies the space normally used by transfer RNAs. Initiation in both bacteria and eukaryotes depends on the structure of the IRES RNA, but in bacteria this RNA uses a different mechanism that includes a form of ribosome repositioning after initial recruitment. This IRES RNA bridges billions of years of evolutionary divergence and provides an example of an RNA structure-based translation initiation signal capable of operating in two domains of life.
Horváth, Vivien; Merenciano, Miriam; González, Josefa
A relationship between transposable elements (TEs) and the eukaryotic stress response was suggested in the first publications describing TEs. Since then, it has often been assumed that TEs are activated by stress, and that this activation is often beneficial for the organism. In recent years, the availability of new high-throughput experimental techniques has allowed further interrogation of the relationship between TEs and stress. By reviewing the recent literature, we conclude that although there is evidence for a beneficial effect of TE activation under stress conditions, the relationship between TEs and the eukaryotic stress response is quite complex. Copyright © 2017 Elsevier Ltd. All rights reserved.
One of the fundamental problems of modern molecular cellular radiobiology is to reveal general and peculiar processes of the formation of gene mutations and chromosome aberrations in each stage of their formation in the irradiated genome of the higher eukaryots. The solution of the problems depends on the development of research within the framework of comparative radiobiology of genetic loci of the higher eukaryots that makes it possible to study quantitative regularities in the formation of gene (point) mutations and chromosome aberrations in one object and in the same experiment
Nielsen, Olaf; Løbner-Olesen, Anders
DNA replication is an extremely accurate process and cells have evolved intricate control mechanisms to ensure that each region of their genome is replicated only once during S phase. Here, we compare what is known about the processes that prevent re-replication in prokaryotic and eukaryotic cells...... prokaryotes and eukaryotes are inactivated until the next cell cycle. Furthermore, in both systems the beta-clamp of the replicative polymerase associates with enzymatic activities that contribute to the inactivation of the helicase loaders. Finally, recent studies suggest that the control mechanism...
Shemarova, Irina V
In unicellular eukaryotes, apoptosis-like cell death occurs during development, aging and reproduction, and can be induced by environmental stresses and exposure to toxic agents. The essence of the apoptotic machinery in unicellular organisms is similar to that in mammals, but the apoptotic signal network is less complex and of more ancient origin. The review summarizes current data about key apoptotic proteins and mechanisms of the transduction of apoptotic signals by caspase-like proteases and mitochondrial apoptogenic proteins in unicellular eukaryotes. The roles of receptor-dependent and receptor-independent caspase cascades are reviewed. 2010 Elsevier Inc. All rights reserved.
Krapp, S; Kelly, G; Reischl, J; Weinzierl, R O; Matthews, S
RNA polymerase II subunit RPB8 is an essential subunit that is highly conserved throughout eukaryotic evolution and is present in all three types of nuclear RNA polymerases. We report the first high resolution structural insight into eukaryotic RNA polymerase architecture with the solution structure of RPB8 from Saccharomyces cerevisiae. It consists of an eight stranded, antiparallel beta-barrel, four short helical regions and a large, unstructured omega-loop. The strands are connected in classic Greek-key fashion. The overall topology is unusual and contains a striking C2 rotational symmetry. Furthermore, it is most likely a novel associate of the oligonucleotide/oligosaccharide (OB) binding protein class.
Full Text Available Abstract Background Systemic acquired resistance (SAR is induced in non-inoculated leaves following infection with certain pathogenic strains. SAR is effective against many pathogens. Salicylic acid (SA is a signaling molecule of the SAR pathway. The development of SAR is associated with the induction of pathogenesis related (PR genes. Arabidopsis non-expressor of PR1 (NPR1 is a regulatory gene of the SA signal pathway 123. SAR in soybean was first reported following infection with Colletotrichum trancatum that causes anthracnose disease. We investigated if SAR in soybean is regulated by a pathway, similar to the one characterized in Arabidopsis. Results Pathogenesis-related gene GmPR1 is induced following treatment of soybean plants with the SAR inducer, 2,6-dichloroisonicotinic acid (INA or infection with the oomycete pathogen, Phytophthora sojae. In P. sojae-infected plants, SAR was induced against the bacterial pathogen, Pseudomonas syringae pv. glycinea. Soybean GmNPR1-1 and GmNPR1-2 genes showed high identities to Arabidopsis NPR1. They showed similar expression patterns among the organs, studied in this investigation. GmNPR1-1 and GmNPR1-2 are the only soybean homologues of NPR1and are located in homoeologous regions. In GmNPR1-1 and GmNPR1-2 transformed Arabidopsis npr1-1 mutant plants, SAR markers: (i PR-1 was induced following INA treatment and (ii BGL2 following infection with Pseudomonas syringae pv. tomato (Pst, and SAR was induced following Pst infection. Of the five cysteine residues, Cys82, Cys150, Cys155, Cys160, and Cys216 involved in oligomer-monomer transition in NPR1, Cys216 in GmNPR1-1 and GmNPR1-2 proteins was substituted to Ser and Leu, respectively. Conclusion Complementation analyses in Arabidopsis npr1-1 mutants revealed that homoeologous GmNPR1-1 and GmNPR1-2 genes are orthologous to Arabidopsis NPR1. Therefore, SAR pathway in soybean is most likely regulated by GmNPR1 genes. Substitution of Cys216 residue, essential
Sandhu, Devinder; Tasma, I Made; Frasch, Ryan; Bhattacharyya, Madan K
Systemic acquired resistance (SAR) is induced in non-inoculated leaves following infection with certain pathogenic strains. SAR is effective against many pathogens. Salicylic acid (SA) is a signaling molecule of the SAR pathway. The development of SAR is associated with the induction of pathogenesis related (PR) genes. Arabidopsis non-expressor of PR1 (NPR1) is a regulatory gene of the SA signal pathway 123. SAR in soybean was first reported following infection with Colletotrichum trancatum that causes anthracnose disease. We investigated if SAR in soybean is regulated by a pathway, similar to the one characterized in Arabidopsis. Pathogenesis-related gene GmPR1 is induced following treatment of soybean plants with the SAR inducer, 2,6-dichloroisonicotinic acid (INA) or infection with the oomycete pathogen, Phytophthora sojae. In P. sojae-infected plants, SAR was induced against the bacterial pathogen, Pseudomonas syringae pv. glycinea. Soybean GmNPR1-1 and GmNPR1-2 genes showed high identities to Arabidopsis NPR1. They showed similar expression patterns among the organs, studied in this investigation. GmNPR1-1 and GmNPR1-2 are the only soybean homologues of NPR1and are located in homoeologous regions. In GmNPR1-1 and GmNPR1-2 transformed Arabidopsis npr1-1 mutant plants, SAR markers: (i) PR-1 was induced following INA treatment and (ii) BGL2 following infection with Pseudomonas syringae pv. tomato (Pst), and SAR was induced following Pst infection. Of the five cysteine residues, Cys82, Cys150, Cys155, Cys160, and Cys216 involved in oligomer-monomer transition in NPR1, Cys216 in GmNPR1-1 and GmNPR1-2 proteins was substituted to Ser and Leu, respectively. Complementation analyses in Arabidopsis npr1-1 mutants revealed that homoeologous GmNPR1-1 and GmNPR1-2 genes are orthologous to Arabidopsis NPR1. Therefore, SAR pathway in soybean is most likely regulated by GmNPR1 genes. Substitution of Cys216 residue, essential for oligomer-monomer transition of Arabidopsis NPR1
Fan Ying; Shi Lichi; Ladizhansky, Vladimir; Brown, Leonid S.
Overexpression of isotope-labeled multi-spanning eukaryotic membrane proteins for structural NMR studies is often challenging. On the one hand, difficulties with achieving proper folding, membrane insertion, and native-like post-translational modifications frequently disqualify bacterial expression systems. On the other hand, eukaryotic cell cultures can be prohibitively expensive. One of the viable alternatives, successfully used for producing proteins for solution NMR studies, is yeast expression systems, particularly Pichia pastoris. We report on successful implementation and optimization of isotope labeling protocols, previously used for soluble secreted proteins, to produce homogeneous samples of a eukaryotic seven-transmembrane helical protein, rhodopsin from Leptosphaeria maculans. Even in shake-flask cultures, yields exceeded 5 mg of purified uniformly 13 C, 15 N-labeled protein per liter of culture. The protein was stable (at least several weeks at 5°C) and functionally active upon reconstitution into lipid membranes at high protein-to-lipid ratio required for solid-state NMR. The samples gave high-resolution 13 C and 15 N solid-state magic angle spinning NMR spectra, amenable to a detailed structural analysis. We believe that similar protocols can be adopted for challenging mammalian targets, which often resist characterization by other structural methods.
McCarthy, Matthew D.; Lehman, Jennifer; Kudela, Raphael
Stable nitrogen isotopic analysis of individual amino acids (δ15N-AA) has unique potential to elucidate the complexities of food webs, track heterotrophic transformations, and understand diagenesis of organic nitrogen (ON). While δ15N-AA patterns of autotrophs have been shown to be generally similar, prior work has also suggested that differences may exist between cyanobacteria and eukaryotic algae. However, δ15N-AA patterns in differing oceanic algal groups have never been closely examined. The overarching goals of this study were first to establish a more quantitative understanding of algal δ15N-AA patterns, and second to examine whether δ15N-AA patterns have potential as a new tracer for distinguishing prokaryotic vs. eukaryotic N sources. We measured δ15N-AA from prokaryotic and eukaryotic phytoplankton cultures and used a complementary set of statistical approaches (simple normalization, regression-derived fractionation factors, and multivariate analyses) to test for variations. A generally similar δ15N-AA pattern was confirmed for all algae, however significant AA-specific variation was also consistently identified between the two groups. The relative δ15N fractionation of Glx (glutamine + glutamic acid combined) vs. total proteinaceous N appeared substantially different, which we hypothesize could be related to differing enzymatic forms. In addition, the several other AA (most notably glycine and leucine) appeared to have strong biomarker potential. Finally, we observed that overall patterns of δ15N values in algae correspond well with the Trophic vs. Source-AA division now commonly used to describe variable AA δ15N changes with trophic transfer, suggesting a common mechanistic basis. Overall, these results show that autotrophic δ15N-AA patterns can differ between major algal evolutionary groupings for many AA. The statistically significant multivariate results represent a first approach for testing ideas about relative eukaryotic vs. prokaryotic
. The available literature indi- cated that the expression of eIF5A was temporal and spatial difference and suppressing eIF5A activation causes pleiotropic effects. Transcript analysis reveals that two tobacco eIF5A genes ...
Hudson, Brian D; Christiansen, Elisabeth; Tikhonova, Irina G
When it is difficult to develop selective ligands within a family of related G-protein-coupled receptors (GPCRs), chemically engineered receptors activated solely by synthetic ligands (RASSLs) are useful alternatives for probing receptor function. In the present work, we explored whether a RASSL...... on this receptor and demonstrates that exploitation of pharmacological variation between species orthologs is a powerful method to generate novel chemically engineered GPCRs.-Hudson, B. D., Christiansen, E., Tikhonova, I. G., Grundmann, M., Kostenis, E., Adams, D. R., Ulven, T., Milligan, G. Chemically engineering...
Full Text Available Phagosome maturation is a complex process enabling degradation of internalised particles. Our data obtained at the gene, protein and cellular level indicate that the set of components involved in this process and known up to now in mammalian cells is functioning in unicellular eukaryote. Rab7-interacting partners: homologues of its effector RILP (Rab-interacting lysosomal protein and LAMP-2 (lysosomal membrane protein 2 as well as a7 subunit of the 26S proteasome were revealed in Paramecium phagolysosomal compartment. We identified the gene/transcript fragments encoding RILP-related proteins (RILP1 and RILP2 in Paramecium by PCR/RT-PCR and sequencing. The deduced amino acid sequences of RILP1 and RILP2 show 60.5% and 58.3% similarity, respectively, to the region involved in regulating of lysosomal morphology and dynein-dynactin recruitment of human RILP. RILP colocalised with Rab7 in Paramecium lysosomes and at phagolysosomal membrane during phagocytosis of both the latex beads and bacteria. In the same compartment LAMP-2 was present and its expression during latex internalisation was 2.5-fold higher than in the control when P2 protein fractions (100 000 x g of equal load were quantified by immunoblotting. LAMP-2 crossreacting polypeptide of ~106 kDa was glycosylated as shown by fluorescent and Western analysis of the same blot preceded by PNGase F treatment. The a7 subunit of 26S proteasome was detected close to the phagosomal membrane in the small vesicles, in some of which it colocalised with Rab7. Immunoblotting confirmed presence of RILPrelated polypeptide and a7 subunit of 26S proteasome in Paramecium protein fractions. These results suggest that Rab7, RILP and LAMP-2 may be involved in phagosome maturation in Paramecium.
Full Text Available Abstract Background Free-living amoebae serve as a natural reservoir for some bacteria that have evolved into «amoeba-resistant» bacteria. Among these, some are strictly intra-amoebal, such as Candidatus "Protochlamydia amoebophila" (Candidatus "P. amoebophila", whose genomic sequence is available. We sequenced the genome of Legionella drancourtii (L. drancourtii, another recently described intra-amoebal bacterium. By comparing these two genomes with those of their closely related species, we were able to study the genetic characteristics specific to their amoebal lifestyle. Findings We identified a sterol delta-7 reductase-encoding gene common to these two bacteria and absent in their relatives. This gene encodes an enzyme which catalyses the last step of cholesterol biosynthesis in eukaryotes, and is probably functional within L. drancourtii since it is transcribed. The phylogenetic analysis of this protein suggests that it was acquired horizontally by a few bacteria from viridiplantae. This gene was also found in the Acanthamoeba polyphaga Mimivirus genome, a virus that grows in amoebae and possesses the largest viral genome known to date. Conclusion L. drancourtii acquired a sterol delta-7 reductase-encoding gene of viridiplantae origin. The most parsimonious hypothesis is that this gene was initially acquired by a Chlamydiales ancestor parasite of plants. Subsequently, its descendents transmitted this gene in amoebae to other intra-amoebal microorganisms, including L. drancourtii and Coxiella burnetii. The role of the sterol delta-7 reductase in prokaryotes is as yet unknown but we speculate that it is involved in host cholesterol parasitism.
Full Text Available Members of the Roseobacter clade which play a key role in the biogeochemical cycles of the ocean are diverse and abundant, comprising 10-25% of the bacterioplankton in most marine surface waters. The rapid accumulation of whole-genome sequence data for the Roseobacter clade allows us to obtain a clearer picture of its evolution.In this study about 1,200 likely orthologous protein families were identified from 17 Roseobacter bacteria genomes. Functional annotations for these genes are provided by iProClass. Phylogenetic trees were constructed for each gene using maximum likelihood (ML and neighbor joining (NJ. Putative organismal phylogenetic trees were built with phylogenomic methods. These trees were compared and analyzed using principal coordinates analysis (PCoA, approximately unbiased (AU and Shimodaira-Hasegawa (SH tests. A core set of 694 genes with vertical descent signal that are resistant to horizontal gene transfer (HGT is used to reconstruct a robust organismal phylogeny. In addition, we also discovered the most likely 109 HGT genes. The core set contains genes that encode ribosomal apparatus, ABC transporters and chaperones often found in the environmental metagenomic and metatranscriptomic data. These genes in the core set are spread out uniformly among the various functional classes and biological processes.Here we report a new multigene-derived phylogenetic tree of the Roseobacter clade. Of particular interest is the HGT of eleven genes involved in vitamin B12 synthesis as well as key enzynmes for dimethylsulfoniopropionate (DMSP degradation. These aquired genes are essential for the growth of Roseobacters and their eukaryotic partners.
Urban, J. M.; Foulk, M. S.; Casella, Cinzia
Origins of DNA replication (ORIs) occur at defined regions in the genome. Although DNA sequence defines the position of ORIs in budding yeast, the factors for ORI specification remain elusive in metazoa. Several methods have been used recently to map ORIs in metazoan genomes with the hope...... that features for ORI specification might emerge. These methods are reviewed here with analysis of their advantages and shortcomings. The various factors that may influence ORI selection for initiation of DNA replication are discussed....
Vandepoele, Klaas; Van Bel, Michiel; Richard, Guilhem; Van Landeghem, Sofie; Verhelst, Bram; Moreau, Hervé; Van de Peer, Yves; Grimsley, Nigel; Piganeau, Gwenael
With the advent of next generation genome sequencing, the number of sequenced algal genomes and transcriptomes is rapidly growing. Although a few genome portals exist to browse individual genome sequences, exploring complete genome information from multiple species for the analysis of user-defined sequences or gene lists remains a major challenge. pico-PLAZA is a web-based resource (http://bioinformatics.psb.ugent.be/pico-plaza/) for algal genomics that combines different data types with intuitive tools to explore genomic diversity, perform integrative evolutionary sequence analysis and study gene functions. Apart from homologous gene families, multiple sequence alignments, phylogenetic trees, Gene Ontology, InterPro and text-mining functional annotations, different interactive viewers are available to study genome organization using gene collinearity and synteny information. Different search functions, documentation pages, export functions and an extensive glossary are available to guide non-expert scientists. To illustrate the versatility of the platform, different case studies are presented demonstrating how pico-PLAZA can be used to functionally characterize large-scale EST/RNA-Seq data sets and to perform environmental genomics. Functional enrichments analysis of 16 Phaeodactylum tricornutum transcriptome libraries offers a molecular view on diatom adaptation to different environments of ecological relevance. Furthermore, we show how complementary genomic data sources can easily be combined to identify marker genes to study the diversity and distribution of algal species, for example in metagenomes, or to quantify intraspecific diversity from environmental strains. © 2013 John Wiley & Sons Ltd and Society for Applied Microbiology.
Full Text Available The carmine spider mite (CSM, Tetranychus cinnabarinus, is an important pest mite in agriculture, because it can develop insecticide resistance easily. To gain valuable gene information and molecular basis for the future insecticide resistance study of CSM, the first transcriptome analysis of CSM was conducted. A total of 45,016 contigs and 25,519 unigenes were generated from the de novo transcriptome assembly, and 15,167 unigenes were annotated via BLAST querying against current databases, including nr, SwissProt, the Clusters of Orthologous Groups (COGs, Kyoto Encyclopedia of Genes and Genomes (KEGG and Gene Ontology (GO. Aligning the transcript to Tetranychus urticae genome, the 19255 (75.45% of the transcripts had significant (e-value <10-5 matches to T. urticae DNA genome, 19111 sequences matched to T. urticae proteome with an average protein length coverage of 42.55%. Core Eukaryotic Genes Mapping Approach (CEGMA analysis identified 435 core eukaryotic genes (CEGs in the CSM dataset corresponding to 95% coverage. Ten gene categories that relate to insecticide resistance in arthropod were generated from CSM transcriptome, including 53 P450-, 22 GSTs-, 23 CarEs-, 1 AChE-, 7 GluCls-, 9 nAChRs-, 8 GABA receptor-, 1 sodium channel-, 6 ATPase- and 12 Cyt b genes. We developed significant molecular resources for T. cinnabarinus putatively involved in insecticide resistance. The transcriptome assembly analysis will significantly facilitate our study on the mechanism of adapting environmental stress (including insecticide in CSM at the molecular level, and will be very important for developing new control strategies against this pest mite.
Perederina, Anna; Esakova, Olga; Quan, Chao; Khanova, Elena; Krasilnikov, Andrey S
Ribonuclease (RNase) P is a site-specific endoribonuclease found in all kingdoms of life. Typical RNase P consists of a catalytic RNA component and a protein moiety. In the eukaryotes, the RNase P lineage has split into two, giving rise to a closely related enzyme, RNase MRP, which has similar components but has evolved to have different specificities. The eukaryotic RNases P/MRP have acquired an essential helix-loop-helix protein-binding RNA domain P3 that has an important function in eukaryotic enzymes and distinguishes them from bacterial and archaeal RNases P. Here, we present a crystal structure of the P3 RNA domain from Saccharomyces cerevisiae RNase MRP in a complex with RNase P/MRP proteins Pop6 and Pop7 solved to 2.7 A. The structure suggests similar structural organization of the P3 RNA domains in RNases P/MRP and possible functions of the P3 domains and proteins bound to them in the stabilization of the holoenzymes' structures as well as in interactions with substrates. It provides the first insight into the structural organization of the eukaryotic enzymes of the RNase P/MRP family.
Knyigavko, V.G.; Ponomarenko, N.S.; Meshcheryakova, O.P.; Protasenya, S.Yu.
A mathematical model of the processes determining reproductive death of the exposed cells was built. The model takes into account the phenomenon of saturation of the system of DNA radiation lesion reparation and structural functional peculiarities of chromatin structure in eukaryotes. The problem of assessment of the model parameters using experimental data was discussed.
Khare, Pranay D.; Russell, Stephen J.; Federspiel, Mark J.
Display technology refers to methods of generating libraries of modularly coded biomolecules and screening them for particular properties. Retroviruses are good candidates to be a eukaryotic viral platform for the display of polypeptides synthesized in eukaryotic cells. Here we demonstrate that avian leukosis virus (ALV) provides an ideal platform for display of nonviral polyaeptides expressed in a eukaryotic cell substrate. Different sizes of polypeptides were genetically fused to the extreme N-terminus of the ALV envelope glycoprotein in an ALV infectious clone containing an alkaline phosphatase reporter gene. The chimeric envelope glycoproteins were efficiently incorporated into virions and were stably displayed on the surface of the virions through multiple virus replication cycles. The foreign polypeptides did not interfere with the attachment and entry functions of the underlying ALV envelope glycoproteins. The displayed polypeptides were fully functional and could efficiently mediate attachment of the recombinant viruses to their respective cognate receptors. This study demonstrates that ALV is an ideal display platform for the generation and selection of libraries of polypeptides where there is a need for expression, folding, and posttranslational modification in the endoplasmic reticulum of eukaryotic cells
Full Text Available Abstract Background Microsatellites have immense utility as molecular markers in different fields like genome characterization and mapping, phylogeny and evolutionary biology. Existing microsatellite databases are of limited utility for experimental and computational biologists with regard to their content and information output. EuMicroSatdb (Eukaryotic MicroSatellite database http://ipu.ac.in/usbt/EuMicroSatdb.htm is a web based relational database for easy and efficient positional mining of microsatellites from sequenced eukaryotic genomes. Description A user friendly web interface has been developed for microsatellite data retrieval using Active Server Pages (ASP. The backend database codes for data extraction and assembly have been written using Perl based scripts and C++. Precise need based microsatellites data retrieval is possible using different input parameters like microsatellite type (simple perfect or compound perfect, repeat unit length (mono- to hexa-nucleotide, repeat number, microsatellite length and chromosomal location in the genome. Furthermore, information about clustering of different microsatellites in the genome can also be retrieved. Finally, to facilitate primer designing for PCR amplification of any desired microsatellite locus, 200 bp upstream and downstream sequences are provided. Conclusion The database allows easy systematic retrieval of comprehensive information about simple and compound microsatellites, microsatellite clusters and their locus coordinates in 31 sequenced eukaryotic genomes. The information content of the database is useful in different areas of research like gene tagging, genome mapping, population genetics, germplasm characterization and in understanding microsatellite dynamics in eukaryotic genomes.
Monier, Adam; Pagarete, António; de Vargas, Colomban; Allen, Michael J.; Read, Betsy; Claverie, Jean-Michel; Ogata, Hiroyuki
Interactions between viruses and phytoplankton, the main primary producers in the oceans, affect global biogeochemical cycles and climate. Recent studies are increasingly revealing possible cases of gene transfers between cyanobacteria and phages, which might have played significant roles in the evolution of cyanobacteria/phage systems. However, little has been documented about the occurrence of horizontal gene transfer in eukaryotic phytoplankton/virus systems. Here we report phylogenetic evidence for the transfer of seven genes involved in the sphingolipid biosynthesis pathway between the cosmopolitan eukaryotic microalga Emiliania huxleyi and its large DNA virus EhV. PCR assays indicate that these genes are prevalent in E. huxleyi and EhV strains isolated from different geographic locations. Patterns of protein and gene sequence conservation support that these genes are functional in both E. huxleyi and EhV. This is the first clear case of horizontal gene transfer of multiple functionally linked enzymes in a eukaryotic phytoplankton–virus system. We examine arguments for the possible direction of the gene transfer. The virus-to-host direction suggests the existence of ancient viruses that controlled the complex metabolic pathway in order to infect primitive eukaryotic cells. In contrast, the host-to-virus direction suggests that the serial acquisition of genes involved in the same metabolic pathway might have been a strategy for the ancestor of EhVs to stay ahead of their closest relatives in the great evolutionary race for survival. PMID:19451591
Li, Sanshu; Smith, Kathryn D; Davis, Jared H; Gordon, Patricia B; Breaker, Ronald R; Strobel, Scott A
Fluorine is an abundant element and is toxic to organisms from bacteria to humans, but the mechanisms by which eukaryotes resist fluoride toxicity are unknown. The Escherichia coli gene crcB was recently shown to be regulated by a fluoride-responsive riboswitch, implicating it in fluoride response. There are >8,000 crcB homologs across all domains of life, indicating that it has an important role in biology. Here we demonstrate that eukaryotic homologs [renamed FEX (fluoride exporter)] function in fluoride export. FEX KOs in three eukaryotic model organisms, Neurospora crassa, Saccharomyces cerevisiae, and Candida albicans, are highly sensitized to fluoride (>200-fold) but not to other halides. Some of these KO strains are unable to grow in fluoride concentrations found in tap water. Using the radioactive isotope of fluoride, (18)F, we developed an assay to measure the intracellular fluoride concentration and show that the FEX deletion strains accumulate fluoride in excess of the external concentration, providing direct evidence of FEX function in fluoride efflux. In addition, they are more sensitive to lower pH in the presence of fluoride. These results demonstrate that eukaryotic FEX genes encode a previously unrecognized class of fluoride exporter necessary for survival in standard environmental conditions.
Marques, Catarina A; Dickens, Nicholas J; Paape, Daniel; Campbell, Samantha J; McCulloch, Richard
DNA replication initiates on defined genome sites, termed origins. Origin usage appears to follow common rules in the eukaryotic organisms examined to date: all chromosomes are replicated from multiple origins, which display variations in firing efficiency and are selected from a larger pool of potential origins. To ask if these features of DNA replication are true of all eukaryotes, we describe genome-wide origin mapping in the parasite Leishmania. Origin mapping in Leishmania suggests a striking divergence in origin usage relative to characterized eukaryotes, since each chromosome appears to be replicated from a single origin. By comparing two species of Leishmania, we find evidence that such origin singularity is maintained in the face of chromosome fusion or fission events during evolution. Mapping Leishmania origins suggests that all origins fire with equal efficiency, and that the genomic sites occupied by origins differ from related non-origins sites. Finally, we provide evidence that origin location in Leishmania displays striking conservation with Trypanosoma brucei, despite the latter parasite replicating its chromosomes from multiple, variable strength origins. The demonstration of chromosome replication for a single origin in Leishmania, a microbial eukaryote, has implications for the evolution of origin multiplicity and associated controls, and may explain the pervasive aneuploidy that characterizes Leishmania chromosome architecture.
The Selfish Operon Model postulates that the organization of bacterial genes into operons is beneficial to the constituent genes in that proximity allows horizontal cotransfer of all genes required for a selectable phenotype; eukaryotic operons formed for very different reasons. Horizontal transfer of selfish operons most probably promotes bacterial diversification.
Barlow, Peter W; Baldelli, E; Baluška, Frantisek
The Energide concept, as well as the endosymbiotic theory of eukaryotic cell organization and evolution, proposes that present-day cells of eukaryotic organisms are mosaics of specialized and cooperating units, or organelles. Some of these units were originally free-living prokaryotes, which were engulfed during evolutionary time. Mitochondria represent one of these types of previously independent organisms, the Energide, is another type. This new perspective on the organization of the cell has been further expanded to reveal the concept of a public milieu, the cytosol, in which Energides and mitochondria live, each with their own private internal milieu. The present paper discusses how the endosymbiotic theory implicates a new hypothesis about the hierarchical and communicational organization of the integrated prokaryotic components of the eukaryotic cell and provides a new angle from which to consider the theory of evolution and its bearing upon cellular complexity. Thus, it is proposed that the “selfish gene” hypothesis of Dawkins1 is not the only possible perspective for comprehending genomic and cellular evolution. Our proposal is that maternal mitochondria are the selfish “master” entities of the eukaryotic cell with respect not only to their propagation from cell-to-cell and from generation-to-generation but also to their regulation of all other cellular functions. However, it should be recognized that the concept of “master” and “servant” cell components is a metaphor; in present-day living organisms their organellar components are considered to be interdependent and inseparable. PMID:19513277
Ros, V.I.D.; Hurst, G.D.D.
The expansion of genome sequencing projects has produced accumulating evidence for lateral transfer of genes between prokaryotic and eukaryotic genomes. However, it remains controversial whether these genes are of functional importance in their recipient host. Nikoh and Nakabachi, in a recent paper
Nielsen, Henrik; Engelbrecht, Jacob; Brunak, Søren
We have developed a new method for the identification of signal peptides and their cleavage based on neural networks trained on separate sets of prokaryotic and eukaryotic sequence. The method performs significantly better than previous prediction schemes and can easily be applied on genome...
Speijer, D.; Lukeš, Julius; Eliáš, M.
Roč. 112, č. 29 (2015), s. 8827-8834 ISSN 0027-8424 R&D Projects: GA MŠk LH12104; GA ČR GA15-21974S Institutional support: RVO:60077344 Keywords : reactive oxygen species * evolution * protists * eukaryotes * sex Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 9.423, year: 2015
Lyngsø, C.; Kjaerulff, S.; Muller, S.
in vivo selection system for folded proteins. It is based on genetic complementation of the Schizosaccharomyces pombe growth marker gene invertase fused C-terminally to a protein library. The fusion proteins are directed to the secretion system, utilizing the ability of the eukaryotic protein quality...
SINA M. ADL; ALASTAIR G. B. SIMPSON; MARK A. FARMER; ROBERT A. ANDERSEN; O. ROGER ANDERSON; JOHN R. BARTA; SAMUEL S. BOWSER; GUY BRUGEROLLE; ROBERT A. FENSOME; SUZANNE FREDERICQ; TIMOTHY Y. JAMES; SERGEI KARPOV; PAUL KUGRENS; JOHN KRUG; CHRISTOPHER E. LANE; LOUISE A. LEWIS; JEAN LODGE; DENIS H. LYNN; DAVID G. MANN; RICHARD M. MCCOURT; LEONEL MENDOZA; ØJVIND MOESTRUP; SHARON E. MOZLEY-STANDRIDGE; THOMAS A. NERAD; CAROL A. SHEARER; ALEXEY V. SMIRNOV; FREDERICK W. SPIEGEL; MAX F.J.R. TAYLOR
This revision of the classification of unicellular eukaryotes updates that of Levine et al. (1980) for the protozoa and expands it to include other protists. Whereas the previous revision was primarily to incorporate the results of ultrastructural studies, this revision incorporates results from both ultrastructural research since 1980 and molecular phylogenetic...
Sina M. Adl; Alastair G.B. Simpson; Mark A. Farmer; Robert A. Andersen; O. Roger Anderson; John R. Barta; Samuel S. Bowser; Guy Brugerolle; Robert A. Fensome; Suzanne Fredericq; Timothy Y. James; Sergei Karpov; Paul Kugrens; John Krug; Christopher E. Lane; Louise A. Lewis; Jean Lodge; Denis H. Lynn; David G. Mann; Richard M. McCourt; Leonel Mendoza; Ojvind Moestrup; Sharon E. Mozley-Standridge; Thomas A. Nerad; Carol A. Shearer; Alexey V. Smirnov; Frederick W. Speigel; Max F.J.R. Taylor
This revision of the classification of unicellular eukaryotes updates that of Levine et al. (1980) for the protozoa and expands it to include other protists. Whereas the previous revision was primarily to incorporate the results of ultrastructural studies, this revision incorporates results from both ultrastructural research since 1980 and molecular phylogenetic...
Rodríguez-Cruz, Maricela; Coral-Vázquez, Ramón M.; Hernández-Stengele, Gabriel; Sánchez, Raúl; Salazar, Emmanuel; Sanchez-Muñoz, Fausto; Encarnación-Guevara, Sergio; Ramírez-Salcedo, Jorge
The mammary gland (MG) undergoes functional and metabolic changes during the transition from pregnancy to lactation, possibly by regulation of conserved genes. The objective was to elucidate orthologous genes, chromosome clusters and putative conserved transcriptional modules during MG development. We analyzed expression of 22,000 transcripts using murine microarrays and RNA samples of MG from virgin, pregnant, and lactating rats by cross-species hybridization. We identified 521 transcripts differentially expressed; upregulated in early (78%) and midpregnancy (89%) and early lactation (64%), but downregulated in mid-lactation (61%). Putative orthologous genes were identified. We mapped the altered genes to orthologous chromosomal locations in human and mouse. Eighteen sets of conserved genes associated with key cellular functions were revealed and conserved transcription factor binding site search entailed possible coregulation among all eight block sets of genes. This study demonstrates that the use of heterologous array hybridization for screening of orthologous gene expression from rat revealed sets of conserved genes arranged in chromosomal order implicated in signaling pathways and functional ontology. Results demonstrate the utilization power of comparative genomics and prove the feasibility of using rodent microarrays to identification of putative coexpressed orthologous genes involved in the control of human mammary gland development. PMID:24288657
Fawcett, S. E.; Lomas, M. W.; Ward, B. B.; Sigman, D. M.
The Sargasso Sea is characterized by a short period of deep vertical mixing in the late winter and early spring, followed by strong thermal stratification during the summer. Stratification persists into the fall, impeding the upward flux of nitrate from depth so that recycled forms of nitrogen (N) such as ammonium are thought to support most primary production. We collected particles from surface waters during March, July, October, and December, used flow cytometry to separate the prokaryotic and eukaryotic phytoplankton, and analyzed their respective 15N/14N. In all months, the 15N/14N of the prokaryotic genera, Prochlorococcus and Synechococcus, was low, indicative of reliance on recycled N throughout the year. In July, the 15N/14N of eukaryotic phytoplankton was variable but consistently higher than that of the prokaryotes, reflecting eukaryotic consumption of subsurface nitrate. Two eukaryotic profiles from October and December were similar to those from July. In three other fall profiles, the eukaryotes had a 15N/14N similar to that of the prokaryotes, suggesting a switch toward greater reliance on recycled N. This change in the dominant N source supporting eukaryotic production appears to be driven by the density structure of the upper water column. The very shallow low-density surface "mixed layer" (≤20 m) that develops in early-to-mid summer does not contribute to stratification at the base of the euphotic zone, and subsurface nitrate can mix up into the lower euphotic zone, facilitating continued production. The deepening of the mixed layer into the fall, typically taken as an indication of weaker overall stratification, actually strengthens the isolation of the euphotic zone as a whole, reducing the upward supply of nitrate to the photosynthetically active layer. The same counterintuitive dynamic explains the latitudinal patterns in a set of three October depth profiles. Two northern stations (32°N and 27°N) were characterized by a thick, low
Hesketh, Andy; Vergnano, Marta; Wan, Chris; Oliver, Stephen G
We have engineered Saccharomyces cerevisiae to inducibly synthesize the prokaryotic signaling nucleotides cyclic di-GMP (cdiGMP), cdiAMP, and ppGpp in order to characterize the range of effects these nucleotides exert on eukaryotic cell function during bacterial pathogenesis. Synthetic genetic array (SGA) and transcriptome analyses indicated that, while these compounds elicit some common reactions in yeast, there are also complex and distinctive responses to each of the three nucleotides. All three are capable of inhibiting eukaryotic cell growth, with the guanine nucleotides exhibiting stronger effects than cdiAMP. Mutations compromising mitochondrial function and chromatin remodeling show negative epistatic interactions with all three nucleotides. In contrast, certain mutations that cause defects in chromatin modification and ribosomal protein function show positive epistasis, alleviating growth inhibition by at least two of the three nucleotides. Uniquely, cdiGMP is lethal both to cells growing by respiration on acetate and to obligately fermentative petite mutants. cdiGMP is also synthetically lethal with the ribonucleotide reductase (RNR) inhibitor hydroxyurea. Heterologous expression of the human ppGpp hydrolase Mesh1p prevented the accumulation of ppGpp in the engineered yeast and restored cell growth. Extensive in vivo interactions between bacterial signaling molecules and eukaryotic gene function occur, resulting in outcomes ranging from growth inhibition to death. cdiGMP functions through a mechanism that must be compensated by unhindered RNR activity or by functionally competent mitochondria. Mesh1p may be required for abrogating the damaging effects of ppGpp in human cells subjected to bacterial infection. IMPORTANCE During infections, pathogenic bacteria can release nucleotides into the cells of their eukaryotic hosts. These nucleotides are recognized as signals that contribute to the initiation of defensive immune responses that help the infected
Jeremy G Wideman
Full Text Available MAPL (mitochondria-associated protein ligase, also called MULAN/GIDE/MUL1 is a multifunctional mitochondrial outer membrane protein found in human cells that contains a unique BAM (beside a membrane domain and a C-terminal RING-finger domain. MAPL has been implicated in several processes that occur in animal cells such as NF-kB activation, innate immunity and antiviral signaling, suppression of PINK1/parkin defects, mitophagy in skeletal muscle, and caspase-dependent apoptosis. Previous studies demonstrated that the BAM domain is present in diverse organisms in which most of these processes do not occur, including plants, archaea, and bacteria. Thus the conserved function of MAPL and its BAM domain remains an open question. In order to gain insight into its conserved function, we investigated the evolutionary origins of MAPL by searching for homologues in predicted proteomes of diverse eukaryotes. We show that MAPL proteins with a conserved BAM-RING architecture are present in most animals, protists closely related to animals, a single species of fungus, and several multicellular plants and related green algae. Phylogenetic analysis demonstrated that eukaryotic MAPL proteins originate from a common ancestor and not from independent horizontal gene transfers from bacteria. We also determined that two independent duplications of MAPL occurred, one at the base of multicellular plants and another at the base of vertebrates. Although no other eukaryote genome examined contained a verifiable MAPL orthologue, BAM domain-containing proteins were identified in the protists Bigelowiella natans and Ectocarpus siliculosis. Phylogenetic analyses demonstrated that these proteins are more closely related to prokaryotic BAM proteins and therefore likely arose from independent horizontal gene transfers from bacteria. We conclude that MAPL proteins with BAM-RING architectures have been present in the holozoan and viridiplantae lineages since their very beginnings
Permal, Emmanuelle; Flutre, Timothée; Quesneville, Hadi
Current high-throughput techniques have made it feasible to sequence even the genomes of non-model organisms. However, the annotation process now represents a bottleneck to genome analysis, especially when dealing with transposable elements (TE). Combined approaches, using both de novo and knowledge-based methods to detect TEs, are likely to produce reasonably comprehensive and sensitive results. This chapter provides a roadmap for researchers involved in genome projects to address this issue. At each step of the TE annotation process, from the identification of TE families to the annotation of TE copies, we outline the tools and good practices to be used.
Pandey, Gaurav; Arora, Sonali; Manocha, Sahil; Whalen, Sean
Protein interaction networks are a promising type of data for studying complex biological systems. However, despite the rich information embedded in these networks, these networks face important data quality challenges of noise and incompleteness that adversely affect the results obtained from their analysis. Here, we apply a robust measure of local network structure called common neighborhood similarity (CNS) to address these challenges. Although several CNS measures have been proposed in the literature, an understanding of their relative efficacies for the analysis of interaction networks has been lacking. We follow the framework of graph transformation to convert the given interaction network into a transformed network corresponding to a variety of CNS measures evaluated. The effectiveness of each measure is then estimated by comparing the quality of protein function predictions obtained from its corresponding transformed network with those from the original network. Using a large set of human and fly protein interactions, and a set of over GO terms for both, we find that several of the transformed networks produce more accurate predictions than those obtained from the original network. In particular, the measure and other continuous CNS measures perform well this task, especially for large networks. Further investigation reveals that the two major factors contributing to this improvement are the abilities of CNS measures to prune out noisy edges and enhance functional coherence in the transformed networks. PMID:25275489
Full Text Available BACKGROUND: Microbial eukaryotes represent an important component of the human gut microbiome, with different beneficial or harmful roles; some species are commensal or mutualistic, whereas others are opportunistic or parasitic. The diversity of eukaryotes inhabiting humans remains relatively unexplored because of either the low abundance of these organisms in human gut or because they have received limited attention from a whole-community perspective. METHODOLOGY/PRINCIPAL FINDING: In this study, a single fecal sample from a healthy African male was studied using both culture-dependent methods and extended molecular methods targeting the 18S rRNA and ITS sequences. Our results revealed that very few fungi, including Candida spp., Galactomyces spp., and Trichosporon asahii, could be isolated using culture-based methods. In contrast, a relatively a high number of eukaryotic species could be identified in this fecal sample when culture-independent methods based on various primer sets were used. A total of 27 species from one sample were found among the 977 analyzed clones. The clone libraries were dominated by fungi (716 clones/977, 73.3%, corresponding to 16 different species. In addition, 187 sequences out of 977 (19.2% corresponded to 9 different species of plants; 59 sequences (6% belonged to other micro-eukaryotes in the gut, including Entamoeba hartmanni and Blastocystis sp; and only 15 clones/977 (1.5% were related to human 18S rRNA sequences. CONCLUSION: Our results revealed a complex eukaryotic community in the volunteer's gut, with fungi being the most abundant species in the stool sample. Larger investigations are needed to assess the generality of these results and to understand their roles in human health and disease.
Filker, Sabine; Sommaruga, Ruben; Vila, Irma; Stoeck, Thorsten
Microbial eukaryotes hold a key role in aquatic ecosystem functioning. Yet, their diversity in freshwater lakes, particularly in high-mountain lakes, is relatively unknown compared with the marine environment. Low nutrient availability, low water temperature and high ultraviolet radiation make most high-mountain lakes extremely challenging habitats for life and require specific molecular and physiological adaptations. We therefore expected that these ecosystems support a plankton diversity that differs notably from other freshwater lakes. In addition, we hypothesized that the communities under study exhibit geographic structuring. Our rationale was that geographic dispersal of small-sized eukaryotes in high-mountain lakes over continental distances seems difficult. We analysed hypervariable V4 fragments of the SSU rRNA gene to compare the genetic microbial eukaryote diversity in high-mountain lakes located in the European Alps, the Chilean Altiplano and the Ethiopian Bale Mountains. Microbial eukaryotes were not globally distributed corroborating patterns found for bacteria, multicellular animals and plants. Instead, the plankton community composition emerged as a highly specific fingerprint of a geographic region even on higher taxonomic levels. The intraregional heterogeneity of the investigated lakes was mirrored in shifts in microbial eukaryote community structure, which, however, was much less pronounced compared with interregional beta-diversity. Statistical analyses revealed that on a regional scale, environmental factors are strong predictors for plankton community structures in high-mountain lakes. While on long-distance scales (>10 000 km), isolation by distance is the most plausible scenario, on intermediate scales (up to 6000 km), both contemporary environmental factors and historical contingencies interact to shift plankton community structures. © 2016 John Wiley & Sons Ltd.
Carmel, Liran; Wolf, Yuri I; Rogozin, Igor B; Koonin, Eugene V
Several contrasting scenarios have been proposed for the origin and evolution of spliceosomal introns, a hallmark of eukaryotic genes. A comprehensive probabilistic model to obtain a definitive reconstruction of intron evolution was developed and applied to 391 sets of conserved genes from 19 eukaryotic species. It is inferred that a relatively high intron density was reached early, i.e., the last common ancestor of eukaryotes contained >2.15 introns/kilobase, and the last common ancestor of multicellular life forms harbored approximately 3.4 introns/kilobase, a greater intron density than in most of the extant fungi and in some animals. The rates of intron gain and intron loss appear to have been dropping during the last approximately 1.3 billion years, with the decline in the gain rate being much steeper. Eukaryotic lineages exhibit three distinct modes of evolution of the intron-exon structure. The primary, balanced mode, apparently, operates in all lineages. In this mode, intron gain and loss are strongly and positively correlated, in contrast to previous reports on inverse correlation between these processes. The second mode involves an elevated rate of intron loss and is prevalent in several lineages, such as fungi and insects. The third mode, characterized by elevated rate of intron gain, is seen only in deep branches of the tree, indicating that bursts of intron invasion occurred at key points in eukaryotic evolution, such as the origin of animals. Intron dynamics could depend on multiple mechanisms, and in the balanced mode, gain and loss of introns might share common mechanistic features.
Full Text Available Precise genome editing in livestock is of great value for the fundamental investigation of disease modeling. However, genetically modified pigs carrying subtle point mutations were still seldom reported despite the rapid development of programmable endonucleases. Here, we attempt to investigate single-stranded oligonucleotides (ssODN mediated knockin by introducing two orthologous pathogenic mutations, p.E693G for Alzheimer's disease and p.G2019S for Parkinson's disease, into porcine APP and LRRK2 loci, respectively. Desirable homology-directed repair (HDR efficiency was achieved in porcine fetal fibroblasts (PFFs by optimizing the dosage and length of ssODN templates. Interestingly, incomplete HDR alleles harboring partial point mutations were observed in single-cell colonies, which indicate the complex mechanism of ssODN-mediated HDR. The effect of mutation-to-cut distance on incorporation rate was further analyzed by deep sequencing. We demonstrated that a mutation-to-cut distance of 11 bp resulted in a remarkable difference in HDR efficiency between two point mutations. Finally, we successfully obtained one cloned piglet harboring the orthologous p.C313Y mutation at the MSTN locus via somatic cell nuclear transfer (SCNT. Our proof-of-concept study demonstrated efficient ssODN-mediated incorporation of pathogenic point mutations in porcine somatic cells, thus facilitating further development of disease modeling and genetic breeding in pigs.
Full Text Available Cell division cycle 25 (cdc25 is an evolutionarily conserved phosphatase that promotes cell cycle progression. Among the four cdc25 orthologs in Caenorhabditis elegans, we found that cdc-25.4 mutant males failed to produce outcrossed progeny. This was not caused by defects in sperm development, but by defects in male mating behavior. The cdc-25.4 mutant males showed various defects during male mating, including contact response, backing, turning, and vulva location. Aberrant turning behavior was the most prominent defect in the cdc-25.4 mutant males. We also found that cdc-25.4 is expressed in many neuronal cells throughout development. The turning defect in cdc-25.4 mutant males was recovered by cdc-25.4 transgenic expression in neuronal cells, suggesting that cdc-25.4 functions in neurons for male mating. However, the neuronal morphology of cdc-25.4 mutant males appeared to be normal, as examined with several neuronal markers. Also, RNAi depletion of wee-1.3, a C. elegans ortholog of Wee1/Myt1 kinase, failed to suppress the mating defects of cdc-25.4 mutant males. These findings suggest that, for successful male mating, cdc-25.4 does not target cell cycles that are required for neuronal differentiation and development. Rather, cdc-25.4 likely regulates noncanonical substrates in neuronal cells.
Li, Zhi-Yong; Wang, Yue-Zhu; He, Li-Ming; Zheng, Hua-Jun
The whole metabolism of a sponge holobiont and the respective contributions of prokaryotic and eukaryotic symbionts and their associations with the sponge host remain largely unclear. Meanwhile, compared with shallow water sponges, deep-sea sponges are rarely understood. Here we report the metagenomic exploration of deep-sea sponge Neamphius huxleyi at the whole community level. Metagenomic data showed phylogenetically diverse prokaryotes and eukaryotes in Neamphius huxleyi. MEGAN and gene enrichment analyses indicated different metabolic potentials of prokaryotic symbionts from eukaryotic symbionts, especially in nitrogen and carbon metabolisms, and their molecular interactions with the sponge host. These results supported the hypothesis that prokaryotic and eukaryotic symbionts have different ecological roles and relationships with sponge host. Moreover, vigorous denitrification, and CO2 fixation by chemoautotrophic prokaryotes were suggested for this deep-sea sponge. The study provided novel insights into the respective potentials of prokaryotic and eukaryotic symbionts and their associations with deep-sea sponge Neamphius huxleyi. PMID:24463735
Li, Zhi-Yong; Wang, Yue-Zhu; He, Li-Ming; Zheng, Hua-Jun
The whole metabolism of a sponge holobiont and the respective contributions of prokaryotic and eukaryotic symbionts and their associations with the sponge host remain largely unclear. Meanwhile, compared with shallow water sponges, deep-sea sponges are rarely understood. Here we report the metagenomic exploration of deep-sea sponge Neamphius huxleyi [corrected] . at the whole community level. Metagenomic data showed phylogenetically diverse prokaryotes and eukaryotes in Neamphius huxleyi [corrected]. MEGAN and gene enrichment analyses indicated different metabolic potentials of prokaryotic symbionts from eukaryotic symbionts, especially in nitrogen and carbon metabolisms, and their molecular interactions with the sponge host. These results supported the hypothesis that prokaryotic and eukaryotic symbionts have different ecological roles and relationships with sponge host. Moreover, vigorous denitrification, and CO2 fixation by chemoautotrophic prokaryotes were suggested for this deep-sea sponge. The study provided novel insights into the respective potentials of prokaryotic and eukaryotic symbionts and their associations with deep-sea sponge Neamphius huxleyi [corrected].
Li, Zhi-Yong; Wang, Yue-Zhu; He, Li-Ming; Zheng, Hua-Jun
The whole metabolism of a sponge holobiont and the respective contributions of prokaryotic and eukaryotic symbionts and their associations with the sponge host remain largely unclear. Meanwhile, compared with shallow water sponges, deep-sea sponges are rarely understood. Here we report the metagenomic exploration of deep-sea sponge Lamellomorpha sp. at the whole community level. Metagenomic data showed phylogenetically diverse prokaryotes and eukaryotes in Lamellomorpha sp.. MEGAN and gene enrichment analyses indicated different metabolic potentials of prokaryotic symbionts from eukaryotic symbionts, especially in nitrogen and carbon metabolisms, and their molecular interactions with the sponge host. These results supported the hypothesis that prokaryotic and eukaryotic symbionts have different ecological roles and relationships with sponge host. Moreover, vigorous denitrification, and CO2 fixation by chemoautotrophic prokaryotes were suggested for this deep-sea sponge. The study provided novel insights into the respective potentials of prokaryotic and eukaryotic symbionts and their associations with deep-sea sponge Lamellomorpha sp..
Full Text Available Abstract Sterols are key components of eukaryotic cellular membranes that are synthesized by multi-enzyme pathways that require molecular oxygen. Because prokaryotes fundamentally lack sterols, it is unclear how the vast diversity of bacterivorous eukaryotes that inhabit hypoxic environments obtain, or synthesize, sterols. Here we show that tetrahymanol, a triterpenoid that does not require molecular oxygen for its biosynthesis, likely functions as a surrogate of sterol in eukaryotes inhabiting oxygen-poor environments. Genes encoding the tetrahymanol synthesizing enzyme squalene-tetrahymanol cyclase were found from several phylogenetically diverged eukaryotes that live in oxygen-poor environments and appear to have been laterally transferred among such eukaryotes. Reviewers This article was reviewed by Eric Bapteste and Eugene Koonin.
Cristina D. Vianna-Soares
Full Text Available Two different hydrophilic acrylate beads were prepared via aqueous suspension polymerization. Beads produced of a hydroxypropyl methacrylate (HPMA and ethyleneglycol methacrylate (EDMA copolymer were obtained using a polyvinyl alcohol suspending medium. Copolymers of 2hydroxyethyl methacrylate (HEMA, methyl methacrylate (MMA and ethyleneglycol methacrylate (EDMA beads were obtained using magnesium hydroxide as the suspending agent. Following characterization by scanning electron microscopy (SEM, nitrogen sorption analysis (NSA and mercury intrusion porosimetry (MIP, the beads were cultured with monkey fibroblasts (COS7 to evaluate their ability to support cell growth, attachment and adhesion. Cell growth behavior onto small HPMA/EDMA copolymer beads and large HEMA/MMA/EDMA copolymer beads is evaluated regarding their hidrophilicity/hidrophobicity and surface roughness.
Garcia, Nathan S; Sexton, Julie; Riggins, Tracey; Brown, Jeff; Lomas, Michael W; Martiny, Adam C
Current hypotheses suggest that cellular elemental stoichiometry of marine eukaryotic phytoplankton such as the ratios of cellular carbon:nitrogen:phosphorus (C:N:P) vary between phylogenetic groups. To investigate how phylogenetic structure, cell volume, growth rate, and temperature interact to affect the cellular elemental stoichiometry of marine eukaryotic phytoplankton, we examined the C:N:P composition in 30 isolates across 7 classes of marine phytoplankton that were grown with a sufficient supply of nutrients and nitrate as the nitrogen source. The isolates covered a wide range in cell volume (5 orders of magnitude), growth rate (temperature (2-24°C). Our analysis indicates that C:N:P is highly variable, with statistical model residuals accounting for over half of the total variance and no relationship between phylogeny and elemental stoichiometry. Furthermore, our data indicated that variability in C:P, N:P, and C:N within Bacillariophyceae (diatoms) was as high as that among all of the isolates that we examined. In addition, a linear statistical model identified a positive relationship between diatom cell volume and C:P and N:P. Among all of the isolates that we examined, the statistical model identified temperature as a significant factor, consistent with the temperature-dependent translation efficiency model, but temperature only explained 5% of the total statistical model variance. While some of our results support data from previous field studies, the high variability of elemental ratios within Bacillariophyceae contradicts previous work that suggests that this cosmopolitan group of microalgae has consistently low C:P and N:P ratios in comparison with other groups.
Rochi, Lucia; Diéguez, María José; Burguener, Germán; Darino, Martín Alejandro; Pergolesi, María Fernanda; Ingala, Lorena Romina; Cuyeu, Alba Romina; Turjanski, Adrián; Kreff, Enrique Domingo; Sacco, Francisco
Rust fungi are one of the most devastating pathogens of crop plants. The biotrophic fungus Puccinia sorghi Schwein (Ps) is responsible for maize common rust, an endemic disease of maize (Zea mays L.) in Argentina that causes significant yield losses in corn production. In spite of this, the Ps genomic sequence was not available. We used Illumina sequencing to rapidly produce the 99.6Mbdraft genome sequence of Ps race RO10H11247, derived from a single-uredinial isolate from infected maize leaves collected in the Argentine Corn Belt Region during 2010. High quality reads were obtained from 200bppaired-end and 5000bpmate-paired libraries and assembled in 15,722 scaffolds. A pipeline which combined an ab initio program with homology-based models and homology to in planta enriched ESTs from four cereal pathogenic fungus (the three sequenced wheat rusts and Ustilago maydis) was used to identify 21,087 putative coding sequences, of which 1599 might be part of the Ps RO10H11247 secretome. Among the 458 highly conserved protein families from the euKaryotic Orthologous Groups (KOG) that occur in a wide range of eukaryotic organisms, 97.5% have at least one member with high homology in the Ps assembly (TBlastN, E-value⩽e-10) covering more than 50% of the length of the KOG protein. Comparative studies with the three sequenced wheat rust fungus, and microsynteny analysis involving Puccinia striiformis f. sp. tritici (Pst, wheat stripe rust fungus), support the quality achieved. The results presented here show the effectiveness of the Illumina strategy for sequencing dikaryotic genomes of non-model organisms and provides reliable DNA sequence information for genomic studies, including pathogenic mechanisms of this maize fungus and molecular marker design. Copyright © 2016 Elsevier Inc. All rights reserved.
Background Xanthophylls, oxygenated derivatives of carotenes, play critical roles in photosynthetic apparatus of cyanobacteria, algae, and higher plants. Although the xanthophylls biosynthetic pathway of algae is largely unknown, it is of particular interest because they have a very complicated evolutionary history. Carotenoid hydroxylase (CHY) is an important protein that plays essential roles in xanthophylls biosynthesis. With the availability of 18 sequenced algal genomes, we performed a comprehensive comparative analysis of chy genes and explored their distribution, structure, evolution, origins, and expression. Results Overall 60 putative chy genes were identified and classified into two major subfamilies (bch and cyp97) according to their domain structures. Genes in the bch subfamily were found in 10 green algae and 1 red alga, but absent in other algae. In the phylogenetic tree, bch genes of green algae and higher plants share a common ancestor and are of non-cyanobacterial origin, whereas that of red algae is of cyanobacteria. The homologs of cyp97a/c genes were widespread only in green algae, while cyp97b paralogs were seen in most of algae. Phylogenetic analysis on cyp97 genes supported the hypothesis that cyp97b is an ancient gene originated before the formation of extant algal groups. The cyp97a gene is more closely related to cyp97c in evolution than to cyp97b. The two cyp97 genes were isolated from the green alga Haematococcus pluvialis, and transcriptional expression profiles of chy genes were observed under high light stress of different wavelength. Conclusions Green algae received a β-xanthophylls biosynthetic pathway from host organisms. Although red algae inherited the pathway from cyanobacteria during primary endosymbiosis, it remains unclear in Chromalveolates. The α-xanthophylls biosynthetic pathway is a common feature in green algae and higher plants. The origination of cyp97a/c is most likely due to gene duplication before divergence of
Cui, Hongli; Yu, Xiaona; Wang, Yan; Cui, Yulin; Li, Xueqin; Liu, Zhaopu; Qin, Song
Xanthophylls, oxygenated derivatives of carotenes, play critical roles in photosynthetic apparatus of cyanobacteria, algae, and higher plants. Although the xanthophylls biosynthetic pathway of algae is largely unknown, it is of particular interest because they have a very complicated evolutionary history. Carotenoid hydroxylase (CHY) is an important protein that plays essential roles in xanthophylls biosynthesis. With the availability of 18 sequenced algal genomes, we performed a comprehensive comparative analysis of chy genes and explored their distribution, structure, evolution, origins, and expression. Overall 60 putative chy genes were identified and classified into two major subfamilies (bch and cyp97) according to their domain structures. Genes in the bch subfamily were found in 10 green algae and 1 red alga, but absent in other algae. In the phylogenetic tree, bch genes of green algae and higher plants share a common ancestor and are of non-cyanobacterial origin, whereas that of red algae is of cyanobacteria. The homologs of cyp97a/c genes were widespread only in green algae, while cyp97b paralogs were seen in most of algae. Phylogenetic analysis on cyp97 genes supported the hypothesis that cyp97b is an ancient gene originated before the formation of extant algal groups. The cyp97a gene is more closely related to cyp97c in evolution than to cyp97b. The two cyp97 genes were isolated from the green alga Haematococcus pluvialis, and transcriptional expression profiles of chy genes were observed under high light stress of different wavelength. Green algae received a β-xanthophylls biosynthetic pathway from host organisms. Although red algae inherited the pathway from cyanobacteria during primary endosymbiosis, it remains unclear in Chromalveolates. The α-xanthophylls biosynthetic pathway is a common feature in green algae and higher plants. The origination of cyp97a/c is most likely due to gene duplication before divergence of green algae and higher plants
Full Text Available Lepidopteran pheromone receptors (PRs, for which orthologies are evident among closely related species, provide an intriguing example of gene family evolution in terms of how new functions may arise. However, only a limited number of PRs have been functionally characterized so far and thus evolutionary scenarios suffer from elements of speculation. In this study we investigated the turnip moth Agrotis segetum, in which female moths produce a mixture of chemically related pheromone components that elicit specific responses from receptor cells on male antennae. We cloned nine A. segetum PR genes and the Orco gene by degenerate primer based RT-PCR. The nine PR genes, named as AsegOR1 and AsegOR3-10, fall into four distinct orthologous clusters of known lepidopteran PRs, of which one contains six paralogues. The paralogues are under relaxed selective pressure, contrasting with the purifying selection on other clusters. We identified the receptors AsegOR9, AsegOR4 and AsegOR5, specific for the respective homologous pheromone components (Z-5-decenyl, (Z-7-dodecenyl and (Z-9-tetradecenyl acetates, by two-electrode voltage clamp recording from Xenopus laevis oocytes co-expressing Orco and each PR candidate. These receptors occur in three different orthologous clusters. We also found that the six paralogues with high sequence similarity vary dramatically in ligand selectivity and sensitivity. Different from AsegOR9, AsegOR6 showed a relatively large response to the behavioural antagonist (Z-5-decenol, and a small response to (Z-5-decenyl acetate. AsegOR1 was broadly tuned, but most responsive to (Z-5-decenyl acetate, (Z-7-dodecenyl acetate and the behavioural antagonist (Z-8-dodecenyl acetate. AsegOR8 and AsegOR7, which differ from AsegOR6 and AsegOR1 by 7 and 10 aa respectively, showed much lower sensitivities. AsegOR10 showed only small responses to all the tested compounds. These results suggest that new receptors arise through gene duplication, and
Lee, Youngdeuk; Elvitigala, Don Anushka Sandaruwan; Whang, Ilson; Lee, Sukkyoung; Kim, Hyowon; Zoysa, Mahanama De; Oh, Chulhong; Kang, Do-Hyung; Lee, Jehee
Immune signaling cascades have an indispensable role in the host defense of almost all the organisms. Tumor necrosis factor (TNF) signaling is considered as a prominent signaling pathway in vertebrate as well as invertebrate species. Within the signaling cascade, TNF receptor-associated factor (TRAF) and TNF receptor-associated protein (TTRAP) has been shown to have a crucial role in the modulation of immune signaling in animals. Here, we attempted to characterize a novel molluskan ortholog of TTRAP (AbTTRAP) from disk abalone (Haliotis discus discus) and analyzed its expression levels under pathogenic stress. The complete coding sequence of AbTTRAP consisted of 1071 nucleotides, coding for a 357 amino acid peptide, with a predicted molecular mass of 40 kDa. According to our in-silico analysis, AbTTRAP resembled the typical TTRAP domain architecture, including a 5'-tyrosyl DNA phosphodiesterase domain. Moreover, phylogenetic analysis revealed its common ancestral invertebrate origin, where AbTTRAP was clustered with molluskan counterparts. Quantitative real time PCR showed universally distributed expression of AbTTRAP in selected tissues of abalone, from which more prominent expression was detected in hemocytes. Upon stimulation with two pathogen-derived mitogens, lipopolysaccharide (LPS) and polyinosinic:polycytidylic acid (poly I:C), transcript levels of AbTTRAP in hemocytes and gill tissues were differentially modulated with time. In addition, the recombinant protein of AbTTRAP exhibited prominent endonuclease activity against abalone genomic DNA, which was enhanced by the presence of Mg(2+) in the medium. Collectively, these results reinforce the existence of the TNF signaling cascade in mollusks like disk abalone, further implicating the putative regulatory behavior of TTRAP in invertebrate host pathology. Copyright © 2014 Elsevier Ltd. All rights reserved.
Buey, Rubén M.; Ledesma-Amaro, Rodrigo; Velázquez-Campoy, Adrián; Balsera, Mónica; Chagoyen, Mónica; de Pereda, José M.; Revuelta, José L.
Inosine-5'-monophosphate dehydrogenase (IMPDH) plays key roles in purine nucleotide metabolism and cell proliferation. Although IMPDH is a widely studied therapeutic target, there is limited information about its physiological regulation. Using Ashbya gossypii as a model, we describe the molecular mechanism and the structural basis for the allosteric regulation of IMPDH by guanine nucleotides. We report that GTP and GDP bind to the regulatory Bateman domain, inducing octamers with compromised catalytic activity. Our data suggest that eukaryotic and prokaryotic IMPDHs might have developed different regulatory mechanisms, with GTP/GDP inhibiting only eukaryotic IMPDHs. Interestingly, mutations associated with human retinopathies map into the guanine nucleotide-binding sites including a previously undescribed non-canonical site and disrupt allosteric inhibition. Together, our results shed light on the mechanisms of the allosteric regulation of enzymes mediated by Bateman domains and provide a molecular basis for certain retinopathies, opening the door to new therapeutic approaches.
Alkalaeva, Elena; Mikhailova, Tatiana
The genetic code determines how amino acids are encoded within mRNA. It is universal among the vast majority of organisms, although several exceptions are known. Variant genetic codes are found in ciliates, mitochondria, and numerous other organisms. All revealed genetic codes (standard and variant) have at least one codon encoding a translation stop signal. However, recently two new genetic codes with a reassignment of all three stop codons were revealed in studies examining the protozoa transcriptomes. Here, we discuss this finding and the recent studies of variant genetic codes in eukaryotes. We consider the possible molecular mechanisms allowing the use of certain codons as sense and stop signals simultaneously. The results obtained by studying these amazing organisms represent a new and exciting insight into the mechanism of stop codon decoding in eukaryotes. Also see the video abstract here. © 2017 WILEY Periodicals, Inc.
Takahashi, Hideo; Shimada, Ichio
The preparation of stable isotope-labeled proteins is necessary for the application of a wide variety of NMR methods, to study the structures and dynamics of proteins and protein complexes. The E. coli expression system is generally used for the production of isotope-labeled proteins, because of the advantages of ease of handling, rapid growth, high-level protein production, and low cost for isotope-labeling. However, many eukaryotic proteins are not functionally expressed in E. coli, due to problems related to disulfide bond formation, post-translational modifications, and folding. In such cases, other expression systems are required for producing proteins for biomolecular NMR analyses. In this paper, we review the recent advances in expression systems for isotopically labeled heterologous proteins, utilizing non-E. coli prokaryotic and eukaryotic cells.
Deng, Feilong; Chen, Shi-Yi; Wu, Zhou-Lin; Hu, Yongsong; Jia, Xianbo; Lai, Song-Jia
Owing to wide application of RNA sequencing (RNA-seq) technology, more and more eukaryotic genomes have been extensively annotated, such as the gene structure, alternative splicing, and noncoding loci. Annotation information of genome is prevalently stored as plain text in General Feature Format (GFF), which could be hundreds or thousands Mb in size. Therefore, it is a challenge for manipulating GFF file for biologists who have no bioinformatic skill. In this study, we provide a web server (GFFview) for parsing the annotation information of eukaryotic genome and then generating statistical description of six indices for visualization. GFFview is very useful for investigating quality and difference of the de novo assembled transcriptome in RNA-seq studies.
Khan, Mohammad Farhan; Spurgeon, Sarah; von der Haar, Tobias
Phosphorylation of eukaryotic translation initiation factor 2 (eIF2) is one of the best studied and most widely used means for regulating protein synthesis activity in eukaryotic cells. This pathway regulates protein synthesis in response to stresses, viral infections, and nutrient depletion, among others. We present analyses of an ordinary differential equation-based model of this pathway, which aim to identify its principal robustness-conferring features. Our analyses indicate that robustness is a distributed property, rather than arising from the properties of any one individual pathway species. However, robustness-conferring properties are unevenly distributed between the different species, and we identify a guanine nucleotide dissociation inhibitor (GDI) complex as a species that likely contributes strongly to the robustness of the pathway. Our analyses make further predictions on the dynamic response to different types of kinases that impinge on eIF2. Copyright © 2018 Elsevier Ltd. All rights reserved.
Goldar, A.; Arneodo, A.; Audit, B.; Argoul, F.; Rappailles, A.; Guilbaud, G.; Petryk, N.; Kahli, M.; Hyrien, O.
We propose a non-local model of DNA replication that takes into account the observed uncertainty on the position and time of replication initiation in eukaryote cell populations. By picturing replication initiation as a two-state system and considering all possible transition configurations, and by taking into account the chromatin’s fractal dimension, we derive an analytical expression for the rate of replication initiation. This model predicts with no free parameter the temporal profiles of initiation rate, replication fork density and fraction of replicated DNA, in quantitative agreement with corresponding experimental data from both S. cerevisiae and human cells and provides a quantitative estimate of initiation site redundancy. This study shows that, to a large extent, the program that regulates the dynamics of eukaryotic DNA replication is a collective phenomenon that emerges from the stochastic nature of replication origins initiation.
Currently, the best scenario for earliest forms of life is based on RNA molecules as they have the proven ability to catalyze enzymatic reactions and harbor genetic information. Evolutionary principles valid today become apparent in such models already. Furthermore, many features of eukaryotic genome architecture might have their origins in an RNA or RNA/protein (RNP) world, including the onset of a further transition, when DNA replaced RNA as the genetic bookkeeper of the cell. Chromosome maintenance, splicing, and regulatory function via RNA may be deeply rooted in the RNA/RNP worlds. Mostly in eukaryotes, conversion from RNA to DNA is still ongoing, which greatly impacts the plasticity of extant genomes. Raw material for novel genes encoding protein or RNA, or parts of genes including regulatory elements that selection can act on, continues to enter the evolutionary lottery. PMID:25081515
Qiu, Guo-Hua; Yang, Xiaoyan; Zheng, Xintian; Huang, Cuiqin
Traditionally, the genome has been described as the 'book of life'. However, the metaphor of a book may not reflect the dynamic nature of the structure and function of the genome. In the eukaryotic genome, the number of centrally located protein-coding sequences is relatively constant across species, but the amount of noncoding DNA increases considerably with the increase of organismal evolutional complexity. Therefore, it has been hypothesized that the abundant peripheral noncoding DNA protects the genome and the central protein-coding sequences in the eukaryotic genome. Upon comparison with the habitation, sociality and defense mechanisms of a social insect colony, it is found that the genome is similar to a social insect colony in various aspects. A social insect colony may thus be a better metaphor than a book to describe the spatial organization and physical functions of the genome. The potential implications of the metaphor are also discussed.
Lengsfeld, Bettina M.; Berry, Kayla N.; Ghosh, Sharmistha; Takahashi, Masateru; Francis, Nicole J.
Propagation of chromatin states through DNA replication is central to epigenetic regulation and can involve recruitment of chromatin proteins to replicating chromatin through interactions with replication fork components. Here we show using a fully reconstituted T7 bacteriophage system that eukaryotic proteins are not required to tether the Polycomb complex PRC1 to templates during DNA replication. Instead, DNA binding by PRC1 can withstand passage of a simple replication fork.
Mazroui, Rachid; Sukarieh, Rami; Bordeleau, Marie-Eve; Kaufman, Randal J.; Northcote, Peter; Tanaka, Junichi; Gallouzi, Imed; Pelletier, Jerry
Cytoplasmic aggregates known as stress granules (SGs) arise as a consequence of cellular stress and contain stalled translation preinitiation complexes. These foci are thought to serve as sites of mRNA storage or triage during the cell stress response. SG formation has been shown to require induction of eukaryotic initiation factor (eIF)2α phosphorylation. Herein, we investigate the potential role of other initiation factors in this process and demonstrate that interfering with eIF4A activity...
Full Text Available Ohr and OsmC proteins comprise two subfamilies within a large group of proteins that display Cys-based, thiol dependent peroxidase activity. These proteins were previously thought to be restricted to prokaryotes, but we show here, using iterated sequence searches, that Ohr/OsmC homologs are also present in 217 species of eukaryotes with a massive presence in Fungi (186 species. Many of these eukaryotic Ohr proteins possess an N-terminal extension that is predicted to target them to mitochondria. We obtained recombinant proteins for four eukaryotic members of the Ohr/OsmC family and three of them displayed lipoyl peroxidase activity. Further functional and biochemical characterization of the Ohr homologs from the ascomycete fungus Mycosphaerella fijiensis Mf_1 (MfOhr, the causative agent of Black Sigatoka disease in banana plants, was pursued. Similarly to what has been observed for the bacterial proteins, we found that: (i the peroxidase activity of MfOhr was supported by DTT or dihydrolipoamide (dithiols, but not by β-mercaptoethanol or GSH (monothiols, even in large excess; (ii MfOhr displayed preference for organic hydroperoxides (CuOOH and tBOOH over hydrogen peroxide; (iii MfOhr presented extraordinary reactivity towards linoleic acid hydroperoxides (k=3.18 (±2.13×108 M−1 s−1. Both Cys87 and Cys154 were essential to the peroxidase activity, since single mutants for each Cys residue presented no activity and no formation of intramolecular disulfide bond upon treatment with hydroperoxides. The pKa value of the Cysp residue was determined as 5.7±0.1 by a monobromobimane alkylation method. Therefore, eukaryotic Ohr peroxidases share several biochemical features with prokaryotic orthologues and are preferentially located in mitochondria. Keywords: Ohr/OsmC, Thiol-dependent peroxidases, Phylogeny
Li, Sanshu; Smith, Kathryn D.; Davis, Jared H.; Gordon, Patricia B.; Breaker, Ronald R.; Strobel, Scott A.
Although fluoride is plentiful in the environment and is commonly used at high concentrations in oral hygiene products, little has been known about how biological systems overcome the toxic effects of this anion. We demonstrate that a protein called FEX in many fungi is essential for cell survival in the presence of high fluoride concentrations. The protein is required for the rapid expulsion of cytoplasmic fluoride, indicating that many eukaryotic species that carry FEX genes likely avoid fl...
Meireles, D A; Domingos, R M; Gaiarsa, J W; Ragnoni, E G; Bannitz-Fernandes, R; da Silva Neto, J F; de Souza, R F; Netto, L E S
Ohr and OsmC proteins comprise two subfamilies within a large group of proteins that display Cys-based, thiol dependent peroxidase activity. These proteins were previously thought to be restricted to prokaryotes, but we show here, using iterated sequence searches, that Ohr/OsmC homologs are also present in 217 species of eukaryotes with a massive presence in Fungi (186 species). Many of these eukaryotic Ohr proteins possess an N-terminal extension that is predicted to target them to mitochondria. We obtained recombinant proteins for four eukaryotic members of the Ohr/OsmC family and three of them displayed lipoyl peroxidase activity. Further functional and biochemical characterization of the Ohr homologs from the ascomycete fungus Mycosphaerella fijiensis Mf_1 (MfOhr), the causative agent of Black Sigatoka disease in banana plants, was pursued. Similarly to what has been observed for the bacterial proteins, we found that: (i) the peroxidase activity of MfOhr was supported by DTT or dihydrolipoamide (dithiols), but not by β-mercaptoethanol or GSH (monothiols), even in large excess; (ii) MfOhr displayed preference for organic hydroperoxides (CuOOH and tBOOH) over hydrogen peroxide; (iii) MfOhr presented extraordinary reactivity towards linoleic acid hydroperoxides (k=3.18 (±2.13)×10 8 M -1 s -1 ). Both Cys 87 and Cys 154 were essential to the peroxidase activity, since single mutants for each Cys residue presented no activity and no formation of intramolecular disulfide bond upon treatment with hydroperoxides. The pK a value of the Cys p residue was determined as 5.7±0.1 by a monobromobimane alkylation method. Therefore, eukaryotic Ohr peroxidases share several biochemical features with prokaryotic orthologues and are preferentially located in mitochondria. Copyright © 2017. Published by Elsevier B.V.
Williamson, C M; Bramley, A J; Lax, A J
The lysostaphin gene of Staphylococcus simulans was cloned into Escherichia coli. The 5' end of the gene was modified to include a eukaryotic start codon, the Kozak expression start site consensus sequence, and an enzyme site to facilitate manipulation of the gene. Transcription of the modified gene in vitro yielded an RNA transcript which, when added to a rabbit reticulocyte cell-free translation system, directed the synthesis of several products. The largest product, migrating at approximat...
Cheng, Mary Hongying; Coalson, Rob D.
Early crystal structures of prokaryotic CLC proteins identified three Cl– binding sites: internal (Sint), central (Scen), and external (Sext). A conserved external GLU (GLUex) residue acts as a gate competing for Sext. Recently, the first crystal structure of a eukaryotic transporter, CmCLC, revealed that in this transporter GLUex competes instead for Scen. Here, we use molecular dynamics simulations to investigate Cl– transport through CmCLC. The gating and Cl–/H+ transport cycle are inferre...
Lengsfeld, Bettina M.
Propagation of chromatin states through DNA replication is central to epigenetic regulation and can involve recruitment of chromatin proteins to replicating chromatin through interactions with replication fork components. Here we show using a fully reconstituted T7 bacteriophage system that eukaryotic proteins are not required to tether the Polycomb complex PRC1 to templates during DNA replication. Instead, DNA binding by PRC1 can withstand passage of a simple replication fork.
Kuerner, Julia; Medalia, Ohad; Linaroudis, Alexandros A.; Baumeister, Wolfgang
Cryo-electron tomography (cryo-ET) is an emerging imaging technology that combines the potential of three-dimensional (3-D) imaging at molecular resolution (<5 nm) with a close-to-life preservation of the specimen. In conjunction with pattern recognition techniques, it enables us to map the molecular landscape inside cells. The application of cryo-ET to intact cells provides novel insights into the structure and the spatial organization of the cytoskeleton in prokaryotic and eukaryotic cells
Parris, Darren J.; Ganesh, Sangita; Edgcomb, Virginia P.; Stewart, Frank J.; DeLong, Edward
Molecular surveys are revealing diverse eukaryotic assemblages in oxygen-limited ocean waters. These communities may play pivotal ecological roles through autotrophy, feeding, and a wide range of symbiotic associations with prokaryotes. We used 18S rRNA gene sequencing to provide the first snapshot of pelagic microeukaryotic community structure in two cellular size fractions (0.2-1.6 µm, >1.6 µm) from seven depths through the anoxic oxygen minimum zone (OMZ) off northern Chile. Sequencing ...
Feng Hongqing; Wang Ruixue; Sun Peng; Wu Haiyan; Liu Qi; Li Fangting; Fang Jing; Zhang Jue; Zhu Weidong
The mechanisms of eukaryotic cell response to cold plasma are studied. A series of single gene mutants of eukaryotic model organism Saccharomyces cerevisiae are used to compare their sensitivity to plasma treatment with the wild type. We examined 12 mutants in the oxidative stress pathway and the cell cycle pathway, in which 8 are found to be hypersensitive to plasma processing. The mutated genes' roles in the two pathways are analyzed to understand the biological response mechanisms of plasma treatment. The results demonstrate that genes from both pathways are needed for the eukaryotic cells to survive the complex plasma treatment.
O'Toole, Nicholas; Barboza, Perry S.; Ungerfeld, Emilio; Leigh, Mary Beth; Selinger, L. Brent; Butler, Greg; Tsang, Adrian; McAllister, Tim A.; Forster, Robert J.
Background Herbivores rely on digestive tract lignocellulolytic microorganisms, including bacteria, fungi and protozoa, to derive energy and carbon from plant cell wall polysaccharides. Culture independent metagenomic studies have been used to reveal the genetic content of the bacterial species within gut microbiomes. However, the nature of the genes encoded by eukaryotic protozoa and fungi within these environments has not been explored using metagenomic or metatranscriptomic approaches. Methodology/Principal Findings In this study, a metatranscriptomic approach was used to investigate the functional diversity of the eukaryotic microorganisms within the rumen of muskoxen (Ovibos moschatus), with a focus on plant cell wall degrading enzymes. Polyadenylated RNA (mRNA) was sequenced on the Illumina Genome Analyzer II system and 2.8 gigabases of sequences were obtained and 59129 contigs assembled. Plant cell wall degrading enzyme modules including glycoside hydrolases, carbohydrate esterases and polysaccharide lyases were identified from over 2500 contigs. These included a number of glycoside hydrolase family 6 (GH6), GH48 and swollenin modules, which have rarely been described in previous gut metagenomic studies. Conclusions/Significance The muskoxen rumen metatranscriptome demonstrates a much higher percentage of cellulase enzyme discovery and an 8.7x higher rate of total carbohydrate active enzyme discovery per gigabase of sequence than previous rumen metagenomes. This study provides a snapshot of eukaryotic gene expression in the muskoxen rumen, and identifies a number of candidate genes coding for potentially valuable lignocellulolytic enzymes. PMID:21655220
Ray, Ann; Schwartz, Nika; de Souza Santos, Marcela; Zhang, Junmei; Orth, Kim; Salomon, Dor
Most type VI secretion systems (T6SSs) described to date are protein delivery apparatuses that mediate bactericidal activities. Several T6SSs were also reported to mediate virulence activities, although only few anti-eukaryotic effectors have been described. Here, we identify three T6SSs in the marine bacterium Vibrio proteolyticus and show that T6SS1 mediates bactericidal activities under warm marine-like conditions. Using comparative proteomics, we find nine potential T6SS1 effectors, five of which belong to the polymorphic MIX-effector class. Remarkably, in addition to six predicted bactericidal effectors, the T6SS1 secretome includes three putative anti-eukaryotic effectors. One of these is a MIX-effector containing a cytotoxic necrotizing factor 1 domain. We demonstrate that T6SS1 can use this MIX-effector to target phagocytic cells, resulting in morphological changes and actin cytoskeleton rearrangements. In conclusion, the V. proteolyticus T6SS1, a system homologous to one found in pathogenic vibrios, uses a suite of polymorphic effectors that target both bacteria and eukaryotic neighbors. © 2017 The Authors. Published under the terms of the CC BY 4.0 license.
Mouilleron, Hélène; Delcourt, Vivian; Roucou, Xavier
mRNAs carry the genetic information that is translated by ribosomes. The traditional view of a mature eukaryotic mRNA is a molecule with three main regions, the 5' UTR, the protein coding open reading frame (ORF) or coding sequence (CDS), and the 3' UTR. This concept assumes that ribosomes translate one ORF only, generally the longest one, and produce one protein. As a result, in the early days of genomics and bioinformatics, one CDS was associated with each protein-coding gene. This fundamental concept of a single CDS is being challenged by increasing experimental evidence indicating that annotated proteins are not the only proteins translated from mRNAs. In particular, mass spectrometry (MS)-based proteomics and ribosome profiling have detected productive translation of alternative open reading frames. In several cases, the alternative and annotated proteins interact. Thus, the expression of two or more proteins translated from the same mRNA may offer a mechanism to ensure the co-expression of proteins which have functional interactions. Translational mechanisms already described in eukaryotic cells indicate that the cellular machinery is able to translate different CDSs from a single viral or cellular mRNA. In addition to summarizing data showing that the protein coding potential of eukaryotic mRNAs has been underestimated, this review aims to challenge the single translated CDS dogma. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Ghosh, Amrita; Shrivastav, Anupama; Jose, D Amilan; Mishra, Sanjiv K; Chandrakanth, C K; Mishra, Sandhya; Das, Amitava
The chromogenic complex 1 x Zn (where 1 is (E)-4-(4-dimethylamino-phenylazo)-N,N-bispyridin-2-ylmethyl-benzenesulfonamide) showed high affinity toward the phosphate ion in tetrabutylammonium phosphate in acetonitrile solution and could preferentially bind to adenosine triphosphate (ATP) in aqueous solution at physiological pH. This binding caused a visual change in color, whereas no such change was noticed with other related anions (adenosine monophosphate, adenosine diphosphate, pyrophosphate, and phosphate) of biological significance. Thus, 1 x Zn could be used as a staining agent for different biological cells through binding to the ATP, generated in situ by the mitochondria (in eukaryotes). For prokaryotes (bacteria) the cell membrane takes care of the cells' energy conversion, since they lack mitochondria. ATP is produced in their unique cell structure on the cell membrane, which is not found in any eukaryotes. These stained cells could be viewed with normal light microscopy. This reagent could even be used for distinguishing the gram-positive and the gram-negative bacteria (prokaryotes). This dye was found to be nonlipophilic in nature and nontoxic to living microbes (eukaryotes and prokaryotes). Further, stained cells were found to grow in their respective media, and this confirmed the maintenance of viability of the microbes even after staining, unlike with many other dyes available commercially.
Leray, Matthieu; Knowlton, Nancy
PCR amplification followed by deep sequencing of homologous gene regions is increasingly used to characterize the diversity and taxonomic composition of marine eukaryotic communities. This approach may generate millions of sequences for hundreds of samples simultaneously. Therefore, tools that researchers can use to visualize complex patterns of diversity for these massive datasets are essential. Efforts by microbiologists to understand the Earth and human microbiomes using high-throughput sequencing of the 16S rRNA gene has led to the development of several user-friendly, open-source software packages that can be similarly used to analyze eukaryotic datasets. Quantitative Insights Into Microbial Ecology (QIIME) offers some of the most helpful data visualization tools. Here, we describe functionalities to import OTU tables generated with any molecular marker (e.g., 18S, COI, ITS) and associated metadata into QIIME. We then present a range of analytical tools implemented within QIIME that can be used to obtain insights about patterns of alpha and beta diversity for marine eukaryotes.
Full Text Available BACKGROUND: Herbivores rely on digestive tract lignocellulolytic microorganisms, including bacteria, fungi and protozoa, to derive energy and carbon from plant cell wall polysaccharides. Culture independent metagenomic studies have been used to reveal the genetic content of the bacterial species within gut microbiomes. However, the nature of the genes encoded by eukaryotic protozoa and fungi within these environments has not been explored using metagenomic or metatranscriptomic approaches. METHODOLOGY/PRINCIPAL FINDINGS: In this study, a metatranscriptomic approach was used to investigate the functional diversity of the eukaryotic microorganisms within the rumen of muskoxen (Ovibos moschatus, with a focus on plant cell wall degrading enzymes. Polyadenylated RNA (mRNA was sequenced on the Illumina Genome Analyzer II system and 2.8 gigabases of sequences were obtained and 59129 contigs assembled. Plant cell wall degrading enzyme modules including glycoside hydrolases, carbohydrate esterases and polysaccharide lyases were identified from over 2500 contigs. These included a number of glycoside hydrolase family 6 (GH6, GH48 and swollenin modules, which have rarely been described in previous gut metagenomic studies. CONCLUSIONS/SIGNIFICANCE: The muskoxen rumen metatranscriptome demonstrates a much higher percentage of cellulase enzyme discovery and an 8.7x higher rate of total carbohydrate active enzyme discovery per gigabase of sequence than previous rumen metagenomes. This study provides a snapshot of eukaryotic gene expression in the muskoxen rumen, and identifies a number of candidate genes coding for potentially valuable lignocellulolytic enzymes.
Nillegoda, Nadinath B; Stank, Antonia; Malinverni, Duccio; Alberts, Niels; Szlachcic, Anna; Barducci, Alessandro; De Los Rios, Paolo; Wade, Rebecca C; Bukau, Bernd
Hsp70 participates in a broad spectrum of protein folding processes extending from nascent chain folding to protein disaggregation. This versatility in function is achieved through a diverse family of J-protein cochaperones that select substrates for Hsp70. Substrate select