WorldWideScience

Sample records for proteins phylogenetic analysis

  1. Phylogenetic Analysis Using Protein Mass Spectrometry.

    Science.gov (United States)

    Ma, Shiyong; Downard, Kevin M; Wong, Jason W H

    2017-01-01

    Through advances in molecular biology, comparative analysis of DNA sequences is currently the cornerstone in the study of molecular evolution and phylogenetics. Nevertheless, protein mass spectrometry offers some unique opportunities to enable phylogenetic analyses in organisms where DNA may be difficult or costly to obtain. To date, the methods of phylogenetic analysis using protein mass spectrometry can be classified into three categories: (1) de novo protein sequencing followed by classical phylogenetic reconstruction, (2) direct phylogenetic reconstruction using proteolytic peptide mass maps, and (3) mapping of mass spectral data onto classical phylogenetic trees. In this chapter, we provide a brief description of the three methods and the protocol for each method along with relevant tools and algorithms.

  2. Molecular cloning, phylogenetic analysis and heat shock response of Babesia gibsoni heat shock protein 90.

    Science.gov (United States)

    Yamasaki, Masahiro; Tsuboi, Yoshihiro; Taniyama, Yusuke; Uchida, Naohiro; Sato, Reeko; Nakamura, Kensuke; Ohta, Hiroshi; Takiguchi, Mitsuyoshi

    2016-09-01

    The Babesia gibsoni heat shock protein 90 (BgHSP90) gene was cloned and sequenced. The length of the gene was 2,610 bp with two introns. This gene was amplified from cDNA corresponding to full length coding sequence (CDS) with an open reading frame of 2,148 bp. A phylogenetic analysis of the CDS of HSP90 gene showed that B. gibsoni was most closely related to B. bovis and Babesia sp. BQ1/Lintan and lies within a phylogenetic cluster of protozoa. Moreover, mRNA transcription profile for BgHSP90 exposed to high temperature were examined by quantitative real-time reverse transcription-polymerase chain reaction. BgHSP90 levels were elevated when the parasites were incubated at 43°C for 1 hr.

  3. Taxonomic colouring of phylogenetic trees of protein sequences

    Directory of Open Access Journals (Sweden)

    Andrade-Navarro Miguel A

    2006-02-01

    Full Text Available Abstract Background Phylogenetic analyses of protein families are used to define the evolutionary relationships between homologous proteins. The interpretation of protein-sequence phylogenetic trees requires the examination of the taxonomic properties of the species associated to those sequences. However, there is no online tool to facilitate this interpretation, for example, by automatically attaching taxonomic information to the nodes of a tree, or by interactively colouring the branches of a tree according to any combination of taxonomic divisions. This is especially problematic if the tree contains on the order of hundreds of sequences, which, given the accelerated increase in the size of the protein sequence databases, is a situation that is becoming common. Results We have developed PhyloView, a web based tool for colouring phylogenetic trees upon arbitrary taxonomic properties of the species represented in a protein sequence phylogenetic tree. Provided that the tree contains SwissProt, SpTrembl, or GenBank protein identifiers, the tool retrieves the taxonomic information from the corresponding database. A colour picker displays a summary of the findings and allows the user to associate colours to the leaves of the tree according to any number of taxonomic partitions. Then, the colours are propagated to the branches of the tree. Conclusion PhyloView can be used at http://www.ogic.ca/projects/phyloview/. A tutorial, the software with documentation, and GPL licensed source code, can be accessed at the same web address.

  4. Phylogenetic analysis of fungal ABC transporters.

    Science.gov (United States)

    Kovalchuk, Andriy; Driessen, Arnold J M

    2010-03-16

    The superfamily of ABC proteins is among the largest known in nature. Its members are mainly, but not exclusively, involved in the transport of a broad range of substrates across biological membranes. Many contribute to multidrug resistance in microbial pathogens and cancer cells. The diversity of ABC proteins in fungi is comparable with those in multicellular animals, but so far fungal ABC proteins have barely been studied. We performed a phylogenetic analysis of the ABC proteins extracted from the genomes of 27 fungal species from 18 orders representing 5 fungal phyla thereby covering the most important groups. Our analysis demonstrated that some of the subfamilies of ABC proteins remained highly conserved in fungi, while others have undergone a remarkable group-specific diversification. Members of the various fungal phyla also differed significantly in the number of ABC proteins found in their genomes, which is especially reduced in the yeast S. cerevisiae and S. pombe. Data obtained during our analysis should contribute to a better understanding of the diversity of the fungal ABC proteins and provide important clues about their possible biological functions.

  5. Phylogenetic analysis and protein structure modelling identifies distinct Ca(2+)/Cation antiporters and conservation of gene family structure within Arabidopsis and rice species.

    Science.gov (United States)

    Pittman, Jon K; Hirschi, Kendal D

    2016-12-01

    The Ca(2+)/Cation Antiporter (CaCA) superfamily is an ancient and widespread family of ion-coupled cation transporters found in nearly all kingdoms of life. In animals, K(+)-dependent and K(+)-indendent Na(+)/Ca(2+) exchangers (NCKX and NCX) are important CaCA members. Recently it was proposed that all rice and Arabidopsis CaCA proteins should be classified as NCX proteins. Here we performed phylogenetic analysis of CaCA genes and protein structure homology modelling to further characterise members of this transporter superfamily. Phylogenetic analysis of rice and Arabidopsis CaCAs in comparison with selected CaCA members from non-plant species demonstrated that these genes form clearly distinct families, with the H(+)/Cation exchanger (CAX) and cation/Ca(2+) exchanger (CCX) families dominant in higher plants but the NCKX and NCX families absent. NCX-related Mg(2+)/H(+) exchanger (MHX) and CAX-related Na(+)/Ca(2+) exchanger-like (NCL) proteins are instead present. Analysis of genomes of ten closely-related rice species and four Arabidopsis-related species found that CaCA gene family structures are highly conserved within related plants, apart from minor variation. Protein structures were modelled for OsCAX1a and OsMHX1. Despite exhibiting broad structural conservation, there are clear structural differences observed between the different CaCA types. Members of the CaCA superfamily form clearly distinct families with different phylogenetic, structural and functional characteristics, and therefore should not be simply classified as NCX proteins, which should remain as a separate gene family.

  6. PhyloSift: phylogenetic analysis of genomes and metagenomes.

    Science.gov (United States)

    Darling, Aaron E; Jospin, Guillaume; Lowe, Eric; Matsen, Frederick A; Bik, Holly M; Eisen, Jonathan A

    2014-01-01

    Like all organisms on the planet, environmental microbes are subject to the forces of molecular evolution. Metagenomic sequencing provides a means to access the DNA sequence of uncultured microbes. By combining DNA sequencing of microbial communities with evolutionary modeling and phylogenetic analysis we might obtain new insights into microbiology and also provide a basis for practical tools such as forensic pathogen detection. In this work we present an approach to leverage phylogenetic analysis of metagenomic sequence data to conduct several types of analysis. First, we present a method to conduct phylogeny-driven Bayesian hypothesis tests for the presence of an organism in a sample. Second, we present a means to compare community structure across a collection of many samples and develop direct associations between the abundance of certain organisms and sample metadata. Third, we apply new tools to analyze the phylogenetic diversity of microbial communities and again demonstrate how this can be associated to sample metadata. These analyses are implemented in an open source software pipeline called PhyloSift. As a pipeline, PhyloSift incorporates several other programs including LAST, HMMER, and pplacer to automate phylogenetic analysis of protein coding and RNA sequences in metagenomic datasets generated by modern sequencing platforms (e.g., Illumina, 454).

  7. PhyloSift: phylogenetic analysis of genomes and metagenomes

    Directory of Open Access Journals (Sweden)

    Aaron E. Darling

    2014-01-01

    Full Text Available Like all organisms on the planet, environmental microbes are subject to the forces of molecular evolution. Metagenomic sequencing provides a means to access the DNA sequence of uncultured microbes. By combining DNA sequencing of microbial communities with evolutionary modeling and phylogenetic analysis we might obtain new insights into microbiology and also provide a basis for practical tools such as forensic pathogen detection.In this work we present an approach to leverage phylogenetic analysis of metagenomic sequence data to conduct several types of analysis. First, we present a method to conduct phylogeny-driven Bayesian hypothesis tests for the presence of an organism in a sample. Second, we present a means to compare community structure across a collection of many samples and develop direct associations between the abundance of certain organisms and sample metadata. Third, we apply new tools to analyze the phylogenetic diversity of microbial communities and again demonstrate how this can be associated to sample metadata.These analyses are implemented in an open source software pipeline called PhyloSift. As a pipeline, PhyloSift incorporates several other programs including LAST, HMMER, and pplacer to automate phylogenetic analysis of protein coding and RNA sequences in metagenomic datasets generated by modern sequencing platforms (e.g., Illumina, 454.

  8. Phylogenetic analysis of fungal heterotrimeric G protein-encoding genes and their expression during dimorphism in Mucor circinelloides.

    Science.gov (United States)

    Valle-Maldonado, Marco Iván; Jácome-Galarza, Irvin Eduardo; Díaz-Pérez, Alma Laura; Martínez-Cadena, Guadalupe; Campos-García, Jesús; Ramírez-Díaz, Martha Isela; Reyes-De la Cruz, Homero; Riveros-Rosas, Héctor; Díaz-Pérez, César; Meza-Carmen, Víctor

    2015-12-01

    In fungi, heterotrimeric G proteins are key regulators of biological processes such as mating, virulence, morphology, among others. Mucor circinelloides is a model organism for many biological processes, and its genome contains the largest known repertoire of genes that encode putative heterotrimeric G protein subunits in the fungal kingdom: twelve Gα (McGpa1-12), three Gβ (McGpb1-3), and three Gγ (McGpg1-3). Phylogenetic analysis of fungal Gα showed that they are divided into four distinct groups as reported previously. Fungal Gβ and Gγ are also divided into four phylogenetic groups, and to our understanding this is the first report of a phylogenetic classification for fungal Gβ and Gγ subunits. Almost all genes that encode putative heterotrimeric G subunits in M. circinelloides are differentially expressed during dimorphic growth, except for McGpg1 (Gγ) that showed very low mRNA levels at all developmental stages. Moreover, several of the subunits are expressed in a similar pattern and at the same level, suggesting that they constitute discrete complexes. For example, McGpb3 (Gβ), and McGpg2 (Gγ), are co-expressed during mycelium growth, and McGpa1, McGpb2, and McGpg2, are co-expressed during yeast development. These findings provide the conceptual framework to study the biological role of these genes during M. circinelloides morphogenesis. Copyright © 2015 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.

  9. Correlated mutations in protein sequences: Phylogenetic and structural effects

    Energy Technology Data Exchange (ETDEWEB)

    Lapedes, A.S. [Los Alamos National Lab., NM (United States). Theoretical Div.]|[Santa Fe Inst., NM (United States); Giraud, B.G. [C.E.N. Saclay, Gif/Yvette (France). Service Physique Theorique; Liu, L.C. [Los Alamos National Lab., NM (United States). Theoretical Div.; Stormo, G.D. [Univ. of Colorado, Boulder, CO (United States). Dept. of Molecular, Cellular and Developmental Biology

    1998-12-01

    Covariation analysis of sets of aligned sequences for RNA molecules is relatively successful in elucidating RNA secondary structure, as well as some aspects of tertiary structure. Covariation analysis of sets of aligned sequences for protein molecules is successful in certain instances in elucidating certain structural and functional links, but in general, pairs of sites displaying highly covarying mutations in protein sequences do not necessarily correspond to sites that are spatially close in the protein structure. In this paper the authors identify two reasons why naive use of covariation analysis for protein sequences fails to reliably indicate sequence positions that are spatially proximate. The first reason involves the bias introduced in calculation of covariation measures due to the fact that biological sequences are generally related by a non-trivial phylogenetic tree. The authors present a null-model approach to solve this problem. The second reason involves linked chains of covariation which can result in pairs of sites displaying significant covariation even though they are not spatially proximate. They present a maximum entropy solution to this classic problem of causation versus correlation. The methodologies are validated in simulation.

  10. Contrasting HIV phylogenetic relationships and V3 loop protein similarities

    Energy Technology Data Exchange (ETDEWEB)

    Korber, B. (Los Alamos National Lab., NM (United States) Santa Fe Inst., NM (United States)); Myers, G. (Los Alamos National Lab., NM (United States))

    1992-01-01

    At least five distinct sequence subtypes of HIV-I can be identified from the major centers of the AMS pandemic. While it is too early to tell whether these subtypes are serologically or phenotypically similar or distinct in terms of properties such as pathogenicity and transmissibility, we can begin to investigate their potential for phenotypic divergence at the protein sequence level. Phylogenetic analysis of HIV DNA sequences is being widely used to examine lineages of different viral strains as they evolve and spread throughout the globe. We have identified five distinct HIV-1 subtypes (designated A-E), or clades, based on phylogenetic clustering patterns generated from genetic information from both the gag and envelope (env) genes from a spectrum of international isolates. Our initial observations concerning both HIV-1 and HIV-2 sequences indicate that conserved patterns in protein chemistry may indeed exist across distant lineages. Such patterns in V3 loop amino acid chemistry may be indicative of stable lineages or convergence within this highly variable, though functionally and immunologically critical, region. We think that there may be parallels between the apparently stable HIV-2 V3 lineage and the previously mentioned HIV-1 V3 loops which are very similar at the protein level despite being distant by cladistic analysis, and which do not possess the distinctive positively charged residues. Highly conserved V3 loop protein sequences are also encountered in SIVAGMs and CIVs (chimpanzee viral strains), which do not appear to be pathogenic in their wild-caught natural hosts.

  11. Contrasting HIV phylogenetic relationships and V3 loop protein similarities

    Energy Technology Data Exchange (ETDEWEB)

    Korber, B. [Los Alamos National Lab., NM (United States)]|[Santa Fe Inst., NM (United States); Myers, G. [Los Alamos National Lab., NM (United States)

    1992-12-31

    At least five distinct sequence subtypes of HIV-I can be identified from the major centers of the AMS pandemic. While it is too early to tell whether these subtypes are serologically or phenotypically similar or distinct in terms of properties such as pathogenicity and transmissibility, we can begin to investigate their potential for phenotypic divergence at the protein sequence level. Phylogenetic analysis of HIV DNA sequences is being widely used to examine lineages of different viral strains as they evolve and spread throughout the globe. We have identified five distinct HIV-1 subtypes (designated A-E), or clades, based on phylogenetic clustering patterns generated from genetic information from both the gag and envelope (env) genes from a spectrum of international isolates. Our initial observations concerning both HIV-1 and HIV-2 sequences indicate that conserved patterns in protein chemistry may indeed exist across distant lineages. Such patterns in V3 loop amino acid chemistry may be indicative of stable lineages or convergence within this highly variable, though functionally and immunologically critical, region. We think that there may be parallels between the apparently stable HIV-2 V3 lineage and the previously mentioned HIV-1 V3 loops which are very similar at the protein level despite being distant by cladistic analysis, and which do not possess the distinctive positively charged residues. Highly conserved V3 loop protein sequences are also encountered in SIVAGMs and CIVs (chimpanzee viral strains), which do not appear to be pathogenic in their wild-caught natural hosts.

  12. Automatic selection of reference taxa for protein-protein interaction prediction with phylogenetic profiling

    DEFF Research Database (Denmark)

    Simonsen, Martin; Maetschke, S.R.; Ragan, M.A.

    2012-01-01

    Motivation: Phylogenetic profiling methods can achieve good accuracy in predicting protein–protein interactions, especially in prokaryotes. Recent studies have shown that the choice of reference taxa (RT) is critical for accurate prediction, but with more than 2500 fully sequenced taxa publicly......: We present three novel methods for automating the selection of RT, using machine learning based on known protein–protein interaction networks. One of these methods in particular, Tree-Based Search, yields greatly improved prediction accuracies. We further show that different methods for constituting...... phylogenetic profiles often require very different RT sets to support high prediction accuracy....

  13. Detecting Network Communities: An Application to Phylogenetic Analysis

    Science.gov (United States)

    Andrade, Roberto F. S.; Rocha-Neto, Ivan C.; Santos, Leonardo B. L.; de Santana, Charles N.; Diniz, Marcelo V. C.; Lobão, Thierry Petit; Goés-Neto, Aristóteles; Pinho, Suani T. R.; El-Hani, Charbel N.

    2011-01-01

    This paper proposes a new method to identify communities in generally weighted complex networks and apply it to phylogenetic analysis. In this case, weights correspond to the similarity indexes among protein sequences, which can be used for network construction so that the network structure can be analyzed to recover phylogenetically useful information from its properties. The analyses discussed here are mainly based on the modular character of protein similarity networks, explored through the Newman-Girvan algorithm, with the help of the neighborhood matrix . The most relevant networks are found when the network topology changes abruptly revealing distinct modules related to the sets of organisms to which the proteins belong. Sound biological information can be retrieved by the computational routines used in the network approach, without using biological assumptions other than those incorporated by BLAST. Usually, all the main bacterial phyla and, in some cases, also some bacterial classes corresponded totally (100%) or to a great extent (>70%) to the modules. We checked for internal consistency in the obtained results, and we scored close to 84% of matches for community pertinence when comparisons between the results were performed. To illustrate how to use the network-based method, we employed data for enzymes involved in the chitin metabolic pathway that are present in more than 100 organisms from an original data set containing 1,695 organisms, downloaded from GenBank on May 19, 2007. A preliminary comparison between the outcomes of the network-based method and the results of methods based on Bayesian, distance, likelihood, and parsimony criteria suggests that the former is as reliable as these commonly used methods. We conclude that the network-based method can be used as a powerful tool for retrieving modularity information from weighted networks, which is useful for phylogenetic analysis. PMID:21573202

  14. Ixodes ricinus tick lipocalins: identification, cloning, phylogenetic analysis and biochemical characterization.

    Directory of Open Access Journals (Sweden)

    Jérôme Beaufays

    Full Text Available BACKGROUND: During their blood meal, ticks secrete a wide variety of proteins that interfere with their host's defense mechanisms. Among these proteins, lipocalins play a major role in the modulation of the inflammatory response. METHODOLOGY/PRINCIPAL FINDINGS: Screening a cDNA library in association with RT-PCR and RACE methodologies allowed us to identify 14 new lipocalin genes in the salivary glands of the Ixodes ricinus hard tick. A computational in-depth structural analysis confirmed that LIRs belong to the lipocalin family. These proteins were called LIR for "Lipocalin from I. ricinus" and numbered from 1 to 14 (LIR1 to LIR14. According to their percentage identity/similarity, LIR proteins may be assigned to 6 distinct phylogenetic groups. The mature proteins have calculated pM and pI varying from 21.8 kDa to 37.2 kDa and from 4.45 to 9.57 respectively. In a western blot analysis, all recombinant LIRs appeared as a series of thin bands at 50-70 kDa, suggesting extensive glycosylation, which was experimentally confirmed by treatment with N-glycosidase F. In addition, the in vivo expression analysis of LIRs in I. ricinus, examined by RT-PCR, showed homogeneous expression profiles for certain phylogenetic groups and relatively heterogeneous profiles for other groups. Finally, we demonstrated that LIR6 codes for a protein that specifically binds leukotriene B4. CONCLUSIONS/SIGNIFICANCE: This work confirms that, regarding their biochemical properties, expression profile, and sequence signature, lipocalins in Ixodes hard tick genus, and more specifically in the Ixodes ricinus species, are segregated into distinct phylogenetic groups suggesting potential distinct function. This was particularly demonstrated by the ability of LIR6 to scavenge leukotriene B4. The other LIRs did not bind any of the ligands tested, such as 5-hydroxytryptamine, ADP, norepinephrine, platelet activating factor, prostaglandins D2 and E2, and finally leukotrienes B4 and C

  15. Phylogenetic reconstruction of South American felids defined by protein electrophoresis.

    Science.gov (United States)

    Slattery, J P; Johnson, W E; Goldman, D; O'Brien, S J

    1994-09-01

    Phylogenetic associations among six closely related South American felid species were defined by changes in protein-encoding gene loci. We analyzed proteins isolated from skin fibroblasts using two-dimensional electrophoresis and allozymes extracted from blood cells. Genotypes were determined for multiple individuals of ocelot, margay, tigrina, Geoffroy's cat, kodkod, and pampas cat at 548 loci resolved by two-dimensional electrophoresis and 44 allozyme loci. Phenograms were constructed using the methods of Fitch-Margoliash and neighbor-joining on a matrix of Nei's unbiased genetic distances for all pairs of species. Results of a relative-rate test indicate changes in two-dimensional electrophoresis data are constant among all South American felids with respect to a hyena outgroup. Allelic frequencies were transformed to discrete character states for maximum parsimony analysis. Phylogenetic reconstruction indicates a major split occurred approximately 5-6 million years ago, leading to three groups within the ocelot lineage. The earliest divergence led to Leopardus tigrina, followed by a split between an ancestor of an unresolved trichotomy of three species (Oncifelis guigna, O. geoffroyi, and Lynchailuris colocolo) and a recent common ancestor of Leopardus pardalis and L. wiedii. The results suggest that modern South American felids are monophyletic and evolved rapidly after the formation of the Panama land bridge between North and South America.

  16. The fusion protein signal-peptide-coding region of canine distemper virus: a useful tool for phylogenetic reconstruction and lineage identification.

    Directory of Open Access Journals (Sweden)

    Nicolás Sarute

    Full Text Available Canine distemper virus (CDV; Paramyxoviridae, Morbillivirus is the etiologic agent of a multisystemic infectious disease affecting all terrestrial carnivore families with high incidence and mortality in domestic dogs. Sequence analysis of the hemagglutinin (H gene has been widely employed to characterize field strains, permitting the identification of nine CDV lineages worldwide. Recently, it has been established that the sequences of the fusion protein signal-peptide (Fsp coding region are extremely variable, suggesting that analysis of its sequence might be useful for strain characterization studies. However, the divergence of Fsp sequences among worldwide strains and its phylogenetic resolution has not yet been evaluated. We constructed datasets containing the Fsp-coding region and H gene sequences of the same strains belonging to eight CDV lineages. Both datasets were used to evaluate their phylogenetic resolution. The phylogenetic analysis revealed that both datasets clustered the same strains into eight different branches, corresponding to CDV lineages. The inter-lineage amino acid divergence was fourfold greater for the Fsp peptide than for the H protein. The likelihood mapping revealed that both datasets display strong phylogenetic signals in the region of well-resolved topologies. These features indicate that Fsp-coding region sequence analysis is suitable for evolutionary studies as it allows for straightforward identification of CDV lineages.

  17. The fusion protein signal-peptide-coding region of canine distemper virus: a useful tool for phylogenetic reconstruction and lineage identification.

    Science.gov (United States)

    Sarute, Nicolás; Calderón, Marina Gallo; Pérez, Ruben; La Torre, José; Hernández, Martín; Francia, Lourdes; Panzera, Yanina

    2013-01-01

    Canine distemper virus (CDV; Paramyxoviridae, Morbillivirus) is the etiologic agent of a multisystemic infectious disease affecting all terrestrial carnivore families with high incidence and mortality in domestic dogs. Sequence analysis of the hemagglutinin (H) gene has been widely employed to characterize field strains, permitting the identification of nine CDV lineages worldwide. Recently, it has been established that the sequences of the fusion protein signal-peptide (Fsp) coding region are extremely variable, suggesting that analysis of its sequence might be useful for strain characterization studies. However, the divergence of Fsp sequences among worldwide strains and its phylogenetic resolution has not yet been evaluated. We constructed datasets containing the Fsp-coding region and H gene sequences of the same strains belonging to eight CDV lineages. Both datasets were used to evaluate their phylogenetic resolution. The phylogenetic analysis revealed that both datasets clustered the same strains into eight different branches, corresponding to CDV lineages. The inter-lineage amino acid divergence was fourfold greater for the Fsp peptide than for the H protein. The likelihood mapping revealed that both datasets display strong phylogenetic signals in the region of well-resolved topologies. These features indicate that Fsp-coding region sequence analysis is suitable for evolutionary studies as it allows for straightforward identification of CDV lineages.

  18. Phylogenetic continuum indicates "galaxies" in the protein universe: preliminary results on the natural group structures of proteins.

    Science.gov (United States)

    Ladunga, I

    1992-04-01

    The markedly nonuniform, even systematic distribution of sequences in the protein "universe" has been analyzed by methods of protein taxonomy. Mapping of the natural hierarchical system of proteins has revealed some dense cores, i.e., well-defined clusterings of proteins that seem to be natural structural groupings, possibly seeds for a future protein taxonomy. The aim was not to force proteins into more or less man-made categories by discriminant analysis, but to find structurally similar groups, possibly of common evolutionary origin. Single-valued distance measures between pairs of superfamilies from the Protein Identification Resource were defined by two chi 2-like methods on tripeptide frequencies and the variable-length subsequence identity method derived from dot-matrix comparisons. Distance matrices were processed by several methods of cluster analysis to detect phylogenetic continuum between highly divergent proteins. Only well-defined clusters characterized by relatively unique structural, intracellular environmental, organismal, and functional attribute states were selected as major protein groups, including subsets of viral and Escherichia coli proteins, hormones, inhibitors, plant, ribosomal, serum and structural proteins, amino acid synthases, and clusters dominated by certain oxidoreductases and apolar and DNA-associated enzymes. The limited repertoire of functional patterns due to small genome size, the high rate of recombination, specific features of the bacterial membranes, or of the virus cycle canalize certain proteins of viruses and Gram-negative bacteria, respectively, to organismal groups.

  19. Computational identification and phylogenetic analysis of the oil-body structural proteins, oleosin and caleosin, in castor bean and flax.

    Science.gov (United States)

    Hyun, Tae Kyung; Kumar, Dhinesh; Cho, Young-Yeol; Hyun, Hae-Nam; Kim, Ju-Sung

    2013-02-25

    Oil bodies (OBs) are the intracellular particles derived from oilseeds. These OBs store lipids as a carbon resource, and have been exploited for a variety of industrial applications including biofuels. Oleosin and caleosin are the common OB structural proteins which are enabling biotechnological enhancement of oil content and OB-based pharmaceutical formations via stabilizing OBs. Although the draft whole genome sequence information for Ricinus communis L. (castor bean) and Linum usitatissimum L. (flax), important oil seed plants, is available in public database, OB-structural proteins in these plants are poorly indentified. Therefore, in this study, we performed a comprehensive bioinformatic analysis including analysis of the genome sequence, conserved domains and phylogenetic relationships to identify OB structural proteins in castor bean and flax genomes. Using comprehensive analysis, we have identified 6 and 15 OB-structural proteins from castor bean and flax, respectively. A complete overview of this gene family in castor bean and flax is presented, including the gene structures, phylogeny and conserved motifs, resulting in the presence of central hydrophobic regions with proline knot motif, providing an evolutionary proof that this central hydrophobic region had evolved from duplications in the primitive eukaryotes. In addition, expression analysis of L-oleosin and caleosin genes using quantitative real-time PCR demonstrated that seed contained their maximum expression, except that RcCLO-1 expressed maximum in cotyledon. Thus, our comparative genomics analysis of oleosin and caleosin genes and their putatively encoded proteins in two non-model plant species provides insights into the prospective usage of gene resources for improving OB-stability. Copyright © 2012 Elsevier B.V. All rights reserved.

  20. Analysis of Domain Architecture and Phylogenetics of Family 2 Glycoside Hydrolases (GH2.

    Directory of Open Access Journals (Sweden)

    David Talens-Perales

    Full Text Available In this work we report a detailed analysis of the topology and phylogenetics of family 2 glycoside hydrolases (GH2. We distinguish five topologies or domain architectures based on the presence and distribution of protein domains defined in Pfam and Interpro databases. All of them share a central TIM barrel (catalytic module with two β-sandwich domains (non-catalytic at the N-terminal end, but differ in the occurrence and nature of additional non-catalytic modules at the C-terminal region. Phylogenetic analysis was based on the sequence of the Pfam Glyco_hydro_2_C catalytic module present in most GH2 proteins. Our results led us to propose a model in which evolutionary diversity of GH2 enzymes is driven by the addition of different non-catalytic domains at the C-terminal region. This model accounts for the divergence of β-galactosidases from β-glucuronidases, the diversification of β-galactosidases with different transglycosylation specificities, and the emergence of bicistronic β-galactosidases. This study also allows the identification of groups of functionally uncharacterized protein sequences with potential biotechnological interest.

  1. phangorn: phylogenetic analysis in R.

    Science.gov (United States)

    Schliep, Klaus Peter

    2011-02-15

    phangorn is a package for phylogenetic reconstruction and analysis in the R language. Previously it was only possible to estimate phylogenetic trees with distance methods in R. phangorn, now offers the possibility of reconstructing phylogenies with distance based methods, maximum parsimony or maximum likelihood (ML) and performing Hadamard conjugation. Extending the general ML framework, this package provides the possibility of estimating mixture and partition models. Furthermore, phangorn offers several functions for comparing trees, phylogenetic models or splits, simulating character data and performing congruence analyses. phangorn can be obtained through the CRAN homepage http://cran.r-project.org/web/packages/phangorn/index.html. phangorn is licensed under GPL 2.

  2. The small heat shock proteins from Acidithiobacillus ferrooxidans: gene expression, phylogenetic analysis, and structural modeling

    Directory of Open Access Journals (Sweden)

    Ribeiro Daniela A

    2011-12-01

    Full Text Available Abstract Background Acidithiobacillus ferrooxidans is an acidophilic, chemolithoautotrophic bacterium that has been successfully used in metal bioleaching. In this study, an analysis of the A. ferrooxidans ATCC 23270 genome revealed the presence of three sHSP genes, Afe_1009, Afe_1437 and Afe_2172, that encode proteins from the HSP20 family, a class of intracellular multimers that is especially important in extremophile microorganisms. Results The expression of the sHSP genes was investigated in A. ferrooxidans cells submitted to a heat shock at 40°C for 15, 30 and 60 minutes. After 60 minutes, the gene on locus Afe_1437 was about 20-fold more highly expressed than the gene on locus Afe_2172. Bioinformatic and phylogenetic analyses showed that the sHSPs from A. ferrooxidans are possible non-paralogous proteins, and are regulated by the σ32 factor, a common transcription factor of heat shock proteins. Structural studies using homology molecular modeling indicated that the proteins encoded by Afe_1009 and Afe_1437 have a conserved α-crystallin domain and share similar structural features with the sHSP from Methanococcus jannaschii, suggesting that their biological assembly involves 24 molecules and resembles a hollow spherical shell. Conclusion We conclude that the sHSPs encoded by the Afe_1437 and Afe_1009 genes are more likely to act as molecular chaperones in the A. ferrooxidans heat shock response. In addition, the three sHSPs from A. ferrooxidans are not recent paralogs, and the Afe_1437 and Afe_1009 genes could be inherited horizontally by A. ferrooxidans.

  3. Open Reading Frame Phylogenetic Analysis on the Cloud

    Directory of Open Access Journals (Sweden)

    Che-Lun Hung

    2013-01-01

    Full Text Available Phylogenetic analysis has become essential in researching the evolutionary relationships between viruses. These relationships are depicted on phylogenetic trees, in which viruses are grouped based on sequence similarity. Viral evolutionary relationships are identified from open reading frames rather than from complete sequences. Recently, cloud computing has become popular for developing internet-based bioinformatics tools. Biocloud is an efficient, scalable, and robust bioinformatics computing service. In this paper, we propose a cloud-based open reading frame phylogenetic analysis service. The proposed service integrates the Hadoop framework, virtualization technology, and phylogenetic analysis methods to provide a high-availability, large-scale bioservice. In a case study, we analyze the phylogenetic relationships among Norovirus. Evolutionary relationships are elucidated by aligning different open reading frame sequences. The proposed platform correctly identifies the evolutionary relationships between members of Norovirus.

  4. Sequence comparison and phylogenetic analysis of core gene of ...

    African Journals Online (AJOL)

    Phylogenetic analysis suggests that our sequences are clustered with sequences reported from Japan. This is the first phylogenetic analysis of HCV core gene from Pakistani population. Our sequences and sequences from Japan are grouped into same cluster in the phylogenetic tree. Sequence comparison and ...

  5. Phylogenetic reconstruction of South American felids defined by protein electrophoresis

    OpenAIRE

    Pecon Slattery, J.; Johnson, W. E.; Goldman, D.; O'Brien, S. J.

    1994-01-01

    Phylogenetic associations among six closely related South American felid species were defined by changes in protein-encoding gene loci. We analyzed proteins isolated from skin fibroblasts using two-dimensional electrophoresis and allozymes extracted from blood cells. Genotypes were determined for multiple individuals of ocelot, margay, tigrina, Geoffroy's cat, kodkod, and pampas cat at 548 loci resolved by two-dimensional electrophoresis and 44 allozyme loci. Phenograms were constructed using...

  6. ProteinHistorian: tools for the comparative analysis of eukaryote protein origin.

    Directory of Open Access Journals (Sweden)

    John A Capra

    Full Text Available The evolutionary history of a protein reflects the functional history of its ancestors. Recent phylogenetic studies identified distinct evolutionary signatures that characterize proteins involved in cancer, Mendelian disease, and different ontogenic stages. Despite the potential to yield insight into the cellular functions and interactions of proteins, such comparative phylogenetic analyses are rarely performed, because they require custom algorithms. We developed ProteinHistorian to make tools for performing analyses of protein origins widely available. Given a list of proteins of interest, ProteinHistorian estimates the phylogenetic age of each protein, quantifies enrichment for proteins of specific ages, and compares variation in protein age with other protein attributes. ProteinHistorian allows flexibility in the definition of protein age by including several algorithms for estimating ages from different databases of evolutionary relationships. We illustrate the use of ProteinHistorian with three example analyses. First, we demonstrate that proteins with high expression in human, compared to chimpanzee and rhesus macaque, are significantly younger than those with human-specific low expression. Next, we show that human proteins with annotated regulatory functions are significantly younger than proteins with catalytic functions. Finally, we compare protein length and age in many eukaryotic species and, as expected from previous studies, find a positive, though often weak, correlation between protein age and length. ProteinHistorian is available through a web server with an intuitive interface and as a set of command line tools; this allows biologists and bioinformaticians alike to integrate these approaches into their analysis pipelines. ProteinHistorian's modular, extensible design facilitates the integration of new datasets and algorithms. The ProteinHistorian web server, source code, and pre-computed ages for 32 eukaryotic genomes are

  7. galaxieEST: addressing EST identity through automated phylogenetic analysis.

    Science.gov (United States)

    Nilsson, R Henrik; Rajashekar, Balaji; Larsson, Karl-Henrik; Ursing, Björn M

    2004-07-05

    Research involving expressed sequence tags (ESTs) is intricately coupled to the existence of large, well-annotated sequence repositories. Comparatively complete and satisfactory annotated public sequence libraries are, however, available only for a limited range of organisms, rendering the absence of sequences and gene structure information a tangible problem for those working with taxa lacking an EST or genome sequencing project. Paralogous genes belonging to the same gene family but distinguished by derived characteristics are particularly prone to misidentification and erroneous annotation; high but incomplete levels of sequence similarity are typically difficult to interpret and have formed the basis of many unsubstantiated assumptions of orthology. In these cases, a phylogenetic study of the query sequence together with the most similar sequences in the database may be of great value to the identification process. In order to facilitate this laborious procedure, a project to employ automated phylogenetic analysis in the identification of ESTs was initiated. galaxieEST is an open source Perl-CGI script package designed to complement traditional similarity-based identification of EST sequences through employment of automated phylogenetic analysis. It uses a series of BLAST runs as a sieve to retrieve nucleotide and protein sequences for inclusion in neighbour joining and parsimony analyses; the output includes the BLAST output, the results of the phylogenetic analyses, and the corresponding multiple alignments. galaxieEST is available as an on-line web service for identification of fungal ESTs and for download / local installation for use with any organism group at http://galaxie.cgb.ki.se/galaxieEST.html. By addressing sequence relatedness in addition to similarity, galaxieEST provides an integrative view on EST origin and identity, which may prove particularly useful in cases where similarity searches return one or more pertinent, but not full, matches and

  8. Comparative phylogenetic analysis of intergenic spacers and small ...

    African Journals Online (AJOL)

    The phylogenetic analysis of test isolates included assessment of variation in sequences and length of IGS and SSU-rRNA genes with reference to 16 different microsporidian sequences. The results proved that IGS sequences have more variation than SSU-rRNA gene sequences. Analysis of phylogenetic trees reveal that ...

  9. Data set for phylogenetic tree and RAMPAGE Ramachandran plot analysis of SODs in Gossypium raimondii and G. arboreum.

    Science.gov (United States)

    Wang, Wei; Xia, Minxuan; Chen, Jie; Deng, Fenni; Yuan, Rui; Zhang, Xiaopei; Shen, Fafu

    2016-12-01

    The data presented in this paper is supporting the research article "Genome-Wide Analysis of Superoxide Dismutase Gene Family in Gossypium raimondii and G. arboreum" [1]. In this data article, we present phylogenetic tree showing dichotomy with two different clusters of SODs inferred by the Bayesian method of MrBayes (version 3.2.4), "Bayesian phylogenetic inference under mixed models" [2], Ramachandran plots of G. raimondii and G. arboreum SODs, the protein sequence used to generate 3D sructure of proteins and the template accession via SWISS-MODEL server, "SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information." [3] and motif sequences of SODs identified by InterProScan (version 4.8) with the Pfam database, "Pfam: the protein families database" [4].

  10. Nucleotide and amino acid sequences of a coat protein of an Ukrainian isolate of Potato virus Y: comparison with homologous sequences of other isolates and phylogenetic analysis

    Directory of Open Access Journals (Sweden)

    Budzanivska I. G.

    2014-03-01

    Full Text Available Aim. Identification of the widespread Ukrainian isolate(s of PVY (Potato virus Y in different potato cultivars and subsequent phylogenetic analysis of detected PVY isolates based on NA and AA sequences of coat protein. Methods. ELISA, RT-PCR, DNA sequencing and phylogenetic analysis. Results. PVY has been identified serologically in potato cultivars of Ukrainian selection. In this work we have optimized a method for total RNA extraction from potato samples and offered a sensitive and specific PCR-based test system of own design for diagnostics of the Ukrainian PVY isolates. Part of the CP gene of the Ukrainian PVY isolate has been sequenced and analyzed phylogenetically. It is demonstrated that the Ukrainian isolate of Potato virus Y (CP gene has a higher percentage of homology with the recombinant isolates (strains of this pathogen (approx. 98.8– 99.8 % of homology for both nucleotide and translated amino acid sequences of the CP gene. The Ukrainian isolate of PVY is positioned in the separate cluster together with the isolates found in Syria, Japan and Iran; these isolates possibly have common origin. The Ukrainian PVY isolate is confirmed to be recombinant. Conclusions. This work underlines the need and provides the means for accurate monitoring of Potato virus Y in the agroecosystems of Ukraine. Most importantly, the phylogenetic analysis demonstrated the recombinant nature of this PVY isolate which has been attributed to the strain group O, subclade N:O.

  11. Mitochondrial DNA genomes organization and phylogenetic relationships analysis of eight anemonefishes (pomacentridae: amphiprioninae.

    Directory of Open Access Journals (Sweden)

    Jianlong Li

    Full Text Available Anemonefishes (Pomacentridae Amphiprioninae are a group of 30 valid coral reef fish species with their phylogenetic relationships still under debate. The eight available mitogenomes of anemonefishes were used to reconstruct the molecular phylogenetic tree; six were obtained from this study (Amphiprion clarkii, A. frenatus, A. percula, A. perideraion, A. polymnus and Premnas biaculeatus and two from GenBank (A. bicinctus and A. ocellaris. The seven Amphiprion species represent all four subgenera and P. biaculeatus is the only species from Premnas. The eight mitogenomes of anemonefishes encoded 13 protein-coding genes, two rRNA genes, 22 tRNA genes and two main non-coding regions, with the gene arrangement and translation direction basically identical to other typical vertebrate mitogenomes. Among the 13 protein-coding genes, A. ocellaris (AP006017 and A. percula (KJ174497 had the same length in ND5 with 1,866 bp, which were three nucleotides less than the other six anemonefishes. Both structures of ND5, however, could translate to amino acid successfully. Only four mitogenomes had the tandem repeats in D-loop; the tandem repeats were located in downstream after Conserved Sequence Block rather than the upstream and repeated in a simply way. The phylogenetic utility was tested with Bayesian and Maximum Likelihood methods using all 13 protein-coding genes. The results strongly supported that the subfamily Amphiprioninae was monophyletic and P. biaculeatus should be assigned to the genus Amphiprion. Premnas biaculeatus with the percula complex were revealed to be the ancient anemonefish species. The tree forms of ND1, COIII, ND4, Cytb, Cytb+12S rRNA, Cytb+COI and Cytb+COI+12S rRNA were similar to that 13 protein-coding genes, therefore, we suggested that the suitable single mitochondrial gene for phylogenetic analysis of anemonefishes maybe Cytb. Additional mitogenomes of anemonefishes with a combination of nuclear markers will be useful to

  12. Comparative genomic analysis reveals a novel mitochondrial isoform of human rTS protein and unusual phylogenetic distribution of the rTS gene

    Directory of Open Access Journals (Sweden)

    McGuire John J

    2005-09-01

    Full Text Available Abstract Background The rTS gene (ENOSF1, first identified in Homo sapiens as a gene complementary to the thymidylate synthase (TYMS mRNA, is known to encode two protein isoforms, rTSα and rTSβ. The rTSβ isoform appears to be an enzyme responsible for the synthesis of signaling molecules involved in the down-regulation of thymidylate synthase, but the exact cellular functions of rTS genes are largely unknown. Results Through comparative genomic sequence analysis, we predicted the existence of a novel protein isoform, rTS, which has a 27 residue longer N-terminus by virtue of utilizing an alternative start codon located upstream of the start codon in rTSβ. We observed that a similar extended N-terminus could be predicted in all rTS genes for which genomic sequences are available and the extended regions are conserved from bacteria to human. Therefore, we reasoned that the protein with the extended N-terminus might represent an ancestral form of the rTS protein. Sequence analysis strongly predicts a mitochondrial signal sequence in the extended N-terminal of human rTSγ, which is absent in rTSβ. We confirmed the existence of rTS in human mitochondria experimentally by demonstrating the presence of both rTSγ and rTSβ proteins in mitochondria isolated by subcellular fractionation. In addition, our comprehensive analysis of rTS orthologous sequences reveals an unusual phylogenetic distribution of this gene, which suggests the occurrence of one or more horizontal gene transfer events. Conclusion The presence of two rTS isoforms in mitochondria suggests that the rTS signaling pathway may be active within mitochondria. Our report also presents an example of identifying novel protein isoforms and for improving gene annotation through comparative genomic analysis.

  13. Comparative genomic analysis reveals a novel mitochondrial isoform of human rTS protein and unusual phylogenetic distribution of the rTS gene

    Science.gov (United States)

    Liang, Ping; Nair, Jayakumar R; Song, Lei; McGuire, John J; Dolnick, Bruce J

    2005-01-01

    Background The rTS gene (ENOSF1), first identified in Homo sapiens as a gene complementary to the thymidylate synthase (TYMS) mRNA, is known to encode two protein isoforms, rTSα and rTSβ. The rTSβ isoform appears to be an enzyme responsible for the synthesis of signaling molecules involved in the down-regulation of thymidylate synthase, but the exact cellular functions of rTS genes are largely unknown. Results Through comparative genomic sequence analysis, we predicted the existence of a novel protein isoform, rTS, which has a 27 residue longer N-terminus by virtue of utilizing an alternative start codon located upstream of the start codon in rTSβ. We observed that a similar extended N-terminus could be predicted in all rTS genes for which genomic sequences are available and the extended regions are conserved from bacteria to human. Therefore, we reasoned that the protein with the extended N-terminus might represent an ancestral form of the rTS protein. Sequence analysis strongly predicts a mitochondrial signal sequence in the extended N-terminal of human rTSγ, which is absent in rTSβ. We confirmed the existence of rTS in human mitochondria experimentally by demonstrating the presence of both rTSγ and rTSβ proteins in mitochondria isolated by subcellular fractionation. In addition, our comprehensive analysis of rTS orthologous sequences reveals an unusual phylogenetic distribution of this gene, which suggests the occurrence of one or more horizontal gene transfer events. Conclusion The presence of two rTS isoforms in mitochondria suggests that the rTS signaling pathway may be active within mitochondria. Our report also presents an example of identifying novel protein isoforms and for improving gene annotation through comparative genomic analysis. PMID:16162288

  14. Review and phylogenetic analysis of qac genes that reduce susceptibility to quaternary ammonium compounds in Staphylococcus i>species

    DEFF Research Database (Denmark)

    Wassenaar, Trudy M; Ussery, David; Nielsen, Lene Nørby

    2015-01-01

    described in the literature for qac detection may miss particular qac genes due to lack of DNA conservation. Despite their resemblance in substrate specificity, the Qac proteins belonging to the two protein families have little in common. QacA and QacB are highly conserved in Staphylococcus species, while...... variation, despite their short length, even within the Staphylococcus genus. Phylogenetic analysis of these genes identified similarity to a large number of other SMR members, found in staphylococci as well as in other genera. A number of phylogenetic trees of SMR Qac proteins are presented here, starting...... antibiotics. In Stapylococcus species, six different plasmid-encoded Qac efflux pumps have been described, and they belong to two major protein families. QacA and QacB are members of the Major Facilitator Superfamily, while QacC, QacG, QacH, and QacJ all belong to the Small Multidrug Resistance (SMR) family...

  15. Short segment search method for phylogenetic analysis using nested sliding windows

    Science.gov (United States)

    Iskandar, A. A.; Bustamam, A.; Trimarsanto, H.

    2017-10-01

    To analyze phylogenetics in Bioinformatics, coding DNA sequences (CDS) segment is needed for maximal accuracy. However, analysis by CDS cost a lot of time and money, so a short representative segment by CDS, which is envelope protein segment or non-structural 3 (NS3) segment is necessary. After sliding window is implemented, a better short segment than envelope protein segment and NS3 is found. This paper will discuss a mathematical method to analyze sequences using nested sliding window to find a short segment which is representative for the whole genome. The result shows that our method can find a short segment which more representative about 6.57% in topological view to CDS segment than an Envelope segment or NS3 segment.

  16. Protein analysis in dissolved organic matter: What proteins from organic debris, soil leachate and surface water can tell us - a perspective

    Directory of Open Access Journals (Sweden)

    W. X. Schulze

    2005-01-01

    Full Text Available Mass spectrometry based analysis of proteins is widely used to study cellular processes in model organisms. However, it has not yet routinely been applied in environmental research. Based on observations that protein can readily be detected as a component of dissolved organic matter (DOM, this article gives an example about the possible use of protein analysis in ecology and environmental sciences focusing on different terrestrial ecosystems. At this stage, there are two areas of interest: (1 the identification of phylogenetic groups contributing to the environmental protein pool, and (2 identification of the organismic origin of specific enzymes that are important for ecosystem processes. In this paper, mass spectrometric protein analysis was applied to identify proteins from decomposing plant material and DOM of soil leachates and surface water samples derived from different environments. It is concluded, that mass spectrometric protein analysis is capable of distinguishing phylogenetic origin of proteins from litter protein extracts, leachates of different soil horizons, and from various sources of terrestrial surface water. Current limitation is imposed by the limited knowledge of complete genomes of soil organisms. The protein analysis allows to relate protein presence to biogeochemical processes, and to identify the source organisms for specific active enzymes. Further applications, such as in pollution research are conceivable. In summary, the analysis of proteins opens a new area of research between the fields of microbiology and biogeochemistry.

  17. TREEFINDER: a powerful graphical analysis environment for molecular phylogenetics

    Directory of Open Access Journals (Sweden)

    von Haeseler Arndt

    2004-06-01

    Full Text Available Abstract Background Most analysis programs for inferring molecular phylogenies are difficult to use, in particular for researchers with little programming experience. Results TREEFINDER is an easy-to-use integrative platform-independent analysis environment for molecular phylogenetics. In this paper the main features of TREEFINDER (version of April 2004 are described. TREEFINDER is written in ANSI C and Java and implements powerful statistical approaches for inferring gene tree and related analyzes. In addition, it provides a user-friendly graphical interface and a phylogenetic programming language. Conclusions TREEFINDER is a versatile framework for analyzing phylogenetic data across different platforms that is suited both for exploratory as well as advanced studies.

  18. Molecular Phylogenetic: Organism Taxonomy Method Based on Evolution History

    Directory of Open Access Journals (Sweden)

    N.L.P Indi Dharmayanti

    2011-03-01

    Full Text Available Phylogenetic is described as taxonomy classification of an organism based on its evolution history namely its phylogeny and as a part of systematic science that has objective to determine phylogeny of organism according to its characteristic. Phylogenetic analysis from amino acid and protein usually became important area in sequence analysis. Phylogenetic analysis can be used to follow the rapid change of a species such as virus. The phylogenetic evolution tree is a two dimensional of a species graphic that shows relationship among organisms or particularly among their gene sequences. The sequence separation are referred as taxa (singular taxon that is defined as phylogenetically distinct units on the tree. The tree consists of outer branches or leaves that represents taxa and nodes and branch represent correlation among taxa. When the nucleotide sequence from two different organism are similar, they were inferred to be descended from common ancestor. There were three methods which were used in phylogenetic, namely (1 Maximum parsimony, (2 Distance, and (3 Maximum likehoood. Those methods generally are applied to construct the evolutionary tree or the best tree for determine sequence variation in group. Every method is usually used for different analysis and data.

  19. Revealing pancrustacean relationships: phylogenetic analysis of ribosomal protein genes places Collembola (springtails) in a monophyletic Hexapoda and reinforces the discrepancy between mitochondrial and nuclear DNA markers.

    Science.gov (United States)

    Timmermans, M J T N; Roelofs, D; Mariën, J; van Straalen, N M

    2008-03-12

    In recent years, several new hypotheses on phylogenetic relations among arthropods have been proposed on the basis of DNA sequences. One of the challenged hypotheses is the monophyly of hexapods. This discussion originated from analyses based on mitochondrial DNA datasets that, due to an unusual positioning of Collembola, suggested that the hexapod body plan evolved at least twice. Here, we re-evaluate the position of Collembola using ribosomal protein gene sequences. In total 48 ribosomal proteins were obtained for the collembolan Folsomia candida. These 48 sequences were aligned with sequence data on 35 other ecdysozoans. Each ribosomal protein gene was available for 25% to 86% of the taxa. However, the total sequence information was unequally distributed over the taxa and ranged between 4% and 100%. A concatenated dataset was constructed (5034 inferred amino acids in length), of which ~66% of the positions were filled. Phylogenetic tree reconstructions, using Maximum Likelihood, Maximum Parsimony, and Bayesian methods, resulted in a topology that supports monophyly of Hexapoda. Although ribosomal proteins in general may not evolve independently, they once more appear highly valuable for phylogenetic reconstruction. Our analyses clearly suggest that Hexapoda is monophyletic. This underpins the inconsistency between nuclear and mitochondrial datasets when analyzing pancrustacean relationships. Caution is needed when applying mitochondrial markers in deep phylogeny.

  20. Applying phylogenetic analysis to viral livestock diseases: moving beyond molecular typing.

    Science.gov (United States)

    Olvera, Alex; Busquets, Núria; Cortey, Marti; de Deus, Nilsa; Ganges, Llilianne; Núñez, José Ignacio; Peralta, Bibiana; Toskano, Jennifer; Dolz, Roser

    2010-05-01

    Changes in livestock production systems in recent years have altered the presentation of many diseases resulting in the need for more sophisticated control measures. At the same time, new molecular assays have been developed to support the diagnosis of animal viral disease. Nucleotide sequences generated by these diagnostic techniques can be used in phylogenetic analysis to infer phenotypes by sequence homology and to perform molecular epidemiology studies. In this review, some key elements of phylogenetic analysis are highlighted, such as the selection of the appropriate neutral phylogenetic marker, the proper phylogenetic method and different techniques to test the reliability of the resulting tree. Examples are given of current and future applications of phylogenetic reconstructions in viral livestock diseases. Copyright 2009 Elsevier Ltd. All rights reserved.

  1. [Genome-wide identification, phylogenetic analysis and expression profiling of the WOX family genes in Solanum lycopersicum].

    Science.gov (United States)

    Li, Xiao-xu; Liu, Cheng; Li, Wei; Zhang, Zeng-lin; Gao, Xiao-ming; Zhou, Hui; Guo, Yong-feng

    2016-05-01

    Members of the plant-specific WOX transcription factor family have been reported to play important roles in cell to cell communication as well as other physiological and developmental processes. In this study, ten members of the WOX transcription factor family were identified in Solanum lycopersicum with HMMER. Neighbor-joining phylogenetic tree, maximum-likelihood tree and Bayesian-inference tree were constructed and similar topologies were shown using the protein sequences of the homeodomain. Phylogenetic study revealed that the 25 WOX family members from Arabidopsis and tomato fall into three clades and nine subfamilies. The patterns of exon-intron structures and organization of conserved domains in Arabidopsis and tomato were consistent based on the phylogenetic results. Transcriptome analysis showed that the expression patterns of SlWOXs were different in different tissue types. Gene Ontology (GO) analysis suggested that, as transcription factors, the SlWOX family members could be involved in a number of biological processes including cell to cell communication and tissue development. Our results are useful for future studies on WOX family members in tomato and other plant species.

  2. Molecular phylogenetic and expression analysis of the complete WRKY transcription factor family in maize.

    Science.gov (United States)

    Wei, Kai-Fa; Chen, Juan; Chen, Yan-Feng; Wu, Ling-Juan; Xie, Dao-Xin

    2012-04-01

    The WRKY transcription factors function in plant growth and development, and response to the biotic and abiotic stresses. Although many studies have focused on the functional identification of the WRKY transcription factors, much less is known about molecular phylogenetic and global expression analysis of the complete WRKY family in maize. In this study, we identified 136 WRKY proteins coded by 119 genes in the B73 inbred line from the complete genome and named them in an orderly manner. Then, a comprehensive phylogenetic analysis of five species was performed to explore the origin and evolutionary patterns of these WRKY genes, and the result showed that gene duplication is the major driving force for the origin of new groups and subgroups and functional divergence during evolution. Chromosomal location analysis of maize WRKY genes indicated that 20 gene clusters are distributed unevenly in the genome. Microarray-based expression analysis has revealed that 131 WRKY transcripts encoded by 116 genes may participate in the regulation of maize growth and development. Among them, 102 transcripts are stably expressed with a coefficient of variation (CV) value of WRKY genes with the CV value of >15% are further analysed to discover new organ- or tissue-specific genes. In addition, microarray analyses of transcriptional responses to drought stress and fungal infection showed that maize WRKY proteins are involved in stress responses. All these results contribute to a deep probing into the roles of WRKY transcription factors in maize growth and development and stress tolerance.

  3. Revealing pancrustacean relationships: Phylogenetic analysis of ribosomal protein genes places Collembola (springtails in a monophyletic Hexapoda and reinforces the discrepancy between mitochondrial and nuclear DNA markers

    Directory of Open Access Journals (Sweden)

    Mariën J

    2008-03-01

    Full Text Available Abstract Background In recent years, several new hypotheses on phylogenetic relations among arthropods have been proposed on the basis of DNA sequences. One of the challenged hypotheses is the monophyly of hexapods. This discussion originated from analyses based on mitochondrial DNA datasets that, due to an unusual positioning of Collembola, suggested that the hexapod body plan evolved at least twice. Here, we re-evaluate the position of Collembola using ribosomal protein gene sequences. Results In total 48 ribosomal proteins were obtained for the collembolan Folsomia candida. These 48 sequences were aligned with sequence data on 35 other ecdysozoans. Each ribosomal protein gene was available for 25% to 86% of the taxa. However, the total sequence information was unequally distributed over the taxa and ranged between 4% and 100%. A concatenated dataset was constructed (5034 inferred amino acids in length, of which ~66% of the positions were filled. Phylogenetic tree reconstructions, using Maximum Likelihood, Maximum Parsimony, and Bayesian methods, resulted in a topology that supports monophyly of Hexapoda. Conclusion Although ribosomal proteins in general may not evolve independently, they once more appear highly valuable for phylogenetic reconstruction. Our analyses clearly suggest that Hexapoda is monophyletic. This underpins the inconsistency between nuclear and mitochondrial datasets when analyzing pancrustacean relationships. Caution is needed when applying mitochondrial markers in deep phylogeny.

  4. Penicillium simile sp. nov. revealed by morphological and phylogenetic analysis.

    Science.gov (United States)

    Davolos, Domenico; Pietrangeli, Biancamaria; Persiani, Anna Maria; Maggi, Oriana

    2012-02-01

    The morphology of three phenetically identical Penicillium isolates, collected from the bioaerosol in a restoration laboratory in Italy, displayed macro- and microscopic characteristics that were similar though not completely ascribable to Penicillium raistrickii. For this reason, a phylogenetic approach based on DNA sequencing analysis was performed to establish both the taxonomic status and the evolutionary relationships of these three peculiar isolates in relation to previously described species of the genus Penicillium. We used four nuclear loci (both rRNA and protein coding genes) that have previously proved useful for the molecular investigation of taxa belonging to the genus Penicillium at various evolutionary levels. The internal transcribed spacer region (ITS1-5.8S-ITS2), domains D1 and D2 of the 28S rDNA, a region of the tubulin beta chain gene (benA) and part of the calmodulin gene (cmd) were amplified by PCR and sequenced. Analysis of the rRNA genes and of the benA and cmd sequence data indicates the presence of three isogenic isolates belonging to a genetically distinct species of the genus Penicillium, here described and named Penicillium simile sp. nov. (ATCC MYA-4591(T)  = CBS 129191(T)). This novel species is phylogenetically different from P. raistrickii and other related species of the genus Penicillium (e.g. Penicillium scabrosum), from which it can be distinguished on the basis of morphological trait analysis.

  5. 16S rRNA phylogenetic analysis of actinomycetes isolated from ...

    African Journals Online (AJOL)

    Subsequently, phylogenetic tree was constructed using suitable bioinformatics tools to identify the similarity which showed 97% similarity between strains. Moreover, all the selected strains of actinomycetes were subjected to study the protein and plasmid DNA expression profiles which showed prominent bands with ...

  6. Phylogenetic comparative methods complement discriminant function analysis in ecomorphology.

    Science.gov (United States)

    Barr, W Andrew; Scott, Robert S

    2014-04-01

    In ecomorphology, Discriminant Function Analysis (DFA) has been used as evidence for the presence of functional links between morphometric variables and ecological categories. Here we conduct simulations of characters containing phylogenetic signal to explore the performance of DFA under a variety of conditions. Characters were simulated using a phylogeny of extant antelope species from known habitats. Characters were modeled with no biomechanical relationship to the habitat category; the only sources of variation were body mass, phylogenetic signal, or random "noise." DFA on the discriminability of habitat categories was performed using subsets of the simulated characters, and Phylogenetic Generalized Least Squares (PGLS) was performed for each character. Analyses were repeated with randomized habitat assignments. When simulated characters lacked phylogenetic signal and/or habitat assignments were random, ecomorphology. Copyright © 2013 Wiley Periodicals, Inc.

  7. Phylogenetic Analysis of Phytophthora Species Based on Mitochondrial and Nuclear DNA Sequences

    NARCIS (Netherlands)

    Kroon, L.P.N.M.; Bakker, F.T.; Bosch, van den G.B.M.; Bonants, P.J.M.; Flier, W.G.

    2004-01-01

    A molecular phylogenetic analysis of the genus Phytophthora was performed, 113 isolates from 48 Phytophthora species were included in this analysis. Phylogenetic analyses were performed on regions of mitochondrial (cytochrome c oxidase subunit 1; NADH dehydrogenase subunit 1) and nuclear gene

  8. Phylogenetic analysis of West Nile virus isolated in Italy in 2008.

    Science.gov (United States)

    Savini, G; Monaco, F; Calistri, P; Lelli, R

    2008-11-27

    In Italy the first occurrence of West Nile virus (WNV) infection was reported in Tuscany region during the late summer of 1998. In August 2008, the WNV infection re-emerged in Italy, in areas surrounding the Po river delta, and involving three regions Lombardy, Emilia Romagna and Veneto. WNV was isolated from blood and organs samples of one horse, one donkey, one pigeon (Columba livia) and three magpies (Pica pica). The phylogenetic analysis of the isolates, conducted on 255 bp in the region coding for the E protein, indicates that these isolates belong to the lineage I among the European strains. According to the analysis, both the 1998 and 2008 Italian strains as well as isolates from Romania, Russia, Senegal and Kenya fell in the same sub-cluster.

  9. The complete mitochondrial genome of Pallisentis celatus (Acanthocephala) with phylogenetic analysis of acanthocephalans and rotifers.

    Science.gov (United States)

    Pan, Ting Shuang; Nie, Pin

    2013-07-01

    Acanthocephalans are a small group of obligate endoparasites. They and rotifers are recently placed in a group called Syndermata. However, phylogenetic relationships within classes of acanthocephalans, and between them and rotifers, have not been well resolved, possibly due to the lack of molecular data suitable for such analysis. In this study, the mitochondrial (mt) genome was sequenced from Pallisentis celatus (Van Cleave, 1928), an acanthocephalan in the class Eoacanthocephala, an intestinal parasite of rice-field eel, Monopterus albus (Zuiew, 1793), in China. The complete mt genome sequence of P. celatus is 13 855 bp long, containing 36 genes including 12 protein-coding genes, 22 transfer RNAs (tRNAs) and 2 ribosomal RNAs (rRNAs) as reported for other acanthocephalan species. All genes are encoded on the same strand and in the same direction. Phylogenetic analysis indicated that acanthocephalans are closely related with a clade containing bdelloids, which then correlates with the clade containing monogononts. The class Eoacanthocephala, containing P. celatus and Paratenuisentis ambiguus (Van Cleave, 1921) was closely related to the Palaeacanthocephala. It is thus indicated that acanthocephalans may be just clustered among groups of rotifers. However, the resolving of phylogenetic relationship among all classes of acanthocephalans and between them and rotifers may require further sampling and more molecular data.

  10. BLAST-EXPLORER helps you building datasets for phylogenetic analysis

    Directory of Open Access Journals (Sweden)

    Claverie Jean-Michel

    2010-01-01

    Full Text Available Abstract Background The right sampling of homologous sequences for phylogenetic or molecular evolution analyses is a crucial step, the quality of which can have a significant impact on the final interpretation of the study. There is no single way for constructing datasets suitable for phylogenetic analysis, because this task intimately depends on the scientific question we want to address, Moreover, database mining softwares such as BLAST which are routinely used for searching homologous sequences are not specifically optimized for this task. Results To fill this gap, we designed BLAST-Explorer, an original and friendly web-based application that combines a BLAST search with a suite of tools that allows interactive, phylogenetic-oriented exploration of the BLAST results and flexible selection of homologous sequences among the BLAST hits. Once the selection of the BLAST hits is done using BLAST-Explorer, the corresponding sequence can be imported locally for external analysis or passed to the phylogenetic tree reconstruction pipelines available on the Phylogeny.fr platform. Conclusions BLAST-Explorer provides a simple, intuitive and interactive graphical representation of the BLAST results and allows selection and retrieving of the BLAST hit sequences based a wide range of criterions. Although BLAST-Explorer primarily aims at helping the construction of sequence datasets for further phylogenetic study, it can also be used as a standard BLAST server with enriched output. BLAST-Explorer is available at http://www.phylogeny.fr

  11. Phylogenetic and biogeographic analysis of sphaerexochine trilobites.

    Directory of Open Access Journals (Sweden)

    Curtis R Congreve

    Full Text Available BACKGROUND: Sphaerexochinae is a speciose and widely distributed group of cheirurid trilobites. Their temporal range extends from the earliest Ordovician through the Silurian, and they survived the end Ordovician mass extinction event (the second largest mass extinction in Earth history. Prior to this study, the individual evolutionary relationships within the group had yet to be determined utilizing rigorous phylogenetic methods. Understanding these evolutionary relationships is important for producing a stable classification of the group, and will be useful in elucidating the effects the end Ordovician mass extinction had on the evolutionary and biogeographic history of the group. METHODOLOGY/PRINCIPAL FINDINGS: Cladistic parsimony analysis of cheirurid trilobites assigned to the subfamily Sphaerexochinae was conducted to evaluate phylogenetic patterns and produce a hypothesis of relationship for the group. This study utilized the program TNT, and the analysis included thirty-one taxa and thirty-nine characters. The results of this analysis were then used in a Lieberman-modified Brooks Parsimony Analysis to analyze biogeographic patterns during the Ordovician-Silurian. CONCLUSIONS/SIGNIFICANCE: The genus Sphaerexochus was found to be monophyletic, consisting of two smaller clades (one composed entirely of Ordovician species and another composed of Silurian and Ordovician species. By contrast, the genus Kawina was found to be paraphyletic. It is a basal grade that also contains taxa formerly assigned to Cydonocephalus. Phylogenetic patterns suggest Sphaerexochinae is a relatively distinctive trilobite clade because it appears to have been largely unaffected by the end Ordovician mass extinction. Finally, the biogeographic analysis yields two major conclusions about Sphaerexochus biogeography: Bohemia and Avalonia were close enough during the Silurian to exchange taxa; and during the Ordovician there was dispersal between Eastern Laurentia and

  12. Computational-based structural, functional and phylogenetic analysis of Enterobacter phytases.

    Science.gov (United States)

    Pramanik, Krishnendu; Kundu, Shreyasi; Banerjee, Sandipan; Ghosh, Pallab Kumar; Maiti, Tushar Kanti

    2018-06-01

    Myo-inositol hexakisphosphate phosphohydrolases (i.e., phytases) are known to be a very important enzyme responsible for solubilization of insoluble phosphates. In the present study, Enterobacter phytases have characterized by different phylogenetic, structural and functional parameters using some standard bio-computational tools. Results showed that majority of the Enterobacter phytases are acidic in nature as most of the isoelectric points were under 7.0. The aliphatic indices predicted for the selected proteins were below 40 indicating their thermostable nature. The average molecular weight of the proteins was 48 kDa. The lower values of GRAVY of the said proteins implied that they have better interactions with water. Secondary structure prediction revealed that alpha-helical content was highest among the other forms such as sheets, coils, etc. Moreover, the predicted 3D structure of Enterobacter phytases divulged that the proteins consisted of four monomeric polypeptide chains i.e., it was a tetrameric protein. The predicted tertiary model of E. aerogenes (A0A0M3HCJ2) was deposited in Protein Model Database (Acc. No.: PM0080561) for further utilization after a thorough quality check from QMEAN and SAVES server. Functional analysis supported their classification as histidine acid phosphatases. Besides, multiple sequence alignment revealed that "DG-DP-LG" was the most highly conserved residues within the Enterobacter phytases. Thus, the present study will be useful in selecting suitable phytase-producing microbe exclusively for using in the animal food industry as a food additive.

  13. Sifting through genomes with iterative-sequence clustering produces a large, phylogenetically diverse protein-family resource.

    Science.gov (United States)

    Sharpton, Thomas J; Jospin, Guillaume; Wu, Dongying; Langille, Morgan G I; Pollard, Katherine S; Eisen, Jonathan A

    2012-10-13

    New computational resources are needed to manage the increasing volume of biological data from genome sequencing projects. One fundamental challenge is the ability to maintain a complete and current catalog of protein diversity. We developed a new approach for the identification of protein families that focuses on the rapid discovery of homologous protein sequences. We implemented fully automated and high-throughput procedures to de novo cluster proteins into families based upon global alignment similarity. Our approach employs an iterative clustering strategy in which homologs of known families are sifted out of the search for new families. The resulting reduction in computational complexity enables us to rapidly identify novel protein families found in new genomes and to perform efficient, automated updates that keep pace with genome sequencing. We refer to protein families identified through this approach as "Sifting Families," or SFams. Our analysis of ~10.5 million protein sequences from 2,928 genomes identified 436,360 SFams, many of which are not represented in other protein family databases. We validated the quality of SFam clustering through statistical as well as network topology-based analyses. We describe the rapid identification of SFams and demonstrate how they can be used to annotate genomes and metagenomes. The SFam database catalogs protein-family quality metrics, multiple sequence alignments, hidden Markov models, and phylogenetic trees. Our source code and database are publicly available and will be subject to frequent updates (http://edhar.genomecenter.ucdavis.edu/sifting_families/).

  14. Phylogenetic and recombination analysis of tomato spotted wilt virus.

    Directory of Open Access Journals (Sweden)

    Sen Lian

    Full Text Available Tomato spotted wilt virus (TSWV severely damages and reduces the yield of many economically important plants worldwide. In this study, we determined the whole-genome sequences of 10 TSWV isolates recently identified from various regions and hosts in Korea. Phylogenetic analysis of these 10 isolates as well as the three previously sequenced isolates indicated that the 13 Korean TSWV isolates could be divided into two groups reflecting either two different origins or divergences of Korean TSWV isolates. In addition, the complete nucleotide sequences for the 13 Korean TSWV isolates along with previously sequenced TSWV RNA segments from Korea and other countries were subjected to phylogenetic and recombination analysis. The phylogenetic analysis indicated that both the RNA L and RNA M segments of most Korean isolates might have originated in Western Europe and North America but that the RNA S segments for all Korean isolates might have originated in China and Japan. Recombination analysis identified a total of 12 recombination events among all isolates and segments and five recombination events among the 13 Korea isolates; among the five recombinants from Korea, three contained the whole RNA L segment, suggesting reassortment rather than recombination. Our analyses provide evidence that both recombination and reassortment have contributed to the molecular diversity of TSWV.

  15. New approaches to phylogenetic tree search and their application to large numbers of protein alignments.

    Science.gov (United States)

    Whelan, Simon

    2007-10-01

    Phylogenetic tree estimation plays a critical role in a wide variety of molecular studies, including molecular systematics, phylogenetics, and comparative genomics. Finding the optimal tree relating a set of sequences using score-based (optimality criterion) methods, such as maximum likelihood and maximum parsimony, may require all possible trees to be considered, which is not feasible even for modest numbers of sequences. In practice, trees are estimated using heuristics that represent a trade-off between topological accuracy and speed. I present a series of novel algorithms suitable for score-based phylogenetic tree reconstruction that demonstrably improve the accuracy of tree estimates while maintaining high computational speeds. The heuristics function by allowing the efficient exploration of large numbers of trees through novel hill-climbing and resampling strategies. These heuristics, and other computational approximations, are implemented for maximum likelihood estimation of trees in the program Leaphy, and its performance is compared to other popular phylogenetic programs. Trees are estimated from 4059 different protein alignments using a selection of phylogenetic programs and the likelihoods of the tree estimates are compared. Trees estimated using Leaphy are found to have equal to or better likelihoods than trees estimated using other phylogenetic programs in 4004 (98.6%) families and provide a unique best tree that no other program found in 1102 (27.1%) families. The improvement is particularly marked for larger families (80 to 100 sequences), where Leaphy finds a unique best tree in 81.7% of families.

  16. BioMatriX: Sequence analysis, structure visualization, phylogenetics ...

    African Journals Online (AJOL)

    bmx-biomatrix.blogspot.com) developed for biological science community to augment scientific research regarding genomics, proteomics, phylogenetics and linkage analysis in one platform. BioMatriX offers multi-functional services to perform ...

  17. DNA Translator and Aligner: HyperCard utilities to aid phylogenetic analysis of molecules.

    Science.gov (United States)

    Eernisse, D J

    1992-04-01

    DNA Translator and Aligner are molecular phylogenetics HyperCard stacks for Macintosh computers. They manipulate sequence data to provide graphical gene mapping, conversions, translations and manual multiple-sequence alignment editing. DNA Translator is able to convert documented GenBank or EMBL documented sequences into linearized, rescalable gene maps whose gene sequences are extractable by clicking on the corresponding map button or by selection from a scrolling list. Provided gene maps, complete with extractable sequences, consist of nine metazoan, one yeast, and one ciliate mitochondrial DNAs and three green plant chloroplast DNAs. Single or multiple sequences can be manipulated to aid in phylogenetic analysis. Sequences can be translated between nucleic acids and proteins in either direction with flexible support of alternate genetic codes and ambiguous nucleotide symbols. Multiple aligned sequence output from diverse sources can be converted to Nexus, Hennig86 or PHYLIP format for subsequent phylogenetic analysis. Input or output alignments can be examined with Aligner, a convenient accessory stack included in the DNA Translator package. Aligner is an editor for the manual alignment of up to 100 sequences that toggles between display of matched characters and normal unmatched sequences. DNA Translator also generates graphic displays of amino acid coding and codon usage frequency relative to all other, or only synonymous, codons for approximately 70 select organism-organelle combinations. Codon usage data is compatible with spreadsheet or UWGCG formats for incorporation of additional molecules of interest. The complete package is available via anonymous ftp and is free for non-commercial uses.

  18. Phylogenetic analysis of hepatitis B virus in pakistan

    International Nuclear Information System (INIS)

    Baig, S.; Hasnain, N.U.

    2008-01-01

    To identify the distribution pattern of Hepatitis B Virus (HBV) genotype in a group of patients and to study its phylogenetic divergence. Two hundred and one HBV infected patients were genotyped for this study. All HbsAg positive individuals, either healthy carriers or suffering from conditions such as acute or chronic hepatitis, cirrhosis and hepatocellular carcinoma were included. Hepatitis B patients co-infected with other hepatic viruses were excluded. Hepatitis B virus DNA was extracted from serum, and subjected to a nested PCR, using the primers type-specific for genotype detection. Phylogenetic analysis was performed in the pre-S1 through S genes of HBV. The divergence was studied through 15 sequences of 967bp submitted to the DBJ/EMBL/GenBank databases accessible under accession number EF584640 through EF584654. Out of 201 patients tested, 156 were males and 45 were females. Genotype D was the predominant type found in 128 (64%) patients followed by A in 47 (23%) and mixed A/D in 26 (13%). Phylogenetic analysis confirmed the dominance of genotype D and subtype ayw2. There was dominance of genotype D subtype ayw2. It had a close resemblance with HBV strains that circulate in Iran, India and Japan. (author)

  19. Sifting through genomes with iterative-sequence clustering produces a large, phylogenetically diverse protein-family resource

    Directory of Open Access Journals (Sweden)

    Sharpton Thomas J

    2012-10-01

    Full Text Available Abstract Background New computational resources are needed to manage the increasing volume of biological data from genome sequencing projects. One fundamental challenge is the ability to maintain a complete and current catalog of protein diversity. We developed a new approach for the identification of protein families that focuses on the rapid discovery of homologous protein sequences. Results We implemented fully automated and high-throughput procedures to de novo cluster proteins into families based upon global alignment similarity. Our approach employs an iterative clustering strategy in which homologs of known families are sifted out of the search for new families. The resulting reduction in computational complexity enables us to rapidly identify novel protein families found in new genomes and to perform efficient, automated updates that keep pace with genome sequencing. We refer to protein families identified through this approach as “Sifting Families,” or SFams. Our analysis of ~10.5 million protein sequences from 2,928 genomes identified 436,360 SFams, many of which are not represented in other protein family databases. We validated the quality of SFam clustering through statistical as well as network topology–based analyses. Conclusions We describe the rapid identification of SFams and demonstrate how they can be used to annotate genomes and metagenomes. The SFam database catalogs protein-family quality metrics, multiple sequence alignments, hidden Markov models, and phylogenetic trees. Our source code and database are publicly available and will be subject to frequent updates (http://edhar.genomecenter.ucdavis.edu/sifting_families/.

  20. SATCHMO-JS: a webserver for simultaneous protein multiple sequence alignment and phylogenetic tree construction.

    Science.gov (United States)

    Hagopian, Raffi; Davidson, John R; Datta, Ruchira S; Samad, Bushra; Jarvis, Glen R; Sjölander, Kimmen

    2010-07-01

    We present the jump-start simultaneous alignment and tree construction using hidden Markov models (SATCHMO-JS) web server for simultaneous estimation of protein multiple sequence alignments (MSAs) and phylogenetic trees. The server takes as input a set of sequences in FASTA format, and outputs a phylogenetic tree and MSA; these can be viewed online or downloaded from the website. SATCHMO-JS is an extension of the SATCHMO algorithm, and employs a divide-and-conquer strategy to jump-start SATCHMO at a higher point in the phylogenetic tree, reducing the computational complexity of the progressive all-versus-all HMM-HMM scoring and alignment. Results on a benchmark dataset of 983 structurally aligned pairs from the PREFAB benchmark dataset show that SATCHMO-JS provides a statistically significant improvement in alignment accuracy over MUSCLE, Multiple Alignment using Fast Fourier Transform (MAFFT), ClustalW and the original SATCHMO algorithm. The SATCHMO-JS webserver is available at http://phylogenomics.berkeley.edu/satchmo-js. The datasets used in these experiments are available for download at http://phylogenomics.berkeley.edu/satchmo-js/supplementary/.

  1. Phylogenetic characterization of Clonorchis sinensis proteins homologous to the sigma-class glutathione transferase and their differential expression profiles.

    Science.gov (United States)

    Bae, Young-An; Kim, Jeong-Geun; Kong, Yoon

    2016-01-01

    Glutathione transferase (GST) is one of the major antioxidant proteins with diverse supplemental activities including peroxidase, isomerase, and thiol transferase. GSTs are classified into multiple classes on the basis of their primary structures and substrate/inhibitor specificity. However, the evolutionary routes and physiological environments specific to each of the closely related bioactive enzymes remain elusive. The sigma-like GSTs exhibit amino acid conservation patterns similar to the prostaglandin D synthases (PGDSs). In this study, we analyzed the phylogenetic position of the GSTs of the biocarcinogenic liver fluke, Clonorchis sinensis. We also observed induction profile of the GSTs in association with the parasite's maturation and in response to exogenous oxidative stresses, with special attention to sigma-class GSTs and PGDSs. The C. sinensis genome encoded 12 GST protein species, which were separately assigned to cytosolic (two omega-, one zeta-, two mu-, and five sigma-class), mitochondrial (one kappa-class), and microsomal (one membrane-associated proteins in eicosanoid and glutathione metabolism-like protein) GST families. Multiple sigma GST (or PGDS) orthologs were also detected in Opisthorchis viverrini. Other trematode species possessed only a single sigma-like GST gene. A phylogenetic analysis demonstrated that one of the sigma GST lineages duplicated in the common ancestor of trematodes were specifically expanded in the opisthorchiids, but deleted in other trematodes. The induction profiles of these sigma GST genes along with the development and aging of C. sinensis, and against various exogenous chemical stimuli strongly suggest that the paralogous sigma GST genes might be undergone specialized evolution to cope with the diverse hostile biochemical environments within the mammalian hepatobiliary ductal system. Copyright © 2016 Elsevier B.V. All rights reserved.

  2. Characterization of bud emergence 46 (BEM46) protein: Sequence, structural, phylogenetic and subcellular localization analyses

    International Nuclear Information System (INIS)

    Kumar, Abhishek; Kollath-Leiß, Krisztina; Kempken, Frank

    2013-01-01

    Highlights: •All eukaryotes have at least a single copy of a bem46 ortholog. •The catalytic triad of BEM46 is illustrated using sequence and structural analysis. •We identified indels in the conserved domain of BEM46 protein. •Localization studies of BEM46 protein were carried out using GFP-fusion tagging. -- Abstract: The bud emergence 46 (BEM46) protein from Neurospora crassa belongs to the α/β-hydrolase superfamily. Recently, we have reported that the BEM46 protein is localized in the perinuclear ER and also forms spots close by the plasma membrane. The protein appears to be required for cell type-specific polarity formation in N. crassa. Furthermore, initial studies suggested that the BEM46 amino acid sequence is conserved in eukaryotes and is considered to be one of the widespread conserved “known unknown” eukaryotic genes. This warrants for a comprehensive phylogenetic analysis of this superfamily to unravel origin and molecular evolution of these genes in different eukaryotes. Herein, we observe that all eukaryotes have at least a single copy of a bem46 ortholog. Upon scanning of these proteins in various genomes, we find that there are expansions leading into several paralogs in vertebrates. Usingcomparative genomic analyses, we identified insertion/deletions (indels) in the conserved domain of BEM46 protein, which allow to differentiate fungal classes such as ascomycetes from basidiomycetes. We also find that exonic indels are able to differentiate BEM46 homologs of different eukaryotic lineage. Furthermore, we unravel that BEM46 protein from N. crassa possess a novel endoplasmic-retention signal (PEKK) using GFP-fusion tagging experiments. We propose that three residues namely a serine 188S, a histidine 292H and an aspartic acid 262D are most critical residues, forming a catalytic triad in BEM46 protein from N. crassa. We carried out a comprehensive study on bem46 genes from a molecular evolution perspective with combination of functional

  3. Characterization of bud emergence 46 (BEM46) protein: Sequence, structural, phylogenetic and subcellular localization analyses

    Energy Technology Data Exchange (ETDEWEB)

    Kumar, Abhishek; Kollath-Leiß, Krisztina; Kempken, Frank, E-mail: fkempken@bot.uni-kiel.de

    2013-08-30

    Highlights: •All eukaryotes have at least a single copy of a bem46 ortholog. •The catalytic triad of BEM46 is illustrated using sequence and structural analysis. •We identified indels in the conserved domain of BEM46 protein. •Localization studies of BEM46 protein were carried out using GFP-fusion tagging. -- Abstract: The bud emergence 46 (BEM46) protein from Neurospora crassa belongs to the α/β-hydrolase superfamily. Recently, we have reported that the BEM46 protein is localized in the perinuclear ER and also forms spots close by the plasma membrane. The protein appears to be required for cell type-specific polarity formation in N. crassa. Furthermore, initial studies suggested that the BEM46 amino acid sequence is conserved in eukaryotes and is considered to be one of the widespread conserved “known unknown” eukaryotic genes. This warrants for a comprehensive phylogenetic analysis of this superfamily to unravel origin and molecular evolution of these genes in different eukaryotes. Herein, we observe that all eukaryotes have at least a single copy of a bem46 ortholog. Upon scanning of these proteins in various genomes, we find that there are expansions leading into several paralogs in vertebrates. Usingcomparative genomic analyses, we identified insertion/deletions (indels) in the conserved domain of BEM46 protein, which allow to differentiate fungal classes such as ascomycetes from basidiomycetes. We also find that exonic indels are able to differentiate BEM46 homologs of different eukaryotic lineage. Furthermore, we unravel that BEM46 protein from N. crassa possess a novel endoplasmic-retention signal (PEKK) using GFP-fusion tagging experiments. We propose that three residues namely a serine 188S, a histidine 292H and an aspartic acid 262D are most critical residues, forming a catalytic triad in BEM46 protein from N. crassa. We carried out a comprehensive study on bem46 genes from a molecular evolution perspective with combination of functional

  4. Exopolysaccharide-associated protein sorting in environmental organisms: the PEP-CTERM/EpsH system. Application of a novel phylogenetic profiling heuristic

    Directory of Open Access Journals (Sweden)

    Ward Naomi

    2006-08-01

    Full Text Available Abstract Background Protein translocation to the proper cellular destination may be guided by various classes of sorting signals recognizable in the primary sequence. Detection in some genomes, but not others, may reveal sorting system components by comparison of the phylogenetic profile of the class of sorting signal to that of various protein families. Results We describe a short C-terminal homology domain, sporadically distributed in bacteria, with several key characteristics of protein sorting signals. The domain includes a near-invariant motif Pro-Glu-Pro (PEP. This possible recognition or processing site is followed by a predicted transmembrane helix and a cluster rich in basic amino acids. We designate this domain PEP-CTERM. It tends to occur multiple times in a genome if it occurs at all, with a median count of eight instances; Verrucomicrobium spinosum has sixty-five. PEP-CTERM-containing proteins generally contain an N-terminal signal peptide and exhibit high diversity and little homology to known proteins. All bacteria with PEP-CTERM have both an outer membrane and exopolysaccharide (EPS production genes. By a simple heuristic for screening phylogenetic profiles in the absence of pre-formed protein families, we discovered that a homolog of the membrane protein EpsH (exopolysaccharide locus protein H occurs in a species when PEP-CTERM domains are found. The EpsH family contains invariant residues consistent with a transpeptidase function. Most PEP-CTERM proteins are encoded by single-gene operons preceded by large intergenic regions. In the Proteobacteria, most of these upstream regions share a DNA sequence, a probable cis-regulatory site that contains a sigma-54 binding motif. The phylogenetic profile for this DNA sequence exactly matches that of three proteins: a sigma-54-interacting response regulator (PrsR, a transmembrane histidine kinase (PrsK, and a TPR protein (PrsT. Conclusion These findings are consistent with the hypothesis

  5. galaxie--CGI scripts for sequence identification through automated phylogenetic analysis.

    Science.gov (United States)

    Nilsson, R Henrik; Larsson, Karl-Henrik; Ursing, Björn M

    2004-06-12

    The prevalent use of similarity searches like BLAST to identify sequences and species implicitly assumes the reference database to be of extensive sequence sampling. This is often not the case, restraining the correctness of the outcome as a basis for sequence identification. Phylogenetic inference outperforms similarity searches in retrieving correct phylogenies and consequently sequence identities, and a project was initiated to design a freely available script package for sequence identification through automated Web-based phylogenetic analysis. Three CGI scripts were designed to facilitate qualified sequence identification from a Web interface. Query sequences are aligned to pre-made alignments or to alignments made by ClustalW with entries retrieved from a BLAST search. The subsequent phylogenetic analysis is based on the PHYLIP package for inferring neighbor-joining and parsimony trees. The scripts are highly configurable. A service installation and a version for local use are found at http://andromeda.botany.gu.se/galaxiewelcome.html and http://galaxie.cgb.ki.se

  6. Phylogenetic diversity analysis of Trichoderma species based on ...

    African Journals Online (AJOL)

    vi-4177/CSAU be assigned as the type strains of a species of genus Trichoderma based on phylogenetic tree analysis together with the 18S rRNA gene sequence search in Ribosomal Database Project, small subunit rRNA and large subunit ...

  7. [Phylogenetic analysis of genomes of Vibrio cholerae strains isolated on the territory of Rostov region].

    Science.gov (United States)

    Kuleshov, K V; Markelov, M L; Dedkov, V G; Vodop'ianov, A S; Kermanov, A V; Pisanov, R V; Kruglikov, V D; Mazrukho, A B; Maleev, V V; Shipulin, G A

    2013-01-01

    Determination of origin of 2 Vibrio cholerae strains isolated on the territory of Rostov region by using full genome sequencing data. Toxigenic strain 2011 EL- 301 V. cholerae 01 El Tor Inaba No. 301 (ctxAB+, tcpA+) and nontoxigenic strain V. cholerae O1 Ogawa P- 18785 (ctxAB-, tcpA+) were studied. Sequencing was carried out on the MiSeq platform. Phylogenetic analysis of the genomes obtained was carried out based on comparison of conservative part of the studied and 54 previously sequenced genomes. 2011EL-301 strain genome was presented by 164 contigs with an average coverage of 100, N50 parameter was 132 kb, for strain P- 18785 - 159 contigs with a coverage of69, N50 - 83 kb. The contigs obtained for strain 2011 EL-301 were deposited in DDBJ/EMBL/GenBank databases with access code AJFN02000000, for strain P-18785 - ANHS00000000. 716 protein-coding orthologous genes were detected. Based on phylogenetic analysis strain P- 18785 belongs to PG-1 subgroup (a group of predecessor strains of the 7th pandemic). Strain 2011EL-301 belongs to groups of strains of the 7th pandemic and is included into the cluster with later isolates that are associated with cases of cholera in South Africa and cases of import of cholera to the USA from Pakistan. The data obtained allows to establish phylogenetic connections with V cholerae strains isolated earlier.

  8. Phylogenetic analysis of ferlin genes reveals ancient eukaryotic origins

    Directory of Open Access Journals (Sweden)

    Lek Monkol

    2010-07-01

    Full Text Available Abstract Background The ferlin gene family possesses a rare and identifying feature consisting of multiple tandem C2 domains and a C-terminal transmembrane domain. Much currently remains unknown about the fundamental function of this gene family, however, mutations in its two most well-characterised members, dysferlin and otoferlin, have been implicated in human disease. The availability of genome sequences from a wide range of species makes it possible to explore the evolution of the ferlin family, providing contextual insight into characteristic features that define the ferlin gene family in its present form in humans. Results Ferlin genes were detected from all species of representative phyla, with two ferlin subgroups partitioned within the ferlin phylogenetic tree based on the presence or absence of a DysF domain. Invertebrates generally possessed two ferlin genes (one with DysF and one without, with six ferlin genes in most vertebrates (three DysF, three non-DysF. Expansion of the ferlin gene family is evident between the divergence of lamprey (jawless vertebrates and shark (cartilaginous fish. Common to almost all ferlins is an N-terminal C2-FerI-C2 sandwich, a FerB motif, and two C-terminal C2 domains (C2E and C2F adjacent to the transmembrane domain. Preservation of these structural elements throughout eukaryotic evolution suggests a fundamental role of these motifs for ferlin function. In contrast, DysF, C2DE, and FerA are optional, giving rise to subtle differences in domain topologies of ferlin genes. Despite conservation of multiple C2 domains in all ferlins, the C-terminal C2 domains (C2E and C2F displayed higher sequence conservation and greater conservation of putative calcium binding residues across paralogs and orthologs. Interestingly, the two most studied non-mammalian ferlins (Fer-1 and Misfire in model organisms C. elegans and D. melanogaster, present as outgroups in the phylogenetic analysis, with results suggesting

  9. Data for constructing insect genome content matrices for phylogenetic analysis and functional annotation

    Directory of Open Access Journals (Sweden)

    Jeffrey Rosenfeld

    2016-03-01

    Full Text Available Twenty one fully sequenced and well annotated insect genomes were used to construct genome content matrices for phylogenetic analysis and functional annotation of insect genomes. To examine the role of e-value cutoff in ortholog determination we used scaled e-value cutoffs and a single linkage clustering approach.. The present communication includes (1 a list of the genomes used to construct the genome content phylogenetic matrices, (2 a nexus file with the data matrices used in phylogenetic analysis, (3 a nexus file with the Newick trees generated by phylogenetic analysis, (4 an excel file listing the Core (CORE genes and Unique (UNI genes found in five insect groups, and (5 a figure showing a plot of consistency index (CI versus percent of unannotated genes that are apomorphies in the data set for gene losses and gains and bar plots of gains and losses for four consistency index (CI cutoffs.

  10. Soft-tissue anatomy of the extant hominoids: a review and phylogenetic analysis

    Science.gov (United States)

    Gibbs, S; Collard, M; Wood, B

    2002-01-01

    This paper reports the results of a literature search for information about the soft-tissue anatomy of the extant non-human hominoid genera, Pan, Gorilla, Pongo and Hylobates, together with the results of a phylogenetic analysis of these data plus comparable data for Homo. Information on the four extant non-human hominoid genera was located for 240 out of the 1783 soft-tissue structures listed in the Nomina Anatomica. Numerically these data are biased so that information about some systems (e.g. muscles) and some regions (e.g. the forelimb) are over-represented, whereas other systems and regions (e.g. the veins and the lymphatics of the vascular system, the head region) are either under-represented or not represented at all. Screening to ensure that the data were suitable for use in a phylogenetic analysis reduced the number of eligible soft-tissue structures to 171. These data, together with comparable data for modern humans, were converted into discontinuous character states suitable for phylogenetic analysis and then used to construct a taxon-by-character matrix. This matrix was used in two tests of the hypothesis that soft-tissue characters can be relied upon to reconstruct hominoid phylogenetic relationships. In the first, parsimony analysis was used to identify cladograms requiring the smallest number of character state changes. In the second, the phylogenetic bootstrap was used to determine the confidence intervals of the most parsimonious clades. The parsimony analysis yielded a single most parsimonious cladogram that matched the molecular cladogram. Similarly the bootstrap analysis yielded clades that were compatible with the molecular cladogram; a (Homo, Pan) clade was supported by 95% of the replicates, and a (Gorilla, Pan, Homo) clade by 96%. These are the first hominoid morphological data to provide statistically significant support for the clades favoured by the molecular evidence. PMID:11833653

  11. A phylogenetic analysis of normal modes evolution in enzymes and its relationship to enzyme function.

    Science.gov (United States)

    Lai, Jason; Jin, Jing; Kubelka, Jan; Liberles, David A

    2012-09-21

    Since the dynamic nature of protein structures is essential for enzymatic function, it is expected that functional evolution can be inferred from the changes in protein dynamics. However, dynamics can also diverge neutrally with sequence substitution between enzymes without changes of function. In this study, a phylogenetic approach is implemented to explore the relationship between enzyme dynamics and function through evolutionary history. Protein dynamics are described by normal mode analysis based on a simplified harmonic potential force field applied to the reduced C(α) representation of the protein structure while enzymatic function is described by Enzyme Commission numbers. Similarity of the binding pocket dynamics at each branch of the protein family's phylogeny was analyzed in two ways: (1) explicitly by quantifying the normal mode overlap calculated for the reconstructed ancestral proteins at each end and (2) implicitly using a diffusion model to obtain the reconstructed lineage-specific changes in the normal modes. Both explicit and implicit ancestral reconstruction identified generally faster rates of change in dynamics compared with the expected change from neutral evolution at the branches of potential functional divergences for the α-amylase, D-isomer-specific 2-hydroxyacid dehydrogenase, and copper-containing amine oxidase protein families. Normal mode analysis added additional information over just comparing the RMSD of static structures. However, the branch-specific changes were not statistically significant compared to background function-independent neutral rates of change of dynamic properties and blind application of the analysis would not enable prediction of changes in enzyme specificity. Copyright © 2012 Elsevier Ltd. All rights reserved.

  12. Characterization and phylogenetic analysis of α-gliadin gene ...

    Indian Academy of Sciences (India)

    Supplementary data: Characterization and phylogenetic analysis of α-gliadin gene sequences reveals significant genomic divergence in Triticeae species. Guang-Rong Li, Tao Lang, En-Nian Yang, Cheng Liu ... The MITE insertion at the 3 UTR is boxed. Figure 2. The secondary structure of MITE insertion in HM452949.

  13. Phylogenetic Reconstruction Shows Independent Evolutionary Origins of Mitochondrial Transcription Factors from an Ancient Family of RNA Methyltransferase Proteins.

    Science.gov (United States)

    Aj Harris; Goldman, Aaron David

    2018-04-25

    Here, we generate a robust phylogenetic framework for the rRNA adenine N(6)-methyltransferase (RAMTase) protein family that shows a more ancient and complex evolutionary history within the family than previously reported. RAMTases occur universally by descent across the three domains of life, and typical orthologs within the family perform methylation of the small subunits of ribosomal RNA (rRNA). However, within the RAMTase family, two different groups of mitochondrial transcription factors, mtTFB1 and mtTFB2, have evolved in eukaryotes through neofunctionalization. Previous phylogenetic analyses have suggested that mtTFB1 and mtTFB2 comprise sister clades that arose via gene duplication, which occurred sometime following the endosymbiosis event that produced the mitochondrion. Through dense and taxonomically broad sampling of RAMTase family members especially within bacteria, we found that these eukaryotic mitochondrial transcription factors, mtTFB1 and mtTFB2, have independent origins in phylogenetically distant clades such that their divergence most likely predates the last universal common ancestor of life. The clade of mtTFB2s comprises orthologs in Opisthokonts and the clade of mtTFB1s includes orthologs in Amoebozoa and Metazoa. Thus, we clearly demonstrate that the neofunctionalization producing the transcription factor function evolved twice independently within the RAMTase family. These results are consistent with and help to elucidate outcomes from prior experimental studies, which found that some members of mtTFB1 still perform the ancestral rRNA methylation function, and the results have broader implications for understanding the evolution of new protein functions. Our phylogenetic reconstruction is also in agreement with prior studies showing two independent origins of plastid RAMTases in Viridiplantae and other photosynthetic autotrophs. We believe that this updated phylogeny of RAMTases should provide a robust evolutionary framework for ongoing

  14. Phylogenetic analysis of vitamin B12-related metabolism in Mycobacterium tuberculosis

    OpenAIRE

    Young, Douglas B.; Comas, I?aki; de Carvalho, Luiz P. S.

    2015-01-01

    Comparison of genome sequences from clinical isolates of Mycobacterium tuberculosis with phylogenetically-related pathogens Mycobacterium marinum, Mycobacterium kansasii, and Mycobacterium leprae reveals diversity amongst genes associated with vitamin B12-related metabolism. Diversity is generated by gene deletion events, differential acquisition of genes by horizontal transfer, and single nucleotide polymorphisms (SNPs) with predicted impact on protein function and transcriptional regulation...

  15. Comparative analysis of mitochondrial genomes of five aphid species (Hemiptera: Aphididae and phylogenetic implications.

    Directory of Open Access Journals (Sweden)

    Yuan Wang

    Full Text Available Insect mitochondrial genomes (mitogenomes are of great interest in exploring molecular evolution, phylogenetics and population genetics. Only two mitogenomes have been previously released in the insect group Aphididae, which consists of about 5,000 known species including some agricultural, forestry and horticultural pests. Here we report the complete 16,317 bp mitogenome of Cavariella salicicola and two nearly complete mitogenomes of Aphis glycines and Pterocomma pilosum. We also present a first comparative analysis of mitochondrial genomes of aphids. Results showed that aphid mitogenomes share conserved genomic organization, nucleotide and amino acid composition, and codon usage features. All 37 genes usually present in animal mitogenomes were sequenced and annotated. The analysis of gene evolutionary rate revealed the lowest and highest rates for COI and ATP8, respectively. A unique repeat region exclusively in aphid mitogenomes, which included variable numbers of tandem repeats in a lineage-specific manner, was highlighted for the first time. This region may have a function as another origin of replication. Phylogenetic reconstructions based on protein-coding genes and the stem-loop structures of control regions confirmed a sister relationship between Cavariella and pterocommatines. Current evidence suggest that pterocommatines could be formally transferred into Macrosiphini. Our paper also offers methodological instructions for obtaining other Aphididae mitochondrial genomes.

  16. REFGEN and TREENAMER: Automated Sequence Data Handling for Phylogenetic Analysis in the Genomic Era

    Science.gov (United States)

    Leonard, Guy; Stevens, Jamie R.; Richards, Thomas A.

    2009-01-01

    The phylogenetic analysis of nucleotide sequences and increasingly that of amino acid sequences is used to address a number of biological questions. Access to extensive datasets, including numerous genome projects, means that standard phylogenetic analyses can include many hundreds of sequences. Unfortunately, most phylogenetic analysis programs do not tolerate the sequence naming conventions of genome databases. Managing large numbers of sequences and standardizing sequence labels for use in phylogenetic analysis programs can be a time consuming and laborious task. Here we report the availability of an online resource for the management of gene sequences recovered from public access genome databases such as GenBank. These web utilities include the facility for renaming every sequence in a FASTA alignment file, with each sequence label derived from a user-defined combination of the species name and/or database accession number. This facility enables the user to keep track of the branching order of the sequences/taxa during multiple tree calculations and re-optimisations. Post phylogenetic analysis, these webpages can then be used to rename every label in the subsequent tree files (with a user-defined combination of species name and/or database accession number). Together these programs drastically reduce the time required for managing sequence alignments and labelling phylogenetic figures. Additional features of our platform include the automatic removal of identical accession numbers (recorded in the report file) and generation of species and accession number lists for use in supplementary materials or figure legends. PMID:19812722

  17. A member of the HSP90 family from ovine Babesia in China: molecular characterization, phylogenetic analysis and antigenicity.

    Science.gov (United States)

    Guan, Guiquan; Liu, Junlong; Liu, Aihong; Li, Youquan; Niu, Qingli; Gao, Jinliang; Luo, Jianxun; Chauvin, Alain; Yin, Hong; Moreau, Emmanuelle

    2015-09-01

    Heat shock protein 90 (HSP90) is a key component of the molecular chaperone complex essential for activating many signalling proteins involved in the development and progression of pathogenic cellular transformation. A Hsp90 gene (BQHsp90) was cloned and characterized from Babesia sp. BQ1 (Lintan), an ovine Babesia isolate belonging to Babesia motasi-like group, by screening a cDNA expression library and performing rapid amplification of cDNA ends. The full-length cDNA of BQHsp90 is 2399 bp with an open reading frame of 2154 bp encoding a predicted 83 kDa polypeptide with 717 amino acid residues. It shows significant homology and similar structural characteristics to Hsp90 of other apicomplex organisms. Phylogenetic analysis, based on the HSP90 amino acid sequences, showed that the Babesia genus is clearly separated from other apicomplexa genera. Five Chinese ovine Babesia isolates were divided into 2 phylogenetic clusters, namely Babesia sp. Xinjiang (previously designated a new species) cluster and B. motasi-like cluster which could be further divided into 2 subclusters (Babesia sp. BQ1 (Lintan)/Babesia sp. Tianzhu and Babesia sp. BQ1 (Ningxian)/Babesia sp. Hebei). Finally, the antigenicity of rBQHSP90 protein from prokaryotic expression was also evaluated using western blot and enzyme-linked immunosorbent assay (ELISA).

  18. Identification, expression and phylogenetic analysis of EgG1Y162 from Echinococcus granulosus

    Science.gov (United States)

    Zhang, Fengbo; Ma, Xiumin; Zhu, Yuejie; Wang, Hongying; Liu, Xianfei; Zhu, Min; Ma, Haimei; Wen, Hao; Fan, Haining; Ding, Jianbing

    2014-01-01

    Objective: This study was to clone, identify and analyze the characteristics of egG1Y162 gene from Echinococcus granulosus. Methods: Genomic DNA and total RNAs were extracted from four different developmental stages of protoscolex, germinal layer, adult and egg of Echinococcus granulosus, respectively. Fluorescent quantitative PCR was used for analyzing the expression of egG1Y162 gene. Prokaryotic expression plasmid of pET41a-EgG1Y162 was constructed to express recombinant His-EgG1Y162 antigen. Western blot analysis was performed to detect antigenicity of EgG1Y162 antigen. Gene sequence, amino acid alignment and phylogenetic tree of EgG1Y162 were analyzed by BLAST, online Spidey and MEGA4 software, respectively. Results: EgG1Y162 gene was expressed in four developmental stages of Echinococcus granulosus. And, egG1Y162 gene expression was the highest in the adult stage, with the relative value of 19.526, significantly higher than other three stages. Additionally, Western blot analysis revealed that EgG1Y162 recombinant protein had good reaction with serum samples from Echinococcus granulosus infected human and dog. Moreover, EgG1Y162 antigen was phylogenetically closest to EmY162 antigen, with the similarity over 90%. Conclusion: Our study identified EgG1Y162 antigen in Echinococcus granulosus for the first time. EgG1Y162 antigen had a high similarity with EmY162 antigen, with the genetic differences mainly existing in the intron region. And, EgG1Y162 recombinant protein showed good antigenicity. PMID:25337206

  19. Identification, expression and phylogenetic analysis of EgG1Y162 from Echinococcus granulosus.

    Science.gov (United States)

    Zhang, Fengbo; Ma, Xiumin; Zhu, Yuejie; Wang, Hongying; Liu, Xianfei; Zhu, Min; Ma, Haimei; Wen, Hao; Fan, Haining; Ding, Jianbing

    2014-01-01

    This study was to clone, identify and analyze the characteristics of egG1Y162 gene from Echinococcus granulosus. Genomic DNA and total RNAs were extracted from four different developmental stages of protoscolex, germinal layer, adult and egg of Echinococcus granulosus, respectively. Fluorescent quantitative PCR was used for analyzing the expression of egG1Y162 gene. Prokaryotic expression plasmid of pET41a-EgG1Y162 was constructed to express recombinant His-EgG1Y162 antigen. Western blot analysis was performed to detect antigenicity of EgG1Y162 antigen. Gene sequence, amino acid alignment and phylogenetic tree of EgG1Y162 were analyzed by BLAST, online Spidey and MEGA4 software, respectively. EgG1Y162 gene was expressed in four developmental stages of Echinococcus granulosus. And, egG1Y162 gene expression was the highest in the adult stage, with the relative value of 19.526, significantly higher than other three stages. Additionally, Western blot analysis revealed that EgG1Y162 recombinant protein had good reaction with serum samples from Echinococcus granulosus infected human and dog. Moreover, EgG1Y162 antigen was phylogenetically closest to EmY162 antigen, with the similarity over 90%. Our study identified EgG1Y162 antigen in Echinococcus granulosus for the first time. EgG1Y162 antigen had a high similarity with EmY162 antigen, with the genetic differences mainly existing in the intron region. And, EgG1Y162 recombinant protein showed good antigenicity.

  20. The Complete Mitochondrial Genome of Corizus tetraspilus (Hemiptera: Rhopalidae) and Phylogenetic Analysis of Pentatomomorpha

    Science.gov (United States)

    Guo, Zhong-Long; Wang, Juan; Shen, Yu-Ying

    2015-01-01

    Insect mitochondrial genome (mitogenome) are the most extensively used genetic information for molecular evolution, phylogenetics and population genetics. Pentatomomorpha (>14,000 species) is the second largest infraorder of Heteroptera and of great economic importance. To better understand the diversity and phylogeny within Pentatomomorpha, we sequenced and annotated the complete mitogenome of Corizus tetraspilus (Hemiptera: Rhopalidae), an important pest of alfalfa in China. We analyzed the main features of the C. tetraspilus mitogenome, and provided a comparative analysis with four other Coreoidea species. Our results reveal that gene content, gene arrangement, nucleotide composition, codon usage, rRNA structures and sequences of mitochondrial transcription termination factor are conserved in Coreoidea. Comparative analysis shows that different protein-coding genes have been subject to different evolutionary rates correlated with the G+C content. All the transfer RNA genes found in Coreoidea have the typical clover leaf secondary structure, except for trnS1 (AGN) which lacks the dihydrouridine (DHU) arm and possesses a unusual anticodon stem (9 bp vs. the normal 5 bp). The control regions (CRs) among Coreoidea are highly variable in size, of which the CR of C. tetraspilus is the smallest (440 bp), making the C. tetraspilus mitogenome the smallest (14,989 bp) within all completely sequenced Coreoidea mitogenomes. No conserved motifs are found in the CRs of Coreoidea. In addition, the A+T content (60.68%) of the CR of C. tetraspilus is much lower than that of the entire mitogenome (74.88%), and is lowest among Coreoidea. Phylogenetic analyses based on mitogenomic data support the monophyly of each superfamily within Pentatomomorpha, and recognize a phylogenetic relationship of (Aradoidea + (Pentatomoidea + (Lygaeoidea + (Pyrrhocoroidea + Coreoidea)))). PMID:26042898

  1. Structural and Phylogenetic Analysis of Laccases from Trichoderma: A Bioinformatic Approach

    Science.gov (United States)

    Cázares-García, Saila Viridiana; Vázquez-Garcidueñas, Ma. Soledad; Vázquez-Marrufo, Gerardo

    2013-01-01

    The genus Trichoderma includes species of great biotechnological value, both for their mycoparasitic activities and for their ability to produce extracellular hydrolytic enzymes. Although activity of extracellular laccase has previously been reported in Trichoderma spp., the possible number of isoenzymes is still unknown, as are the structural and functional characteristics of both the genes and the putative proteins. In this study, the system of laccases sensu stricto in the Trichoderma species, the genomes of which are publicly available, were analyzed using bioinformatic tools. The intron/exon structure of the genes and the identification of specific motifs in the sequence of amino acids of the proteins generated in silico allow for clear differentiation between extracellular and intracellular enzymes. Phylogenetic analysis suggests that the common ancestor of the genus possessed a functional gene for each one of these enzymes, which is a characteristic preserved in T. atroviride and T. virens. This analysis also reveals that T. harzianum and T. reesei only retained the intracellular activity, whereas T. asperellum added an extracellular isoenzyme acquired through horizontal gene transfer during the mycoparasitic process. The evolutionary analysis shows that in general, extracellular laccases are subjected to purifying selection, and intracellular laccases show neutral evolution. The data provided by the present study will enable the generation of experimental approximations to better understand the physiological role of laccases in the genus Trichoderma and to increase their biotechnological potential. PMID:23383142

  2. The Development of Three Long Universal Nuclear Protein-Coding Locus Markers and Their Application to Osteichthyan Phylogenetics with Nested PCR

    Science.gov (United States)

    Zhang, Peng

    2012-01-01

    Background Universal nuclear protein-coding locus (NPCL) markers that are applicable across diverse taxa and show good phylogenetic discrimination have broad applications in molecular phylogenetic studies. For example, RAG1, a representative NPCL marker, has been successfully used to make phylogenetic inferences within all major osteichthyan groups. However, such markers with broad working range and high phylogenetic performance are still scarce. It is necessary to develop more universal NPCL markers comparable to RAG1 for osteichthyan phylogenetics. Methodology/Principal Findings We developed three long universal NPCL markers (>1.6 kb each) based on single-copy nuclear genes (KIAA1239, SACS and TTN) that possess large exons and exhibit the appropriate evolutionary rates. We then compared their phylogenetic utilities with that of the reference marker RAG1 in 47 jawed vertebrate species. In comparison with RAG1, each of the three long universal markers yielded similar topologies and branch supports, all in congruence with the currently accepted osteichthyan phylogeny. To compare their phylogenetic performance visually, we also estimated the phylogenetic informativeness (PI) profile for each of the four long universal NPCL markers. The PI curves indicated that SACS performed best over the whole timescale, while RAG1, KIAA1239 and TTN exhibited similar phylogenetic performances. In addition, we compared the success of nested PCR and standard PCR when amplifying NPCL marker fragments. The amplification success rate and efficiency of the nested PCR were overwhelmingly higher than those of standard PCR. Conclusions/Significance Our work clearly demonstrates the superiority of nested PCR over the conventional PCR in phylogenetic studies and develops three long universal NPCL markers (KIAA1239, SACS and TTN) with the nested PCR strategy. The three markers exhibit high phylogenetic utilities in osteichthyan phylogenetics and can be widely used as pilot genes for

  3. REFGEN and TREENAMER: Automated Sequence Data Handling for Phylogenetic Analysis in the Genomic Era

    Directory of Open Access Journals (Sweden)

    Guy Leonard

    2009-01-01

    Full Text Available The phylogenetic analysis of nucleotide sequences and increasingly that of amino acid sequences is used to address a number of biological questions. Access to extensive datasets, including numerous genome projects, means that standard phylogenetic analyses can include many hundreds of sequences. Unfortunately, most phylogenetic analysis programs do not tolerate the sequence naming conventions of genome databases. Managing large numbers of sequences and standardizing sequence labels for use in phylogenetic analysis programs can be a time consuming and laborious task. Here we report the availability of an online resource for the management of gene sequences recovered from public access genome databases such as GenBank. These web utilities include the facility for renaming every sequence in a FASTA alignment fi le, with each sequence label derived from a user-defined combination of the species name and/or database accession number. This facility enables the user to keep track of the branching order of the sequences/taxa during multiple tree calculations and re-optimisations. Post phylogenetic analysis, these webpages can then be used to rename every label in the subsequent tree fi les (with a user-defined combination of species name and/or database accession number. Together these programs drastically reduce the time required for managing sequence alignments and labelling phylogenetic figures. Additional features of our platform include the automatic removal of identical accession numbers (recorded in the report file and generation of species and accession number lists for use in supplementary materials or figure legends.

  4. Human parainfluenza virus type 2 hemagglutinin-neuramindase gene: sequence and phylogenetic analysis of the Saudi strain Riyadh 105/2009

    Directory of Open Access Journals (Sweden)

    Almajhdi Fahad N

    2012-12-01

    Full Text Available Abstract Background Although human parainfluenza type 2 (HPIV-2 virus is an important respiratory pathogen, a little is known about strains circulating in Saudi Arabia. Findings Among 180 nasopharyngeal aspirates collected from suspected cases in Riyadh, only one sample (0.56% was confirmed HPIV-2 positive by nested RT-PCR. The sample that was designated Riyadh 105/2009 was used for sequencing and phylogenetic analysis of the most variable virus gene; the haemagglutinin-neuramindase (HN. Comparison of HN gene of Riyadh 105/2009 strain and the relevant sequences available in GenBank revealed a strong relationship with Oklahoma-94-2009 strain. Phylogenetic analysis indicated four different clusters of HPIV-2 strains (G1-4. Twenty-three amino acid substitutions were recorded for Riyadh 105/2009, from which four are unique. The majority of substitutions (n=18 had changed their amino acids characteristics. By analyzing the effect of the recorded substitutions on the protein function using SIFT program, only two located at positions 360 and 571 were predicted to be deleterious. Conclusions The presented changes of Riyadh 105/2009 strain may possess potential effect on the protein structure and/or function level. This is the first report that describes partial characterization of Saudi HPIV-2 strain.

  5. Comprehensive Phylogenetic Analysis of Bovine Non-aureus Staphylococci Species Based on Whole-Genome Sequencing

    Science.gov (United States)

    Naushad, Sohail; Barkema, Herman W.; Luby, Christopher; Condas, Larissa A. Z.; Nobrega, Diego B.; Carson, Domonique A.; De Buck, Jeroen

    2016-01-01

    Non-aureus staphylococci (NAS), a heterogeneous group of a large number of species and subspecies, are the most frequently isolated pathogens from intramammary infections in dairy cattle. Phylogenetic relationships among bovine NAS species are controversial and have mostly been determined based on single-gene trees. Herein, we analyzed phylogeny of bovine NAS species using whole-genome sequencing (WGS) of 441 distinct isolates. In addition, evolutionary relationships among bovine NAS were estimated from multilocus data of 16S rRNA, hsp60, rpoB, sodA, and tuf genes and sequences from these and numerous other single genes/proteins. All phylogenies were created with FastTree, Maximum-Likelihood, Maximum-Parsimony, and Neighbor-Joining methods. Regardless of methodology, WGS-trees clearly separated bovine NAS species into five monophyletic coherent clades. Furthermore, there were consistent interspecies relationships within clades in all WGS phylogenetic reconstructions. Except for the Maximum-Parsimony tree, multilocus data analysis similarly produced five clades. There were large variations in determining clades and interspecies relationships in single gene/protein trees, under different methods of tree constructions, highlighting limitations of using single genes for determining bovine NAS phylogeny. However, based on WGS data, we established a robust phylogeny of bovine NAS species, unaffected by method or model of evolutionary reconstructions. Therefore, it is now possible to determine associations between phylogeny and many biological traits, such as virulence, antimicrobial resistance, environmental niche, geographical distribution, and host specificity. PMID:28066335

  6. Hal: an automated pipeline for phylogenetic analyses of genomic data.

    Science.gov (United States)

    Robbertse, Barbara; Yoder, Ryan J; Boyd, Alex; Reeves, John; Spatafora, Joseph W

    2011-02-07

    The rapid increase in genomic and genome-scale data is resulting in unprecedented levels of discrete sequence data available for phylogenetic analyses. Major analytical impasses exist, however, prior to analyzing these data with existing phylogenetic software. Obstacles include the management of large data sets without standardized naming conventions, identification and filtering of orthologous clusters of proteins or genes, and the assembly of alignments of orthologous sequence data into individual and concatenated super alignments. Here we report the production of an automated pipeline, Hal that produces multiple alignments and trees from genomic data. These alignments can be produced by a choice of four alignment programs and analyzed by a variety of phylogenetic programs. In short, the Hal pipeline connects the programs BLASTP, MCL, user specified alignment programs, GBlocks, ProtTest and user specified phylogenetic programs to produce species trees. The script is available at sourceforge (http://sourceforge.net/projects/bio-hal/). The results from an example analysis of Kingdom Fungi are briefly discussed.

  7. Phylogenetic analysis of VP2 gene of canine parvovirus and comparison with Indian and world isolates.

    Science.gov (United States)

    Kaur, G; Chandra, M; Dwivedi, P N

    2016-03-01

    Canine parvovirus (CPV) causes hemorrhagic enteritis, especially in young dogs, leading to high morbidity and mortality. It has four main antigenic types CPV-2, CPV-2a, CPV-2b and CPV-2c. Virus protein 2 (VP2) is the main capsid protein and mutations affecting VP2 gene are responsible for the evolution of various antigenic types of CPV. Full length VP2 gene from field isolates was amplified and cloned for sequence analysis. The sequences were submitted to the GenBank and were assigned Acc. Nos., viz. KP406928.1 for P12, KP406927.1 for P15, KP406930.1 for P32, KP406926.1 for Megavac-6 and KP406929.1 for NobivacDHPPi. Phylogenetic analysis indicated that the samples were forming a separate clad with vaccine strains. When the samples were compared with the world and Indian isolates, it was observed that samples formed a separate node indicating regional genetic variation in CPV.

  8. Phylogenetic and comparative gene expression analysis of barley (Hordeum vulgare)WRKY transcription factor family reveals putatively retained functions betweenmonocots and dicots

    Energy Technology Data Exchange (ETDEWEB)

    Mangelsen, Elke; Kilian, Joachim; Berendzen, Kenneth W.; Kolukisaoglu, Uner; Harter, Klaus; Jansson, Christer; Wanke, Dierk

    2008-02-01

    WRKY proteins belong to the WRKY-GCM1 superfamily of zinc finger transcription factors that have been subject to a large plant-specific diversification. For the cereal crop barley (Hordeum vulgare), three different WRKY proteins have been characterized so far, as regulators in sucrose signaling, in pathogen defense, and in response to cold and drought, respectively. However, their phylogenetic relationship remained unresolved. In this study, we used the available sequence information to identify a minimum number of 45 barley WRKY transcription factor (HvWRKY) genes. According to their structural features the HvWRKY factors were classified into the previously defined polyphyletic WRKY subgroups 1 to 3. Furthermore, we could assign putative orthologs of the HvWRKY proteins in Arabidopsis and rice. While in most cases clades of orthologous proteins were formed within each group or subgroup, other clades were composed of paralogous proteins for the grasses and Arabidopsis only, which is indicative of specific gene radiation events. To gain insight into their putative functions, we examined expression profiles of WRKY genes from publicly available microarray data resources and found group specific expression patterns. While putative orthologs of the HvWRKY transcription factors have been inferred from phylogenetic sequence analysis, we performed a comparative expression analysis of WRKY genes in Arabidopsis and barley. Indeed, highly correlative expression profiles were found between some of the putative orthologs. HvWRKY genes have not only undergone radiation in monocot or dicot species, but exhibit evolutionary traits specific to grasses. HvWRKY proteins exhibited not only sequence similarities between orthologs with Arabidopsis, but also relatedness in their expression patterns. This correlative expression is indicative for a putative conserved function of related WRKY proteins in mono- and dicot species.

  9. Utility of COX1 phylogenetics to differentiate between locally acquired and imported Plasmodium knowlesi infections in Singapore

    Science.gov (United States)

    Loh, Jin Phang; Gao, Qiu Han Christine; Lee, Vernon J; Tetteh, Kevin; Drakeley, Chris

    2016-01-01

    INTRODUCTION Although there have been several phylogenetic studies on Plasmodium knowlesi (P. knowlesi), only cytochrome c oxidase subunit 1 (COX1) gene analysis has shown some geographical differentiation between the isolates of different countries. METHODS Phylogenetic analysis of locally acquired P. knowlesi infections, based on circumsporozoite, small subunit ribosomal ribonucleic acid (SSU rRNA), merozoite surface protein 1 and COX1 gene targets, was performed. The results were compared with the published sequences of regional isolates from Malaysia and Thailand. RESULTS Phylogenetic analysis of the circumsporozoite, SSU rRNA and merozoite surface protein 1 gene sequences for regional P. knowlesi isolates showed no obvious differentiation that could be attributed to their geographical origin. However, COX1 gene analysis showed that it was possible to differentiate between Singapore-acquired P. knowlesi infections and P. knowlesi infections from Peninsular Malaysia and Sarawak, Borneo, Malaysia. CONCLUSION The ability to differentiate between locally acquired P. knowlesi infections and imported P. knowlesi infections has important utility for the monitoring of P. knowlesi malaria control programmes in Singapore. PMID:26805667

  10. Utility of COX1 phylogenetics to differentiate between locally acquired and imported Plasmodium knowlesi infections in Singapore.

    Science.gov (United States)

    Loh, Jin Phang; Gao, Qiu Han Christine; Lee, Vernon J; Tetteh, Kevin; Drakeley, Chris

    2016-12-01

    Although there have been several phylogenetic studies on Plasmodium knowlesi (P. knowlesi), only cytochrome c oxidase subunit 1 (COX1) gene analysis has shown some geographical differentiation between the isolates of different countries. Phylogenetic analysis of locally acquired P. knowlesi infections, based on circumsporozoite, small subunit ribosomal ribonucleic acid (SSU rRNA), merozoite surface protein 1 and COX1 gene targets, was performed. The results were compared with the published sequences of regional isolates from Malaysia and Thailand. Phylogenetic analysis of the circumsporozoite, SSU rRNA and merozoite surface protein 1 gene sequences for regional P. knowlesi isolates showed no obvious differentiation that could be attributed to their geographical origin. However, COX1 gene analysis showed that it was possible to differentiate between Singapore-acquired P. knowlesi infections and P. knowlesi infections from Peninsular Malaysia and Sarawak, Borneo, Malaysia. The ability to differentiate between locally acquired P. knowlesi infections and imported P. knowlesi infections has important utility for the monitoring of P. knowlesi malaria control programmes in Singapore. Copyright: © Singapore Medical Association

  11. Bayesian models for comparative analysis integrating phylogenetic uncertainty

    Directory of Open Access Journals (Sweden)

    Villemereuil Pierre de

    2012-06-01

    Full Text Available Abstract Background Uncertainty in comparative analyses can come from at least two sources: a phylogenetic uncertainty in the tree topology or branch lengths, and b uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow and inflated significance in hypothesis testing (e.g. p-values will be too small. Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. Methods We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. Results We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Conclusions Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible

  12. Bayesian models for comparative analysis integrating phylogenetic uncertainty

    Science.gov (United States)

    2012-01-01

    Background Uncertainty in comparative analyses can come from at least two sources: a) phylogenetic uncertainty in the tree topology or branch lengths, and b) uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow) and inflated significance in hypothesis testing (e.g. p-values will be too small). Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. Methods We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. Results We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Conclusions Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible general purpose tool for

  13. Phylogenetic analysis of Tomato spotted wilt virus (TSWV) NSs protein demonstrates the isolated emergence of resistance-breaking strains in pepper.

    Science.gov (United States)

    Almási, Asztéria; Csilléry, Gábor; Csömör, Zsófia; Nemes, Katalin; Palkovics, László; Salánki, Katalin; Tóbiás, István

    2015-02-01

    Resurgence of Tomato spotted wilt virus (TSWV) worldwide as well as in Hungary causing heavy economic losses directed the attention to the factors contributing to the outbreak of this serious epidemics. The introgression of Tsw resistance gene into various pepper cultivars seemed to solve TSWV control, but widely used resistant pepper cultivars bearing the same, unique resistance locus evoked the rapid emergence of resistance-breaking (RB) TSWV strains. In Hungary, the sporadic appearance of RB strains in pepper-producing region was first observed in 2010-2011, but in 2012 it was detected frequently. Previously, the non-structural protein (NSs) encoded by small RNA (S RNA) of TSWV was verified as the avirulence factor for Tsw resistance, therefore we analyzed the S RNA of the Hungarian RB and wild type (WT) isolates and compared to previously analyzed TSWV strains with RB properties from different geographical origins. Phylogenetic analysis demonstrated that the different RB strains had the closest relationship with the local WT isolates and there is no conserved mutation present in all the NSs genes of RB isolates from different geographical origins. According to these results, we concluded that the RB isolates evolved separately in geographic point of view, and also according to the RB mechanism.

  14. Sequencing, description and phylogenetic analysis of the mitochondrial genome of Sarcocheilichthys sinensis sinensis (Cypriniformes: Cyprinidae).

    Science.gov (United States)

    Li, Chen; He, Liping; Chen, Chong; Cai, Lingchao; Chen, Pingping; Yang, Shoubao

    2016-01-01

    Sarcocheilichthys sinensis sinensis (Bleeker, 1871), is a small benthopelagic freshwater species with high nutritional and ornamental value. In this study, the complete mitochondrial genome of S. sinensis sinensis was determined; the phylogenetic analysis with another individual and closely related species of Sarcocheilichthys fishes was carried out. The complete mitogenome of S. sinensis sinensis was 16683 bp in length, consist of 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and 2 non-coding regions: (D-loop and OL). It indicated that D-loop, ND2, and CytB may be appropriate molecular markers for studying population genetics and conservation biology of Sarcocheilichthys fishes.

  15. Phylogenetic analysis of anemone fishes of the Persian Gulf using ...

    African Journals Online (AJOL)

    STORAGESEVER

    2008-06-17

    Jun 17, 2008 ... genetic diversity among samples was investigated by phylogenetic analysis. Results show that there is ... more about the living organisms found in this region. Many marine ... Kish (modified from Pous et al., 2004). Table 2.

  16. Molecular Phylogenetics: Concepts for a Newcomer.

    Science.gov (United States)

    Ajawatanawong, Pravech

    Molecular phylogenetics is the study of evolutionary relationships among organisms using molecular sequence data. The aim of this review is to introduce the important terminology and general concepts of tree reconstruction to biologists who lack a strong background in the field of molecular evolution. Some modern phylogenetic programs are easy to use because of their user-friendly interfaces, but understanding the phylogenetic algorithms and substitution models, which are based on advanced statistics, is still important for the analysis and interpretation without a guide. Briefly, there are five general steps in carrying out a phylogenetic analysis: (1) sequence data preparation, (2) sequence alignment, (3) choosing a phylogenetic reconstruction method, (4) identification of the best tree, and (5) evaluating the tree. Concepts in this review enable biologists to grasp the basic ideas behind phylogenetic analysis and also help provide a sound basis for discussions with expert phylogeneticists.

  17. Sequence analysis of the L protein of the Ebola 2014 outbreak: Insight into conserved regions and mutations.

    Science.gov (United States)

    Ayub, Gohar; Waheed, Yasir

    2016-06-01

    The 2014 Ebola outbreak was one of the largest that have occurred; it started in Guinea and spread to Nigeria, Liberia and Sierra Leone. Phylogenetic analysis of the current virus species indicated that this outbreak is the result of a divergent lineage of the Zaire ebolavirus. The L protein of Ebola virus (EBOV) is the catalytic subunit of the RNA‑dependent RNA polymerase complex, which, with VP35, is key for the replication and transcription of viral RNA. Earlier sequence analysis demonstrated that the L protein of all non‑segmented negative‑sense (NNS) RNA viruses consists of six domains containing conserved functional motifs. The aim of the present study was to analyze the presence of these motifs in 2014 EBOV isolates, highlight their function and how they may contribute to the overall pathogenicity of the isolates. For this purpose, 81 2014 EBOV L protein sequences were aligned with 475 other NNS RNA viruses, including Paramyxoviridae and Rhabdoviridae viruses. Phylogenetic analysis of all EBOV outbreak L protein sequences was also performed. Analysis of the amino acid substitutions in the 2014 EBOV outbreak was conducted using sequence analysis. The alignment demonstrated the presence of previously conserved motifs in the 2014 EBOV isolates and novel residues. Notably, all the mutations identified in the 2014 EBOV isolates were tolerant, they were pathogenic with certain examples occurring within previously determined functional conserved motifs, possibly altering viral pathogenicity, replication and virulence. The phylogenetic analysis demonstrated that all sequences with the exception of the 2014 EBOV sequences were clustered together. The 2014 EBOV outbreak has acquired a great number of mutations, which may explain the reasons behind this unprecedented outbreak. Certain residues critical to the function of the polymerase remain conserved and may be targets for the development of antiviral therapeutic agents.

  18. Phylogenetic analysis of members of the Phycodnaviridae virus family, using amplified fragments of the major capsid protein gene.

    Science.gov (United States)

    Larsen, J B; Larsen, A; Bratbak, G; Sandaa, R-A

    2008-05-01

    Algal viruses are considered ecologically important by affecting host population dynamics and nutrient flow in aquatic food webs. Members of the family Phycodnaviridae are also interesting due to their extraordinary genome size. Few algal viruses in the Phycodnaviridae family have been sequenced, and those that have been have few genes in common and low gene homology. It has hence been difficult to design general PCR primers that allow further studies of their ecology and diversity. In this study, we screened the nine type I core genes of the nucleocytoplasmic large DNA viruses for sequences suitable for designing a general set of primers. Sequence comparison between members of the Phycodnaviridae family, including three partly sequenced viruses infecting the prymnesiophyte Pyramimonas orientalis and the haptophytes Phaeocystis pouchetii and Chrysochromulina ericina (Pyramimonas orientalis virus 01B [PoV-01B], Phaeocystis pouchetii virus 01 [PpV-01], and Chrysochromulina ericina virus 01B [CeV-01B], respectively), revealed eight conserved regions in the major capsid protein (MCP). Two of these regions also showed conservation at the nucleotide level, and this allowed us to design degenerate PCR primers. The primers produced 347- to 518-bp amplicons when applied to lysates from algal viruses kept in culture and from natural viral communities. The aim of this work was to use the MCP as a proxy to infer phylogenetic relationships and genetic diversity among members of the Phycodnaviridae family and to determine the occurrence and diversity of this gene in natural viral communities. The results support the current legitimate genera in the Phycodnaviridae based on alga host species. However, while placing the mimivirus in close proximity to the type species, PBCV-1, of Phycodnaviridae along with the three new viruses assigned to the family (PoV-01B, PpV-01, and CeV-01B), the results also indicate that the coccolithoviruses and phaeoviruses are more diverged from this

  19. Isolation and phylogenetic analysis of canine distemper virus among domestic dogs in Vietnam.

    Science.gov (United States)

    Nguyen, Dung Van; Suzuki, Junko; Minami, Shohei; Yonemitsu, Kenzo; Nagata, Nao; Kuwata, Ryusei; Shimoda, Hiroshi; Vu, Chien Kim; Truong, Thuy Quoc; Maeda, Ken

    2017-01-20

    Canine distemper virus (CDV) is one of the most serious pathogens found in many species of carnivores, including domestic dogs. In this study, hemagglutinin (H) genes were detected in five domestic Vietnamese dogs with diarrhea, and two CDVs were successfully isolated from dogs positive for H genes. The complete genome of one isolate, CDV/dog/HCM/33/140816, was determined. Phylogenetic analysis showed that all Vietnamese CDVs belonged to the Asia-1 genotype. In addition, the H proteins of Vietnamese CDV strains were the most homologous to those of Chinese CDVs (98.4% to 99.3% identity). These results indicated that the Asia-1 genotype of CDV was the predominant genotype circulating among the domestic dog population in Vietnam and that transboundary transmission of CDV has occurred between Vietnam and China.

  20. Sequencing and phylogenetic analysis of tobacco virus 2, a polerovirus from Nicotiana tabacum.

    Science.gov (United States)

    Zhou, Benguo; Wang, Fang; Zhang, Xuesong; Zhang, Lina; Lin, Huafeng

    2017-07-01

    The complete genome sequence of a new virus, provisionally named tobacco virus 2 (TV2), was determined and identified from leaves of tobacco (Nicotiana tabacum) exhibiting leaf mosaic, yellowing, and deformity, in Anhui Province, China. The genome sequence of TV2 comprises 5,979 nucleotides, with 87% nucleotide sequence identity to potato leafroll virus (PLRV). Its genome organization is similar to that of PLRV, containing six open reading frames (ORFs) that potentially encode proteins with putative functions in cell-to-cell movement and suppression of RNA silencing. Phylogenetic analysis of the nucleotide sequence placed TV2 alongside members of the genus Polerovirus in the family Luteoviridae. To the best our knowledge, this study is the first report of a complete genome sequence of a new polerovirus identified in tobacco.

  1. Genome-wide comparative analysis of phylogenetic trees: the prokaryotic forest of life.

    Science.gov (United States)

    Puigbò, Pere; Wolf, Yuri I; Koonin, Eugene V

    2012-01-01

    Genome-wide comparison of phylogenetic trees is becoming an increasingly common approach in evolutionary genomics, and a variety of approaches for such comparison have been developed. In this article, we present several methods for comparative analysis of large numbers of phylogenetic trees. To compare phylogenetic trees taking into account the bootstrap support for each internal branch, the Boot-Split Distance (BSD) method is introduced as an extension of the previously developed Split Distance method for tree comparison. The BSD method implements the straightforward idea that comparison of phylogenetic trees can be made more robust by treating tree splits differentially depending on the bootstrap support. Approaches are also introduced for detecting tree-like and net-like evolutionary trends in the phylogenetic Forest of Life (FOL), i.e., the entirety of the phylogenetic trees for conserved genes of prokaryotes. The principal method employed for this purpose includes mapping quartets of species onto trees to calculate the support of each quartet topology and so to quantify the tree and net contributions to the distances between species. We describe the application of these methods to analyze the FOL and the results obtained with these methods. These results support the concept of the Tree of Life (TOL) as a central evolutionary trend in the FOL as opposed to the traditional view of the TOL as a "species tree."

  2. Complete Mitochondrial Genome of the Red Fox (Vuples vuples) and Phylogenetic Analysis with Other Canid Species.

    Science.gov (United States)

    Zhong, Hua-Ming; Zhang, Hong-Hai; Sha, Wei-Lai; Zhang, Cheng-De; Chen, Yu-Cai

    2010-04-01

    The whole mitochondrial genome sequence of red fox (Vuples vuples) was determined. It had a total length of 16 723 bp. As in most mammal mitochondrial genome, it contained 13 protein coding genes, two ribosome RNA genes, 22 transfer RNA genes and one control region. The base composition was 31.3% A, 26.1% C, 14.8% G and 27.8% T, respectively. The codon usage of red fox, arctic fox, gray wolf, domestic dog and coyote followed the same pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 3 gene in the red fox. A long tandem repeat rich in AC was found between conserved sequence block 1 and 2 in the control region. In order to confirm the phylogenetic relationships of red fox to other canids, phylogenetic trees were reconstructed by neighbor-joining and maximum parsimony methods using 12 concatenated heavy-strand protein-coding genes. The result indicated that arctic fox was the sister group of red fox and they both belong to the red fox-like clade in family Canidae, while gray wolf, domestic dog and coyote belong to wolf-like clade. The result was in accordance with existing phylogenetic results.

  3. Phylogenetic analysis of the genus Hordeum using repetitive DNA sequences

    DEFF Research Database (Denmark)

    Svitashev, S.; Bryngelsson, T.; Vershinin, A.

    1994-01-01

    A set of six cloned barley (Hordeum vulgare) repetitive DNA sequences was used for the analysis of phylogenetic relationships among 31 species (46 taxa) of the genus Hordeum, using molecular hybridization techniques. In situ hybridization experiments showed dispersed organization of the sequences...

  4. In Silico Phylogenetic Analysis and Molecular Modelling Study of 2-Haloalkanoic Acid Dehalogenase Enzymes from Bacterial and Fungal Origin

    Directory of Open Access Journals (Sweden)

    Raghunath Satpathy

    2016-01-01

    Full Text Available 2-Haloalkanoic acid dehalogenase enzymes have broad range of applications, starting from bioremediation to chemical synthesis of useful compounds that are widely distributed in fungi and bacteria. In the present study, a total of 81 full-length protein sequences of 2-haloalkanoic acid dehalogenase from bacteria and fungi were retrieved from NCBI database. Sequence analysis such as multiple sequence alignment (MSA, conserved motif identification, computation of amino acid composition, and phylogenetic tree construction were performed on these primary sequences. From MSA analysis, it was observed that the sequences share conserved lysine (K and aspartate (D residues in them. Also, phylogenetic tree indicated a subcluster comprised of both fungal and bacterial species. Due to nonavailability of experimental 3D structure for fungal 2-haloalkanoic acid dehalogenase in the PDB, molecular modelling study was performed for both fungal and bacterial sources of enzymes present in the subcluster. Further structural analysis revealed a common evolutionary topology shared between both fungal and bacterial enzymes. Studies on the buried amino acids showed highly conserved Leu and Ser in the core, despite variation in their amino acid percentage. Additionally, a surface exposed tryptophan was conserved in all of these selected models.

  5. Phylogenetic analysis of the expansion of the MATH-BTB gene family in the grasses.

    Science.gov (United States)

    Juranić, Martina; Dresselhaus, Thomas

    2014-01-01

    MATH-BTB proteins are known to act as substrate-specific adaptors of cullin3 (CUL3)-based ubiquitin E3 ligases to target protein for ubiquitination. In a previous study we reported the presence of 31 MATH-BTB genes in the maize genome and determined the regulatory role of the MATH-BTB protein MAB1 during meiosis to mitosis transition. In contrast to maize, there are only 6 homologous genes in the model plant Arabidopsis, while this family has largely expanded in grasses. Here, we report a phylogenetic analysis of the MATH-BTB gene family in 9 land plant species including various mosses, eudicots, and grasses. We extend a previous classification of the plant MATH-BTB family and additionally arrange the expanded group into 5 grass-specific clades. Synteny studies indicate that expansion occurred to a large extent due to local gene duplications. Expression studies of 3 closely related MATH-BTB genes in maize (MAB1-3) indicate highly specific expression pattern. In summary, this work provides a solid base for further studies comparing genetic and functional information of the MATH-BTB family especially in the grasses.

  6. Solution structure and phylogenetics of Prod1, a member of the three-finger protein superfamily implicated in salamander limb regeneration.

    Directory of Open Access Journals (Sweden)

    Acely Garza-Garcia

    Full Text Available BACKGROUND: Following the amputation of a limb, newts and salamanders have the capability to regenerate the lost tissues via a complex process that takes place at the site of injury. Initially these cells undergo dedifferentiation to a state competent to regenerate the missing limb structures. Crucially, dedifferentiated cells have memory of their level of origin along the proximodistal (PD axis of the limb, a property known as positional identity. Notophthalmus viridescens Prod1 is a cell-surface molecule of the three-finger protein (TFP superfamily involved in the specification of newt limb PD identity. The TFP superfamily is a highly diverse group of metazoan proteins that includes snake venom toxins, mammalian transmembrane receptors and miscellaneous signaling molecules. METHODOLOGY/PRINCIPAL FINDINGS: With the aim of identifying potential orthologs of Prod1, we have solved its 3D structure and compared it to other known TFPs using phylogenetic techniques. The analysis shows that TFP 3D structures group in different categories according to function. Prod1 clusters with other cell surface protein TFP domains including the complement regulator CD59 and the C-terminal domain of urokinase-type plasminogen activator. To infer orthology, a structure-based multiple sequence alignment of representative TFP family members was built and analyzed by phylogenetic methods. Prod1 has been proposed to be the salamander CD59 but our analysis fails to support this association. Prod1 is not a good match for any of the TFP families present in mammals and this result was further supported by the identification of the putative orthologs of both CD59 and N. viridescens Prod1 in sequence data for the salamander Ambystoma tigrinum. CONCLUSIONS/SIGNIFICANCE: The available data suggest that Prod1, and thereby its role in encoding PD identity, is restricted to salamanders. The lack of comparable limb-regenerative capability in other adult vertebrates could be

  7. Phylogenetic analysis of Newcastle disease viruses isolated from commercial poultry in Mozambique, 2011 to 2016

    International Nuclear Information System (INIS)

    Mapaco, L.P.; Monjane, I.V.A.; Nhamusso, A.E.; Viljoen, G.J; Dundon, W.G.; Achá, S.J.

    2016-01-01

    Full text: The complete sequence of the fusion (F) protein gene from eleven Newcastle disease viruses (NDV) isolated from commercial poultry in Mozambique between 2011 and 2016 has been generated. The F gene cleavage site motif for all eleven isolates was 112RRRKRF117 indicating that the viruses are virulent. A phylogenetic analysis using the full F gene sequence revealed that the viruses clustered within genotype VIIh and showed a higher similarity to NDVs from South Africa, China and Southeast Asia than to viruses previously described in Mozambique in 1994 to 1995 and 2005. The characterization of these new NDVs has important implications for Newcastle disease management and control in Mozambique. (author)

  8. Phylo_dCor: distance correlation as a novel metric for phylogenetic profiling.

    Science.gov (United States)

    Sferra, Gabriella; Fratini, Federica; Ponzi, Marta; Pizzi, Elisabetta

    2017-09-05

    Elaboration of powerful methods to predict functional and/or physical protein-protein interactions from genome sequence is one of the main tasks in the post-genomic era. Phylogenetic profiling allows the prediction of protein-protein interactions at a whole genome level in both Prokaryotes and Eukaryotes. For this reason it is considered one of the most promising methods. Here, we propose an improvement of phylogenetic profiling that enables handling of large genomic datasets and infer global protein-protein interactions. This method uses the distance correlation as a new measure of phylogenetic profile similarity. We constructed robust reference sets and developed Phylo-dCor, a parallelized version of the algorithm for calculating the distance correlation that makes it applicable to large genomic data. Using Saccharomyces cerevisiae and Escherichia coli genome datasets, we showed that Phylo-dCor outperforms phylogenetic profiling methods previously described based on the mutual information and Pearson's correlation as measures of profile similarity. In this work, we constructed and assessed robust reference sets and propose the distance correlation as a measure for comparing phylogenetic profiles. To make it applicable to large genomic data, we developed Phylo-dCor, a parallelized version of the algorithm for calculating the distance correlation. Two R scripts that can be run on a wide range of machines are available upon request.

  9. Construction of phylogenetic trees by kernel-based comparative analysis of metabolic networks.

    Science.gov (United States)

    Oh, S June; Joung, Je-Gun; Chang, Jeong-Ho; Zhang, Byoung-Tak

    2006-06-06

    To infer the tree of life requires knowledge of the common characteristics of each species descended from a common ancestor as the measuring criteria and a method to calculate the distance between the resulting values of each measure. Conventional phylogenetic analysis based on genomic sequences provides information about the genetic relationships between different organisms. In contrast, comparative analysis of metabolic pathways in different organisms can yield insights into their functional relationships under different physiological conditions. However, evaluating the similarities or differences between metabolic networks is a computationally challenging problem, and systematic methods of doing this are desirable. Here we introduce a graph-kernel method for computing the similarity between metabolic networks in polynomial time, and use it to profile metabolic pathways and to construct phylogenetic trees. To compare the structures of metabolic networks in organisms, we adopted the exponential graph kernel, which is a kernel-based approach with a labeled graph that includes a label matrix and an adjacency matrix. To construct the phylogenetic trees, we used an unweighted pair-group method with arithmetic mean, i.e., a hierarchical clustering algorithm. We applied the kernel-based network profiling method in a comparative analysis of nine carbohydrate metabolic networks from 81 biological species encompassing Archaea, Eukaryota, and Eubacteria. The resulting phylogenetic hierarchies generally support the tripartite scheme of three domains rather than the two domains of prokaryotes and eukaryotes. By combining the kernel machines with metabolic information, the method infers the context of biosphere development that covers physiological events required for adaptation by genetic reconstruction. The results show that one may obtain a global view of the tree of life by comparing the metabolic pathway structures using meta-level information rather than sequence

  10. Construction of phylogenetic trees by kernel-based comparative analysis of metabolic networks

    Directory of Open Access Journals (Sweden)

    Chang Jeong-Ho

    2006-06-01

    Full Text Available Abstract Background To infer the tree of life requires knowledge of the common characteristics of each species descended from a common ancestor as the measuring criteria and a method to calculate the distance between the resulting values of each measure. Conventional phylogenetic analysis based on genomic sequences provides information about the genetic relationships between different organisms. In contrast, comparative analysis of metabolic pathways in different organisms can yield insights into their functional relationships under different physiological conditions. However, evaluating the similarities or differences between metabolic networks is a computationally challenging problem, and systematic methods of doing this are desirable. Here we introduce a graph-kernel method for computing the similarity between metabolic networks in polynomial time, and use it to profile metabolic pathways and to construct phylogenetic trees. Results To compare the structures of metabolic networks in organisms, we adopted the exponential graph kernel, which is a kernel-based approach with a labeled graph that includes a label matrix and an adjacency matrix. To construct the phylogenetic trees, we used an unweighted pair-group method with arithmetic mean, i.e., a hierarchical clustering algorithm. We applied the kernel-based network profiling method in a comparative analysis of nine carbohydrate metabolic networks from 81 biological species encompassing Archaea, Eukaryota, and Eubacteria. The resulting phylogenetic hierarchies generally support the tripartite scheme of three domains rather than the two domains of prokaryotes and eukaryotes. Conclusion By combining the kernel machines with metabolic information, the method infers the context of biosphere development that covers physiological events required for adaptation by genetic reconstruction. The results show that one may obtain a global view of the tree of life by comparing the metabolic pathway

  11. Phylogenetic comparative methods on phylogenetic networks with reticulations.

    Science.gov (United States)

    Bastide, Paul; Solís-Lemus, Claudia; Kriebel, Ricardo; Sparks, K William; Ané, Cécile

    2018-04-25

    The goal of Phylogenetic Comparative Methods (PCMs) is to study the distribution of quantitative traits among related species. The observed traits are often seen as the result of a Brownian Motion (BM) along the branches of a phylogenetic tree. Reticulation events such as hybridization, gene flow or horizontal gene transfer, can substantially affect a species' traits, but are not modeled by a tree. Phylogenetic networks have been designed to represent reticulate evolution. As they become available for downstream analyses, new models of trait evolution are needed, applicable to networks. One natural extension of the BM is to use a weighted average model for the trait of a hybrid, at a reticulation point. We develop here an efficient recursive algorithm to compute the phylogenetic variance matrix of a trait on a network, in only one preorder traversal of the network. We then extend the standard PCM tools to this new framework, including phylogenetic regression with covariates (or phylogenetic ANOVA), ancestral trait reconstruction, and Pagel's λ test of phylogenetic signal. The trait of a hybrid is sometimes outside of the range of its two parents, for instance because of hybrid vigor or hybrid depression. These two phenomena are rather commonly observed in present-day hybrids. Transgressive evolution can be modeled as a shift in the trait value following a reticulation point. We develop a general framework to handle such shifts, and take advantage of the phylogenetic regression view of the problem to design statistical tests for ancestral transgressive evolution in the evolutionary history of a group of species. We study the power of these tests in several scenarios, and show that recent events have indeed the strongest impact on the trait distribution of present-day taxa. We apply those methods to a dataset of Xiphophorus fishes, to confirm and complete previous analysis in this group. All the methods developed here are available in the Julia package PhyloNetworks.

  12. Phylogenetic analysis of the light-harvesting system in Chromera velia.

    Science.gov (United States)

    Pan, Hao; Slapeta, Jan; Carter, Dee; Chen, Min

    2012-03-01

    Chromera velia is a newly discovered photosynthetic eukaryotic alga that has functional chloroplasts closely related to the apicoplast of apicomplexan parasites. Recently, the chloroplast in C. velia was shown to be derived from the red algal lineage. Light-harvesting protein complexes (LHC), which are a group of proteins involved in photon capture and energy transfer in photosynthesis, are important for photosynthesis efficiency, photo-adaptation/accumulation and photo-protection. Although these proteins are encoded by genes located in the nucleus, LHC peptides migrate and function in the chloroplast, hence the LHC may have a different evolutionary history compared to chloroplast evolution. Here, we compare the phylogenetic relationship of the C. velia LHCs to LHCs from other photosynthetic organisms. Twenty-three LHC homologues retrieved from C. velia EST sequences were aligned according to their conserved regions. The C. velia LHCs are positioned in four separate groups on trees constructed by neighbour-joining, maximum likelihood and Bayesian methods. A major group of seventeen LHCs from C. velia formed a separate cluster that was closest to dinoflagellate LHC, and to LHC and fucoxanthin chlorophyll-binding proteins from diatoms. One C. velia LHC sequence grouped with LI1818/LI818-like proteins, which were recently identified as environmental stress-induced protein complexes. Only three LHC homologues from C. velia grouped with the LHCs from red algae.

  13. Genetic and phylogenetic analysis of ten Gobiidae species in China ...

    African Journals Online (AJOL)

    To study the genetic and phylogenetic relationship of gobioid fishes in China, the representatives of 10 gobioid fishes from 2 subfamilies in China were examined by amplified fragment length polymorphism (AFLP) analysis. We established 220 AFLP bands for 45 individuals from the 10 species, and the percentage of ...

  14. Analysis of plasmid genes by phylogenetic profiling and visualization of homology relationships using Blast2Network

    Directory of Open Access Journals (Sweden)

    Bazzicalupo Marco

    2008-12-01

    Full Text Available Abstract Background Phylogenetic methods are well-established bioinformatic tools for sequence analysis, allowing to describe the non-independencies of sequences because of their common ancestor. However, the evolutionary profiles of bacterial genes are often complicated by hidden paralogy and extensive and/or (multiple horizontal gene transfer (HGT events which make bifurcating trees often inappropriate. In this context, plasmid sequences are paradigms of network-like relationships characterizing the evolution of prokaryotes. Actually, they can be transferred among different organisms allowing the dissemination of novel functions, thus playing a pivotal role in prokaryotic evolution. However, the study of their evolutionary dynamics is complicated by the absence of universally shared genes, a prerequisite for phylogenetic analyses. Results To overcome such limitations we developed a bioinformatic package, named Blast2Network (B2N, allowing the automatic phylogenetic profiling and the visualization of homology relationships in a large number of plasmid sequences. The software was applied to the study of 47 completely sequenced plasmids coming from Escherichia, Salmonella and Shigella spps. Conclusion The tools implemented by B2N allow to describe and visualize in a new way some of the evolutionary features of plasmid molecules of Enterobacteriaceae; in particular it helped to shed some light on the complex history of Escherichia, Salmonella and Shigella plasmids and to focus on possible roles of unannotated proteins. The proposed methodology is general enough to be used for comparative genomic analyses of bacteria.

  15. Genome-wide identification, characterization and phylogenetic analysis of 50 catfish ATP-binding cassette (ABC) transporter genes.

    Science.gov (United States)

    Liu, Shikai; Li, Qi; Liu, Zhanjiang

    2013-01-01

    Although a large set of full-length transcripts was recently assembled in catfish, annotation of large gene families, especially those with duplications, is still a great challenge. Most often, complexities in annotation cause mis-identification and thereby much confusion in the scientific literature. As such, detailed phylogenetic analysis and/or orthology analysis are required for annotation of genes involved in gene families. The ATP-binding cassette (ABC) transporter gene superfamily is a large gene family that encodes membrane proteins that transport a diverse set of substrates across membranes, playing important roles in protecting organisms from diverse environment. In this work, we identified a set of 50 ABC transporters in catfish genome. Phylogenetic analysis allowed their identification and annotation into seven subfamilies, including 9 ABCA genes, 12 ABCB genes, 12 ABCC genes, 5 ABCD genes, 2 ABCE genes, 4 ABCF genes and 6 ABCG genes. Most ABC transporters are conserved among vertebrates, though cases of recent gene duplications and gene losses do exist. Gene duplications in catfish were found for ABCA1, ABCB3, ABCB6, ABCC5, ABCD3, ABCE1, ABCF2 and ABCG2. The whole set of catfish ABC transporters provide the essential genomic resources for future biochemical, toxicological and physiological studies of ABC drug efflux transporters. The establishment of orthologies should allow functional inferences with the information from model species, though the function of lineage-specific genes can be distinct because of specific living environment with different selection pressure.

  16. Protein-based stable isotope probing.

    Science.gov (United States)

    Jehmlich, Nico; Schmidt, Frank; Taubert, Martin; Seifert, Jana; Bastida, Felipe; von Bergen, Martin; Richnow, Hans-Hermann; Vogt, Carsten

    2010-12-01

    We describe a stable isotope probing (SIP) technique that was developed to link microbe-specific metabolic function to phylogenetic information. Carbon ((13)C)- or nitrogen ((15)N)-labeled substrates (typically with >98% heavy label) were used in cultivation experiments and the heavy isotope incorporation into proteins (protein-SIP) on growth was determined. The amount of incorporation provides a measure for assimilation of a substrate, and the sequence information from peptide analysis obtained by mass spectrometry delivers phylogenetic information about the microorganisms responsible for the metabolism of the particular substrate. In this article, we provide guidelines for incubating microbial cultures with labeled substrates and a protocol for protein-SIP. The protocol guides readers through the proteomics pipeline, including protein extraction, gel-free and gel-based protein separation, the subsequent mass spectrometric analysis of peptides and the calculation of the incorporation of stable isotopes into peptides. Extraction of proteins and the mass fingerprint measurements of unlabeled and labeled fractions can be performed in 2-3 d.

  17. Genetic diversity and phylogenetic analysis of Tams1 of Theileria annulata isolates from three continents between 2000 and 2012.

    Science.gov (United States)

    Wang, Jiay; Yang, Xianyong; Wang, Yuge; Jing, Zhihong; Meng, Kai; Liu, Jianzhu; Guo, Huijun; Xu, Ruixue; Cheng, Ziqiang

    2014-01-01

    Theileria annulata, which is part of the Theileria sergenti/Theileria buffeli/Theileria orientalis group, preferentially infects cattle and results in high mortality and morbidity in the Mediterranean, Middle East, and Central Asia. The polypeptide Tams1 is an immunodominant major merozoite piroplasm surface antigen of T. annulata that could be used as a marker for epidemiological studies and phylogenetic analysis. In the present study, a total of 155 Tams1 sequences were investigated for genetic diversity and phylogenetic relationships through phylogenetic analysis. Results showed that the Tams1 sequences were divided into two major groups and that distribution for some isolates also exhibited geographic specificity. As targeting polymorphic genes for parasite detection may result in underestimation of infection, polymerase chain reaction (PCR) assay using two different probes targeting tams-1 genes of these two groups can be more credible. In addition, the direction of the spread of the disease was discovered to be from the Mediterranean or the tropical zone to the Eurasian peninsula, Middle East, Southern Asia, and Africa, particularly for Group 2. A similar occurrence was also found between the Ms1 gene of Theileria lestoquardi and the Tams1 gene of T. annulata, which explains cross-immunogenicity to a certain extent. However, no potential glycosylation site in the Tams1 of T. annulata was found in this study, which illustrated that instead of N-glycosylation, other modifications have more significant effects on the immunogenicity of the Tams1 protein.

  18. Phylogenetic analysis of HSP70 and cyt b gene sequences for Chinese Leishmania isolates and ultrastructural characteristics of Chinese Leishmania sp.

    Science.gov (United States)

    Yuan, Dongmei; Qin, Hanxiao; Zhang, Jianguo; Liao, Lin; Chen, Qiwei; Chen, Dali; Chen, Jianping

    2017-02-01

    Leishmaniasis is a worldwide epidemic disease caused by the genus Leishmania, which is still endemic in the west and northwest areas of China. Some viewpoints of the traditional taxonomy of Chinese Leishmania have been challenged by recent phylogenetic researches based on different molecular markers. However, the taxonomic positions and phylogenetic relationships of Chinese Leishmania isolates remain controversial, which need for more data and further analysis. In this study, the heat shock protein 70 (HSP70) gene and cytochrome b (cyt b) gene were used for phylogenetic analysis of Chinese Leishmania isolates from patients, dogs, gerbils, and sand flies in different geographic origins. Besides, for the interesting Leishmania sp. in China, the ultrastructure of three Chinese Leishmania sp. strains (MHOM/CN/90/SC10H2, SD, GL) were observed by transmission electron microscopy. Bayesian trees from HSP70 and cyt b congruently indicated that the 14 Chinese Leishmania isolates belong to three Leishmania species including L. donovani complex, L. gerbilli, and L. (Sauroleishmania) sp. Their identity further confirmed that the undescribed Leishmania species causing visceral Leishmaniasis (VL) in China is closely related to L. tarentolae. The phylogenetic results from HSP70 also suggested the classification of subspecies within L. donovani complex: KXG-918, KXG-927, KXG-Liu, KXG-Xu, 9044, SC6, and KXG-65 belong to L. donovani; Cy, WenChuan, and 801 were proposed to be L. infantum. Through transmission electron microscopy, unexpectedly, the Golgi apparatus were not observed in SC10H2, SD, and GL, which was similar to previous reports of reptilian Leishmania. The statistical analysis of microtubule counts separated SC10H2, SD, and GL as one group from any other reference strain (L. donovani MHOM/IN/80/DD8; L. tropica MHOM/SU/74/K27; L. gerbilli MRHO/CN/60/GERBILLI). The ultrastructural characteristics of Leishmania sp. partly lend support to the phylogenetic inference that

  19. Phylogenetic analysis of West Nile virus, Nuevo Leon State, Mexico.

    Science.gov (United States)

    Blitvich, Bradley J; Fernández-Salas, Ildefonso; Contreras-Cordero, Juan F; Loroño-Pino, María A; Marlenee, Nicole L; Díaz, Francisco J; González-Rojas, José I; Obregón-Martínez, Nelson; Chiu-García, Jorge A; Black, William C; Beaty, Barry J

    2004-07-01

    West Nile virus RNA was detected in brain tissue from a horse that died in June 2003 in Nuevo Leon State, Mexico. Nucleotide sequencing and phylogenetic analysis of the premembrane and envelope genes showed that the virus was most closely related to West Nile virus isolates collected in Texas in 2002.

  20. Consequences of recombination on traditional phylogenetic analysis

    DEFF Research Database (Denmark)

    Schierup, M H; Hein, J

    2000-01-01

    We investigate the shape of a phylogenetic tree reconstructed from sequences evolving under the coalescent with recombination. The motivation is that evolutionary inferences are often made from phylogenetic trees reconstructed from population data even though recombination may well occur (mt......DNA or viral sequences) or does occur (nuclear sequences). We investigate the size and direction of biases when a single tree is reconstructed ignoring recombination. Standard software (PHYLIP) was used to construct the best phylogenetic tree from sequences simulated under the coalescent with recombination....... With recombination present, the length of terminal branches and the total branch length are larger, and the time to the most recent common ancestor smaller, than for a tree reconstructed from sequences evolving with no recombination. The effects are pronounced even for small levels of recombination that may...

  1. Molecular, phylogenetic and comparative genomic analysis of the cytokinin oxidase/dehydrogenase gene family in the Poaceae.

    Science.gov (United States)

    Mameaux, Sabine; Cockram, James; Thiel, Thomas; Steuernagel, Burkhard; Stein, Nils; Taudien, Stefan; Jack, Peter; Werner, Peter; Gray, John C; Greenland, Andy J; Powell, Wayne

    2012-01-01

    The genomes of cereals such as wheat (Triticum aestivum) and barley (Hordeum vulgare) are large and therefore problematic for the map-based cloning of agronomicaly important traits. However, comparative approaches within the Poaceae permit transfer of molecular knowledge between species, despite their divergence from a common ancestor sixty million years ago. The finding that null variants of the rice gene cytokinin oxidase/dehydrogenase 2 (OsCKX2) result in large yield increases provides an opportunity to explore whether similar gains could be achieved in other Poaceae members. Here, phylogenetic, molecular and comparative analyses of CKX families in the sequenced grass species rice, brachypodium, sorghum, maize and foxtail millet, as well as members identified from the transcriptomes/genomes of wheat and barley, are presented. Phylogenetic analyses define four Poaceae CKX clades. Comparative analyses showed that CKX phylogenetic groupings can largely be explained by a combination of local gene duplication, and the whole-genome duplication event that predates their speciation. Full-length OsCKX2 homologues in barley (HvCKX2.1, HvCKX2.2) and wheat (TaCKX2.3, TaCKX2.4, TaCKX2.5) are characterized, with comparative analysis at the DNA, protein and genetic/physical map levels suggesting that true CKX2 orthologs have been identified. Furthermore, our analysis shows CKX2 genes in barley and wheat have undergone a Triticeae-specific gene-duplication event. Finally, by identifying ten of the eleven CKX genes predicted to be present in barley by comparative analyses, we show that next-generation sequencing approaches can efficiently determine the gene space of large-genome crops. Together, this work provides the foundation for future functional investigation of CKX family members within the Poaceae. © 2011 National Institute of Agricultural Botany (NIAB). Plant Biotechnology Journal © 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell

  2. Genome-Wide Identification and Analysis of Genes Encoding PHD-Finger Protein in Tomato

    International Nuclear Information System (INIS)

    Hayat, S.; Cheng, Z.; Chen, X.

    2016-01-01

    The PHD-finger proteins are conserved in eukaryotic organisms and are involved in a variety of important functions in different biological processes in plants. However, the function of PHD fingers are poorly known in tomato (Solanum lycopersicum L.). In current study, we identified 45 putative genes coding Phd finger protein in tomato distributed on 11 chromosomes except for chromosome 8. Some of the genes encode other conserved key domains besides Phd-finger. Phylogenetic analysis of these 45 proteins resulted in seven clusters. Most Phd finger proteins were predicted to PML body location. These PHD-finger genes displayed differential expression either in various organs, at different development stages and under stresses in tomato. Our study provides the first systematic analysis of PHD-finger genes and proteins in tomato. This preliminary study provides a very useful reference information for Phd-finger proteins in tomato. They will be helpful for cloning and functional study of tomato PHD-finger genes. (author)

  3. Phylogenetic analysis of canine distemper virus in domestic dogs in Nanjing, China.

    Science.gov (United States)

    Bi, Zhenwei; Wang, Yongshan; Wang, Xiaoli; Xia, Xingxia

    2015-02-01

    Canine distemper virus (CDV) infects a broad range of carnivores, including wild and domestic Canidae. The hemagglutinin gene, which encodes the attachment protein that determines viral tropism, has been widely used to determine the relationship between CDV strains of different lineages circulating worldwide. We determined the full-length H gene sequences of seven CDV field strains detected in domestic dogs in Nanjing, China. A phylogenetic analysis of the H gene sequences of CDV strains from different geographic regions and vaccine strains was performed. Four of the seven CDV strains were grouped in the same cluster of the Asia-1 lineage to which the vast majority of Chinese CDV strains belong, whereas the other three were clustered within the Asia-4 lineage, which has never been detected in China. This represents the first record of detection of strains of the Asia-4 lineage in China since this lineage was reported in Thailand in 2013.

  4. A Distance Measure for Genome Phylogenetic Analysis

    Science.gov (United States)

    Cao, Minh Duc; Allison, Lloyd; Dix, Trevor

    Phylogenetic analyses of species based on single genes or parts of the genomes are often inconsistent because of factors such as variable rates of evolution and horizontal gene transfer. The availability of more and more sequenced genomes allows phylogeny construction from complete genomes that is less sensitive to such inconsistency. For such long sequences, construction methods like maximum parsimony and maximum likelihood are often not possible due to their intensive computational requirement. Another class of tree construction methods, namely distance-based methods, require a measure of distances between any two genomes. Some measures such as evolutionary edit distance of gene order and gene content are computational expensive or do not perform well when the gene content of the organisms are similar. This study presents an information theoretic measure of genetic distances between genomes based on the biological compression algorithm expert model. We demonstrate that our distance measure can be applied to reconstruct the consensus phylogenetic tree of a number of Plasmodium parasites from their genomes, the statistical bias of which would mislead conventional analysis methods. Our approach is also used to successfully construct a plausible evolutionary tree for the γ-Proteobacteria group whose genomes are known to contain many horizontally transferred genes.

  5. Phylogenetic and structural analysis of centromeric DNA and kinetochore proteins

    OpenAIRE

    Meraldi, Patrick; McAinsh, Andrew D; Rheinbay, Esther; Sorger, Peter K

    2006-01-01

    Background: Kinetochores are large multi-protein structures that assemble on centromeric DNA (CEN DNA) and mediate the binding of chromosomes to microtubules. Comprising 125 base-pairs of CEN DNA and 70 or more protein components, Saccharomyces cerevisiae kinetochores are among the best understood. In contrast, most fungal, plant and animal cells assemble kinetochores on CENs that are longer and more complex, raising the question of whether kinetochore architecture has been conserved through ...

  6. First phylogenetic analysis of Ehrlichia canis in dogs and ticks from Mexico. Preliminary study

    Directory of Open Access Journals (Sweden)

    Carolina G. Sosa-Gutiérrez

    2016-09-01

    Full Text Available Objective. Phylogenetic characterization of Ehrlichia canis in dogs naturally infected and ticks, diagnosed by PCR and sequencing of 16SrRNA gene; compare different isolates found in American countries. Materials and methods. Were collected Blood samples from 139 dogs with suggestive clinical manifestations of this disease and they were infested with ticks; part of 16SrRNA gene was sequenced and aligned, with 17 sequences reported in American countries. Two phylogenetic trees were constructed using the Maximum likelihood method, and Maximum parsimony. Results. They were positive to E. canis 25/139 (18.0% dogs and 29/139 (20.9% ticks. The clinical manifestations presented were fever, fatigue, depression and vomiting. Rhipicephalus sanguineus Dermacentor variabilis and Haemaphysalis leporis-palustris ticks were positive for E. canis. Phylogenetic analysis showed that the sequences of dogs and ticks in Mexico form a third group diverging of sequences from South America and USA. Conclusions. This is the first phylogenetic analysis of E. canis in Mexico. There are differences in the sequences of Mexico with those reported in South America and USA. This research lays the foundation for further study of genetic variability.

  7. New Insights into the Phylogeny and Gene Context Analysis of Binder of Sperm Proteins (BSPs.

    Directory of Open Access Journals (Sweden)

    Edith Serrano

    Full Text Available Seminal plasma (SP proteins support the survival of spermatozoa acting not only at the plasma membrane but also by inhibition of capacitation, resulting in higher fertilizing ability. Among SP proteins, BSP (binder of sperm proteins are the most studied, since they may be useful for the improvement of semen diluents, storage and subsequent fertilization results. However, an updated and detailed phylogenetic analysis of the BSP protein superfamily has not been carried out with all the sequences described in the main databases. The update view shows for the first time an equally distributed number of sequences between the three families: BSP, and their homologs 1 (BSPH1 and 2 (BSPH2. The BSP family is divided in four subfamilies, BSP1 subfamily being the predominant, followed by subfamilies BSP3, BSP5 and BSP2. BSPH proteins were found among placental mammals (Eutheria belonging to the orders Proboscidea, Primates, Lagomorpha, Rodentia, Chiroptera, Perissodactyla and Cetartiodactyla. However, BSPH2 proteins were also found in the Scandentia order and Metatheria clade. This phylogenetic analysis, when combined with a gene context analysis, showed a completely new evolutionary scenario for the BSP superfamily of proteins with three defined different gene patterns, one for BSPs, one for BSPH1/BSPH2/ELSPBP1 and another one for BSPH1/BSPH2 without ELSPBP1. In addition, the study has permitted to define concise conserved blocks for each family (BSP, BSPH1 and BSPH2, which could be used for a more reliable assignment for the incoming sequences, for data curation of current databases, and for cloning new BSPs, as the one described in this paper, ram seminal vesicle 20 kDa protein (RSVP20, Ovis aries BSP5b.

  8. DendroBLAST: approximate phylogenetic trees in the absence of multiple sequence alignments.

    Science.gov (United States)

    Kelly, Steven; Maini, Philip K

    2013-01-01

    The rapidly growing availability of genome information has created considerable demand for both fast and accurate phylogenetic inference algorithms. We present a novel method called DendroBLAST for reconstructing phylogenetic dendrograms/trees from protein sequences using BLAST. This method differs from other methods by incorporating a simple model of sequence evolution to test the effect of introducing sequence changes on the reliability of the bipartitions in the inferred tree. Using realistic simulated sequence data we demonstrate that this method produces phylogenetic trees that are more accurate than other commonly-used distance based methods though not as accurate as maximum likelihood methods from good quality multiple sequence alignments. In addition to tests on simulated data, we use DendroBLAST to generate input trees for a supertree reconstruction of the phylogeny of the Archaea. This independent analysis produces an approximate phylogeny of the Archaea that has both high precision and recall when compared to previously published analysis of the same dataset using conventional methods. Taken together these results demonstrate that approximate phylogenetic trees can be produced in the absence of multiple sequence alignments, and we propose that these trees will provide a platform for improving and informing downstream bioinformatic analysis. A web implementation of the DendroBLAST method is freely available for use at http://www.dendroblast.com/.

  9. DendroBLAST: approximate phylogenetic trees in the absence of multiple sequence alignments.

    Directory of Open Access Journals (Sweden)

    Steven Kelly

    Full Text Available The rapidly growing availability of genome information has created considerable demand for both fast and accurate phylogenetic inference algorithms. We present a novel method called DendroBLAST for reconstructing phylogenetic dendrograms/trees from protein sequences using BLAST. This method differs from other methods by incorporating a simple model of sequence evolution to test the effect of introducing sequence changes on the reliability of the bipartitions in the inferred tree. Using realistic simulated sequence data we demonstrate that this method produces phylogenetic trees that are more accurate than other commonly-used distance based methods though not as accurate as maximum likelihood methods from good quality multiple sequence alignments. In addition to tests on simulated data, we use DendroBLAST to generate input trees for a supertree reconstruction of the phylogeny of the Archaea. This independent analysis produces an approximate phylogeny of the Archaea that has both high precision and recall when compared to previously published analysis of the same dataset using conventional methods. Taken together these results demonstrate that approximate phylogenetic trees can be produced in the absence of multiple sequence alignments, and we propose that these trees will provide a platform for improving and informing downstream bioinformatic analysis. A web implementation of the DendroBLAST method is freely available for use at http://www.dendroblast.com/.

  10. The long and winding road of molecular data in phylogenetic analysis.

    Science.gov (United States)

    Suárez-Díaz, Edna

    2014-01-01

    The use of molecules and reactions as evidence, markers and/or traits for evolutionary processes has a history more than a century long. Molecules have been used in studies of intra-specific variation and studies of similarity among species that do not necessarily result in the analysis of phylogenetic relations. Promoters of the use of molecular data have sustained the need for quantification as the main argument to make use of them. Moreover, quantification has allowed intensive statistical analysis, as a condition and a product of increasing automation. All of these analyses are subject to the methodological anxiety characteristic of a community in search of objectivity (Suárez-Díaz and Anaya-Munoz, Stud Hist Philos Biol Biomed Sci 39:451–458, 2008). It is in this context that scientists compared and evaluated protein and nucleic acid sequence data with other types of molecular data – including immunological, electrophoretic and hybridization data. This paper argues that by looking at longterm historical processes, such as the use of molecular evidence in evolutionary biology, we gain valuable insights into the history of science. In that sense, it accompanies a growing concern among historians for big-pictures of science that incorporate the fruitful historical research on local cases of the last decades.

  11. A phylogenetic transform enhances analysis of compositional microbiota data.

    Science.gov (United States)

    Silverman, Justin D; Washburne, Alex D; Mukherjee, Sayan; David, Lawrence A

    2017-02-15

    Surveys of microbial communities (microbiota), typically measured as relative abundance of species, have illustrated the importance of these communities in human health and disease. Yet, statistical artifacts commonly plague the analysis of relative abundance data. Here, we introduce the PhILR transform, which incorporates microbial evolutionary models with the isometric log-ratio transform to allow off-the-shelf statistical tools to be safely applied to microbiota surveys. We demonstrate that analyses of community-level structure can be applied to PhILR transformed data with performance on benchmarks rivaling or surpassing standard tools. Additionally, by decomposing distance in the PhILR transformed space, we identified neighboring clades that may have adapted to distinct human body sites. Decomposing variance revealed that covariation of bacterial clades within human body sites increases with phylogenetic relatedness. Together, these findings illustrate how the PhILR transform combines statistical and phylogenetic models to overcome compositional data challenges and enable evolutionary insights relevant to microbial communities.

  12. Phylogenetic diversity and relationships among species of genus ...

    African Journals Online (AJOL)

    Fifty six Nicotiana species were used to construct phylogenetic trees and to asses the genetic relationships between them. Genetic distances estimated from RAPD analysis was used to construct phylogenetic trees using Phylogenetic Inference Package (PHYLIP). Since phylogenetic relationships estimated for closely ...

  13. Conus pennaceus : a phylogenetic analysis of the Mozambican ...

    African Journals Online (AJOL)

    The genus Conus has over 500 species and is the most species-rich taxon of marine invertebrates. Based on mitochondrial DNA, this study focuses on the phylogenetics of Conus, particularly the pennaceus complex collected along the Mozambican coast. Phylogenetic trees based on both the 16S and the 12S ribosomal ...

  14. Phylogenetic Analysis of Dengue Virus in Bangkalan, Madura Island, East Java Province, Indonesia.

    Science.gov (United States)

    Sucipto, Teguh Hari; Kotaki, Tomohiro; Mulyatno, Kris Cahyo; Churrotin, Siti; Labiqah, Amaliah; Soegijanto, Soegeng; Kameoka, Masanori

    2018-01-01

    Dengue virus (DENV) infection is a major health issue in tropical and subtropical areas. Indonesia is one of the biggest dengue endemic countries in the world. In the present study, the phylogenetic analysis of DENV in Bangkalan, Madura Island, Indonesia, was performed in order to obtain a clearer understanding of its dynamics in this country. A total of 359 blood samples from dengue-suspected patients were collected between 2012 and 2014. Serotyping was conducted using a multiplex Reverse Transcriptase-Polymerase Chain Reaction and a phylogenetic analysis of E gene sequences was performed using the Bayesian Markov chain Monte Carlo (MCMC) method. 17 out of 359 blood samples (4.7%) were positive for the isolation of DENV. Serotyping and the phylogenetic analysis revealed the predominance of DENV-1 genotype I (9/17, 52.9%), followed by DENV-2 Cosmopolitan type (7/17, 41.2%) and DENV-3 genotype I (1/17, 5.9%) . DENV-4 was not isolated. The Madura Island isolates showed high nucleotide similarity to other Indonesian isolates, indicating frequent virus circulation in Indonesia. The results of the present study highlight the importance of continuous viral surveillance in dengue endemic areas in order to obtain a clearer understanding of the dynamics of DENV in Indonesia.

  15. Phylogenetic diversity and biodiversity indices on phylogenetic networks.

    Science.gov (United States)

    Wicke, Kristina; Fischer, Mareike

    2018-04-01

    In biodiversity conservation it is often necessary to prioritize the species to conserve. Existing approaches to prioritization, e.g. the Fair Proportion Index and the Shapley Value, are based on phylogenetic trees and rank species according to their contribution to overall phylogenetic diversity. However, in many cases evolution is not treelike and thus, phylogenetic networks have been developed as a generalization of phylogenetic trees, allowing for the representation of non-treelike evolutionary events, such as hybridization. Here, we extend the concepts of phylogenetic diversity and phylogenetic diversity indices from phylogenetic trees to phylogenetic networks. On the one hand, we consider the treelike content of a phylogenetic network, e.g. the (multi)set of phylogenetic trees displayed by a network and the so-called lowest stable ancestor tree associated with it. On the other hand, we derive the phylogenetic diversity of subsets of taxa and biodiversity indices directly from the internal structure of the network. We consider both approaches that are independent of so-called inheritance probabilities as well as approaches that explicitly incorporate these probabilities. Furthermore, we introduce our software package NetDiversity, which is implemented in Perl and allows for the calculation of all generalized measures of phylogenetic diversity and generalized phylogenetic diversity indices established in this note that are independent of inheritance probabilities. We apply our methods to a phylogenetic network representing the evolutionary relationships among swordtails and platyfishes (Xiphophorus: Poeciliidae), a group of species characterized by widespread hybridization. Copyright © 2018 Elsevier Inc. All rights reserved.

  16. Phylogenetic analysis of the haemagglutinin gene of canine distemper virus strains detected from giant panda and raccoon dogs in China

    Science.gov (United States)

    2013-01-01

    Background Canine distemper virus (CDV) infects a variety of carnivores, including wild and domestic Canidae. In this study, we sequenced and phylogenetic analyses of the hemagglutinin (H) genes from eight canine distemper virus (CDV) isolates obtained from seven raccoon dogs (Nyctereutes procyonoides) and a giant panda (Ailuropoda melanoleuca) in China. Results Phylogenetic analysis of the partial hemagglutinin gene sequences showed close clustering for geographic lineages, clearly distinct from vaccine strains and other wild-type foreign CDV strains, all the CDV strains were characterized as Asia-1 genotype and were highly similar to each other (91.5-99.8% nt and 94.4-99.8% aa). The giant panda and raccoon dogs all were 549Y on the HA protein in this study, irrespective of the host species. Conclusions These findings enhance our knowledge of the genetic characteristics of Chinese CDV isolates, and may facilitate the development of effective strategies for monitoring and controlling CDV for wild canids and non-cainds in China. PMID:23566727

  17. Whole genome sequence phylogenetic analysis of four Mexican rabies viruses isolated from cattle.

    Science.gov (United States)

    Bárcenas-Reyes, I; Loza-Rubio, E; Cantó-Alarcón, G J; Luna-Cozar, J; Enríquez-Vázquez, A; Barrón-Rodríguez, R J; Milián-Suazo, F

    2017-08-01

    Phylogenetic analysis of the rabies virus in molecular epidemiology has been traditionally performed on partial sequences of the genome, such as the N, G, and P genes; however, that approach raises concerns about the discriminatory power compared to whole genome sequencing. In this study we characterized four strains of the rabies virus isolated from cattle in Querétaro, Mexico by comparing the whole genome sequence to that of strains from the American, European and Asian continents. Four cattle brain samples positive to rabies and characterized as AgV11, genotype 1, were used in the study. A cDNA sequence was generated by reverse transcription PCR (RT-PCR) using oligo dT. cDNA samples were sequenced in an Illumina NextSeq 500 platform. The phylogenetic analysis was performed with MEGA 6.0. Minimum evolution phylogenetic trees were constructed with the Neighbor-Joining method and bootstrapped with 1000 replicates. Three large and seven small clusters were formed with the 26 sequences used. The largest cluster grouped strains from different species in South America: Brazil, and the French Guyana. The second cluster grouped five strains from Mexico. A Mexican strain reported in a different study was highly related to our four strains, suggesting common source of infection. The phylogenetic analysis shows that the type of host is different for the different regions in the American Continent; rabies is more related to bats. It was concluded that the rabies virus in central Mexico is genetically stable and that it is transmitted by the vampire bat Desmodus rotundus. Copyright © 2017 Elsevier Ltd. All rights reserved.

  18. Ultrafast Approximation for Phylogenetic Bootstrap

    NARCIS (Netherlands)

    Bui Quang Minh, [No Value; Nguyen, Thi; von Haeseler, Arndt

    Nonparametric bootstrap has been a widely used tool in phylogenetic analysis to assess the clade support of phylogenetic trees. However, with the rapidly growing amount of data, this task remains a computational bottleneck. Recently, approximation methods such as the RAxML rapid bootstrap (RBS) and

  19. treespace: Statistical exploration of landscapes of phylogenetic trees.

    Science.gov (United States)

    Jombart, Thibaut; Kendall, Michelle; Almagro-Garcia, Jacob; Colijn, Caroline

    2017-11-01

    The increasing availability of large genomic data sets as well as the advent of Bayesian phylogenetics facilitates the investigation of phylogenetic incongruence, which can result in the impossibility of representing phylogenetic relationships using a single tree. While sometimes considered as a nuisance, phylogenetic incongruence can also reflect meaningful biological processes as well as relevant statistical uncertainty, both of which can yield valuable insights in evolutionary studies. We introduce a new tool for investigating phylogenetic incongruence through the exploration of phylogenetic tree landscapes. Our approach, implemented in the R package treespace, combines tree metrics and multivariate analysis to provide low-dimensional representations of the topological variability in a set of trees, which can be used for identifying clusters of similar trees and group-specific consensus phylogenies. treespace also provides a user-friendly web interface for interactive data analysis and is integrated alongside existing standards for phylogenetics. It fills a gap in the current phylogenetics toolbox in R and will facilitate the investigation of phylogenetic results. © 2017 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.

  20. System analysis of salt and osmotic stress induced proteins in Nostoc muscorum and Bradyrhizobium japonicum

    Directory of Open Access Journals (Sweden)

    Vipin Kaithwas

    2017-06-01

    Full Text Available In this study the proteome response of the two diazotrophic organism’s viz. Nostoc muscorum and Bradyrhizobium japonicum exposed to salt (NaCl and osmotic (sucrose stresses was compared. Out of the total over expressed proteins; we have selected only three over expressed proteins viz. GroEL chaperonin, nitrogenase Mo-Fe protein and argininosuccinate synthase for further analysis, and then we analyzed the amino acid frequencies of all the three over expressed proteins. That led to the conclusion that amino acids e.g. alanine, glycine and valine that were energetically cheaper to produce were showing higher frequencies. This study would help in tracing the phylogenetic relationship between protein families.

  1. Peptide Pattern Recognition for high-throughput protein sequence analysis and clustering

    DEFF Research Database (Denmark)

    Busk, Peter Kamp

    2017-01-01

    Large collections of protein sequences with divergent sequences are tedious to analyze for understanding their phylogenetic or structure-function relation. Peptide Pattern Recognition is an algorithm that was developed to facilitate this task but the previous version does only allow a limited...... number of sequences as input. I implemented Peptide Pattern Recognition as a multithread software designed to handle large numbers of sequences and perform analysis in a reasonable time frame. Benchmarking showed that the new implementation of Peptide Pattern Recognition is twenty times faster than...... the previous implementation on a small protein collection with 673 MAP kinase sequences. In addition, the new implementation could analyze a large protein collection with 48,570 Glycosyl Transferase family 20 sequences without reaching its upper limit on a desktop computer. Peptide Pattern Recognition...

  2. [A phylogenetic analysis of plant communities of Teberda Biosphere Reserve].

    Science.gov (United States)

    Shulakov, A A; Egorov, A V; Onipchenko, V G

    2016-01-01

    Phylogenetic analysis of communities is based on the comparison of distances on the phylogenetic tree between species of a community under study and those distances in random samples taken out of local flora. It makes it possible to determine to what extent a community composition is formed by more closely related species (i.e., "clustered") or, on the opposite, it is more even and includes species that are less related with each other. The first case is usually interpreted as a result of strong influence caused by abiotic factors, due to which species with similar ecology, a priori more closely related, would remain: In the second case, biotic factors, such as competition, may come to the fore and lead to forming a community out of distant clades due to divergence of their ecological niches: The aim of this' study Was Ad explore the phylogenetic structure in communities of the northwestern Caucasus at two spatial scales - the scale of area from 4 to 100 m2 and the smaller scale within a community. The list of local flora of the alpine belt has been composed using the database of geobotanic descriptions carried out in Teberda Biosphere Reserve at true altitudes exceeding.1800 m. It includes 585 species of flowering plants belonging to 57 families. Basal groups of flowering plants are.not represented in the list. At the scale of communities of three classes, namely Thlaspietea rotundifolii - commumties formed on screes and pebbles, Calluno-Ulicetea - alpine meadow, and Mulgedio-Aconitetea subalpine meadows, have not demonstrated significant distinction of phylogenetic structure. At intra level, for alpine meadows the larger share of closely related species. (clustered community) is detected. Significantly clustered happen to be those communities developing on rocks (class Asplenietea trichomanis) and alpine (class Juncetea trifidi). At the same time, alpine lichen proved to have even phylogenetic structure at the small scale. Alpine (class Salicetea herbaceae) that

  3. Phylogenetic Distribution of the Capsid Assembly Protein Gene (g20) of Cyanophages in Paddy Floodwaters in Northeast China

    Science.gov (United States)

    Jing, Ruiyong; Liu, Junjie; Yu, Zhenhua; Liu, Xiaobing; Wang, Guanghua

    2014-01-01

    Numerous studies have revealed the high diversity of cyanophages in marine and freshwater environments, but little is currently known about the diversity of cyanophages in paddy fields, particularly in Northeast (NE) China. To elucidate the genetic diversity of cyanophages in paddy floodwaters in NE China, viral capsid assembly protein gene (g20) sequences from five floodwater samples were amplified with the primers CPS1 and CPS8. Denaturing gradient gel electrophoresis (DGGE) was applied to distinguish different g20 clones. In total, 54 clones differing in g20 nucleotide sequences were obtained in this study. Phylogenetic analysis showed that the distribution of g20 sequences in this study was different from that in Japanese paddy fields, and all the sequences were grouped into Clusters α, β, γ and ε. Within Clusters α and β, three new small clusters (PFW-VII∼-IX) were identified. UniFrac analysis of g20 clone assemblages demonstrated that the community compositions of cyanophage varied among marine, lake and paddy field environments. In paddy floodwater, community compositions of cyanophage were also different between NE China and Japan. PMID:24533125

  4. Exploring the Genomic Roadmap and Molecular Phylogenetics Associated with MODY Cascades Using Computational Biology.

    Science.gov (United States)

    Chakraborty, Chiranjib; Bandyopadhyay, Sanghamitra; Doss, C George Priya; Agoramoorthy, Govindasamy

    2015-04-01

    Maturity onset diabetes of the young (MODY) is a metabolic and genetic disorder. It is different from type 1 and type 2 diabetes with low occurrence level (1-2%) among all diabetes. This disorder is a consequence of β-cell dysfunction. Till date, 11 subtypes of MODY have been identified, and all of them can cause gene mutations. However, very little is known about the gene mapping, molecular phylogenetics, and co-expression among MODY genes and networking between cascades. This study has used latest servers and software such as VarioWatch, ClustalW, MUSCLE, G Blocks, Phylogeny.fr, iTOL, WebLogo, STRING, and KEGG PATHWAY to perform comprehensive analyses of gene mapping, multiple sequences alignment, molecular phylogenetics, protein-protein network design, co-expression analysis of MODY genes, and pathway development. The MODY genes are located in chromosomes-2, 7, 8, 9, 11, 12, 13, 17, and 20. Highly aligned block shows Pro, Gly, Leu, Arg, and Pro residues are highly aligned in the positions of 296, 386, 437, 455, 456 and 598, respectively. Alignment scores inform us that HNF1A and HNF1B proteins have shown high sequence similarity among MODY proteins. Protein-protein network design shows that HNF1A, HNF1B, HNF4A, NEUROD1, PDX1, PAX4, INS, and GCK are strongly connected, and the co-expression analyses between MODY genes also show distinct association between HNF1A and HNF4A genes. This study has used latest tools of bioinformatics to develop a rapid method to assess the evolutionary relationship, the network development, and the associations among eleven MODY genes and cascades. The prediction of sequence conservation, molecular phylogenetics, protein-protein network and the association between the MODY cascades enhances opportunities to get more insights into the less-known MODY disease.

  5. Analysis of substructural variation in families of enzymatic proteins with applications to protein function prediction

    Directory of Open Access Journals (Sweden)

    Fofanov Viacheslav Y

    2010-05-01

    Full Text Available Abstract Background Structural variations caused by a wide range of physico-chemical and biological sources directly influence the function of a protein. For enzymatic proteins, the structure and chemistry of the catalytic binding site residues can be loosely defined as a substructure of the protein. Comparative analysis of drug-receptor substructures across and within species has been used for lead evaluation. Substructure-level similarity between the binding sites of functionally similar proteins has also been used to identify instances of convergent evolution among proteins. In functionally homologous protein families, shared chemistry and geometry at catalytic sites provide a common, local point of comparison among proteins that may differ significantly at the sequence, fold, or domain topology levels. Results This paper describes two key results that can be used separately or in combination for protein function analysis. The Family-wise Analysis of SubStructural Templates (FASST method uses all-against-all substructure comparison to determine Substructural Clusters (SCs. SCs characterize the binding site substructural variation within a protein family. In this paper we focus on examples of automatically determined SCs that can be linked to phylogenetic distance between family members, segregation by conformation, and organization by homology among convergent protein lineages. The Motif Ensemble Statistical Hypothesis (MESH framework constructs a representative motif for each protein cluster among the SCs determined by FASST to build motif ensembles that are shown through a series of function prediction experiments to improve the function prediction power of existing motifs. Conclusions FASST contributes a critical feedback and assessment step to existing binding site substructure identification methods and can be used for the thorough investigation of structure-function relationships. The application of MESH allows for an automated

  6. Molecular identification and phylogenetic analysis of Wuchereria bancrofti from human blood samples in Egypt.

    Science.gov (United States)

    Abdel-Shafi, Iman R; Shoieb, Eman Y; Attia, Samar S; Rubio, José M; Ta-Tang, Thuy-Huong; El-Badry, Ayman A

    2017-03-01

    Lymphatic filariasis (LF) is a serious vector-borne health problem, and Wuchereria bancrofti (W.b) is the major cause of LF worldwide and is focally endemic in Egypt. Identification of filarial infection using traditional morphologic and immunological criteria can be difficult and lead to misdiagnosis. The aim of the present study was molecular detection of W.b in residents in endemic areas in Egypt, sequence variance analysis, and phylogenetic analysis of W.b DNA. Collected blood samples from residents in filariasis endemic areas in five governorates were subjected to semi-nested PCR targeting repeated DNA sequence, for detection of W.b DNA. PCR products were sequenced; subsequently, a phylogenetic analysis of the obtained sequences was performed. Out of 300 blood samples, W.b DNA was identified in 48 (16%). Sequencing analysis confirmed PCR results identifying only W.b species. Sequence alignment and phylogenetic analysis indicated genetically distinct clusters of W.b among the study population. Study results demonstrated that the semi-nested PCR proved to be an effective diagnostic tool for accurate and rapid detection of W.b infections in nano-epidemics and is applicable for samples collected in the daytime as well as the night time. PCR products sequencing and phylogenitic analysis revealed three different nucleotide sequences variants. Further genetic studies of W.b in Egypt and other endemic areas are needed to distinguish related strains and the various ecological as well as drug effects exerted on them to support W.b elimination.

  7. Assessing the Goodness of Fit of Phylogenetic Comparative Methods: A Meta-Analysis and Simulation Study.

    Directory of Open Access Journals (Sweden)

    Dwueng-Chwuan Jhwueng

    Full Text Available Phylogenetic comparative methods (PCMs have been applied widely in analyzing data from related species but their fit to data is rarely assessed.Can one determine whether any particular comparative method is typically more appropriate than others by examining comparative data sets?I conducted a meta-analysis of 122 phylogenetic data sets found by searching all papers in JEB, Blackwell Synergy and JSTOR published in 2002-2005 for the purpose of assessing the fit of PCMs. The number of species in these data sets ranged from 9 to 117.I used the Akaike information criterion to compare PCMs, and then fit PCMs to bivariate data sets through REML analysis. Correlation estimates between two traits and bootstrapped confidence intervals of correlations from each model were also compared.For phylogenies of less than one hundred taxa, the Independent Contrast method and the independent, non-phylogenetic models provide the best fit.For bivariate analysis, correlations from different PCMs are qualitatively similar so that actual correlations from real data seem to be robust to the PCM chosen for the analysis. Therefore, researchers might apply the PCM they believe best describes the evolutionary mechanisms underlying their data.

  8. Comparison of Boolean analysis and standard phylogenetic methods using artificially evolved and natural mt-tRNA sequences from great apes.

    Science.gov (United States)

    Ari, Eszter; Ittzés, Péter; Podani, János; Thi, Quynh Chi Le; Jakó, Eena

    2012-04-01

    Boolean analysis (or BOOL-AN; Jakó et al., 2009. BOOL-AN: A method for comparative sequence analysis and phylogenetic reconstruction. Mol. Phylogenet. Evol. 52, 887-97.), a recently developed method for sequence comparison uses the Iterative Canonical Form of Boolean functions. It considers sequence information in a way entirely different from standard phylogenetic methods (i.e. Maximum Parsimony, Maximum-Likelihood, Neighbor-Joining, and Bayesian analysis). The performance and reliability of Boolean analysis were tested and compared with the standard phylogenetic methods, using artificially evolved - simulated - nucleotide sequences and the 22 mitochondrial tRNA genes of the great apes. At the outset, we assumed that the phylogeny of Hominidae is generally well established, and the guide tree of artificial sequence evolution can also be used as a benchmark. These offer a possibility to compare and test the performance of different phylogenetic methods. Trees were reconstructed by each method from 2500 simulated sequences and 22 mitochondrial tRNA sequences. We also introduced a special re-sampling method for Boolean analysis on permuted sequence sites, the P-BOOL-AN procedure. Considering the reliability values (branch support values of consensus trees and Robinson-Foulds distances) we used for simulated sequence trees produced by different phylogenetic methods, BOOL-AN appeared as the most reliable method. Although the mitochondrial tRNA sequences of great apes are relatively short (59-75 bases long) and the ratio of their constant characters is about 75%, BOOL-AN, P-BOOL-AN and the Bayesian approach produced the same tree-topology as the established phylogeny, while the outcomes of Maximum Parsimony, Maximum-Likelihood and Neighbor-Joining methods were equivocal. We conclude that Boolean analysis is a promising alternative to existing methods of sequence comparison for phylogenetic reconstruction and congruence analysis. Copyright © 2012 Elsevier Inc. All

  9. Phylogenetic inertia and Darwin's higher law.

    Science.gov (United States)

    Shanahan, Timothy

    2011-03-01

    The concept of 'phylogenetic inertia' is routinely deployed in evolutionary biology as an alternative to natural selection for explaining the persistence of characteristics that appear sub-optimal from an adaptationist perspective. However, in many of these contexts the precise meaning of 'phylogenetic inertia' and its relationship to selection are far from clear. After tracing the history of the concept of 'inertia' in evolutionary biology, I argue that treating phylogenetic inertia and natural selection as alternative explanations is mistaken because phylogenetic inertia is, from a Darwinian point of view, simply an expected effect of selection. Although Darwin did not discuss 'phylogenetic inertia,' he did assert the explanatory priority of selection over descent. An analysis of 'phylogenetic inertia' provides a perspective from which to assess Darwin's view. Copyright © 2010 Elsevier Ltd. All rights reserved.

  10. Analysis of chitin-binding proteins from Manduca sexta provides new insights into evolution of peritrophin A-type chitin-binding domains in insects.

    Science.gov (United States)

    Tetreau, Guillaume; Dittmer, Neal T; Cao, Xiaolong; Agrawal, Sinu; Chen, Yun-Ru; Muthukrishnan, Subbaratnam; Haobo, Jiang; Blissard, Gary W; Kanost, Michael R; Wang, Ping

    2015-07-01

    In insects, chitin is a major structural component of the cuticle and the peritrophic membrane (PM). In nature, chitin is always associated with proteins among which chitin-binding proteins (CBPs) are the most important for forming, maintaining and regulating the functions of these extracellular structures. In this study, a genome-wide search for genes encoding proteins with ChtBD2-type (peritrophin A-type) chitin-binding domains (CBDs) was conducted. A total of 53 genes encoding 56 CBPs were identified, including 15 CPAP1s (cuticular proteins analogous to peritrophins with 1 CBD), 11 CPAP3s (CPAPs with 3 CBDs) and 17 PMPs (PM proteins) with a variable number of CBDs, which are structural components of cuticle or of the PM. CBDs were also identified in enzymes of chitin metabolism including 6 chitinases and 7 chitin deacetylases encoded by 6 and 5 genes, respectively. RNA-seq analysis confirmed that PMP and CPAP genes have differential spatial expression patterns. The expression of PMP genes is midgut-specific, while CPAP genes are widely expressed in different cuticle forming tissues. Phylogenetic analysis of CBDs of proteins in insects belonging to different orders revealed that CPAP1s from different species constitute a separate family with 16 different groups, including 6 new groups identified in this study. The CPAP3s are clustered into a separate family of 7 groups present in all insect orders. Altogether, they reveal that duplication events of CBDs in CPAP1s and CPAP3s occurred prior to the evolutionary radiation of insect species. In contrast to the CPAPs, all CBDs from individual PMPs are generally clustered and distinct from other PMPs in the same species in phylogenetic analyses, indicating that the duplication of CBDs in each of these PMPs occurred after divergence of insect species. Phylogenetic analysis of these three CBP families showed that the CBDs in CPAP1s form a clearly separate family, while those found in PMPs and CPAP3s were clustered

  11. Orthology prediction at scalable resolution by phylogenetic tree analysis

    NARCIS (Netherlands)

    Heijden, R.T.J.M. van der; Snel, B.; Noort, V. van; Huynen, M.A.

    2007-01-01

    BACKGROUND: Orthology is one of the cornerstones of gene function prediction. Dividing the phylogenetic relations between genes into either orthologs or paralogs is however an oversimplification. Already in two-species gene-phylogenies, the complicated, non-transitive nature of phylogenetic

  12. A format for phylogenetic placements.

    Directory of Open Access Journals (Sweden)

    Frederick A Matsen

    Full Text Available We have developed a unified format for phylogenetic placements, that is, mappings of environmental sequence data (e.g., short reads into a phylogenetic tree. We are motivated to do so by the growing number of tools for computing and post-processing phylogenetic placements, and the lack of an established standard for storing them. The format is lightweight, versatile, extensible, and is based on the JSON format, which can be parsed by most modern programming languages. Our format is already implemented in several tools for computing and post-processing parsimony- and likelihood-based phylogenetic placements and has worked well in practice. We believe that establishing a standard format for analyzing read placements at this early stage will lead to a more efficient development of powerful and portable post-analysis tools for the growing applications of phylogenetic placement.

  13. Host-range phylogenetic grouping of capripoxviruses. Genetic typing of CaPVs

    International Nuclear Information System (INIS)

    Le Goff, C.; Chadeyras, A.; Libeau, G.; Albina, E.; Fakhfakh, E.; Hammami, S.; Elexpeter Aba Adulugba; Diallo, A.

    2005-01-01

    Because of their close relationship, specific identification of the CaPVs genus inside the Poxviridae family relies mainly on molecular tools rather than on classical serology. We describe the suitability of the G protein-coupled chemokine receptor (GPCR), for host range phylogenetic grouping. The analysis of 26 CaPVs shows 3 tight genetic clusters consisting of goatpox virus (GPV), lumpy skin disease virus (LSDV), and sheeppox virus (SPV). (author)

  14. Phylogenetic reconstruction methods: an overview.

    Science.gov (United States)

    De Bruyn, Alexandre; Martin, Darren P; Lefeuvre, Pierre

    2014-01-01

    Initially designed to infer evolutionary relationships based on morphological and physiological characters, phylogenetic reconstruction methods have greatly benefited from recent developments in molecular biology and sequencing technologies with a number of powerful methods having been developed specifically to infer phylogenies from macromolecular data. This chapter, while presenting an overview of basic concepts and methods used in phylogenetic reconstruction, is primarily intended as a simplified step-by-step guide to the construction of phylogenetic trees from nucleotide sequences using fairly up-to-date maximum likelihood methods implemented in freely available computer programs. While the analysis of chloroplast sequences from various Vanilla species is used as an illustrative example, the techniques covered here are relevant to the comparative analysis of homologous sequences datasets sampled from any group of organisms.

  15. Detection and phylogenetic analysis of bacteriophage WO in spiders (Araneae).

    Science.gov (United States)

    Yan, Qian; Qiao, Huping; Gao, Jin; Yun, Yueli; Liu, Fengxiang; Peng, Yu

    2015-11-01

    Phage WO is a bacteriophage found in Wolbachia. Herein, we represent the first phylogenetic study of WOs that infect spiders (Araneae). Seven species of spiders (Araneus alternidens, Nephila clavata, Hylyphantes graminicola, Prosoponoides sinensis, Pholcus crypticolens, Coleosoma octomaculatum, and Nurscia albofasciata) from six families were infected by Wolbachia and WO, followed by comprehensive sequence analysis. Interestingly, WO could be only detected Wolbachia-infected spiders. The relative infection rates of those seven species of spiders were 75, 100, 88.9, 100, 62.5, 72.7, and 100 %, respectively. Our results indicated that both Wolbachia and WO were found in three different body parts of N. clavata, and WO could be passed to the next generation of H. graminicola by vertical transmission. There were three different sequences for WO infected in A. alternidens and two different WO sequences from C. octomaculatum. Only one sequence of WO was found for the other five species of spiders. The discovered sequence of WO ranged from 239 to 311 bp. Phylogenetic tree was generated using maximum likelihood (ML) based on the orf7 gene sequences. According to the phylogenetic tree, WOs in N. clavata and H. graminicola were clustered in the same group. WOs from A. alternidens (WAlt1) and C. octomaculatum (WOct2) were closely related to another clade, whereas WO in P. sinensis was classified as a sole cluster.

  16. A gateway for phylogenetic analysis powered by grid computing featuring GARLI 2.0.

    Science.gov (United States)

    Bazinet, Adam L; Zwickl, Derrick J; Cummings, Michael P

    2014-09-01

    We introduce molecularevolution.org, a publicly available gateway for high-throughput, maximum-likelihood phylogenetic analysis powered by grid computing. The gateway features a garli 2.0 web service that enables a user to quickly and easily submit thousands of maximum likelihood tree searches or bootstrap searches that are executed in parallel on distributed computing resources. The garli web service allows one to easily specify partitioned substitution models using a graphical interface, and it performs sophisticated post-processing of phylogenetic results. Although the garli web service has been used by the research community for over three years, here we formally announce the availability of the service, describe its capabilities, highlight new features and recent improvements, and provide details about how the grid system efficiently delivers high-quality phylogenetic results. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

  17. Hippo pathway phylogenetics predicts monoubiquitylation of Salvador and Merlin/Nf2.

    Directory of Open Access Journals (Sweden)

    Robert G Wisotzkey

    Full Text Available Recently we employed phylogenetics to predict that the cellular interpretation of TGF-β signals is modulated by monoubiquitylation cycles affecting the Smad4 signal transducer/tumor suppressor. This prediction was subsequently validated by experiments in flies, frogs and mammalian cells. Here we apply a phylogenetic approach to the Hippo pathway and predict that two of its signal transducers, Salvador and Merlin/Nf2 (also a tumor suppressor are regulated by monoubiquitylation. This regulatory mechanism does not lead to protein degradation but instead serves as a highly efficient "off/on" switch when the protein is subsequently deubiquitylated. Overall, our study shows that the creative application of phylogenetics can predict new roles for pathway components and new mechanisms for regulating intercellular signaling pathways.

  18. Phylogenetic molecular function annotation

    International Nuclear Information System (INIS)

    Engelhardt, Barbara E; Jordan, Michael I; Repo, Susanna T; Brenner, Steven E

    2009-01-01

    It is now easier to discover thousands of protein sequences in a new microbial genome than it is to biochemically characterize the specific activity of a single protein of unknown function. The molecular functions of protein sequences have typically been predicted using homology-based computational methods, which rely on the principle that homologous proteins share a similar function. However, some protein families include groups of proteins with different molecular functions. A phylogenetic approach for predicting molecular function (sometimes called 'phylogenomics') is an effective means to predict protein molecular function. These methods incorporate functional evidence from all members of a family that have functional characterizations using the evolutionary history of the protein family to make robust predictions for the uncharacterized proteins. However, they are often difficult to apply on a genome-wide scale because of the time-consuming step of reconstructing the phylogenies of each protein to be annotated. Our automated approach for function annotation using phylogeny, the SIFTER (Statistical Inference of Function Through Evolutionary Relationships) methodology, uses a statistical graphical model to compute the probabilities of molecular functions for unannotated proteins. Our benchmark tests showed that SIFTER provides accurate functional predictions on various protein families, outperforming other available methods.

  19. PALM: a paralleled and integrated framework for phylogenetic inference with automatic likelihood model selectors.

    Directory of Open Access Journals (Sweden)

    Shu-Hwa Chen

    Full Text Available BACKGROUND: Selecting an appropriate substitution model and deriving a tree topology for a given sequence set are essential in phylogenetic analysis. However, such time consuming, computationally intensive tasks rely on knowledge of substitution model theories and related expertise to run through all possible combinations of several separate programs. To ensure a thorough and efficient analysis and avert tedious manipulations of various programs, this work presents an intuitive framework, the phylogenetic reconstruction with automatic likelihood model selectors (PALM, with convincing, updated algorithms and a best-fit model selection mechanism for seamless phylogenetic analysis. METHODOLOGY: As an integrated framework of ClustalW, PhyML, MODELTEST, ProtTest, and several in-house programs, PALM evaluates the fitness of 56 substitution models for nucleotide sequences and 112 substitution models for protein sequences with scores in various criteria. The input for PALM can be either sequences in FASTA format or a sequence alignment file in PHYLIP format. To accelerate the computing of maximum likelihood and bootstrapping, this work integrates MPICH2/PhyML, PalmMonitor and Palm job controller across several machines with multiple processors and adopts the task parallelism approach. Moreover, an intuitive and interactive web component, PalmTree, is developed for displaying and operating the output tree with options of tree rooting, branches swapping, viewing the branch length values, and viewing bootstrapping score, as well as removing nodes to restart analysis iteratively. SIGNIFICANCE: The workflow of PALM is straightforward and coherent. Via a succinct, user-friendly interface, researchers unfamiliar with phylogenetic analysis can easily use this server to submit sequences, retrieve the output, and re-submit a job based on a previous result if some sequences are to be deleted or added for phylogenetic reconstruction. PALM results in an inference of

  20. Orthology prediction at scalable resolution by phylogenetic tree analysis

    Directory of Open Access Journals (Sweden)

    Huynen Martijn A

    2007-03-01

    Full Text Available Abstract Background Orthology is one of the cornerstones of gene function prediction. Dividing the phylogenetic relations between genes into either orthologs or paralogs is however an oversimplification. Already in two-species gene-phylogenies, the complicated, non-transitive nature of phylogenetic relations results in inparalogs and outparalogs. For situations with more than two species we lack semantics to specifically describe the phylogenetic relations, let alone to exploit them. Published procedures to extract orthologous groups from phylogenetic trees do not allow identification of orthology at various levels of resolution, nor do they document the relations between the orthologous groups. Results We introduce "levels of orthology" to describe the multi-level nature of gene relations. This is implemented in a program LOFT (Levels of Orthology From Trees that assigns hierarchical orthology numbers to genes based on a phylogenetic tree. To decide upon speciation and gene duplication events in a tree LOFT can be instructed either to perform classical species-tree reconciliation or to use the species overlap between partitions in the tree. The hierarchical orthology numbers assigned by LOFT effectively summarize the phylogenetic relations between genes. The resulting high-resolution orthologous groups are depicted in colour, facilitating visual inspection of (large trees. A benchmark for orthology prediction, that takes into account the varying levels of orthology between genes, shows that the phylogeny-based high-resolution orthology assignments made by LOFT are reliable. Conclusion The "levels of orthology" concept offers high resolution, reliable orthology, while preserving the relations between orthologous groups. A Windows as well as a preliminary Java version of LOFT is available from the LOFT website http://www.cmbi.ru.nl/LOFT.

  1. Expressed sequence tags as a tool for phylogenetic analysis of placental mammal evolution.

    Directory of Open Access Journals (Sweden)

    Morgan Kullberg

    Full Text Available BACKGROUND: We investigate the usefulness of expressed sequence tags, ESTs, for establishing divergences within the tree of placental mammals. This is done on the example of the established relationships among primates (human, lagomorphs (rabbit, rodents (rat and mouse, artiodactyls (cow, carnivorans (dog and proboscideans (elephant. METHODOLOGY/PRINCIPAL FINDINGS: We have produced 2000 ESTs (1.2 mega bases from a marsupial mouse and characterized the data for their use in phylogenetic analysis. The sequences were used to identify putative orthologous sequences from whole genome projects. Although most ESTs stem from single sequence reads, the frequency of potential sequencing errors was found to be lower than allelic variation. Most of the sequences represented slowly evolving housekeeping-type genes, with an average amino acid distance of 6.6% between human and mouse. Positive Darwinian selection was identified at only a few single sites. Phylogenetic analyses of the EST data yielded trees that were consistent with those established from whole genome projects. CONCLUSIONS: The general quality of EST sequences and the general absence of positive selection in these sequences make ESTs an attractive tool for phylogenetic analysis. The EST approach allows, at reasonable costs, a fast extension of data sampling from species outside the genome projects.

  2. PHYLOGENETIC RELATIONSHIPS AMONGST 10 Durio SPECIES BASED ON PCR-RFLP ANALYSIS OF TWO CHLOROPLAST GENES

    Directory of Open Access Journals (Sweden)

    Panca J. Santoso

    2013-07-01

    Full Text Available Twenty seven species of Durio have been identified in Sabah and Sarawak, Malaysia, but their relationships have not been studied. This study was conducted to analyse phylogenetic relationships amongst 10 Durio species in Malaysia using PCR-RFLP on two chloroplast DNA genes, i.e. ndhC-trnV and rbcL. DNAs were extracted from young leaves of 11 accessions from 10 Durio species collected from the Tenom Agriculture Research Station, Sabah, and University Agriculture Park, Universiti Putra Malaysia. Two pairs of oligonucleotide primers, N1-N2 and rbcL1-rbcL2, were used to flank the target regions ndhC-trnV and rbcL. Eight restriction enzymes, HindIII, BsuRI, PstI, TaqI, MspI, SmaI, BshNI, and EcoR130I, were used to digest the amplicons. Based on the results of PCR-RFLP on ndhC-trnV gene, the 10 Durio species were grouped into five distinct clusters, and the accessions generally showed high variations. However, based on the results of PCR-RFLP on the rbcL gene, the species were grouped into three distinct clusters, and generally showed low variations. This means that ndhC-trnV gene is more reliable for phylogenetic analysis in lower taxonomic level of Durio species or for diversity analysis, while rbcL gene is reliable marker for phylogenetic analysis at higher taxonomic level. PCR-RFLP on the ndhC-trnV and rbcL genes could therefore be considered as useful markers to phylogenetic analysis amongst Durio species. These finding might be used for further molecular marker assisted in Durio breeding program.

  3. GapCoder automates the use of indel characters in phylogenetic analysis.

    Science.gov (United States)

    Young, Nelson D; Healy, John

    2003-02-19

    Several ways of incorporating indels into phylogenetic analysis have been suggested. Simple indel coding has two strengths: (1) biological realism and (2) efficiency of analysis. In the method, each indel with different start and/or end positions is considered to be a separate character. The presence/absence of these indel characters is then added to the data set. We have written a program, GapCoder to automate this procedure. The program can input PIR format aligned datasets, find the indels and add the indel-based characters. The output is a NEXUS format file, which includes a table showing what region each indel characters is based on. If regions are excluded from analysis, this table makes it easy to identify the corresponding indel characters for exclusion. Manual implementation of the simple indel coding method can be very time-consuming, especially in data sets where indels are numerous and/or overlapping. GapCoder automates this method and is therefore particularly useful during procedures where phylogenetic analyses need to be repeated many times, such as when different alignments are being explored or when various taxon or character sets are being explored. GapCoder is currently available for Windows from http://www.home.duq.edu/~youngnd/GapCoder.

  4. aes, the gene encoding the esterase B in Escherichia coli, is a powerful phylogenetic marker of the species

    Directory of Open Access Journals (Sweden)

    Tuffery Pierre

    2009-12-01

    Full Text Available Abstract Background Previous studies have established a correlation between electrophoretic polymorphism of esterase B, and virulence and phylogeny of Escherichia coli. Strains belonging to the phylogenetic group B2 are more frequently implicated in extraintestinal infections and include esterase B2 variants, whereas phylogenetic groups A, B1 and D contain less virulent strains and include esterase B1 variants. We investigated esterase B as a marker of phylogeny and/or virulence, in a thorough analysis of the esterase B-encoding gene. Results We identified the gene encoding esterase B as the acetyl-esterase gene (aes using gene disruption. The analysis of aes nucleotide sequences in a panel of 78 reference strains, including the E. coli reference (ECOR strains, demonstrated that the gene is under purifying selection. The phylogenetic tree reconstructed from aes sequences showed a strong correlation with the species phylogenetic history, based on multi-locus sequence typing using six housekeeping genes. The unambiguous distinction between variants B1 and B2 by electrophoresis was consistent with Aes amino-acid sequence analysis and protein modelling, which showed that substituted amino acids in the two esterase B variants occurred mostly at different sites on the protein surface. Studies in an experimental mouse model of septicaemia using mutant strains did not reveal a direct link between aes and extraintestinal virulence. Moreover, we did not find any genes in the chromosomal region of aes to be associated with virulence. Conclusion Our findings suggest that aes does not play a direct role in the virulence of E. coli extraintestinal infection. However, this gene acts as a powerful marker of phylogeny, illustrating the extensive divergence of B2 phylogenetic group strains from the rest of the species.

  5. Nucleotide diversity and phylogenetic relationships among ...

    Indian Academy of Sciences (India)

    NIRAJ SINGH

    for phylogenetic analysis of Gladiolus and related taxa using combined datasets from chloroplast genome. The psbA–trnH ... phylogenetic relationships among cultivars could be useful for hybridization programmes for further improvement of the crop. [Singh N. ... breeding in nature, and exhibited diverse pollination mech-.

  6. Unrealistic phylogenetic trees may improve phylogenetic footprinting.

    Science.gov (United States)

    Nettling, Martin; Treutler, Hendrik; Cerquides, Jesus; Grosse, Ivo

    2017-06-01

    The computational investigation of DNA binding motifs from binding sites is one of the classic tasks in bioinformatics and a prerequisite for understanding gene regulation as a whole. Due to the development of sequencing technologies and the increasing number of available genomes, approaches based on phylogenetic footprinting become increasingly attractive. Phylogenetic footprinting requires phylogenetic trees with attached substitution probabilities for quantifying the evolution of binding sites, but these trees and substitution probabilities are typically not known and cannot be estimated easily. Here, we investigate the influence of phylogenetic trees with different substitution probabilities on the classification performance of phylogenetic footprinting using synthetic and real data. For synthetic data we find that the classification performance is highest when the substitution probability used for phylogenetic footprinting is similar to that used for data generation. For real data, however, we typically find that the classification performance of phylogenetic footprinting surprisingly increases with increasing substitution probabilities and is often highest for unrealistically high substitution probabilities close to one. This finding suggests that choosing realistic model assumptions might not always yield optimal predictions in general and that choosing unrealistically high substitution probabilities close to one might actually improve the classification performance of phylogenetic footprinting. The proposed PF is implemented in JAVA and can be downloaded from https://github.com/mgledi/PhyFoo. : martin.nettling@informatik.uni-halle.de. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.

  7. Phylogenetic analysis of several Thermus strains from Rehai of Tengchong, Yunnan, China.

    Science.gov (United States)

    Lin, Lianbing; Zhang, Jie; Wei, Yunlin; Chen, Chaoyin; Peng, Qian

    2005-10-01

    Several Thermus strains were isolated from 10 hot springs of the Rehai geothermal area in Tengchong, Yunnan province. The diversity of Thermus strains was examined by sequencing the 16S rRNA genes and comparing their sequences. Phylogenetic analysis showed that the 16S rDNA sequences from the Rehai geothermal isolates form four branches in the phylogenetic tree and had greater than 95.9% similarity in the phylogroup. Secondary structure comparison also indicated that the 16S rRNA from the Rehai geothermal isolates have unique secondary structure characteristics in helix 6, helix 9, and helix 10 (reference to Escherichia coli). This research is the first attempt to reveal the diversity of Thermus strains that are distributed in the Rehai geothermal area.

  8. Phylogenetic trees

    OpenAIRE

    Baños, Hector; Bushek, Nathaniel; Davidson, Ruth; Gross, Elizabeth; Harris, Pamela E.; Krone, Robert; Long, Colby; Stewart, Allen; Walker, Robert

    2016-01-01

    We introduce the package PhylogeneticTrees for Macaulay2 which allows users to compute phylogenetic invariants for group-based tree models. We provide some background information on phylogenetic algebraic geometry and show how the package PhylogeneticTrees can be used to calculate a generating set for a phylogenetic ideal as well as a lower bound for its dimension. Finally, we show how methods within the package can be used to compute a generating set for the join of any two ideals.

  9. Phylogenetic analysis of methanogens from the bovine rumen

    Directory of Open Access Journals (Sweden)

    Forster Robert J

    2001-05-01

    Full Text Available Abstract Background Interest in methanogens from ruminants has resulted from the role of methane in global warming and from the fact that cattle typically lose 6 % of ingested energy as methane. Several species of methanogens have been isolated from ruminants. However they are difficult to culture, few have been consistently found in high numbers, and it is likely that major species of rumen methanogens are yet to be identified. Results Total DNA from clarified bovine rumen fluid was amplified using primers specific for Archaeal 16S rRNA gene sequences (rDNA. Phylogenetic analysis of 41 rDNA sequences identified three clusters of methanogens. The largest cluster contained two distinct subclusters with rDNA sequences similar to Methanobrevibacter ruminantium 16S rDNA. A second cluster contained sequences related to 16S rDNA from Methanosphaera stadtmanae, an organism not previously described in the rumen. The third cluster contained rDNA sequences that may form a novel group of rumen methanogens. Conclusions The current set of 16S rRNA hybridization probes targeting methanogenic Archaea does not cover the phylogenetic diversity present in the rumen and possibly other gastro-intestinal tract environments. New probes and quantitative PCR assays are needed to determine the distribution of the newly identified methanogen clusters in rumen microbial communities.

  10. [BIOINFORMATIC SEARCH AND PHYLOGENETIC ANALYSIS OF THE CELLULOSE SYNTHASE GENES OF FLAX (LINUM USITATISSIMUM)].

    Science.gov (United States)

    Pydiura, N A; Bayer, G Ya; Galinousky, D V; Yemets, A I; Pirko, Ya V; Podvitski, T A; Anisimova, N V; Khotyleva, L V; Kilchevsky, A V; Blume, Ya B

    2015-01-01

    A bioinformatic search of sequences encoding cellulose synthase genes in the flax genome, and their comparison to dicots orthologs was carried out. The analysis revealed 32 cellulose synthase gene candidates, 16 of which are highly likely to encode cellulose synthases, and the remaining 16--cellulose synthase-like proteins (Csl). Phylogenetic analysis of gene products of cellulose synthase genes allowed distinguishing 6 groups of cellulose synthase genes of different classes: CesA1/10, CesA3, CesA4, CesA5/6/2/9, CesA7 and CesA8. Paralogous sequences within classes CesA1/10 and CesA5/6/2/9 which are associated with the primary cell wall formation are characterized by a greater similarity within these classes than orthologous sequences. Whereas the genes controlling the biosynthesis of secondary cell wall cellulose form distinct clades: CesA4, CesA7, and CesA8. The analysis of 16 identified flax cellulose synthase gene candidates shows the presence of at least 12 different cellulose synthase gene variants in flax genome which are represented in all six clades of cellulose synthase genes. Thus, at this point genes of all ten known cellulose synthase classes are identify in flax genome, but their correct classification requires additional research.

  11. PROCOV: maximum likelihood estimation of protein phylogeny under covarion models and site-specific covarion pattern analysis

    Directory of Open Access Journals (Sweden)

    Wang Huai-Chun

    2009-09-01

    Full Text Available Abstract Background The covarion hypothesis of molecular evolution holds that selective pressures on a given amino acid or nucleotide site are dependent on the identity of other sites in the molecule that change throughout time, resulting in changes of evolutionary rates of sites along the branches of a phylogenetic tree. At the sequence level, covarion-like evolution at a site manifests as conservation of nucleotide or amino acid states among some homologs where the states are not conserved in other homologs (or groups of homologs. Covarion-like evolution has been shown to relate to changes in functions at sites in different clades, and, if ignored, can adversely affect the accuracy of phylogenetic inference. Results PROCOV (protein covarion analysis is a software tool that implements a number of previously proposed covarion models of protein evolution for phylogenetic inference in a maximum likelihood framework. Several algorithmic and implementation improvements in this tool over previous versions make computationally expensive tree searches with covarion models more efficient and analyses of large phylogenomic data sets tractable. PROCOV can be used to identify covarion sites by comparing the site likelihoods under the covarion process to the corresponding site likelihoods under a rates-across-sites (RAS process. Those sites with the greatest log-likelihood difference between a 'covarion' and an RAS process were found to be of functional or structural significance in a dataset of bacterial and eukaryotic elongation factors. Conclusion Covarion models implemented in PROCOV may be especially useful for phylogenetic estimation when ancient divergences between sequences have occurred and rates of evolution at sites are likely to have changed over the tree. It can also be used to study lineage-specific functional shifts in protein families that result in changes in the patterns of site variability among subtrees.

  12. Cryptorchestia ruffoi sp. n. from the island of Rhodes (Greece, revealed by morphological and phylogenetic analysis (Crustacea, Amphipoda, Talitridae

    Directory of Open Access Journals (Sweden)

    Domenico Davolos

    2017-02-01

    Full Text Available A new Cryptorchestia species, Cryptorchestia ruffoi Latella & Vonk, sp. n. from the island of Rhodes in south-eastern Greece, can be distinguished on the basis of morphological and phylogenetic data. Morphological analysis and DNA sequencing of mitochondrial and nuclear protein-coding genes indicated that this species is related to C. cavimana (Cyprus and C. garbinii (Mediterranean regions, with a recent northward expansion. Results supported a genetic separation between the Cryptorchestia species of the east Mediterranean regions and those of the northeast Atlantic volcanic islands examined in this study (C. canariensis, C. gomeri, C. guancha, and C. stocki from the Canary islands, C. monticola from Madeira, and C. chevreuxi from the Azores. The Mediterranean and Atlantic Cryptorchestia species appear to be also morphologically distinct. Cryptorchestia ruffoi sp. n., C. cavimana, C. garbinii, and C. kosswigi (Turkish coast clearly have a small lobe on the male gnathopod 1 merus. This character was the main diagnostic difference between Cryptorchestia (sensu Lowry, 2013 and Orchestia. However, among the six northeast Atlantic island Cryptorchestia species only C. stocki has a small lobe on the merus of gnathopod 1. Reduction or loss of the lobe in the Atlantic Island species cannot be ruled out; however, molecular phylogenetic analysis leads us to presume that this lobe independently evolved between the east Mediterranean Cryptorchestia species and C. stocki from Gran Canaria.

  13. Phylogenetic inference of calyptrates, with the first mitogenomes for Gasterophilinae (Diptera: Oestridae) and Paramacronychiinae (Diptera: Sarcophagidae)

    Science.gov (United States)

    Zhang, Dong; Yan, Liping; Zhang, Ming; Chu, Hongjun; Cao, Jie; Li, Kai; Hu, Defu; Pape, Thomas

    2016-01-01

    The complete mitogenome of the horse stomach bot fly Gasterophilus pecorum (Fabricius) and a near-complete mitogenome of Wohlfahrt's wound myiasis fly Wohlfahrtia magnifica (Schiner) were sequenced. The mitogenomes contain the typical 37 mitogenes found in metazoans, organized in the same order and orientation as in other cyclorrhaphan Diptera. Phylogenetic analyses of mitogenomes from 38 calyptrate taxa with and without two non-calyptrate outgroups were performed using Bayesian Inference and Maximum Likelihood. Three sub-analyses were performed on the concatenated data: (1) not partitioned; (2) partitioned by gene; (3) 3rd codon positions of protein-coding genes omitted. We estimated the contribution of each of the mitochondrial genes for phylogenetic analysis, as well as the effect of some popular methodologies on calyptrate phylogeny reconstruction. In the favoured trees, the Oestroidea are nested within the muscoid grade. Relationships at the family level within Oestroidea are (remaining Calliphoridae (Sarcophagidae (Oestridae, Pollenia + Tachinidae))). Our mito-phylogenetic reconstruction of the Calyptratae presents the most extensive taxon coverage so far, and the risk of long-branch attraction is reduced by an appropriate selection of outgroups. We find that in the Calyptratae the ND2, ND5, ND1, COIII, and COI genes are more phylogenetically informative compared with other mitochondrial protein-coding genes. Our study provides evidence that data partitioning and the inclusion of conserved tRNA genes have little influence on calyptrate phylogeny reconstruction, and that the 3rd codon positions of protein-coding genes are not saturated and therefore should be included. PMID:27019632

  14. A global phylogenetic analysis in order to determine the host species and geography dependent features present in the evolution of avian H9N2 influenza hemagglutinin

    Directory of Open Access Journals (Sweden)

    Andrew R. Dalby

    2014-10-01

    Full Text Available A complete phylogenetic analysis of all of the H9N2 hemagglutinin sequences that were collected between 1966 and 2012 was carried out in order to build a picture of the geographical and host specific evolution of the hemagglutinin protein. To improve the quality and applicability of the output data the sequences were divided into subsets based upon location and host species.The phylogenetic analysis of hemagglutinin reveals that the protein has distinct lineages between China and the Middle East, and that wild birds in both regions retain a distinct form of the H9 molecule, from the same lineage as the ancestral hemagglutinin. The results add further evidence to the hypothesis that the current predominant H9N2 hemagglutinin lineage might have originated in Southern China. The study also shows that there are sampling problems that affect the reliability of this and any similar analysis. This raises questions about the surveillance of H9N2 and the need for wider sampling of the virus in the environment.The results of this analysis are also consistent with a model where hemagglutinin has predominantly evolved by neutral drift punctuated by occasional selection events. These selective events have produced the current pattern of distinct lineages in the Middle East, Korea and China. This interpretation is in agreement with existing studies that have shown that there is widespread intra-country sequence evolution.

  15. Molecular cytogenetic characterisation and phylogenetic analysis of the seven cultivated Vigna species (Fabaceae).

    Science.gov (United States)

    She, C-W; Jiang, X-H; Ou, L-J; Liu, J; Long, K-L; Zhang, L-H; Duan, W-T; Zhao, W; Hu, J-C

    2015-01-01

    The genomic organisation of the seven cultivated Vigna species, V. unguiculata, V. subterranea, V. angularis, V. umbellata, V. radiata, V. mungo and V. aconitifolia, was determined using sequential combined PI and DAPI (CPD) staining and dual-colour fluorescence in situ hybridisation (FISH) with 5S and 45S rDNA probes. For phylogenetic analyses, comparative genomic in situ hybridisation (cGISH) onto somatic chromosomes and sequence analysis of the internal transcribed spacer (ITS) of 45S rDNA were used. Quantitative karyotypes were established using chromosome measurements, fluorochrome bands and rDNA FISH signals. All species had symmetrical karyotypes composed of only metacentric or metacentric and submetacentric chromosomes. Distinct heterochromatin differentiation was revealed by CPD staining and DAPI counterstaining after FISH. The rDNA sites among all species differed in their number, location and size. cGISH of V. umbellata genomic DNA to the chromosomes of all species produced strong signals in all centromeric regions of V. umbellata and V. angularis, weak signals in all pericentromeric regions of V. aconitifolia, and CPD-banded proximal regions of V. mungo var. mungo. Molecular phylogenetic trees showed that V. angularis and V. umbellata were the closest relatives, and V. mungo and V. aconitifolia were relatively closely related; these species formed a group that was separated from another group comprising V. radiata, V. unguiculata ssp. sesquipedalis and V. subterranea. This result was consistent with the phylogenetic relationships inferred from the heterochromatin and cGISH patterns; thus, fluorochrome banding and cGISH are efficient tools for the phylogenetic analysis of Vigna species. © 2014 German Botanical Society and The Royal Botanical Society of the Netherlands.

  16. Phylogenetic analysis of P5 P-type ATPases, a eukaryotic lineage of secretory pathway pumps

    DEFF Research Database (Denmark)

    Møller, Annette; Asp, Torben; Holm, Preben Bach

    2008-01-01

    prokaryotic genome. Based on a protein alignment we could group the P5 ATPases into two subfamilies, P5A and P5B that, based on the number of negative charges in conserved trans-membrane segment 4, are likely to have different ion specificities. P5A ATPases are present in all eukaryotic genomes sequenced so......Eukaryotes encompass a remarkable variety of organisms and unresolved lineages. Different phylogenetic analyses have lead to conflicting conclusions as to the origin and associations between lineages and species. In this work, we investigated evolutionary relationship of a family of cation pumps...... exclusive for the secretory pathway of eukaryotes by combining the identification of lineage-specific genes with phylogenetic evolution of common genes. Sequences of P5 ATPases, which are regarded to be cation pumps in the endoplasmic reticulum (ER), were identified in all eukaryotic lineages but not in any...

  17. Phylogenetic Evidence for Lateral Gene Transfer in the Intestine of Marine Iguanas

    Science.gov (United States)

    Nelson, David M.; Cann, Isaac K. O.; Altermann, Eric; Mackie, Roderick I.

    2010-01-01

    Background Lateral gene transfer (LGT) appears to promote genotypic and phenotypic variation in microbial communities in a range of environments, including the mammalian intestine. However, the extent and mechanisms of LGT in intestinal microbial communities of non-mammalian hosts remains poorly understood. Methodology/Principal Findings We sequenced two fosmid inserts obtained from a genomic DNA library derived from an agar-degrading enrichment culture of marine iguana fecal material. The inserts harbored 16S rRNA genes that place the organism from which they originated within Clostridium cluster IV, a well documented group that habitats the mammalian intestinal tract. However, sequence analysis indicates that 52% of the protein-coding genes on the fosmids have top BLASTX hits to bacterial species that are not members of Clostridium cluster IV, and phylogenetic analysis suggests that at least 10 of 44 coding genes on the fosmids may have been transferred from Clostridium cluster XIVa to cluster IV. The fosmids encoded four transposase-encoding genes and an integrase-encoding gene, suggesting their involvement in LGT. In addition, several coding genes likely involved in sugar transport were probably acquired through LGT. Conclusion Our phylogenetic evidence suggests that LGT may be common among phylogenetically distinct members of the phylum Firmicutes inhabiting the intestinal tract of marine iguanas. PMID:20520734

  18. Phylogenetic evidence for lateral gene transfer in the intestine of marine iguanas.

    Directory of Open Access Journals (Sweden)

    David M Nelson

    Full Text Available BACKGROUND: Lateral gene transfer (LGT appears to promote genotypic and phenotypic variation in microbial communities in a range of environments, including the mammalian intestine. However, the extent and mechanisms of LGT in intestinal microbial communities of non-mammalian hosts remains poorly understood. METHODOLOGY/PRINCIPAL FINDINGS: We sequenced two fosmid inserts obtained from a genomic DNA library derived from an agar-degrading enrichment culture of marine iguana fecal material. The inserts harbored 16S rRNA genes that place the organism from which they originated within Clostridium cluster IV, a well documented group that habitats the mammalian intestinal tract. However, sequence analysis indicates that 52% of the protein-coding genes on the fosmids have top BLASTX hits to bacterial species that are not members of Clostridium cluster IV, and phylogenetic analysis suggests that at least 10 of 44 coding genes on the fosmids may have been transferred from Clostridium cluster XIVa to cluster IV. The fosmids encoded four transposase-encoding genes and an integrase-encoding gene, suggesting their involvement in LGT. In addition, several coding genes likely involved in sugar transport were probably acquired through LGT. CONCLUSION: Our phylogenetic evidence suggests that LGT may be common among phylogenetically distinct members of the phylum Firmicutes inhabiting the intestinal tract of marine iguanas.

  19. Phylogenetic evidence for lateral gene transfer in the intestine of marine iguanas.

    Science.gov (United States)

    Nelson, David M; Cann, Isaac K O; Altermann, Eric; Mackie, Roderick I

    2010-05-24

    Lateral gene transfer (LGT) appears to promote genotypic and phenotypic variation in microbial communities in a range of environments, including the mammalian intestine. However, the extent and mechanisms of LGT in intestinal microbial communities of non-mammalian hosts remains poorly understood. We sequenced two fosmid inserts obtained from a genomic DNA library derived from an agar-degrading enrichment culture of marine iguana fecal material. The inserts harbored 16S rRNA genes that place the organism from which they originated within Clostridium cluster IV, a well documented group that habitats the mammalian intestinal tract. However, sequence analysis indicates that 52% of the protein-coding genes on the fosmids have top BLASTX hits to bacterial species that are not members of Clostridium cluster IV, and phylogenetic analysis suggests that at least 10 of 44 coding genes on the fosmids may have been transferred from Clostridium cluster XIVa to cluster IV. The fosmids encoded four transposase-encoding genes and an integrase-encoding gene, suggesting their involvement in LGT. In addition, several coding genes likely involved in sugar transport were probably acquired through LGT. Our phylogenetic evidence suggests that LGT may be common among phylogenetically distinct members of the phylum Firmicutes inhabiting the intestinal tract of marine iguanas.

  20. The family of DOF transcription factors in Brachypodium distachyon: phylogenetic comparison with rice and barley DOFs and expression profiling

    Directory of Open Access Journals (Sweden)

    Hernando-Amado Sara

    2012-11-01

    Full Text Available Abstract Background Transcription factors (TFs are proteins that have played a central role both in evolution and in domestication, and are major regulators of development in living organisms. Plant genome sequences reveal that approximately 7% of all genes encode putative TFs. The DOF (DNA binding with One Finger TF family has been associated with vital processes exclusive to higher plants and to their close ancestors (algae, mosses and ferns. These are seed maturation and germination, light-mediated regulation, phytohormone and plant responses to biotic and abiotic stresses, etc. In Hordeum vulgare and Oryza sativa, 26 and 30 different Dof genes, respectively, have been annotated. Brachypodium distachyon has been the first Pooideae grass to be sequenced and, due to its genomic, morphological and physiological characteristics, has emerged as the model system for temperate cereals, such as wheat and barley. Results Through searches in the B. distachyon genome, 27 Dof genes have been identified and a phylogenetic comparison with the Oryza sativa and the Hordeum vulgare DOFs has been performed. To explore the evolutionary relationship among these DOF proteins, a combined phylogenetic tree has been constructed with the Brachypodium DOFs and those from rice and barley. This phylogenetic analysis has classified the DOF proteins into four Major Cluster of Orthologous Groups (MCOGs. Using RT-qPCR analysis the expression profiles of the annotated BdDof genes across four organs (leaves, roots, spikes and seeds has been investigated. These results have led to a classification of the BdDof genes into two groups, according to their expression levels. The genes highly or preferentially expressed in seeds have been subjected to a more detailed expression analysis (maturation, dry stage and germination. Conclusions Comparison of the expression profiles of the Brachypodium Dof genes with the published functions of closely related DOF sequences from the cereal

  1. Analysis of complete mitochondrial genome sequences increases phylogenetic resolution of bears (Ursidae, a mammalian family that experienced rapid speciation

    Directory of Open Access Journals (Sweden)

    Ryder Oliver A

    2007-10-01

    Full Text Available Abstract Background Despite the small number of ursid species, bear phylogeny has long been a focus of study due to their conservation value, as all bear genera have been classified as endangered at either the species or subspecies level. The Ursidae family represents a typical example of rapid evolutionary radiation. Previous analyses with a single mitochondrial (mt gene or a small number of mt genes either provide weak support or a large unresolved polytomy for ursids. We revisit the contentious relationships within Ursidae by analyzing complete mt genome sequences and evaluating the performance of both entire mt genomes and constituent mtDNA genes in recovering a phylogeny of extremely recent speciation events. Results This mitochondrial genome-based phylogeny provides strong evidence that the spectacled bear diverged first, while within the genus Ursus, the sloth bear is the sister taxon of all the other five ursines. The latter group is divided into the brown bear/polar bear and the two black bears/sun bear assemblages. These findings resolve the previous conflicts between trees using partial mt genes. The ability of different categories of mt protein coding genes to recover the correct phylogeny is concordant with previous analyses for taxa with deep divergence times. This study provides a robust Ursidae phylogenetic framework for future validation by additional independent evidence, and also has significant implications for assisting in the resolution of other similarly difficult phylogenetic investigations. Conclusion Identification of base composition bias and utilization of the combined data of whole mitochondrial genome sequences has allowed recovery of a strongly supported phylogeny that is upheld when using multiple alternative outgroups for the Ursidae, a mammalian family that underwent a rapid radiation since the mid- to late Pliocene. It remains to be seen if the reliability of mt genome analysis will hold up in studies of other

  2. Sequence features and phylogenetic analysis of the stress protein Hsp90α in chinook salmon Oncorhynchus tshawytscha, a poikilothermic vertebrate

    Science.gov (United States)

    Palmisano, Aldo N.; Winton, James R.; Dickhoff, Walton W.

    1999-01-01

    We cloned and sequenced a chinook salmon Hsp90 cDNA; sequence analysis shows it to be Hsp90??. Phylogenetic analysis supports the hypothesis that ?? and ?? paralogs of Hsp90 arose as a result of a gene duplication event and that they diverged early in the evolution of vertebrates, before tetrapods separated from the teleost lineage. Among several differences distinguishing poikilothermic Hsp90?? sequences from their bird and mammal orthologs, the teleost versions specifically lack a characteristic QTQDQP phosphorylation site near the N-terminus. We used the cDNA to develop an RNA (Northern) blot to quantify cellular Hsp90 mRNA levels. Chinook salmon embryonic (CHSE-214) cells responded to heat shock with a rapid rise in Hsp90 mRNA through 4 h, followed by a gradual decline over the next 20 h. Hsp90 mRNA level may be useful as a stress indicator, especially in a laboratory setting or in response to acute heat stress.

  3. How to find soluble proteins: a comprehensive analysis of alpha/beta hydrolases for recombinant expression in E. coli

    Directory of Open Access Journals (Sweden)

    Barth Sandra

    2005-04-01

    Full Text Available Abstract Background In screening of libraries derived by expression cloning, expression of active proteins in E. coli can be limited by formation of inclusion bodies. In these cases it would be desirable to enrich gene libraries for coding sequences with soluble gene products in E. coli and thus to improve the efficiency of screening. Previously Wilkinson and Harrison showed that solubility can be predicted from amino acid composition (Biotechnology 1991, 9(5:443–448. We have applied this analysis to members of the alpha/beta hydrolase fold family to predict their solubility in E. coli. alpha/beta hydrolases are a highly diverse family with more than 1800 proteins which have been grouped into homologous families and superfamilies. Results The predicted solubility in E. coli depends on hydrolase size, phylogenetic origin of the host organism, the homologous family and the superfamily, to which the hydrolase belongs. In general small hydrolases are predicted to be more soluble than large hydrolases, and eukaryotic hydrolases are predicted to be less soluble in E. coli than prokaryotic ones. However, combining phylogenetic origin and size leads to more complex conclusions. Hydrolases from prokaryotic, fungal and metazoan origin are predicted to be most soluble if they are of small, medium and large size, respectively. We observed large variations of predicted solubility between hydrolases from different homologous families and from different taxa. Conclusion A comprehensive analysis of all alpha/beta hydrolase sequences allows more efficient screenings for new soluble alpha/beta hydrolases by the use of libraries which contain more soluble gene products. Screening of hydrolases from families whose members are hard to express as soluble proteins in E. coli should first be done in coding sequences of organisms from phylogenetic groups with the highest average of predicted solubility for proteins of this family. The tools developed here can be used

  4. Prokaryotic diversity, composition structure, and phylogenetic analysis of microbial communities in leachate sediment ecosystems.

    Science.gov (United States)

    Liu, Jingjing; Wu, Weixiang; Chen, Chongjun; Sun, Faqian; Chen, Yingxu

    2011-09-01

    In order to obtain insight into the prokaryotic diversity and community in leachate sediment, a culture-independent DNA-based molecular phylogenetic approach was performed with archaeal and bacterial 16S rRNA gene clone libraries derived from leachate sediment of an aged landfill. A total of 59 archaeal and 283 bacterial rDNA phylotypes were identified in 425 archaeal and 375 bacterial analyzed clones. All archaeal clones distributed within two archaeal phyla of the Euryarchaeota and Crenarchaeota, and well-defined methanogen lineages, especially Methanosaeta spp., are the most numerically dominant species of the archaeal community. Phylogenetic analysis of the bacterial library revealed a variety of pollutant-degrading and biotransforming microorganisms, including 18 distinct phyla. A substantial fraction of bacterial clones showed low levels of similarity with any previously documented sequences and thus might be taxonomically new. Chemical characteristics and phylogenetic inferences indicated that (1) ammonium-utilizing bacteria might form consortia to alleviate or avoid the negative influence of high ammonium concentration on other microorganisms, and (2) members of the Crenarchaeota found in the sediment might be involved in ammonium oxidation. This study is the first to report the composition of the microbial assemblages and phylogenetic characteristics of prokaryotic populations extant in leachate sediment. Additional work on microbial activity and contaminant biodegradation remains to be explored.

  5. Incompletely resolved phylogenetic trees inflate estimates of phylogenetic conservatism.

    Science.gov (United States)

    Davies, T Jonathan; Kraft, Nathan J B; Salamin, Nicolas; Wolkovich, Elizabeth M

    2012-02-01

    The tendency for more closely related species to share similar traits and ecological strategies can be explained by their longer shared evolutionary histories and represents phylogenetic conservatism. How strongly species traits co-vary with phylogeny can significantly impact how we analyze cross-species data and can influence our interpretation of assembly rules in the rapidly expanding field of community phylogenetics. Phylogenetic conservatism is typically quantified by analyzing the distribution of species values on the phylogenetic tree that connects them. Many phylogenetic approaches, however, assume a completely sampled phylogeny: while we have good estimates of deeper phylogenetic relationships for many species-rich groups, such as birds and flowering plants, we often lack information on more recent interspecific relationships (i.e., within a genus). A common solution has been to represent these relationships as polytomies on trees using taxonomy as a guide. Here we show that such trees can dramatically inflate estimates of phylogenetic conservatism quantified using S. P. Blomberg et al.'s K statistic. Using simulations, we show that even randomly generated traits can appear to be phylogenetically conserved on poorly resolved trees. We provide a simple rarefaction-based solution that can reliably retrieve unbiased estimates of K, and we illustrate our method using data on first flowering times from Thoreau's woods (Concord, Massachusetts, USA).

  6. Phylogenetic relationships of Hemiptera inferred from mitochondrial and nuclear genes.

    Science.gov (United States)

    Song, Nan; Li, Hu; Cai, Wanzhi; Yan, Fengming; Wang, Jianyun; Song, Fan

    2016-11-01

    Here, we reconstructed the Hemiptera phylogeny based on the expanded mitochondrial protein-coding genes and the nuclear 18S rRNA gene, separately. The differential rates of change across lineages may associate with long-branch attraction (LBA) effect and result in conflicting estimates of phylogeny from different types of data. To reduce the potential effects of systematic biases on inferences of topology, various data coding schemes, site removal method, and different algorithms were utilized in phylogenetic reconstruction. We show that the outgroups Phthiraptera, Thysanoptera, and the ingroup Sternorrhyncha share similar base composition, and exhibit "long branches" relative to other hemipterans. Thus, the long-branch attraction between these groups is suspected to cause the failure of recovering Hemiptera under the homogeneous model. In contrast, a monophyletic Hemiptera is supported when heterogeneous model is utilized in the analysis. Although higher level phylogenetic relationships within Hemiptera remain to be answered, consensus between analyses is beginning to converge on a stable phylogeny.

  7. Pylogeny: an open-source Python framework for phylogenetic tree reconstruction and search space heuristics

    Directory of Open Access Journals (Sweden)

    Alexander Safatli

    2015-06-01

    Full Text Available Summary. Pylogeny is a cross-platform library for the Python programming language that provides an object-oriented application programming interface for phylogenetic heuristic searches. Its primary function is to permit both heuristic search and analysis of the phylogenetic tree search space, as well as to enable the design of novel algorithms to search this space. To this end, the framework supports the structural manipulation of phylogenetic trees, in particular using rearrangement operators such as NNI, SPR, and TBR, the scoring of trees using parsimony and likelihood methods, the construction of a tree search space graph, and the programmatic execution of a few existing heuristic programs. The library supports a range of common phylogenetic file formats and can be used for both nucleotide and protein data. Furthermore, it is also capable of supporting GPU likelihood calculation on nucleotide character data through the BEAGLE library.Availability. Existing development and source code is available for contribution and for download by the public from GitHub (http://github.com/AlexSafatli/Pylogeny. A stable release of this framework is available for download through PyPi (Python Package Index at http://pypi.python.org/pypi/pylogeny.

  8. PyElph - a software tool for gel images analysis and phylogenetics

    Directory of Open Access Journals (Sweden)

    Pavel Ana Brânduşa

    2012-01-01

    Full Text Available Abstract Background This paper presents PyElph, a software tool which automatically extracts data from gel images, computes the molecular weights of the analyzed molecules or fragments, compares DNA patterns which result from experiments with molecular genetic markers and, also, generates phylogenetic trees computed by five clustering methods, using the information extracted from the analyzed gel image. The software can be successfully used for population genetics, phylogenetics, taxonomic studies and other applications which require gel image analysis. Researchers and students working in molecular biology and genetics would benefit greatly from the proposed software because it is free, open source, easy to use, has a friendly Graphical User Interface and does not depend on specific image acquisition devices like other commercial programs with similar functionalities do. Results PyElph software tool is entirely implemented in Python which is a very popular programming language among the bioinformatics community. It provides a very friendly Graphical User Interface which was designed in six steps that gradually lead to the results. The user is guided through the following steps: image loading and preparation, lane detection, band detection, molecular weights computation based on a molecular weight marker, band matching and finally, the computation and visualization of phylogenetic trees. A strong point of the software is the visualization component for the processed data. The Graphical User Interface provides operations for image manipulation and highlights lanes, bands and band matching in the analyzed gel image. All the data and images generated in each step can be saved. The software has been tested on several DNA patterns obtained from experiments with different genetic markers. Examples of genetic markers which can be analyzed using PyElph are RFLP (Restriction Fragment Length Polymorphism, AFLP (Amplified Fragment Length Polymorphism, RAPD

  9. Phylogenetically informed logic relationships improve detection of biological network organization

    Science.gov (United States)

    2011-01-01

    Background A "phylogenetic profile" refers to the presence or absence of a gene across a set of organisms, and it has been proven valuable for understanding gene functional relationships and network organization. Despite this success, few studies have attempted to search beyond just pairwise relationships among genes. Here we search for logic relationships involving three genes, and explore its potential application in gene network analyses. Results Taking advantage of a phylogenetic matrix constructed from the large orthologs database Roundup, we invented a method to create balanced profiles for individual triplets of genes that guarantee equal weight on the different phylogenetic scenarios of coevolution between genes. When we applied this idea to LAPP, the method to search for logic triplets of genes, the balanced profiles resulted in significant performance improvement and the discovery of hundreds of thousands more putative triplets than unadjusted profiles. We found that logic triplets detected biological network organization and identified key proteins and their functions, ranging from neighbouring proteins in local pathways, to well separated proteins in the whole pathway, and to the interactions among different pathways at the system level. Finally, our case study suggested that the directionality in a logic relationship and the profile of a triplet could disclose the connectivity between the triplet and surrounding networks. Conclusion Balanced profiles are superior to the raw profiles employed by traditional methods of phylogenetic profiling in searching for high order gene sets. Gene triplets can provide valuable information in detection of biological network organization and identification of key genes at different levels of cellular interaction. PMID:22172058

  10. Complete mitochondrial genome of four pheretimoid earthworms (Clitellata: Oligochaeta) and their phylogenetic reconstruction.

    Science.gov (United States)

    Zhang, Liangliang; Jiang, Jibao; Dong, Yan; Qiu, Jiangping

    2015-12-15

    Among oligochaetes, the Pheretima complex within the Megascolecidae is a major earthworm group. Recently, however, the systematics of the Pheretima complex based on morphology are challenged by molecular studies. Since little comparative analysis of earthworm complete mitochondrial genomes has been reported yet, we sequenced mitogenomes of four pheretimoid earthworm species to explore their phylogenetic relationships. The general earthworm genomic features are also found in four earthworms: all genes transcribed from the same strand, the same initiation codon ATG for each PCGs, and conserved structures of RNA genes. Interestingly we find an extra potential tRNA-leucine (CUN) in Amynthas longisiphonus. The earthworm mitochondrial ATP8 exhibits the highest evolutionary rate, while the gene CO1 evolves slowest. Phylogenetic analysis based on protein-coding genes (PCGs) strongly supports the monophyly of the Clitellata, Hirudinea, Oligochaeta, Megascolecidae and Pheretima complex. Our analysis, however, reveals non-monophyly within the genara Amynthas and Metaphire. Thus the generic divisions based on morphology in the Pheretima complex should be reconsidered. Copyright © 2015 Elsevier B.V. All rights reserved.

  11. Estimating phylogenetic trees from genome-scale data.

    Science.gov (United States)

    Liu, Liang; Xi, Zhenxiang; Wu, Shaoyuan; Davis, Charles C; Edwards, Scott V

    2015-12-01

    The heterogeneity of signals in the genomes of diverse organisms poses challenges for traditional phylogenetic analysis. Phylogenetic methods known as "species tree" methods have been proposed to directly address one important source of gene tree heterogeneity, namely the incomplete lineage sorting that occurs when evolving lineages radiate rapidly, resulting in a diversity of gene trees from a single underlying species tree. Here we review theory and empirical examples that help clarify conflicts between species tree and concatenation methods, and misconceptions in the literature about the performance of species tree methods. Considering concatenation as a special case of the multispecies coalescent model helps explain differences in the behavior of the two methods on phylogenomic data sets. Recent work suggests that species tree methods are more robust than concatenation approaches to some of the classic challenges of phylogenetic analysis, including rapidly evolving sites in DNA sequences and long-branch attraction. We show that approaches, such as binning, designed to augment the signal in species tree analyses can distort the distribution of gene trees and are inconsistent. Computationally efficient species tree methods incorporating biological realism are a key to phylogenetic analysis of whole-genome data. © 2015 New York Academy of Sciences.

  12. Phylogenetic analysis of rabbit haemorrhagic disease virus (RHDV) strains isolated in Poland.

    Science.gov (United States)

    Fitzner, Andrzej; Niedbalski, Wieslaw

    2017-10-01

    The aim of this study was to characterise the nucleotide and amino acid sequence of complete genomes (7.5 kb) from RHDV strains isolated in Poland and estimate the genetic variability in different elements of the viral RNA. In addition, the sequence of Polish RHDV isolates isolated from 1988-2015 was compared with the sequences of other European RHDV, including the RHDVa and RHDV2/RHDVb subtypes. The complete sequence was developed by the compilation of partial nucleotide sequences. This sequence consisted of approximately 7428 nucleotides. For comparison of nucleotide sequences and the development of phylogenetic trees of Polish RHDV isolates and reference RHDV strains representing the main phylogenetic groups of classical RHDV, RHDVa and RHDV2 as well as the non-pathogenic rabbit lagovirus RCV, the BLAST software with blastn and MEGA6 with neighbour-joining method was applied. The complete nucleotide sequence of Polish isolates of RHDV has also been entered into GenBank. For comparative analysis, nineteen complete sequences representing the main RHDV genetic types available in GenBank were used. The results of phylogenetic analysis of Polish RHDV strains reveals the presence of three classical RHDV genogroups (G2, G4 and G5) and an RHDVa variant (G6). The oldest RHDV isolates (KGM 1988, PD 1989 and MAL 1994) belong to genogroup G2. It can be assumed that the elimination of these strains from the environment probably occurred at the turn of 1994 and 1995. Genogroup G2 was replaced by the phylogenetically younger BLA 1994 and OPO 2004 strains from genogroup G4, which probably originated from the G3 lineage, represented by the Italian strains BS89. The last representatives of classical RHDV in Poland are isolates GSK 1988 and ZD0 2000 from genogroup G5. A single clade contains the Polish RHDV strains from 2004-2015 (GRZ 2004, KRY 2004, L145 2004, W147 2005, SKO 2013, GLE 2013, RED1 2013, STR 2012, STR2 2013, STR 2014, BIE 2015) identified as RHDVa, which clustered

  13. Phylogenetic analysis of vitamin B12-related metabolism in Mycobacterium tuberculosis

    Directory of Open Access Journals (Sweden)

    Douglas B. Young

    2015-03-01

    Full Text Available Comparison of genome sequences from clinical isolates of Mycobacterium tuberculosis with phylogenetically-related pathogens Mycobacterium marinum, Mycobacterium kansasii and Mycobacterium leprae reveals diversity amongst genes associated with vitamin B12-related metabolism. Diversity is generated by gene deletion events, differential acquisition of genes by horizontal transfer, and single nucleotide polymorphisms with predicted impact on protein function and transcriptional regulation. Differences in the B12 synthesis pathway, methionine biosynthesis, fatty acid catabolism, and DNA repair and replication are consistent with adaptations to different environmental niches and pathogenic lifestyles. While there is no evidence of further gene acquisition during expansion of the M. tuberculosis complex, the emergence of other forms of genetic diversity provides insights into continuing host-pathogen co-evolution and has the potential to identify novel targets for disease intervention.

  14. Molecular phylogenetic analysis of non-sexually transmitted strains of Haemophilus ducreyi.

    Science.gov (United States)

    Gaston, Jordan R; Roberts, Sally A; Humphreys, Tricia L

    2015-01-01

    Haemophilus ducreyi, the etiologic agent of chancroid, has been previously reported to show genetic variance in several key virulence factors, placing strains of the bacterium into two genetically distinct classes. Recent studies done in yaws-endemic areas of the South Pacific have shown that H. ducreyi is also a major cause of cutaneous limb ulcers (CLU) that are not sexually transmitted. To genetically assess CLU strains relative to the previously described class I, class II phylogenetic hierarchy, we examined nucleotide sequence diversity at 11 H. ducreyi loci, including virulence and housekeeping genes, which encompass approximately 1% of the H. ducreyi genome. Sequences for all 11 loci indicated that strains collected from leg ulcers exhibit DNA sequences homologous to class I strains of H. ducreyi. However, sequences for 3 loci, including a hemoglobin receptor (hgbA), serum resistance protein (dsrA), and a collagen adhesin (ncaA) contained informative amounts of variation. Phylogenetic analyses suggest that these non-sexually transmitted strains of H. ducreyi comprise a sub-clonal population within class I strains of H. ducreyi. Molecular dating suggests that CLU strains are the most recently developed, having diverged approximately 0.355 million years ago, fourteen times more recently than the class I/class II divergence. The CLU strains' divergence falls after the divergence of humans from chimpanzees, making it the first known H. ducreyi divergence event directly influenced by the selective pressures accompanying human hosts.

  15. Molecular phylogenetic analysis of non-sexually transmitted strains of Haemophilus ducreyi.

    Directory of Open Access Journals (Sweden)

    Jordan R Gaston

    Full Text Available Haemophilus ducreyi, the etiologic agent of chancroid, has been previously reported to show genetic variance in several key virulence factors, placing strains of the bacterium into two genetically distinct classes. Recent studies done in yaws-endemic areas of the South Pacific have shown that H. ducreyi is also a major cause of cutaneous limb ulcers (CLU that are not sexually transmitted. To genetically assess CLU strains relative to the previously described class I, class II phylogenetic hierarchy, we examined nucleotide sequence diversity at 11 H. ducreyi loci, including virulence and housekeeping genes, which encompass approximately 1% of the H. ducreyi genome. Sequences for all 11 loci indicated that strains collected from leg ulcers exhibit DNA sequences homologous to class I strains of H. ducreyi. However, sequences for 3 loci, including a hemoglobin receptor (hgbA, serum resistance protein (dsrA, and a collagen adhesin (ncaA contained informative amounts of variation. Phylogenetic analyses suggest that these non-sexually transmitted strains of H. ducreyi comprise a sub-clonal population within class I strains of H. ducreyi. Molecular dating suggests that CLU strains are the most recently developed, having diverged approximately 0.355 million years ago, fourteen times more recently than the class I/class II divergence. The CLU strains' divergence falls after the divergence of humans from chimpanzees, making it the first known H. ducreyi divergence event directly influenced by the selective pressures accompanying human hosts.

  16. Phylogenetic and Diversity Analysis of Dactylis glomerata Subspecies Using SSR and IT-ISJ Markers.

    Science.gov (United States)

    Yan, Defei; Zhao, Xinxin; Cheng, Yajuan; Ma, Xiao; Huang, Linkai; Zhang, Xinquan

    2016-10-31

    The genus Dactylis , an important forage crop, has a wide geographical distribution in temperate regions. While this genus is thought to include a single species, Dactylis glomerata , this species encompasses many subspecies whose relationships have not been fully characterized. In this study, the genetic diversity and phylogenetic relationships of nine representative Dactylis subspecies were examined using SSR and IT-ISJ markers. In total, 21 pairs of SSR primers and 15 pairs of IT-ISJ primers were used to amplify 295 polymorphic bands with polymorphic rates of 100%. The average polymorphic information contents (PICs) of SSR and IT-ISJ markers were 0.909 and 0.780, respectively. The combined data of the two markers indicated a high level of genetic diversity among the nine D. glomerata subspecies, with a Nei's gene diversity index value of 0.283 and Shannon's diversity of 0.448. Preliminarily phylogenetic analysis results revealed that the 20 accessions could be divided into three groups (A, B, C). Furthermore, they could be divided into five clusters, which is similar to the structure analysis with K = 5. Phylogenetic placement in these three groups may be related to the distribution ranges and the climate types of the subspecies in each group. Group A contained eight accessions of four subspecies, originating from the west Mediterranean, while Group B contained seven accessions of three subspecies, originating from the east Mediterranean.

  17. Phylogenetic analysis and taxonomic revision of Physodactylinae (Coleoptera, Elateridae

    Directory of Open Access Journals (Sweden)

    Simone Policena Rosa

    2014-01-01

    Full Text Available A phylogeny based on male morphological characters and taxonomic revision of the Physodactylinae genera are presented. The phylogenetic analysis based on 66 male characters resulted in the polyphyly of Physodactylinae which comprises four independent lineages. Oligostethius and Idiotropia from Africa were found to be sister groups. Teslasena from Brazil was corroborated as belonging to Cardiophorinae clade. The South American genera Physodactylus and Dactylophysus were found to be sister groups and phylogenetically related to Heterocrepidius species. The Oriental Toxognathus resulted as sister group of that clade plus (Dicrepidius ramicornis (Lissomus sp, Physorhynus erythrocephalus. Taxonomic revisions include diagnoses and redescriptions of genera and distributional records and illustrations of species. Key to species of Teslasena, Toxognathus, Dactylophysus and Physodactylus are also provided. Teslasena lucasi is synonymized with T. femoralis. A new species of Dactylophysus is described, D. hirtus sp. nov., and lectotypes are designated to non-conspecific D. mendax sensu Fleutiaux and Heterocrepidius mendax Candèze. Physodactylus niger is removed from synonymy under P. oberthuri; P. carreti is synonymized with P. niger; P. obesus and P. testaceus are synonymized with P. sulcatus. Nine new species are described in Physodactylus: P. asper sp. nov., P. brunneus sp. nov., P. chassaini sp. nov., P. flavifrons sp. nov., P. girardi sp. nov., P. gounellei sp. nov., P. latithorax sp. nov., P. patens sp. nov. and P. tuberculatus sp. nov.

  18. Statistical assignment of DNA sequences using Bayesian phylogenetics

    DEFF Research Database (Denmark)

    Terkelsen, Kasper Munch; Boomsma, Wouter Krogh; Huelsenbeck, John P.

    2008-01-01

    We provide a new automated statistical method for DNA barcoding based on a Bayesian phylogenetic analysis. The method is based on automated database sequence retrieval, alignment, and phylogenetic analysis using a custom-built program for Bayesian phylogenetic analysis. We show on real data...... that the method outperforms Blast searches as a measure of confidence and can help eliminate 80% of all false assignment based on best Blast hit. However, the most important advance of the method is that it provides statistically meaningful measures of confidence. We apply the method to a re......-analysis of previously published ancient DNA data and show that, with high statistical confidence, most of the published sequences are in fact of Neanderthal origin. However, there are several cases of chimeric sequences that are comprised of a combination of both Neanderthal and modern human DNA....

  19. Phylogenetic analysis of Gossypium L. using restriction fragment length polymorphism of repeated sequences.

    Science.gov (United States)

    Zhang, Meiping; Rong, Ying; Lee, Mi-Kyung; Zhang, Yang; Stelly, David M; Zhang, Hong-Bin

    2015-10-01

    Cotton is the world's leading textile fiber crop and is also grown as a bioenergy and food crop. Knowledge of the phylogeny of closely related species and the genome origin and evolution of polyploid species is significant for advanced genomics research and breeding. We have reconstructed the phylogeny of the cotton genus, Gossypium L., and deciphered the genome origin and evolution of its five polyploid species by restriction fragment analysis of repeated sequences. Nuclear DNA of 84 accessions representing 35 species and all eight genomes of the genus were analyzed. The phylogenetic tree of the genus was reconstructed using the parsimony method on 1033 polymorphic repeated sequence restriction fragments. The genome origin of its polyploids was determined by calculating the diploid-polyploid restriction fragment correspondence (RFC). The tree is consistent with the morphological classification, genome designation and geographic distribution of the species at subgenus, section and subsection levels. Gossypium lobatum (D7) was unambiguously shown to have the highest RFC with the D-subgenomes of all five polyploids of the genus, while the common ancestor of Gossypium herbaceum (A1) and Gossypium arboreum (A2) likely contributed to the A-subgenomes of the polyploids. These results provide a comprehensive phylogenetic tree of the cotton genus and new insights into the genome origin and evolution of its polyploid species. The results also further demonstrate a simple, rapid and inexpensive method suitable for phylogenetic analysis of closely related species, especially congeneric species, and the inference of genome origin of polyploids that constitute over 70 % of flowering plants.

  20. Large-Scale Genomic Analysis of Codon Usage in Dengue Virus and Evaluation of Its Phylogenetic Dependence

    Science.gov (United States)

    Lara-Ramírez, Edgar E.; Salazar, Ma Isabel; López-López, María de Jesús; Salas-Benito, Juan Santiago; Sánchez-Varela, Alejandro

    2014-01-01

    The increasing number of dengue virus (DENV) genome sequences available allows identifying the contributing factors to DENV evolution. In the present study, the codon usage in serotypes 1–4 (DENV1–4) has been explored for 3047 sequenced genomes using different statistics methods. The correlation analysis of total GC content (GC) with GC content at the three nucleotide positions of codons (GC1, GC2, and GC3) as well as the effective number of codons (ENC, ENCp) versus GC3 plots revealed mutational bias and purifying selection pressures as the major forces influencing the codon usage, but with distinct pressure on specific nucleotide position in the codon. The correspondence analysis (CA) and clustering analysis on relative synonymous codon usage (RSCU) within each serotype showed similar clustering patterns to the phylogenetic analysis of nucleotide sequences for DENV1–4. These clustering patterns are strongly related to the virus geographic origin. The phylogenetic dependence analysis also suggests that stabilizing selection acts on the codon usage bias. Our analysis of a large scale reveals new feature on DENV genomic evolution. PMID:25136631

  1. PHYLOGENETIC ANALYSIS OF LEARNING-RELATED NEUROMODULATION IN MOLLUSCAN MECHANOSENSORY NEURONS.

    Science.gov (United States)

    Wright, William G; Kirschman, David; Rozen, Danny; Maynard, Barbara

    1996-12-01

    In spite of significant advances in our understanding of mechanisms of learning and memory in a variety of organisms, little is known about how such mechanisms evolve. Even mechanisms of simple forms of learning, such as habituation and sensitization, have not been studied phylogenetically. Here we begin an evolutionary analysis of learning-related neuromodulation in species related to the well-studied opisthobranch gastropod, Aplysia californica. In Aplysia, increased spike duration and excitability in mechanosensory neurons contribute to several forms of learning-related changes to defensive withdrawal reflexes. The modulatory transmitter serotonin (5-hydroxytryptamine, or 5-HT), is thought to play a critical role in producing these firing property changes. In the present study, we tested mechanosensory homologs of the tail-withdrawal reflex in species related to Aplysia for 5-HT-mediated increases in spike duration and excitability. Criteria used to identify homologous tail-sensory neurons included position, relative size, resting electrical properties, expression of a sensory neuron-specific protein, neuroanatomy, and receptive field. The four ingroup species studied (Aplysia californica, Dolabella auricularia, Bursatella leachii, and Dolabrifera dolabrifera) belong to two clades (two species each) within the family Aplysiidae. In the first clade (Aplysia/Dolabella), we found that the tail-sensory neurons of A. californica and tail-sensory homologs of a closely related species, D. auricularia, responded to bath-applied serotonin in essentially similar fashion: significant increases in spike duration as well as excitability. In the other clade (Dolabrifera/Bursatella), more distantly related to Aplysia, one species (B. leachii) showed spike broadening and increased excitability. However, the other species (D. dolabrifera) showed neither spike broadening nor increased excitability. The firing properties of tail-sensory homologs of D. dolabrifera were insensitive

  2. Metagenomics and the protein universe

    Science.gov (United States)

    Godzik, Adam

    2011-01-01

    Metagenomics sequencing projects have dramatically increased our knowledge of the protein universe and provided over one-half of currently known protein sequences; they have also introduced a much broader phylogenetic diversity into the protein databases. The full analysis of metagenomic datasets is only beginning, but it has already led to the discovery of thousands of new protein families, likely representing novel functions specific to given environments. At the same time, a deeper analysis of such novel families, including experimental structure determination of some representatives, suggests that most of them represent distant homologs of already characterized protein families, and thus most of the protein diversity present in the new environments are due to functional divergence of the known protein families rather than the emergence of new ones. PMID:21497084

  3. YBYRÁ facilitates comparison of large phylogenetic trees.

    Science.gov (United States)

    Machado, Denis Jacob

    2015-07-01

    The number and size of tree topologies that are being compared by phylogenetic systematists is increasing due to technological advancements in high-throughput DNA sequencing. However, we still lack tools to facilitate comparison among phylogenetic trees with a large number of terminals. The "YBYRÁ" project integrates software solutions for data analysis in phylogenetics. It comprises tools for (1) topological distance calculation based on the number of shared splits or clades, (2) sensitivity analysis and automatic generation of sensitivity plots and (3) clade diagnoses based on different categories of synapomorphies. YBYRÁ also provides (4) an original framework to facilitate the search for potential rogue taxa based on how much they affect average matching split distances (using MSdist). YBYRÁ facilitates comparison of large phylogenetic trees and outperforms competing software in terms of usability and time efficiency, specially for large data sets. The programs that comprises this toolkit are written in Python, hence they do not require installation and have minimum dependencies. The entire project is available under an open-source licence at http://www.ib.usp.br/grant/anfibios/researchSoftware.html .

  4. Marine turtle mitogenome phylogenetics and evolution

    DEFF Research Database (Denmark)

    Duchene, Sebastián; Frey, Amy; Alfaro-Núñez, Luis Alonso

    2012-01-01

    The sea turtles are a group of cretaceous origin containing seven recognized living species: leatherback, hawksbill, Kemp's ridley, olive ridley, loggerhead, green, and flatback. The leatherback is the single member of the Dermochelidae family, whereas all other sea turtles belong in Cheloniidae...... distributions, shedding light on complex migration patterns and possible geographic or climatic events as driving forces of sea-turtle distribution. We have sequenced complete mitogenomes for all sea-turtle species, including samples from their geographic range extremes, and performed phylogenetic analyses...... to assess sea-turtle evolution with a large molecular dataset. We found variation in the length of the ATP8 gene and a highly variable site in ND4 near a proton translocation channel in the resulting protein. Complete mitogenomes show strong support and resolution for phylogenetic relationships among all...

  5. Analysis of Comparative Sequence and Genomic Data to Verify Phylogenetic Relationship and Explore a New Subfamily of Bacterial Lipases.

    Directory of Open Access Journals (Sweden)

    Malihe Masomian

    Full Text Available Thermostable and organic solvent-tolerant enzymes have significant potential in a wide range of synthetic reactions in industry due to their inherent stability at high temperatures and their ability to endure harsh organic solvents. In this study, a novel gene encoding a true lipase was isolated by construction of a genomic DNA library of thermophilic Aneurinibacillus thermoaerophilus strain HZ into Escherichia coli plasmid vector. Sequence analysis revealed that HZ lipase had 62% identity to putative lipase from Bacillus pseudomycoides. The closely characterized lipases to the HZ lipase gene are from thermostable Bacillus and Geobacillus lipases belonging to the subfamily I.5 with ≤ 57% identity. The amino acid sequence analysis of HZ lipase determined a conserved pentapeptide containing the active serine, GHSMG and a Ca(2+-binding motif, GCYGSD in the enzyme. Protein structure modeling showed that HZ lipase consisted of an α/β hydrolase fold and a lid domain. Protein sequence alignment, conserved regions analysis, clustal distance matrix and amino acid composition illustrated differences between HZ lipase and other thermostable lipases. Phylogenetic analysis revealed that this lipase represented a new subfamily of family I of bacterial true lipases, classified as family I.9. The HZ lipase was expressed under promoter Plac using IPTG and was characterized. The recombinant enzyme showed optimal activity at 65 °C and retained ≥ 97% activity after incubation at 50 °C for 1h. The HZ lipase was stable in various polar and non-polar organic solvents.

  6. Amino Acids Sequence Based in Silico Analysis of RuBisCO (Ribulose-1,5 Bisphosphate Carboxylase Oxygenase Proteins in Some Carthamus L. ssp.

    Directory of Open Access Journals (Sweden)

    Emre SEVİNDİK

    2017-06-01

    Full Text Available RuBisCO is an important enzyme for plants to photosynthesize and balance carbon dioxide in the atmosphere. This study aimed to perform sequence, physicochemical, phylogenetic and 3D (three-dimensional comparative analyses of RuBisCO proteins in the Carthamus ssp. using various bioinformatics tools. The sequence lengths of the RuBisCO proteins were between 166 and 477 amino acids, with an average length of 411.8 amino acids. Their molecular weights (Mw ranged from 18711.47 to 52843.09 Da; the most acidic and basic protein sequences were detected in C. tinctorius (pI = 5.99 and in C. tenuis (pI = 6.92, respectively. The extinction coefficients of RuBisCO proteins at 280 nm ranged from 17,670 to 69,830 M-1 cm-1, the instability index (II values for RuBisCO proteins ranged from 33.31 to 39.39, while the GRAVY values of RuBisCO proteins ranged from -0.313 to -0.250. The most abundant amino acid in the RuBisCO protein was Gly (9.7%, while the least amino acid ratio was Trp (1.6 %. The putative phosphorylation sites of RuBisCO proteins were determined by NetPhos 2.0. Phylogenetic analysis revealed that RuBisCO proteins formed two main clades. A RAMPAGE analysis revealed that 96.3%-97.6% of residues were located in the favoured region of RuBisCO proteins. To predict the three dimensional (3D structure of the RuBisCO proteins PyMOL was used. The results of the current study provide insights into fundamental characteristic of RuBisCO proteins in Carthamus ssp.

  7. Phylogenetic distribution and membrane topology of the LytR-CpsA-Psr protein family

    Directory of Open Access Journals (Sweden)

    Berger-Bächi Brigitte

    2008-12-01

    Full Text Available Abstract Background The bacterial cell wall is the target of many antibiotics and cell envelope constituents are critical to host-pathogen interactions. To combat resistance development and virulence, a detailed knowledge of the individual factors involved is essential. Members of the LytR-CpsA-Psr family of cell envelope-associated attenuators are relevant for β-lactam resistance, biofilm formation, and stress tolerance, and they are suggested to play a role in cell wall maintenance. However, their precise function is still unknown. This study addresses the occurrence as well as sequence-based characteristics of the LytR-CpsA-Psr proteins. Results A comprehensive list of LytR-CpsA-Psr proteins was established, and their phylogenetic distribution and clustering into subgroups was determined. LytR-CpsA-Psr proteins were present in all Gram-positive organisms, except for the cell wall-deficient Mollicutes and one strain of the Clostridiales. In contrast, the majority of Gram-negatives did not contain LytR-CpsA-Psr family members. Despite high sequence divergence, the LytR-CpsA-Psr domains of different subclusters shared a highly similar, predicted mixed a/β-structure, and conserved charged residues. PhoA fusion experiments, using MsrR of Staphylococcus aureus, confirmed membrane topology predictions and extracellular location of its LytR-CpsA-Psr domain. Conclusion The LytR-CpsA-Psr domain is unique to bacteria. The presence of diverse subgroups within the LytR-CpsA-Psr family might indicate functional differences, and could explain variations in phenotypes of respective mutants reported. The identified conserved structural elements and amino acids are likely to be important for the function of the domain and will help to guide future studies of the LytR-CpsA-Psr proteins.

  8. NGS combined with phylogenetic analysis to detect HIV-1 dual infection in Romanian people who inject drugs.

    Science.gov (United States)

    Popescu, Bogdan; Banica, Leontina; Nicolae, Ionelia; Radu, Eugen; Niculescu, Iulia; Abagiu, Adrian; Otelea, Dan; Paraschiv, Simona

    2018-04-04

    Dual HIV infections are possible and likely in people who inject drugs (PWID). Thirty-eight newly diagnosed patients, 19 PWID and 19 heterosexually HIV infected were analysed. V2-V3 loop of HIV-1 env gene was sequenced on the NGS platform 454 GSJunior (Roche). HIV-1 dual/multiple infections were identified in five PWID. For three of these patients, the reconstructed variants belonged to pure F1 subtype and CRF14_BG strains according to phylogenetic analysis. New recombinant forms between these parental strains were identified in two PWID samples. NGS data can provide, with the help of phylogenetic analysis, important insights about the intra-host sub-population structure. Copyright © 2018. Published by Elsevier Masson SAS.

  9. Functional & phylogenetic diversity of copepod communities

    Science.gov (United States)

    Benedetti, F.; Ayata, S. D.; Blanco-Bercial, L.; Cornils, A.; Guilhaumon, F.

    2016-02-01

    The diversity of natural communities is classically estimated through species identification (taxonomic diversity) but can also be estimated from the ecological functions performed by the species (functional diversity), or from the phylogenetic relationships among them (phylogenetic diversity). Estimating functional diversity requires the definition of specific functional traits, i.e., phenotypic characteristics that impact fitness and are relevant to ecosystem functioning. Estimating phylogenetic diversity requires the description of phylogenetic relationships, for instance by using molecular tools. In the present study, we focused on the functional and phylogenetic diversity of copepod surface communities in the Mediterranean Sea. First, we implemented a specific trait database for the most commonly-sampled and abundant copepod species of the Mediterranean Sea. Our database includes 191 species, described by seven traits encompassing diverse ecological functions: minimal and maximal body length, trophic group, feeding type, spawning strategy, diel vertical migration and vertical habitat. Clustering analysis in the functional trait space revealed that Mediterranean copepods can be gathered into groups that have different ecological roles. Second, we reconstructed a phylogenetic tree using the available sequences of 18S rRNA. Our tree included 154 of the analyzed Mediterranean copepod species. We used these two datasets to describe the functional and phylogenetic diversity of copepod surface communities in the Mediterranean Sea. The replacement component (turn-over) and the species richness difference component (nestedness) of the beta diversity indices were identified. Finally, by comparing various and complementary aspects of plankton diversity (taxonomic, functional, and phylogenetic diversity) we were able to gain a better understanding of the relationships among the zooplankton community, biodiversity, ecosystem function, and environmental forcing.

  10. Phylogenetic analyses of Vitis (Vitaceae) based on complete chloroplast genome sequences: effects of taxon sampling and phylogenetic methods on resolving relationships among rosids.

    Science.gov (United States)

    Jansen, Robert K; Kaittanis, Charalambos; Saski, Christopher; Lee, Seung-Bum; Tomkins, Jeffrey; Alverson, Andrew J; Daniell, Henry

    2006-04-09

    The Vitaceae (grape) is an economically important family of angiosperms whose phylogenetic placement is currently unresolved. Recent phylogenetic analyses based on one to several genes have suggested several alternative placements of this family, including sister to Caryophyllales, asterids, Saxifragales, Dilleniaceae or to rest of rosids, though support for these different results has been weak. There has been a recent interest in using complete chloroplast genome sequences for resolving phylogenetic relationships among angiosperms. These studies have clarified relationships among several major lineages but they have also emphasized the importance of taxon sampling and the effects of different phylogenetic methods for obtaining accurate phylogenies. We sequenced the complete chloroplast genome of Vitis vinifera and used these data to assess relationships among 27 angiosperms, including nine taxa of rosids. The Vitis vinifera chloroplast genome is 160,928 bp in length, including a pair of inverted repeats of 26,358 bp that are separated by small and large single copy regions of 19,065 bp and 89,147 bp, respectively. The gene content and order of Vitis is identical to many other unrearranged angiosperm chloroplast genomes, including tobacco. Phylogenetic analyses using maximum parsimony and maximum likelihood were performed on DNA sequences of 61 protein-coding genes for two datasets with 28 or 29 taxa, including eight or nine taxa from four of the seven currently recognized major clades of rosids. Parsimony and likelihood phylogenies of both data sets provide strong support for the placement of Vitaceae as sister to the remaining rosids. However, the position of the Myrtales and support for the monophyly of the eurosid I clade differs between the two data sets and the two methods of analysis. In parsimony analyses, the inclusion of Gossypium is necessary to obtain trees that support the monophyly of the eurosid I clade. However, maximum likelihood analyses place

  11. Phylogenetic analyses of Vitis (Vitaceae based on complete chloroplast genome sequences: effects of taxon sampling and phylogenetic methods on resolving relationships among rosids

    Directory of Open Access Journals (Sweden)

    Alverson Andrew J

    2006-04-01

    Full Text Available Abstract Background The Vitaceae (grape is an economically important family of angiosperms whose phylogenetic placement is currently unresolved. Recent phylogenetic analyses based on one to several genes have suggested several alternative placements of this family, including sister to Caryophyllales, asterids, Saxifragales, Dilleniaceae or to rest of rosids, though support for these different results has been weak. There has been a recent interest in using complete chloroplast genome sequences for resolving phylogenetic relationships among angiosperms. These studies have clarified relationships among several major lineages but they have also emphasized the importance of taxon sampling and the effects of different phylogenetic methods for obtaining accurate phylogenies. We sequenced the complete chloroplast genome of Vitis vinifera and used these data to assess relationships among 27 angiosperms, including nine taxa of rosids. Results The Vitis vinifera chloroplast genome is 160,928 bp in length, including a pair of inverted repeats of 26,358 bp that are separated by small and large single copy regions of 19,065 bp and 89,147 bp, respectively. The gene content and order of Vitis is identical to many other unrearranged angiosperm chloroplast genomes, including tobacco. Phylogenetic analyses using maximum parsimony and maximum likelihood were performed on DNA sequences of 61 protein-coding genes for two datasets with 28 or 29 taxa, including eight or nine taxa from four of the seven currently recognized major clades of rosids. Parsimony and likelihood phylogenies of both data sets provide strong support for the placement of Vitaceae as sister to the remaining rosids. However, the position of the Myrtales and support for the monophyly of the eurosid I clade differs between the two data sets and the two methods of analysis. In parsimony analyses, the inclusion of Gossypium is necessary to obtain trees that support the monophyly of the eurosid I clade

  12. Phylogenetic turnover during subtropical forest succession across environmental and phylogenetic scales.

    Science.gov (United States)

    Purschke, Oliver; Michalski, Stefan G; Bruelheide, Helge; Durka, Walter

    2017-12-01

    Although spatial and temporal patterns of phylogenetic community structure during succession are inherently interlinked and assembly processes vary with environmental and phylogenetic scales, successional studies of community assembly have yet to integrate spatial and temporal components of community structure, while accounting for scaling issues. To gain insight into the processes that generate biodiversity after disturbance, we combine analyses of spatial and temporal phylogenetic turnover across phylogenetic scales, accounting for covariation with environmental differences. We compared phylogenetic turnover, at the species- and individual-level, within and between five successional stages, representing woody plant communities in a subtropical forest chronosequence. We decomposed turnover at different phylogenetic depths and assessed its covariation with between-plot abiotic differences. Phylogenetic turnover between stages was low relative to species turnover and was not explained by abiotic differences. However, within the late-successional stages, there was high presence-/absence-based turnover (clustering) that occurred deep in the phylogeny and covaried with environmental differentiation. Our results support a deterministic model of community assembly where (i) phylogenetic composition is constrained through successional time, but (ii) toward late succession, species sorting into preferred habitats according to niche traits that are conserved deep in phylogeny, becomes increasingly important.

  13. Aujeszky's disease in red fox (Vulpes vulpes): phylogenetic analysis unravels an unexpected epidemiologic link.

    Science.gov (United States)

    Caruso, Claudio; Dondo, Alessandro; Cerutti, Francesco; Masoero, Loretta; Rosamilia, Alfonso; Zoppi, Simona; D'Errico, Valeria; Grattarola, Carla; Acutis, Pier Luigi; Peletto, Simone

    2014-07-01

    We describe Aujeszky's disease in a female of red fox (Vulpes vulpes). Although wild boar (Sus scrofa) would be the expected source of infection, phylogenetic analysis suggested a domestic rather than a wild source of virus, underscoring the importance of biosecurity measures in pig farms to prevent contact with wild animals.

  14. HIV forensics: pitfalls and acceptable standards in the use of phylogenetic analysis as evidence in criminal investigations of HIV transmission.

    Science.gov (United States)

    Bernard, E J; Azad, Y; Vandamme, A M; Weait, M; Geretti, A M

    2007-09-01

    Phylogenetic analysis - the study of the genetic relatedness between HIV strains - has recently been used in criminal prosecutions as evidence of responsibility for HIV transmission. In these trials, the expert opinion of virologists has been of critical importance. Phylogenetic analysis of HIV gene sequences is complex and its findings do not achieve the levels of certainty obtained with the forensic analysis of human DNA. Although two individuals may carry HIV strains that are closely related, these will not necessarily be unique to the two parties and could extend to other persons within the same transmission network. For forensic purposes, phylogenetic analysis should be conducted under strictly controlled conditions by laboratories with relevant expertise applying rigorous methods. It is vitally important to include the right controls, which should be epidemiologically and temporally relevant to the parties under investigation. Use of inappropriate controls can exaggerate any relatedness between the virus strains of the complainant and defendant as being strikingly unique. It will be often difficult to obtain the relevant controls. If convenient but less appropriate controls are used, interpretation of the findings should be tempered accordingly. Phylogenetic analysis cannot prove that HIV transmission occurred directly between two individuals. However, it can exonerate individuals by demonstrating that the defendant carries a virus strain unrelated to that of the complainant. Expert witnesses should acknowledge the limitations of the inferences that might be made and choose the correct language in both written and verbal testimony.

  15. Phylogenetic features of hemagglutin gene in canine distemper virus strains from different genetic lineages.

    Science.gov (United States)

    Liao, Peng; Guo, Li; Wen, Yongjun; Yang, Yangling; Cheng, Shipeng

    2015-01-01

    In the present study, the genotype of two Canine distemper virus (CDV) strains, namely, ZJJ-SD and ZJJ-LN, were investigated, based on the whole hemagglutinin (HA) gene. The CDV strains were obtained from two foxes in Shandong Province and Liaoning Province in 2011. Phylogenetic analyses were carried out for 260 CDV strains worldwide, and a statistical analysis was performed in the amino acid substitutions at positions 530 and 549 of the HA protein. Phylogenetic analyses revealed that the two strains, ZJJ-SD and ZJJ-LN, belonged to the CDV Asia I lineage. Site 530 of HA protein was found to be relatively conserved within CDV lineages in different host species by combining the genetic sequence data with the published data from 260 CDV strains worldwide. The data analysis showed a bias toward the predicted substitution Y549H for the non-dog strains in Asia I and Europe lineages. The ratio of site 549 genetic drift in the HA gene were significantly different between dogs and non-dogs in the two lineages. The strain ZJJ-SD, from wild canid, has an Y549H substitution. It is one of three Y549H substitution for wild canids in Asia I lineages. Site 530 of HA protein was not immediately relative to CDV genetic drift from dogs to non-dogs. Statistical analysis indicated that non-dog strains have a high probability to contain Y549H than dog strains in Asia I and Europe lineages. Thus, site 549 is considered important in genetic drift from dogs to non-dogs, at least in Asia I and Europe lineages.

  16. A bootstrap based analysis pipeline for efficient classification of phylogenetically related animal miRNAs

    Directory of Open Access Journals (Sweden)

    Gu Xun

    2007-03-01

    Full Text Available Abstract Background Phylogenetically related miRNAs (miRNA families convey important information of the function and evolution of miRNAs. Due to the special sequence features of miRNAs, pair-wise sequence identity between miRNA precursors alone is often inadequate for unequivocally judging the phylogenetic relationships between miRNAs. Most of the current methods for miRNA classification rely heavily on manual inspection and lack measurements of the reliability of the results. Results In this study, we designed an analysis pipeline (the Phylogeny-Bootstrap-Cluster (PBC pipeline to identify miRNA families based on branch stability in the bootstrap trees derived from overlapping genome-wide miRNA sequence sets. We tested the PBC analysis pipeline with the miRNAs from six animal species, H. sapiens, M. musculus, G. gallus, D. rerio, D. melanogaster, and C. elegans. The resulting classification was compared with the miRNA families defined in miRBase. The two classifications were largely consistent. Conclusion The PBC analysis pipeline is an efficient method for classifying large numbers of heterogeneous miRNA sequences. It requires minimum human involvement and provides measurements of the reliability of the classification results.

  17. Phylogenomic and MALDI-TOF MS analysis of Streptococcus sinensis HKU4T reveals a distinct phylogenetic clade in the genus Streptococcus.

    Science.gov (United States)

    Teng, Jade L L; Huang, Yi; Tse, Herman; Chen, Jonathan H K; Tang, Ying; Lau, Susanna K P; Woo, Patrick C Y

    2014-10-20

    Streptococcus sinensis is a recently discovered human pathogen isolated from blood cultures of patients with infective endocarditis. Its phylogenetic position, as well as those of its closely related species, remains inconclusive when single genes were used for phylogenetic analysis. For example, S. sinensis branched out from members of the anginosus, mitis, and sanguinis groups in the 16S ribosomal RNA gene phylogenetic tree, but it was clustered with members of the anginosus and sanguinis groups when groEL gene sequences used for analysis. In this study, we sequenced the draft genome of S. sinensis and used a polyphasic approach, including concatenated genes, whole genomes, and matrix-assisted laser desorption ionization-time of flight mass spectrometry to analyze the phylogeny of S. sinensis. The size of the S. sinensis draft genome is 2.06 Mb, with GC content of 42.2%. Phylogenetic analysis using 50 concatenated genes or whole genomes revealed that S. sinensis formed a distinct cluster with Streptococcus oligofermentans and Streptococcus cristatus, and these three streptococci were clustered with the "sanguinis group." As for phylogenetic analysis using hierarchical cluster analysis of the mass spectra of streptococci, S. sinensis also formed a distinct cluster with S. oligofermentans and S. cristatus, but these three streptococci were clustered with the "mitis group." On the basis of the findings, we propose a novel group, named "sinensis group," to include S. sinensis, S. oligofermentans, and S. cristatus, in the Streptococcus genus. Our study also illustrates the power of phylogenomic analyses for resolving ambiguities in bacterial taxonomy. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  18. Phylogenetic characterization of the ubiquitous electron transfer flavoprotein families ETF-alpha and ETF-beta.

    Science.gov (United States)

    Tsai, M H; Saier, M H

    1995-06-01

    Electron transfer flavoproteins (ETF) are alpha beta-heterodimers found in eukaryotic mitochondria and bacteria. We have identified currently sequenced protein members of the ETF-alpha and ETF-beta families. Members of these two families include (a) the ETF subunits of mammals and bacteria, (b) homologous pairs of proteins (FixB/FixA) that are essential for nitrogen fixation in some bacteria, and (c) a pair of carnitine-inducible proteins encoded by two open reading frames in Escherichia coli (YaaQ and YaaR). These three groups of proteins comprise three clusters on both the ETF-alpha and ETF-beta phylogenetic trees, separated from each other by comparable phylogenetic distances. This fact suggests that these two protein families evolved with similar overall rates of evolutionary divergence. Relative regions of sequence conservation are evaluated, and signature sequences for both families are derived.

  19. Molecular phylogenetics of mastodon and Tyrannosaurus rex.

    Science.gov (United States)

    Organ, Chris L; Schweitzer, Mary H; Zheng, Wenxia; Freimark, Lisa M; Cantley, Lewis C; Asara, John M

    2008-04-25

    We report a molecular phylogeny for a nonavian dinosaur, extending our knowledge of trait evolution within nonavian dinosaurs into the macromolecular level of biological organization. Fragments of collagen alpha1(I) and alpha2(I) proteins extracted from fossil bones of Tyrannosaurus rex and Mammut americanum (mastodon) were analyzed with a variety of phylogenetic methods. Despite missing sequence data, the mastodon groups with elephant and the T. rex groups with birds, consistent with predictions based on genetic and morphological data for mastodon and on morphological data for T. rex. Our findings suggest that molecular data from long-extinct organisms may have the potential for resolving relationships at critical areas in the vertebrate evolutionary tree that have, so far, been phylogenetically intractable.

  20. The space of ultrametric phylogenetic trees.

    Science.gov (United States)

    Gavryushkin, Alex; Drummond, Alexei J

    2016-08-21

    The reliability of a phylogenetic inference method from genomic sequence data is ensured by its statistical consistency. Bayesian inference methods produce a sample of phylogenetic trees from the posterior distribution given sequence data. Hence the question of statistical consistency of such methods is equivalent to the consistency of the summary of the sample. More generally, statistical consistency is ensured by the tree space used to analyse the sample. In this paper, we consider two standard parameterisations of phylogenetic time-trees used in evolutionary models: inter-coalescent interval lengths and absolute times of divergence events. For each of these parameterisations we introduce a natural metric space on ultrametric phylogenetic trees. We compare the introduced spaces with existing models of tree space and formulate several formal requirements that a metric space on phylogenetic trees must possess in order to be a satisfactory space for statistical analysis, and justify them. We show that only a few known constructions of the space of phylogenetic trees satisfy these requirements. However, our results suggest that these basic requirements are not enough to distinguish between the two metric spaces we introduce and that the choice between metric spaces requires additional properties to be considered. Particularly, that the summary tree minimising the square distance to the trees from the sample might be different for different parameterisations. This suggests that further fundamental insight is needed into the problem of statistical consistency of phylogenetic inference methods. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  1. Identification of a new genomic hot spot of evolutionary diversification of protein function.

    Directory of Open Access Journals (Sweden)

    Aline Winkelmann

    Full Text Available Establishment of phylogenetic relationships remains a challenging task because it is based on computational analysis of genomic hot spots that display species-specific sequence variations. Here, we identify a species-specific thymine-to-guanine sequence variation in the Glrb gene which gives rise to species-specific splice donor sites in the Glrb genes of mouse and bushbaby. The resulting splice insert in the receptor for the inhibitory neurotransmitter glycine (GlyR conveys synaptic receptor clustering and specific association with a particular synaptic plasticity-related splice variant of the postsynaptic scaffold protein gephyrin. This study identifies a new genomic hot spot which contributes to phylogenetic diversification of protein function and advances our understanding of phylogenetic relationships.

  2. Large-Scale Genomic Analysis of Codon Usage in Dengue Virus and Evaluation of Its Phylogenetic Dependence

    Directory of Open Access Journals (Sweden)

    Edgar E. Lara-Ramírez

    2014-01-01

    Full Text Available The increasing number of dengue virus (DENV genome sequences available allows identifying the contributing factors to DENV evolution. In the present study, the codon usage in serotypes 1–4 (DENV1–4 has been explored for 3047 sequenced genomes using different statistics methods. The correlation analysis of total GC content (GC with GC content at the three nucleotide positions of codons (GC1, GC2, and GC3 as well as the effective number of codons (ENC, ENCp versus GC3 plots revealed mutational bias and purifying selection pressures as the major forces influencing the codon usage, but with distinct pressure on specific nucleotide position in the codon. The correspondence analysis (CA and clustering analysis on relative synonymous codon usage (RSCU within each serotype showed similar clustering patterns to the phylogenetic analysis of nucleotide sequences for DENV1–4. These clustering patterns are strongly related to the virus geographic origin. The phylogenetic dependence analysis also suggests that stabilizing selection acts on the codon usage bias. Our analysis of a large scale reveals new feature on DENV genomic evolution.

  3. A phylogenetic analysis of the sugar porters in hemiascomycetous yeasts.

    Science.gov (United States)

    Palma, Margarida; Goffeau, André; Spencer-Martins, Isabel; Baret, Philippe V

    2007-01-01

    A total of 214 members of the sugar porter (SP) family (TC 2.A.1.1) from eight hemiascomycetous yeasts: Saccharomyces cerevisiae, Candida glabrata, Kluyveromyces lactis, Ashbya (Eremothecium) gossypii, Debaryomyces hansenii, Yarrowia lipolytica, Candida albicans and Pichia stipitis, were identified. The yeast SPs were classified in 13 different phylogenetic clusters. Specific sugar substrates could be allocated to nine phylogenetic clusters, including two novel TC clusters that are specific to fungi, i.e. the glycerol:H(+) symporter (2.A.1.1.38) and the high-affinity glucose transporter (2.A.1.1.39). Four phylogenetic clusters are identified by the preliminary fifth number Z23, Z24, Z25 and Z26 and the substrates of their members remain undetermined. The amplification of the SP clusters across the Hemiascomycetes reflects adaptation to specific carbon and energy sources available in the habitat of each yeast species. (c) 2007 S. Karger AG, Basel.

  4. Complete Chloroplast Genomes of Papaver rhoeas and Papaver orientale: Molecular Structures, Comparative Analysis, and Phylogenetic Analysis

    Directory of Open Access Journals (Sweden)

    Jianguo Zhou

    2018-02-01

    Full Text Available Papaver rhoeas L. and P. orientale L., which belong to the family Papaveraceae, are used as ornamental and medicinal plants. The chloroplast genome has been used for molecular markers, evolutionary biology, and barcoding identification. In this study, the complete chloroplast genome sequences of P. rhoeas and P. orientale are reported. Results show that the complete chloroplast genomes of P. rhoeas and P. orientale have typical quadripartite structures, which are comprised of circular 152,905 and 152,799-bp-long molecules, respectively. A total of 130 genes were identified in each genome, including 85 protein-coding genes, 37 tRNA genes, and 8 rRNA genes. Sequence divergence analysis of four species from Papaveraceae indicated that the most divergent regions are found in the non-coding spacers with minimal differences among three Papaver species. These differences include the ycf1 gene and intergenic regions, such as rpoB-trnC, trnD-trnT, petA-psbJ, psbE-petL, and ccsA-ndhD. These regions are hypervariable regions, which can be used as specific DNA barcodes. This finding suggested that the chloroplast genome could be used as a powerful tool to resolve the phylogenetic positions and relationships of Papaveraceae. These results offer valuable information for future research in the identification of Papaver species and will benefit further investigations of these species.

  5. Bioinformatics analysis and construction of phylogenetic tree of aquaporins from Echinococcus granulosus.

    Science.gov (United States)

    Wang, Fen; Ye, Bin

    2016-09-01

    Cyst echinococcosis caused by the matacestodal larvae of Echinococcus granulosus (Eg), is a chronic, worldwide, and severe zoonotic parasitosis. The treatment of cyst echinococcosis is still difficult since surgery cannot fit the needs of all patients, and drugs can lead to serious adverse events as well as resistance. The screen of target proteins interacted with new anti-hydatidosis drugs is urgently needed to meet the prevailing challenges. Here, we analyzed the sequences and structure properties, and constructed a phylogenetic tree by bioinformatics methods. The MIP family signature and Protein kinase C phosphorylation sites were predicted in all nine EgAQPs. α-helix and random coil were the main secondary structures of EgAQPs. The numbers of transmembrane regions were three to six, which indicated that EgAQPs contained multiple hydrophobic regions. A neighbor-joining tree indicated that EgAQPs were divided into two branches, seven EgAQPs formed a clade with AQP1 from human, a "strict" aquaporins, other two EgAQPs formed a clade with AQP9 from human, an aquaglyceroporins. Unfortunately, homology modeling of EgAQPs was aborted. These results provide a foundation for understanding and researches of the biological function of E. granulosus.

  6. Phylogenetic and bioinformatic analysis of gap junction-related proteins, innexins, pannexins and connexins.

    Science.gov (United States)

    Fushiki, Daisuke; Hamada, Yasuo; Yoshimura, Ryoichi; Endo, Yasuhisa

    2010-04-01

    All multi-cellular animals, including hydra, insects and vertebrates, develop gap junctions, which communicate directly with neighboring cells. Gap junctions consist of protein families called connexins in vertebrates and innexins in invertebrates. Connexins and innexins have no homology in their amino acid sequence, but both are thought to have some similar characteristics, such as a tetra-membrane-spanning structure, formation of a channel by hexamer, and transmission of small molecules (e.g. ions) to neighboring cells. Pannexins were recently identified as a homolog of innexins in vertebrate genomes. Although pannexins are thought to share the function of intercellular communication with connexins and innexins, there is little information about the relationship among these three protein families of gap junctions. We phylgenetically and bioinformatically examined these protein families and other tetra-membrane-spanning proteins using a database and three analytical softwares. The clades formed by pannexin families do not belong to the species classification but do to paralogs of each member of pannexins. Amino acid sequences of pannexins are closely related to those of innexins but less to those of connexins. These data suggest that innexins and pannexins have a common origin, but the relationship between innexins/pannexins and connexins is as slight as that of other tetra-membrane-spanning members.

  7. SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data.

    Science.gov (United States)

    Lee, Tae-Ho; Guo, Hui; Wang, Xiyin; Kim, Changsoo; Paterson, Andrew H

    2014-02-26

    Phylogenetic trees are widely used for genetic and evolutionary studies in various organisms. Advanced sequencing technology has dramatically enriched data available for constructing phylogenetic trees based on single nucleotide polymorphisms (SNPs). However, massive SNP data makes it difficult to perform reliable analysis, and there has been no ready-to-use pipeline to generate phylogenetic trees from these data. We developed a new pipeline, SNPhylo, to construct phylogenetic trees based on large SNP datasets. The pipeline may enable users to construct a phylogenetic tree from three representative SNP data file formats. In addition, in order to increase reliability of a tree, the pipeline has steps such as removing low quality data and considering linkage disequilibrium. A maximum likelihood method for the inference of phylogeny is also adopted in generation of a tree in our pipeline. Using SNPhylo, users can easily produce a reliable phylogenetic tree from a large SNP data file. Thus, this pipeline can help a researcher focus more on interpretation of the results of analysis of voluminous data sets, rather than manipulations necessary to accomplish the analysis.

  8. Genome-wide analysis of eukaryote thaumatin-like proteins (TLPs with an emphasis on poplar

    Directory of Open Access Journals (Sweden)

    Duplessis Sébastien

    2011-02-01

    Full Text Available Abstract Background Plant inducible immunity includes the accumulation of a set of defense proteins during infection called pathogenesis-related (PR proteins, which are grouped into families termed PR-1 to PR-17. The PR-5 family is composed of thaumatin-like proteins (TLPs, which are responsive to biotic and abiotic stress and are widely studied in plants. TLPs were also recently discovered in fungi and animals. In the poplar genome, TLPs are over-represented compared with annual species and their transcripts strongly accumulate during stress conditions. Results Our analysis of the poplar TLP family suggests that the expansion of this gene family was followed by diversification, as differences in expression patterns and predicted properties correlate with phylogeny. In particular, we identified a clade of poplar TLPs that cluster to a single 350 kb locus of chromosome I and that are up-regulated by poplar leaf rust infection. A wider phylogenetic analysis of eukaryote TLPs - including plant, animal and fungi sequences - shows that TLP gene content and diversity increased markedly during land plant evolution. Mapping the reported functions of characterized TLPs to the eukaryote phylogenetic tree showed that antifungal or glycan-lytic properties are widespread across eukaryote phylogeny, suggesting that these properties are shared by most TLPs and are likely associated with the presence of a conserved acidic cleft in their 3D structure. Also, we established an exhaustive catalog of TLPs with atypical architectures such as small-TLPs, TLP-kinases and small-TLP-kinases, which have potentially developed alternative functions (such as putative receptor kinases for pathogen sensing and signaling. Conclusion Our study, based on the most recent plant genome sequences, provides evidence for TLP gene family diversification during land plant evolution. We have shown that the diverse functions described for TLPs are not restricted to specific clades but seem

  9. The complete chloroplast genome sequence of Ampelopsis: gene organization, comparative analysis and phylogenetic relationships to other angiosperms

    Directory of Open Access Journals (Sweden)

    Gurusamy eRaman

    2016-03-01

    Full Text Available Ampelopsis brevipedunculata is an economically important plant that belongs to the Vitaceae family of angiosperms. The phylogenetic placement of Vitaceae is still unresolved. Recent phylogenetic studies suggested that it should be placed in various alternative families including Caryophyllaceae, asteraceae, Saxifragaceae, Dilleniaceae, or with the rest of the rosid families. However, these analyses provided weak supportive results because they were based on only one of several genes. Accordingly, complete chloroplast genome sequences are required to resolve the phylogenetic relationships among angiosperms. Recent phylogenetic analyses based on the complete chloroplast genome sequence suggested strong support for the position of Vitaceae as the earliest diverging lineage of rosids and placed it as a sister to the remaining rosids. These studies also revealed relationships among several major lineages of angiosperms; however, they highlighted the significance of taxon sampling for obtaining accurate phylogenies. In the present study, we sequenced the complete chloroplast genome of A. brevipedunculata and used these data to assess the relationships among 32 angiosperms, including 18 taxa of rosids. The Ampelopsis chloroplast genome is 161,090 bp in length, and includes a pair of inverted repeats of 26,394 bp that are separated by small and large single copy regions of 19,036 bp and 89,266 bp, respectively. The gene content and order of Ampelopsis is identical to many other unrearranged angiosperm chloroplast genomes, including Vitis and tobacco. A phylogenetic tree constructed based on 70 protein-coding genes of 33 angiosperms showed that both Saxifragales and Vitaceae diverged from the rosid clade and formed two clades with 100% bootstrap value. The position of the Vitaceae is sister to Saxifragales, and both are the basal and earliest diverging lineages. Moreover, Saxifragales forms a sister clade to Vitaceae of rosids. Overall, the results of

  10. Protein clustering and RNA phylogenetic reconstruction of the influenza A [corrected] virus NS1 protein allow an update in classification and identification of motif conservation.

    Science.gov (United States)

    Sevilla-Reyes, Edgar E; Chavaro-Pérez, David A; Piten-Isidro, Elvira; Gutiérrez-González, Luis H; Santos-Mendoza, Teresa

    2013-01-01

    The non-structural protein 1 (NS1) of influenza A virus (IAV), coded by its third most diverse gene, interacts with multiple molecules within infected cells. NS1 is involved in host immune response regulation and is a potential contributor to the virus host range. Early phylogenetic analyses using 50 sequences led to the classification of NS1 gene variants into groups (alleles) A and B. We reanalyzed NS1 diversity using 14,716 complete NS IAV sequences, downloaded from public databases, without host bias. Removal of sequence redundancy and further structured clustering at 96.8% amino acid similarity produced 415 clusters that enhanced our capability to detect distinct subgroups and lineages, which were assigned a numerical nomenclature. Maximum likelihood phylogenetic reconstruction using RNA sequences indicated the previously identified deep branching separating group A from group B, with five distinct subgroups within A as well as two and five lineages within the A4 and A5 subgroups, respectively. Our classification model proposes that sequence patterns in thirteen amino acid positions are sufficient to fit >99.9% of all currently available NS1 sequences into the A subgroups/lineages or the B group. This classification reduces host and virus bias through the prioritization of NS1 RNA phylogenetics over host or virus phenetics. We found significant sequence conservation within the subgroups and lineages with characteristic patterns of functional motifs, such as the differential binding of CPSF30 and crk/crkL or the availability of a C-terminal PDZ-binding motif. To understand selection pressures and evolution acting on NS1, it is necessary to organize the available data. This updated classification may help to clarify and organize the study of NS1 interactions and pathogenic differences and allow the drawing of further functional inferences on sequences in each group, subgroup and lineage rather than on a strain-by-strain basis.

  11. Principal component analysis and the locus of the Fréchet mean in the space of phylogenetic trees.

    Science.gov (United States)

    Nye, Tom M W; Tang, Xiaoxian; Weyenberg, Grady; Yoshida, Ruriko

    2017-12-01

    Evolutionary relationships are represented by phylogenetic trees, and a phylogenetic analysis of gene sequences typically produces a collection of these trees, one for each gene in the analysis. Analysis of samples of trees is difficult due to the multi-dimensionality of the space of possible trees. In Euclidean spaces, principal component analysis is a popular method of reducing high-dimensional data to a low-dimensional representation that preserves much of the sample's structure. However, the space of all phylogenetic trees on a fixed set of species does not form a Euclidean vector space, and methods adapted to tree space are needed. Previous work introduced the notion of a principal geodesic in this space, analogous to the first principal component. Here we propose a geometric object for tree space similar to the [Formula: see text]th principal component in Euclidean space: the locus of the weighted Fréchet mean of [Formula: see text] vertex trees when the weights vary over the [Formula: see text]-simplex. We establish some basic properties of these objects, in particular showing that they have dimension [Formula: see text], and propose algorithms for projection onto these surfaces and for finding the principal locus associated with a sample of trees. Simulation studies demonstrate that these algorithms perform well, and analyses of two datasets, containing Apicomplexa and African coelacanth genomes respectively, reveal important structure from the second principal components.

  12. Phylogenetic utility of ribosomal genes for reconstructing the phylogeny of five Chinese satyrine tribes (Lepidoptera, Nymphalidae

    Directory of Open Access Journals (Sweden)

    Mingsheng Yang

    2015-03-01

    Full Text Available Satyrinae is one of twelve subfamilies of the butterfly family Nymphalidae, which currently includes nine tribes. However, phylogenetic relationships among them remain largely unresolved, though different researches have been conducted based on both morphological and molecular data. However, ribosomal genes have never been used in tribe level phylogenetic analyses of Satyrinae. In this study we investigate for the first time the phylogenetic relationships among the tribes Elymniini, Amathusiini, Zetherini and Melanitini which are indicated to be a monophyletic group, and the Satyrini, using two ribosomal genes (28s rDNA and 16s rDNA and four protein-coding genes (EF-1α, COI, COII and Cytb. We mainly aim to assess the phylogenetic informativeness of the ribosomal genes as well as clarify the relationships among different tribes. Our results show the two ribosomal genes generally have the same high phylogenetic informativeness compared with EF-1α; and we infer the 28s rDNA would show better informativeness if the 28s rDNA sequence data for each sampling taxon are obtained in this study. The placement of the monotypic genus Callarge Leech in Zetherini is confirmed for the first time based on molecular evidence. In addition, our maximum likelihood (ML and Bayesian inference (BI trees consistently show that the involved Satyrinae including the Amathusiini is monophyletic with high support values. Although the relationships among the five tribes are identical among ML and BI analyses and are mostly strongly-supported in BI analysis, those in ML analysis are lowly- or moderately- supported. Therefore, the relationships among the related five tribes recovered herein need further verification based on more sampling taxa.

  13. The origin of modern metabolic networks inferred from phylogenomic analysis of protein architecture.

    Science.gov (United States)

    Caetano-Anollés, Gustavo; Kim, Hee Shin; Mittenthal, Jay E

    2007-05-29

    Metabolism represents a complex collection of enzymatic reactions and transport processes that convert metabolites into molecules capable of supporting cellular life. Here we explore the origins and evolution of modern metabolism. Using phylogenomic information linked to the structure of metabolic enzymes, we sort out recruitment processes and discover that most enzymatic activities were associated with the nine most ancient and widely distributed protein fold architectures. An analysis of newly discovered functions showed enzymatic diversification occurred early, during the onset of the modern protein world. Most importantly, phylogenetic reconstruction exercises and other evidence suggest strongly that metabolism originated in enzymes with the P-loop hydrolase fold in nucleotide metabolism, probably in pathways linked to the purine metabolic subnetwork. Consequently, the first enzymatic takeover of an ancient biochemistry or prebiotic chemistry was related to the synthesis of nucleotides for the RNA world.

  14. Genetic characterization and phylogenetic analysis of porcine circovirus type 2 (PCV2) in Serbia.

    Science.gov (United States)

    Savic, Bozidar; Milicevic, Vesna; Jakic-Dimic, Dobrila; Bojkovski, Jovan; Prodanovic, Radisa; Kureljusic, Branislav; Potkonjak, Aleksandar; Savic, Borivoje

    2012-01-01

    Porcine circovirus type 2 (PCV2) is the main causative agent of postweaning multisystemic wasting syndrome (PMWS). To characterize and determine the genetic diversity of PCV2 in the porcine population of Serbia, nucleotide and deduced amino acid sequences of the open reading frame 2 (ORF2) of PCV2 collected from the tissues of pigs that either had died as a result of PMWS or did not exhibit disease symptoms were analyzed. Sequencing and phylogenetic analysis showed considerable diversity among PCV2 ORF2 sequences and the existence of two main PCV2 genotypes, PCV2b and PCV2a, with at least three clusters, 1A/B, 1C and 2D. In order to provide further proof that the 1C strain is circulating in the porcine population, the whole viral genome of one PCV2 isolate was sequenced. Genotyping and phylogenetic analysis using the entire viral genome sequences confirmed that there was a PMWS-associated 1C strain emerging in Serbia. Our analysis also showed that PCV2b is dominant in the porcine population, and that it is exclusively associated with PMWS occurrences in the country. These data constitute a useful basis for further epidemiological studies regarding the heterogeneity of PCV2 strains on the European continent.

  15. Mosaic amino acid conservation in 3D-structures of surface protein and polymerase of hepatitis B virus

    NARCIS (Netherlands)

    van Hemert, Formijn J.; Zaaijer, Hans L.; Berkhout, Ben; Lukashov, Vladimir V.

    2008-01-01

    Surface protein and polymerase of hepatitis B virus provide a striking example of gene overlap. Inclusion of more coding constraints in the phylogenetic analysis forces the tree toward accepted topology. Three-dimensional protein modeling demonstrates that participation in local protein function

  16. A database of phylogenetically atypical genes in archaeal and bacterial genomes, identified using the DarkHorse algorithm

    Directory of Open Access Journals (Sweden)

    Allen Eric E

    2008-10-01

    Full Text Available Abstract Background The process of horizontal gene transfer (HGT is believed to be widespread in Bacteria and Archaea, but little comparative data is available addressing its occurrence in complete microbial genomes. Collection of high-quality, automated HGT prediction data based on phylogenetic evidence has previously been impractical for large numbers of genomes at once, due to prohibitive computational demands. DarkHorse, a recently described statistical method for discovering phylogenetically atypical genes on a genome-wide basis, provides a means to solve this problem through lineage probability index (LPI ranking scores. LPI scores inversely reflect phylogenetic distance between a test amino acid sequence and its closest available database matches. Proteins with low LPI scores are good horizontal gene transfer candidates; those with high scores are not. Description The DarkHorse algorithm has been applied to 955 microbial genome sequences, and the results organized into a web-searchable relational database, called the DarkHorse HGT Candidate Resource http://darkhorse.ucsd.edu. Users can select individual genomes or groups of genomes to screen by LPI score, search for protein functions by descriptive annotation or amino acid sequence similarity, or select proteins with unusual G+C composition in their underlying coding sequences. The search engine reports LPI scores for match partners as well as query sequences, providing the opportunity to explore whether potential HGT donor sequences are phylogenetically typical or atypical within their own genomes. This information can be used to predict whether or not sufficient information is available to build a well-supported phylogenetic tree using the potential donor sequence. Conclusion The DarkHorse HGT Candidate database provides a powerful, flexible set of tools for identifying phylogenetically atypical proteins, allowing researchers to explore both individual HGT events in single genomes, and

  17. The combination of phylogenetic analysis with epidemiological and serological data to track HIV-1 transmission in a sexual transmission case.

    Directory of Open Access Journals (Sweden)

    Min Chen

    Full Text Available To investigate the linkage of HIV transmission from a man to a woman through unprotected sexual contact without disclosing his HIV-positive status.Combined with epidemiological information and serological tests, phylogenetic analysis was used to test the a priori hypothesis of HIV transmission from the man to the woman. Control subjects, infected with HIV through heterosexual intercourse, from the same location were also sampled. Phylogenetic analyses were performed using the consensus gag, pol and env sequences obtained from blood samples of the man, the woman and the local control subjects. The env quasispecies of the man, the woman, and two controls were also obtained using single genome amplification and sequencing (SGA/S to explore the paraphyletic relationship by phylogenetic analysis.Epidemiological information and serological tests indicated that the man was infected with HIV-1 earlier than the woman. Phylogenetic analyses of the consensus sequences showed a monophyletic cluster for the man and woman in all three genomic regions. Furthermore, gag sequences of the man and woman shared a unique recombination pattern from subtype B and C, which was different from those of CRF07_BC or CRF08_BC observed in the local samples. These indicated that the viral sequences from the two subjects display a high level of similarity. Further, viral quasispecies from the man exhibited a paraphyletic relationship with those from the woman in the Bayesian and maximum-likelihood (ML phylogenetic trees of the env region, which supported the transmission direction from the man to the woman.In the context of epidemiological and serological evidence, the results of phylogenetic analyses support the transmission from the man to the woman.

  18. Detection and phylogenetic analysis of infectious pancreatic necrosis virus in Chile.

    Science.gov (United States)

    Tapia, D; Eissler, Y; Torres, P; Jorquera, E; Espinoza, J C; Kuznar, J

    2015-10-27

    Infectious pancreatic necrosis virus (IPNV) is the etiological agent of a highly contagious disease that is endemic to salmon farming in Chile and causes great economic losses to the industry. Here we compared different diagnostic methods to detect IPNV in field samples, including 3 real-time reverse transcription PCR (qRT-PCR) assays, cell culture isolation, and indirect fluorescent antibody test (IFAT). Additionally, we performed a phylogenetic analysis to investigate the genogroups prevailing in Chile, as well as their geographic distribution and virulence. The 3 qRT-PCR assays used primers that targeted regions of the VP2 and VP1 genes of the virus and were tested in 46 samples, presenting a fair agreement within their results. All samples were positive for at least 2 of the qRT-PCR assays, 29 were positive for cell culture, and 23 for IFAT, showing less sensitivity for these latter 2 methods. For the phylogenetic analysis, portions of 1180 and 523 bp of the VP2 region of segment A were amplified by RT-PCR, sequenced and compared with sequences from reference strains and from isolates reported by previous studies carried out in Chile. Most of the sequenced isolates belonged to genogroup 5 (European origin), and 5 were classified within genogroup 1 (American origin). Chilean isolates formed clusters within each of the genogroups found, evidencing a clear differentiation from the reference strains. To our knowledge, this is the most extensive study completed for IPNV in Chile, covering isolates from sea- and freshwater salmon farms and showing a high prevalence of this virus in the country.

  19. Analysis of correlations between sites in models of protein sequences

    International Nuclear Information System (INIS)

    Giraud, B.G.; Lapedes, A.; Liu, L.C.

    1998-01-01

    A criterion based on conditional probabilities, related to the concept of algorithmic distance, is used to detect correlated mutations at noncontiguous sites on sequences. We apply this criterion to the problem of analyzing correlations between sites in protein sequences; however, the analysis applies generally to networks of interacting sites with discrete states at each site. Elementary models, where explicit results can be derived easily, are introduced. The number of states per site considered ranges from 2, illustrating the relation to familiar classical spin systems, to 20 states, suitable for representing amino acids. Numerical simulations show that the criterion remains valid even when the genetic history of the data samples (e.g., protein sequences), as represented by a phylogenetic tree, introduces nonindependence between samples. Statistical fluctuations due to finite sampling are also investigated and do not invalidate the criterion. A subsidiary result is found: The more homogeneous a population, the more easily its average properties can drift from the properties of its ancestor. copyright 1998 The American Physical Society

  20. A Phylogenetic and Phenotypic Analysis of Salmonella enterica Serovar Weltevreden, an Emerging Agent of Diarrheal Disease in Tropical Regions.

    Directory of Open Access Journals (Sweden)

    Carine Makendi

    2016-02-01

    Full Text Available Salmonella enterica serovar Weltevreden (S. Weltevreden is an emerging cause of diarrheal and invasive disease in humans residing in tropical regions. Despite the regional and international emergence of this Salmonella serovar, relatively little is known about its genetic diversity, genomics or virulence potential in model systems. Here we used whole genome sequencing and bioinformatics analyses to define the phylogenetic structure of a diverse global selection of S. Weltevreden. Phylogenetic analysis of more than 100 isolates demonstrated that the population of S. Weltevreden can be segregated into two main phylogenetic clusters, one associated predominantly with continental Southeast Asia and the other more internationally dispersed. Subcluster analysis suggested the local evolution of S. Weltevreden within specific geographical regions. Four of the isolates were sequenced using long read sequencing to produce high quality reference genomes. Phenotypic analysis in Hep-2 cells and in a murine infection model indicated that S. Weltevreden were significantly attenuated in these models compared to the classical S. Typhimurium reference strain SL1344. Our work outlines novel insights into this important emerging pathogen and provides a baseline understanding for future research studies.

  1. Phylogenetic analysis of of Sarcocystis nesbitti (Coccidia: Sarcocystidae) suggests a snake as its probable definitive host

    Science.gov (United States)

    Sarcocystis nesbitti was first described by Mandour in 1969 from rhesus monkey muscle. Its definitive host remains unknown. 18SrRNA gene of Sarcocystis nesbitti was amplified, sequenced, and subjected to phylogenetic analysis. Among those congeners available for comparison, it shares closest affinit...

  2. BIMLR: a method for constructing rooted phylogenetic networks from rooted phylogenetic trees.

    Science.gov (United States)

    Wang, Juan; Guo, Maozu; Xing, Linlin; Che, Kai; Liu, Xiaoyan; Wang, Chunyu

    2013-09-15

    Rooted phylogenetic trees constructed from different datasets (e.g. from different genes) are often conflicting with one another, i.e. they cannot be integrated into a single phylogenetic tree. Phylogenetic networks have become an important tool in molecular evolution, and rooted phylogenetic networks are able to represent conflicting rooted phylogenetic trees. Hence, the development of appropriate methods to compute rooted phylogenetic networks from rooted phylogenetic trees has attracted considerable research interest of late. The CASS algorithm proposed by van Iersel et al. is able to construct much simpler networks than other available methods, but it is extremely slow, and the networks it constructs are dependent on the order of the input data. Here, we introduce an improved CASS algorithm, BIMLR. We show that BIMLR is faster than CASS and less dependent on the input data order. Moreover, BIMLR is able to construct much simpler networks than almost all other methods. BIMLR is available at http://nclab.hit.edu.cn/wangjuan/BIMLR/. © 2013 Elsevier B.V. All rights reserved.

  3. Molecular characterization and phylogenetic analysis of human influenza A viruses isolated in Iran during the 2014-2015 season.

    Science.gov (United States)

    Moasser, Elham; Behzadian, Farida; Moattari, Afagh; Fotouhi, Fatemeh; Rahimi, Amir; Zaraket, Hassan; Hosseini, Seyed Younes

    2017-07-01

    Influenza A viruses are an important cause of severe infectious diseases in humans and are characterized by their fast evolution rate. Global monitoring of these viruses is critical to detect newly emerging variants during annual epidemics. Here, we sought to genetically characterize influenza A/H1N1pdm09 and A/H3N2 viruses collected in Iran during the 2014-2015 influenza season. A total of 200 nasopharyngeal swabs were collected from patients with influenza-like illnesses. Swabs were screened for influenza A and B using real-time PCR. Furthermore, positive specimens with high virus load underwent virus isolation and genetic characterization of their hemagglutinin (HA) and M genes. Of the 200 specimens, 80 were influenza A-positive, including 44 A/H1N1pdm09 and 36 A/H3N2, while 18 were influenza B-positive. Phylogenetic analysis of the HA genes of the A/H1N1pdm09 viruses revealed the circulation of clade 6C, characterized by amino acid substitutions D97N, V234I and K283E. Analysis of the A/H3N2 viruses showed a genetic drift from the vaccine strain A/Texas/50/2012 with 5 mutations (T128A, R142G, N145S, P198S and S219F) belonging to the antigenic sites A, B, and D of the HA protein. The A/H3N2 viruses belonged to phylogenetic clades 3C.2 and 3C.3. The M gene trees of the Iranian A/H1N1pdm09 and A/H3N2 mirrored the clustering patterns of their corresponding HA trees. Our results reveal co-circulation of several influenza A virus strains in Iran during the 2014-2015 influenza season.

  4. Auto-validating von Neumann rejection sampling from small phylogenetic tree spaces

    Directory of Open Access Journals (Sweden)

    York Thomas

    2009-01-01

    Full Text Available Abstract Background In phylogenetic inference one is interested in obtaining samples from the posterior distribution over the tree space on the basis of some observed DNA sequence data. One of the simplest sampling methods is the rejection sampler due to von Neumann. Here we introduce an auto-validating version of the rejection sampler, via interval analysis, to rigorously draw samples from posterior distributions over small phylogenetic tree spaces. Results The posterior samples from the auto-validating sampler are used to rigorously (i estimate posterior probabilities for different rooted topologies based on mitochondrial DNA from human, chimpanzee and gorilla, (ii conduct a non-parametric test of rate variation between protein-coding and tRNA-coding sites from three primates and (iii obtain a posterior estimate of the human-neanderthal divergence time. Conclusion This solves the open problem of rigorously drawing independent and identically distributed samples from the posterior distribution over rooted and unrooted small tree spaces (3 or 4 taxa based on any multiply-aligned sequence data.

  5. Phylogenetic Analysis of Apple scar skin viroid Isolates in Korea

    Directory of Open Access Journals (Sweden)

    Kang Hee Cho

    2015-12-01

    Full Text Available To identify genome sequences of Apple scar skin viroid (ASSVd isolates in Korea, the field survey was performed from ‘Hongro’ apple orchards located in eight sites in South Korea (Bongwha, Cheongsong, Dangjin, Gimchoen, Muju, Mungyeong, Suwon, and Yeongwol. ASSVd was detected by RT-PCR and PCR fragments were cloned into cloning vector. Full-length viral genomes of eight ASSVd isolates were sequenced and compared with 21 isolates reported previously from Korea, India, China, Japan and Greece. Eight isolates in this study showed 92.2-99.7% nucleotide sequence identities with those reported previously. Phylogenetic analysis showed that seven isolates reported in this study belong to the same group distinct from other groups.

  6. Phylogenetic analysis of Tibetan mastiffs based on mitochondrial hypervariable region I.

    Science.gov (United States)

    Ren, Zhanjun; Chen, Huiling; Yang, Xuejiao; Zhang, Chengdong

    2017-03-01

    Recently, the number of Tibetan mastiffs, which is a precious germplasm resource and cultural heritage, is decreasing sharply. Therefore, the genetic diversity of Tibetan mastiffs needs to be studied to clarify its phylogenetics relationships and lay the foundation for resource protection, rational development and utilization of Tibetan mastiffs. We sequenced hypervariable region I of mitochondrial DNA (mtDNA) of 110 individuals from Tibet region and Gansu province. A total of 12 polymorphic sites were identified which defined eight haplotypes of which H4 and H8 were unique to Tibetan population with H8 being identified first. The haplotype diversity (Hd: 0.808), nucleotide diversity (Pi: 0.603%), the average number of nucleotide difference (K: 3.917) of Tibetan mastiffs from Gansu were higher than those from Tibet region (Hd: 0.794; Pi: 0.589%; K: 3.831), which revealed higher genetic diversity in Gansu. In terms of total population, the genetic variation was low. The median-joining network and phylogenetic tree based on the mtDNA hypervariable region I showed that Tibetan mastiffs originated from grey wolves, as the other domestic dogs and had different history of maternal origin. The mismatch distribution analysis and neutrality tests indicated that Tibetan mastiffs were in genetic equilibrium or in a population decline.

  7. On universal common ancestry, sequence similarity, and phylogenetic structure: the sins of P-values and the virtues of Bayesian evidence

    Directory of Open Access Journals (Sweden)

    Theobald Douglas L

    2011-11-01

    Full Text Available Abstract Background The universal common ancestry (UCA of all known life is a fundamental component of modern evolutionary theory, supported by a wide range of qualitative molecular evidence. Nevertheless, recently both the status and nature of UCA has been questioned. In earlier work I presented a formal, quantitative test of UCA in which model selection criteria overwhelmingly choose common ancestry over independent ancestry, based on a dataset of universally conserved proteins. These model-based tests are founded in likelihoodist and Bayesian probability theory, in opposition to classical frequentist null hypothesis tests such as Karlin-Altschul E-values for sequence similarity. In a recent comment, Koonin and Wolf (K&W claim that the model preference for UCA is "a trivial consequence of significant sequence similarity". They support this claim with a computational simulation, derived from universally conserved proteins, which produces similar sequences lacking phylogenetic structure. The model selection tests prefer common ancestry for this artificial data set. Results For the real universal protein sequences, hierarchical phylogenetic structure (induced by genealogical history is the overriding reason for why the tests choose UCA; sequence similarity is a relatively minor factor. First, for cases of conflicting phylogenetic structure, the tests choose independent ancestry even with highly similar sequences. Second, certain models, like star trees and K&W's profile model (corresponding to their simulation, readily explain sequence similarity yet lack phylogenetic structure. However, these are extremely poor models for the real proteins, even worse than independent ancestry models, though they explain K&W's artificial data well. Finally, K&W's simulation is an implementation of a well-known phylogenetic model, and it produces sequences that mimic homologous proteins. Therefore the model selection tests work appropriately with the artificial

  8. Genetic diversity and phylogenetic analysis of Aleutian mink disease virus isolates in north-east China.

    Science.gov (United States)

    Leng, Xue; Liu, Dongxu; Li, Jianming; Shi, Kun; Zeng, Fanli; Zong, Ying; Liu, Yi; Sun, Zhibo; Zhang, Shanshan; Liu, Yadong; Du, Rui

    2018-05-01

    Aleutian mink disease is the most important disease in the mink-farming industry worldwide. So far, few large-scale molecular epidemiological studies of AMDV, based on the NS1 and VP2 genes, have been conducted in China. Here, eight new Chinese isolates of AMDV from three provinces in north-east China were analyzed to clarify the molecular epidemiology of AMDV. The seroprevalence of AMDV in north-east China was 41.8% according to counterimmuno-electrophoresis. Genetic variation analysis of the eight isolates showed significant non-synonymous substitutions in the NS1 and VP2 genes, especially in the NS1 gene. All eight isolates included the caspase-recognition sequence NS1:285 (DQTD↓S), but not the caspase recognition sequence NS1:227 (INTD↓S). The LN1 and LN2 strains had a new 10-amino-acid deletion in-between amino acids 28-37, while the JL3 strain had a one-amino-acid deletion at position 28 in the VP2 protein, compared with the AMDV-G strain. Phylogenetic analysis based on most of NS1 (1755 bp) and complete VP2 showed that the AMDV genotypes did not cluster according to their pathogenicity or geographic origin. Local and imported ADMV species are all prevalent in mink-farming populations in the north-east of China. This is the first study to report the molecular epidemiology of AMDV in north-east China based on most of NS1 and the complete VP2, and further provides information about polyG deletions and new variations in the amino acid sequences of NS1 and VP2 proteins. This report is a good foundation for further study of AMDV in China.

  9. Effect of site-specific heterogeneous evolution on phylogenetic reconstruction: a simple evaluation.

    Science.gov (United States)

    Cheng, Qiqun; Su, Zhixi; Zhong, Yang; Gu, Xun

    2009-07-15

    Recent studies have shown that heterogeneous evolution may mislead phylogenetic analysis, which has been neglected for a long time. We evaluate the effect of heterogeneous evolution on phylogenetic analysis, using 18 fish mitogenomic coding sequences as an example. Using the software DIVERGE, we identify 198 amino acid sites that have experienced heterogeneous evolution. After removing these sites, the rest of sites are shown to be virtually homogeneous in the evolutionary rate. There are some differences between phylogenetic trees built with heterogeneous sites ("before tree") and without heterogeneous sites ("after tree"). Our study demonstrates that for phylogenetic reconstruction, an effective approach is to identify and remove sites with heterogeneous evolution, and suggests that researchers can use the software DIVERGE to remove the influence of heterogeneous evolution before reconstructing phylogenetic trees.

  10. Phylogenetic analysis reveals a cryptic species Blastomyces gilchristii, sp. nov. within the human pathogenic fungus Blastomyces dermatitidis.

    Directory of Open Access Journals (Sweden)

    Elizabeth M Brown

    Full Text Available Analysis of the population genetic structure of microbial species is of fundamental importance to many scientific disciplines because it can identify cryptic species, reveal reproductive mode, and elucidate processes that contribute to pathogen evolution. Here, we examined the population genetic structure and geographic differentiation of the sexual, dimorphic fungus Blastomyces dermatitidis, the causative agent of blastomycosis.Criteria for Genealogical Concordance Phylogenetic Species Recognition (GCPSR applied to seven nuclear loci (arf6, chs2, drk1, fads, pyrF, tub1, and its-2 from 78 clinical and environmental isolates identified two previously unrecognized phylogenetic species. Four of seven single gene phylogenies examined (chs2, drk1, pyrF, and its-2 supported the separation of Phylogenetic Species 1 (PS1 and Phylogenetic Species 2 (PS2 which were also well differentiated in the concatenated chs2-drk1-fads-pyrF-tub1-arf6-its2 genealogy with all isolates falling into one of two evolutionarily independent lineages. Phylogenetic species were genetically distinct with interspecific divergence 4-fold greater than intraspecific divergence and a high Fst value (0.772, P<0.001 indicative of restricted gene flow between PS1 and PS2. Whereas panmixia expected of a single freely recombining population was not observed, recombination was detected when PS1 and PS2 were assessed separately, suggesting reproductive isolation. Random mating among PS1 isolates, which were distributed across North America, was only detected after partitioning isolates into six geographic regions. The PS2 population, found predominantly in the hyper-endemic regions of northwestern Ontario, Wisconsin, and Minnesota, contained a substantial clonal component with random mating detected only among unique genotypes in the population.These analyses provide evidence for a genetically divergent clade within Blastomyces dermatitidis, which we use to describe a novel species

  11. Are Ichthyosporea animals or fungi? Bayesian phylogenetic analysis of elongation factor 1alpha of Ichthyophonus irregularis.

    Science.gov (United States)

    Ragan, Mark A; Murphy, Colleen A; Rand, Thomas G

    2003-12-01

    Ichthyosporea is a recently recognized group of morphologically simple eukaryotes, many of which cause disease in aquatic organisms. Ribosomal RNA sequence analyses place Ichthyosporea near the divergence of the animal and fungal lineages, but do not allow resolution of its exact phylogenetic position. Some of the best evidence for a specific grouping of animals and fungi (Opisthokonta) has come from elongation factor 1alpha, not only phylogenetic analysis of sequences but also the presence or absence of short insertions and deletions. We sequenced the EF-1alpha gene from the ichthyosporean parasite Ichthyophonus irregularis and determined its phylogenetic position using neighbor-joining, parsimony and Bayesian methods. We also sequenced EF-1alpha genes from four chytrids to provide broader representation within fungi. Sequence analyses and the presence of a characteristic 12 amino acid insertion strongly indicate that I. irregularis is a member of Opisthokonta, but do not resolve whether I. irregularis is a specific relative of animals or of fungi. However, the EF-1alpha of I. irregularis exhibits a two amino acid deletion heretofore reported only among fungi.

  12. Genome-wide analysis of SINA family in plants and their phylogenetic relationships.

    Science.gov (United States)

    Wang, Meng; Jin, Ying; Fu, Junjie; Zhu, Yun; Zheng, Jun; Hu, Jian; Wang, Guoying

    2008-06-01

    SINA genes in plants are part of a multigene family with 5 members in Arabidopsis thaliana, 10 members in Populus trichocarpa, 6 members in Oryza sativa, at least 6 members in Zea mays and at least 1 member in Physcomitrella patens. Six members in maize were confirmed by RT-PCR. All SINAs have one RING domain and one SINA domain. These two domains are highly conserved in plants. According to the motif organization and phylogenetic tree, SINA family members were divided into 2 groups. In addition, through semi-quantitative RT-PCR analysis of maize members and Digital Northern analysis of Arabidopsis and rice members, we found that the tissue expression patterns are more diverse in monocot than in Arabidopsis.

  13. Identification of Tunisian Leishmania spp. by PCR amplification of cysteine proteinase B (cpb) genes and phylogenetic analysis.

    Science.gov (United States)

    Chaouch, Melek; Fathallah-Mili, Akila; Driss, Mehdi; Lahmadi, Ramzi; Ayari, Chiraz; Guizani, Ikram; Ben Said, Moncef; Benabderrazak, Souha

    2013-03-01

    Discrimination of the Old World Leishmania parasites is important for diagnosis and epidemiological studies of leishmaniasis. We have developed PCR assays that allow the discrimination between Leishmania major, Leishmania tropica and Leishmania infantum Tunisian species. The identification was performed by a simple PCR targeting cysteine protease B (cpb) gene copies. These PCR can be a routine molecular biology tools for discrimination of Leishmania spp. from different geographical origins and different clinical forms. Our assays can be an informative source for cpb gene studying concerning drug, diagnostics and vaccine research. The PCR products of the cpb gene and the N-acetylglucosamine-1-phosphate transferase (nagt) Leishmania gene were sequenced and aligned. Phylogenetic trees of Leishmania based cpb and nagt sequences are close in topology and present the classic distribution of Leishmania in the Old World. The phylogenetic analysis has enabled the characterization and identification of different strains, using both multicopy (cpb) and single copy (nagt) genes. Indeed, the cpb phylogenetic analysis allowed us to identify the Tunisian Leishmania killicki species, and a group which gathers the least evolved isolates of the Leishmania donovani complex, that was originated from East Africa. This clustering confirms the African origin for the visceralizing species of the L. donovani complex. Copyright © 2012 Elsevier B.V. All rights reserved.

  14. Conformation of phylogenetic relationship of Penaeidae shrimp based on morphometric and molecular investigations.

    Science.gov (United States)

    Rajakumaran, P; Vaseeharan, B; Jayakumar, R; Chidambara, R

    2014-01-01

    Understanding of accurate phylogenetic relationship among Penaeidae shrimp is important for academic and fisheries industry. The Morphometric and Randomly amplified polymorphic DNA (RAPD) analysis was used to make the phylogenetic relationsip among 13 Penaeidae shrimp. For morphometric analysis forty variables and total lengths of shrimp were measured for each species, and removed the effect of size variation. The size normalized values obtained was subjected to UPGMA (Unweighted Pair-Group Method with Arithmetic Mean) cluster analysis. For RAPD analysis, the four primers showed reliable differentiation between species, and used correlation coefficient between the DNA banding patterns of 13 Penaeidae species to construct UPGMA dendrogram. Phylogenetic relationship from morphometric and molecular analysis for Penaeidae species found to be congruent. We concluded that as the results from morphometry investigations concur with molecular one, phylogenetic relationship obtained for the studied Penaeidae are considered to be reliable.

  15. Proteomics analysis of "Rovabiot Excel", a secreted protein cocktail from the filamentous fungus Penicillium funiculosum grown under industrial process fermentation.

    Science.gov (United States)

    Guais, Olivier; Borderies, Gisèle; Pichereaux, Carole; Maestracci, Marc; Neugnot, Virginie; Rossignol, Michel; François, Jean Marie

    2008-12-01

    MS/MS techniques are well customized now for proteomic analysis, even for non-sequenced organisms, since peptide sequences obtained by these methods can be matched with those found in databases from closely related sequenced organisms. We used this approach to characterize the protein content of the "Rovabio Excel", an enzymatic cocktail produced by Penicillium funiculosum that is used as feed additive in animal nutrition. Protein separation by bi-dimensional electrophoresis yielded more than 100 spots, from which 37 proteins were unambiguously assigned from peptide sequences. By one-dimensional SDS-gel electrophoresis, 34 proteins were identified among which 8 were not found in the 2-DE analysis. A third method, termed 'peptidic shotgun', which consists in a direct treatment of the cocktail by trypsin followed by separation of the peptides on two-dimensional liquid chromatography, resulted in the identification of two additional proteins not found by the two other methods. Altogether, more than 50 proteins, among which several glycosylhydrolytic, hemicellulolytic and proteolytic enzymes, were identified by combining three separation methods in this enzymatic cocktail. This work confirmed the power of proteome analysis to explore the genome expression of a non-sequenced fungus by taking advantage of sequences from phylogenetically related filamentous fungi and pave the way for further functional analysis of P. funiculosum.

  16. Evolution of oil-producing trichomes in Sisyrinchium (Iridaceae): insights from the first comprehensive phylogenetic analysis of the genus

    Science.gov (United States)

    Chauveau, Olivier; Eggers, Lilian; Raquin, Christian; Silvério, Adriano; Brown, Spencer; Couloux, Arnaud; Cruaud, Corine; Kaltchuk-Santos, Eliane; Yockteng, Roxana; Souza-Chies, Tatiana T.; Nadot, Sophie

    2011-01-01

    Background and Aims Sisyrinchium (Iridaceae: Iridoideae: Sisyrinchieae) is one of the largest, most widespread and most taxonomically complex genera in Iridaceae, with all species except one native to the American continent. Phylogenetic relationships within the genus were investigated and the evolution of oil-producing structures related to specialized oil-bee pollination examined. Methods Phylogenetic analyses based on eight molecular markers obtained from 101 Sisyrinchium accessions representing 85 species were conducted in the first extensive phylogenetic analysis of the genus. Total evidence analyses confirmed the monophyly of the genus and retrieved nine major clades weakly connected to the subdivisions previously recognized. The resulting phylogenetic hypothesis was used to reconstruct biogeographical patterns, and to trace the evolutionary origin of glandular trichomes present in the flowers of several species. Key Results and Conclusions Glandular trichomes evolved three times independently in the genus. In two cases, these glandular trichomes are oil-secreting, suggesting that the corresponding flowers might be pollinated by oil-bees. Biogeographical patterns indicate expansions from Central America and the northern Andes to the subandean ranges between Chile and Argentina and to the extended area of the Paraná river basin. The distribution of oil-flower species across the phylogenetic trees suggests that oil-producing trichomes may have played a key role in the diversification of the genus, a hypothesis that requires future testing. PMID:21527419

  17. Aspergillus niger contains the cryptic phylogenetic species A. awamori.

    Science.gov (United States)

    Perrone, Giancarlo; Stea, Gaetano; Epifani, Filomena; Varga, János; Frisvad, Jens C; Samson, Robert A

    2011-11-01

    Aspergillus section Nigri is an important group of species for food and medical mycology, and biotechnology. The Aspergillus niger 'aggregate' represents its most complicated taxonomic subgroup containing eight morphologically indistinguishable taxa: A. niger, Aspergillus tubingensis, Aspergillus acidus, Aspergillus brasiliensis, Aspergillus costaricaensis, Aspergillus lacticoffeatus, Aspergillus piperis, and Aspergillus vadensis. Aspergillus awamori, first described by Nakazawa, has been compared taxonomically with other black aspergilli and recently it has been treated as a synonym of A. niger. Phylogenetic analyses of sequences generated from portions of three genes coding for the proteins β-tubulin (benA), calmodulin (CaM), and the translation elongation factor-1 alpha (TEF-1α) of a population of A. niger strains isolated from grapes in Europe revealed the presence of a cryptic phylogenetic species within this population, A. awamori. Morphological, physiological, ecological and chemical data overlap occurred between A. niger and the cryptic A. awamori, however the splitting of these two species was also supported by AFLP analysis of the full genome. Isolates in both phylospecies can produce the mycotoxins ochratoxin A and fumonisin B₂, and they also share the production of pyranonigrin A, tensidol B, funalenone, malformins, and naphtho-γ-pyrones. In addition, sequence analysis of four putative A. awamori strains from Japan, used in the koji industrial fermentation, revealed that none of these strains belong to the A. awamori phylospecies. Copyright © 2011 British Mycological Society. Published by Elsevier Ltd. All rights reserved.

  18. Molecular and phylogenetic characterization of the sieve element occlusion gene family in Fabaceae and non-Fabaceae plants.

    Science.gov (United States)

    Rüping, Boris; Ernst, Antonia M; Jekat, Stephan B; Nordzieke, Steffen; Reineke, Anna R; Müller, Boje; Bornberg-Bauer, Erich; Prüfer, Dirk; Noll, Gundula A

    2010-10-08

    The phloem of dicotyledonous plants contains specialized P-proteins (phloem proteins) that accumulate during sieve element differentiation and remain parietally associated with the cisternae of the endoplasmic reticulum in mature sieve elements. Wounding causes P-protein filaments to accumulate at the sieve plates and block the translocation of photosynthate. Specialized, spindle-shaped P-proteins known as forisomes that undergo reversible calcium-dependent conformational changes have evolved exclusively in the Fabaceae. Recently, the molecular characterization of three genes encoding forisome components in the model legume Medicago truncatula (MtSEO1, MtSEO2 and MtSEO3; SEO = sieve element occlusion) was reported, but little is known about the molecular characteristics of P-proteins in non-Fabaceae. We performed a comprehensive genome-wide comparative analysis by screening the M. truncatula, Glycine max, Arabidopsis thaliana, Vitis vinifera and Solanum phureja genomes, and a Malus domestica EST library for homologs of MtSEO1, MtSEO2 and MtSEO3 and identified numerous novel SEO genes in Fabaceae and even non-Fabaceae plants, which do not possess forisomes. Even in Fabaceae some SEO genes appear to not encode forisome components. All SEO genes have a similar exon-intron structure and are expressed predominantly in the phloem. Phylogenetic analysis revealed the presence of several subgroups with Fabaceae-specific subgroups containing all of the known as well as newly identified forisome component proteins. We constructed Hidden Markov Models that identified three conserved protein domains, which characterize SEO proteins when present in combination. In addition, one common and three subgroup specific protein motifs were found in the amino acid sequences of SEO proteins. SEO genes are organized in genomic clusters and the conserved synteny allowed us to identify several M. truncatula vs G. max orthologs as well as paralogs within the G. max genome. The unexpected

  19. Phylogenetic Framework and Molecular Signatures for the Main Clades of the Phylum Actinobacteria

    Science.gov (United States)

    Gao, Beile

    2012-01-01

    Summary: The phylum Actinobacteria harbors many important human pathogens and also provides one of the richest sources of natural products, including numerous antibiotics and other compounds of biotechnological interest. Thus, a reliable phylogeny of this large phylum and the means to accurately identify its different constituent groups are of much interest. Detailed phylogenetic and comparative analyses of >150 actinobacterial genomes reported here form the basis for achieving these objectives. In phylogenetic trees based upon 35 conserved proteins, most of the main groups of Actinobacteria as well as a number of their superageneric clades are resolved. We also describe large numbers of molecular markers consisting of conserved signature indels in protein sequences and whole proteins that are specific for either all Actinobacteria or their different clades (viz., orders, families, genera, and subgenera) at various taxonomic levels. These signatures independently support the existence of different phylogenetic clades, and based upon them, it is now possible to delimit the phylum Actinobacteria (excluding Coriobacteriia) and most of its major groups in clear molecular terms. The species distribution patterns of these markers also provide important information regarding the interrelationships among different main orders of Actinobacteria. The identified molecular markers, in addition to enabling the development of a stable and reliable phylogenetic framework for this phylum, also provide novel and powerful means for the identification of different groups of Actinobacteria in diverse environments. Genetic and biochemical studies on these Actinobacteria-specific markers should lead to the discovery of novel biochemical and/or other properties that are unique to different groups of Actinobacteria. PMID:22390973

  20. Phylogenetics of neotropical Platymiscium (Leguminosae

    DEFF Research Database (Denmark)

    Saslis-Lagoudakis, C. Haris; Chase, Mark W; Robinson, Daniel N

    2008-01-01

    Platymiscium is a neotropical legume genus of forest trees in the Pterocarpus clade of the pantropical "dalbergioid" clade. It comprises 19 species (29 taxa), distributed from Mexico to southern Brazil. This study presents a molecular phylogenetic analysis of Platymiscium and allies inferred from...

  1. Mitochondrial genome of the stonefly Kamimuria wangi (Plecoptera: Perlidae) and phylogenetic position of plecoptera based on mitogenomes.

    Science.gov (United States)

    Yu-Han, Qian; Hai-Yan, Wu; Xiao-Yu, Ji; Wei-Wei, Yu; Yu-Zhou, Du

    2014-01-01

    This study determined the mitochondrial genome sequence of the stonefly, Kamimuria wangi. In order to investigate the relatedness of stonefly to other members of Neoptera, a phylogenetic analysis was undertaken based on 13 protein-coding genes of mitochondrial genomes in 13 representative insects. The mitochondrial genome of the stonefly is a circular molecule consisting of 16,179 nucleotides and contains the 37 genes typically found in other insects. A 10-bp poly-T stretch was observed in the A+T-rich region of the K. wangi mitochondrial genome. Downstream of the poly-T stretch, two regions were located with potential ability to form stem-loop structures; these were designated stem-loop 1 (positions 15848-15651) and stem-loop 2 (15965-15998). The arrangement of genes and nucleotide composition of the K. wangi mitogenome are similar to those in Pteronarcys princeps, suggesting a conserved genome evolution within the Plecoptera. Phylogenetic analysis using maximum likelihood and Bayesian inference of 13 protein-coding genes supported a novel relationship between the Plecoptera and Ephemeroptera. The results contradict the existence of a monophyletic Plectoptera and Plecoptera as sister taxa to Embiidina, and thus requires further analyses with additional mitogenome sampling at the base of the Neoptera.

  2. Mitochondrial genome of the stonefly Kamimuria wangi (Plecoptera: Perlidae and phylogenetic position of plecoptera based on mitogenomes.

    Directory of Open Access Journals (Sweden)

    Qian Yu-Han

    Full Text Available This study determined the mitochondrial genome sequence of the stonefly, Kamimuria wangi. In order to investigate the relatedness of stonefly to other members of Neoptera, a phylogenetic analysis was undertaken based on 13 protein-coding genes of mitochondrial genomes in 13 representative insects. The mitochondrial genome of the stonefly is a circular molecule consisting of 16,179 nucleotides and contains the 37 genes typically found in other insects. A 10-bp poly-T stretch was observed in the A+T-rich region of the K. wangi mitochondrial genome. Downstream of the poly-T stretch, two regions were located with potential ability to form stem-loop structures; these were designated stem-loop 1 (positions 15848-15651 and stem-loop 2 (15965-15998. The arrangement of genes and nucleotide composition of the K. wangi mitogenome are similar to those in Pteronarcys princeps, suggesting a conserved genome evolution within the Plecoptera. Phylogenetic analysis using maximum likelihood and Bayesian inference of 13 protein-coding genes supported a novel relationship between the Plecoptera and Ephemeroptera. The results contradict the existence of a monophyletic Plectoptera and Plecoptera as sister taxa to Embiidina, and thus requires further analyses with additional mitogenome sampling at the base of the Neoptera.

  3. Identification and phylogenetic analysis of novel cytochrome P450 1A genes from ungulate species.

    Science.gov (United States)

    Darwish, Wageh Sobhy; Kawai, Yusuke; Ikenaka, Yoshinori; Yamamoto, Hideaki; Muroya, Tarou; Ishizuka, Mayumi

    2010-09-01

    As part of an ongoing effort to understand the biological response of wild and domestic ungulates to different environmental pollutants such as dioxin-like compounds, cDNAs encoding for CYP1A1 and CYP1A2 were cloned and characterized. Four novel CYP1A cDNA fragments from the livers of four wild ungulates (elephant, hippopotamus, tapir and deer) were identified. Three fragments from hippopotamus, tapir and deer were classified as CYP1A2, and the other fragment from elephant was designated as CYP1A1/2. The deduced amino acid sequences of these fragment CYP1As showed identities ranging from 76 to 97% with other animal CYP1As. The phylogenetic analysis of these fragments showed that both elephant and hippopotamus CYP1As made separate branches, while tapir and deer CYP1As were located beside that of horse and cattle respectively in the phylogenetic tree. Analysis of dN/dS ratio among the identified CYP1As indicated that odd toed ungulate CYP1A2s were exposed to different selection pressure.

  4. A coevolution analysis for identifying protein-protein interactions by Fourier transform

    Science.gov (United States)

    Yin, Changchuan; Yau, Stephen S. -T.

    2017-01-01

    Protein-protein interactions (PPIs) play key roles in life processes, such as signal transduction, transcription regulations, and immune response, etc. Identification of PPIs enables better understanding of the functional networks within a cell. Common experimental methods for identifying PPIs are time consuming and expensive. However, recent developments in computational approaches for inferring PPIs from protein sequences based on coevolution theory avoid these problems. In the coevolution theory model, interacted proteins may show coevolutionary mutations and have similar phylogenetic trees. The existing coevolution methods depend on multiple sequence alignments (MSA); however, the MSA-based coevolution methods often produce high false positive interactions. In this paper, we present a computational method using an alignment-free approach to accurately detect PPIs and reduce false positives. In the method, protein sequences are numerically represented by biochemical properties of amino acids, which reflect the structural and functional differences of proteins. Fourier transform is applied to the numerical representation of protein sequences to capture the dissimilarities of protein sequences in biophysical context. The method is assessed for predicting PPIs in Ebola virus. The results indicate strong coevolution between the protein pairs (NP-VP24, NP-VP30, NP-VP40, VP24-VP30, VP24-VP40, and VP30-VP40). The method is also validated for PPIs in influenza and E.coli genomes. Since our method can reduce false positive and increase the specificity of PPI prediction, it offers an effective tool to understand mechanisms of disease pathogens and find potential targets for drug design. The Python programs in this study are available to public at URL (https://github.com/cyinbox/PPI). PMID:28430779

  5. A coevolution analysis for identifying protein-protein interactions by Fourier transform.

    Directory of Open Access Journals (Sweden)

    Changchuan Yin

    Full Text Available Protein-protein interactions (PPIs play key roles in life processes, such as signal transduction, transcription regulations, and immune response, etc. Identification of PPIs enables better understanding of the functional networks within a cell. Common experimental methods for identifying PPIs are time consuming and expensive. However, recent developments in computational approaches for inferring PPIs from protein sequences based on coevolution theory avoid these problems. In the coevolution theory model, interacted proteins may show coevolutionary mutations and have similar phylogenetic trees. The existing coevolution methods depend on multiple sequence alignments (MSA; however, the MSA-based coevolution methods often produce high false positive interactions. In this paper, we present a computational method using an alignment-free approach to accurately detect PPIs and reduce false positives. In the method, protein sequences are numerically represented by biochemical properties of amino acids, which reflect the structural and functional differences of proteins. Fourier transform is applied to the numerical representation of protein sequences to capture the dissimilarities of protein sequences in biophysical context. The method is assessed for predicting PPIs in Ebola virus. The results indicate strong coevolution between the protein pairs (NP-VP24, NP-VP30, NP-VP40, VP24-VP30, VP24-VP40, and VP30-VP40. The method is also validated for PPIs in influenza and E.coli genomes. Since our method can reduce false positive and increase the specificity of PPI prediction, it offers an effective tool to understand mechanisms of disease pathogens and find potential targets for drug design. The Python programs in this study are available to public at URL (https://github.com/cyinbox/PPI.

  6. Phylogenetic Analysis, Lineage-Specific Expansion and Functional Divergence of seed dormancy 4-Like Genes in Plants.

    Directory of Open Access Journals (Sweden)

    Saminathan Subburaj

    Full Text Available The rice gene seed dormancy 4 (OsSdr4 functions in seed dormancy and is a major factor associated with pre-harvest sprouting (PHS. Although previous studies of this protein family were reported for rice and other species, knowledge of the evolution of genes homologous to OsSdr4 in plants remains inadequate. Fifty four Sdr4-like (hereafter designated Sdr4L genes were identified in nine plant lineages including 36 species. Phylogenetic analysis placed these genes in eight subfamilies (I-VIII. Genes from the same lineage clustered together, supported by analysis of conserved motifs and exon-intron patterns. Segmental duplications were present in both dicot and monocot clusters, while tandemly duplicated genes occurred only in monocot clusters indicating that both tandem and segmental duplications contributed to expansion of the grass I and II subfamilies. Estimation of the approximate ages of the duplication events indicated that ancestral Sdr4 genes evolved from a common angiosperm ancestor, about 160 million years ago (MYA. Moreover, diversification of Sdr4L genes in mono and dicot plants was mainly associated with genome-wide duplication and speciation events. Functional divergence was observed in all subfamily pairs, except IV/VIIIa. Further analysis indicated that functional constraints between subfamily pairs I/II, I/VIIIb, II/VI, II/VIIIb, II/IV, and VI/VIIIb were statistically significant. Site and branch-site model analyses of positive selection suggested that these genes were under strong adaptive selection pressure. Critical amino acids detected for both functional divergence and positive selection were mostly located in the loops, pointing to functional importance of these regions in this protein family. In addition, differential expression studies by transcriptome atlas of 11 Sdr4L genes showed that the duplicated genes may have undergone divergence in expression between plant species. Our findings showed that Sdr4L genes are

  7. Phylogenetic turnover during subtropical forest succession across environmental and phylogenetic scales

    OpenAIRE

    Purschke, Oliver; Michalski, Stefan G.; Bruelheide, Helge; Durka, Walter

    2017-01-01

    Abstract Although spatial and temporal patterns of phylogenetic community structure during succession are inherently interlinked and assembly processes vary with environmental and phylogenetic scales, successional studies of community assembly have yet to integrate spatial and temporal components of community structure, while accounting for scaling issues. To gain insight into the processes that generate biodiversity after disturbance, we combine analyses of spatial and temporal phylogenetic ...

  8. Application of the major capsid protein as a marker of the phylogenetic diversity of Emiliania huxleyi viruses.

    Science.gov (United States)

    Rowe, Janet M; Fabre, Marie-Françoise; Gobena, Daniel; Wilson, William H; Wilhelm, Steven W

    2011-05-01

    Studies of the Phycodnaviridae have traditionally relied on the DNA polymerase (pol) gene as a biomarker. However, recent investigations have suggested that the major capsid protein (MCP) gene may be a reliable phylogenetic biomarker. We used MCP gene amplicons gathered across the North Atlantic to assess the diversity of Emiliania huxleyi-infecting Phycodnaviridae. Nucleotide sequences were examined across >6000 km of open ocean, with comparisons between concentrates of the virus-size fraction of seawater and of lysates generated by exposing host strains to these same virus concentrates. Analyses revealed that many sequences were only sampled once, while several were over-represented. Analyses also revealed nucleotide sequences distinct from previous coastal isolates. Examination of lysed cultures revealed a new richness in phylogeny, as MCP sequences previously unrepresented within the existing collection of E. huxleyi viruses (EhV) were associated with viruses lysing cultures. Sequences were compared with previously described EhV MCP sequences from the North Sea and a Norwegian Fjord, as well as from the Gulf of Maine. Principal component analysis indicates that location-specific distinctions exist despite the presence of sequences common across these environments. Overall, this investigation provides new sequence data and an assessment on the use of the MCP gene. © 2011 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved.

  9. In Silico Characterization of Pectate Lyase Protein Sequences from Different Source Organisms

    Directory of Open Access Journals (Sweden)

    Amit Kumar Dubey

    2010-01-01

    Full Text Available A total of 121 protein sequences of pectate lyases were subjected to homology search, multiple sequence alignment, phylogenetic tree construction, and motif analysis. The phylogenetic tree constructed revealed different clusters based on different source organisms representing bacterial, fungal, plant, and nematode pectate lyases. The multiple accessions of bacterial, fungal, nematode, and plant pectate lyase protein sequences were placed closely revealing a sequence level similarity. The multiple sequence alignment of these pectate lyase protein sequences from different source organisms showed conserved regions at different stretches with maximum homology from amino acid residues 439–467, 715–816, and 829–910 which could be used for designing degenerate primers or probes specific for pectate lyases. The motif analysis revealed a conserved Pec_Lyase_C domain uniformly observed in all pectate lyases irrespective of variable sources suggesting its possible role in structural and enzymatic functions.

  10. A measure of the denseness of a phylogenetic network. [by sequenced proteins from extant species

    Science.gov (United States)

    Holmquist, R.

    1978-01-01

    An objective measure of phylogenetic denseness is developed to examine various phylogenetic criteria: alpha- and beta-hemoglobin, myoglobin, cytochrome c, and the parvalbumin family. Attention is given to the number of nucleotide replacements separating homologous sequences, and to the topology of the network (in other words, to the qualitative nature of the network as defined by how closely the studied species are related). Applications include quantitative comparisons of species origin, relation, and rates of evolution.

  11. The TyrA family of aromatic-pathway dehydrogenases in phylogenetic context

    Directory of Open Access Journals (Sweden)

    Wolinsky Murray

    2005-05-01

    Full Text Available Abstract Background The TyrA protein family includes members that catalyze two dehydrogenase reactions in distinct pathways leading to L-tyrosine and a third reaction that is not part of tyrosine biosynthesis. Family members share a catalytic core region of about 30 kDa, where inhibitors operate competitively by acting as substrate mimics. This protein family typifies many that are challenging for bioinformatic analysis because of relatively modest sequence conservation and small size. Results Phylogenetic relationships of TyrA domains were evaluated in the context of combinatorial patterns of specificity for the two substrates, as well as the presence or absence of a variety of fusions. An interactive tool is provided for prediction of substrate specificity. Interactive alignments for a suite of catalytic-core TyrA domains of differing specificity are also provided to facilitate phylogenetic analysis. tyrA membership in apparent operons (or supraoperons was examined, and patterns of conserved synteny in relationship to organismal positions on the 16S rRNA tree were ascertained for members of the domain Bacteria. A number of aromatic-pathway genes (hisHb, aroF, aroQ have fused with tyrA, and it must be more than coincidental that the free-standing counterparts of all of the latter fused genes exhibit a distinct trace of syntenic association. Conclusion We propose that the ancestral TyrA dehydrogenase had broad specificity for both the cyclohexadienyl and pyridine nucleotide substrates. Indeed, TyrA proteins of this type persist today, but it is also common to find instances of narrowed substrate specificities, as well as of acquisition via gene fusion of additional catalytic domains or regulatory domains. In some clades a qualitative change associated with either narrowed substrate specificity or gene fusion has produced an evolutionary "jump" in the vertical genealogy of TyrA homologs. The evolutionary history of gene organizations that include

  12. A taxonomic and phylogenetic re-appraisal of the genus Curvularia

    Science.gov (United States)

    Species of Curvularia are important plant and human pathogens worldwide. In this study, the genus Curvularia is re-assessed based on molecular phylogenetic analysis and morphological observations of available isolates and specimens. A multi-gene phylogenetic tree inferred from ITS, TEF and GPDH gene...

  13. The complete mitochondrial genome of rabbit pinworm Passalurus ambiguus: genome characterization and phylogenetic analysis.

    Science.gov (United States)

    Liu, Guo-Hua; Li, Sheng; Zou, Feng-Cai; Wang, Chun-Ren; Zhu, Xing-Quan

    2016-01-01

    Passalurus ambiguus (Nematda: Oxyuridae) is a common pinworm which parasitizes in the caecum and colon of rabbits. Despite its significance as a pathogen, the epidemiology, genetics, systematics, and biology of this pinworm remain poorly understood. In the present study, we sequenced the complete mitochondrial (mt) genome of P. ambiguus. The circular mt genome is 14,023 bp in size and encodes of 36 genes, including 12 protein-coding, two ribosomal RNA, and 22 transfer RNA genes. The mt gene order of P. ambiguus is the same as that of Wellcomia siamensis, but distinct from that of Enterobius vermicularis. Phylogenetic analyses based on concatenated amino acid sequences of 12 protein-coding genes by Bayesian inference (BI) showed that P. ambiguus was more closely related to W. siamensis than to E. vermicularis. This mt genome provides novel genetic markers for studying the molecular epidemiology, population genetics, systematics of pinworm of animals and humans, and should have implications for the diagnosis, prevention, and control of passaluriasis in rabbits and other animals.

  14. Epitope discovery with phylogenetic hidden Markov models.

    LENUS (Irish Health Repository)

    Lacerda, Miguel

    2010-05-01

    Existing methods for the prediction of immunologically active T-cell epitopes are based on the amino acid sequence or structure of pathogen proteins. Additional information regarding the locations of epitopes may be acquired by considering the evolution of viruses in hosts with different immune backgrounds. In particular, immune-dependent evolutionary patterns at sites within or near T-cell epitopes can be used to enhance epitope identification. We have developed a mutation-selection model of T-cell epitope evolution that allows the human leukocyte antigen (HLA) genotype of the host to influence the evolutionary process. This is one of the first examples of the incorporation of environmental parameters into a phylogenetic model and has many other potential applications where the selection pressures exerted on an organism can be related directly to environmental factors. We combine this novel evolutionary model with a hidden Markov model to identify contiguous amino acid positions that appear to evolve under immune pressure in the presence of specific host immune alleles and that therefore represent potential epitopes. This phylogenetic hidden Markov model provides a rigorous probabilistic framework that can be combined with sequence or structural information to improve epitope prediction. As a demonstration, we apply the model to a data set of HIV-1 protein-coding sequences and host HLA genotypes.

  15. Comparative analyses of the complete mitochondrial genomes of Dosinia clams and their phylogenetic position within Veneridae.

    Science.gov (United States)

    Lv, Changda; Li, Qi; Kong, Lingfeng

    2018-01-01

    Mitochondrial genomes have proved to be a powerful tool in resolving phylogenetic relationship. In order to understand the mitogenome characteristics and phylogenetic position of the genus Dosinia, we sequenced the complete mitochondrial genomes of Dosinia altior and Dosinia troscheli (Bivalvia: Veneridae), compared them with that of Dosinia japonica and established a phylogenetic tree for Veneridae. The mitogenomes of D. altior (17,536 bp) and D. troscheli (17,229 bp) are the two smallest in Veneridae, which include 13 protein-coding genes, 2 ribosomal RNA genes, 22 tRNA genes, and non-coding regions. The mitogenomes of the Dosinia species are similar in size, gene content, AT content, AT- and GC- skews, and gene arrangement. The phylogenetic relationships of family Veneridae were established based on 12 concatenated protein-coding genes using maximum likelihood and Bayesian analyses, which supported that Dosininae and Meretricinae have a closer relationship, with Tapetinae being the sister taxon. The information obtained in this study will contribute to further understanding of the molecular features of bivalve mitogenomes and the evolutionary history of the genus Dosinia.

  16. A RAD-based phylogenetics for Orestias fishes from Lake Titicaca.

    Science.gov (United States)

    Takahashi, Tetsumi; Moreno, Edmundo

    2015-12-01

    The fish genus Orestias is endemic to the Andes highlands, and Lake Titicaca is the centre of the species diversity of the genus. Previous phylogenetic studies based on a single locus of mitochondrial and nuclear DNA strongly support the monophyly of a group composed of many of species endemic to the Lake Titicaca basin (the Lake Titicaca radiation), but the relationships among the species in the radiation remain unclear. Recently, restriction site-associated DNA (RAD) sequencing, which can produce a vast number of short sequences from various loci of nuclear DNA, has emerged as a useful way to resolve complex phylogenetic problems. To propose a new phylogenetic hypothesis of Orestias fishes of the Lake Titicaca radiation, we conducted a cluster analysis based on morphological similarities among fish samples and a molecular phylogenetic analysis based on RAD sequencing. From a morphological cluster analysis, we recognised four species groups in the radiation, and three of the four groups were resolved as monophyletic groups in maximum-likelihood trees based on RAD sequencing data. The other morphology-based group was not resolved as a monophyletic group in molecular phylogenies, and some members of the group were diverged from its sister group close to the root of the Lake Titicaca radiation. The evolution of these fishes is discussed from the phylogenetic relationships. Copyright © 2015 Elsevier Inc. All rights reserved.

  17. PhyDesign: an online application for profiling phylogenetic informativeness

    Directory of Open Access Journals (Sweden)

    Townsend Jeffrey P

    2011-05-01

    Full Text Available Abstract Background The rapid increase in number of sequenced genomes for species across of the tree of life is revealing a diverse suite of orthologous genes that could potentially be employed to inform molecular phylogenetic studies that encompass broader taxonomic sampling. Optimal usage of this diversity of loci requires user-friendly tools to facilitate widespread cost-effective locus prioritization for phylogenetic sampling. The Townsend (2007 phylogenetic informativeness provides a unique empirical metric for guiding marker selection. However, no software or automated methodology to evaluate sequence alignments and estimate the phylogenetic informativeness metric has been available. Results Here, we present PhyDesign, a platform-independent online application that implements the Townsend (2007 phylogenetic informativeness analysis, providing a quantitative prediction of the utility of loci to solve specific phylogenetic questions. An easy-to-use interface facilitates uploading of alignments and ultrametric trees to calculate and depict profiles of informativeness over specified time ranges, and provides rankings of locus prioritization for epochs of interest. Conclusions By providing these profiles, PhyDesign facilitates locus prioritization increasing the efficiency of sequencing for phylogenetic purposes compared to traditional studies with more laborious and low capacity screening methods, as well as increasing the accuracy of phylogenetic studies. Together with a manual and sample files, the application is freely accessible at http://phydesign.townsend.yale.edu.

  18. Lipase genes in Mucor circinelloides: identification, sub-cellular location, phylogenetic analysis and expression profiling during growth and lipid accumulation.

    Science.gov (United States)

    Zan, Xinyi; Tang, Xin; Chu, Linfang; Zhao, Lina; Chen, Haiqin; Chen, Yong Q; Chen, Wei; Song, Yuanda

    2016-10-01

    Lipases or triacylglycerol hydrolases are widely spread in nature and are particularly common in the microbial world. The filamentous fungus Mucor circinelloides is a potential lipase producer, as it grows well in triacylglycerol-contained culture media. So far only one lipase from M. circinelloides has been characterized, while the majority of lipases remain unknown in this fungus. In the present study, 47 potential lipase genes in M. circinelloides WJ11 and 30 potential lipase genes in M. circinelloides CBS 277.49 were identified by extensive bioinformatics analysis. An overview of these lipases is presented, including several characteristics, sub-cellular location, phylogenetic analysis and expression profiling of the lipase genes during growth and lipid accumulation. All of these proteins contained the consensus sequence for a classical lipase (GXSXG motif) and were divided into four types including α/β-hydrolase_1, α/β-hydrolase_3, class_3 and GDSL lipase (GDSL) based on gene annotations. Phylogenetic analyses revealed that class_3 family and α/β-hydrolase_3 family were the conserved lipase family in M. circinelloides. Additionally, some lipases also contained a typical acyltransferase motif of H-(X) 4-D, and these lipases may play a dual role in lipid metabolism, catalyzing both lipid hydrolysis and transacylation reactions. The differential expression of all lipase genes were confirmed by quantitative real-time PCR, and the expression profiling were analyzed to predict the possible biological roles of these lipase genes in lipid metabolism in M. circinelloides. We preliminarily hypothesized that lipases may be involved in triacylglycerol degradation, phospholipid synthesis and beta-oxidation. Moreover, the results of sub-cellular localization, the presence of signal peptide and transcriptional analyses of lipase genes indicated that four lipase in WJ11 most likely belong to extracellular lipases with a signal peptide. These findings provide a platform

  19. Sequence comparison and phylogenetic analysis of core gene of ...

    African Journals Online (AJOL)

    STORAGESEVER

    2010-07-19

    Jul 19, 2010 ... and antisense primers, a single band of 573 base pairs .... Amino acid sequence alignment of Cluster I and Cluster II of phylogenetic tree. First ten sequences ... sequence weighting, postion-spiecific gap penalties and weight.

  20. Phylogenetic analysis of Dengue virus 1 isolated from South Minas Gerais, Brazil.

    Science.gov (United States)

    Drumond, Betania Paiva; Fagundes, Luiz Gustavo da Silva; Rocha, Raissa Prado; Fumagalli, Marcilio Jorge; Araki, Carlos Shigueru; Colombo, Tatiana Elisa; Nogueira, Mauricio Lacerda; Castilho, Thiago Elias; da Silveira, Nelson José Freitas; Malaquias, Luiz Cosme Cotta; Coelho, Luiz Felipe Leomil

    2016-01-01

    Dengue is a major worldwide public health problem, especially in the tropical and subtropical regions of the world. Primary infection with a single Dengue virus serotype causes a mild, self-limiting febrile illness called dengue fever. However, a subset of patients who experience secondary infection with a different serotype can progress to a more severe form of the disease, called dengue hemorrhagic fever. The four Dengue virus serotypes (1-4) are antigenically and genetically distinct and each serotype is composed of multiple genotypes. In this study we isolated one Dengue virus 1 serotype, named BR/Alfenas/2012, from a patient with dengue hemorrhagic fever in Alfenas, South Minas Gerais, Brazil and molecular identification was performed based on the analysis of NS5 gene. Swiss mice were infected with this isolate to verify its potential to induce histopathological alterations characteristic of dengue. Liver histopathological analysis of infected animals showed the presence of inflammatory infiltrates, hepatic steatosis, as well as edema, hemorrhage and necrosis focal points. Phylogenetic and evolutionary analyses based on the envelope gene provided evidence that the isolate BR/Alfenas/2012 belongs to genotype V, lineage I and it is probably derived from isolates of Rio de Janeiro, Brazil. The isolate BR/Alfenas/2012 showed two unique amino acids substitutions (SER222THRE and PHE306SER) when compared to other Brazilian isolates from the same genotype/lineage. Molecular models were generated for the envelope protein indicating that the amino acid alteration PHE 306 SER could contribute to a different folding in this region located within the domain III. Further genetic and animal model studies using BR/Alfenas/2012 and other isolates belonging to the same lineage/genotype could help determine the relation of these genetic alterations and dengue hemorrhagic fever in a susceptible population. Copyright © 2015 Sociedade Brasileira de Microbiologia. Published by

  1. Phylogenetic analysis of Dengue virus 1 isolated from South Minas Gerais, Brazil

    Directory of Open Access Journals (Sweden)

    Betania Paiva Drumond

    2016-03-01

    Full Text Available Abstract Dengue is a major worldwide public health problem, especially in the tropical and subtropical regions of the world. Primary infection with a single Dengue virus serotype causes a mild, self-limiting febrile illness called dengue fever. However, a subset of patients who experience secondary infection with a different serotype can progress to a more severe form of the disease, called dengue hemorrhagic fever. The four Dengue virus serotypes (1–4 are antigenically and genetically distinct and each serotype is composed of multiple genotypes. In this study we isolated one Dengue virus 1 serotype, named BR/Alfenas/2012, from a patient with dengue hemorrhagic fever in Alfenas, South Minas Gerais, Brazil and molecular identification was performed based on the analysis of NS5 gene. Swiss mice were infected with this isolate to verify its potential to induce histopathological alterations characteristic of dengue. Liver histopathological analysis of infected animals showed the presence of inflammatory infiltrates, hepatic steatosis, as well as edema, hemorrhage and necrosis focal points. Phylogenetic and evolutionary analyses based on the envelope gene provided evidence that the isolate BR/Alfenas/2012 belongs to genotype V, lineage I and it is probably derived from isolates of Rio de Janeiro, Brazil. The isolate BR/Alfenas/2012 showed two unique amino acids substitutions (SER222THRE and PHE306SER when compared to other Brazilian isolates from the same genotype/lineage. Molecular models were generated for the envelope protein indicating that the amino acid alteration PHE 306 SER could contribute to a different folding in this region located within the domain III. Further genetic and animal model studies using BR/Alfenas/2012 and other isolates belonging to the same lineage/genotype could help determine the relation of these genetic alterations and dengue hemorrhagic fever in a susceptible population.

  2. Phylogenetic analysis of Bacillus subtilis strains applicable to natto (fermented soybean) production.

    Science.gov (United States)

    Kubo, Yuji; Rooney, Alejandro P; Tsukakoshi, Yoshiki; Nakagawa, Rikio; Hasegawa, Hiromasa; Kimura, Keitarou

    2011-09-01

    Spore-forming Bacillus strains that produce extracellular poly-γ-glutamic acid were screened for their application to natto (fermented soybean food) fermentation. Among the 424 strains, including Bacillus subtilis and B. amyloliquefaciens, which we isolated from rice straw, 59 were capable of fermenting natto. Biotin auxotrophism was tightly linked to natto fermentation. A multilocus nucleotide sequence of six genes (rpoB, purH, gyrA, groEL, polC, and 16S rRNA) was used for phylogenetic analysis, and amplified fragment length polymorphism (AFLP) analysis was also conducted on the natto-fermenting strains. The ability to ferment natto was inferred from the two principal components of the AFLP banding pattern, and natto-fermenting strains formed a tight cluster within the B. subtilis subsp. subtilis group.

  3. High genetic diversity of equine infectious anaemia virus strains from Slovenia revealed upon phylogenetic analysis of the p15 gag gene region.

    Science.gov (United States)

    Kuhar, U; Malovrh, T

    2016-03-01

    The equine infectious anaemia virus (EIAV), which belongs to the Retroviridae family, infects equids almost worldwide. Every year, sporadic EIAV cases are detected in Slovenia. To characterise the Slovenian EIAV strains in the p15 gag gene region phylogenetically in order to compare the Slovenian EIAV strains with EIAV strains from abroad, especially with the recently published European strains. Cross-sectional study using material derived from post mortem examination. In total, 29 EIAV serologically positive horses from 18 different farms were examined in this study. Primers were designed to amplify the p15 gag gene region. Amplicons of 28 PCRs were subjected to direct DNA sequencing and phylogenetic analysis. Altogether, 28 EIAV sequences were obtained from 17 different farms and were distributed between 4 separate monophyletic groups and 9 branches upon phylogenetic analysis. Among EIAV strains from abroad, the closest relatives to Slovenian EIAV strains were European EIAV strains from Italy. Phylogenetic analysis also showed that some animals from distantly located farms were most probably infected with the same EIAV strains, as well as animals from the same farm and animals from farms located in the same geographical region. This is the first report of such high genetic diversity of EIAV strains from one country. This led to speculation that there is a potential virus reservoir among the populations of riding horses, horses kept for pleasure and horses for meat production, with some farmers or horse-owners not following legislation, thus enabling the spread of infection with EIAV. The low sensitivity of the agar gel immunodiffusion test may also contribute to the spread of infection with EIAV, because some infected horses might have escaped detection. The results of the phylogenetic analysis also provide additional knowledge about the highly heterogeneous nature of the EIAV genome. © 2015 EVJ Ltd.

  4. Molecular and phylogenetic analysis of HIV-1 variants circulating in Italy

    Directory of Open Access Journals (Sweden)

    Sbreglia Costanza

    2008-10-01

    Full Text Available Abstract Objective The continuous identification of HIV-1 non-B subtypes and recombinant forms in Italy indicates the need of constant molecular epidemiology survey of genetic forms circulating and transmitted in the resident population. Methods The distribution of HIV-1 subtypes has been evaluated in 25 seropositive individuals residing in Italy, most of whom were infected through a sexual route during the 1995–2005 period. Each sample has been characterized by detailed molecular and phylogenetic analyses. Results 18 of the 25 samples were positive at HIV-1 PCR amplification. Three samples showed a nucleotide divergence compatible with a non-B subtype classification. The phylogenetic analysis, performed on both HIV-1 env and gag regions, confirms the molecular sub-typing prediction, given that 1 sample falls into the C subtype and 2 into the G subtype. The B subtype isolates show high levels of intra-subtype nucleotide divergence, compatible with a long-lasting epidemic and a progressive HIV-1 molecular diversification. Conclusion The Italian HIV-1 epidemic is still mostly attributable to the B subtype, regardless the transmission route, which shows an increasing nucleotide heterogeneity. Heterosexual transmission and the interracial blending, however, are slowly introducing novel HIV-1 subtypes. Therefore, a molecular monitoring is needed to follow the constant evolution of the HIV-1 epidemic.

  5. In vitro and in silico cloning of Xenopus laevis SOD2 cDNA and its phylogenetic analysis.

    Science.gov (United States)

    Purrello, Michele; Di Pietro, Cinzia; Ragusa, Marco; Pulvirenti, Alfredo; Giugno, Rosalba; Di Pietro, Valentina; Emmanuele, Giovanni; Travali, Salvo; Scalia, Marina; Shasha, Dennis; Ferro, Alfredo

    2005-02-01

    By using the methodology of both wet and dry biology (i.e., RT-PCR and cycle sequencing, and biocomputational technology, respectively) and the data obtained through the Genome Projects, we have cloned Xenopus laevis SOD2 (MnSOD) cDNA and determined its nucleotide sequence. These data and the deduced protein primary structure were compared with all the other SOD2 nucleotide and amino acid sequences from eukaryotes and prokaryotes, published in public databases. The analysis was performed by using both Clustal W, a well known and widely used program for sequence analysis, and AntiClustAl, a new algorithm recently created and implemented by our group. Our results demonstrate a very high conservation of the enzyme amino acid sequence during evolution, which proves a close structure-function relationship. This is to be expected for very ancient molecules endowed with critical biological functions, performed through a specific structural organization. The nucleotide sequence conservation is less pronounced: this too was foreseeable, due to neutral mutations and to the species-specific codon usage. The data obtained by using AntiClustAl are comparable with those produced with Clustal W, which validates this algorithm as an important new tool for biocomputational analysis. Finally, it is noteworthy that evolutionary trees, drawn by using all the available data on SOD2 nucleotide sequences and amino acid and either Clustal W or AntiClustAl, are comparable to those obtained through phylogenetic analysis based on fossil records.

  6. A molecular phylogenetic analysis of the Scarabaeinae (dung beetles).

    Science.gov (United States)

    Monaghan, Michael T; Inward, Daegan J G; Hunt, Toby; Vogler, Alfried P

    2007-11-01

    The dung beetles (Scarabaeinae) include ca. 5000 species and exhibit a diverse array of morphologies and behaviors. This variation presumably reflects the adaptation to a diversity of food types and the different strategies used to avoid competition for vertebrate dung, which is the primary breeding environment for most species. The current classification gives great weight to the major behavioral types, separating the ball rollers and the tunnelers, but existing phylogenetic studies have been based on limited taxonomic or biogeographic sampling and have been contradictory. Here, we present a molecular phylogenetic analysis of 214 species of Scarabaeinae, representing all 12 traditionally recognized tribes and six biogeographical regions, using partial gene sequences from one nuclear (28S) and two mitochondrial (cox1, rrnL) genes. Length variation in 28S (588-621 bp) and rrnL (514-523 bp) was subjected to a thorough evaluation of alternative alignments, gap-coding methods, and tree searches using model-based (Bayesian and likelihood), maximum parsimony, and direct optimization analyses. The small-bodied, non-dung-feeding Sarophorus+Coptorhina were basal in all reconstructions. These were closely related to rolling Odontoloma+Dicranocara, suggesting an early acquisition of rolling behavior. Smaller tribes and most genera were monophyletic, while Canthonini and Dichotomiini each consisted of multiple paraphyletic lineages at hierarchical levels equivalent to the smaller tribes. Plasticity of rolling and tunneling was evidenced by a lack of monophyly (S-H test, p > 0.05) and several reversals within clades. The majority of previously unrecognized clades were geographical, including the well-supported Neotropical Phanaeini+Eucraniini, and a large Australian clade of rollers as well as tunneling Coptodactyla and Demarziella. Only three lineages, Gymnopleurini, Copris+Microcopris and Onthophagus, were widespread and therefore appear to be dispersive at a global scale. A

  7. Simultaneous analysis of five molecular markers provides a well-supported phylogenetic hypothesis for the living bony-tongue fishes (Osteoglossomorpha: Teleostei).

    Science.gov (United States)

    Lavoué, Sébastien; Sullivan, John P

    2004-10-01

    Fishes of the Superorder Osteoglossomorpha (the "bonytongues") constitute a morphologically heterogeneous group of basal teleosts, including highly derived subgroups such as African electric fishes, the African butterfly fish, and Old World knifefishes. Lack of consensus among hypotheses of osteoglossomorph relationships advanced during the past 30 years may be due in part to the difficulty of identifying shared derived characters among the morphologically differentiated extant families of this group. In this study, we present a novel phylogenetic hypothesis for this group, based on the analysis of more than 4000 characters from five molecular markers (the mitochondrial cytochrome b, 12S and 16S rRNA genes, and the nuclear genes RAG2 and MLL). Our taxonomic sampling includes one representative of each extant non-mormyrid osteoglossomorph genus, one representative for the monophyletic family Mormyridae, and four outgroup taxa within the basal Teleostei. Maximum parsimony analysis of combined and equally weighted characters from the five molecular markers and Bayesian analysis provide a single, well-supported, hypothesis of osteoglossomorph interrelationships and show the group to be monophyletic. The tree topology is the following: (Hiodon alosoides, (Pantodon buchholzi, (((Osteoglossum bicirrhosum, Scleropages sp.), (Arapaima gigas, Heterotis niloticus)), ((Gymnarchus niloticus, Ivindomyrus opdenboschi), ((Notopterus notopterus, Chitala ornata), (Xenomystus nigri, Papyrocranus afer)))))). We compare our results with previously published phylogenetic hypotheses based on morpho-anatomical data. Additionally, we explore the consequences of the long terminal branch length for the taxon Pantodon buchholzi in our phylogenetic reconstruction and we use the obtained phylogenetic tree to reconstruct the evolutionary history of electroreception in the Notopteroidei.

  8. In Silico Identification, Phylogenetic and Bioinformatic Analysis of Argonaute Genes in Plants

    Directory of Open Access Journals (Sweden)

    Khaled Mirzaei

    2014-01-01

    Full Text Available Argonaute protein family is the key players in pathways of gene silencing and small regulatory RNAs in different organisms. Argonaute proteins can bind small noncoding RNAs and control protein synthesis, affect messenger RNA stability, and even participate in the production of new forms of small RNAs. The aim of this study was to characterize and perform bioinformatic analysis of Argonaute proteins in 32 plant species that their genome was sequenced. A total of 437 Argonaute genes were identified and were analyzed based on lengths, gene structure, and protein structure. Results showed that Argonaute proteins were highly conserved across plant kingdom. Phylogenic analysis divided plant Argonautes into three classes. Argonaute proteins have three conserved domains PAZ, MID and PIWI. In addition to three conserved domains namely, PAZ, MID, and PIWI, we identified few more domains in AGO of some plant species. Expression profile analysis of Argonaute proteins showed that expression of these genes varies in most of tissues, which means that these proteins are involved in regulation of most pathways of the plant system. Numbers of alternative transcripts of Argonaute genes were highly variable among the plants. A thorough analysis of large number of putative Argonaute genes revealed several interesting aspects associated with this protein and brought novel information with promising usefulness for both basic and biotechnological applications.

  9. 70 kD stress protein (Hsp70) analysis in living shallow-water benthic foraminifera

    Digital Repository Service at National Institute of Oceanography (India)

    Heinz, P.; Marten, R.A; Linshy, V.N.; Haap, T.; Geslin, E.; Kohler, H-R.

    Hsp70 is a phylogenetically highly conserved protein family present in all eukaryotic organisms tested so far. Its synthesis is induced by proteotoxic stress. The detection of Hsp70 in foraminifera is presented here. We introduce a standard...

  10. The complete genome structure and phylogenetic relationship of infectious hematopoietic necrosis virus

    Science.gov (United States)

    Morzunov , Sergey P.; Winton, James R.; Nichol, Stuart T.

    1995-01-01

    Infectious hematopoietic necrosis virus (IHNV), a member of the family Rhabdoviridae, causes a severe disease with high mortality in salmonid fish. The nucleotide sequence (11, 131 bases) of the entire genome was determined for the pathogenic WRAC strain of IHNV from southern Idaho. This allowed detailed analysis of all 6 genes, the deduced amino acid sequences of their encoded proteins, and important control motifs including leader, trailer and gene junction regions. Sequence analysis revealed that the 6 virus genes are located along the genome in the 3′ to 5′ order: nucleocapsid (N), polymerase-associated phosphoprotein (P or M1), matrix protein (M or M2), surface glycoprotein (G), a unique non-virion protein (NV) and virus polymerase (L). The IHNV genome RNA was found to have highly complementary termini (15 of 16 nucleotides). The gene junction regions display the highly conserved sequence UCURUC(U)7RCCGUG(N)4CACR (in the vRNA sense), which includes the typical rhabdovirus transcription termination/polyadenylation signal and a novel putative transcription initiation signal. Phylogenetic analysis of M, G and L protein sequences allowed insights into the evolutionary and taxonomic relationship of rhabdoviruses of fish relative to those of insects or mammals, and a broader sense of the relationship of non-segmented negative-strand RNA viruses. Based on these data, a new genus, piscivirus, is proposed which will initially contain IHNV, viral hemorrhagic septicemia virus and Hirame rhabdovirus.

  11. Phylogenetic analysis and DNA-based species confirmation in Anopheles (Nyssorhynchus.

    Directory of Open Access Journals (Sweden)

    Peter G Foster

    Full Text Available Specimens of neotropical Anopheles (Nyssorhynchus were collected and identified morphologically. We amplified three genes for phylogenetic analysis-the single copy nuclear white and CAD genes, and the COI barcode region. Since we had multiple specimens for most species we were able to test how well the single or combined genes were able to corroborate morphologically defined species by placing the species into exclusive groups. We found that single genes, including the COI barcode region, were poor at confirming species, but that the three genes combined were able to do so much better. This has implications for species identification, species delimitation, and species discovery, and we caution that single genes are not enough. Higher level groupings were partially resolved with some well-supported groupings, whereas others were found to be either polyphyletic or paraphyletic. There were examples of known groups, such as the Myzorhynchella Section, which were poorly supported with single genes but were well supported with combined genes. From this we can infer that more sequence data will be needed in order to show more higher-level groupings with good support. We got unambiguously good support (0.94-1.0 Bayesian posterior probability from all DNA-based analyses for a grouping of An. dunhami with An. nuneztovari and An. goeldii, and because of this and because of morphological similarities we propose that An. dunhami be included in the Nuneztovari Complex. We obtained phylogenetic corroboration for new species which had been recognised by morphological differences; these will need to be formally described and named.

  12. [Phylogenetic analysis of closely related Leuconostoc citreum species based on partial housekeeping genes].

    Science.gov (United States)

    Lv, Qiang; Chen, Ming; Xu, Haiyan; Song, Yuqin; Sun, Zhihong; Dan, Tong; Sun, Tiansong

    2013-07-04

    Using the 16S rRNA, dnaA, murC and pyrG gene sequences, we identified the phylogenetic relationship among closely related Leuconostoc citreum species. Seven Leu. citreum strains originally isolated from sourdough were characterized by PCR methods to amplify the dnaA, murC and pyrG gene sequences, which were determined to assess the suitability as phylogenetic markers. Then, we estimated the genetic distance and constructed the phylogenetic trees including 16S rRNA and above mentioned three housekeeping genes combining with published corresponding sequences. By comparing the phylogenetic trees, the topology of three housekeeping genes trees were consistent with that of 16S rRNA gene. The homology of closely related Leu. citreum species among dnaA, murC, pyrG and 16S rRNA gene sequences were different, ranged from75.5% to 97.2%, 50.2% to 99.7%, 65.0% to 99.8% and 98.5% 100%, respectively. The phylogenetic relationship of three housekeeping genes sequences were highly consistent with the results of 16S rRNA gene sequence, while the genetic distance of these housekeeping genes were extremely high than 16S rRNA gene. Consequently, the dnaA, murC and pyrG gene are suitable for classification and identification closely related Leu. citreum species.

  13. Confirmation of a novel siadenovirus species detected in raptors: partial sequence and phylogenetic analysis.

    Science.gov (United States)

    Kovács, Endre R; Benko, Mária

    2009-03-01

    Partial genome characterisation of a novel adenovirus, found recently in organ samples of multiple species of dead birds of prey, was carried out by sequence analysis of PCR-amplified DNA fragments. The virus, named as raptor adenovirus 1 (RAdV-1), has originally been detected by a nested PCR method with consensus primers targeting the adenoviral DNA polymerase gene. Phylogenetic analysis with the deduced amino acid sequence of the small PCR product has implied a new siadenovirus type present in the samples. Since virus isolation attempts remained unsuccessful, further characterisation of this putative novel siadenovirus was carried out with the use of PCR on the infected organ samples. The DNA sequence of the central genome part of RAdV-1, encompassing nine full (pTP, 52K, pIIIa, III, pVII, pX, pVI, hexon, protease) and two partial (DNA polymerase and DBP) genes and exceeding 12 kb pairs in size, was determined. Phylogenetic tree reconstructions, based on several genes, unambiguously confirmed the preliminary classification of RAdV-1 as a new species within the genus Siadenovirus. Further study of RAdV-1 is of interest since it represents a rare adenovirus genus of yet undetermined host origin.

  14. Phylogenetic analysis of influenza A viruses (H3N2 circulating in Zhytomyr region during 2013–2014 epidemic season

    Directory of Open Access Journals (Sweden)

    Boyalska O. G.

    2015-06-01

    Full Text Available Aim. To perform phylogenetic analysis of the hemagglutinin (HA and neuraminidase (NA genes of influenza A(H3N2 viruses circulating in the Zhytomyr region during 2013–2014 epidemic season. To make comparison of the HA and NA genes sequences of the Zhytomyr region isolates with the HA and NA genes sequences of influenza viruses circulating in the world. Methods. Laboratory diagnosis was conducted by real-time polymerase chain reaction (RT-PCR. In this study the sequencing and phylogenetic analysis were carried out. Results. For the first time the genes of influenza A(H3N2 viruses isolated in the Zhytomyr region during 2013–2014 epidemic season, coding hemagglutinin and neuraminidase were compared with their orthologs. According to the results of this comparison the phylogenetic tree was constructed. Additionally, the amino acid substitutions of the influenza viruses circulating in Ukraine and worldwide were analyzed. Conclusions. The nucleotide sequences of the influenza A(H3N2 viruses genes HA and NA isolated in the Zhytomyr region were identified. Based on the nucleotide sequences of HA and NA we constructed the influenza virus phylogenetic tree demonstrating that the virus isolated in the Zhytomyr region was closely related to the Ukrainian isolate from Kharkov and in the world to the isolates from Germany, Romania, Italy.

  15. Using phylogenetically-informed annotation (PIA) to search for light-interacting genes in transcriptomes from non-model organisms.

    Science.gov (United States)

    Speiser, Daniel I; Pankey, M Sabrina; Zaharoff, Alexander K; Battelle, Barbara A; Bracken-Grissom, Heather D; Breinholt, Jesse W; Bybee, Seth M; Cronin, Thomas W; Garm, Anders; Lindgren, Annie R; Patel, Nipam H; Porter, Megan L; Protas, Meredith E; Rivera, Ajna S; Serb, Jeanne M; Zigler, Kirk S; Crandall, Keith A; Oakley, Todd H

    2014-11-19

    Tools for high throughput sequencing and de novo assembly make the analysis of transcriptomes (i.e. the suite of genes expressed in a tissue) feasible for almost any organism. Yet a challenge for biologists is that it can be difficult to assign identities to gene sequences, especially from non-model organisms. Phylogenetic analyses are one useful method for assigning identities to these sequences, but such methods tend to be time-consuming because of the need to re-calculate trees for every gene of interest and each time a new data set is analyzed. In response, we employed existing tools for phylogenetic analysis to produce a computationally efficient, tree-based approach for annotating transcriptomes or new genomes that we term Phylogenetically-Informed Annotation (PIA), which places uncharacterized genes into pre-calculated phylogenies of gene families. We generated maximum likelihood trees for 109 genes from a Light Interaction Toolkit (LIT), a collection of genes that underlie the function or development of light-interacting structures in metazoans. To do so, we searched protein sequences predicted from 29 fully-sequenced genomes and built trees using tools for phylogenetic analysis in the Osiris package of Galaxy (an open-source workflow management system). Next, to rapidly annotate transcriptomes from organisms that lack sequenced genomes, we repurposed a maximum likelihood-based Evolutionary Placement Algorithm (implemented in RAxML) to place sequences of potential LIT genes on to our pre-calculated gene trees. Finally, we implemented PIA in Galaxy and used it to search for LIT genes in 28 newly-sequenced transcriptomes from the light-interacting tissues of a range of cephalopod mollusks, arthropods, and cubozoan cnidarians. Our new trees for LIT genes are available on the Bitbucket public repository ( http://bitbucket.org/osiris_phylogenetics/pia/ ) and we demonstrate PIA on a publicly-accessible web server ( http://galaxy-dev.cnsi.ucsb.edu/pia/ ). Our new

  16. Phylogenetic analysis of the diacylglycerol kinase family of proteins and identification of multiple highly-specific conserved inserts and deletions within the catalytic domain that are distinctive characteristics of different classes of DGK homologs.

    Directory of Open Access Journals (Sweden)

    Radhey S Gupta

    Full Text Available Diacylglycerol kinase (DGK family of proteins, which phosphorylates diacylglycerol into phosphatidic acid, play important role in controlling diverse cellular processes in eukaryotic organisms. Most vertebrate species contain 10 different DGK isozymes, which are grouped into 5 different classes based on the presence or absence of specific functional domains. However, the relationships among different DGK isozymes or how they have evolved from a common ancestor is unclear. The catalytic domain constitutes the single largest sequence element within the DGK proteins that is commonly and uniquely shared by all family members, but there is limited understanding of the overall function of this domain. In this work, we have used the catalytic domain sequences to construct a phylogenetic tree for the DGK family members from representatives of the main vertebrate classes and have also examined the distributions of various DGK isozymes in eukaryotic phyla. In a tree based on catalytic domain sequences, the DGK homologs belonging to different classes formed strongly supported clusters which were separated by long branches, and the different isozymes within each class also generally formed monophyletic groupings. Further, our analysis of the sequence alignments of catalytic domains has identified >10 novel sequence signatures consisting of conserved signature indels (inserts or deletions, CSIs that are distinctive characteristics of either particular classes of DGK isozymes, or are commonly shared by members of two or more classes of DGK isozymes. The conserved indels in protein sequences are known to play important functional roles in the proteins/organisms where they are found. Thus, our identification of multiple highly specific CSIs that are distinguishing characteristics of different classes of DGK homologs points to the existence of important differences in the catalytic domain function among the DGK isozymes. The identified CSIs in conjunction with

  17. Phylogenetic reconstruction and polymorphism analysis of BK virus VP2 gene isolated from renal transplant recipients in China.

    Science.gov (United States)

    Wang, Zhang-Yang; Hong, Wei-Long; Zhu, Zhe-Hui; Chen, Yun-Hao; Ye, Wen-LE; Chu, Guang-Yu; Li, Jia-Lin; Chen, Bi-Cheng; Xia, Peng

    2015-11-01

    BK polyomavirus (BKV) is important pathogen for kidney transplant recipients, as it is frequently re-activated, leading to nephropathy. The aim of this study was to investigate the phylogenetic reconstruction and polymorphism of the VP2 gene in BKV isolated from Chinese kidney transplant recipients. Phylogenetic analysis was carried out in the VP2 region from 135 BKV-positive samples and 28 reference strains retrieved from GenBank. The unweighted pair-group method with arithmetic mean (UPGMA) grouped all strains into subtypes, but failed to subdivide strains into subgroups. Among the plasma and urine samples, all plasma (23/23) and 82 urine samples (82/95) were identified to contain subtype I; the other 10 urine samples contained subtype IV. A 86-bp fragment was identified as a highly conserved sequence. Following alignment with 36 published BKV sequences from China, 92 sites of polymorphism were identified, including 11 single nucleotide polymorphisms (SNPs) prevalent in Chinese individuals and 30 SNPs that were specific to the two predominant subtypes I and IV. The limitations of the VP2 gene segment in subgrouping were confirmed by phylogenetic analysis. The conserved sequence and polymorphism identified in this study may be helpful in the detection and genotyping of BKV.

  18. Untangling hybrid phylogenetic signals: horizontal gene transfer and artifacts of phylogenetic reconstruction.

    Science.gov (United States)

    Beiko, Robert G; Ragan, Mark A

    2009-01-01

    Phylogenomic methods can be used to investigate the tangled evolutionary relationships among genomes. Building 'all the trees of all the genes' can potentially identify common pathways of horizontal gene transfer (HGT) among taxa at varying levels of phylogenetic depth. Phylogenetic affinities can be aggregated and merged with the information about genetic linkage and biochemical function to examine hypotheses of adaptive evolution via HGT. Additionally, the use of many genetic data sets increases the power of statistical tests for phylogenetic artifacts. However, large-scale phylogenetic analyses pose several challenges, including the necessary abandonment of manual validation techniques, the need to translate inferred phylogenetic discordance into inferred HGT events, and the challenges involved in aggregating results from search-based inference methods. In this chapter we describe a tree search procedure to recover the most parsimonious pathways of HGT, and examine some of the assumptions that are made by this method.

  19. Genomewide identification and expression analysis of the ARF gene ...

    Indian Academy of Sciences (India)

    Figure 1. Phylogenetic relation of apple ARF genes. The phylogenetic tree was constructed based on a complete protein sequence align- ment of MdARFs by the neighbour-joining method with bootstrapping analysis (1000 replicates). The scale bar represents 0.05 amino acid substitutions per site. Paralogous gene pairs ...

  20. Phylogenetic analysis at deep timescales: unreliable gene trees, bypassed hidden support, and the coalescence/concatalescence conundrum.

    Science.gov (United States)

    Gatesy, John; Springer, Mark S

    2014-11-01

    Large datasets are required to solve difficult phylogenetic problems that are deep in the Tree of Life. Currently, two divergent systematic methods are commonly applied to such datasets: the traditional supermatrix approach (= concatenation) and "shortcut" coalescence (= coalescence methods wherein gene trees and the species tree are not co-estimated). When applied to ancient clades, these contrasting frameworks often produce congruent results, but in recent phylogenetic analyses of Placentalia (placental mammals), this is not the case. A recent series of papers has alternatively disputed and defended the utility of shortcut coalescence methods at deep phylogenetic scales. Here, we examine this exchange in the context of published phylogenomic data from Mammalia; in particular we explore two critical issues - the delimitation of data partitions ("genes") in coalescence analysis and hidden support that emerges with the combination of such partitions in phylogenetic studies. Hidden support - increased support for a clade in combined analysis of all data partitions relative to the support evident in separate analyses of the various data partitions, is a hallmark of the supermatrix approach and a primary rationale for concatenating all characters into a single matrix. In the most extreme cases of hidden support, relationships that are contradicted by all gene trees are supported when all of the genes are analyzed together. A valid fear is that shortcut coalescence methods might bypass or distort character support that is hidden in individual loci because small gene fragments are analyzed in isolation. Given the extensive systematic database for Mammalia, the assumptions and applicability of shortcut coalescence methods can be assessed with rigor to complement a small but growing body of simulation work that has directly compared these methods to concatenation. We document several remarkable cases of hidden support in both supermatrix and coalescence paradigms and argue

  1. Genetic Diversity and Phylogenetic Analysis of the Iranian Leishmania Parasites Based on HSP70 Gene PCR-RFLP and Sequence Analysis.

    Science.gov (United States)

    Nemati, Sara; Fazaeli, Asghar; Hajjaran, Homa; Khamesipour, Ali; Anbaran, Mohsen Falahati; Bozorgomid, Arezoo; Zarei, Fatah

    2017-08-01

    Despite the broad distribution of leishmaniasis among Iranians and animals across the country, little is known about the genetic characteristics of the causative agents. Applying both HSP70 PCR-RFLP and sequence analyses, this study aimed to evaluate the genetic diversity and phylogenetic relationships among Leishmania spp. isolated from Iranian endemic foci and available reference strains. A total of 36 Leishmania isolates from almost all districts across the country were genetically analyzed for the HSP70 gene using both PCR-RFLP and sequence analysis. The original HSP70 gene sequences were aligned along with homologous Leishmania sequences retrieved from NCBI, and subjected to the phylogenetic analysis. Basic parameters of genetic diversity were also estimated. The HSP70 PCR-RFLP presented 3 different electrophoretic patterns, with no further intraspecific variation, corresponding to 3 Leishmania species available in the country, L. tropica, L. major, and L. infantum. Phylogenetic analyses presented 5 major clades, corresponding to 5 species complexes. Iranian lineages, including L. major, L. tropica, and L. infantum, were distributed among 3 complexes L. major, L. tropica, and L. donovani. However, within the L. major and L. donovani species complexes, the HSP70 phylogeny was not able to distinguish clearly between the L. major and L. turanica isolates, and between the L. infantum, L. donovani, and L. chagasi isolates, respectively. Our results indicated that both HSP70 PCR-RFLP and sequence analyses are medically applicable tools for identification of Leishmania species in Iranian patients. However, the reduced genetic diversity of the target gene makes it inevitable that its phylogeny only resolves the major groups, namely, the species complexes.

  2. Diversity structure of culturable bacteria isolated from the Fildes Peninsula (King George Island, Antarctica): A phylogenetic analysis perspective.

    Science.gov (United States)

    González-Rocha, Gerardo; Muñoz-Cartes, Gabriel; Canales-Aguirre, Cristian B; Lima, Celia A; Domínguez-Yévenes, Mariana; Bello-Toledo, Helia; Hernández, Cristián E

    2017-01-01

    It has been proposed that Antarctic environments select microorganisms with unique biochemical adaptations, based on the tenet 'Everything is everywhere, but, the environment selects' by Baas-Becking. However, this is a hypothesis that has not been extensively evaluated. This study evaluated the fundamental prediction contained in this hypothesis-in the sense that species are structured in the landscape according to their local habitats-, using as study model the phylogenetic diversity of the culturable bacteria of Fildes Peninsula (King George Island, Antarctica). Eighty bacterial strains isolated from 10 different locations in the area, were recovered. Based on phylogenetic analysis of 16S rRNA gene sequences, the isolates were grouped into twenty-six phylotypes distributed in three main clades, of which only six are exclusive to Antarctica. Results showed that phylotypes do not group significantly by habitat type; however, local habitat types had phylogenetic signal, which support the phylogenetic niche conservatism hypothesis and not a selective role of the environment like the Baas-Becking hypothesis suggests. We propose that, more than habitat selection resulting in new local adaptations and diversity, local historical colonization and species sorting (i.e. differences in speciation and extinction rates that arise by interaction of species level traits with the environment) play a fundamental role on the culturable bacterial diversity in Antarctica.

  3. The complete mitochondrial genome of the land snail Cornu aspersum (Helicidae: Mollusca: intra-specific divergence of protein-coding genes and phylogenetic considerations within Euthyneura.

    Directory of Open Access Journals (Sweden)

    Juan Diego Gaitán-Espitia

    Full Text Available The complete sequences of three mitochondrial genomes from the land snail Cornu aspersum were determined. The mitogenome has a length of 14050 bp, and it encodes 13 protein-coding genes, 22 transfer RNA genes and two ribosomal RNA genes. It also includes nine small intergene spacers, and a large AT-rich intergenic spacer. The intra-specific divergence analysis revealed that COX1 has the lower genetic differentiation, while the most divergent genes were NADH1, NADH3 and NADH4. With the exception of Euhadra herklotsi, the structural comparisons showed the same gene order within the family Helicidae, and nearly identical gene organization to that found in order Pulmonata. Phylogenetic reconstruction recovered Basommatophora as polyphyletic group, whereas Eupulmonata and Pulmonata as paraphyletic groups. Bayesian and Maximum Likelihood analyses showed that C. aspersum is a close relative of Cepaea nemoralis, and with the other Helicidae species form a sister group of Albinaria caerulea, supporting the monophyly of the Stylommatophora clade.

  4. A preliminary mitochondrial genome phylogeny of Orthoptera (Insecta) and approaches to maximizing phylogenetic signal found within mitochondrial genome data.

    Science.gov (United States)

    Fenn, J Daniel; Song, Hojun; Cameron, Stephen L; Whiting, Michael F

    2008-10-01

    The phylogenetic utility of mitochondrial genomes (mtgenomes) is examined using the framework of a preliminary phylogeny of Orthoptera. This study presents five newly sequenced genomes from four orthopteran families. While all ensiferan and polyneopteran taxa retain the ancestral gene order, all caeliferan lineages including the newly sequenced caeliferan species contain a tRNA rearrangement from the insect ground plan tRNA(Lys)(K)-tRNA(Asp)(D) swapping to tRNA(Asp) (D)-tRNA(Lys) (K) confirming that this rearrangement is a possible molecular synapomorphy for this suborder. The phylogenetic signal in mtgenomes is rigorously examined under the analytical regimens of parsimony, maximum likelihood and Bayesian inference, along with how gene inclusion/exclusion, data recoding, gap coding, and different partitioning schemes influence the phylogenetic reconstruction. When all available data are analyzed simultaneously, the monophyly of Orthoptera and its two suborders, Caelifera and Ensifera, are consistently recovered in the context of our taxon sampling, regardless of the optimality criteria. When protein-coding genes are analyzed as a single partition, nearly identical topology to the combined analyses is recovered, suggesting that much of the signals of the mtgenome come from the protein-coding genes. Transfer and ribosomal RNAs perform poorly when analyzed individually, but contribute signal when analyzed in combination with the protein-coding genes. Inclusion of third codon position of the protein-coding genes does not negatively affect the phylogenetic reconstruction when all genes are analyzed together, whereas recoding of the protein-coding genes into amino acid sequences introduces artificial resolution. Over-partitioning in a Bayesian framework appears to have a negative effect in achieving convergence. Our findings suggest that the best phylogenetic inferences are made when all available nucleotide data from the mtgenome are analyzed simultaneously, and that

  5. The best of both worlds: Phylogenetic eigenvector regression and mapping

    Directory of Open Access Journals (Sweden)

    José Alexandre Felizola Diniz Filho

    2015-09-01

    Full Text Available Eigenfunction analyses have been widely used to model patterns of autocorrelation in time, space and phylogeny. In a phylogenetic context, Diniz-Filho et al. (1998 proposed what they called Phylogenetic Eigenvector Regression (PVR, in which pairwise phylogenetic distances among species are submitted to a Principal Coordinate Analysis, and eigenvectors are then used as explanatory variables in regression, correlation or ANOVAs. More recently, a new approach called Phylogenetic Eigenvector Mapping (PEM was proposed, with the main advantage of explicitly incorporating a model-based warping in phylogenetic distance in which an Ornstein-Uhlenbeck (O-U process is fitted to data before eigenvector extraction. Here we compared PVR and PEM in respect to estimated phylogenetic signal, correlated evolution under alternative evolutionary models and phylogenetic imputation, using simulated data. Despite similarity between the two approaches, PEM has a slightly higher prediction ability and is more general than the original PVR. Even so, in a conceptual sense, PEM may provide a technique in the best of both worlds, combining the flexibility of data-driven and empirical eigenfunction analyses and the sounding insights provided by evolutionary models well known in comparative analyses.

  6. Molecular typing of canine parvovirus from Sulaimani, Iraq and phylogenetic analysis using partial VP2 gene

    Directory of Open Access Journals (Sweden)

    M.O.Baba Sheikh

    2017-09-01

    Full Text Available Canine parvovirus (CPV remains the most significant viral cause of haemorrhagic enteritis and bloody diarrhoea in puppies over the age of 12 weeks. The objective of the present study was to detect and genotype CPV-2 by polymerase chain reaction (PCR and to perform phylogenetic analysis using partial VP2 gene sequences. We analysed eight faecal samples of unvaccinated dogs with signs of vomiting and bloody diarrhoea during the period from December 2013 to May 2014 in different locations in Sulaimani, Kurdistan, Iraq. After PCR detection, we found that all viral sequences in our study were CPV-2b variants, which differed genetically by 0.8% to 3.6% from five commercially available vaccines. Alignment between eight nucleotides of field virus sequences showed 95% to 99.5% similarity. The phylogenetic analysis for the 8 field sequences formed two distinct clusters with two sequences belonging to strains from China and Thailand and the other six – with a strain from Egypt. Molecular characterisation and CPV typing are crucial in epidemiological studies for future prevention and control of the disease.

  7. Molecular and phylogenetic analysis of HIV-1 variants circulating among injecting drug users in Mashhad-Iran

    Directory of Open Access Journals (Sweden)

    Buonaguro FM

    2006-09-01

    Full Text Available Abstract Genetic and phylogenetic information on the HIV-1 epidemic in Middle-East Countries, and in particular in Iran, are extremely limited. By March 2004, the Iranian Ministry of Health officially reported a cumulative number of 6'532 HIV positive individuals and 214 AIDS cases in the Iranian HIV-1 epidemic. The intra-venous drug users (IDUs represent the group at highest risk for HIV-1 infection in Iran, accounting for almost 63% of all HIV-infected population. In this regards, a molecular phylogenetic study has been performed on a sentinel cohort of HIV-1 seropositive IDUs enrolled at the end of 2005 at the University of Mashhad, the largest city North East of Tehran. The study has been performed on both gag and env subgenomic regions amplified by Polymerase Chain Reaction (PCR from peripheral blood mononuclear cells (PBMCs and characterized by direct DNA sequence analysis. The results reported here show that the HIV-1 subtype A is circulating in this IDUs sentinel cohort. Moreover, the single phylogenetic cluster as well as the intra-group low nucleotide divergence is indicative of a recent outbreak. Unexpectedly, the Iranian samples appear to be phylogenetically derived from African Sub-Saharan subtype A viruses, raising stirring speculations on HIV-1 introduction into the IDUs epidemic in Mashhad. This sentinel study could represent the starting point for a wider molecular survey of the HIV-1 epidemics in Iran to evaluate in detail the distribution of genetic subtypes and possible natural drug-resistant variants, which are extremely helpful information to design diagnostic and therapeutic strategies.

  8. Phylogenetic analysis of the sharpshooter genus Subrasaca Young, 1977 (Hemiptera, Cicadellidae, Cicadellini)

    Science.gov (United States)

    da Silva, Roberta dos Santos; Mejdalani, Gabriel; Cavichioli, Rodney R.

    2015-01-01

    Abstract The South American sharpshooter genus Subrasaca comprises 14 species. Some species of this genus are quite common in the Brazilian Atlantic Rainforest. In this paper, a phylogenetic analysis of Subrasaca, based on a matrix of 20 terminal taxa and 72 morphological characters of the head, thorax, and male and female genitalia, is presented. The analysis yielded six equally most parsimonious trees (197 steps, CI = 0.6091, RI = 0.5722, and RC = 0.3486). The results suggest that Subrasaca is a monophyletic taxon, although the genus branch is not robust. The clade showing the highest bootstrap and Bremer scores is formed by species with longitudinal dark brown to black stripes on the forewings (Subrasaca bimaculata, Subrasaca constricta, Subrasaca curvovittata, and Subrasaca flavolineata), followed by Subrasaca atronasa + Subrasaca austera. PMID:25829841

  9. Phylogenetic Analysis of the Bee Tribe Anthidiini | Combey | Journal ...

    African Journals Online (AJOL)

    The phylogenetic relationships among members of long tongue bee tribe Anthidiini (Megachilidae: Megachilinae) were investigated at the Department of Entomology and Wildlife, University of Cape Coast (Ghana) and the Agricultural Research Council, Pretoria (South Af-rica) from July, 2006 to May, 2007. Ten museums ...

  10. Phylogenetic analysis of Demodex caprae based on mitochondrial 16S rDNA sequence.

    Science.gov (United States)

    Zhao, Ya-E; Hu, Li; Ma, Jun-Xian

    2013-11-01

    Demodex caprae infests the hair follicles and sebaceous glands of goats worldwide, which not only seriously impairs goat farming, but also causes a big economic loss. However, there are few reports on the DNA level of D. caprae. To reveal the taxonomic position of D. caprae within the genus Demodex, the present study conducted phylogenetic analysis of D. caprae based on mt16S rDNA sequence data. D. caprae adults and eggs were obtained from a skin nodule of the goat suffering demodicidosis. The mt16S rDNA sequences of individual mite were amplified using specific primers, and then cloned, sequenced, and aligned. The sequence divergence, genetic distance, and transition/transversion rate were computed, and the phylogenetic trees in Demodex were reconstructed. Results revealed the 339-bp partial sequences of six D. caprae isolates were obtained, and the sequence identity was 100% among isolates. The pairwise divergences between D. caprae and Demodex canis or Demodex folliculorum or Demodex brevis were 22.2-24.0%, 24.0-24.9%, and 22.9-23.2%, respectively. The corresponding average genetic distances were 2.840, 2.926, and 2.665, and the average transition/transversion rates were 0.70, 0.55, and 0.54, respectively. The divergences, genetic distances, and transition/transversion rates of D. caprae versus the other three species all reached interspecies level. The five phylogenetic trees all presented that D. caprae clustered with D. brevis first, and then with D. canis, D. folliculorum, and Demodex injai in sequence. In conclusion, D. caprae is an independent species, and it is closer to D. brevis than to D. canis, D. folliculorum, or D. injai.

  11. Phylogenetic versus functional signals in the evolution of form-function relationships in terrestrial vision.

    Science.gov (United States)

    Motani, Ryosuke; Schmitz, Lars

    2011-08-01

    Phylogeny is deeply pertinent to evolutionary studies. Traits that perform a body function are expected to be strongly influenced by physical "requirements" of the function. We investigated if such traits exhibit phylogenetic signals, and, if so, how phylogenetic noises bias quantification of form-function relationships. A form-function system that is strongly influenced by physics, namely the relationship between eye morphology and visual optics in amniotes, was used. We quantified the correlation between form (i.e., eye morphology) and function (i.e., ocular optics) while varying the level of phylogenetic bias removal through adjusting Pagel's λ. Ocular soft-tissue dimensions exhibited the highest correlation with ocular optics when 1% of phylogenetic bias expected from Brownian motion was removed (i.e., λ= 0.01); the value for hard-tissue data were 8%. A small degree of phylogenetic bias therefore exists in morphology despite of the stringent functional constraints. We also devised a phylogenetically informed discriminant analysis and recorded the effects of phylogenetic bias on this method using the same data. Use of proper λ values during phylogenetic bias removal improved misidentification rates in resulting classifications when prior probabilities were assumed to be equal. Even a small degree of phylogenetic bias affected the classification resulting from phylogenetically informed discriminant analysis. © 2011 The Author(s). Evolution© 2011 The Society for the Study of Evolution.

  12. Nonbinary Tree-Based Phylogenetic Networks.

    Science.gov (United States)

    Jetten, Laura; van Iersel, Leo

    2018-01-01

    Rooted phylogenetic networks are used to describe evolutionary histories that contain non-treelike evolutionary events such as hybridization and horizontal gene transfer. In some cases, such histories can be described by a phylogenetic base-tree with additional linking arcs, which can, for example, represent gene transfer events. Such phylogenetic networks are called tree-based. Here, we consider two possible generalizations of this concept to nonbinary networks, which we call tree-based and strictly-tree-based nonbinary phylogenetic networks. We give simple graph-theoretic characterizations of tree-based and strictly-tree-based nonbinary phylogenetic networks. Moreover, we show for each of these two classes that it can be decided in polynomial time whether a given network is contained in the class. Our approach also provides a new view on tree-based binary phylogenetic networks. Finally, we discuss two examples of nonbinary phylogenetic networks in biology and show how our results can be applied to them.

  13. Sequence and 3D structure based analysis of TNT degrading proteins in Arabidopsis thaliana.

    Science.gov (United States)

    Bhattacherjee, Amrita; Mandal, Rahul Shubhra; Das, Santasabuj; Kundu, Sudip

    2014-03-01

    TNT, accidentally released at several manufacturing sites, contaminates ground water and soil. It has a toxic effect to algae and invertebrate, and chronic exposure to TNT also causes harmful effects to human. On the other hand, many plants including Arabidopsis thaliana have the ability to metabolize TNT either completely or at least to a reduced less toxic form. In A. thaliana, the enzyme UDP glucosyltransferase (UDPGT) can further conjugate the reduced forms 2-HADNT and 4-HADNT (2-hydroxylamino-4, 6- dinitrotoluene and 4-hydroxylamino-2, 6- dinitrotoluene) of TNT. Based on the experimental analysis, existing literature and phylogenetic analysis, it is evident that among 107 UDPGT proteins only six are involved in the TNT degrading process. A total of 13 UDPGT proteins including five of these TNT degrading proteins fall within the same group of phylogeny. Thus, these 13 UDPGT proteins have been classified into two groups, TNT-degrading and TNT-non-degrading proteins. To understand the differences in TNT-degrading capacities; using homology modeling we first predicted two structures, taking one representative sequence from both the groups. Next, we performed molecular docking of the modeled structure and TNT reduced form 2-hydroxylamino-4, 6- dinitrotoluene (2-HADNT). We observed that while the Trp residue located within the active site region of the TNT- degrading protein showed π-Cation interaction; such type of interaction was absent in TNT-non-degrading protein, as the respective Trp residue lay outside of the pocket in this case. We observed the conservation of this π-Cation interaction during MD simulation of TNT-degrading protein. Thus, the position and the orientation of the active site residue Trp could explain the presence and absence of TNT-degrading capacity of the UDPGT proteins.

  14. Middle Pleistocene protein sequences from the rhinoceros genus Stephanorhinus and the phylogeny of extant and extinct Middle/Late Pleistocene Rhinocerotidae.

    Science.gov (United States)

    Welker, Frido; Smith, Geoff M; Hutson, Jarod M; Kindler, Lutz; Garcia-Moreno, Alejandro; Villaluenga, Aritza; Turner, Elaine; Gaudzinski-Windheuser, Sabine

    2017-01-01

    Ancient protein sequences are increasingly used to elucidate the phylogenetic relationships between extinct and extant mammalian taxa. Here, we apply these recent developments to Middle Pleistocene bone specimens of the rhinoceros genus Stephanorhinus . No biomolecular sequence data is currently available for this genus, leaving phylogenetic hypotheses on its evolutionary relationships to extant and extinct rhinoceroses untested. Furthermore, recent phylogenies based on Rhinocerotidae (partial or complete) mitochondrial DNA sequences differ in the placement of the Sumatran rhinoceros ( Dicerorhinus sumatrensis ). Therefore, studies utilising ancient protein sequences from Middle Pleistocene contexts have the potential to provide further insights into the phylogenetic relationships between extant and extinct species, including Stephanorhinus and Dicerorhinus . ZooMS screening (zooarchaeology by mass spectrometry) was performed on several Late and Middle Pleistocene specimens from the genus Stephanorhinus , subsequently followed by liquid chromatography-tandem mass spectrometry (LC-MS/MS) to obtain ancient protein sequences from a Middle Pleistocene Stephanorhinus specimen. We performed parallel analysis on a Late Pleistocene woolly rhinoceros specimen and extant species of rhinoceroses, resulting in the availability of protein sequence data for five extant species and two extinct genera. Phylogenetic analysis additionally included all extant Perissodactyla genera ( Equus , Tapirus ), and was conducted using Bayesian (MrBayes) and maximum-likelihood (RAxML) methods. Various ancient proteins were identified in both the Middle and Late Pleistocene rhinoceros samples. Protein degradation and proteome complexity are consistent with an endogenous origin of the identified proteins. Phylogenetic analysis of informative proteins resolved the Perissodactyla phylogeny in agreement with previous studies in regards to the placement of the families Equidae, Tapiridae, and

  15. Avian papillomaviruses: the parrot Psittacus erithacus papillomavirus (PePV genome has a unique organization of the early protein region and is phylogenetically related to the chaffinch papillomavirus

    Directory of Open Access Journals (Sweden)

    Jenson A Bennett

    2002-07-01

    Full Text Available Abstract Background An avian papillomavirus genome has been cloned from a cutaneous exophytic papilloma from an African grey parrot (Psittacus erithacus. The nucleotide sequence, genome organization, and phylogenetic position of the Psittacus erithacus papillomavirus (PePV were determined. This PePV sequence represents the first complete avian papillomavirus genome defined. Results The PePV genome (7304 basepairs differs from other papillomaviruses, in that it has a unique organization of the early protein region lacking classical E6 and E7 open reading frames. Phylogenetic comparison of the PePV sequence with partial E1 and L1 sequences of the chaffinch (Fringilla coelebs papillomavirus (FPV reveals that these two avian papillomaviruses form a monophyletic cluster with a common branch that originates near the unresolved center of the papillomavirus evolutionary tree. Conclusions The PePV genome has a unique layout of the early protein region which represents a novel prototypic genomic organization for avian papillomaviruses. The close relationship between PePV and FPV, and between their Psittaciformes and Passeriformes hosts, supports the hypothesis that papillomaviruses have co-evolved and speciated together with their host species throughout evolution.

  16. MulRF: a software package for phylogenetic analysis using multi-copy gene trees.

    Science.gov (United States)

    Chaudhary, Ruchi; Fernández-Baca, David; Burleigh, John Gordon

    2015-02-01

    MulRF is a platform-independent software package for phylogenetic analysis using multi-copy gene trees. It seeks the species tree that minimizes the Robinson-Foulds (RF) distance to the input trees using a generalization of the RF distance to multi-labeled trees. The underlying generic tree distance measure and fast running time make MulRF useful for inferring phylogenies from large collections of gene trees, in which multiple evolutionary processes as well as phylogenetic error may contribute to gene tree discord. MulRF implements several features for customizing the species tree search and assessing the results, and it provides a user-friendly graphical user interface (GUI) with tree visualization. The species tree search is implemented in C++ and the GUI in Java Swing. MulRF's executable as well as sample datasets and manual are available at http://genome.cs.iastate.edu/CBL/MulRF/, and the source code is available at https://github.com/ruchiherself/MulRFRepo. ruchic@ufl.edu Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  17. A comparative phylogenetic analysis of full-length mariner elements

    Indian Academy of Sciences (India)

    Mariner like elements (MLEs) are widely distributed type II transposons with an open reading frame (ORF) for transposase. We studied comparative phylogenetic evolution and inverted terminal repeat (ITR) conservation of MLEs from Indian saturniid silkmoth, Antheraea mylitta with other full length MLEs submitted in the ...

  18. The complete chloroplast genome sequence of Dodonaea viscosa: comparative and phylogenetic analyses.

    Science.gov (United States)

    Saina, Josphat K; Gichira, Andrew W; Li, Zhi-Zhong; Hu, Guang-Wan; Wang, Qing-Feng; Liao, Kuo

    2018-02-01

    The plant chloroplast (cp) genome is a highly conserved structure which is beneficial for evolution and systematic research. Currently, numerous complete cp genome sequences have been reported due to high throughput sequencing technology. However, there is no complete chloroplast genome of genus Dodonaea that has been reported before. To better understand the molecular basis of Dodonaea viscosa chloroplast, we used Illumina sequencing technology to sequence its complete genome. The whole length of the cp genome is 159,375 base pairs (bp), with a pair of inverted repeats (IRs) of 27,099 bp separated by a large single copy (LSC) 87,204 bp, and small single copy (SSC) 17,972 bp. The annotation analysis revealed a total of 115 unique genes of which 81 were protein coding, 30 tRNA, and four ribosomal RNA genes. Comparative genome analysis with other closely related Sapindaceae members showed conserved gene order in the inverted and single copy regions. Phylogenetic analysis clustered D. viscosa with other species of Sapindaceae with strong bootstrap support. Finally, a total of 249 SSRs were detected. Moreover, a comparison of the synonymous (Ks) and nonsynonymous (Ka) substitution rates in D. viscosa showed very low values. The availability of cp genome reported here provides a valuable genetic resource for comprehensive further studies in genetic variation, taxonomy and phylogenetic evolution of Sapindaceae family. In addition, SSR markers detected will be used in further phylogeographic and population structure studies of the species in this genus.

  19. Molecular characterization and phylogenetic relationships among ...

    African Journals Online (AJOL)

    Molecular characterization and phylogenetic relationships among and within species of Phalaenopsis (Epidendroideae: Orchidaceae) based on RAPD analysis. ... Ph. parishii, Ph. labbi nepal, Ph. speciosa, Ph. lobbi yellow, Ph. venosa, Ph. hieroglyphica, and Ph. maculata; the third group consisted of Ph. minho princess, ...

  20. Phylogenetic signals in the climatic niches of the world's amphibians

    DEFF Research Database (Denmark)

    Hof, Christian; Rahbek, Carsten; Araújo, Miguel B.

    2010-01-01

    amphibian orders and across biogeographical regions. To our knowledge, this is the first study providing a comprehensive analysis of the phylogenetic signal in species climatic niches for an entire clade across the world. Even though our results do not provide a strong test of the niche conservatism......The question of whether closely related species share similar ecological requirements has attracted increasing attention, because of its importance for understanding global diversity gradients and the impacts of climate change on species distributions. In fact, the assumption that related species...... are also ecologically similar has often been made, although the prevalence of such a phylogenetic signal in ecological niches remains heavily debated. Here, we provide a global analysis of phylogenetic niche relatedness for the world's amphibians. In particular, we assess which proportion of the variance...

  1. Phylogenetic analysis of emergent Streptococcus pneumoniae serotype 22F causing invasive pneumococcal disease using whole genome sequencing.

    Directory of Open Access Journals (Sweden)

    Walter H B Demczuk

    Full Text Available Since implementation of the 13-valent polyvalent conjugate vaccine (PCV13 in Canada during 2010, the proportion of PCV13 serotypes causing invasive pneumococcal disease (IPD has declined from 55% (n = 1492 in 2010 to 31% (n = 764 in 2014. A concurrent increase of non-PCV13 serotypes has occurred and 22F has become the most prevalent serotype in Canada increasing from 7% (n = 183 to 11% (n = 283. Core single nucleotide variant phylogenetic analysis was performed on 137 Streptococcus pneumoniae serotype 22F isolates collected across Canada from 2005-2015. Six phylogenetic lineages (n = 117 were identified among a serotype 22F/ST433 clonal complex (CC, including a recently expanding erythromycin-resistant clone. Erythromycin-resistance was observed in 25 isolates possessing ermB, mef or a 23S rRNA A2061G point mutation; 2 penicillin-resistant isolates had recombinant pbp1a, pbp2a and/or pbp2x; 3 tetracycline-resistant isolates contained tetM; and 1 isolate was multidrug-resistant. Virulence factor analysis indicated a high level of homogeneity among the 22F/ST433 clonal complex strains. A group of 6 phylogenetic outlier strains had differing MLST, antimicrobial resistance and molecular profiles suggestive of capsule switching events. While capsule switch events among S. pneumoniae serotype 22F has been observed, increasing prevalence of S. pneumoniae serotype 22F can be attributed to an evolving homogenous clone expanding nationally through local transmission events.

  2. Discrimination and chemical phylogenetic study of seven species of Dendrobium using infrared spectroscopy combined with cluster analysis

    Science.gov (United States)

    Luo, Congpei; He, Tao; Chun, Ze

    2013-04-01

    Dendrobium is a commonly used and precious herb in Traditional Chinese Medicine. The high biodiversity of Dendrobium and the therapeutic needs require tools for the correct and fast discrimination of different Dendrobium species. This study investigates Fourier transform infrared spectroscopy followed by cluster analysis for discrimination and chemical phylogenetic study of seven Dendrobium species. Despite the general pattern of the IR spectra, different intensities, shapes, peak positions were found in the IR spectra of these samples, especially in the range of 1800-800 cm-1. The second derivative transformation and alcoholic extracting procedure obviously enlarged the tiny spectral differences among these samples. The results indicated each Dendrobium species had a characteristic IR spectra profile, which could be used to discriminate them. The similarity coefficients among the samples were analyzed based on their second derivative IR spectra, which ranged from 0.7632 to 0.9700, among the seven Dendrobium species, and from 0.5163 to 0.9615, among the ethanol extracts. A dendrogram was constructed based on cluster analysis the IR spectra for studying the chemical phylogenetic relationships among the samples. The results indicated that D. denneanum and D. crepidatum could be the alternative resources to substitute D. chrysotoxum, D. officinale and D. nobile which were officially recorded in Chinese Pharmacopoeia. In conclusion, with the advantages of high resolution, speediness and convenience, the experimental approach can successfully discriminate and construct the chemical phylogenetic relationships of the seven Dendrobium species.

  3. Analyzing Phylogenetic Trees with Timed and Probabilistic Model Checking: The Lactose Persistence Case Study.

    Science.gov (United States)

    Requeno, José Ignacio; Colom, José Manuel

    2014-12-01

    Model checking is a generic verification technique that allows the phylogeneticist to focus on models and specifications instead of on implementation issues. Phylogenetic trees are considered as transition systems over which we interrogate phylogenetic questions written as formulas of temporal logic. Nonetheless, standard logics become insufficient for certain practices of phylogenetic analysis since they do not allow the inclusion of explicit time and probabilities. The aim of this paper is to extend the application of model checking techniques beyond qualitative phylogenetic properties and adapt the existing logical extensions and tools to the field of phylogeny. The introduction of time and probabilities in phylogenetic specifications is motivated by the study of a real example: the analysis of the ratio of lactose intolerance in some populations and the date of appearance of this phenotype.

  4. Visualizing phylogenetic tree landscapes.

    Science.gov (United States)

    Wilgenbusch, James C; Huang, Wen; Gallivan, Kyle A

    2017-02-02

    Genomic-scale sequence alignments are increasingly used to infer phylogenies in order to better understand the processes and patterns of evolution. Different partitions within these new alignments (e.g., genes, codon positions, and structural features) often favor hundreds if not thousands of competing phylogenies. Summarizing and comparing phylogenies obtained from multi-source data sets using current consensus tree methods discards valuable information and can disguise potential methodological problems. Discovery of efficient and accurate dimensionality reduction methods used to display at once in 2- or 3- dimensions the relationship among these competing phylogenies will help practitioners diagnose the limits of current evolutionary models and potential problems with phylogenetic reconstruction methods when analyzing large multi-source data sets. We introduce several dimensionality reduction methods to visualize in 2- and 3-dimensions the relationship among competing phylogenies obtained from gene partitions found in three mid- to large-size mitochondrial genome alignments. We test the performance of these dimensionality reduction methods by applying several goodness-of-fit measures. The intrinsic dimensionality of each data set is also estimated to determine whether projections in 2- and 3-dimensions can be expected to reveal meaningful relationships among trees from different data partitions. Several new approaches to aid in the comparison of different phylogenetic landscapes are presented. Curvilinear Components Analysis (CCA) and a stochastic gradient decent (SGD) optimization method give the best representation of the original tree-to-tree distance matrix for each of the three- mitochondrial genome alignments and greatly outperformed the method currently used to visualize tree landscapes. The CCA + SGD method converged at least as fast as previously applied methods for visualizing tree landscapes. We demonstrate for all three mtDNA alignments that 3D

  5. On the use of cartographic projections in visualizing phylo-genetic tree space

    Directory of Open Access Journals (Sweden)

    Clement Mark

    2010-06-01

    Full Text Available Abstract Phylogenetic analysis is becoming an increasingly important tool for biological research. Applications include epidemiological studies, drug development, and evolutionary analysis. Phylogenetic search is a known NP-Hard problem. The size of the data sets which can be analyzed is limited by the exponential growth in the number of trees that must be considered as the problem size increases. A better understanding of the problem space could lead to better methods, which in turn could lead to the feasible analysis of more data sets. We present a definition of phylogenetic tree space and a visualization of this space that shows significant exploitable structure. This structure can be used to develop search methods capable of handling much larger data sets.

  6. Inclusion of chloroplast genes that have undergone expansion misleads phylogenetic reconstruction in the Chlorophyta.

    Science.gov (United States)

    Novis, Phil M; Smissen, Rob; Buckley, Thomas R; Gopalakrishnan, Kishore; Visnovsky, Gabriel

    2013-11-01

    Chlorophytes comprise a substantial proportion of green plant diversity. However, sister-group relationships and circumscription of the classes Chlorophyceae, Trebouxiophyceae, and Ulvophyceae have been problematic to resolve. Some analyses support a sister relationship between the trebouxiophycean Leptosira and chlorophyceans, potentially altering the circumscription of two classes, also supported by a shared fragmentation in the chloroplast gene rpoB. We sought to determine whether the latter is a synapomorphy or whether the supporting analyses are vulnerable to systematic bias. We sequenced a portion of rpoB spanning the fragmented region in strains for which it had not previously been sampled: four Chlorophyceae, six counterclockwise (CCW) group (ulvophyceans and trebouxiophyceans) and one streptophyte. We then explored the effect of subsampling proteins and taxa on phylogenetic reconstruction from a data set of 41 chloroplast proteins. None of the CCW or streptophyte strains possessed the split in rpoB, including inferred near relatives of Leptosira, but it was found in all chlorophycean strains. We reconstructed alternative phylogenies (Leptosira + Chlorophyceae and Leptosira + Chlorellales) using two different protein groups (Rpo and Rps), both subject to coding-region expansion. A conserved region of RpoB remained suitable for analysis of more recent divergences. The Rps sequences can explain earlier findings linking Leptosira with the Chlorophyceae and should be excluded from phylogenetic analyses attempting to resolve deep nodes because their expansion violates the assumptions of substitution models. We reaffirm that Leptosira is a trebouxiophycean and that fragmentation of rpoB has occurred at least twice in chlorophyte evolution.

  7. Solving Classification Problems for Large Sets of Protein Sequences with the Example of Hox and ParaHox Proteins

    Directory of Open Access Journals (Sweden)

    Stefanie D. Hueber

    2016-02-01

    Full Text Available Phylogenetic methods are key to providing models for how a given protein family evolved. However, these methods run into difficulties when sequence divergence is either too low or too high. Here, we provide a case study of Hox and ParaHox proteins so that additional insights can be gained using a new computational approach to help solve old classification problems. For two (Gsx and Cdx out of three ParaHox proteins the assignments differ between the currently most established view and four alternative scenarios. We use a non-phylogenetic, pairwise-sequence-similarity-based method to assess which of the previous predictions, if any, are best supported by the sequence-similarity relationships between Hox and ParaHox proteins. The overall sequence-similarities show Gsx to be most similar to Hox2–3, and Cdx to be most similar to Hox4–8. The results indicate that a purely pairwise-sequence-similarity-based approach can provide additional information not only when phylogenetic inference methods have insufficient information to provide reliable classifications (as was shown previously for central Hox proteins, but also when the sequence variation is so high that the resulting phylogenetic reconstructions are likely plagued by long-branch-attraction artifacts.

  8. An autochthonous case of hepatitis C virus genotype 5a in Brazil: phylogenetic analysis

    DEFF Research Database (Denmark)

    Ribeiro, L.C.; Souto, F.J.D.; do Espirito-Santo, M.P.

    2009-01-01

    Genotype 5 of hepatitis C virus (HCV) has been rarely identified in South America. A female of African descent who never left Brazil was found to be infected by this genotype in Mato Grosso state, Central Brazil. The patient denied drug injections and revealed that she had received blood...... transfusions several years before. One of her blood donors was identified and tested negative for anti-HCV and HCV RNA, as were her husband and offspring. Phylogenetic analysis of the E1 and NS5B regions confirmed that this HCV strain belonged to genotype 5a. However, the E1 region analysis indicates that our...

  9. Partial characterization of Maize rayado fino virus isolates from Ecuador: phylogenetic analysis supports a Central American origin of the virus.

    Science.gov (United States)

    Chicas, Mauricio; Caviedes, Mario; Hammond, Rosemarie; Madriz, Kenneth; Albertazzi, Federico; Villalobos, Heydi; Ramírez, Pilar

    2007-06-01

    Maize rayado fino virus (MRFV) infects maize and appears to be restricted to, yet widespread in, the Americas. MRFV was previously unreported from Ecuador. Maize plants exhibiting symptoms of MRFV infection were collected at the Santa Catalina experiment station in Quito, Ecuador. RT-PCR reactions were performed on total RNA extracted from the symptomatic leaves using primers specific for the capsid protein (CP) gene and 3' non-translated region of MRFV and first strand cDNA as a template. Nucleotide sequence comparisons to previously sequenced MRFV isolates from other geographic regions revealed 88-91% sequence identity. Phylogenetic trees constructed using Maximum Likelihood, UPGMA, Minimal Evolution, Neighbor Joining, and Maximum Parsimony methods separated the MRFV isolates into four groups. These groups may represent geographic isolation generated by the mountainous chains of the American continent. Analysis of the sequences and the genetic distances among the different isolates suggests that MRFV may have originated in Mexico and/or Guatemala and from there it dispersed to the rest of the Americas.

  10. Phylogenetic paleobiogeography of Late Ordovician Laurentian brachiopods

    Directory of Open Access Journals (Sweden)

    Jennifer E. Bauer

    2014-12-01

    Full Text Available Phylogenetic biogeographic analysis of four brachiopod genera was used to uncover large-scale geologic drivers of Late Ordovician biogeographic differentiation in Laurentia. Previously generated phylogenetic hypotheses were converted into area cladograms, ancestral geographic ranges were optimized and speciation events characterized as via dispersal or vicariance, when possible. Area relationships were reconstructed using Lieberman-modified Brooks Parsimony Analysis. The resulting area cladograms indicate tectonic and oceanographic changes were the primary geologic drivers of biogeographic patterns within the focal taxa. The Taconic tectophase contributed to the separation of the Appalachian and Central basins as well as the two midcontinent basins, whereas sea level rise following the Boda Event promoted interbasinal dispersal. Three migration pathways into the Cincinnati Basin were recognized, which supports the multiple pathway hypothesis for the Richmondian Invasion.

  11. Phylogenetic Inference of HIV Transmission Clusters

    Directory of Open Access Journals (Sweden)

    Vlad Novitsky

    2017-10-01

    Full Text Available Better understanding the structure and dynamics of HIV transmission networks is essential for designing the most efficient interventions to prevent new HIV transmissions, and ultimately for gaining control of the HIV epidemic. The inference of phylogenetic relationships and the interpretation of results rely on the definition of the HIV transmission cluster. The definition of the HIV cluster is complex and dependent on multiple factors, including the design of sampling, accuracy of sequencing, precision of sequence alignment, evolutionary models, the phylogenetic method of inference, and specified thresholds for cluster support. While the majority of studies focus on clusters, non-clustered cases could also be highly informative. A new dimension in the analysis of the global and local HIV epidemics is the concept of phylogenetically distinct HIV sub-epidemics. The identification of active HIV sub-epidemics reveals spreading viral lineages and may help in the design of targeted interventions.HIVclustering can also be affected by sampling density. Obtaining a proper sampling density may increase statistical power and reduce sampling bias, so sampling density should be taken into account in study design and in interpretation of phylogenetic results. Finally, recent advances in long-range genotyping may enable more accurate inference of HIV transmission networks. If performed in real time, it could both inform public-health strategies and be clinically relevant (e.g., drug-resistance testing.

  12. Comparative genome analysis reveals a conserved family of actin-like proteins in apicomplexan parasites

    Directory of Open Access Journals (Sweden)

    Sibley L David

    2005-12-01

    Full Text Available Abstract Background The phylum Apicomplexa is an early-branching eukaryotic lineage that contains a number of important human and animal pathogens. Their complex life cycles and unique cytoskeletal features distinguish them from other model eukaryotes. Apicomplexans rely on actin-based motility for cell invasion, yet the regulation of this system remains largely unknown. Consequently, we focused our efforts on identifying actin-related proteins in the recently completed genomes of Toxoplasma gondii, Plasmodium spp., Cryptosporidium spp., and Theileria spp. Results Comparative genomic and phylogenetic studies of apicomplexan genomes reveals that most contain only a single conventional actin and yet they each have 8–10 additional actin-related proteins. Among these are a highly conserved Arp1 protein (likely part of a conserved dynactin complex, and Arp4 and Arp6 homologues (subunits of the chromatin-remodeling machinery. In contrast, apicomplexans lack canonical Arp2 or Arp3 proteins, suggesting they lost the Arp2/3 actin polymerization complex on their evolutionary path towards intracellular parasitism. Seven of these actin-like proteins (ALPs are novel to apicomplexans. They show no phylogenetic associations to the known Arp groups and likely serve functions specific to this important group of intracellular parasites. Conclusion The large diversity of actin-like proteins in apicomplexans suggests that the actin protein family has diverged to fulfill various roles in the unique biology of intracellular parasites. Conserved Arps likely participate in vesicular transport and gene expression, while apicomplexan-specific ALPs may control unique biological traits such as actin-based gliding motility.

  13. An improved model for whole genome phylogenetic analysis by Fourier transform.

    Science.gov (United States)

    Yin, Changchuan; Yau, Stephen S-T

    2015-10-07

    and demonstrates that the improved DFT dissimilarity measure is an efficient and effective similarity measure of DNA sequences. Due to its high efficiency and accuracy, the proposed DFT similarity measure is successfully applied on phylogenetic analysis for individual genes and large whole bacterial genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.

  14. Comparative analysis of DNA polymorphisms and phylogenetic relationships among Syzygium cumini Skeels based on phenotypic characters and RAPD technique.

    Science.gov (United States)

    Singh, Jitendra P; Singh, Ak; Bajpai, Anju; Ahmad, Iffat Zareen

    2014-01-01

    The Indian black berry (Syzygium cumini Skeels) has a great nutraceutical and medicinal properties. As in other fruit crops, the fruit characteristics are important attributes for differentiation were also determined for different accessions of S. cumini. The fruit weight, length, breadth, length: breadth ratio, pulp weight, pulp content, seed weight and pulp: seed ratio significantly varied in different accessions. Molecular characterization was carried out using PCR based RAPD technique. Out of 80 RAPD primers, only 18 primers produced stable polymorphisms that were used to examine the phylogenetic relationship. A sum of 207 loci were generated out of which 201 loci found polymorphic. The average genetic dissimilarity was 97 per cent among jamun accessions. The phylogenetic relationship was also determined by principal coordinates analysis (PCoA) that explained 46.95 per cent cumulative variance. The two-dimensional PCoA analysis showed grouping of the different accessions that were plotted into four sub-plots, representing clustering of accessions. The UPGMA (r = 0.967) and NJ (r = 0.987) dendrogram constructed based on the dissimilarity matrix revealed a good degree of fit with the cophenetic correlation value. The dendrogram grouped the accessions into three main clusters according to their eco-geographical regions which given useful insight into their phylogenetic relationships.

  15. Expression and phylogenetic analyses of the Gel/Gas proteins of Tuber melanosporum provide insights into the function and evolution of glucan remodeling enzymes in fungi.

    Science.gov (United States)

    Sillo, Fabiano; Gissi, Carmela; Chignoli, Daniele; Ragni, Enrico; Popolo, Laura; Balestrini, Raffaella

    2013-04-01

    The β(1,3)-glucanosyltransferases of the GH72 family are redundant enzymes that are essential for the formation and dynamic remodeling of the fungal wall during different stages of the life cycle. Four putative genes encoding glycosylphosphatidylinositol (GPI)-anchored β(1,3)-glucanosyltransferases, designated TmelGEL1, TmelGEL2, TmelGEL4 and TmelGAS4, have been annotated in the genome of Tuber melanosporum, an ectomycorrhizal fungus that also produces a hypogeous fruiting body (FB) of great commercial value (black truffle). This work focuses on the characterization and expression of this multigene family by taking advantage of a laser microdissection (LMD) technology that has been used to separate two distinct compartments in the FB, the hyphae and the asci containing the ascospores. Of the four genes, TmelGEL1 was the most up-regulated in the FB compared to the free-living mycelium. Inside the FB, the expression of TmelGEL1 was restricted to the hyphal compartment. A phylogenetic analysis of the Gel/Gas protein family of T. melanosporum was also carried out. A total of 237 GH72 proteins from 51 Ascomycotina and 3 Basidiomycota (outgroup) species were analyzed. The resulting tree provides insight into the evolution of the T. melanosporum proteins and identifies new GH72 paralogs/subfamilies. Moreover, it represents a starting point to formulate new hypotheses on the significance of the striking GH72 gene redundancy in fungal biology. Copyright © 2013 Elsevier Inc. All rights reserved.

  16. Acquired Thermotolerance and Heat Shock Proteins in Thermophiles from the Three Phylogenetic Domains

    DEFF Research Database (Denmark)

    Trent, Jonathan D.; Gabrielsen, Mette; Jensen, Bo

    1994-01-01

    Thermophilic organisms from each of the three phylogenetic domains (Bacteria, Archaea, and Eucarya) acquired thermotolerance after heat shock. Bacillus caldolyticus grown at 60 degrees C and heat shocked at 69 degrees C for 10 min showed thermotolerance at 74 degrees C, Sulfolobus shibatae grown...

  17. Transforming phylogenetic networks: Moving beyond tree space

    OpenAIRE

    Huber, Katharina T.; Moulton, Vincent; Wu, Taoyang

    2016-01-01

    Phylogenetic networks are a generalization of phylogenetic trees that are used to represent reticulate evolution. Unrooted phylogenetic networks form a special class of such networks, which naturally generalize unrooted phylogenetic trees. In this paper we define two operations on unrooted phylogenetic networks, one of which is a generalization of the well-known nearest-neighbor interchange (NNI) operation on phylogenetic trees. We show that any unrooted phylogenetic network can be transforme...

  18. Adaptive Evolution of Eel Fluorescent Proteins from Fatty Acid Binding Proteins Produces Bright Fluorescence in the Marine Environment.

    Directory of Open Access Journals (Sweden)

    David F Gruber

    Full Text Available We report the identification and characterization of two new members of a family of bilirubin-inducible fluorescent proteins (FPs from marine chlopsid eels and demonstrate a key region of the sequence that serves as an evolutionary switch from non-fluorescent to fluorescent fatty acid-binding proteins (FABPs. Using transcriptomic analysis of two species of brightly fluorescent Kaupichthys eels (Kaupichthys hyoproroides and Kaupichthys n. sp., two new FPs were identified, cloned and characterized (Chlopsid FP I and Chlopsid FP II. We then performed phylogenetic analysis on 210 FABPs, spanning 16 vertebrate orders, and including 163 vertebrate taxa. We show that the fluorescent FPs diverged as a protein family and are the sister group to brain FABPs. Our results indicate that the evolution of this family involved at least three gene duplication events. We show that fluorescent FABPs possess a unique, conserved tripeptide Gly-Pro-Pro sequence motif, which is not found in non-fluorescent fatty acid binding proteins. This motif arose from a duplication event of the FABP brain isoforms and was under strong purifying selection, leading to the classification of this new FP family. Residues adjacent to the motif are under strong positive selection, suggesting a further refinement of the eel protein's fluorescent properties. We present a phylogenetic reconstruction of this emerging FP family and describe additional fluorescent FABP members from groups of distantly related eels. The elucidation of this class of fish FPs with diverse properties provides new templates for the development of protein-based fluorescent tools. The evolutionary adaptation from fatty acid-binding proteins to fluorescent fatty acid-binding proteins raises intrigue as to the functional role of bright green fluorescence in this cryptic genus of reclusive eels that inhabit a blue, nearly monochromatic, marine environment.

  19. Isolation and phylogenetic characterization of Canine distemper virus from India.

    Science.gov (United States)

    Swati; Deka, Dipak; Uppal, Sanjeev Kumar; Verma, Ramneek

    2015-09-01

    Canine distemper (CD), caused by canine distemper virus (CDV) is a highly contagious disease that infects a variety of carnivores. Sequence analysis of CDVs from different geographical areas has shown a lot of variation in the genome of the virus especially in haemagglutinin gene which might be one of the causes of vaccine failure. In this study, we isolated the virus (place: Ludhiana, Punjab; year: 2014) and further cloned, sequenced and analyzed partial haemagglutinin (H) gene and full length genes for fusion protein (F), phosphoprotein (P) and matrix protein (M) from an Indian wild-type CDV. Higher sequence homology was observed with the strains from Switzerland, Hungary, Germany; and lower with the vaccine strains like Ondersteport, CDV3, Convac for all the genes. The multiple sequence alignment showed more variation in partial H (45 nucleotide and 5 amino acid substitutions) and complete F (79 nucleotide and 30 amino acid substitutions) than in complete P (44 nucleotide and 22 amino acid substitutions) and complete M (22 nucleotide and 4 amino acid substitutions) gene/protein. Predicted potential N-linked glycosylation sites in H, F, M and P proteins were similar to the previously known wild-type CDVs but different from the vaccine strains. The Indian CDV formed a distinct clade in the phylogenetic tree clearly separated from the previously known wild-type and vaccine strains.

  20. Bioinformatic Analysis of Strawberry GSTF12 Gene

    Science.gov (United States)

    Wang, Xiran; Jiang, Leiyu; Tang, Haoru

    2018-01-01

    GSTF12 has always been known as a key factor of proanthocyanins accumulate in plant testa. Through bioinformatics analysis of the nucleotide and encoded protein sequence of GSTF12, it is more advantageous to the study of genes related to anthocyanin biosynthesis accumulation pathway. Therefore, we chosen GSTF12 gene of 11 kinds species, downloaded their nucleotide and protein sequence from NCBI as the research object, found strawberry GSTF12 gene via bioinformation analyse, constructed phylogenetic tree. At the same time, we analysed the strawberry GSTF12 gene of physical and chemical properties and its protein structure and so on. The phylogenetic tree showed that Strawberry and petunia were closest relative. By the protein prediction, we found that the protein owed one proper signal peptide without obvious transmembrane regions.

  1. Phylogenetic and functional analysis of the bacteriophage P1 single-stranded DNA-binding protein

    DEFF Research Database (Denmark)

    Bendtsen, Jannick Dyrløv; Nilsson, A.S.; Lehnherr, H.

    2002-01-01

    and does not represent a recent acquirement of the phage. The P1 and E. coli SSB proteins are fully functionally interchangeable. SSB-P1 is nonessential for phage growth in an exponentially growing E. coli host, and it is sufficient to promote bacterial growth in the absence of the E. coli SSB protein....... Expression studies showed that the P1 ssb gene is transcribed only, in an rpoS-independent fashion, during stationary-phase growth in E. coli. Mixed infection experiments demonstrated that a wild-type phage has a selective advantage over an ssb-null mutant when exposed to a bacterial host in the stationary...

  2. Phylogenetic Analysis of C Type Lectin from Toxocara canis Infective Larvae and Comparison with the C Type Lectin Fam-ily in the Immune System of Mouse and Human

    Directory of Open Access Journals (Sweden)

    Fazeleh ETEBAR

    2018-03-01

    Full Text Available Background: C type lectin (CTL family is a type of calcium-dependent proteins found in vertebrates and invertebrates. The objective of this study was to perform a comparative analysis and phylogenetic inferring for understanding the similarities and differences of carbohydrate recognition domain (CRD domain of Toxocara canis CTL and other nematodes, and similar C type lectin involved in the immune system of mouse and human as their host.Methods: The female T. canis was retrieved from the 2-6 months puppies (Department of Parasitology, Faculty of Veterinary Medicine, University of Tehran, 2015. To collect T. canis eggs, the worms were cultured for 5 d until they were embryonated. The hatching process was accelerated for collecting the stage 2 larvae, and the larvae were cultured for a week. A cDNA library was made from the total mRNA of T. canis infective larvae. The PCR amplification for C type lectin gene was performed and the amino acids were analyzed using the alignment method and the construction of phylogenetic tree.Results: The suspension sample maintained at 30 ºC for four weeks could embryonate 90%-100% of eggs. T. canis CTL gene was 657 bp in length and encoded a protein with 219 amino acids. The CTL of species of Strongylida order were closely placed in the tree, whereas the members of Ascaridida orders were located in a separate branch. High levels of similarity (36%-44% and conservation of C type lectin from T. canis with mouse and human C type lectins. Its C type lectin showed a higher similarity with asialoglycoprotein receptor (ASGPR, macrophage lectin, dendritic cell-specific intercellular adhesion molecule 3-grabbing nonintegrin (DC-SIGN, MINCLE receptor of mouse and human.Conclusion: Analysis of CRD domain of C type lectin protein could make a better understanding of their role in the interaction of nematode parasite with their hosts.

  3. Phylogenetic analysis reveals two genotypes of the emerging fungus Mucor indicus, an opportunistic human pathogen in immunocompromised patients

    NARCIS (Netherlands)

    Taj-Aldeen, Saad J.; Almaslamani, Muna; Theelen, B.J.F.; Boekhout, Teun

    2017-01-01

    Mucormycosis is a rare fungal infection caused by Mucor indicus. Phylogenetic analysis of many M. indicus isolates, mainly sampled from different clinical and environmental specimens collected worldwide, revealed two genotypes, I and II, based on ITS and D1/D2 LSU rDNA sequences. A retrospective

  4. Large-scale inference of gene function through phylogenetic annotation of Gene Ontology terms: case study of the apoptosis and autophagy cellular processes.

    Science.gov (United States)

    Feuermann, Marc; Gaudet, Pascale; Mi, Huaiyu; Lewis, Suzanna E; Thomas, Paul D

    2016-01-01

    We previously reported a paradigm for large-scale phylogenomic analysis of gene families that takes advantage of the large corpus of experimentally supported Gene Ontology (GO) annotations. This 'GO Phylogenetic Annotation' approach integrates GO annotations from evolutionarily related genes across ∼100 different organisms in the context of a gene family tree, in which curators build an explicit model of the evolution of gene functions. GO Phylogenetic Annotation models the gain and loss of functions in a gene family tree, which is used to infer the functions of uncharacterized (or incompletely characterized) gene products, even for human proteins that are relatively well studied. Here, we report our results from applying this paradigm to two well-characterized cellular processes, apoptosis and autophagy. This revealed several important observations with respect to GO annotations and how they can be used for function inference. Notably, we applied only a small fraction of the experimentally supported GO annotations to infer function in other family members. The majority of other annotations describe indirect effects, phenotypes or results from high throughput experiments. In addition, we show here how feedback from phylogenetic annotation leads to significant improvements in the PANTHER trees, the GO annotations and GO itself. Thus GO phylogenetic annotation both increases the quantity and improves the accuracy of the GO annotations provided to the research community. We expect these phylogenetically based annotations to be of broad use in gene enrichment analysis as well as other applications of GO annotations.Database URL: http://amigo.geneontology.org/amigo. © The Author(s) 2016. Published by Oxford University Press.

  5. Short communication. Genotyping and phylogenetic analysis of bovine viral diarrhea virus (BVDV) isolates in Kosovo

    OpenAIRE

    Izedin Goga; Kristaq Berxholi; Beqe Hulaj; Driton Sylejmani; Boris Yakobson; Yehuda Stram

    2014-01-01

    Three serum samples positive in Antigen ELISA BVDV have been tested to characterise genetic diversity of bovine viral diarrhea virus (BVDV) in Kosovo. Samples were obtained in 2011 from heifers and were amplified by reverse transcription-polymerase chain reaction, sequenced and analysed by computer-assisted phylogenetic analysis. Amplified products and nucleotide sequence showed that all 3 isolates belonged to BVDV 1 genotype and 1b sub genotype. These results enrich the extant knowledge of B...

  6. Phylogenetic and Pathotypic Characterization of a Newcastle Disease Virus Strain Isolated from Ducks and Pigeons in Hubei, China

    Directory of Open Access Journals (Sweden)

    Y Wang

    Full Text Available ABSTRACT Newcastle disease is a highly contagious disease responsible for major outbreaks and considerable economic losses in the poultry industry in China. There is still little information available regarding gene characterization of the NDV, especially in ducks and pigeons. Therefore, the aim of this study was to investigate NDV isolated from ducks and pigeons in Hubei, China. In this study, three NDVs from ducks and pigeons were isolated between 2013 and 2015.The fusion protein (F gene of the NDV isolates was sequenced and phylogenetically analyzed. The clinical signs and gross histopathological lesions were examined. Phylogenetic analysis of these strains indicated that all the sequences are classified as genotype II. The isolates shared a 112 G-R-Q-G-R-L 117motif at the F protein cleavage site, indicating that these three isolates strains are lentogenic. Necropsy and histopathology showed the typical pathological changes. It was concluded that commercial ducks and pigeons in Hubei province carry lentogenic NDV strains with regular genetic divergence, indicating that these species may act as the main reservoirs of NDV in poultry. Therefore, strategies and surveillance should be undertaken to reduce the risk of ND outbreaks.

  7. Using genes as characters and a parsimony analysis to explore the phylogenetic position of turtles.

    Directory of Open Access Journals (Sweden)

    Bin Lu

    Full Text Available The phylogenetic position of turtles within the vertebrate tree of life remains controversial. Conflicting conclusions from different studies are likely a consequence of systematic error in the tree construction process, rather than random error from small amounts of data. Using genomic data, we evaluate the phylogenetic position of turtles with both conventional concatenated data analysis and a "genes as characters" approach. Two datasets were constructed, one with seven species (human, opossum, zebra finch, chicken, green anole, Chinese pond turtle, and western clawed frog and 4584 orthologous genes, and the second with four additional species (soft-shelled turtle, Nile crocodile, royal python, and tuatara but only 1638 genes. Our concatenated data analysis strongly supported turtle as the sister-group to archosaurs (the archosaur hypothesis, similar to several recent genomic data based studies using similar methods. When using genes as characters and gene trees as character-state trees with equal weighting for each gene, however, our parsimony analysis suggested that turtles are possibly sister-group to diapsids, archosaurs, or lepidosaurs. None of these resolutions were strongly supported by bootstraps. Furthermore, our incongruence analysis clearly demonstrated that there is a large amount of inconsistency among genes and most of the conflict relates to the placement of turtles. We conclude that the uncertain placement of turtles is a reflection of the true state of nature. Concatenated data analysis of large and heterogeneous datasets likely suffers from systematic error and over-estimates of confidence as a consequence of a large number of characters. Using genes as characters offers an alternative for phylogenomic analysis. It has potential to reduce systematic error, such as data heterogeneity and long-branch attraction, and it can also avoid problems associated with computation time and model selection. Finally, treating genes as

  8. Using Genes as Characters and a Parsimony Analysis to Explore the Phylogenetic Position of Turtles

    Science.gov (United States)

    Lu, Bin; Yang, Weizhao; Dai, Qiang; Fu, Jinzhong

    2013-01-01

    The phylogenetic position of turtles within the vertebrate tree of life remains controversial. Conflicting conclusions from different studies are likely a consequence of systematic error in the tree construction process, rather than random error from small amounts of data. Using genomic data, we evaluate the phylogenetic position of turtles with both conventional concatenated data analysis and a “genes as characters” approach. Two datasets were constructed, one with seven species (human, opossum, zebra finch, chicken, green anole, Chinese pond turtle, and western clawed frog) and 4584 orthologous genes, and the second with four additional species (soft-shelled turtle, Nile crocodile, royal python, and tuatara) but only 1638 genes. Our concatenated data analysis strongly supported turtle as the sister-group to archosaurs (the archosaur hypothesis), similar to several recent genomic data based studies using similar methods. When using genes as characters and gene trees as character-state trees with equal weighting for each gene, however, our parsimony analysis suggested that turtles are possibly sister-group to diapsids, archosaurs, or lepidosaurs. None of these resolutions were strongly supported by bootstraps. Furthermore, our incongruence analysis clearly demonstrated that there is a large amount of inconsistency among genes and most of the conflict relates to the placement of turtles. We conclude that the uncertain placement of turtles is a reflection of the true state of nature. Concatenated data analysis of large and heterogeneous datasets likely suffers from systematic error and over-estimates of confidence as a consequence of a large number of characters. Using genes as characters offers an alternative for phylogenomic analysis. It has potential to reduce systematic error, such as data heterogeneity and long-branch attraction, and it can also avoid problems associated with computation time and model selection. Finally, treating genes as characters

  9. Molecular Characterization and Functional Analysis of PR-1-Like Proteins Identified from the Wheat Head Blight Fungus Fusarium graminearum.

    Science.gov (United States)

    Lu, Shunwen; Edwards, Michael C

    2018-04-01

    The group 1 pathogenesis-related (PR-1) proteins originally identified from plants and their homologs are also found in other eukaryotic kingdoms. Studies on nonplant PR-1-like (PR-1L) proteins have been pursued widely in humans and animals but rarely in filamentous ascomycetes. Here, we report the characterization of four PR-1L proteins identified from the ascomycete fungus Fusarium graminearum, the primary cause of Fusarium head blight of wheat and barley (designated FgPR-1L). Molecular cloning revealed that the four FgPR-1L proteins are all encoded by small open reading frames (612 to 909 bp) that are often interrupted by introns, in contrast to plant PR-1 genes that lack introns. Sequence analysis indicated that all FgPR-1L proteins contain the PR-1-specific three-dimensional structure, and one of them features a C-terminal transmembrane (TM) domain that has not been reported for any stand-alone PR-1 proteins. Transcriptional analysis revealed that the four FgPR-1L genes are expressed in axenic cultures and in planta with different spatial or temporal expression patterns. Phylogenetic analysis indicated that fungal PR-1L proteins fall into three major groups, one of which harbors FgPR-1L-2-related TM-containing proteins from both phytopathogenic and human-pathogenic ascomycetes. Low-temperature sodium dodecyl sulfate polyacrylamide gel electrophoresis and proteolytic assays indicated that the recombinant FgPR-1L-4 protein exists as a monomer and is resistant to subtilisin of the serine protease family. Functional analysis confirmed that deletion of the FgPR-1L-4 gene from the fungal genome results in significantly reduced virulence on susceptible wheat. This study provides the first example that the F. graminearum-wheat interaction involves a pathogen-derived PR-1L protein that affects fungal virulence on the host.

  10. Molecular identification and phylogenetic study of Demodex caprae.

    Science.gov (United States)

    Zhao, Ya-E; Cheng, Juan; Hu, Li; Ma, Jun-Xian

    2014-10-01

    The DNA barcode has been widely used in species identification and phylogenetic analysis since 2003, but there have been no reports in Demodex. In this study, to obtain an appropriate DNA barcode for Demodex, molecular identification of Demodex caprae based on mitochondrial cox1 was conducted. Firstly, individual adults and eggs of D. caprae were obtained for genomic DNA (gDNA) extraction; Secondly, mitochondrial cox1 fragment was amplified, cloned, and sequenced; Thirdly, cox1 fragments of D. caprae were aligned with those of other Demodex retrieved from GenBank; Finally, the intra- and inter-specific divergences were computed and the phylogenetic trees were reconstructed to analyze phylogenetic relationship in Demodex. Results obtained from seven 429-bp fragments of D. caprae showed that sequence identities were above 99.1% among three adults and four eggs. The intraspecific divergences in D. caprae, Demodex folliculorum, Demodex brevis, and Demodex canis were 0.0-0.9, 0.5-0.9, 0.0-0.2, and 0.0-0.5%, respectively, while the interspecific divergences between D. caprae and D. folliculorum, D. canis, and D. brevis were 20.3-20.9, 21.8-23.0, and 25.0-25.3, respectively. The interspecific divergences were 10 times higher than intraspecific ones, indicating considerable barcoding gap. Furthermore, the phylogenetic trees showed that four Demodex species gathered separately, representing independent species; and Demodex folliculorum gathered with canine Demodex, D. caprae, and D. brevis in sequence. In conclusion, the selected 429-bp mitochondrial cox1 gene is an appropriate DNA barcode for molecular classification, identification, and phylogenetic analysis of Demodex. D. caprae is an independent species and D. folliculorum is closer to D. canis than to D. caprae or D. brevis.

  11. Enumerating all maximal frequent subtrees in collections of phylogenetic trees.

    Science.gov (United States)

    Deepak, Akshay; Fernández-Baca, David

    2014-01-01

    A common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees. The goal is, roughly, to find a subset of the species (taxa) on which all or some significant subset of the trees agree. One popular method to do so is through maximum agreement subtrees (MASTs). MASTs are also used, among other things, as a metric for comparing phylogenetic trees, computing congruence indices and to identify horizontal gene transfer events. We give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. These approaches can return subtrees on larger sets of taxa than MASTs, and can reveal new common phylogenetic relationships not present in either MASTs or the majority rule tree (a popular consensus method). Our current implementation is available on the web at https://code.google.com/p/mfst-miner/. Our computational results confirm that maximal agreement subtrees and all maximal frequent subtrees can reveal a more complete phylogenetic picture of the common patterns in collections of phylogenetic trees than maximum agreement subtrees; they are also often more resolved than the majority rule tree. Further, our experiments show that enumerating maximal frequent subtrees is considerably more practical than enumerating ordinary (not necessarily maximal) frequent subtrees.

  12. PhyLIS: a simple GNU/Linux distribution for phylogenetics and phyloinformatics.

    Science.gov (United States)

    Thomson, Robert C

    2009-07-30

    PhyLIS is a free GNU/Linux distribution that is designed to provide a simple, standardized platform for phylogenetic and phyloinformatic analysis. The operating system incorporates most commonly used phylogenetic software, which has been pre-compiled and pre-configured, allowing for straightforward application of phylogenetic methods and development of phyloinformatic pipelines in a stable Linux environment. The software is distributed as a live CD and can be installed directly or run from the CD without making changes to the computer. PhyLIS is available for free at http://www.eve.ucdavis.edu/rcthomson/phylis/.

  13. SWPhylo - A Novel Tool for Phylogenomic Inferences by Comparison of Oligonucleotide Patterns and Integration of Genome-Based and Gene-Based Phylogenetic Trees.

    Science.gov (United States)

    Yu, Xiaoyu; Reva, Oleg N

    2018-01-01

    Modern phylogenetic studies may benefit from the analysis of complete genome sequences of various microorganisms. Evolutionary inferences based on genome-scale analysis are believed to be more accurate than the gene-based alternative. However, the computational complexity of current phylogenomic procedures, inappropriateness of standard phylogenetic tools to process genome-wide data, and lack of reliable substitution models which correlates with alignment-free phylogenomic approaches deter microbiologists from using these opportunities. For example, the super-matrix and super-tree approaches of phylogenomics use multiple integrated genomic loci or individual gene-based trees to infer an overall consensus tree. However, these approaches potentially multiply errors of gene annotation and sequence alignment not mentioning the computational complexity and laboriousness of the methods. In this article, we demonstrate that the annotation- and alignment-free comparison of genome-wide tetranucleotide frequencies, termed oligonucleotide usage patterns (OUPs), allowed a fast and reliable inference of phylogenetic trees. These were congruent to the corresponding whole genome super-matrix trees in terms of tree topology when compared with other known approaches including 16S ribosomal RNA and GyrA protein sequence comparison, complete genome-based MAUVE, and CVTree methods. A Web-based program to perform the alignment-free OUP-based phylogenomic inferences was implemented at http://swphylo.bi.up.ac.za/. Applicability of the tool was tested on different taxa from subspecies to intergeneric levels. Distinguishing between closely related taxonomic units may be enforced by providing the program with alignments of marker protein sequences, eg, GyrA.

  14. Avian influenza A (H9N2: computational molecular analysis and phylogenetic characterization of viral surface proteins isolated between 1997 and 2009 from the human population

    Directory of Open Access Journals (Sweden)

    Idrees Muhammad

    2010-11-01

    Full Text Available Abstract Background H9N2 avian influenza A viruses have become panzootic in Eurasia over the last decade and have caused several human infections in Asia since 1998. To study their evolution and zoonotic potential, we conducted an in silico analysis of H9N2 viruses that have infected humans between 1997 and 2009 and identified potential novel reassortments. Results A total of 22 hemagglutinin (HA and neuraminidase (NA nucleotide and deduced amino acid sequences were retrieved from the NCBI flu database. It was identified that mature peptide sequences of HA genes isolated from humans in 2009 had glutamine at position 226 (H3 of the receptor binding site, indicating a preference to bind to the human α (2-6 sialic acid receptors, which is different from previously isolated viruses and studies where the presence of leucine at the same position contributes to preference for human receptors and presence of glutamine towards avian receptors. Similarly, strains isolated in 2009 possessed new motif R-S-N-R in spite of typical R-S-S-R at the cleavage site of HA, which isn't reported before for H9N2 cases in humans. Other changes involved loss, addition, and variations in potential glycosylation sites as well as in predicted epitopes. The results of phylogenetic analysis indicated that HA and NA gene segments of H9N2 including those from current and proposed vaccine strains belong to two different Eurasian phylogenetic lineages confirming possible genetic reassortments. Conclusions These findings support the continuous evolution of avian H9N2 viruses towards human as host and are in favor of effective surveillance and better characterization studies to address this issue.

  15. The performance of the Congruence Among Distance Matrices (CADM) test in phylogenetic analysis

    Science.gov (United States)

    2011-01-01

    Background CADM is a statistical test used to estimate the level of Congruence Among Distance Matrices. It has been shown in previous studies to have a correct rate of type I error and good power when applied to dissimilarity matrices and to ultrametric distance matrices. Contrary to most other tests of incongruence used in phylogenetic analysis, the null hypothesis of the CADM test assumes complete incongruence of the phylogenetic trees instead of congruence. In this study, we performed computer simulations to assess the type I error rate and power of the test. It was applied to additive distance matrices representing phylogenies and to genetic distance matrices obtained from nucleotide sequences of different lengths that were simulated on randomly generated trees of varying sizes, and under different evolutionary conditions. Results Our results showed that the test has an accurate type I error rate and good power. As expected, power increased with the number of objects (i.e., taxa), the number of partially or completely congruent matrices and the level of congruence among distance matrices. Conclusions Based on our results, we suggest that CADM is an excellent candidate to test for congruence and, when present, to estimate its level in phylogenomic studies where numerous genes are analysed simultaneously. PMID:21388552

  16. The performance of the Congruence Among Distance Matrices (CADM test in phylogenetic analysis

    Directory of Open Access Journals (Sweden)

    Lapointe François-Joseph

    2011-03-01

    Full Text Available Abstract Background CADM is a statistical test used to estimate the level of Congruence Among Distance Matrices. It has been shown in previous studies to have a correct rate of type I error and good power when applied to dissimilarity matrices and to ultrametric distance matrices. Contrary to most other tests of incongruence used in phylogenetic analysis, the null hypothesis of the CADM test assumes complete incongruence of the phylogenetic trees instead of congruence. In this study, we performed computer simulations to assess the type I error rate and power of the test. It was applied to additive distance matrices representing phylogenies and to genetic distance matrices obtained from nucleotide sequences of different lengths that were simulated on randomly generated trees of varying sizes, and under different evolutionary conditions. Results Our results showed that the test has an accurate type I error rate and good power. As expected, power increased with the number of objects (i.e., taxa, the number of partially or completely congruent matrices and the level of congruence among distance matrices. Conclusions Based on our results, we suggest that CADM is an excellent candidate to test for congruence and, when present, to estimate its level in phylogenomic studies where numerous genes are analysed simultaneously.

  17. On Nakhleh's metric for reduced phylogenetic networks

    OpenAIRE

    Cardona, Gabriel; Llabrés, Mercè; Rosselló, Francesc; Valiente Feruglio, Gabriel Alejandro

    2009-01-01

    We prove that Nakhleh’s metric for reduced phylogenetic networks is also a metric on the classes of tree-child phylogenetic networks, semibinary tree-sibling time consistent phylogenetic networks, and multilabeled phylogenetic trees. We also prove that it separates distinguishable phylogenetic networks. In this way, it becomes the strongest dissimilarity measure for phylogenetic networks available so far. Furthermore, we propose a generalization of that metric that separates arbitrary phyl...

  18. Phylogenetic rooting using minimal ancestor deviation.

    Science.gov (United States)

    Tria, Fernando Domingues Kümmel; Landan, Giddy; Dagan, Tal

    2017-06-19

    Ancestor-descendent relations play a cardinal role in evolutionary theory. Those relations are determined by rooting phylogenetic trees. Existing rooting methods are hampered by evolutionary rate heterogeneity or the unavailability of auxiliary phylogenetic information. Here we present a rooting approach, the minimal ancestor deviation (MAD) method, which accommodates heterotachy by using all pairwise topological and metric information in unrooted trees. We demonstrate the performance of the method, in comparison to existing rooting methods, by the analysis of phylogenies from eukaryotes and prokaryotes. MAD correctly recovers the known root of eukaryotes and uncovers evidence for the origin of cyanobacteria in the ocean. MAD is more robust and consistent than existing methods, provides measures of the root inference quality and is applicable to any tree with branch lengths.

  19. Towards an integrated phylogenetic classification of the Tremellomycetes.

    Science.gov (United States)

    Liu, X-Z; Wang, Q-M; Göker, M; Groenewald, M; Kachalkin, A V; Lumbsch, H T; Millanes, A M; Wedin, M; Yurkov, A M; Boekhout, T; Bai, F-Y

    2015-06-01

    Families and genera assigned to Tremellomycetes have been mainly circumscribed by morphology and for the yeasts also by biochemical and physiological characteristics. This phenotype-based classification is largely in conflict with molecular phylogenetic analyses. Here a phylogenetic classification framework for the Tremellomycetes is proposed based on the results of phylogenetic analyses from a seven-genes dataset covering the majority of tremellomycetous yeasts and closely related filamentous taxa. Circumscriptions of the taxonomic units at the order, family and genus levels recognised were quantitatively assessed using the phylogenetic rank boundary optimisation (PRBO) and modified general mixed Yule coalescent (GMYC) tests. In addition, a comprehensive phylogenetic analysis on an expanded LSU rRNA (D1/D2 domains) gene sequence dataset covering as many as available teleomorphic and filamentous taxa within Tremellomycetes was performed to investigate the relationships between yeasts and filamentous taxa and to examine the stability of undersampled clades. Based on the results inferred from molecular data and morphological and physiochemical features, we propose an updated classification for the Tremellomycetes. We accept five orders, 17 families and 54 genera, including seven new families and 18 new genera. In addition, seven families and 17 genera are emended and one new species name and 185 new combinations are proposed. We propose to use the term pro tempore or pro tem. in abbreviation to indicate the species names that are temporarily maintained.

  20. Detection, molecular typing and phylogenetic analysis of Leishmania isolated from cases of leishmaniasis among Syrian refugees in Lebanon

    Directory of Open Access Journals (Sweden)

    Tamara Salloum

    2016-06-01

    Two molecular typing methods of 39 FFPE Leishmania isolates were used: the ITS1-PCR RFLP and the nested ITS1-5.8S rDNA gene amplification followed by sequencing and phylogenetic analysis. The efficiency of these two techniques in Leishmania identification was compared and the phylogenetic relationships among these isolates were illustrated based on the neighbor-joining (NJ method. The results were statistically correlated with the parasitic index (PI. The DNA storage in formalin-fixed paraffin embedded (FFPE tissues was assessed as well. The parasites identified were all L. tropica as determined by both techniques. ITS1-5.8S rDNA gene based typing proved to be more sensitive in the detection of parasites (positive in 69.2% of the isolates as opposed to the ITS1-PCR RFLP method that was successful in identifying L. tropica in only 43.6% of the isolates. Sequencing and phylogenetic analysis revealed high levels of heterogeneity. A statistically significant correlation was observed between PI and the results of the nested ITS1-5.8S rDNA gene PCR. Genotyping at the species level is essential for monitoring the relative frequency of CL in the Mediterranean area that is correlated to three different Leishmania species (Leishmania infantum, Leishmania major and L. tropica, each characterized by distinct epidemiological features. The obtained results highlight the need to find a universally accepted diagnostic tool for Leishmania typing.

  1. Phylogenetic analysis of HIV-1 pol gene: first subgenomic evidence of CRF29-BF among Iranian HIV-1 patients

    Directory of Open Access Journals (Sweden)

    Kazem Baesi

    2014-09-01

    Full Text Available Objective: To identify the dominant subtype among the HIV-1 strains circulation in Iran. Methods: In this cross sectional study 100 HIV positive patients participated. HIV-1 RNA was extracted from plasma. RT nested-PCR was performed and the final products were sequenced and phylogenetically analyzed; reference sequences were downloaded from Los Alamos, aligned with Iranian pol sequences in the study and analyzed by neighbor-joining method. Results: The results of the phylogenetic analysis showed that HIV-1 subtype CRF-35AD was the dominant subtype among HIV-1 infected patients in Iran; this analysis also suggested a new circulating recombinant form that had not previously been identified in Iran: CRF-29BF. Conclusions: The impact of HIV diversity on pathogenesis, transmission and clinical management have been discussed in different studies; therefore, analyses of HIV genetic diversity is required to design effective antiretroviral strategies for different HIV subtypes.

  2. Phylogenetic Analysis of Bacillus subtilis Strains Applicable to Natto (Fermented Soybean) Production ▿

    Science.gov (United States)

    Kubo, Yuji; Rooney, Alejandro P.; Tsukakoshi, Yoshiki; Nakagawa, Rikio; Hasegawa, Hiromasa; Kimura, Keitarou

    2011-01-01

    Spore-forming Bacillus strains that produce extracellular poly-γ-glutamic acid were screened for their application to natto (fermented soybean food) fermentation. Among the 424 strains, including Bacillus subtilis and B. amyloliquefaciens, which we isolated from rice straw, 59 were capable of fermenting natto. Biotin auxotrophism was tightly linked to natto fermentation. A multilocus nucleotide sequence of six genes (rpoB, purH, gyrA, groEL, polC, and 16S rRNA) was used for phylogenetic analysis, and amplified fragment length polymorphism (AFLP) analysis was also conducted on the natto-fermenting strains. The ability to ferment natto was inferred from the two principal components of the AFLP banding pattern, and natto-fermenting strains formed a tight cluster within the B. subtilis subsp. subtilis group. PMID:21764950

  3. The power and pitfalls of HIV phylogenetics in public health.

    Science.gov (United States)

    Brooks, James I; Sandstrom, Paul A

    2013-07-25

    Phylogenetics is the application of comparative studies of genetic sequences in order to infer evolutionary relationships among organisms. This tool can be used as a form of molecular epidemiology to enhance traditional population-level communicable disease surveillance. Phylogenetic study has resulted in new paradigms being created in the field of communicable diseases and this commentary aims to provide the reader with an explanation of how phylogenetics can be used in tracking infectious diseases. Special emphasis will be placed upon the application of phylogenetics as a tool to help elucidate HIV transmission patterns and the limitations to these methods when applied to forensic analysis. Understanding infectious disease epidemiology in order to prevent new transmissions is the sine qua non of public health. However, with increasing epidemiological resolution, there may be an associated potential loss of privacy to the individual. It is within this context that we aim to promote the discussion on how to use phylogenetics to achieve important public health goals, while at the same time protecting the rights of the individual.

  4. The origin and evolution of Basigin(BSG) gene: A comparative genomic and phylogenetic analysis.

    Science.gov (United States)

    Zhu, Xinyan; Wang, Shenglan; Shao, Mingjie; Yan, Jie; Liu, Fei

    2017-07-01

    Basigin (BSG), also known as extracellular matrix metalloproteinase inducer (EMMPRIN) or cluster of differentiation 147 (CD147), plays various fundamental roles in the intercellular recognition involved in immunologic phenomena, differentiation, and development. In this study, we aimed to compare the similarities and differences of BSG among organisms and explore possible evolutionary relationships based on the comparison result. We used the extensive BLAST tool to search the metazoan genomes, N-glycosylation sites, the transmembrane region and other functional sites. We then identified BSG homologs from genomic sequences and analyzed their phylogenetic relationships. We identified that BSG genes exist not only in the vertebrate metazoans but also in the invertebrate metazoans such as Amphioxus B. floridae, D. melanogaster, A. mellifera, S. japonicum, C. gigas, and T. patagoniensis. After sequence analysis, we confirmed that only vertebrate metazoans and Cephalochordate (amphioxus B. floridae) have the classic structure (a signal peptide, two Ig-like domains (IgC2 and IgI), a transmembrane region, and an intracellular domain). The invertebrate metazoans (excluding amphioxus B. floridae) lack the N-terminal signal peptides and IgC2 domain. We then generated a phylogenetic tree, genome organization comparison, and chromosomal disposition analysis based on the biological information obtained from the NCBI and Ensembl databases. Finally, we established the possible evolutionary scenario of the BSG gene, which showed the restricted exon rearrangement that has occurred during evolution, forming the present-day BSG gene. Copyright © 2017 Elsevier Ltd. All rights reserved.

  5. Integrated Automatic Workflow for Phylogenetic Tree Analysis Using Public Access and Local Web Services.

    Science.gov (United States)

    Damkliang, Kasikrit; Tandayya, Pichaya; Sangket, Unitsa; Pasomsub, Ekawat

    2016-11-28

    At the present, coding sequence (CDS) has been discovered and larger CDS is being revealed frequently. Approaches and related tools have also been developed and upgraded concurrently, especially for phylogenetic tree analysis. This paper proposes an integrated automatic Taverna workflow for the phylogenetic tree inferring analysis using public access web services at European Bioinformatics Institute (EMBL-EBI) and Swiss Institute of Bioinformatics (SIB), and our own deployed local web services. The workflow input is a set of CDS in the Fasta format. The workflow supports 1,000 to 20,000 numbers in bootstrapping replication. The workflow performs the tree inferring such as Parsimony (PARS), Distance Matrix - Neighbor Joining (DIST-NJ), and Maximum Likelihood (ML) algorithms of EMBOSS PHYLIPNEW package based on our proposed Multiple Sequence Alignment (MSA) similarity score. The local web services are implemented and deployed into two types using the Soaplab2 and Apache Axis2 deployment. There are SOAP and Java Web Service (JWS) providing WSDL endpoints to Taverna Workbench, a workflow manager. The workflow has been validated, the performance has been measured, and its results have been verified. Our workflow's execution time is less than ten minutes for inferring a tree with 10,000 replicates of the bootstrapping numbers. This paper proposes a new integrated automatic workflow which will be beneficial to the bioinformaticians with an intermediate level of knowledge and experiences. All local services have been deployed at our portal http://bioservices.sci.psu.ac.th.

  6. Phylogenetic analysis of canine parvovirus isolates from Sichuan and Gansu provinces of China in 2011.

    Science.gov (United States)

    Xu, J; Guo, H-C; Wei, Y-Q; Shu, L; Wang, J; Li, J-S; Cao, S-Z; Sun, S-Q

    2015-02-01

    Canine parvovirus causes serious disease in dogs. Study of the genetic variation in emerging CPV strains is important for disease control strategy. The antigenic property of CPV is connected with specific amino acid changes, mainly in the capsid protein VP2. This study was carried out to characterize VP2 gene of CPV viruses from two provinces of China in 2011. The complete VP2 genes of the CPV-positive samples were amplified and sequenced. Genetic analysis based on the VP2 genes of CPV was conducted. All of the isolates screened and sequenced in this study were typed as CPV-2a except GS-K11 strain, which was typed as CPV-2b. Sequence comparison showed nucleotide identities of 98.8-100% among CPV strains, whereas the Aa similarities were 99.6-100%. Compared with the reference strains, there are three distinctive amino acid changes at VP2 gene residue 267, 324 and 440 of the strains isolated in this study. Of the 27 strains, fourteen (51.85%) had the 267 (Phe-Tyr) and 440 (Thr-Ala) substitution, all the 27 (100%) had 324 (Tyr-Ile) substitution. Phylogenetically, all of the strains isolated in this study formed a major monophyletic cluster together with one South Korean isolate, two Thailand isolates and four Chinese former isolates. © 2013 Blackwell Verlag GmbH.

  7. CRTAC1 homolog proteins are conserved from cyanobacteria to man and secreted by the teleost fish pituitary gland.

    Science.gov (United States)

    Redruello, Begoña; Louro, Bruno; Anjos, Liliana; Silva, Nádia; Greenwell, Roger S; Canario, Adelino V M; Power, Deborah M

    2010-05-15

    Cartilage acidic protein 1 (CRTAC1) gene expression is used as a marker for chondrocyte differentiation in stem cell-based tissue engineering. It is also transcribed outside the skeleton where at least two different transcripts are expressed in lung and brain. In the pituitary gland of the teleost fish sea bream Sparus auratus, we have found a transcript with a high degree of sequence identity to CRTAC1 family members but lacking the EGF-like calcium-binding domain encoding sequence of CRTAC1 and designated it as CRTAC2. Database searches revealed many previously unidentified members of the CRTAC1 and CRTAC2 in phylogenetically distant organisms, such as cyanobacteria, bryophyta, lancelets, and diverse representatives of vertebrates. Phylogenetic analyses showed that the genes encoding CRTAC1 and CRTAC2 proteins coexist in teleost fish genomes. Structural prediction analysis identified the N-terminal region of the CRTAC1/CRTAC2 family members as a potential seven-bladed beta-propeller structure, closely related to those of integrin alpha chains and glycosylphosphatidylinositol-specific phospholipase D1 protein families. This relationship is confirmed by phylogenetic analysis with the N-terminal domain of sea bream CRTAC2 as the most divergent sequence. Because teleost fishes are the only phylogenetic group where both CRTAC1 and CRTAC2 genes are present, they occupy a pivotal position in studies of the mechanisms governing the specific expression patterns of each gene/protein subfamily. This will be essential to elucidate their respective biological roles. Copyright 2010 Elsevier B.V. All rights reserved.

  8. Phylogenetic Analysis of Petunia sensu Jussieu (Solanaceae) using Chloroplast DNA RFLP

    OpenAIRE

    ANDO, TOSHIO; KOKUBUN, HISASHI; WATANABE, HITOSHI; TANAKA, NORIO; YUKAWA, TOMOHISA; HASHIMOTO, GORO; MARCHESI, EDUARDO; SUÁREZ, ENRIQUE; BASUALDO, ISABEL L.

    2005-01-01

    • Background and Aims The phylogenetic relationships of Petunia sensu Jussieu (Petunia sensu Wijsman plus Calibrachoa) are unclear. This study aimed to resolve this uncertainty using molecular evidence.

  9. Comparison of sequence-based and structure-based phylogenetic ...

    Indian Academy of Sciences (India)

    Prakash

    phylogenetic tree construction methods, has been considered as an equivalent of .... Further detailed analysis described is restricted to the first two groups only. ..... Aspartate-ammonia ligase. Plant virus ..... enzymatic activities?; Trends ...

  10. Development of a multiplex real-time PCR assay for phylogenetic analysis of Uropathogenic Escherichia coli.

    Science.gov (United States)

    Hasanpour, Mojtaba; Najafi, Akram

    2017-06-01

    Uropathogenic Escherichia coli (UPEC) is among major pathogens causing 80-90% of all episodes of urinary tract infections (UTIs). Recently, E. coli strains are divided into eight main phylogenetic groups including A, B1, B2, C, D, E, F, and clade I. This study was aimed to develop a rapid, sensitive, and specific multiplex real time PCR method capable of detecting phylogenetic groups of E. coli strains. This study was carried out on E. coli strains (isolated from the patient with UTI) in which the presence of all seven target genes had been confirmed in our previous phylogenetic study. An EvaGreen-based singleplex and multiplex real-time PCR with melting curve analysis was designed for simultaneous detection and differentiation of these genes. The primers were selected mainly based on the production of amplicons with melting temperatures (T m ) ranging from 82°C to 93°C and temperature difference of more than 1.5°C between each peak.The multiplex real-time PCR assays that have been developed in the present study were successful in detecting the eight main phylogenetic groups. Seven distinct melting peaks were discriminated, with Tm value of 93±0.8 for arpA, 89.2±0.1for chuA, 86.5±0.1 for yjaA, 82.3±0.2 for TspE4C2, 87.8±0.1for trpAgpC, 85.4±0.6 for arpAgpE genes, and 91±0.5 for the internal control. To our knowledge, this study is the first melting curve-based real-time PCR assay developed for simultaneous and discrete detection of these seven target genes. Our findings showed that this assay has the potential to be a rapid, reliable and cost-effective alternative for routine phylotyping of E. coli strains. Copyright © 2017 Elsevier B.V. All rights reserved.

  11. Transforming phylogenetic networks: Moving beyond tree space.

    Science.gov (United States)

    Huber, Katharina T; Moulton, Vincent; Wu, Taoyang

    2016-09-07

    Phylogenetic networks are a generalization of phylogenetic trees that are used to represent reticulate evolution. Unrooted phylogenetic networks form a special class of such networks, which naturally generalize unrooted phylogenetic trees. In this paper we define two operations on unrooted phylogenetic networks, one of which is a generalization of the well-known nearest-neighbor interchange (NNI) operation on phylogenetic trees. We show that any unrooted phylogenetic network can be transformed into any other such network using only these operations. This generalizes the well-known fact that any phylogenetic tree can be transformed into any other such tree using only NNI operations. It also allows us to define a generalization of tree space and to define some new metrics on unrooted phylogenetic networks. To prove our main results, we employ some fascinating new connections between phylogenetic networks and cubic graphs that we have recently discovered. Our results should be useful in developing new strategies to search for optimal phylogenetic networks, a topic that has recently generated some interest in the literature, as well as for providing new ways to compare networks. Copyright © 2016 Elsevier Ltd. All rights reserved.

  12. Social Mating System and Sex-Biased Dispersal in Mammals and Birds: A Phylogenetic Analysis

    Science.gov (United States)

    Mabry, Karen E.; Shelley, Erin L.; Davis, Katie E.; Blumstein, Daniel T.; Van Vuren, Dirk H.

    2013-01-01

    The hypothesis that patterns of sex-biased dispersal are related to social mating system in mammals and birds has gained widespread acceptance over the past 30 years. However, two major complications have obscured the relationship between these two behaviors: 1) dispersal frequency and dispersal distance, which measure different aspects of the dispersal process, have often been confounded, and 2) the relationship between mating system and sex-biased dispersal in these vertebrate groups has not been examined using modern phylogenetic comparative methods. Here, we present a phylogenetic analysis of the relationship between mating system and sex-biased dispersal in mammals and birds. Results indicate that the evolution of female-biased dispersal in mammals may be more likely on monogamous branches of the phylogeny, and that females may disperse farther than males in socially monogamous mammalian species. However, we found no support for a relationship between social mating system and sex-biased dispersal in birds when the effects of phylogeny are taken into consideration. We caution that although there are larger-scale behavioral differences in mating system and sex-biased dispersal between mammals and birds, mating system and sex-biased dispersal are far from perfectly associated within these taxa. PMID:23483957

  13. Revised phylogenetic analysis of the Aetosauria (Archosauria: Pseudosuchia; assessing the effects of incongruent morphological character sets

    Directory of Open Access Journals (Sweden)

    William G. Parker

    2016-01-01

    Full Text Available Aetosauria is an early-diverging clade of pseudosuchians (crocodile-line archosaurs that had a global distribution and high species diversity as a key component of various Late Triassic terrestrial faunas. It is one of only two Late Triassic clades of large herbivorous archosaurs, and thus served a critical ecological role. Nonetheless, aetosaur phylogenetic relationships are still poorly understood, owing to an overreliance on osteoderm characters, which are often poorly constructed and suspected to be highly homoplastic. A new phylogenetic analysis of the Aetosauria, comprising 27 taxa and 83 characters, includes more than 40 new characters that focus on better sampling the cranial and endoskeletal regions, and represents the most comprenhensive phylogeny of the clade to date. Parsimony analysis recovered three most parsimonious trees; the strict consensus of these trees finds an Aetosauria that is divided into two main clades: Desmatosuchia, which includes the Desmatosuchinae and the Stagonolepidinae, and Aetosaurinae, which includes the Typothoracinae. As defined Desmatosuchinae now contains Neoaetosauroides engaeus and several taxa that were previously referred to the genus Stagonolepis, and a new clade, Desmatosuchini, is erected for taxa more closely related to Desmatosuchus. Overall support for some clades is still weak, and Partitioned Bremer Support (PBS is applied for the first time to a strictly morphological dataset demonstrating that this weak support is in part because of conflict in the phylogenetic signals of cranial versus postcranial characters. PBS helps identify homoplasy among characters from various body regions, presumably the result of convergent evolution within discrete anatomical modules. It is likely that at least some of this character conflict results from different body regions evolving at different rates, which may have been under different selective pressures.

  14. Pan-Domain Analysis of ZIP Zinc Transporters

    Directory of Open Access Journals (Sweden)

    Laura E. Lehtovirta-Morley

    2017-12-01

    Full Text Available The ZIP (Zrt/Irt-like protein family of zinc transporters is found in all three domains of life. However, little is known about the phylogenetic relationship amongst ZIP transporters, their distribution, or their origin. Here we employed phylogenetic analysis to explore the evolution of ZIP transporters, with a focus on the major human fungal pathogen, Candida albicans. Pan-domain analysis of bacterial, archaeal, fungal, and human proteins revealed a complex relationship amongst the ZIP family members. Here we report (i a eukaryote-wide group of cellular zinc importers, (ii a fungal-specific group of zinc importers having genetic association with the fungal zincophore, and, (iii a pan-kingdom supercluster made up of two distinct subgroups with orthologues in bacterial, archaeal, and eukaryotic phyla.

  15. Component identification of electron transport chains in curdlan-producing Agrobacterium sp. ATCC 31749 and its genome-specific prediction using comparative genome and phylogenetic trees analysis.

    Science.gov (United States)

    Zhang, Hongtao; Setubal, Joao Carlos; Zhan, Xiaobei; Zheng, Zhiyong; Yu, Lijun; Wu, Jianrong; Chen, Dingqiang

    2011-06-01

    Agrobacterium sp. ATCC 31749 (formerly named Alcaligenes faecalis var. myxogenes) is a non-pathogenic aerobic soil bacterium used in large scale biotechnological production of curdlan. However, little is known about its genomic information. DNA partial sequence of electron transport chains (ETCs) protein genes were obtained in order to understand the components of ETC and genomic-specificity in Agrobacterium sp. ATCC 31749. Degenerate primers were designed according to ETC conserved sequences in other reported species. DNA partial sequences of ETC genes in Agrobacterium sp. ATCC 31749 were cloned by the PCR method using degenerate primers. Based on comparative genomic analysis, nine electron transport elements were ascertained, including NADH ubiquinone oxidoreductase, succinate dehydrogenase complex II, complex III, cytochrome c, ubiquinone biosynthesis protein ubiB, cytochrome d terminal oxidase, cytochrome bo terminal oxidase, cytochrome cbb (3)-type terminal oxidase and cytochrome caa (3)-type terminal oxidase. Similarity and phylogenetic analyses of these genes revealed that among fully sequenced Agrobacterium species, Agrobacterium sp. ATCC 31749 is closest to Agrobacterium tumefaciens C58. Based on these results a comprehensive ETC model for Agrobacterium sp. ATCC 31749 is proposed.

  16. Functional and phylogenetic ecology in R

    CERN Document Server

    Swenson, Nathan G

    2014-01-01

    Functional and Phylogenetic Ecology in R is designed to teach readers to use R for phylogenetic and functional trait analyses. Over the past decade, a dizzying array of tools and methods were generated to incorporate phylogenetic and functional information into traditional ecological analyses. Increasingly these tools are implemented in R, thus greatly expanding their impact. Researchers getting started in R can use this volume as a step-by-step entryway into phylogenetic and functional analyses for ecology in R. More advanced users will be able to use this volume as a quick reference to understand particular analyses. The volume begins with an introduction to the R environment and handling relevant data in R. Chapters then cover phylogenetic and functional metrics of biodiversity; null modeling and randomizations for phylogenetic and functional trait analyses; integrating phylogenetic and functional trait information; and interfacing the R environment with a popular C-based program. This book presents a uni...

  17. An attempt to reconstruct phylogenetic relationships within Caribbean nummulitids: simulating relationships and tracing character evolution

    Science.gov (United States)

    Eder, Wolfgang; Ives Torres-Silva, Ana; Hohenegger, Johann

    2017-04-01

    Phylogenetic analysis and trees based on molecular data are broadly applied and used to infer genetical and biogeographic relationship in recent larger foraminifera. Molecular phylogenetic is intensively used within recent nummulitids, however for fossil representatives these trees are only of minor informational value. Hence, within paleontological studies a phylogenetic approach through morphometric analysis is of much higher value. To tackle phylogenetic relationships within the nummulitid family, a much higher number of morphological character must be measured than are commonly used in biometric studies, where mostly parameters describing embryonic size (e.g., proloculus diameter, deuteroloculus diameter) and/or the marginal spiral (e.g., spiral diagrams, spiral indices) are studied. For this purpose 11 growth-independent and/or growth-invariant characters have been used to describe the morphological variability of equatorial thin sections of seven Carribbean nummulitid taxa (Nummulites striatoreticulatus, N. macgillavry, Palaeonummulites willcoxi, P.floridensis, P. soldadensis, P.trinitatensis and P.ocalanus) and one outgroup taxon (Ranikothalia bermudezi). Using these characters, phylogenetic trees were calculated using a restricted maximum likelihood algorithm (REML), and results are cross-checked by ordination and cluster analysis. Square-change parsimony method has been run to reconstruct ancestral states, as well as to simulate the evolution of the chosen characters along the calculated phylogenetic tree and, independent - contrast analysis was used to estimate confidence intervals. Based on these simulations, phylogenetic tendencies of certain characters proposed for nummulitids (e.g., Cope's rule or nepionic acceleration) can be tested, whether these tendencies are valid for the whole family or only for certain clades. At least, within the Carribean nummulitids, phylogenetic trends along some growth-independent characters of the embryo (e.g., first

  18. Phylogenetic classification of the halichondrids (Porifera, Demospongiae)

    NARCIS (Netherlands)

    Soest, van R.W.M.; Díaz, Maria Cristina; Pomponi, Shirley A.

    1990-01-01

    Using a multicharacter approach and numerical cladistic computer programs a phylogenetic analysis is made of a newly defined order Halichondrida (which includes all Halichondrida and parts of the Axinellida sensu Lévi, 1973), with emphasis on the newly defined family Halichondriidae (which includes

  19. The covert world of fish biofluorescence: a phylogenetically widespread and phenotypically variable phenomenon.

    Directory of Open Access Journals (Sweden)

    John S Sparks

    Full Text Available The discovery of fluorescent proteins has revolutionized experimental biology. Whereas the majority of fluorescent proteins have been identified from cnidarians, recently several fluorescent proteins have been isolated across the animal tree of life. Here we show that biofluorescence is not only phylogenetically widespread, but is also phenotypically variable across both cartilaginous and bony fishes, highlighting its evolutionary history and the possibility for discovery of numerous novel fluorescent proteins. Fish biofluorescence is especially common and morphologically variable in cryptically patterned coral-reef lineages. We identified 16 orders, 50 families, 105 genera, and more than 180 species of biofluorescent fishes. We have also reconstructed our current understanding of the phylogenetic distribution of biofluorescence for ray-finned fishes. The presence of yellow long-pass intraocular filters in many biofluorescent fish lineages and the substantive color vision capabilities of coral-reef fishes suggest that they are capable of detecting fluoresced light. We present species-specific emission patterns among closely related species, indicating that biofluorescence potentially functions in intraspecific communication and evidence that fluorescence can be used for camouflage. This research provides insight into the distribution, evolution, and phenotypic variability of biofluorescence in marine lineages and examines the role this variation may play.

  20. The phylogenetic position of the roughskin skate Dipturus trachyderma (Krefft & Stehmann, 1975) (Rajiformes, Rajidae) inferred from the mitochondrial genome.

    Science.gov (United States)

    Vargas-Caro, Carolina; Bustamante, Carlos; Lamilla, Julio; Bennett, Michael B; Ovenden, Jennifer R

    2016-07-01

    The complete mitochondrial genome of the roughskin skate Dipturus trachyderma is described from 1 455 724 sequences obtained using Illumina NGS technology. Total length of the mitogenome was 16 909 base pairs, comprising 2 rRNAs, 13 protein-coding genes, 22 tRNAs and 2 non-coding regions. Phylogenetic analysis based on mtDNA revealed low genetic divergence among longnose skates, in particular, those dwelling the continental shelf and slope off the coasts of Chile and Argentina.

  1. SWPhylo – A Novel Tool for Phylogenomic Inferences by Comparison of Oligonucleotide Patterns and Integration of Genome-Based and Gene-Based Phylogenetic Trees

    Science.gov (United States)

    Yu, Xiaoyu; Reva, Oleg N

    2018-01-01

    Modern phylogenetic studies may benefit from the analysis of complete genome sequences of various microorganisms. Evolutionary inferences based on genome-scale analysis are believed to be more accurate than the gene-based alternative. However, the computational complexity of current phylogenomic procedures, inappropriateness of standard phylogenetic tools to process genome-wide data, and lack of reliable substitution models which correlates with alignment-free phylogenomic approaches deter microbiologists from using these opportunities. For example, the super-matrix and super-tree approaches of phylogenomics use multiple integrated genomic loci or individual gene-based trees to infer an overall consensus tree. However, these approaches potentially multiply errors of gene annotation and sequence alignment not mentioning the computational complexity and laboriousness of the methods. In this article, we demonstrate that the annotation- and alignment-free comparison of genome-wide tetranucleotide frequencies, termed oligonucleotide usage patterns (OUPs), allowed a fast and reliable inference of phylogenetic trees. These were congruent to the corresponding whole genome super-matrix trees in terms of tree topology when compared with other known approaches including 16S ribosomal RNA and GyrA protein sequence comparison, complete genome-based MAUVE, and CVTree methods. A Web-based program to perform the alignment-free OUP-based phylogenomic inferences was implemented at http://swphylo.bi.up.ac.za/. Applicability of the tool was tested on different taxa from subspecies to intergeneric levels. Distinguishing between closely related taxonomic units may be enforced by providing the program with alignments of marker protein sequences, eg, GyrA. PMID:29511354

  2. Toward a comprehensive phylogenetic reconstruction of the evolutionary history of mitogen-activated protein kinases in the plant kingdom.

    Science.gov (United States)

    Janitza, Philipp; Ullrich, Kristian Karsten; Quint, Marcel

    2012-01-01

    The mitogen-activated protein kinase (MAPK) pathway is a three-tier signaling cascade that transmits cellular information from the plasma membrane to the cytoplasm where it triggers downstream responses. The MAPKs represent the last step in this cascade and are activated when both tyrosine and threonine residues in a conserved TxY motif are phosphorylated by MAPK kinases, which in turn are themselves activated by phosphorylation by MAPK kinase kinases. To understand the molecular evolution of MAPKs in the plant kingdom, we systematically conducted a Hidden-Markov-Model based screen to identify MAPKs in 13 completely sequenced plant genomes. In this analysis, we included green algae, bryophytes, lycophytes, and several mono- and eudicotyledonous species covering >800 million years of evolution. The phylogenetic relationships of the 204 identified MAPKs based on Bayesian inference facilitated the retraction of the sequence of emergence of the four major clades that are characterized by the presence of a TDY or TEY-A/TEY-B/TEY-C type kinase activation loop. We present evidence that after the split of TDY- and TEY-type MAPKs, initially the TEY-C clade emerged. This was followed by the TEY-B clade in early land plants until the TEY-A clade finally emerged in flowering plants. In addition to these well characterized clades, we identified another highly conserved clade of 45 MAPK-likes, members of which were previously described as Mak-homologous kinases. In agreement with their essential functions, molecular population genetic analysis of MAPK genes in Arabidopsis thaliana accessions reveal that purifying selection drove the evolution of the MAPK family, implying strong functional constraints on MAPK genes. Closely related MAPKs most likely subfunctionalized, a process in which differential transcriptional regulation of duplicates may be involved.

  3. Towards a comprehensive phylogenetic reconstruction of the evolutionary history of mitogen-activated protein kinases in the plant kingdom

    Directory of Open Access Journals (Sweden)

    Philipp eJanitza

    2012-12-01

    Full Text Available The mitogen-activated protein kinase (MAPK pathway is a three-tier signaling cascade that transmits cellular information from the plasma membrane to the cytoplasm where it triggers downstream responses. The MAPKs represent the last step in this cascade and are activated when both tyrosine and threonine residues in a conserved TxY motif are phosphorylated by MAPK kinases, which in turn are themselves activated by phosphorylation by MAPK kinase kinases. To understand the molecular evolution of MAPKs in the plant kingdom, we systematically conducted a Hidden-Markov-Model based screen to identify MAPKs in 13 completely sequenced plant genomes. In this analysis, we included green algae, bryophytes, lycophytes, and several mono- and dicotyledonous species covering >800 million years of evolution. The phylogenetic relationships of the 204 identified MAPKs based on Bayesian inference facilitated the retraction of the sequence of emergence of the four major clades that are characterized by the presence of a TDY or TEY-A/TEY-B/TEY-C type kinase activation loop. We present evidence that after the split of TDY- and TEY-type MAPKs, initially the TEY-C clade emerged. This was followed by the TEY-B clade in early land plants until the TEY-A clade finally emerged in flowering plants. In addition to these well characterized clades, we identified another highly conserved clade of 45 MAPK-likes, members of which were previously described as MHKs. In agreement with their essential functions, molecular population genetic analysis of MAPK genes in Arabidopsis thaliana accessions reveal that purifying selection drove the evolution of the MAPK family, implying strong functional constraints on MAPK genes. Closely related MAPKs most likely subfunctionalized, a process in which differential transcriptional regulation of duplicates may be involved.

  4. [Identification and phylogenetic analysis of one strain of Lactobacillus delbrueckii subsp. bulgaricus separated from yoghourt].

    Science.gov (United States)

    Wang, Chuan; Zhang, Chaowu; Pei, Xiaofang; Liu, Hengchuan

    2007-11-01

    For being further applied and studied, one strain of Lactobacillus delbrueckii subsp. bulgaricus (wch9901) separated from yoghourt which had been identified by phenotype characteristic analysis was identified by 16S rDNA and phylogenetic analyzed. The 16S rDNA of wch9901 was amplified with the genomic DNA of wch9901 as template, and the conservative sequences of the 16S rDNA as primers. Inserted 16S rDNA amplified into clonal vector pGEM-T under the function of T4 DNA ligase to construct recombined plasmid pGEM-wch9901 16S rDNA. The recombined plasmid was identified by restriction enzyme digestion, and the eligible plasmid was presented to sequencing company for DNA sequencing. Nucleic acid sequence was blast in GenBank and phylogenetic tree was constructed using neighbor-joining method of distance methods by Mega3.1 soft. Results of blastn showed that the homology of 16S rDNA of wch9901 with the 16S rDNA of Lactobacillus delbrueckii subsp. bulgaricus strains was higher than 96%. On the phylogenetic tree, wch9901 formed a separate branch and located between Lactobacillus delbrueckii subsp. bulgaricus LGM2 evolution branch and another evolution branch which was composed of Lactobacillus delbrueckii subsp. bulgaricus DL2 evolution cluster and Lactobacillus delbrueckii subsp. bulgaricus JSQ evolution cluster. The distance between wch9901 evolution branch and Lactobacillus delbrueckii subsp. bulgaricus LGM2 evolution branch was the closest. wch9901 belonged to Lactobacillus delbrueckii subsp. bulgaricus. wch9901 showed the closest evolution relationship to Lactobacillus delbrueckii subsp. bulgaricus LGM2.

  5. Enumerating all maximal frequent subtrees in collections of phylogenetic trees

    Science.gov (United States)

    2014-01-01

    Background A common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees. The goal is, roughly, to find a subset of the species (taxa) on which all or some significant subset of the trees agree. One popular method to do so is through maximum agreement subtrees (MASTs). MASTs are also used, among other things, as a metric for comparing phylogenetic trees, computing congruence indices and to identify horizontal gene transfer events. Results We give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. These approaches can return subtrees on larger sets of taxa than MASTs, and can reveal new common phylogenetic relationships not present in either MASTs or the majority rule tree (a popular consensus method). Our current implementation is available on the web at https://code.google.com/p/mfst-miner/. Conclusions Our computational results confirm that maximal agreement subtrees and all maximal frequent subtrees can reveal a more complete phylogenetic picture of the common patterns in collections of phylogenetic trees than maximum agreement subtrees; they are also often more resolved than the majority rule tree. Further, our experiments show that enumerating maximal frequent subtrees is considerably more practical than enumerating ordinary (not necessarily maximal) frequent subtrees. PMID:25061474

  6. PhyLIS: A Simple GNU/Linux Distribution for Phylogenetics and Phyloinformatics

    Directory of Open Access Journals (Sweden)

    Robert C. Thomson

    2009-01-01

    Full Text Available PhyLIS is a free GNU/Linux distribution that is designed to provide a simple, standardized platform for phylogenetic and phyloinformatic analysis. The operating system incorporates most commonly used phylogenetic software, which has been pre-compiled and pre-configured, allowing for straightforward application of phylogenetic methods and development of phyloinformatic pipelines in a stable Linux environment. The software is distributed as a live CD and can be installed directly or run from the CD without making changes to the computer. PhyLIS is available for free at http://www.eve.ucdavis.edu/rcthomson/phylis/.

  7. Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling.

    Directory of Open Access Journals (Sweden)

    Junha Shin

    Full Text Available Phylogenetic profiling, a network inference method based on gene inheritance profiles, has been widely used to construct functional gene networks in microbes. However, its utility for network inference in higher eukaryotes has been limited. An improved algorithm with an in-depth understanding of pathway evolution may overcome this limitation. In this study, we investigated the effects of taxonomic structures on co-inheritance analysis using 2,144 reference species in four query species: Escherichia coli, Saccharomyces cerevisiae, Arabidopsis thaliana, and Homo sapiens. We observed three clusters of reference species based on a principal component analysis of the phylogenetic profiles, which correspond to the three domains of life-Archaea, Bacteria, and Eukaryota-suggesting that pathways inherit primarily within specific domains or lower-ranked taxonomic groups during speciation. Hence, the co-inheritance pattern within a taxonomic group may be eroded by confounding inheritance patterns from irrelevant taxonomic groups. We demonstrated that co-inheritance analysis within domains substantially improved network inference not only in microbe species but also in the higher eukaryotes, including humans. Although we observed two sub-domain clusters of reference species within Eukaryota, co-inheritance analysis within these sub-domain taxonomic groups only marginally improved network inference. Therefore, we conclude that co-inheritance analysis within domains is the optimal approach to network inference with the given reference species. The construction of a series of human gene networks with increasing sample sizes of the reference species for each domain revealed that the size of the high-accuracy networks increased as additional reference species genomes were included, suggesting that within-domain co-inheritance analysis will continue to expand human gene networks as genomes of additional species are sequenced. Taken together, we propose that co

  8. The transposition distance for phylogenetic trees

    OpenAIRE

    Rossello, Francesc; Valiente, Gabriel

    2006-01-01

    The search for similarity and dissimilarity measures on phylogenetic trees has been motivated by the computation of consensus trees, the search by similarity in phylogenetic databases, and the assessment of clustering results in bioinformatics. The transposition distance for fully resolved phylogenetic trees is a recent addition to the extensive collection of available metrics for comparing phylogenetic trees. In this paper, we generalize the transposition distance from fully resolved to arbi...

  9. Phylogenetic analysis of molecular and morphological data highlights uncertainty in the relationships of fossil and living species of Elopomorpha (Actinopterygii: Teleostei).

    Science.gov (United States)

    Dornburg, Alex; Friedman, Matt; Near, Thomas J

    2015-08-01

    Elopomorpha is one of the three main clades of living teleost fishes and includes a range of disparate lineages including eels, tarpons, bonefishes, and halosaurs. Elopomorphs were among the first groups of fishes investigated using Hennigian phylogenetic methods and continue to be the object of intense phylogenetic scrutiny due to their economic significance, diversity, and crucial evolutionary status as the sister group of all other teleosts. While portions of the phylogenetic backbone for Elopomorpha are consistent between studies, the relationships among Albula, Pterothrissus, Notacanthiformes, and Anguilliformes remain contentious and difficult to evaluate. This lack of phylogenetic resolution is problematic as fossil lineages are often described and placed taxonomically based on an assumed sister group relationship between Albula and Pterothrissus. In addition, phylogenetic studies using morphological data that sample elopomorph fossil lineages often do not include notacanthiform or anguilliform lineages, potentially introducing a bias toward interpreting fossils as members of the common stem of Pterothrissus and Albula. Here we provide a phylogenetic analysis of DNA sequences sampled from multiple nuclear genes that include representative taxa from Albula, Pterothrissus, Notacanthiformes and Anguilliformes. We integrate our molecular dataset with a morphological character matrix that spans both living and fossil elopomorph lineages. Our results reveal substantial uncertainty in the placement of Pterothrissus as well as all sampled fossil lineages, questioning the stability of the taxonomy of fossil Elopomorpha. However, despite topological uncertainty, our integration of fossil lineages into a Bayesian time calibrated framework provides divergence time estimates for the clade that are consistent with previously published age estimates based on the elopomorph fossil record and molecular estimates resulting from traditional node-dating methods. Copyright

  10. Protein space: a natural method for realizing the nature of protein universe.

    Science.gov (United States)

    Yu, Chenglong; Deng, Mo; Cheng, Shiu-Yuen; Yau, Shek-Chung; He, Rong L; Yau, Stephen S-T

    2013-02-07

    Current methods cannot tell us what the nature of the protein universe is concretely. They are based on different models of amino acid substitution and multiple sequence alignment which is an NP-hard problem and requires manual intervention. Protein structural analysis also gives a direction for mapping the protein universe. Unfortunately, now only a minuscule fraction of proteins' 3-dimensional structures are known. Furthermore, the phylogenetic tree representations are not unique for any existing tree construction methods. Here we develop a novel method to realize the nature of protein universe. We show the protein universe can be realized as a protein space in 60-dimensional Euclidean space using a distance based on a normalized distribution of amino acids. Every protein is in one-to-one correspondence with a point in protein space, where proteins with similar properties stay close together. Thus the distance between two points in protein space represents the biological distance of the corresponding two proteins. We also propose a natural graphical representation for inferring phylogenies. The representation is natural and unique based on the biological distances of proteins in protein space. This will solve the fundamental question of how proteins are distributed in the protein universe. Copyright © 2012 Elsevier Ltd. All rights reserved.

  11. Ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses.

    Science.gov (United States)

    Fouquier, Jennifer; Rideout, Jai Ram; Bolyen, Evan; Chase, John; Shiffer, Arron; McDonald, Daniel; Knight, Rob; Caporaso, J Gregory; Kelley, Scott T

    2016-02-24

    Fungi play critical roles in many ecosystems, cause serious diseases in plants and animals, and pose significant threats to human health and structural integrity problems in built environments. While most fungal diversity remains unknown, the development of PCR primers for the internal transcribed spacer (ITS) combined with next-generation sequencing has substantially improved our ability to profile fungal microbial diversity. Although the high sequence variability in the ITS region facilitates more accurate species identification, it also makes multiple sequence alignment and phylogenetic analysis unreliable across evolutionarily distant fungi because the sequences are hard to align accurately. To address this issue, we created ghost-tree, a bioinformatics tool that integrates sequence data from two genetic markers into a single phylogenetic tree that can be used for diversity analyses. Our approach starts with a "foundation" phylogeny based on one genetic marker whose sequences can be aligned across organisms spanning divergent taxonomic groups (e.g., fungal families). Then, "extension" phylogenies are built for more closely related organisms (e.g., fungal species or strains) using a second more rapidly evolving genetic marker. These smaller phylogenies are then grafted onto the foundation tree by mapping taxonomic names such that each corresponding foundation-tree tip would branch into its new "extension tree" child. We applied ghost-tree to graft fungal extension phylogenies derived from ITS sequences onto a foundation phylogeny derived from fungal 18S sequences. Our analysis of simulated and real fungal ITS data sets found that phylogenetic distances between fungal communities computed using ghost-tree phylogenies explained significantly more variance than non-phylogenetic distances. The phylogenetic metrics also improved our ability to distinguish small differences (effect sizes) between microbial communities, though results were similar to non-phylogenetic

  12. Molecular cytogenetic (FISH and genome analysis of diploid wheatgrasses and their phylogenetic relationship.

    Directory of Open Access Journals (Sweden)

    Gabriella Linc

    Full Text Available This paper reports detailed FISH-based karyotypes for three diploid wheatgrass species Agropyron cristatum (L. Beauv., Thinopyrum bessarabicum (Savul.&Rayss A. Löve, Pseudoroegneria spicata (Pursh A. Löve, the supposed ancestors of hexaploid Thinopyrum intermedium (Host Barkworth & D.R.Dewey, compiled using DNA repeats and comparative genome analysis based on COS markers. Fluorescence in situ hybridization (FISH with repetitive DNA probes proved suitable for the identification of individual chromosomes in the diploid JJ, StSt and PP genomes. Of the seven microsatellite markers tested only the (GAAn trinucleotide sequence was appropriate for use as a single chromosome marker for the P. spicata AS chromosome. Based on COS marker analysis, the phylogenetic relationship between diploid wheatgrasses and the hexaploid bread wheat genomes was established. These findings confirmed that the J and E genomes are in neighbouring clusters.

  13. Spatial phylogenetics of the vascular flora of Chile.

    Science.gov (United States)

    Scherson, Rosa A; Thornhill, Andrew H; Urbina-Casanova, Rafael; Freyman, William A; Pliscoff, Patricio A; Mishler, Brent D

    2017-07-01

    Current geographic patterns of biodiversity are a consequence of the evolutionary history of the lineages that comprise them. This study was aimed at exploring how evolutionary features of the vascular flora of Chile are distributed across the landscape. Using a phylogeny at the genus level for 87% of the Chilean vascular flora, and a geographic database of sample localities, we calculated phylogenetic diversity (PD), phylogenetic endemism (PE), relative PD (RPD), and relative PE (RPE). Categorical Analyses of Neo- and Paleo-Endemism (CANAPE) were also performed, using a spatial randomization to assess statistical significance. A cluster analysis using range-weighted phylogenetic turnover was used to compare among grid cells, and with known Chilean bioclimates. PD patterns were concordant with known centers of high taxon richness and the Chilean biodiversity hotspot. In addition, several other interesting areas of concentration of evolutionary history were revealed as potential conservation targets. The south of the country shows areas of significantly high RPD and a concentration of paleo-endemism, and the north shows areas of significantly low PD and RPD, and a concentration of neo-endemism. Range-weighted phylogenetic turnover shows high congruence with the main macrobioclimates of Chile. Even though the study was done at the genus level, the outcome provides an accurate outline of phylogenetic patterns that can be filled in as more fine-scaled information becomes available. Copyright © 2017 Elsevier Inc. All rights reserved.

  14. Hot-spot analysis for drug discovery targeting protein-protein interactions.

    Science.gov (United States)

    Rosell, Mireia; Fernández-Recio, Juan

    2018-04-01

    Protein-protein interactions are important for biological processes and pathological situations, and are attractive targets for drug discovery. However, rational drug design targeting protein-protein interactions is still highly challenging. Hot-spot residues are seen as the best option to target such interactions, but their identification requires detailed structural and energetic characterization, which is only available for a tiny fraction of protein interactions. Areas covered: In this review, the authors cover a variety of computational methods that have been reported for the energetic analysis of protein-protein interfaces in search of hot-spots, and the structural modeling of protein-protein complexes by docking. This can help to rationalize the discovery of small-molecule inhibitors of protein-protein interfaces of therapeutic interest. Computational analysis and docking can help to locate the interface, molecular dynamics can be used to find suitable cavities, and hot-spot predictions can focus the search for inhibitors of protein-protein interactions. Expert opinion: A major difficulty for applying rational drug design methods to protein-protein interactions is that in the majority of cases the complex structure is not available. Fortunately, computational docking can complement experimental data. An interesting aspect to explore in the future is the integration of these strategies for targeting PPIs with large-scale mutational analysis.

  15. Phylogenetic Trees From Sequences

    Science.gov (United States)

    Ryvkin, Paul; Wang, Li-San

    In this chapter, we review important concepts and approaches for phylogeny reconstruction from sequence data.We first cover some basic definitions and properties of phylogenetics, and briefly explain how scientists model sequence evolution and measure sequence divergence. We then discuss three major approaches for phylogenetic reconstruction: distance-based phylogenetic reconstruction, maximum parsimony, and maximum likelihood. In the third part of the chapter, we review how multiple phylogenies are compared by consensus methods and how to assess confidence using bootstrapping. At the end of the chapter are two sections that list popular software packages and additional reading.

  16. The phylogenetic likelihood library.

    Science.gov (United States)

    Flouri, T; Izquierdo-Carrasco, F; Darriba, D; Aberer, A J; Nguyen, L-T; Minh, B Q; Von Haeseler, A; Stamatakis, A

    2015-03-01

    We introduce the Phylogenetic Likelihood Library (PLL), a highly optimized application programming interface for developing likelihood-based phylogenetic inference and postanalysis software. The PLL implements appropriate data structures and functions that allow users to quickly implement common, error-prone, and labor-intensive tasks, such as likelihood calculations, model parameter as well as branch length optimization, and tree space exploration. The highly optimized and parallelized implementation of the phylogenetic likelihood function and a thorough documentation provide a framework for rapid development of scalable parallel phylogenetic software. By example of two likelihood-based phylogenetic codes we show that the PLL improves the sequential performance of current software by a factor of 2-10 while requiring only 1 month of programming time for integration. We show that, when numerical scaling for preventing floating point underflow is enabled, the double precision likelihood calculations in the PLL are up to 1.9 times faster than those in BEAGLE. On an empirical DNA dataset with 2000 taxa the AVX version of PLL is 4 times faster than BEAGLE (scaling enabled and required). The PLL is available at http://www.libpll.org under the GNU General Public License (GPL). © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

  17. Forensic application of phylogenetic analyses - Exploration of suspected HIV-1 transmission case.

    Science.gov (United States)

    Siljic, Marina; Salemovic, Dubravka; Cirkovic, Valentina; Pesic-Pavlovic, Ivana; Ranin, Jovan; Todorovic, Marija; Nikolic, Slobodan; Jevtovic, Djordje; Stanojevic, Maja

    2017-03-01

    Transmission of human immunodeficiency virus (HIV) between individuals may have important legal implications and therefore may come to require forensic investigation based upon phylogenetic analysis. In criminal trials results of phylogenetic analyses have been used as evidence of responsibility for HIV transmission. In Serbia, as in many countries worldwide, exposure and deliberate transmission of HIV are criminalized. We present the results of applying state of the art phylogenetic analyses, based on pol and env genetic sequences, in exploration of suspected HIV transmission among three subjects: a man and two women, with presumed assumption of transmission direction from one woman to a man. Phylogenetic methods included relevant neighbor-joining (NJ), maximum likelihood (ML) and Bayesian methods of phylogenetic trees reconstruction and hypothesis testing, that has been shown to be the most sensitive for the reconstruction of epidemiological links mostly from sexually infected individuals. End-point limiting-dilution PCR (EPLD-PCR) assay, generating the minimum of 10 sequences per genetic region per subject, was performed to assess HIV quasispecies distribution and to explore the direction of HIV transmission between three subjects. Phylogenetic analysis revealed that the viral sequences from the three subjects were more genetically related to each other than to other strains circulating in the same area with the similar epidemiological profile, forming strongly supported transmission chain, which could be in favour of a priori hypothesis of one of the women infecting the man. However, in the EPLD based phylogenetic trees for both pol and env genetic region, viral sequences of one subject (man) were paraphyletic to those of two other subjects (women), implying the direction of transmission opposite to the a priori assumption. The dated tree in our analysis confirmed the clustering pattern of query sequences. Still, in the context of unsampled sequences and

  18. Identification and phylogenetic analysis of a novel starch synthase in maize

    Directory of Open Access Journals (Sweden)

    Hanmei eLiu

    2015-11-01

    Full Text Available Starch is an important reserve of carbon and energy in plants, providing the majority of calories in the human diet and animal feed. Its synthesis is orchestrated by several key enzymes, and the amount and structure of starch, affecting crop yield and quality, are determined mainly by starch synthase (SS activity. To date, five SS isoforms, including SSI-IV and Granule Bound Starch Synthase (GBSS have been identified and their physiological functions have been well characterized. Here, we report the identification of a new SS isoform in maize, designated SSV. By searching sequenced genomes, SSV has been found in all green plants with conserved sequences and gene structures. Our phylogenetic analysis based on 780 base pairs has suggested that SSIV and SSV resulted from a gene duplication event, which may have occurred before the algae formation. An expression profile analysis of SSV in maize has indicated that ZmSSV is mainly transcribed in the kernel and ear leaf during the grain filling stage, which is partly similar to other SS isoforms. Therefore, it is likely that SSV may play an important role in starch biosynthesis. Subsequent analysis of SSV function may facilitate understanding the mechanism of starch granules formation, number and structure.

  19. Characterization and evolutionary analysis of tributyltin-binding protein and pufferfish saxitoxin and tetrodotoxin-binding protein genes in toxic and nontoxic pufferfishes.

    Science.gov (United States)

    Hashiguchi, Y; Lee, J M; Shiraishi, M; Komatsu, S; Miki, S; Shimasaki, Y; Mochioka, N; Kusakabe, T; Oshima, Y

    2015-05-01

    Understanding the evolutionary mechanisms of toxin accumulation in pufferfishes has been long-standing problem in toxicology and evolutionary biology. Pufferfish saxitoxin and tetrodotoxin-binding protein (PSTBP) is involved in the transport and accumulation of tetrodotoxin and is one of the most intriguing proteins related to the toxicity of pufferfishes. PSTBPs are fusion proteins consisting of two tandem repeated tributyltin-binding protein type 2 (TBT-bp2) domains. In this study, we examined the evolutionary dynamics of TBT-bp2 and PSTBP genes to understand the evolution of toxin accumulation in pufferfishes. Database searches and/or PCR-based cDNA cloning in nine pufferfish species (6 toxic and 3 nontoxic) revealed that all species possessed one or more TBT-bp2 genes, but PSTBP genes were found only in 5 toxic species belonging to genus Takifugu. These toxic Takifugu species possessed two or three copies of PSTBP genes. Phylogenetic analysis of TBT-bp2 and PSTBP genes suggested that PSTBPs evolved in the common ancestor of Takifugu species by repeated duplications and fusions of TBT-bp2 genes. In addition, a detailed comparison of Takifugu TBT-bp2 and PSTBP gene sequences detected a signature of positive selection under the pressure of gene conversion. The complicated evolutionary dynamics of TBT-bp2 and PSTBP genes may reflect the diversity of toxicity in pufferfishes. © 2015 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2015 European Society For Evolutionary Biology.

  20. Subfamily-specific adaptations in the structures of two penicillin-binding proteins from Mycobacterium tuberculosis.

    Directory of Open Access Journals (Sweden)

    Daniil M Prigozhin

    Full Text Available Beta-lactam antibiotics target penicillin-binding proteins including several enzyme classes essential for bacterial cell-wall homeostasis. To better understand the functional and inhibitor-binding specificities of penicillin-binding proteins from the pathogen, Mycobacterium tuberculosis, we carried out structural and phylogenetic analysis of two predicted D,D-carboxypeptidases, Rv2911 and Rv3330. Optimization of Rv2911 for crystallization using directed evolution and the GFP folding reporter method yielded a soluble quadruple mutant. Structures of optimized Rv2911 bound to phenylmethylsulfonyl fluoride and Rv3330 bound to meropenem show that, in contrast to the nonspecific inhibitor, meropenem forms an extended interaction with the enzyme along a conserved surface. Phylogenetic analysis shows that Rv2911 and Rv3330 belong to different clades that emerged in Actinobacteria and are not represented in model organisms such as Escherichia coli and Bacillus subtilis. Clade-specific adaptations allow these enzymes to fulfill distinct physiological roles despite strict conservation of core catalytic residues. The characteristic differences include potential protein-protein interaction surfaces and specificity-determining residues surrounding the catalytic site. Overall, these structural insights lay the groundwork to develop improved beta-lactam therapeutics for tuberculosis.

  1. Locating a tree in a phylogenetic network

    NARCIS (Netherlands)

    Iersel, van L.J.J.; Semple, C.; Steel, M.A.

    2010-01-01

    Phylogenetic trees and networks are leaf-labelled graphs that are used to describe evolutionary histories of species. The Tree Containment problem asks whether a given phylogenetic tree is embedded in a given phylogenetic network. Given a phylogenetic network and a cluster of species, the Cluster

  2. Global phylogenetic analysis of contemporary aleutian mink disease viruses (AMDVs)

    DEFF Research Database (Denmark)

    Ryt-Hansen, Pia; Hagberg, E. E.; Chriél, Mariann

    2017-01-01

    a strain originating from Sweden. In contrast, we did not identify any potential source for the other and more widespread outbreak strain. To the authors knowledge this is the first major global phylogenetic study of contemporary AMDV partial NS1 sequences. The study proved that partial NS1 sequencing can...

  3. XplorSeq: a software environment for integrated management and phylogenetic analysis of metagenomic sequence data.

    Science.gov (United States)

    Frank, Daniel N

    2008-10-07

    Advances in automated DNA sequencing technology have accelerated the generation of metagenomic DNA sequences, especially environmental ribosomal RNA gene (rDNA) sequences. As the scale of rDNA-based studies of microbial ecology has expanded, need has arisen for software that is capable of managing, annotating, and analyzing the plethora of diverse data accumulated in these projects. XplorSeq is a software package that facilitates the compilation, management and phylogenetic analysis of DNA sequences. XplorSeq was developed for, but is not limited to, high-throughput analysis of environmental rRNA gene sequences. XplorSeq integrates and extends several commonly used UNIX-based analysis tools by use of a Macintosh OS-X-based graphical user interface (GUI). Through this GUI, users may perform basic sequence import and assembly steps (base-calling, vector/primer trimming, contig assembly), perform BLAST (Basic Local Alignment and Search Tool; 123) searches of NCBI and local databases, create multiple sequence alignments, build phylogenetic trees, assemble Operational Taxonomic Units, estimate biodiversity indices, and summarize data in a variety of formats. Furthermore, sequences may be annotated with user-specified meta-data, which then can be used to sort data and organize analyses and reports. A document-based architecture permits parallel analysis of sequence data from multiple clones or amplicons, with sequences and other data stored in a single file. XplorSeq should benefit researchers who are engaged in analyses of environmental sequence data, especially those with little experience using bioinformatics software. Although XplorSeq was developed for management of rDNA sequence data, it can be applied to most any sequencing project. The application is available free of charge for non-commercial use at http://vent.colorado.edu/phyloware.

  4. Trophic phylogenetics: evolutionary influences on body size, feeding, and species associations in grassland arthropods.

    Science.gov (United States)

    Lind, Eric M; Vincent, John B; Weiblen, George D; Cavender-Bares, Jeannine; Borer, Elizabeth T

    2015-04-01

    Contemporary animal-plant interactions such as herbivory are widely understood to be shaped by evolutionary history. Yet questions remain about the role of plant phylogenetic diversity in generating and maintaining herbivore diversity, and whether evolutionary relatedness of producers might predict the composition of consumer communities. We tested for evidence of evolutionary associations among arthropods and the plants on which they were found, using phylogenetic analysis of naturally occurring arthropod assemblages sampled from a plant-diversity manipulation experiment. Considering phylogenetic relationships among more than 900 arthropod consumer taxa and 29 plant species in the experiment, we addressed several interrelated questions. First, our results support the hypothesis that arthropod functional traits such as body size and trophic role are phylogenetically conserved in community ecological samples. Second, herbivores tended to cooccur with closer phylogenetic relatives than would be expected at random, whereas predators and parasitoids did not show phylogenetic association patterns. Consumer specialization, as measured by association through time with monocultures of particular host plant species, showed significant phylogenetic signal, although the. strength of this association varied among plant species. Polycultures of phylogenetically dissimilar plant species supported more phylogenetically dissimilar consumer communities than did phylogenetically similar polycultures. Finally, we separated the effects of plant species richness and relatedness in predicting the phylogenetic distribution of the arthropod assemblages in this experiment. The phylogenetic diversity of plant communities predicted the phylogenetic diversity of herbivore communities even after accounting for plant species richness. The phylogenetic diversity of secondary consumers differed by guild, with predator phylogenetic diversity responding to herbivore relatedness, while parasitoid

  5. Combining protein sequence, structure, and dynamics: A novel approach for functional evolution analysis of PAS domain superfamily.

    Science.gov (United States)

    Dong, Zheng; Zhou, Hongyu; Tao, Peng

    2018-02-01

    PAS domains are widespread in archaea, bacteria, and eukaryota, and play important roles in various functions. In this study, we aim to explore functional evolutionary relationship among proteins in the PAS domain superfamily in view of the sequence-structure-dynamics-function relationship. We collected protein sequences and crystal structure data from RCSB Protein Data Bank of the PAS domain superfamily belonging to three biological functions (nucleotide binding, photoreceptor activity, and transferase activity). Protein sequences were aligned and then used to select sequence-conserved residues and build phylogenetic tree. Three-dimensional structure alignment was also applied to obtain structure-conserved residues. The protein dynamics were analyzed using elastic network model (ENM) and validated by molecular dynamics (MD) simulation. The result showed that the proteins with same function could be grouped by sequence similarity, and proteins in different functional groups displayed statistically significant difference in their vibrational patterns. Interestingly, in all three functional groups, conserved amino acid residues identified by sequence and structure conservation analysis generally have a lower fluctuation than other residues. In addition, the fluctuation of conserved residues in each biological function group was strongly correlated with the corresponding biological function. This research suggested a direct connection in which the protein sequences were related to various functions through structural dynamics. This is a new attempt to delineate functional evolution of proteins using the integrated information of sequence, structure, and dynamics. © 2017 The Protein Society.

  6. Locating a tree in a phylogenetic network

    OpenAIRE

    van Iersel, Leo; Semple, Charles; Steel, Mike

    2010-01-01

    Phylogenetic trees and networks are leaf-labelled graphs that are used to describe evolutionary histories of species. The Tree Containment problem asks whether a given phylogenetic tree is embedded in a given phylogenetic network. Given a phylogenetic network and a cluster of species, the Cluster Containment problem asks whether the given cluster is a cluster of some phylogenetic tree embedded in the network. Both problems are known to be NP-complete in general. In this article, we consider t...

  7. Nonbinary tree-based phylogenetic networks

    OpenAIRE

    Jetten, Laura; van Iersel, Leo

    2016-01-01

    Rooted phylogenetic networks are used to describe evolutionary histories that contain non-treelike evolutionary events such as hybridization and horizontal gene transfer. In some cases, such histories can be described by a phylogenetic base-tree with additional linking arcs, which can for example represent gene transfer events. Such phylogenetic networks are called tree-based. Here, we consider two possible generalizations of this concept to nonbinary networks, which we call tree-based and st...

  8. Encoding phylogenetic trees in terms of weighted quartets.

    Science.gov (United States)

    Grünewald, Stefan; Huber, Katharina T; Moulton, Vincent; Semple, Charles

    2008-04-01

    One of the main problems in phylogenetics is to develop systematic methods for constructing evolutionary or phylogenetic trees. For a set of species X, an edge-weighted phylogenetic X-tree or phylogenetic tree is a (graph theoretical) tree with leaf set X and no degree 2 vertices, together with a map assigning a non-negative length to each edge of the tree. Within phylogenetics, several methods have been proposed for constructing such trees that work by trying to piece together quartet trees on X, i.e. phylogenetic trees each having four leaves in X. Hence, it is of interest to characterise when a collection of quartet trees corresponds to a (unique) phylogenetic tree. Recently, Dress and Erdös provided such a characterisation for binary phylogenetic trees, that is, phylogenetic trees all of whose internal vertices have degree 3. Here we provide a new characterisation for arbitrary phylogenetic trees.

  9. NAPS: Network Analysis of Protein Structures

    Science.gov (United States)

    Chakrabarty, Broto; Parekh, Nita

    2016-01-01

    Traditionally, protein structures have been analysed by the secondary structure architecture and fold arrangement. An alternative approach that has shown promise is modelling proteins as a network of non-covalent interactions between amino acid residues. The network representation of proteins provide a systems approach to topological analysis of complex three-dimensional structures irrespective of secondary structure and fold type and provide insights into structure-function relationship. We have developed a web server for network based analysis of protein structures, NAPS, that facilitates quantitative and qualitative (visual) analysis of residue–residue interactions in: single chains, protein complex, modelled protein structures and trajectories (e.g. from molecular dynamics simulations). The user can specify atom type for network construction, distance range (in Å) and minimal amino acid separation along the sequence. NAPS provides users selection of node(s) and its neighbourhood based on centrality measures, physicochemical properties of amino acids or cluster of well-connected residues (k-cliques) for further analysis. Visual analysis of interacting domains and protein chains, and shortest path lengths between pair of residues are additional features that aid in functional analysis. NAPS support various analyses and visualization views for identifying functional residues, provide insight into mechanisms of protein folding, domain-domain and protein–protein interactions for understanding communication within and between proteins. URL:http://bioinf.iiit.ac.in/NAPS/. PMID:27151201

  10. The complete mitochondrial genome of Somanniathelphusa boyangensis and phylogenetic analysis of Genus Somanniathelphusa (Crustacea: Decapoda: Parathelphusidae.

    Directory of Open Access Journals (Sweden)

    Xin-Nan Jia

    Full Text Available In this study, the authors first obtained the mitochondrial genome of Somanniathelphusa boyangensis. The results showed that the mitochondrial genome is 17,032bp in length, included 13 protein-coding genes, 2 rRNAs genes, 22 tRNAs genes and 1 putative control region, and it has the characteristics of the metazoan mitochondrial genome A+T bias. All tRNA genes display the typical clover-leaf secondary structure except tRNASer(AGN, which has lost the dihydroxyuridine arm. The GenBank database contains the mitochondrial genomes of representatives of approximately 22 families of Brachyura, comprising 56 species, including 4 species of freshwater crab. The authors established the phylogenetic relationships using the maximum likelihood and Bayesian inference methods. The phylogenetic relationship indicated that the molecular taxonomy of S. boyangensis is consistent with current morphological classification, and Parathelphusidae and Potamidae are derived within the freshwater clade or as part of it. In addition, the authors used the COX1 sequence of Somanniathelphusa in GenBank and the COX1 sequence of S. boyangensis to estimated the divergence time of this genus. The result displayed that the divergence time of Somanniathelphusa qiongshanensis is consistent with the separation of Hainan Island from mainland China in the Beibu Gulf, and the divergence time for Somanniathelphusa taiwanensis and Somanniathelphusa amoyensis is consistent with the separation of Taiwan Province from Mainland China at Fujian Province. These data indicate that geologic events influenced speciation of the genus Somanniathelphusa.

  11. Molecular and phylogenetic analysis of Anaplasma spp. in sheep and goats from six provinces of China.

    Science.gov (United States)

    Zhang, Yan; Lv, Yali; Zhang, Feifei; Zhang, Wenjing; Wang, Jinhong; Cui, Yanyan; Wang, Rongjun; Jian, Fuchun; Zhang, Longxian; Ning, Changshen

    2016-12-30

    Members of the genus Anaplasma are important emerging tick-borne pathogens in both humans and animals in tropical and subtropical areas. Here, we investigated the presence of Anaplasma spp. in 621 sheep and 710 goats from six provinces of China. Polymerase chain reaction (PCR) and DNA sequencing were conducted to determine the prevalence of Anaplasma (A.) phagocytophilum, A. ovis and A. bovis targeting the 16S ribosomal RNA or the major surface protein 4 gene. PCR revealed Anaplasma in 39.0% (240/621) of sheep and 45.5% (323/710) of goats. The most frequently detected species was A. ovis (88/621, 14.2% for sheep; 129/710, 18.2% for goats), followed by A. bovis (60/621, 9.7% for sheep; 74/710, 10.4% for goats) and A. phagocytophilum (33/621, 5.3% for sheep; 15/710, 2.1% for goats). Additionally, eight sheep and 20 goats were found to be infected with three pathogens simultaneously. DNA sequencing confirmed the presence of these three Anaplasma species in the investigated areas, and phylogenetic analysis indicated that there was geographic segregation to a certain extent, as well as a relationship between the host and cluster of A. ovis. The results of the present study provide valuable data that helps understand the epidemiology of anaplasmosis in ruminants from China.

  12. Sub-grouping and sub-functionalization of the RIFIN multi-copy protein family

    Directory of Open Access Journals (Sweden)

    Sonnhammer Erik L

    2008-01-01

    Full Text Available Abstract Background Parasitic protozoans possess many multicopy gene families which have central roles in parasite survival and virulence. The number and variability of members of these gene families often make it difficult to predict possible functions of the encoded proteins. The families of extra-cellular proteins that are exposed to a host immune response have been driven via immune selection to become antigenically variant, and thereby avoid immune recognition while maintaining protein function to establish a chronic infection. Results We have combined phylogenetic and function shift analyses to study the evolution of the RIFIN proteins, which are antigenically variant and are encoded by the largest multicopy gene family in Plasmodium falciparum. We show that this family can be subdivided into two major groups that we named A- and B-RIFIN proteins. This suggested sub-grouping is supported by a recently published study that showed that, despite the presence of the Plasmodium export (PEXEL motif in all RIFIN variants, proteins from each group have different cellular localizations during the intraerythrocytic life cycle of the parasite. In the present study we show that function shift analysis, a novel technique to predict functional divergence between sub-groups of a protein family, indicates that RIFINs have undergone neo- or sub-functionalization. Conclusion These results question the general trend of clustering large antigenically variant protein groups into homogenous families. Assigning functions to protein families requires their subdivision into meaningful groups such as we have shown for the RIFIN protein family. Using phylogenetic and function shift analysis methods, we identify new directions for the investigation of this broad and complex group of proteins.

  13. Global patterns of amphibian phylogenetic diversity

    DEFF Research Database (Denmark)

    Fritz, Susanne; Rahbek, Carsten

    2012-01-01

    Aim  Phylogenetic diversity can provide insight into how evolutionary processes may have shaped contemporary patterns of species richness. Here, we aim to test for the influence of phylogenetic history on global patterns of amphibian species richness, and to identify areas where macroevolutionary...... processes such as diversification and dispersal have left strong signatures on contemporary species richness. Location  Global; equal-area grid cells of approximately 10,000 km2. Methods  We generated an amphibian global supertree (6111 species) and repeated analyses with the largest available molecular...... phylogeny (2792 species). We combined each tree with global species distributions to map four indices of phylogenetic diversity. To investigate congruence between global spatial patterns of amphibian species richness and phylogenetic diversity, we selected Faith’s phylogenetic diversity (PD) index...

  14. Analysis of Acorus calamus chloroplast genome and its phylogenetic implications.

    Science.gov (United States)

    Goremykin, Vadim V; Holland, Barbara; Hirsch-Ernst, Karen I; Hellwig, Frank H

    2005-09-01

    Determining the phylogenetic relationships among the major lines of angiosperms is a long-standing problem, yet the uncertainty as to the phylogenetic affinity of these lines persists. While a number of studies have suggested that the ANITA (Amborella-Nymphaeales-Illiciales-Trimeniales-Aristolochiales) grade is basal within angiosperms, studies of complete chloroplast genome sequences also suggested an alternative tree, wherein the line leading to the grasses branches first among the angiosperms. To improve taxon sampling in the existing chloroplast genome data, we sequenced the chloroplast genome of the monocot Acorus calamus. We generated a concatenated alignment (89,436 positions for 15 taxa), encompassing almost all sequences usable for phylogeny reconstruction within spermatophytes. The data still contain support for both the ANITA-basal and grasses-basal hypotheses. Using simulations we can show that were the ANITA-basal hypothesis true, parsimony (and distance-based methods with many models) would be expected to fail to recover it. The self-evident explanation for this failure appears to be a long-branch attraction (LBA) between the clade of grasses and the out-group. However, this LBA cannot explain the discrepancies observed between tree topology recovered using the maximum likelihood (ML) method and the topologies recovered using the parsimony and distance-based methods when grasses are deleted. Furthermore, the fact that neither maximum parsimony nor distance methods consistently recover the ML tree, when according to the simulations they would be expected to, when the out-group (Pinus) is deleted, suggests that either the generating tree is not correct or the best symmetric model is misspecified (or both). We demonstrate that the tree recovered under ML is extremely sensitive to model specification and that the best symmetric model is misspecified. Hence, we remain agnostic regarding phylogenetic relationships among basal angiosperm lineages.

  15. Phylogenetic distribution of large-scale genome patchiness

    Directory of Open Access Journals (Sweden)

    Hackenberg Michael

    2008-04-01

    Full Text Available Abstract Background The phylogenetic distribution of large-scale genome structure (i.e. mosaic compositional patchiness has been explored mainly by analytical ultracentrifugation of bulk DNA. However, with the availability of large, good-quality chromosome sequences, and the recently developed computational methods to directly analyze patchiness on the genome sequence, an evolutionary comparative analysis can be carried out at the sequence level. Results The local variations in the scaling exponent of the Detrended Fluctuation Analysis are used here to analyze large-scale genome structure and directly uncover the characteristic scales present in genome sequences. Furthermore, through shuffling experiments of selected genome regions, computationally-identified, isochore-like regions were identified as the biological source for the uncovered large-scale genome structure. The phylogenetic distribution of short- and large-scale patchiness was determined in the best-sequenced genome assemblies from eleven eukaryotic genomes: mammals (Homo sapiens, Pan troglodytes, Mus musculus, Rattus norvegicus, and Canis familiaris, birds (Gallus gallus, fishes (Danio rerio, invertebrates (Drosophila melanogaster and Caenorhabditis elegans, plants (Arabidopsis thaliana and yeasts (Saccharomyces cerevisiae. We found large-scale patchiness of genome structure, associated with in silico determined, isochore-like regions, throughout this wide phylogenetic range. Conclusion Large-scale genome structure is detected by directly analyzing DNA sequences in a wide range of eukaryotic chromosome sequences, from human to yeast. In all these genomes, large-scale patchiness can be associated with the isochore-like regions, as directly detected in silico at the sequence level.

  16. Exploring the relationship between sequence similarity and accurate phylogenetic trees.

    Science.gov (United States)

    Cantarel, Brandi L; Morrison, Hilary G; Pearson, William

    2006-11-01

    We have characterized the relationship between accurate phylogenetic reconstruction and sequence similarity, testing whether high levels of sequence similarity can consistently produce accurate evolutionary trees. We generated protein families with known phylogenies using a modified version of the PAML/EVOLVER program that produces insertions and deletions as well as substitutions. Protein families were evolved over a range of 100-400 point accepted mutations; at these distances 63% of the families shared significant sequence similarity. Protein families were evolved using balanced and unbalanced trees, with ancient or recent radiations. In families sharing statistically significant similarity, about 60% of multiple sequence alignments were 95% identical to true alignments. To compare recovered topologies with true topologies, we used a score that reflects the fraction of clades that were correctly clustered. As expected, the accuracy of the phylogenies was greatest in the least divergent families. About 88% of phylogenies clustered over 80% of clades in families that shared significant sequence similarity, using Bayesian, parsimony, distance, and maximum likelihood methods. However, for protein families with short ancient branches (ancient radiation), only 30% of the most divergent (but statistically significant) families produced accurate phylogenies, and only about 70% of the second most highly conserved families, with median expectation values better than 10(-60), produced accurate trees. These values represent upper bounds on expected tree accuracy for sequences with a simple divergence history; proteins from 700 Giardia families, with a similar range of sequence similarities but considerably more gaps, produced much less accurate trees. For our simulated insertions and deletions, correct multiple sequence alignments did not perform much better than those produced by T-COFFEE, and including sequences with expressed sequence tag-like sequencing errors did not

  17. The limitations of ontogenetic data in phylogenetic analyses

    NARCIS (Netherlands)

    Koenemann, Stefan; Schram, Frederick R.

    2002-01-01

    The analysis of consecutive ontogenetic stages, or events, introduces a new class of data to phylogenetic systematics that are distinctly different from traditional morphological characters and molecular sequence data. Ontogenetic event sequences are distinguished by varying degrees of both a

  18. Measures of phylogenetic differentiation provide robust and complementary insights into microbial communities.

    Science.gov (United States)

    Parks, Donovan H; Beiko, Robert G

    2013-01-01

    High-throughput sequencing techniques have made large-scale spatial and temporal surveys of microbial communities routine. Gaining insight into microbial diversity requires methods for effectively analyzing and visualizing these extensive data sets. Phylogenetic β-diversity measures address this challenge by allowing the relationship between large numbers of environmental samples to be explored using standard multivariate analysis techniques. Despite the success and widespread use of phylogenetic β-diversity measures, an extensive comparative analysis of these measures has not been performed. Here, we compare 39 measures of phylogenetic β diversity in order to establish the relative similarity of these measures along with key properties and performance characteristics. While many measures are highly correlated, those commonly used within microbial ecology were found to be distinct from those popular within classical ecology, and from the recently recommended Gower and Canberra measures. Many of the measures are surprisingly robust to different rootings of the gene tree, the choice of similarity threshold used to define operational taxonomic units, and the presence of outlying basal lineages. Measures differ considerably in their sensitivity to rare organisms, and the effectiveness of measures can vary substantially under alternative models of differentiation. Consequently, the depth of sequencing required to reveal underlying patterns of relationships between environmental samples depends on the selected measure. Our results demonstrate that using complementary measures of phylogenetic β diversity can further our understanding of how communities are phylogenetically differentiated. Open-source software implementing the phylogenetic β-diversity measures evaluated in this manuscript is available at http://kiwi.cs.dal.ca/Software/ExpressBetaDiversity.

  19. Mitochondrial DNA sequence-based phylogenetic relationship ...

    Indian Academy of Sciences (India)

    cophaga ranges from 0.037–0.106 and 0.049–0.207 for COI and ND5 genes, respectively (tables 2 and 3). Analysis of genetic distance on the basis of sequence difference for both the mitochondrial genes shows very little genetic difference. The discrepancy in the phylogenetic trees based on individ- ual genes may be due ...

  20. PHYTOCHEMICAL AND PHYLOGENETIC ANALYSIS OF Spondias(Anacardiaceae

    Directory of Open Access Journals (Sweden)

    Cristiane Pereira

    2015-07-01

    Full Text Available This paper describes the correlation between the phenolic composition and the molecular phylogenetic reconstruction of five Spondias species (Anacardiaceae. Two of these species (S. venulosa and Spondias sp. occur in rainforest areas and the other three are widely distributed in Brazil (S. dulcis, S.mombin, and S. purpurea. The flavonoid enriched fraction of the S. venulosa leaf extract also underwent a chemical study. The results indicate that the presence of flavonol 3-O-glycosides are a synapomorphic character of the studied American Spondias and the production of rhamnetin 3-O-rutinoside is a synapomorphy of the Atlantic forest species. This is the first report of flavonoids in S. venulosa, an endemic species from the Brazilian Atlantic rainforest.

  1. Phylogenetic Relationships of Pseudorasbora, Pseudopungtungia, and Pungtungia (Teleostei; Cypriniformes; Gobioninae Inferred from Multiple Nuclear Gene Sequences

    Directory of Open Access Journals (Sweden)

    Keun-Yong Kim

    2013-01-01

    Full Text Available Gobionine species belonging to the genera Pseudorasbora, Pseudopungtungia, and Pungtungia (Teleostei; Cypriniformes; Cyprinidae have been heavily studied because of problems on taxonomy, threats of extinction, invasion, and human health. Nucleotide sequences of three nuclear genes, that is, recombination activating protein gene 1 (rag1, recombination activating gene 2 (rag2, and early growth response 1 gene (egr1, from Pseudorasbora, Pseudopungtungia, and Pungtungia species residing in China, Japan, and Korea, were analyzed to elucidate their intergeneric and interspecific phylogenetic relationships. In the phylogenetic tree inferred from their multiple gene sequences, Pseudorasbora, Pseudopungtungia and Pungtungia species ramified into three phylogenetically distinct clades; the “tenuicorpa” clade composed of Pseudopungtungia tenuicorpa, the “parva” clade composed of all Pseudorasbora species/subspecies, and the “herzi” clade composed of Pseudopungtungia nigra, and Pungtungia herzi. The genus Pseudorasbora was recovered as monophyletic, while the genus Pseudopungtungia was recovered as polyphyletic. Our phylogenetic result implies the unstable taxonomic status of the genus Pseudopungtungia.

  2. Tree-Based Unrooted Phylogenetic Networks.

    Science.gov (United States)

    Francis, A; Huber, K T; Moulton, V

    2018-02-01

    Phylogenetic networks are a generalization of phylogenetic trees that are used to represent non-tree-like evolutionary histories that arise in organisms such as plants and bacteria, or uncertainty in evolutionary histories. An unrooted phylogenetic network on a non-empty, finite set X of taxa, or network, is a connected, simple graph in which every vertex has degree 1 or 3 and whose leaf set is X. It is called a phylogenetic tree if the underlying graph is a tree. In this paper we consider properties of tree-based networks, that is, networks that can be constructed by adding edges into a phylogenetic tree. We show that although they have some properties in common with their rooted analogues which have recently drawn much attention in the literature, they have some striking differences in terms of both their structural and computational properties. We expect that our results could eventually have applications to, for example, detecting horizontal gene transfer or hybridization which are important factors in the evolution of many organisms.

  3. Molecular characterization and phylogenetic analysis of small ruminant lentiviruses isolated from Canadian sheep and goats

    Directory of Open Access Journals (Sweden)

    Bertoni Giuseppe

    2011-06-01

    Full Text Available Abstract Background Small Ruminant Lentiviruses (SRLV are widespread in Canadian sheep and goats and represent an important health issue in these animals. There is however no data about the genetic diversity of Caprine Arthritis Encephalitis Virus (CAEV or Maedi Visna Virus (MVV in this country. Findings We performed a molecular and phylogenetic analysis of sheep and goat lentiviruses from a small geographic area in Canada using long sequences from the gag region of 30 infected sheep and 36 infected goats originating from 14 different flocks. Pairwise DNA distance and phylogenetic analyses revealed that all SRLV sequences obtained from sheep clustered tightly with prototypical Maedi visna sequences from America. Similarly, all SRLV strains obtained from goats clustered tightly with prototypical US CAEV-Cork strain. Conclusions The data reported in this study suggests that Canadian and US SRLV strains share common origins. In addition, the molecular data failed to bring to light any evidence of past cross species transmission between sheep and goats, which is consistent with the type of farming practiced in this part of the country where single species flocks predominate and where opportunities of cross species transmissions are proportionately low.

  4. Complete mitochondrial genome of the Indian peafowl (Pavo cristatus), with phylogenetic analysis in phasianidae.

    Science.gov (United States)

    Zhou, Tai-Cheng; Sha, Tao; Irwin, David M; Zhang, Ya-Ping

    2015-01-01

    Pavo cristatus, known as the Indian peafowl, is endemic to India and Sri Lanka and has been domesticated for its ornamental and food value. However, its phylogenetic status is still debated. Here, to clarify the phylogenetic status of P. cristatus within Phasianidae, we analyzed its mitochondrial genome (mtDNA). The complete mitochondrial DNA (mtDNA) genome was determined using 34 pairs of primers. Our data show that the mtDNA genome of P. cristatus is 16,686 bp in length. Molecular phylogenetic analyses of P. cristatus was performed along with 22 complete mtDNA genomes belonging to other species in Phasianidae using Bayesian and maximum likelihood methods, where Aythya americana and Anas platyrhynchos were used as outgroups. Our results show that P. critatus has its closest genetic affinity with Pavo muticus and belongs to clade that contains Gallus, Bambusicola and Francolinus.

  5. A New Perspective on Polyploid Fragaria (Strawberry) Genome Composition Based on Large-Scale, Multi-Locus Phylogenetic Analysis

    OpenAIRE

    Yang, Yilong; Davis, Thomas M

    2017-01-01

    Abstract The subgenomic compositions of the octoploid (2n = 8× = 56) strawberry (Fragaria) species, including the economically important cultivated species Fragaria x ananassa, have been a topic of long-standing interest. Phylogenomic approaches utilizing next-generation sequencing technologies offer a new window into species relationships and the subgenomic compositions of polyploids. We have conducted a large-scale phylogenetic analysis of Fragaria (strawberry) species using the Fluidigm Ac...

  6. Short communication. Genotyping and phylogenetic analysis of bovine viral diarrhea virus (BVDV isolates in Kosovo

    Directory of Open Access Journals (Sweden)

    Izedin Goga

    2014-03-01

    Full Text Available Three serum samples positive in Antigen ELISA BVDV have been tested to characterise genetic diversity of bovine viral diarrhea virus (BVDV in Kosovo. Samples were obtained in 2011 from heifers and were amplified by reverse transcription-polymerase chain reaction, sequenced and analysed by computer-assisted phylogenetic analysis. Amplified products and nucleotide sequence showed that all 3 isolates belonged to BVDV 1 genotype and 1b sub genotype. These results enrich the extant knowledge of BVDV and represent the first documented data about Kosovo BVDV isolates.

  7. Comparative sequence analysis of acid sensitive/resistance proteins in Escherichia coli and Shigella flexneri

    Science.gov (United States)

    Manikandan, Selvaraj; Balaji, Seetharaaman; Kumar, Anil; Kumar, Rita

    2007-01-01

    The molecular basis for the survival of bacteria under extreme conditions in which growth is inhibited is a question of great current interest. A preliminary study was carried out to determine residue pattern conservation among the antiporters of enteric bacteria, responsible for extreme acid sensitivity especially in Escherichia coli and Shigella flexneri. Here we found the molecular evidence that proved the relationship between E. coli and S. flexneri. Multiple sequence alignment of the gadC coded acid sensitive antiporter showed many conserved residue patterns at regular intervals at the N-terminal region. It was observed that as the alignment approaches towards the C-terminal, the number of conserved residues decreases, indicating that the N-terminal region of this protein has much active role when compared to the carboxyl terminal. The motif, FHLVFFLLLGG, is well conserved within the entire gadC coded protein at the amino terminal. The motif is also partially conserved among other antiporters (which are not coded by gadC) but involved in acid sensitive/resistance mechanism. Phylogenetic cluster analysis proves the relationship of Escherichia coli and Shigella flexneri. The gadC coded proteins are converged as a clade and diverged from other antiporters belongs to the amino acid-polyamine-organocation (APC) superfamily. PMID:21670792

  8. Phylogenetic relationships within the cyst-forming nematodes (Nematoda, Heteroderidae) based on analysis of sequences from the ITS regions of ribosomal DNA.

    Science.gov (United States)

    Subbotin, S A; Vierstraete, A; De Ley, P; Rowe, J; Waeyenberge, L; Moens, M; Vanfleteren, J R

    2001-10-01

    The ITS1, ITS2, and 5.8S gene sequences of nuclear ribosomal DNA from 40 taxa of the family Heteroderidae (including the genera Afenestrata, Cactodera, Heterodera, Globodera, Punctodera, Meloidodera, Cryphodera, and Thecavermiculatus) were sequenced and analyzed. The ITS regions displayed high levels of sequence divergence within Heteroderinae and compared to outgroup taxa. Unlike recent findings in root knot nematodes, ITS sequence polymorphism does not appear to complicate phylogenetic analysis of cyst nematodes. Phylogenetic analyses with maximum-parsimony, minimum-evolution, and maximum-likelihood methods were performed with a range of computer alignments, including elision and culled alignments. All multiple alignments and phylogenetic methods yielded similar basic structure for phylogenetic relationships of Heteroderidae. The cyst-forming nematodes are represented by six main clades corresponding to morphological characters and host specialization, with certain clades assuming different positions depending on alignment procedure and/or method of phylogenetic inference. Hypotheses of monophyly of Punctoderinae and Heteroderinae are, respectively, strongly and moderately supported by the ITS data across most alignments. Close relationships were revealed between the Avenae and the Sacchari groups and between the Humuli group and the species H. salixophila within Heteroderinae. The Goettingiana group occupies a basal position within this subfamily. The validity of the genera Afenestrata and Bidera was tested and is discussed based on molecular data. We conclude that ITS sequence data are appropriate for studies of relationships within the different species groups and less so for recovery of more ancient speciations within Heteroderidae. Copyright 2001 Academic Press.

  9. Occurrence and phylogenetic analysis of ‘Candidatus Mycoplasma haemominutum’ in wild felines from Paraná, Brazil

    Directory of Open Access Journals (Sweden)

    Claudia Mello Ribeiro

    2017-08-01

    Full Text Available Hemoplasma infections are emerging and wild fauna can represent an important reservoir of these pathogens. However, there are very few epidemiological studies about the occurrence of hemoplasmas in wild cats around the world. The purpose of this study is twofold: (1 evaluate the occurrence and phylogeny of hemoplasmas in captive wild felines at a zoo in the state of Paraná, Brazil, and (2 verify the correlation between subpopulations of these bacteria and the hematological and biochemical parameters of the animals. PCR was used to detect hemoplasmas in the blood of three cougars (Puma concolor, a jaguar (Panthera onca, a tiger (Panthera tigris and a lion (Panthera leo, followed by sequencing and phylogenetic analysis. The cougars and jaguar were found to be hemoplasma-positive by PCR. The phylogenetic analysis of the 16S rRNA gene sequences enabled the identification of genotypes of ‘Candidatus Mycoplasma haemominutum’ circulating in this zoo. The identified sequences were closely related to hemoplasma sequences originating from domestic cats and other wild cats, but the infected cougars and jaguar were healthy and showed no hematological or biochemical changes. It was concluded that P. concolor and P. onca are exposed to ‘Candidatus Mycoplasma haemominutum’ in Paraná, but further research is suggested to assess the resistance of wild cats to different hemoplasma subpopulations.

  10. Phylogenetic selection of target species in Amaryllidaceae tribe Haemantheae for acetylcholinesterase inhibition and affinity to the serotonin reuptake transport protein

    Science.gov (United States)

    We present phylogenetic analyses of 37 taxa of Amaryllidaceae, tribe Haemantheae and Amaryllis belladonna L. as an outgroup, in order to provide a phylogenetic framework for the selection of candidate plants for lead discoveries in relation to Alzheimer´s disease and depression. DNA sequences from t...

  11. The complete mitochondrial genome of Sesarmops sinensis reveals gene rearrangements and phylogenetic relationships in Brachyura.

    Science.gov (United States)

    Tang, Bo-Ping; Xin, Zhao-Zhe; Liu, Yu; Zhang, Dai-Zhen; Wang, Zheng-Fei; Zhang, Hua-Bin; Chai, Xin-Yue; Zhou, Chun-Lin; Liu, Qiu-Ning

    2017-01-01

    Mitochondrial genome (mitogenome) is very important to understand molecular evolution and phylogenetics. Herein, in this study, the complete mitogenome of Sesarmops sinensis was reported. The mitogenome was 15,905 bp in size, and contained 13 protein-coding genes (PCGs), two ribosomal RNA (rRNA) genes, 22 transfer RNA (tRNA) genes, and a control region (CR). The AT skew and the GC skew are both negative in the mitogenomes of S. sinensis. The nucleotide composition of the S. sinensis mitogenome was also biased toward A + T nucleotides (75.7%). All tRNA genes displayed a typical mitochondrial tRNA cloverleaf structure, except for the trnS1 gene, which lacked a dihydroxyuridine arm. S. sinensis exhibits a novel rearrangement compared with the Pancrustacean ground pattern and other Brachyura species. Based on the 13 PCGs, the phylogenetic analysis showed that S. sinensis and Sesarma neglectum were clustered on one branch with high nodal support values, indicating that S. sinensis and S. neglectum have a sister group relationship. The group (S. sinensis + S. neglectum) was sister to (Parasesarmops tripectinis + Metopaulias depressus), suggesting that S. sinensis belongs to Grapsoidea, Sesarmidae. Phylogenetic trees based on amino acid sequences and nucleotide sequences of mitochondrial 13 PCGs using BI and ML respectively indicate that section Eubrachyura consists of four groups clearly. The resulting phylogeny supports the establishment of a separate subsection Potamoida. These four groups correspond to four subsections of Raninoida, Heterotremata, Potamoida, and Thoracotremata.

  12. Chloroplast Genome of the Folk Medicine and Vegetable Plant Talinum paniculatum (Jacq.) Gaertn.: Gene Organization, Comparative and Phylogenetic Analysis.

    Science.gov (United States)

    Liu, Xia; Li, Yuan; Yang, Hongyuan; Zhou, Boyang

    2018-04-09

    The complete chloroplast (cp) genome of Talinum paniculatum (Caryophyllale), a source of pharmaceutical efficacy similar to ginseng, and a widely distributed and planted edible vegetable, were sequenced and analyzed. The cp genome size of T. paniculatum is 156,929 bp, with a pair of inverted repeats (IRs) of 25,751 bp separated by a large single copy (LSC) region of 86,898 bp and a small single copy (SSC) region of 18,529 bp. The genome contains 83 protein-coding genes, 37 transfer RNA (tRNA) genes, eight ribosomal RNA (rRNA) genes and four pseudogenes. Fifty one (51) repeat units and ninety two (92) simple sequence repeats (SSRs) were found in the genome. The pseudogene rpl23 (Ribosomal protein L23) was insert AATT than other Caryophyllale species by sequence alignment, which located in IRs region. The gene of trnK-UUU (tRNA-Lys) and rpl16 (Ribosomal protein L16) have larger introns in T. paniculatum , and the existence of matK (maturase K) genes, which usually located in the introns of trnK-UUU , rich sequence divergence in Caryophyllale. Complete cp genome comparison with other eight Caryophyllales species indicated that the differences between T. paniculatum and P. oleracea were very slight, and the most highly divergent regions occurred in intergenic spacers. Comparisons of IR boundaries among nine Caryophyllales species showed that T. paniculatum have larger IRs region and the contraction is relatively slight. The phylogenetic analysis among 35 Caryophyllales species and two outgroup species revealed that T. paniculatum and P. oleracea do not belong to the same family. All these results give good opportunities for future identification, barcoding of Talinum species, understanding the evolutionary mode of Caryophyllale cp genome and molecular breeding of T. paniculatum with high pharmaceutical efficacy.

  13. Genome-wide identification and expression analysis of the mitogen-activated protein kinase gene family in cassava

    Directory of Open Access Journals (Sweden)

    Yan Yan

    2016-08-01

    Full Text Available Mitogen-activated protein kinases (MAPKs play central roles in plant developmental processes, hormone signaling transduction, and responses to abiotic stress. However, no data are currently available about the MAPK family in cassava, an important tropical crop. Herein, 21 MeMAPK genes were identified from cassava. Phylogenetic analysis indicated that MeMAPKs could be classified into four subfamilies. Gene structure analysis demonstrated that the number of introns in MeMAPK genes ranged from 1 to 10, suggesting large variation among cassava MAPK genes. Conserved motif analysis indicated that all MeMAPKs had typical protein kinase domains. Transcriptomic analysis suggested that MeMAPK genes showed differential expression patterns in distinct tissues and in response to drought stress between wild subspecies and cultivated varieties. Interaction networks and co-expression analyses revealed that crucial pathways controlled by MeMAPK networks may be involved in the differential response to drought stress in different accessions of cassava. Expression of nine selected MAPK genes showed that these genes could comprehensively respond to osmotic, salt, cold, oxidative stressors, and abscisic acid (ABA signaling. These findings yield new insights into the transcriptional control of MAPK gene expression, provide an improved understanding of abiotic stress responses and signaling transduction in cassava, and lead to potential applications in the genetic improvement of cassava cultivars.

  14. Genetic characterization of the non-structural protein-3 gene of bluetongue virus serotype-2 isolate from India

    Directory of Open Access Journals (Sweden)

    Raghavendra Sumanth Pudupakam

    2017-03-01

    Full Text Available Aim: Sequence analysis and phylogenetic studies based on non-structural protein-3 (NS3 gene are important in understanding the evolution and epidemiology of bluetongue virus (BTV. This study was aimed at characterizing the NS3 gene sequence of Indian BTV serotype-2 (BTV2 to elucidate its genetic relationship to global BTV isolates. Materials and Methods: The NS3 gene of BTV2 was amplified from infected BHK-21 cell cultures, cloned and subjected to sequence analysis. The generated NS3 gene sequence was compared with the corresponding sequences of different BTV serotypes across the world, and a phylogenetic relationship was established. Results: The NS3 gene of BTV2 showed moderate levels of variability in comparison to different BTV serotypes, with nucleotide sequence identities ranging from 81% to 98%. The region showed high sequence homology of 93-99% at amino acid level with various BTV serotypes. The PPXY/PTAP late domain motifs, glycosylation sites, hydrophobic domains, and the amino acid residues critical for virus-host interactions were conserved in NS3 protein. Phylogenetic analysis revealed that BTV isolates segregate into four topotypes and that the Indian BTV2 in subclade IA is closely related to Asian and Australian origin strains. Conclusion: Analysis of the NS3 gene indicated that Indian BTV2 isolate is closely related to strains from Asia and Australia, suggesting a common origin of infection. Although the pattern of evolution of BTV2 isolate is different from other global isolates, the deduced amino acid sequence of NS3 protein demonstrated high molecular stability.

  15. Genetic characterization of the non-structural protein-3 gene of bluetongue virus serotype-2 isolate from India.

    Science.gov (United States)

    Pudupakam, Raghavendra Sumanth; Raghunath, Shobana; Pudupakam, Meghanath; Daggupati, Sreenivasulu

    2017-03-01

    Sequence analysis and phylogenetic studies based on non-structural protein-3 (NS3) gene are important in understanding the evolution and epidemiology of bluetongue virus (BTV). This study was aimed at characterizing the NS3 gene sequence of Indian BTV serotype-2 (BTV2) to elucidate its genetic relationship to global BTV isolates. The NS3 gene of BTV2 was amplified from infected BHK-21 cell cultures, cloned and subjected to sequence analysis. The generated NS3 gene sequence was compared with the corresponding sequences of different BTV serotypes across the world, and a phylogenetic relationship was established. The NS3 gene of BTV2 showed moderate levels of variability in comparison to different BTV serotypes, with nucleotide sequence identities ranging from 81% to 98%. The region showed high sequence homology of 93-99% at amino acid level with various BTV serotypes. The PPXY/PTAP late domain motifs, glycosylation sites, hydrophobic domains, and the amino acid residues critical for virus-host interactions were conserved in NS3 protein. Phylogenetic analysis revealed that BTV isolates segregate into four topotypes and that the Indian BTV2 in subclade IA is closely related to Asian and Australian origin strains. Analysis of the NS3 gene indicated that Indian BTV2 isolate is closely related to strains from Asia and Australia, suggesting a common origin of infection. Although the pattern of evolution of BTV2 isolate is different from other global isolates, the deduced amino acid sequence of NS3 protein demonstrated high molecular stability.

  16. Nodal distances for rooted phylogenetic trees.

    Science.gov (United States)

    Cardona, Gabriel; Llabrés, Mercè; Rosselló, Francesc; Valiente, Gabriel

    2010-08-01

    Dissimilarity measures for (possibly weighted) phylogenetic trees based on the comparison of their vectors of path lengths between pairs of taxa, have been present in the systematics literature since the early seventies. For rooted phylogenetic trees, however, these vectors can only separate non-weighted binary trees, and therefore these dissimilarity measures are metrics only on this class of rooted phylogenetic trees. In this paper we overcome this problem, by splitting in a suitable way each path length between two taxa into two lengths. We prove that the resulting splitted path lengths matrices single out arbitrary rooted phylogenetic trees with nested taxa and arcs weighted in the set of positive real numbers. This allows the definition of metrics on this general class of rooted phylogenetic trees by comparing these matrices through metrics in spaces M(n)(R) of real-valued n x n matrices. We conclude this paper by establishing some basic facts about the metrics for non-weighted phylogenetic trees defined in this way using L(p) metrics on M(n)(R), with p [epsilon] R(>0).

  17. Identification and phylogenetic analysis of Dirofilaria ursi (Nematoda: Filarioidea) from Wisconsin black bears (Ursus americanus) and its Wolbachia endosymbiont.

    Science.gov (United States)

    Michalski, Michelle L; Bain, Odile; Fischer, Kerstin; Fischer, Peter U; Kumar, Sanjay; Foster, Jeremy M

    2010-04-01

    Dirofilaria ursi is a filarial nematode of American black bears (Ursus americanus Pallas, 1780) that is vectored by black flies (Simuliidae) in many parts of the United States. In northwestern Wisconsin, the prevalence of microfilaremic bears during the fall hunting season was 21% (n = 47). Unsheathed blood microfilariae from Wisconsin bears possess characters consistent with the original description of D. ursi, as do adult worms observed histologically and grossly. Immunohistochemistry was used to identify the Wolbachia endosymbiont in the hypodermis and lateral cords of an adult female D. ursi. Amplification of wsp, gatB, coxA, fbpA, and ftsZ bacterial sequences from parasite DNA confirmed the presence of Wolbachia, and molecular phylogenetic analysis of the Wolbachia ftsZ gene groups the endosymbiont with Wolbachia from D. immitis and D. repens. Phylogenetic analysis of D. ursi 5s rDNA sequence confirms the morphological observations grouping this parasite as a member of Dirofilaria, and within the Dirofilaria - Onchocerca clade of filarial nematodes. This is the first report of Wolbachia characterization and molecular phylogeny information for D. ursi.

  18. Ant-Based Phylogenetic Reconstruction (ABPR: A new distance algorithm for phylogenetic estimation based on ant colony optimization

    Directory of Open Access Journals (Sweden)

    Karla Vittori

    2008-12-01

    Full Text Available We propose a new distance algorithm for phylogenetic estimation based on Ant Colony Optimization (ACO, named Ant-Based Phylogenetic Reconstruction (ABPR. ABPR joins two taxa iteratively based on evolutionary distance among sequences, while also accounting for the quality of the phylogenetic tree built according to the total length of the tree. Similar to optimization algorithms for phylogenetic estimation, the algorithm allows exploration of a larger set of nearly optimal solutions. We applied the algorithm to four empirical data sets of mitochondrial DNA ranging from 12 to 186 sequences, and from 898 to 16,608 base pairs, and covering taxonomic levels from populations to orders. We show that ABPR performs better than the commonly used Neighbor-Joining algorithm, except when sequences are too closely related (e.g., population-level sequences. The phylogenetic relationships recovered at and above species level by ABPR agree with conventional views. However, like other algorithms of phylogenetic estimation, the proposed algorithm failed to recover expected relationships when distances are too similar or when rates of evolution are very variable, leading to the problem of long-branch attraction. ABPR, as well as other ACO-based algorithms, is emerging as a fast and accurate alternative method of phylogenetic estimation for large data sets.

  19. Phylogenetic resolution and habitat specificity of members of the Photobacterium phosphoreum species group.

    Science.gov (United States)

    Ast, Jennifer C; Dunlap, Paul V

    2005-10-01

    Substantial ambiguity exists regarding the phylogenetic status of facultatively psychrophilic luminous bacteria identified as Photobacterium phosphoreum, a species thought to be widely distributed in the world's oceans and believed to be the specific bioluminescent light-organ symbiont of several deep-sea fishes. Members of the P. phosphoreum species group include luminous and non-luminous strains identified phenotypically from a variety of different habitats as well as phylogenetically defined lineages that appear to be evolutionarily distinct. To resolve this ambiguity and to begin developing a meaningful knowledge of the geographic distributions, habitats and symbiotic relationships of bacteria in the P. phosphoreum species group, we carried out a multilocus, fine-scale phylogenetic analysis based on sequences of the 16S rRNA, gyrB and luxABFE genes of many newly isolated luminous strains from symbiotic and saprophytic habitats, together with previously isolated luminous and non-luminous strains identified as P. phosphoreum from these and other habitats. Parsimony analysis unambiguously resolved three evolutionarily distinct clades, phosphoreum, iliopiscarium and kishitanii. The tight phylogenetic clustering within these clades and the distinct separation between them indicates they are different species, P. phosphoreum, Photobacterium iliopiscarium and the newly recognized 'Photobacterium kishitanii'. Previously reported non-luminous strains, which had been identified phenotypically as P. phosphoreum, resolved unambiguously as P. iliopiscarium, and all examined deep-sea fishes (specimens of families Chlorophthalmidae, Macrouridae, Moridae, Trachichthyidae and Acropomatidae) were found to harbour 'P. kishitanii', not P. phosphoreum, in their light organs. This resolution revealed also that 'P. kishitanii' is cosmopolitan in its geographic distribution. Furthermore, the lack of phylogenetic variation within 'P. kishitanii' indicates that this facultatively

  20. Phylogeny-dominant classification of J-proteins in Arabidopsis thaliana and Brassica oleracea.

    Science.gov (United States)

    Zhang, Bin; Qiu, Han-Lin; Qu, Dong-Hai; Ruan, Ying; Chen, Dong-Hong

    2018-04-05

    Hsp40s or DnaJ/J-proteins are evolutionarily conserved in all organisms as co-chaperones of molecular chaperone HSP70s that mainly participate in maintaining cellular protein homeostasis, such as protein folding, assembly, stabilization, and translocation under normal conditions as well as refolding and degradation under environmental stresses. It has been reported that Arabidopsis J-proteins are classified into four classes (types A-D) according to domain organization, but their phylogenetic relationships are unknown. Here, we identified 129 J-proteins in the world-wide popular vegetable Brassica oleracea, a close relative of the model plant Arabidopsis, and also revised the information of Arabidopsis J-proteins based on the latest online bioresources. According to phylogenetic analysis with domain organization and gene structure as references, the J-proteins from Arabidopsis and B. oleracea were classified into 15 main clades (I-XV) separated by a number of undefined small branches with remote relationship. Based on the number of members, they respectively belong to multigene clades, oligo-gene clades, and mono-gene clades. The J-protein genes from different clades may function together or separately to constitute a complicated regulatory network. This study provides a constructive viewpoint for J-protein classification and an informative platform for further functional dissection and resistant genes discovery related to genetic improvement of crop plants.

  1. Prevalence, Associated Risk Factors, and Phylogenetic Analysis of Toxocara vitulorum Infection in Yaks on the Qinghai Tibetan Plateau, China.

    Science.gov (United States)

    Li, Kun; Lan, Yanfang; Luo, Houqiang; Zhang, Hui; Liu, Dongyu; Zhang, Lihong; Gui, Rui; Wang, Lei; Shahzad, Muhammad; Sizhu, Suolang; Li, Jiakui; Chamba, Yangzom

    2016-10-01

    Toxocara vitulorum has been rarely reported in yaks at high altitudes and remote areas of Sichuan Province of Tibetan Plateau of China. The current study was designed to investigate the prevalence, associated risk factors, and phylogenetic characteristics of T. vitulorum in yak calves on the Qinghai Tibetan plateau. Fecal samples were collected from 891 yak calves and were examined for the presence of T. vitulorum eggs by the McMaster technique. A multivariable logistic regression model was employed to explore variables potentially associated with exposure to T. vitulorum infection. T. vitulorum specimens were collected from the feces of yaks in Hongyuan of Sichuan Province, China. DNA was extracted from ascaris. After PCR amplification, the sequencing of ND1 gene was carried out and phylogenetic analyses was performed by MEGA 6.0 software. The results showed that 64 (20.1%; 95% CI 15.8-24.9%), 75 (17.2; 13.8-21.1), 29 (40.9; 29.3-53.2), and 5 (7.6; 2.5-16.8) yak calves were detected out to excrete T. vitulorum eggs in yak calve feces in Qinghai, Tibet, Sichuan, and Gansu, respectively. The present study revealed that high infection and mortality by T. vitulorum is wildly spread on the Qinghai Tibetan plateau, China by fecal examination. Geographical origin, ages, and fecal consistencies are the risk factors associated with T. vitulorum prevalence by logistic regression analysis. Molecular detection and phylogenetic analysis of ND1 gene of T. vitulorum indicated that T. vitulorum in the yak calves on the Qinghai Tibetan plateau are homologous to preveiously studies reported.

  2. SICLE: a high-throughput tool for extracting evolutionary relationships from phylogenetic trees.

    Science.gov (United States)

    DeBlasio, Dan F; Wisecaver, Jennifer H

    2016-01-01

    We present the phylogeny analysis software SICLE (Sister Clade Extractor), an easy-to-use, high-throughput tool to describe the nearest neighbors to a node of interest in a phylogenetic tree as well as the support value for the relationship. The application is a command line utility that can be embedded into a phylogenetic analysis pipeline or can be used as a subroutine within another C++ program. As a test case, we applied this new tool to the published phylome of Salinibacter ruber, a species of halophilic Bacteriodetes, identifying 13 unique sister relationships to S. ruber across the 4,589 gene phylogenies. S. ruber grouped with bacteria, most often other Bacteriodetes, in the majority of phylogenies, but 91 phylogenies showed a branch-supported sister association between S. ruber and Archaea, an evolutionarily intriguing relationship indicative of horizontal gene transfer. This test case demonstrates how SICLE makes it possible to summarize the phylogenetic information produced by automated phylogenetic pipelines to rapidly identify and quantify the possible evolutionary relationships that merit further investigation. SICLE is available for free for noncommercial use at http://eebweb.arizona.edu/sicle/.

  3. SICLE: a high-throughput tool for extracting evolutionary relationships from phylogenetic trees

    Directory of Open Access Journals (Sweden)

    Dan F. DeBlasio

    2016-08-01

    Full Text Available We present the phylogeny analysis software SICLE (Sister Clade Extractor, an easy-to-use, high-throughput tool to describe the nearest neighbors to a node of interest in a phylogenetic tree as well as the support value for the relationship. The application is a command line utility that can be embedded into a phylogenetic analysis pipeline or can be used as a subroutine within another C++ program. As a test case, we applied this new tool to the published phylome of Salinibacter ruber, a species of halophilic Bacteriodetes, identifying 13 unique sister relationships to S. ruber across the 4,589 gene phylogenies. S. ruber grouped with bacteria, most often other Bacteriodetes, in the majority of phylogenies, but 91 phylogenies showed a branch-supported sister association between S. ruber and Archaea, an evolutionarily intriguing relationship indicative of horizontal gene transfer. This test case demonstrates how SICLE makes it possible to summarize the phylogenetic information produced by automated phylogenetic pipelines to rapidly identify and quantify the possible evolutionary relationships that merit further investigation. SICLE is available for free for noncommercial use at http://eebweb.arizona.edu/sicle/.

  4. Phylogenetic analysis of some Newcastle disease virus isolates from the Sudan

    Directory of Open Access Journals (Sweden)

    N.A. Elmardi

    2016-06-01

    Full Text Available A reverse transcription-polymerase chain reaction (RT-PCR was used to amplify 1412 bp of the fusion protein gene (F gene of four Newcastle disease virus (NDV isolates; two velogenic (TY-1/90 and DIK-90 and two lentogenic isolates (Dongla 88/1 and GD.S.1. Following sequencing, nucleotide sequences were annotated and 894 bp were compared phylogenetically with those from strains previously reported in the Sudan and the virus strains published on the GenBank. It could be demonstrated that TY-1/90 and DIK-90 strains belong to the genotype VI of NDV and are in close genetic relationship to sub- genotype VIb. TY-1/90 and DIK-90 strains were observed to be genetically unrelated to the earlier Sudanese isolates of 1970/80s and the late of 2000s suggesting a different origin. The close genetic relationship to the European and African pigeon paramyxovirus type 1 (PPMV-1 suggests a common ancestor. Dongola, GD.S.1 strains were classified into genotype II that comprises non-pathogenic lentogenic NDV strains. The present genetic classification of NDV isolates of the Sudan provides valuable information on genotypes of NDV. Further molecular epidemiological investigations of the recent outbreaks of Newcastle disease in the Sudan are needed in order to improve the efficiency of control strategies and vaccine development.

  5. Fusion of the subunits α and β of succinyl-CoA synthetase as a phylogenetic marker for Pezizomycotina fungi

    Directory of Open Access Journals (Sweden)

    Amanda M. Koire

    2011-01-01

    Full Text Available Gene fusions, yielding the formation of multidomain proteins, are evolutionary events that can be utilized as phylogenetic markers. Here we describe a fusion gene comprising the α and β subunits of succinyl-coA synthetase, an enzyme of the TCA cycle, in Pezizomycotina fungi. This fusion is present in all Pezizomycotina with complete genome sequences and absent from all other organisms. Phylogenetic analysis of the α and β subunits of succinyl-CoA synthetase suggests that both subunits were duplicated and retained in Pezizomycotina while one copy was lost from other fungi. One of the duplicated copies was then fused in Pezizomycotina. Our results suggest that the fusion of the α and β subunits of succinyl-CoA synthetase can be used as a molecular marker for membership in the Pezizomycotina subphylum. If a species has the fusion it can be reliably classified as Pezizomycotina, while the absence of the fusion is suggestive that the species is not a member of this subphylum.

  6. Molecular phylogenetics of porcini mushrooms (Boletus section Boletus).

    Science.gov (United States)

    Dentinger, Bryn T M; Ammirati, Joseph F; Both, Ernst E; Desjardin, Dennis E; Halling, Roy E; Henkel, Terry W; Moreau, Pierre-Arthur; Nagasawa, Eiji; Soytong, Kasem; Taylor, Andy F; Watling, Roy; Moncalvo, Jean-Marc; McLaughlin, David J

    2010-12-01

    Porcini (Boletus section Boletus: Boletaceae: Boletineae: Boletales) are a conspicuous group of wild, edible mushrooms characterized by fleshy fruiting bodies with a poroid hymenophore that is "stuffed" with white hyphae when young. Their reported distribution is with ectomycorrhizal plants throughout the Northern Hemisphere. Little progress has been made on the systematics of this group using modern molecular phylogenetic tools because sampling has been limited primarily to European species and the genes employed were insufficient to resolve the phylogeny. We examined the evolutionary history of porcini by using a global geographic sampling of most known species, new discoveries from little explored areas, and multiple genes. We used 78 sequences from the fast-evolving nuclear internal transcribed spacers and are able to recognize 18 reciprocally monophyletic species. To address whether or not porcini form a monophyletic group, we compiled a broadly sampled dataset of 41 taxa, including other members of the Boletineae, and used separate and combined phylogenetic analysis of sequences from the nuclear large subunit ribosomal DNA, the largest subunit of RNA polymerase II, and the mitochondrial ATPase subunit six gene. Contrary to previous studies, our separate and combined phylogenetic analyses support the monophyly of porcini. We also report the discovery of two taxa that expand the known distribution of porcini to Australia and Thailand and have ancient phylogenetic connections to the rest of the group. A relaxed molecular clock analysis with these new taxa dates the origin of porcini to between 42 and 54 million years ago, coinciding with the initial diversification of angiosperms, during the Eocene epoch when the climate was warm and humid. These results reveal an unexpected diversity, distribution, and ancient origin of a group of commercially valuable mushrooms that may provide an economic incentive for conservation and support the hypothesis of a tropical

  7. Odorant binding proteins of the red imported fire ant, Solenopsis invicta: an example of the problems facing the analysis of widely divergent proteins.

    Directory of Open Access Journals (Sweden)

    Dietrich Gotzek

    Full Text Available We describe the odorant binding proteins (OBPs of the red imported fire ant, Solenopsis invicta, obtained from analyses of an EST library and separate 454 sequencing runs of two normalized cDNA libraries. We identified a total of 18 putative functional OBPs in this ant. A third of the fire ant OBPs are orthologs to honey bee OBPs. Another third of the OBPs belong to a lineage-specific expansion, which is a common feature of insect OBP evolution. Like other OBPs, the different fire ant OBPs share little sequence similarity (∼ 20%, rendering evolutionary analyses difficult. We discuss the resulting problems with sequence alignment, phylogenetic analysis, and tests of selection. As previously suggested, our results underscore the importance for careful exploration of the sensitivity to the effects of alignment methods for data comprising widely divergent sequences.

  8. Characterization of the complete mitochondrial genome of Acanthoscelides obtectus (Coleoptera: Chrysomelidae: Bruchinae) with phylogenetic analysis.

    Science.gov (United States)

    Yao, Jie; Yang, Hong; Dai, Renhuai

    2017-10-01

    Acanthoscelides obtectus is a common species of the subfamily Bruchinae and a worldwide-distributed seed-feeding beetle. The complete mitochondrial genome of A. obtectus is 16,130 bp in length with an A + T content of 76.4%. It contains a positive AT skew and a negative GC skew. The mitogenome of A. obtectus contains 13 protein-coding genes (PCGs), 22 tRNA genes, two rRNA genes and a non-coding region (D-loop). All PCGs start with an ATN codon, and seven (ND3, ATP6, COIII, ND3, ND4L, ND6, and Cytb) of them terminate with TAA, while the remaining five (COI, COII, ND1, ND4, and ND5) terminate with a single T, ATP8 terminates with TGA. Except tRNA Ser , the secondary structures of 21 tRNAs that can be folded into a typical clover-leaf structure were identified. The secondary structures of lrRNA and srRNA were also predicted in this study. There are six domains with 48 helices in lrRNA and three domains with 32 helices in srRNA. The control region of A. obtectus is 1354 bp in size with the highest A + T content (83.5%) in a mitochondrial gene. Thirteen PCGs in 19 species have been used to infer their phylogenetic relationships. Our results show that A. obtectus belongs to the family Chrysomelidae (subfamily-Bruchinae). This is the first study on phylogenetic analyses involving the mitochondrial genes of A. obtectus and could provide basic data for future studies of mitochondrial genome diversities and the evolution of related insect lineages.

  9. Phylogenetic relationships of Chaetomium isolates based on the ...

    African Journals Online (AJOL)

    Biotech Unit

    2013-02-27

    Feb 27, 2013 ... Phylogenetic analysis of Chaetomium species. The evolutionary history was inferred using the maximum parsimony method. The bootstrap consensus tree inferred from. 1000 replicates is taken to represent the evolutionary history of the taxa analyzed (Felsenstein, 1985). The MP tree was obtained using.

  10. SigTree: A Microbial Community Analysis Tool to Identify and Visualize Significantly Responsive Branches in a Phylogenetic Tree

    OpenAIRE

    Stevens, John R.; Jones, Todd R.; Lefevre, Michael; Ganesan, Balasubramanian; Weimer, Bart C.

    2017-01-01

    Microbial community analysis experiments to assess the effect of a treatment intervention (or environmental change) on the relative abundance levels of multiple related microbial species (or operational taxonomic units) simultaneously using high throughput genomics are becoming increasingly common. Within the framework of the evolutionary phylogeny of all species considered in the experiment, this translates to a statistical need to identify the phylogenetic branches that exhibit a significan...

  11. The phylogenetic relationships among infraorders and superfamilies of Diptera based on morphological evidence

    DEFF Research Database (Denmark)

    Lambkin, Christine L.; Sinclair, Bradley J.; Pape, Thomas

    2013-01-01

    Members of the megadiverse insect order Diptera (flies) have successfully colonized all continents and nearly all habitats. There are more than 154 000 described fly species, representing 1012% of animal species. Elucidating the phylogenetic relationships of such a large component of global...... biodiversity is challenging, but significant advances have been made in the last few decades. Since Hennig first discussed the monophyly of major groupings, Diptera has attracted much study, but most researchers have used non-numerical qualitative methods to assess morphological data. More recently......, quantitative phylogenetic methods have been used on both morphological and molecular data. All previous quantitative morphological studies addressed narrower phylogenetic problems, often below the suborder or infraorder level. Here we present the first numerical analysis of phylogenetic relationships...

  12. On Tree-Based Phylogenetic Networks.

    Science.gov (United States)

    Zhang, Louxin

    2016-07-01

    A large class of phylogenetic networks can be obtained from trees by the addition of horizontal edges between the tree edges. These networks are called tree-based networks. We present a simple necessary and sufficient condition for tree-based networks and prove that a universal tree-based network exists for any number of taxa that contains as its base every phylogenetic tree on the same set of taxa. This answers two problems posted by Francis and Steel recently. A byproduct is a computer program for generating random binary phylogenetic networks under the uniform distribution model.

  13. Genetic and phylogenetic analysis of multi-continent human influenza A(H1N2) reassortant viruses isolated in 2001 through 2003.

    Science.gov (United States)

    Chen, M-J; La, T; Zhao, P; Tam, J S; Rappaport, R; Cheng, S-M

    2006-12-01

    Genetic analyses were performed on 228 influenza A(H1) viruses derived from clinical subjects participating in an experimental vaccine trial conducted in 20 countries on four continents between 2001 and 2003. HA1 phylogenetic analysis of these viruses showed multiple clades circulated around the world with regional prevalence patterns. Sixty-five of the A(H1) viruses were identified as A(H1N2), 40 of which were isolated from South Africa. The A(H1) sequences of these viruses cluster with published H1N2 viruses phylogenetically and share with them diagnostic signature V169A and A193T changes. The results also showed for the first time that H1N2 viruses were prominent in South Africa during the 2001-2002 influenza season, accounting for over 90% of the A(H1) cases in our study, and infecting both children (29/31) and the elderly (11/13). Phylogenetic analysis of the 65 H1N2 viruses we identified, in conjunction with the 56 recent H1N2 viruses currently available in the database, provided a comprehensive view of the circulation and evolution of distinct clades of H1N2 viruses in a temporal manner between early 2001 and mid-2003, shortly after the appearance of these recent reassortant viruses in or near year 2000.

  14. MPBoot: fast phylogenetic maximum parsimony tree inference and bootstrap approximation.

    Science.gov (United States)

    Hoang, Diep Thi; Vinh, Le Sy; Flouri, Tomáš; Stamatakis, Alexandros; von Haeseler, Arndt; Minh, Bui Quang

    2018-02-02

    The nonparametric bootstrap is widely used to measure the branch support of phylogenetic trees. However, bootstrapping is computationally expensive and remains a bottleneck in phylogenetic analyses. Recently, an ultrafast bootstrap approximation (UFBoot) approach was proposed for maximum likelihood analyses. However, such an approach is still missing for maximum parsimony. To close this gap we present MPBoot, an adaptation and extension of UFBoot to compute branch supports under the maximum parsimony principle. MPBoot works for both uniform and non-uniform cost matrices. Our analyses on biological DNA and protein showed that under uniform cost matrices, MPBoot runs on average 4.7 (DNA) to 7 times (protein data) (range: 1.2-20.7) faster than the standard parsimony bootstrap implemented in PAUP*; but 1.6 (DNA) to 4.1 times (protein data) slower than the standard bootstrap with a fast search routine in TNT (fast-TNT). However, for non-uniform cost matrices MPBoot is 5 (DNA) to 13 times (protein data) (range:0.3-63.9) faster than fast-TNT. We note that MPBoot achieves better scores more frequently than PAUP* and fast-TNT. However, this effect is less pronounced if an intensive but slower search in TNT is invoked. Moreover, experiments on large-scale simulated data show that while both PAUP* and TNT bootstrap estimates are too conservative, MPBoot bootstrap estimates appear more unbiased. MPBoot provides an efficient alternative to the standard maximum parsimony bootstrap procedure. It shows favorable performance in terms of run time, the capability of finding a maximum parsimony tree, and high bootstrap accuracy on simulated as well as empirical data sets. MPBoot is easy-to-use, open-source and available at http://www.cibiv.at/software/mpboot .

  15. Molecular phylogenetics and historical biogeography of Rhinolophus bats.

    Science.gov (United States)

    Stoffberg, Samantha; Jacobs, David S; Mackie, Iain J; Matthee, Conrad A

    2010-01-01

    The phylogenetic relationships within the horseshoe bats (genus Rhinolophus) are poorly resolved, particularly at deeper levels within the tree. We present a better-resolved phylogenetic hypothesis for 30 rhinolophid species based on parsimony and Bayesian analyses of the mitochondrial cytochrome b gene and three nuclear introns (TG, THY and PRKC1). Strong support was found for the existence of two geographic clades within the monophyletic Rhinolophidae: an African group and an Oriental assemblage. The relaxed Bayesian clock method indicated that the two rhinolophid clades diverged approximately 35 million years ago and results from Dispersal Vicariance (DIVA) analysis suggest that the horseshoe bats arose in Asia and subsequently dispersed into Europe and Africa.

  16. Phylogenetic Analysis of Shewanella Strains by DNA Relatedness Derived from Whole Genome Microarray DNA-DNA Hybridization and Comparisons with Other Methods

    International Nuclear Information System (INIS)

    Wu, Liyou; Yi, T.Y.; Van Nostrand, Joy; Zhou, Jizhong

    2010-01-01

    Phylogenetic analyses were done for the Shewanella strains isolated from Baltic Sea (38 strains), US DOE Hanford Uranium bioremediation site (Hanford Reach of the Columbia River (HRCR), 11 strains), Pacific Ocean and Hawaiian sediments (8 strains), and strains from other resources (16 strains) with three out group strains, Rhodopseudomonas palustris, Clostridium cellulolyticum, and Thermoanaerobacter ethanolicus X514, using DNA relatedness derived from WCGA-based DNA-DNA hybridizations, sequence similarities of 16S rRNA gene and gyrB gene, and sequence similarities of 6 loci of Shewanella genome selected from a shared gene list of the Shewanella strains with whole genome sequenced based on the average nucleotide identity of them (ANI). The phylogenetic trees based on 16S rRNA and gyrB gene sequences, and DNA relatedness derived from WCGA hybridizations of the tested Shewanella strains share exactly the same sub-clusters with very few exceptions, in which the strains were basically grouped by species. However, the phylogenetic analysis based on DNA relatedness derived from WCGA hybridizations dramatically increased the differentiation resolution at species and strains level within Shewanella genus. When the tree based on DNA relatedness derived from WCGA hybridizations was compared to the tree based on the combined sequences of the selected functional genes (6 loci), we found that the resolutions of both methods are similar, but the clustering of the tree based on DNA relatedness derived from WMGA hybridizations was clearer. These results indicate that WCGA-based DNA-DNA hybridization is an idea alternative of conventional DNA-DNA hybridization methods and it is superior to the phylogenetics methods based on sequence similarities of single genes. Detailed analysis is being performed for the re-classification of the strains examined.

  17. Phylogenetic Analysis of Shewanella Strains by DNA Relatedness Derived from Whole Genome Microarray DNA-DNA Hybridization and Comparison with Other Methods

    Energy Technology Data Exchange (ETDEWEB)

    Wu, Liyou; Yi, T. Y.; Van Nostrand, Joy; Zhou, Jizhong

    2010-05-17

    Phylogenetic analyses were done for the Shewanella strains isolated from Baltic Sea (38 strains), US DOE Hanford Uranium bioremediation site [Hanford Reach of the Columbia River (HRCR), 11 strains], Pacific Ocean and Hawaiian sediments (8 strains), and strains from other resources (16 strains) with three out group strains, Rhodopseudomonas palustris, Clostridium cellulolyticum, and Thermoanaerobacter ethanolicus X514, using DNA relatedness derived from WCGA-based DNA-DNA hybridizations, sequence similarities of 16S rRNA gene and gyrB gene, and sequence similarities of 6 loci of Shewanella genome selected from a shared gene list of the Shewanella strains with whole genome sequenced based on the average nucleotide identity of them (ANI). The phylogenetic trees based on 16S rRNA and gyrB gene sequences, and DNA relatedness derived from WCGA hybridizations of the tested Shewanella strains share exactly the same sub-clusters with very few exceptions, in which the strains were basically grouped by species. However, the phylogenetic analysis based on DNA relatedness derived from WCGA hybridizations dramatically increased the differentiation resolution at species and strains level within Shewanella genus. When the tree based on DNA relatedness derived from WCGA hybridizations was compared to the tree based on the combined sequences of the selected functional genes (6 loci), we found that the resolutions of both methods are similar, but the clustering of the tree based on DNA relatedness derived from WMGA hybridizations was clearer. These results indicate that WCGA-based DNA-DNA hybridization is an idea alternative of conventional DNA-DNA hybridization methods and it is superior to the phylogenetics methods based on sequence similarities of single genes. Detailed analysis is being performed for the re-classification of the strains examined.

  18. An analysis pipeline for the inference of protein-protein interaction networks

    Energy Technology Data Exchange (ETDEWEB)

    Taylor, Ronald C.; Singhal, Mudita; Daly, Don S.; Gilmore, Jason M.; Cannon, William R.; Domico, Kelly O.; White, Amanda M.; Auberry, Deanna L.; Auberry, Kenneth J.; Hooker, Brian S.; Hurst, G. B.; McDermott, Jason E.; McDonald, W. H.; Pelletier, Dale A.; Schmoyer, Denise A.; Wiley, H. S.

    2009-12-01

    An analysis pipeline has been created for deployment of a novel algorithm, the Bayesian Estimator of Protein-Protein Association Probabilities (BEPro), for use in the reconstruction of protein-protein interaction networks. We have combined the Software Environment for BIological Network Inference (SEBINI), an interactive environment for the deployment and testing of network inference algorithms that use high-throughput data, and the Collective Analysis of Biological Interaction Networks (CABIN), software that allows integration and analysis of protein-protein interaction and gene-to-gene regulatory evidence obtained from multiple sources, to allow interactions computed by BEPro to be stored, visualized, and further analyzed. Incorporating BEPro into SEBINI and automatically feeding the resulting inferred network into CABIN, we have created a structured workflow for protein-protein network inference and supplemental analysis from sets of mass spectrometry bait-prey experiment data. SEBINI demo site: https://www.emsl.pnl.gov /SEBINI/ Contact: ronald.taylor@pnl.gov. BEPro is available at http://www.pnl.gov/statistics/BEPro3/index.htm. Contact: ds.daly@pnl.gov. CABIN is available at http://www.sysbio.org/dataresources/cabin.stm. Contact: mudita.singhal@pnl.gov.

  19. Phylogenetic analysis of local-scale tree soil associations in a lowland moist tropical forest.

    Directory of Open Access Journals (Sweden)

    Laura A Schreeg

    Full Text Available BACKGROUND: Local plant-soil associations are commonly studied at the species-level, while associations at the level of nodes within a phylogeny have been less well explored. Understanding associations within a phylogenetic context, however, can improve our ability to make predictions across systems and can advance our understanding of the role of evolutionary history in structuring communities. METHODOLOGY/PRINCIPAL FINDINGS: Here we quantified evolutionary signal in plant-soil associations using a DNA sequence-based community phylogeny and several soil variables (e.g., extractable phosphorus, aluminum and manganese, pH, and slope as a proxy for soil water. We used published plant distributional data from the 50-ha plot on Barro Colorado Island (BCI, Republic of Panamá. Our results suggest some groups of closely related species do share similar soil associations. Most notably, the node shared by Myrtaceae and Vochysiaceae was associated with high levels of aluminum, a potentially toxic element. The node shared by Apocynaceae was associated with high extractable phosphorus, a nutrient that could be limiting on a taxon specific level. The node shared by the large group of Laurales and Magnoliales was associated with both low extractable phosphorus and with steeper slope. Despite significant node-specific associations, this study detected little to no phylogeny-wide signal. We consider the majority of the 'traits' (i.e., soil variables evaluated to fall within the category of ecological traits. We suggest that, given this category of traits, phylogeny-wide signal might not be expected while node-specific signals can still indicate phylogenetic structure with respect to the variable of interest. CONCLUSIONS: Within the BCI forest dynamics plot, distributions of some plant taxa are associated with local-scale differences in soil variables when evaluated at individual nodes within the phylogenetic tree, but they are not detectable by phylogeny

  20. Phenotypic Microdiversity and Phylogenetic Signal Analysis of Traits Related to Social Interaction in Bacillus spp. from Sediment Communities.

    Science.gov (United States)

    Rodríguez-Torres, María Dolores; Islas-Robles, África; Gómez-Lunar, Zulema; Delaye, Luis; Hernández-González, Ismael; Souza, Valeria; Travisano, Michael; Olmedo-Álvarez, Gabriela

    2017-01-01

    Understanding the relationship between phylogeny and predicted traits is important to uncover the dimension of the predictive power of a microbial composition approach. Numerous works have addressed the taxonomic composition of bacteria in communities, but little is known about trait heterogeneity in closely related bacteria that co-occur in communities. We evaluated a sample of 467 isolates from the Churince water system of the Cuatro Cienegas Basin (CCB), enriched for Bacillus spp. The 16S rRNA gene revealed a random distribution of taxonomic groups within this genus among 11 sampling sites. A subsample of 141 Bacillus spp. isolates from sediment, with seven well-represented species was chosen to evaluate the heterogeneity and the phylogenetic signal of phenotypic traits that are known to diverge within small clades, such as substrate utilization, and traits that are conserved deep in the lineage, such as prototrophy, swarming and biofilm formation. We were especially interested in evaluating social traits, such as swarming and biofilm formation, for which cooperation is needed to accomplish a multicellular behavior and for which there is little information from natural communities. The phylogenetic distribution of traits, evaluated by the Purvis and Fritz's D statistics approached a Brownian model of evolution. Analysis of the phylogenetic relatedness of the clusters of members sharing the trait using consenTRAIT algorithm, revealed more clustering and deeper phylogenetic signal for prototrophy, biofilm and swimming compared to the data obtained for substrate utilization. The explanation to the observed Brownian evolution of social traits could be either loss due to complete dispensability or to compensated trait loss due to the availability of public goods. Since many of the evaluated traits can be considered to be collective action traits, such as swarming, motility and biofilm formation, the observed microdiversity within taxonomic groups might be explained

  1. phylo-node: A molecular phylogenetic toolkit using Node.js.

    Science.gov (United States)

    O'Halloran, Damien M

    2017-01-01

    Node.js is an open-source and cross-platform environment that provides a JavaScript codebase for back-end server-side applications. JavaScript has been used to develop very fast and user-friendly front-end tools for bioinformatic and phylogenetic analyses. However, no such toolkits are available using Node.js to conduct comprehensive molecular phylogenetic analysis. To address this problem, I have developed, phylo-node, which was developed using Node.js and provides a stable and scalable toolkit that allows the user to perform diverse molecular and phylogenetic tasks. phylo-node can execute the analysis and process the resulting outputs from a suite of software options that provides tools for read processing and genome alignment, sequence retrieval, multiple sequence alignment, primer design, evolutionary modeling, and phylogeny reconstruction. Furthermore, phylo-node enables the user to deploy server dependent applications, and also provides simple integration and interoperation with other Node modules and languages using Node inheritance patterns, and a customized piping module to support the production of diverse pipelines. phylo-node is open-source and freely available to all users without sign-up or login requirements. All source code and user guidelines are openly available at the GitHub repository: https://github.com/dohalloran/phylo-node.

  2. Nonbinary Tree-Based Phylogenetic Networks

    NARCIS (Netherlands)

    Jetten, L.; van Iersel, L.J.J.

    2018-01-01

    Rooted phylogenetic networks are used to describe evolutionary histories that contain non-treelike evolutionary events such as hybridization and horizontal gene transfer. In some cases, such histories can be described by a phylogenetic base-tree with additional linking arcs, which can for example

  3. Characterization of the complete mitochondrial genome of Marshallagia marshalli and phylogenetic implications for the superfamily Trichostrongyloidea.

    Science.gov (United States)

    Sun, Miao-Miao; Han, Liang; Zhang, Fu-Kai; Zhou, Dong-Hui; Wang, Shu-Qing; Ma, Jun; Zhu, Xing-Quan; Liu, Guo-Hua

    2018-01-01

    Marshallagia marshalli (Nematoda: Trichostrongylidae) infection can lead to serious parasitic gastroenteritis in sheep, goat, and wild ruminant, causing significant socioeconomic losses worldwide. Up to now, the study concerning the molecular biology of M. marshalli is limited. Herein, we sequenced the complete mitochondrial (mt) genome of M. marshalli and examined its phylogenetic relationship with selected members of the superfamily Trichostrongyloidea using Bayesian inference (BI) based on concatenated mt amino acid sequence datasets. The complete mt genome sequence of M. marshalli is 13,891 bp, including 12 protein-coding genes, 22 transfer RNA genes, and 2 ribosomal RNA genes. All protein-coding genes are transcribed in the same direction. Phylogenetic analyses based on concatenated amino acid sequences of the 12 protein-coding genes supported the monophylies of the families Haemonchidae, Molineidae, and Dictyocaulidae with strong statistical support, but rejected the monophyly of the family Trichostrongylidae. The determination of the complete mt genome sequence of M. marshalli provides novel genetic markers for studying the systematics, population genetics, and molecular epidemiology of M. marshalli and its congeners.

  4. Phylogenetic Analysis and Prevalence of Human Papillomavirus (HPV in Women with Several Cervical Pathologies

    Directory of Open Access Journals (Sweden)

    Gülçin Alp Avcı

    2013-09-01

    Full Text Available Objective: To determinate the prevalence of HPV types in patients with cervical cancers in our legion by Real time PCR and DNA sequence analysis and to make phylogenetic analysis was aimed in this study. Material and methods: From January to October 2010, cervical swap samples of 77 patients directed to colposcopy were included in the study. HPV DNA and HPV type 16 were detected by Real Time polymerase chain reaction using the L1 region. Real Time PCR amplifications of MY09/11 products were done by GP5+/GP6+ primers and Cyanine-5 labeled HPV DNA and HPV type 16 specific probe. HPV types determinate by GP5+/GP6+. Phylogenetic analysis of sequences was calculated by Kimura’s two parameters method. Statistically analyses were by using Pearson chi-square and odss ratio tests. Results: Forty seven samples (prevalence; 61% of total seventy seven cervical samples detected as HPV DNA positive. While HPV type 16; 52%, HPV type 16+11; 4%, HPV type 16+6; 1% and non-typing HPV DNA 4% of seventy seven samples determining, 39% of samples observed as negative HPV. Participated in the study population, HPV DNA positive individuals are among 34-56 years. Most HPV DNA positivity rate of 80.0% was between the ages of 31-40. 52.2% of HPV DNA positivity between the ages of 41-50 to fall, but again, 83.3% between the ages of 51-60 to a second peak was determined that increased. 60.0% of 20 ASC-H cases, 63.8% of 36 ASC-US cases, 100% 9 of HSIL cases and 25.0% of 12 LSIL cases were positive for HPV DNA. Conclusion: The investigation of the distribution of HPV genotypes in women with cervical cancer and precancerous lesions in our region is important. Early diagnosis of HPV by using improved technological assays, play a key role to prevent the turn precancerous lesions into invasive cancers.

  5. Phylogenetic analysis of Monascus and new species from honey, pollen and nests of stingless bees

    DEFF Research Database (Denmark)

    Barbosa, R. N.; Leong, Su-lin L.; Vinnere-Pettersson, O.

    2017-01-01

    on this polyphasic approach, the genus Monascus is resolved in nine species, including three new species associated with stingless bees (M. flavipigmentosus sp. nov., M. mellicola sp. nov., M. recifensis sp. nov., M. argentinensis, M. floridanus, M. lunisporas, M. pallens, M. purpureus, M. ruber), and split in two...... new sections (section Floridani sect. nov., section Rubri sect. nov.). Phylogenetic analysis showed that the xerophile Monascus eremophilus does not belong in Monascus and monophyly in Monascus is restored with the transfer of M. eremophilus to Penicillium (P. eremophilum comb. nov.). A list...

  6. Fast phylogenetic DNA barcoding

    DEFF Research Database (Denmark)

    Terkelsen, Kasper Munch; Boomsma, Wouter Krogh; Willerslev, Eske

    2008-01-01

    We present a heuristic approach to the DNA assignment problem based on phylogenetic inferences using constrained neighbour joining and non-parametric bootstrapping. We show that this method performs as well as the more computationally intensive full Bayesian approach in an analysis of 500 insect...... DNA sequences obtained from GenBank. We also analyse a previously published dataset of environmental DNA sequences from soil from New Zealand and Siberia, and use these data to illustrate the fact that statistical approaches to the DNA assignment problem allow for more appropriate criteria...... for determining the taxonomic level at which a particular DNA sequence can be assigned....

  7. Identification, characterization and phylogenetic analysis of antifungal Trichoderma from tomato rhizosphere.

    Science.gov (United States)

    Rai, Shalini; Kashyap, Prem Lal; Kumar, Sudheer; Srivastava, Alok Kumar; Ramteke, Pramod W

    2016-01-01

    The use of Trichoderma isolates with efficient antagonistic activity represents a potentially effective and alternative disease management strategy to replace health hazardous chemical control. In this context, twenty isolates were obtained from tomato rhizosphere and evaluated by their antagonistic activity against four fungal pathogens ( Fusarium oxysporum f. sp. lycopersici , Alternaria alternata , Colletotrichum gloeosporoides and Rhizoctonia solani ). The production of extracellular cell wall degrading enzymes of tested isolates was also measured. All the isolates significantly reduced the mycelial growth of tested pathogens but the amount of growth reduction varied significantly as well. There was a positive correlation between the antagonistic capacity of Trichoderma isolates towards fungal pathogens and their lytic enzyme production. The Trichoderma isolates were initially sorted according to morphology and based on the translation elongation factor 1-α gene sequence similarity, the isolates were designated as Trichoderma harzianum , T. koningii , T. asperellum , T. virens and T. viride . PCA analysis explained 31.53, 61.95, 62.22 and 60.25% genetic variation among Trichoderma isolates based on RAPD, REP-, ERIC- and BOX element analysis, respectively. ERG - 1 gene, encoding a squalene epoxidase has been used for the first time for diversity analysis of antagonistic Trichoderma from tomato rhizosphere. Phylogenetic analysis of ERG -1 gene sequences revealed close relatedness of ERG -1sequences with earlier reported sequences of Hypocrea lixii , T. arundinaceum and T. reesei. However, ERG -1 gene also showed heterogeneity among some antagonistic isolates and indicated the possibility of occurrence of squalene epoxidase driven triterpene biosynthesis as an alternative biocontrol mechanism in Trichoderma species.

  8. Different relationships between temporal phylogenetic turnover and phylogenetic similarity and in two forests were detected by a new null model.

    Science.gov (United States)

    Huang, Jian-Xiong; Zhang, Jian; Shen, Yong; Lian, Ju-yu; Cao, Hong-lin; Ye, Wan-hui; Wu, Lin-fang; Bin, Yue

    2014-01-01

    Ecologists have been monitoring community dynamics with the purpose of understanding the rates and causes of community change. However, there is a lack of monitoring of community dynamics from the perspective of phylogeny. We attempted to understand temporal phylogenetic turnover in a 50 ha tropical forest (Barro Colorado Island, BCI) and a 20 ha subtropical forest (Dinghushan in southern China, DHS). To obtain temporal phylogenetic turnover under random conditions, two null models were used. The first shuffled names of species that are widely used in community phylogenetic analyses. The second simulated demographic processes with careful consideration on the variation in dispersal ability among species and the variations in mortality both among species and among size classes. With the two models, we tested the relationships between temporal phylogenetic turnover and phylogenetic similarity at different spatial scales in the two forests. Results were more consistent with previous findings using the second null model suggesting that the second null model is more appropriate for our purposes. With the second null model, a significantly positive relationship was detected between phylogenetic turnover and phylogenetic similarity in BCI at a 10 m×10 m scale, potentially indicating phylogenetic density dependence. This relationship in DHS was significantly negative at three of five spatial scales. This could indicate abiotic filtering processes for community assembly. Using variation partitioning, we found phylogenetic similarity contributed to variation in temporal phylogenetic turnover in the DHS plot but not in BCI plot. The mechanisms for community assembly in BCI and DHS vary from phylogenetic perspective. Only the second null model detected this difference indicating the importance of choosing a proper null model.

  9. Phylogenetic reconstruction at the species and intraspecies levels in the genus Pisum (L.) (peas) using a histone H1 gene.

    Science.gov (United States)

    Zaytseva, Olga O; Bogdanova, Vera S; Kosterin, Oleg E

    2012-08-10

    A phylogenetic analysis of the genus Pisum (peas), embracing diverse wild and cultivated forms, which evoke problems with species delimitation, was carried out based on a gene coding for histone H1, a protein that has a long and variable functional C-terminal domain. Phylogenetic trees were reconstructed on the basis of the coding sequence of the gene His5 of H1 subtype 5 in 65 pea accessions. Early separation of a clear-cut wild species Pisum fulvum is well supported, while cultivated species Pisum abyssinicum appears as a small branch within Pisum sativum. Another robust branch within P. sativum includes some wild and almost all cultivated representatives of P. sativum. Other wild representatives form diverse but rather subtle branches. In a subset of accessions, PsbA-trnH chloroplast intergenic spacer was also analysed and found less informative than His5. A number of accessions of cultivated peas from remote regions have a His5 allele of identical sequence, encoding an electrophoretically slow protein product, which earlier attracted attention as likely positively selected in harsh climate conditions. In PsbA-trnH, a 8bp deletion was found, which marks cultivated representatives of P. sativum. Copyright © 2012 Elsevier B.V. All rights reserved.

  10. Phylogenetic analysis of partial RNA-polymerase blocks II and III of Rabies virus isolated from the main rabies reservoirs in Brazil.

    Science.gov (United States)

    Carnieli, Pedro; de Novaes Oliveira, Rafael; de Oliveira Fahl, Willian; de Carvalho Ruthner Batista, Helena Beatriz; Scheffer, Karin Corrêa; Iamamoto, Keila; Castilho, Juliana Galera

    2012-08-01

    This study describes the results of the sequencing and analysis of segments of Blocks II and III of the RNA polymerase L gene of Rabies virus isolates from different reservoir species of Brazil. The phylogenetic relations of the virus were determined and a variety of species-specific nucleotides were found in the analyzed areas, but the majority of these mutations were found to be synonymous. However, an analysis of the putative amino acid sequences were shown to have some characteristic mutations between some reservoir species of Brazil, indicating that there was positive selection in the RNA polymerase L gene of Rabies virus. On comparing the putative viral sequences obtained from the Brazilian isolates and other Lyssavirus, it was determined that amino acid mutations occurred in low-restriction areas. This study of the L gene of Rabies virus is the first to be conducted with samples of virus isolates from Brazil, and the results obtained will help in the determination of the phylogenetic relations of the virus.

  11. TCS: a web server for multiple sequence alignment evaluation and phylogenetic reconstruction.

    Science.gov (United States)

    Chang, Jia-Ming; Di Tommaso, Paolo; Lefort, Vincent; Gascuel, Olivier; Notredame, Cedric

    2015-07-01

    This article introduces the Transitive Consistency Score (TCS) web server; a service making it possible to estimate the local reliability of protein multiple sequence alignments (MSAs) using the TCS index. The evaluation can be used to identify the aligned positions most likely to contain structurally analogous residues and also most likely to support an accurate phylogenetic reconstruction. The TCS scoring scheme has been shown to be accurate predictor of structural alignment correctness among commonly used methods. It has also been shown to outperform common filtering schemes like Gblocks or trimAl when doing MSA post-processing prior to phylogenetic tree reconstruction. The web server is available from http://tcoffee.crg.cat/tcs. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. Undergraduate Students’ Difficulties in Reading and Constructing Phylogenetic Tree

    Science.gov (United States)

    Sa'adah, S.; Tapilouw, F. S.; Hidayat, T.

    2017-02-01

    Representation is a very important communication tool to communicate scientific concepts. Biologists produce phylogenetic representation to express their understanding of evolutionary relationships. The phylogenetic tree is visual representation depict a hypothesis about the evolutionary relationship and widely used in the biological sciences. Phylogenetic tree currently growing for many disciplines in biology. Consequently, learning about phylogenetic tree become an important part of biological education and an interesting area for biology education research. However, research showed many students often struggle with interpreting the information that phylogenetic trees depict. The purpose of this study was to investigate undergraduate students’ difficulties in reading and constructing a phylogenetic tree. The method of this study is a descriptive method. In this study, we used questionnaires, interviews, multiple choice and open-ended questions, reflective journals and observations. The findings showed students experiencing difficulties, especially in constructing a phylogenetic tree. The students’ responds indicated that main reasons for difficulties in constructing a phylogenetic tree are difficult to placing taxa in a phylogenetic tree based on the data provided so that the phylogenetic tree constructed does not describe the actual evolutionary relationship (incorrect relatedness). Students also have difficulties in determining the sister group, character synapomorphy, autapomorphy from data provided (character table) and comparing among phylogenetic tree. According to them building the phylogenetic tree is more difficult than reading the phylogenetic tree. Finding this studies provide information to undergraduate instructor and students to overcome learning difficulties of reading and constructing phylogenetic tree.

  13. Phylogenetic analysis of the MS4A and TMEM176 gene families.

    Directory of Open Access Journals (Sweden)

    Jonathan Zuccolo

    2010-02-01

    Full Text Available The MS4A gene family in humans includes CD20 (MS4A1, FcRbeta (MS4A2, Htm4 (MS4A3, and at least 13 other syntenic genes encoding membrane proteins, most having characteristic tetraspanning topology. Expression of MS4A genes is variable in tissues throughout the body; however, several are limited to cells in the hematopoietic system where they have known roles in immune cell functions. Genes in the small TMEM176 group share significant sequence similarity with MS4A genes and there is evidence of immune function of at least one of the encoded proteins. In this study, we examined the evolutionary history of the MS4A/TMEM176 families as well as tissue expression of the phylogenetically earliest members, in order to investigate their possible origins in immune cells.Orthologs of human MS4A genes were found only in mammals; however, MS4A gene homologs were found in most jawed vertebrates. TMEM176 genes were found only in mammals and bony fish. Several unusual MS4A genes having 2 or more tandem MS4A sequences were identified in the chicken (Gallus gallus and early mammals (opossum, Monodelphis domestica and platypus, Ornithorhyncus anatinus. A large number of highly conserved MS4A and TMEM176 genes was found in zebrafish (Danio rerio. The most primitive organism identified to have MS4A genes was spiny dogfish (Squalus acanthus. Tissue expression of MS4A genes in S. acanthias and D. rerio showed no evidence of expression restricted to the hematopoietic system.Our findings suggest that MS4A genes first appeared in cartilaginous fish with expression outside of the immune system, and have since diversified in many species into their modern forms with expression and function in both immune and nonimmune cells.

  14. Genetic variation and phylogenetic relationship analysis of Jatropha curcas L. inferred from nrDNA ITS sequences.

    Science.gov (United States)

    Guo, Guo-Ye; Chen, Fang; Shi, Xiao-Dong; Tian, Yin-Shuai; Yu, Mao-Qun; Han, Xue-Qin; Yuan, Li-Chun; Zhang, Ying

    2016-01-01

    Genetic variation and phylogenetic relationships among 102 Jatropha curcas accessions from Asia, Africa, and the Americas were assessed using the internal transcribed spacer region of nuclear ribosomal DNA (nrDNA ITS). The average G+C content (65.04%) was considerably higher than the A+T (34.96%) content. The estimated genetic diversity revealed moderate genetic variation. The pairwise genetic divergences (GD) between haplotypes were evaluated and ranged from 0.000 to 0.017, suggesting a higher level of genetic differentiation in Mexican accessions than those of other regions. Phylogenetic relationships and intraspecific divergence were inferred by Bayesian inference (BI), maximum parsimony (MP), and median joining (MJ) network analysis and were generally resolved. The J. curcas accessions were consistently divided into three lineages, groups A, B, and C, which demonstrated distant geographical isolation and genetic divergence between American accessions and those from other regions. The MJ network analysis confirmed that Central America was the possible center of origin. The putative migration route suggested that J. curcas was distributed from Mexico or Brazil, via Cape Verde and then split into two routes. One route was dispersed to Spain, then migrated to China, eventually spreading to southeastern Asia, while the other route was dispersed to Africa, via Madagascar and migrated to China, later spreading to southeastern Asia. Copyright © 2016 Académie des sciences. Published by Elsevier SAS. All rights reserved.

  15. Classification and evolutionary analysis of the basic helix-loop-helix gene family in the green anole lizard, Anolis carolinensis.

    Science.gov (United States)

    Liu, Ake; Wang, Yong; Zhang, Debao; Wang, Xuhua; Song, Huifang; Dang, Chunwang; Yao, Qin; Chen, Keping

    2013-08-01

    Helix-loop-helix (bHLH) proteins play essential regulatory roles in a variety of biological processes. These highly conserved proteins form a large transcription factor superfamily, and are commonly identified in large numbers within animal, plant, and fungal genomes. The bHLH domain has been well studied in many animal species, but has not yet been characterized in non-avian reptiles. In this study, we identified 102 putative bHLH genes in the genome of the green anole lizard, Anolis carolinensis. Based on phylogenetic analysis, these genes were classified into 43 families, with 43, 24, 16, 3, 10, and 3 members assigned into groups A, B, C, D, E, and F, respectively, and 3 members categorized as "orphans". Within-group evolutionary relationships inferred from the phylogenetic analysis were consistent with highly conserved patterns observed for introns and additional domains. Results from phylogenetic analysis of the H/E(spl) family suggest that genome and tandem gene duplications have contributed to this family's expansion. Our classification and evolutionary analysis has provided insights into the evolutionary diversification of animal bHLH genes, and should aid future studies on bHLH protein regulation of key growth and developmental processes.

  16. Phylogenetic structure in tropical hummingbird communities

    DEFF Research Database (Denmark)

    Graham, Catherine H; Parra, Juan L; Rahbek, Carsten

    2009-01-01

    How biotic interactions, current and historical environment, and biogeographic barriers determine community structure is a fundamental question in ecology and evolution, especially in diverse tropical regions. To evaluate patterns of local and regional diversity, we quantified the phylogenetic...... composition of 189 hummingbird communities in Ecuador. We assessed how species and phylogenetic composition changed along environmental gradients and across biogeographic barriers. We show that humid, low-elevation communities are phylogenetically overdispersed (coexistence of distant relatives), a pattern...... that is consistent with the idea that competition influences the local composition of hummingbirds. At higher elevations communities are phylogenetically clustered (coexistence of close relatives), consistent with the expectation of environmental filtering, which may result from the challenge of sustaining...

  17. Constructing phylogenetic trees using interacting pathways.

    Science.gov (United States)

    Wan, Peng; Che, Dongsheng

    2013-01-01

    Phylogenetic trees are used to represent evolutionary relationships among biological species or organisms. The construction of phylogenetic trees is based on the similarities or differences of their physical or genetic features. Traditional approaches of constructing phylogenetic trees mainly focus on physical features. The recent advancement of high-throughput technologies has led to accumulation of huge amounts of biological data, which in turn changed the way of biological studies in various aspects. In this paper, we report our approach of building phylogenetic trees using the information of interacting pathways. We have applied hierarchical clustering on two domains of organisms-eukaryotes and prokaryotes. Our preliminary results have shown the effectiveness of using the interacting pathways in revealing evolutionary relationships.

  18. Phylogenetic analysis of feline immunodeficiency virus strains from naturally infected cats in Belgium and The Netherlands.

    Science.gov (United States)

    Roukaerts, Inge D M; Theuns, Sebastiaan; Taffin, Elien R L; Daminet, Sylvie; Nauwynck, Hans J

    2015-01-22

    Feline immunodeficiency virus (FIV) is a major pathogen in feline populations worldwide, with seroprevalences up to 26%. Virus strains circulating in domestic cats are subdivided into different phylogenetic clades (A-E), based on the genetic diversity of the V3-V4 region of the env gene. In this report, a phylogenetic analysis of the V3-V4 env region, and a variable region in the gag gene was made for 36 FIV strains isolated in Belgium and The Netherlands. All newly generated gag sequences clustered together with previously known clade A FIV viruses, confirming the dominance of clade A viruses in Northern Europe. The same was true for the obtained env sequences, with only one sample of an unknown env subtype. Overall, the genetic diversity of FIV strains sequenced in this report was low. This indicates a relatively recent introduction of FIV in Belgium and The Netherlands. However, the sample with an unknown env subtype indicates that new introductions of FIV from unknown origin do occur and this will likely increase genetic variability in time. Copyright © 2014 Elsevier B.V. All rights reserved.

  19. Inferring Phylogenetic Networks Using PhyloNet.

    Science.gov (United States)

    Wen, Dingqiao; Yu, Yun; Zhu, Jiafan; Nakhleh, Luay

    2018-07-01

    PhyloNet was released in 2008 as a software package for representing and analyzing phylogenetic networks. At the time of its release, the main functionalities in PhyloNet consisted of measures for comparing network topologies and a single heuristic for reconciling gene trees with a species tree. Since then, PhyloNet has grown significantly. The software package now includes a wide array of methods for inferring phylogenetic networks from data sets of unlinked loci while accounting for both reticulation (e.g., hybridization) and incomplete lineage sorting. In particular, PhyloNet now allows for maximum parsimony, maximum likelihood, and Bayesian inference of phylogenetic networks from gene tree estimates. Furthermore, Bayesian inference directly from sequence data (sequence alignments or biallelic markers) is implemented. Maximum parsimony is based on an extension of the "minimizing deep coalescences" criterion to phylogenetic networks, whereas maximum likelihood and Bayesian inference are based on the multispecies network coalescent. All methods allow for multiple individuals per species. As computing the likelihood of a phylogenetic network is computationally hard, PhyloNet allows for evaluation and inference of networks using a pseudolikelihood measure. PhyloNet summarizes the results of the various analyzes and generates phylogenetic networks in the extended Newick format that is readily viewable by existing visualization software.

  20. Prevalence, complete genome sequencing and phylogenetic analysis of porcine deltacoronavirus in South Korea, 2014-2016.

    Science.gov (United States)

    Jang, G; Lee, K-K; Kim, S-H; Lee, C

    2017-10-01

    Porcine deltacoronavirus (PDCoV) is a newly emerged enterotropic swine coronavirus that causes enteritis and diarrhoea in piglets. Here, a nested reverse transcription (RT)-PCR approach for the detection of PDCoV was developed to identify and characterize aetiologic agent(s) associated with diarrhoeal diseases in piglets in South Korea. A PCR-based method was applied to investigate the presence of PDCoV in 683 diarrhoeic samples collected from 449 commercial pig farms in South Korea from January 2014 to December 2016. The molecular-based survey indicated a relatively high prevalence of PDCoV (19.03%) in South Korea. Among those, the monoinfection of PDCoV (9.66%) and co-infection of PDCoV (6.30%) with porcine epidemic diarrhoea (PEDV) were predominant in diarrhoeal samples. The full-length genomes or the complete spike genes of the most recent strains identified in 2016 (KNU16-07, KNU16-08 and KNU16-11) were sequenced and analysed to characterize PDCoV currently prevalent in South Korea. We found a single insertion-deletion signature and dozens of genetic changes in the spike (S) genes of the KNU16 isolates. Phylogenetic analysis based on the entire genome and spike protein sequences of these strains indicated that they are most closely related to other Korean isolates grouped with the US strains. However, Korean PDCoV strains formed different branches within the same cluster, implying continuous evolution in the field. Our data will advance the understanding of the molecular epidemiology and evolutionary characteristics of PDCoV circulating in South Korea. © 2017 Blackwell Verlag GmbH.

  1. 16S ribosomal RNA sequence analysis for determination of phylogenetic relationship among methylotrophs.

    Science.gov (United States)

    Tsuji, K; Tsien, H C; Hanson, R S; DePalma, S R; Scholtz, R; LaRoche, S

    1990-01-01

    16S ribosomal RNAs (rRNA) of 12 methylotrophic bacteria have been almost completely sequenced to establish their phylogenetic relationships. Methylotrophs that are physiologically related are phylogenetically diverse and are scattered among the purple eubacteria (class Proteobacteria). Group I methylotrophs can be classified in the beta- and the gamma-subdivisions and group II methylotrophs in the alpha-subdivision of the purple eubacteria, respectively. Pink-pigmented facultative and non-pigmented obligate group II methylotrophs form two distinctly separate branches within the alpha-subdivision. The secondary structures of the 16S rRNA sequences of 'Methylocystis parvus' strain OBBP, 'Methylosinus trichosporium' strain OB3b, 'Methylosporovibrio methanica' strain 81Z and Hyphomicrobium sp. strain DM2 are similar, and these non-pigmented obligate group II methylotrophs form one tight cluster in the alpha-subdivision. The pink-pigmented facultative methylotrophs, Methylobacterium extorquens strain AM1, Methylobacterium sp. strain DM4 and Methylobacterium organophilum strain XX form another cluster within the alpha-subdivision. Although similar in phenotypic characteristics, Methylobacterium organophilum strain XX and Methylobacterium extorquens strain AM1 are clearly distinguishable by their 16S rRNA sequences. The group I methylotrophs, Methylophilus methylotrophus strain AS1 and methylotrophic species DM11, which do not utilize methane, are similar in 16S rRNA sequence to bacteria in the beta-subdivision. The methane-utilizing, obligate group I methanotrophs, Methylococcus capsulatus strain BATH and Methylomonas methanica, are placed in the gamma-subdivision. The results demonstrate that it is possible to distinguish and classify the methylotrophic bacteria using 16S rRNA sequence analysis.

  2. Phylogenetic classification of bony fishes.

    Science.gov (United States)

    Betancur-R, Ricardo; Wiley, Edward O; Arratia, Gloria; Acero, Arturo; Bailly, Nicolas; Miya, Masaki; Lecointre, Guillaume; Ortí, Guillermo

    2017-07-06

    Fish classifications, as those of most other taxonomic groups, are being transformed drastically as new molecular phylogenies provide support for natural groups that were unanticipated by previous studies. A brief review of the main criteria used by ichthyologists to define their classifications during the last 50 years, however, reveals slow progress towards using an explicit phylogenetic framework. Instead, the trend has been to rely, in varying degrees, on deep-rooted anatomical concepts and authority, often mixing taxa with explicit phylogenetic support with arbitrary groupings. Two leading sources in ichthyology frequently used for fish classifications (JS Nelson's volumes of Fishes of the World and W. Eschmeyer's Catalog of Fishes) fail to adopt a global phylogenetic framework despite much recent progress made towards the resolution of the fish Tree of Life. The first explicit phylogenetic classification of bony fishes was published in 2013, based on a comprehensive molecular phylogeny ( www.deepfin.org ). We here update the first version of that classification by incorporating the most recent phylogenetic results. The updated classification presented here is based on phylogenies inferred using molecular and genomic data for nearly 2000 fishes. A total of 72 orders (and 79 suborders) are recognized in this version, compared with 66 orders in version 1. The phylogeny resolves placement of 410 families, or ~80% of the total of 514 families of bony fishes currently recognized. The ordinal status of 30 percomorph families included in this study, however, remains uncertain (incertae sedis in the series Carangaria, Ovalentaria, or Eupercaria). Comments to support taxonomic decisions and comparisons with conflicting taxonomic groups proposed by others are presented. We also highlight cases were morphological support exist for the groups being classified. This version of the phylogenetic classification of bony fishes is substantially improved, providing resolution

  3. Phylogenetic reconstruction of endophytic fungal isolates using internal transcribed spacer 2 (ITS2) region.

    Science.gov (United States)

    GokulRaj, Kathamuthu; Sundaresan, Natesan; Ganeshan, Enthai Jagan; Rajapriya, Pandi; Muthumary, Johnpaul; Sridhar, Jayavel; Pandi, Mohan

    2014-01-01

    Endophytic fungi are inhabitants of plants, living most part of their lifecycle asymptomatically which mainly confer protection and ecological advantages to the host plant. In this present study, 48 endophytic fungi were isolated from the leaves of three medicinal plants and characterized based on ITS2 sequence - secondary structure analysis. ITS2 secondary structures were elucidated with minimum free energy method (MFOLD version 3.1) and consensus structure of each genus was generated by 4SALE. ProfDistS was used to generate ITS2 sequence structure based phylogenetic tree respectively. Our elucidated isolates were belonging to Ascomycetes family, representing 5 orders and 6 genera. Colletotrichum/Glomerella spp., Diaporthae/Phomopsis spp., and Alternaria spp., were predominantly observed while Cochliobolus sp., Cladosporium sp., and Emericella sp., were represented by singletons. The constructed phylogenetic tree has well resolved monophyletic groups with >50% bootstrap value support. Secondary structures based fungal systematics improves not only the stability; it also increases the precision of phylogenetic inference. Above ITS2 based phylogenetic analysis was performed for our 48 isolates along with sequences of known ex-types taken from GenBank which confirms the efficiency of the proposed method. Further, we propose it as superlative marker for reconstructing phylogenetic relationships at different taxonomic levels due to their lesser length.

  4. Cloning, Phylogenetic Analysis and 3D Modeling of a Putative Lysosomal Acid Lipase from the Camel, Camelus dromedarius

    Directory of Open Access Journals (Sweden)

    Farid Shokry Ataya

    2012-08-01

    Full Text Available Acid lipase belongs to a family of enzymes that is mainly present in lysosomes of different organs and the stomach. It is characterized by its capacity to withstand acidic conditions while maintaining high lipolytic activity. We cloned for the first time the full coding sequence of camel’s lysosomal acid lipase, cLIPA using RT-PCR technique (Genbank accession numbers JF803951 and AEG75815, for the nucleotide and aminoacid sequences respectively. The cDNA sequencing revealed an open reading frame of 1,197 nucleotides that encodes a protein of 399 aminoacids which was similar to that from other related mammalian species. Bioinformatic analysis was used to determine the aminoacid sequence, 3D structure and phylogeny of cLIPA. Bioinformatics analysis suggested the molecular weight of the translated protein to be 45.57 kDa, which could be decreased to 43.16 kDa after the removal of a signal peptide comprising the first 21 aminoacids. The deduced cLIPA sequences exhibited high identity with Equus caballus (86%, Numascus leucogenys (85%, Homo sapiens (84%, Sus scrofa (84%, Bos taurus (82% and Ovis aries (81%. cLIPA shows high aminoacid sequence identity with human and dog-gastric lipases (58%, and 59% respectively which makes it relevant to build a 3D structure model for cLIPA. The comparison confirms the presence of the catalytic triad and the oxyanion hole in cLIPA. Phylogenetic analysis revealed that camel cLIPA is grouped with monkey, human, pig, cow and goat. The level of expression of cLIPA in five camel tissues was examined using Real Time-PCR. The highest level of cLIPA transcript was found in the camel testis (162%, followed by spleen (129%, liver (100%, kidney (20.5% and lung (17.4%.

  5. Inferring 'weak spots' in phylogenetic trees: application to mosasauroid nomenclature.

    Science.gov (United States)

    Madzia, Daniel; Cau, Andrea

    2017-01-01

    Mosasauroid squamates represented the apex predators within the Late Cretaceous marine and occasionally also freshwater ecosystems. Proper understanding of the origin of their ecological adaptations or paleobiogeographic dispersals requires adequate knowledge of their phylogeny. The studies assessing the position of mosasauroids on the squamate evolutionary tree and their origins have long given conflicting results. The phylogenetic relationships within Mosasauroidea, however, have experienced only little changes throughout the last decades. Considering the substantial improvements in the development of phylogenetic methodology that have undergone in recent years, resulting, among others, in numerous alterations in the phylogenetic hypotheses of other fossil amniotes, we test the robustness in our understanding of mosasauroid beginnings and their evolutionary history. We re-examined a data set that results from modifications assembled in the course of the last 20 years and performed multiple parsimony analyses and Bayesian tip-dating analysis. Following the inferred topologies and the 'weak spots' in the phylogeny of mosasauroids, we revise the nomenclature of the 'traditionally' recognized mosasauroid clades, to acknowledge the overall weakness among branches and the alternative topologies suggested previously, and discuss several factors that might have an impact on the differing phylogenetic hypotheses and their statistical support.

  6. Phylogenetic and functional analysis of metagenome sequence from high-temperature archaeal habitats demonstrate linkages between metabolic potential and geochemistry

    Directory of Open Access Journals (Sweden)

    William P. Inskeep

    2013-05-01

    Full Text Available Geothermal habitats in Yellowstone National Park (YNP provide an unparalled opportunity to understand the environmental factors that control the distribution of archaea in thermal habitats. Here we describe, analyze and synthesize metagenomic and geochemical data collected from seven high-temperature sites that contain microbial communities dominated by archaea relative to bacteria. The specific objectives of the study were to use metagenome sequencing to determine the structure and functional capacity of thermophilic archaeal-dominated microbial communities across a pH range from 2.5 to 6.4 and to discuss specific examples where the metabolic potential correlated with measured environmental parameters and geochemical processes occurring in situ. Random shotgun metagenome sequence (~40-45 Mbase Sanger sequencing per site was obtained from environmental DNA extracted from high-temperature sediments and/or microbial mats and subjected to numerous phylogenetic and functional analyses. Analysis of individual sequences (e.g., MEGAN and G+C content and assemblies from each habitat type revealed the presence of dominant archaeal populations in all environments, 10 of whose genomes were largely reconstructed from the sequence data. Analysis of protein family occurrence, particularly of those involved in energy conservation, electron transport and autotrophic metabolism, revealed significant differences in metabolic strategies across sites consistent with differences in major geochemical attributes (e.g., sulfide, oxygen, pH. These observations provide an ecological basis for understanding the distribution of indigenous archaeal lineages across high temperature systems of YNP.

  7. PAL: an object-oriented programming library for molecular evolution and phylogenetics.

    Science.gov (United States)

    Drummond, A; Strimmer, K

    2001-07-01

    Phylogenetic Analysis Library (PAL) is a collection of Java classes for use in molecular evolution and phylogenetics. PAL provides a modular environment for the rapid construction of both special-purpose and general analysis programs. PAL version 1.1 consists of 145 public classes or interfaces in 13 packages, including classes for models of character evolution, maximum-likelihood estimation, and the coalescent, with a total of more than 27000 lines of code. The PAL project is set up as a collaborative project to facilitate contributions from other researchers. AVAILIABILTY: The program is free and is available at http://www.pal-project.org. It requires Java 1.1 or later. PAL is licensed under the GNU General Public License.

  8. Multilocus sequence typing and phylogenetic analysis of Propionibacterium acnes

    DEFF Research Database (Denmark)

    Kilian, Mogens; Scholz, Christian F. P.; Lomholt, Hans B.

    2012-01-01

    Propionibacterium acnes is a commensal of human skin but is also implicated in the pathogenesis of acne vulgaris, in biofilm-associated infections of medical devices and endophthalmitis, and in infections of bone and dental root canals. Recent studies associate P. acnes with prostate cancer...... schemes were compared with reference to a phylogenetic tree based on 78 P. acnes genomes and their gene contents. Further support for a basically clonal population structure of P. acnes and a scenario of the global spread of epidemic clones of P. acnes was obtained. Compared to the Belfast scheme...

  9. Phylogenetic analysis of subgenus vigna species using nuclear ribosomal RNA ITS: evidence of hybridization among Vigna unguiculata subspecies.

    Science.gov (United States)

    Vijaykumar, Archana; Saini, Ajay; Jawali, Narendra

    2010-01-01

    Molecular phylogeny among species belonging to subgenus Vigna (genus Vigna) was inferred based on internal transcribed spacer (ITS) sequences of 18S-5.8S-26S ribosomal RNA gene unit. Analysis showed a total of 356 polymorphic sites of which approximately 80% were parsimony informative. Phylogenetic reconstruction by neighbor joining and maximum parsimony methods placed the 57 Vigna accessions (belonging to 15 species) into 5 major clades. Five species viz. Vigna heterophylla, Vigna pubigera, Vigna parkeri, Vigna laurentii, and Vigna gracilis whose position in the subgenus was previously not known were placed in the section Vigna. A single accession (Vigna unguiculata ssp. tenuis, NI 1637) harbored 2 intragenomic ITS variants, indicative of 2 different types of ribosomal DNA (rDNA) repeat units. ITS variant type-I was close to ITS from V. unguiculata ssp. pubescens, whereas type-II was close to V. unguiculata ssp. tenuis. Transcript analysis clearly demonstrates that in accession NI 1637, rDNA repeat units with only type-II ITS variants are transcriptionally active. Evidence from sequence analysis (of 5.8S, ITS1, and ITS2) and secondary structure analysis (of ITS1 and ITS2) indicates that the type-I ITS variant probably does not belong to the pseudogenic rDNA repeat units. The results from phylogenetic and transcript analysis suggest that the rDNA units with the type-I ITS may have introgressed as a result of hybridization (between ssp. tenuis and ssp. pubescens); however, it has been epigenetically silenced. The results also demonstrate differential evolution of ITS sequence among wild and cultivated forms of V. unguiculata.

  10. Integration of Phenotypic Metadata and Protein Similarity in Archaea Using a Spectral Bipartitioning Approach

    Energy Technology Data Exchange (ETDEWEB)

    Hooper, Sean D.; Anderson, Iain J; Pati, Amrita; Dalevi, Daniel; Mavromatis, Konstantinos; Kyrpides, Nikos C

    2009-01-01

    In order to simplify and meaningfully categorize large sets of protein sequence data, it is commonplace to cluster proteins based on the similarity of those sequences. However, it quickly becomes clear that the sequence flexibility allowed a given protein varies significantly among different protein families. The degree to which sequences are conserved not only differs for each protein family, but also is affected by the phylogenetic divergence of the source organisms. Clustering techniques that use similarity thresholds for protein families do not always allow for these variations and thus cannot be confidently used for applications such as automated annotation and phylogenetic profiling. In this work, we applied a spectral bipartitioning technique to all proteins from 53 archaeal genomes. Comparisons between different taxonomic levels allowed us to study the effects of phylogenetic distances on cluster structure. Likewise, by associating functional annotations and phenotypic metadata with each protein, we could compare our protein similarity clusters with both protein function and associated phenotype. Our clusters can be analyzed graphically and interactively online.

  11. Genetic variation and phylogenetic relationships of the ectomycorrhizal Floccularia luteovirens on the Qinghai-Tibet Plateau.

    Science.gov (United States)

    Xing, Rui; Gao, Qing-Bo; Zhang, Fa-Qi; Fu, Peng-Cheng; Wang, Jiu-Li; Yan, Hui-Ying; Chen, Shi-Long

    2017-08-01

    Floccularia luteovirens, as an ectomycorrhizal fungus, is widely distributed in the Qinghai-Tibet Plateau. As an edible fungus, it is famous for its unique flavor. Former studies mainly focus on the chemical composition and genetic structure of this species. However, the phylogenetic relationship between genotypes remains unknown. In this study, the genetic variation and phylogenetic relationship between the genotypes of F. luteovirens in Qinghai-Tibet Plateau was estimated through the analysis on two protein-coding genes (rpb1 and ef-1α) from 398 individuals collected from 24 wild populations. The sample covered the entire range of this species during all the growth seasons from 2011 to 2015. 13 genotypes were detected and moderate genetic diversity was revealed. Based on the results of network analysis, the maximum likelihood (ML), maximum parsimony (MP), and Bayesian inference (BI) analyses, the genotypes H-1, H-4, H-6, H-8, H-10, and H-11 were grouped into one clade. Additionally, a relatively higher genotype diversity (average h value is 0.722) and unique genotypes in the northeast edge of Qinghai- Tibet plateau have been found, combined with the results of mismatch analysis and neutrality tests indicated that Southeast Qinghai-Tibet plateau was a refuge for F. luteovirens during the historical geological or climatic events (uplifting of the Qinghai-Tibet Plateau or Last Glacial Maximum). Furthermore, the present distribution of the species on the Qinghai-Tibet plateau has resulted from the recent population expansion. Our findings provide a foundation for the future study of the evolutionary history and the speciation of this species.

  12. The Princeton Protein Orthology Database (P-POD): a comparative genomics analysis tool for biologists.

    OpenAIRE

    Sven Heinicke; Michael S Livstone; Charles Lu; Rose Oughtred; Fan Kang; Samuel V Angiuoli; Owen White; David Botstein; Kara Dolinski

    2007-01-01

    Many biological databases that provide comparative genomics information and tools are now available on the internet. While certainly quite useful, to our knowledge none of the existing databases combine results from multiple comparative genomics methods with manually curated information from the literature. Here we describe the Princeton Protein Orthology Database (P-POD, http://ortholog.princeton.edu), a user-friendly database system that allows users to find and visualize the phylogenetic r...

  13. First evidence of Leishmania infection in European brown hare (Lepus europaeus) in Greece: GIS analysis and phylogenetic position within the Leishmania spp.

    Science.gov (United States)

    Tsokana, C N; Sokos, C; Giannakopoulos, A; Mamuris, Z; Birtsas, P; Papaspyropoulos, K; Valiakos, G; Spyrou, V; Lefkaditis, M; Chatzopoulos, D C; Kantere, M; Manolakou, K; Touloudi, A; Burriel, A Rodi; Ferroglio, E; Hadjichristodoulou, C; Billinis, C

    2016-01-01

    Although the existence of a sylvatic transmission cycle of Leishmania spp., independent from the domestic cycle, has been proposed, data are scarce on Leishmania infection in wild mammals in Greece. In this study, we aimed to investigate the presence of Leishmania infection in the European brown hare in Greece, to infer the phylogenetic position of the Leishmania parasites detected in hares in Greece, and to identify any possible correlation between Leishmania infection in hares with environmental parameters, using the geographical information system (GIS). Spleen samples from 166 hares were tested by internal transcribed spacer-1 (ITS-1)-nested PCR for the detection of Leishmania DNA. Phylogenetic analysis was performed on Leishmania sequences from hares in Greece in conjunction with Leishmania sequences from dogs in Greece and 46 Leishmania sequences retrieved from GenBank. The Leishmania DNA prevalence in hares was found to be 23.49 % (95 % confidence interval (CI) 17.27-30.69). The phylogenetic analysis confirmed that the Leishmania sequences from hares in Greece belong in the Leishmania donovani complex. The widespread Leishmania infection in hares should be taken into consideration because under specific circumstances, this species can act as a reservoir host. This study suggests that the role of wild animals, including hares, in the epidemiology of Leishmania spp. in Greece deserves further elucidation.

  14. Mitochondrial genome of Pteronotus personatus (Chiroptera: Mormoopidae): comparison with selected bats and phylogenetic considerations.

    Science.gov (United States)

    López-Wilchis, Ricardo; Del Río-Portilla, Miguel Ángel; Guevara-Chumacero, Luis Manuel

    2017-02-01

    We described the complete mitochondrial genome (mitogenome) of the Wagner's mustached bat, Pteronotus personatus, a species belonging to the family Mormoopidae, and compared it with other published mitogenomes of bats (Chiroptera). The mitogenome of P. personatus was 16,570 bp long and contained a typically conserved structure including 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and one control region (D-loop). Most of the genes were encoded on the H-strand, except for eight tRNA and the ND6 genes. The order of protein-coding and rRNA genes was highly conserved in all mitogenomes. All protein-coding genes started with an ATG codon, except for ND2, ND3, and ND5, which initiated with ATA, and terminated with the typical stop codon TAA/TAG or the codon AGA. Phylogenetic trees constructed using Maximum Parsimony, Maximum Likelihood, and Bayesian inference methods showed an identical topology and indicated the monophyly of different families of bats (Mormoopidae, Phyllostomidae, Vespertilionidae, Rhinolophidae, and Pteropopidae) and the existence of two major clades corresponding to the suborders Yangochiroptera and Yinpterochiroptera. The mitogenome sequence provided here will be useful for further phylogenetic analyses and population genetic studies in mormoopid bats.

  15. Undergraduate Students’ Initial Ability in Understanding Phylogenetic Tree

    Science.gov (United States)

    Sa'adah, S.; Hidayat, T.; Sudargo, Fransisca

    2017-04-01

    The Phylogenetic tree is a visual representation depicts a hypothesis about the evolutionary relationship among taxa. Evolutionary experts use this representation to evaluate the evidence for evolution. The phylogenetic tree is currently growing for many disciplines in biology. Consequently, learning about the phylogenetic tree has become an important part of biological education and an interesting area of biology education research. Skill to understanding and reasoning of the phylogenetic tree, (called tree thinking) is an important skill for biology students. However, research showed many students have difficulty in interpreting, constructing, and comparing among the phylogenetic tree, as well as experiencing a misconception in the understanding of the phylogenetic tree. Students are often not taught how to reason about evolutionary relationship depicted in the diagram. Students are also not provided with information about the underlying theory and process of phylogenetic. This study aims to investigate the initial ability of undergraduate students in understanding and reasoning of the phylogenetic tree. The research method is the descriptive method. Students are given multiple choice questions and an essay that representative by tree thinking elements. Each correct answer made percentages. Each student is also given questionnaires. The results showed that the undergraduate students’ initial ability in understanding and reasoning phylogenetic tree is low. Many students are not able to answer questions about the phylogenetic tree. Only 19 % undergraduate student who answered correctly on indicator evaluate the evolutionary relationship among taxa, 25% undergraduate student who answered correctly on indicator applying concepts of the clade, 17% undergraduate student who answered correctly on indicator determines the character evolution, and only a few undergraduate student who can construct the phylogenetic tree.

  16. Accurate reconstruction of insertion-deletion histories by statistical phylogenetics.

    Directory of Open Access Journals (Sweden)

    Oscar Westesson

    Full Text Available The Multiple Sequence Alignment (MSA is a computational abstraction that represents a partial summary either of indel history, or of structural similarity. Taking the former view (indel history, it is possible to use formal automata theory to generalize the phylogenetic likelihood framework for finite substitution models (Dayhoff's probability matrices and Felsenstein's pruning algorithm to arbitrary-length sequences. In this paper, we report results of a simulation-based benchmark of several methods for reconstruction of indel history. The methods tested include a relatively new algorithm for statistical marginalization of MSAs that sums over a stochastically-sampled ensemble of the most probable evolutionary histories. For mammalian evolutionary parameters on several different trees, the single most likely history sampled by our algorithm appears less biased than histories reconstructed by other MSA methods. The algorithm can also be used for alignment-free inference, where the MSA is explicitly summed out of the analysis. As an illustration of our method, we discuss reconstruction of the evolutionary histories of human protein-coding genes.

  17. Improved Maximum Parsimony Models for Phylogenetic Networks.

    Science.gov (United States)

    Van Iersel, Leo; Jones, Mark; Scornavacca, Celine

    2018-05-01

    Phylogenetic networks are well suited to represent evolutionary histories comprising reticulate evolution. Several methods aiming at reconstructing explicit phylogenetic networks have been developed in the last two decades. In this article, we propose a new definition of maximum parsimony for phylogenetic networks that permits to model biological scenarios that cannot be modeled by the definitions currently present in the literature (namely, the "hardwired" and "softwired" parsimony). Building on this new definition, we provide several algorithmic results that lay the foundations for new parsimony-based methods for phylogenetic network reconstruction.

  18. Transmembrane START domain proteins: in silico identification, characterization and expression analysis under stress conditions in chickpea (Cicer arietinum L.).

    Science.gov (United States)

    Satheesh, Viswanathan; Chidambaranathan, Parameswaran; Jagannadham, Prasanth Tejkumar; Kumar, Vajinder; Jain, Pradeep K; Chinnusamy, Viswanathan; Bhat, Shripad R; Srinivasan, R

    2016-01-01

    Steroidogenic acute regulatory related transfer (StART) proteins that are involved in transport of lipid molecules, play a myriad of functions in insects, mammals and plants. These proteins consist of a modular START domain of approximately 200 amino acids which binds and transfers the lipids. In the present study we have performed a genome-wide search for all START domain proteins in chickpea. The search identified 36 chickpea genes belonging to the START domain family. Through a phylogenetic tree reconstructed with Arabidopsis, rice, chickpea, and soybean START proteins, we were able to identify four transmembrane START (TM-START) proteins in chickpea. These four proteins are homologous to the highly conserved mammalian phosphatidylcholine transfer proteins. Multiple sequence alignment of all the transmembrane containing START proteins from Arabidopsis, rice, chickpea, and soybean revealed that the amino acid residues to which phosphatidylcholine binds in mammals, is also conserved in all these plant species, implying an important functional role and a very similar mode of action of all these proteins across dicots and monocots. This study characterizes a few of the not so well studied transmembrane START superfamily genes that may be involved in stress signaling. Expression analysis in various tissues showed that these genes are predominantly expressed in flowers and roots of chickpea. Three of the chickpea TM-START genes showed induced expression in response to drought, salt, wound and heat stress, suggesting their role in stress response.

  19. Identification and phylogenetic analysis of Tityus pachyurus and Tityus obscurus novel putative Na+-channel scorpion toxins.

    Directory of Open Access Journals (Sweden)

    Jimmy A Guerrero-Vargas

    Full Text Available Colombia and Brazil are affected by severe cases of scorpionism. In Colombia the most dangerous accidents are caused by Tityus pachyurus that is widely distributed around this country. In the Brazilian Amazonian region scorpion stings are a common event caused by Tityus obscurus. The main objective of this work was to perform the molecular cloning of the putative Na(+-channel scorpion toxins (NaScTxs from T. pachyurus and T. obscurus venom glands and to analyze their phylogenetic relationship with other known NaScTxs from Tityus species.cDNA libraries from venom glands of these two species were constructed and five nucleotide sequences from T. pachyurus were identified as putative modulators of Na(+-channels, and were named Tpa4, Tpa5, Tpa6, Tpa7 and Tpa8; the latter being the first anti-insect excitatory β-class NaScTx in Tityus scorpion venom to be described. Fifteen sequences from T. obscurus were identified as putative NaScTxs, among which three had been previously described, and the others were named To4 to To15. The peptides Tpa4, Tpa5, Tpa6, To6, To7, To9, To10 and To14 are closely related to the α-class NaScTxs, whereas Tpa7, Tpa8, To4, To8, To12 and To15 sequences are more related to the β-class NaScTxs. To5 is possibly an arthropod specific toxin. To11 and To13 share sequence similarities with both α and β NaScTxs. By means of phylogenetic analysis using the Maximum Parsimony method and the known NaScTxs from Tityus species, these toxins were clustered into 14 distinct groups.This communication describes new putative NaScTxs from T. pachyurus and T. obscurus and their phylogenetic analysis. The results indicate clear geographic separation between scorpions of Tityus genus inhabiting the Amazonian and Mountain Andes regions and those distributed over the Southern of the Amazonian rainforest. Based on the consensus sequences for the different clusters, a new nomenclature for the NaScTxs is proposed.

  20. A phylogenetic study of SPBP and RAI1: evolutionary conservation of chromatin binding modules.

    Directory of Open Access Journals (Sweden)

    Sagar Darvekar

    Full Text Available Our genome is assembled into and array of highly dynamic nucleosome structures allowing spatial and temporal access to DNA. The nucleosomes are subject to a wide array of post-translational modifications, altering the DNA-histone interaction and serving as docking sites for proteins exhibiting effector or "reader" modules. The nuclear proteins SPBP and RAI1 are composed of several putative "reader" modules which may have ability to recognise a set of histone modification marks. Here we have performed a phylogenetic study of their putative reader modules, the C-terminal ePHD/ADD like domain, a novel nucleosome binding region and an AT-hook motif. Interactions studies in vitro and in yeast cells suggested that despite the extraordinary long loop region in their ePHD/ADD-like chromatin binding domains, the C-terminal region of both proteins seem to adopt a cross-braced topology of zinc finger interactions similar to other structurally determined ePHD/ADD structures. Both their ePHD/ADD-like domain and their novel nucleosome binding domain are highly conserved in vertebrate evolution, and construction of a phylogenetic tree displayed two well supported clusters representing SPBP and RAI1, respectively. Their genome and domain organisation suggest that SPBP and RAI1 have occurred from a gene duplication event. The phylogenetic tree suggests that this duplication has happened early in vertebrate evolution, since only one gene was identified in insects and lancelet. Finally, experimental data confirm that the conserved novel nucleosome binding region of RAI1 has the ability to bind the nucleosome core and histones. However, an adjacent conserved AT-hook motif as identified in SPBP is not present in RAI1, and deletion of the novel nucleosome binding region of RAI1 did not significantly affect its nuclear localisation.