accurate phylogenetic breakpoint: Topics by WorldWideScience.org

Sample records for accurate phylogenetic breakpoint

An evolutionary model-based algorithm for accurate phylogenetic breakpoint mapping and subtype prediction in HIV-1.

Directory of Open Access Journals (Sweden)

Sergei L Kosakovsky Pond

2009-11-01

Full Text Available Genetically diverse pathogens (such as Human Immunodeficiency virus type 1, HIV-1 are frequently stratified into phylogenetically or immunologically defined subtypes for classification purposes. Computational identification of such subtypes is helpful in surveillance, epidemiological analysis and detection of novel variants, e.g., circulating recombinant forms in HIV-1. A number of conceptually and technically different techniques have been proposed for determining the subtype of a query sequence, but there is not a universally optimal approach. We present a model-based phylogenetic method for automatically subtyping an HIV-1 (or other viral or bacterial sequence, mapping the location of breakpoints and assigning parental sequences in recombinant strains as well as computing confidence levels for the inferred quantities. Our Subtype Classification Using Evolutionary ALgorithms (SCUEAL procedure is shown to perform very well in a variety of simulation scenarios, runs in parallel when multiple sequences are being screened, and matches or exceeds the performance of existing approaches on typical empirical cases. We applied SCUEAL to all available polymerase (pol sequences from two large databases, the Stanford Drug Resistance database and the UK HIV Drug Resistance Database. Comparing with subtypes which had previously been assigned revealed that a minor but substantial (approximately 5% fraction of pure subtype sequences may in fact be within- or inter-subtype recombinants. A free implementation of SCUEAL is provided as a module for the HyPhy package and the Datamonkey web server. Our method is especially useful when an accurate automatic classification of an unknown strain is desired, and is positioned to complement and extend faster but less accurate methods. Given the increasingly frequent use of HIV subtype information in studies focusing on the effect of subtype on treatment, clinical outcome, pathogenicity and vaccine design, the importance
DB2: a probabilistic approach for accurate detection of tandem duplication breakpoints using paired-end reads.

Science.gov (United States)

Yavaş, Gökhan; Koyutürk, Mehmet; Gould, Meetha P; McMahon, Sarah; LaFramboise, Thomas

2014-03-05

With the advent of paired-end high throughput sequencing, it is now possible to identify various types of structural variation on a genome-wide scale. Although many methods have been proposed for structural variation detection, most do not provide precise boundaries for identified variants. In this paper, we propose a new method, Distribution Based detection of Duplication Boundaries (DB2), for accurate detection of tandem duplication breakpoints, an important class of structural variation, with high precision and recall. Our computational experiments on simulated data show that DB2 outperforms state-of-the-art methods in terms of finding breakpoints of tandem duplications, with a higher positive predictive value (precision) in calling the duplications' presence. In particular, DB2's prediction of tandem duplications is correct 99% of the time even for very noisy data, while narrowing down the space of possible breakpoints within a margin of 15 to 20 bps on the average. Most of the existing methods provide boundaries in ranges that extend to hundreds of bases with lower precision values. Our method is also highly robust to varying properties of the sequencing library and to the sizes of the tandem duplications, as shown by its stable precision, recall and mean boundary mismatch performance. We demonstrate our method's efficacy using both simulated paired-end reads, and those generated from a melanoma sample and two ovarian cancer samples. Newly discovered tandem duplications are validated using PCR and Sanger sequencing. Our method, DB2, uses discordantly aligned reads, taking into account the distribution of fragment length to predict tandem duplications along with their breakpoints on a donor genome. The proposed method fine tunes the breakpoint calls by applying a novel probabilistic framework that incorporates the empirical fragment length distribution to score each feasible breakpoint. DB2 is implemented in Java programming language and is freely available
Accurate phylogenetic tree reconstruction from quartets: a heuristic approach.

Science.gov (United States)

Reaz, Rezwana; Bayzid, Md Shamsuzzoha; Rahman, M Sohel

2014-01-01

Supertree methods construct trees on a set of taxa (species) combining many smaller trees on the overlapping subsets of the entire set of taxa. A 'quartet' is an unrooted tree over 4 taxa, hence the quartet-based supertree methods combine many 4-taxon unrooted trees into a single and coherent tree over the complete set of taxa. Quartet-based phylogeny reconstruction methods have been receiving considerable attentions in the recent years. An accurate and efficient quartet-based method might be competitive with the current best phylogenetic tree reconstruction methods (such as maximum likelihood or Bayesian MCMC analyses), without being as computationally intensive. In this paper, we present a novel and highly accurate quartet-based phylogenetic tree reconstruction method. We performed an extensive experimental study to evaluate the accuracy and scalability of our approach on both simulated and biological datasets.
DNA Probe Pooling for Rapid Delineation of Chromosomal Breakpoints

Energy Technology Data Exchange (ETDEWEB)

Lu, Chun-Mei; Kwan, Johnson; Baumgartner, Adolf; Weier, Jingly F.; Wang, Mei; Escudero, Tomas; Munne' , Santiago; Zitzelsberger, Horst F.; Weier, Heinz-Ulrich

2009-01-30

Structural chromosome aberrations are hallmarks of many human genetic diseases. The precise mapping of translocation breakpoints in tumors is important for identification of genes with altered levels of expression, prediction of tumor progression, therapy response, or length of disease-free survival as well as the preparation of probes for detection of tumor cells in peripheral blood. Similarly, in vitro fertilization (IVF) and preimplantation genetic diagnosis (PGD) for carriers of balanced, reciprocal translocations benefit from accurate breakpoint maps in the preparation of patient-specific DNA probes followed by a selection of normal or balanced oocytes or embryos. We expedited the process of breakpoint mapping and preparation of case-specific probes by utilizing physically mapped bacterial artificial chromosome (BAC) clones. Historically, breakpoint mapping is based on the definition of the smallest interval between proximal and distal probes. Thus, many of the DNA probes prepared for multi-clone and multi-color mapping experiments do not generate additional information. Our pooling protocol described here with examples from thyroid cancer research and PGD accelerates the delineation of translocation breakpoints without sacrificing resolution. The turnaround time from clone selection to mapping results using tumor or IVF patient samples can be as short as three to four days.
Accurate phylogenetic classification of DNA fragments based onsequence composition

Energy Technology Data Exchange (ETDEWEB)

McHardy, Alice C.; Garcia Martin, Hector; Tsirigos, Aristotelis; Hugenholtz, Philip; Rigoutsos, Isidore

2006-05-01

Metagenome studies have retrieved vast amounts of sequenceout of a variety of environments, leading to novel discoveries and greatinsights into the uncultured microbial world. Except for very simplecommunities, diversity makes sequence assembly and analysis a verychallenging problem. To understand the structure a 5 nd function ofmicrobial communities, a taxonomic characterization of the obtainedsequence fragments is highly desirable, yet currently limited mostly tothose sequences that contain phylogenetic marker genes. We show that forclades at the rank of domain down to genus, sequence composition allowsthe very accurate phylogenetic 10 characterization of genomic sequence.We developed a composition-based classifier, PhyloPythia, for de novophylogenetic sequence characterization and have trained it on adata setof 340 genomes. By extensive evaluation experiments we show that themethodis accurate across all taxonomic ranks considered, even forsequences that originate fromnovel organisms and are as short as 1kb.Application to two metagenome datasets 15 obtained from samples ofphosphorus-removing sludge showed that the method allows the accurateclassification at genus level of most sequence fragments from thedominant populations, while at the same time correctly characterizingeven larger parts of the samples at higher taxonomic levels.
Exploring the relationship between sequence similarity and accurate phylogenetic trees.

Science.gov (United States)

Cantarel, Brandi L; Morrison, Hilary G; Pearson, William

2006-11-01

We have characterized the relationship between accurate phylogenetic reconstruction and sequence similarity, testing whether high levels of sequence similarity can consistently produce accurate evolutionary trees. We generated protein families with known phylogenies using a modified version of the PAML/EVOLVER program that produces insertions and deletions as well as substitutions. Protein families were evolved over a range of 100-400 point accepted mutations; at these distances 63% of the families shared significant sequence similarity. Protein families were evolved using balanced and unbalanced trees, with ancient or recent radiations. In families sharing statistically significant similarity, about 60% of multiple sequence alignments were 95% identical to true alignments. To compare recovered topologies with true topologies, we used a score that reflects the fraction of clades that were correctly clustered. As expected, the accuracy of the phylogenies was greatest in the least divergent families. About 88% of phylogenies clustered over 80% of clades in families that shared significant sequence similarity, using Bayesian, parsimony, distance, and maximum likelihood methods. However, for protein families with short ancient branches (ancient radiation), only 30% of the most divergent (but statistically significant) families produced accurate phylogenies, and only about 70% of the second most highly conserved families, with median expectation values better than 10(-60), produced accurate trees. These values represent upper bounds on expected tree accuracy for sequences with a simple divergence history; proteins from 700 Giardia families, with a similar range of sequence similarities but considerably more gaps, produced much less accurate trees. For our simulated insertions and deletions, correct multiple sequence alignments did not perform much better than those produced by T-COFFEE, and including sequences with expressed sequence tag-like sequencing errors did not
Distinct retroelement classes define evolutionary breakpoints demarcating sites of evolutionary novelty

Science.gov (United States)

Longo, Mark S; Carone, Dawn M; Green, Eric D; O'Neill, Michael J; O'Neill, Rachel J

2009-01-01

Background Large-scale genome rearrangements brought about by chromosome breaks underlie numerous inherited diseases, initiate or promote many cancers and are also associated with karyotype diversification during species evolution. Recent research has shown that these breakpoints are nonrandomly distributed throughout the mammalian genome and many, termed "evolutionary breakpoints" (EB), are specific genomic locations that are "reused" during karyotypic evolution. When the phylogenetic trajectory of orthologous chromosome segments is considered, many of these EB are coincident with ancient centromere activity as well as new centromere formation. While EB have been characterized as repeat-rich regions, it has not been determined whether specific sequences have been retained during evolution that would indicate previous centromere activity or a propensity for new centromere formation. Likewise, the conservation of specific sequence motifs or classes at EBs among divergent mammalian taxa has not been determined. Results To define conserved sequence features of EBs associated with centromere evolution, we performed comparative sequence analysis of more than 4.8 Mb within the tammar wallaby, Macropus eugenii, derived from centromeric regions (CEN), euchromatic regions (EU), and an evolutionary breakpoint (EB) that has undergone convergent breakpoint reuse and past centromere activity in marsupials. We found a dramatic enrichment for long interspersed nucleotide elements (LINE1s) and endogenous retroviruses (ERVs) and a depletion of short interspersed nucleotide elements (SINEs) shared between CEN and EBs. We analyzed the orthologous human EB (14q32.33), known to be associated with translocations in many cancers including multiple myelomas and plasma cell leukemias, and found a conserved distribution of similar repetitive elements. Conclusion Our data indicate that EBs tracked within the class Mammalia harbor sequence features retained since the divergence of marsupials
Distinct retroelement classes define evolutionary breakpoints demarcating sites of evolutionary novelty

Directory of Open Access Journals (Sweden)

Green Eric D

2009-07-01

Full Text Available Abstract Background Large-scale genome rearrangements brought about by chromosome breaks underlie numerous inherited diseases, initiate or promote many cancers and are also associated with karyotype diversification during species evolution. Recent research has shown that these breakpoints are nonrandomly distributed throughout the mammalian genome and many, termed "evolutionary breakpoints" (EB, are specific genomic locations that are "reused" during karyotypic evolution. When the phylogenetic trajectory of orthologous chromosome segments is considered, many of these EB are coincident with ancient centromere activity as well as new centromere formation. While EB have been characterized as repeat-rich regions, it has not been determined whether specific sequences have been retained during evolution that would indicate previous centromere activity or a propensity for new centromere formation. Likewise, the conservation of specific sequence motifs or classes at EBs among divergent mammalian taxa has not been determined. Results To define conserved sequence features of EBs associated with centromere evolution, we performed comparative sequence analysis of more than 4.8 Mb within the tammar wallaby, Macropus eugenii, derived from centromeric regions (CEN, euchromatic regions (EU, and an evolutionary breakpoint (EB that has undergone convergent breakpoint reuse and past centromere activity in marsupials. We found a dramatic enrichment for long interspersed nucleotide elements (LINE1s and endogenous retroviruses (ERVs and a depletion of short interspersed nucleotide elements (SINEs shared between CEN and EBs. We analyzed the orthologous human EB (14q32.33, known to be associated with translocations in many cancers including multiple myelomas and plasma cell leukemias, and found a conserved distribution of similar repetitive elements. Conclusion Our data indicate that EBs tracked within the class Mammalia harbor sequence features retained since the
Precise detection of rearrangement breakpoints in mammalian chromosomes

Directory of Open Access Journals (Sweden)

Gautier Christian

2008-06-01

Full Text Available Abstract Background Genomes undergo large structural changes that alter their organisation. The chromosomal regions affected by these rearrangements are called breakpoints, while those which have not been rearranged are called synteny blocks. We developed a method to precisely delimit rearrangement breakpoints on a genome by comparison with the genome of a related species. Contrary to current methods which search for synteny blocks and simply return what remains in the genome as breakpoints, we propose to go further and to investigate the breakpoints themselves in order to refine them. Results Given some reliable and non overlapping synteny blocks, the core of the method consists in refining the regions that are not contained in them. By aligning each breakpoint sequence against its specific orthologous sequences in the other species, we can look for weak similarities inside the breakpoint, thus extending the synteny blocks and narrowing the breakpoints. The identification of the narrowed breakpoints relies on a segmentation algorithm and is statistically assessed. Since this method requires as input synteny blocks with some properties which, though they appear natural, are not verified by current methods for detecting such blocks, we further give a formal definition and provide an algorithm to compute them. The whole method is applied to delimit breakpoints on the human genome when compared to the mouse and dog genomes. Among the 355 human-mouse and 240 human-dog breakpoints, 168 and 146 respectively span less than 50 Kb. We compared the resulting breakpoints with some publicly available ones and show that we achieve a better resolution. Furthermore, we suggest that breakpoints are rarely reduced to a point, and instead consist in often large regions that can be distinguished from the sequences around in terms of segmental duplications, similarity with related species, and transposable elements. Conclusion Our method leads to smaller
Reconstruction of phylogenetic trees of prokaryotes using maximal common intervals.

Science.gov (United States)

Heydari, Mahdi; Marashi, Sayed-Amir; Tusserkani, Ruzbeh; Sadeghi, Mehdi

2014-10-01

One of the fundamental problems in bioinformatics is phylogenetic tree reconstruction, which can be used for classifying living organisms into different taxonomic clades. The classical approach to this problem is based on a marker such as 16S ribosomal RNA. Since evolutionary events like genomic rearrangements are not included in reconstructions of phylogenetic trees based on single genes, much effort has been made to find other characteristics for phylogenetic reconstruction in recent years. With the increasing availability of completely sequenced genomes, gene order can be considered as a new solution for this problem. In the present work, we applied maximal common intervals (MCIs) in two or more genomes to infer their distance and to reconstruct their evolutionary relationship. Additionally, measures based on uncommon segments (UCS's), i.e., those genomic segments which are not detected as part of any of the MCIs, are also used for phylogenetic tree reconstruction. We applied these two types of measures for reconstructing the phylogenetic tree of 63 prokaryotes with known COG (clusters of orthologous groups) families. Similarity between the MCI-based (resp. UCS-based) reconstructed phylogenetic trees and the phylogenetic tree obtained from NCBI taxonomy browser is as high as 93.1% (resp. 94.9%). We show that in the case of this diverse dataset of prokaryotes, tree reconstruction based on MCI and UCS outperforms most of the currently available methods based on gene orders, including breakpoint distance and DCJ. We additionally tested our new measures on a dataset of 13 closely-related bacteria from the genus Prochlorococcus. In this case, distances like rearrangement distance, breakpoint distance and DCJ proved to be useful, while our new measures are still appropriate for phylogenetic reconstruction. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
YAHA: fast and flexible long-read alignment with optimal breakpoint detection.

Science.gov (United States)

Faust, Gregory G; Hall, Ira M

2012-10-01

With improved short-read assembly algorithms and the recent development of long-read sequencers, split mapping will soon be the preferred method for structural variant (SV) detection. Yet, current alignment tools are not well suited for this. We present YAHA, a fast and flexible hash-based aligner. YAHA is as fast and accurate as BWA-SW at finding the single best alignment per query and is dramatically faster and more sensitive than both SSAHA2 and MegaBLAST at finding all possible alignments. Unlike other aligners that report all, or one, alignment per query, or that use simple heuristics to select alignments, YAHA uses a directed acyclic graph to find the optimal set of alignments that cover a query using a biologically relevant breakpoint penalty. YAHA can also report multiple mappings per defined segment of the query. We show that YAHA detects more breakpoints in less time than BWA-SW across all SV classes, and especially excels at complex SVs comprising multiple breakpoints. YAHA is currently supported on 64-bit Linux systems. Binaries and sample data are freely available for download from http://faculty.virginia.edu/irahall/YAHA. imh4y@virginia.edu.
Kalman Filter Track Fits and Track Breakpoint Analysis

CERN Document Server

Astier, Pierre; Cousins, R D; Letessier-Selvon, A A; Popov, B A; Vinogradova, T G; Astier, Pierre; Cardini, Alessandro; Cousins, Robert D.; Letessier-Selvon, Antoine; Popov, Boris A.; Vinogradova, Tatiana

2000-01-01

We give an overview of track fitting using the Kalman filter method in the NOMAD detector at CERN, and emphasize how the wealth of by-product information can be used to analyze track breakpoints (discontinuities in track parameters caused by scattering, decay, etc.). After reviewing how this information has been previously exploited by others, we describe extensions which add power to breakpoint detection and characterization. We show how complete fits to the entire track, with breakpoint parameters added, can be easily obtained from the information from unbroken fits. Tests inspired by the Fisher F-test can then be used to judge breakpoints. Signed quantities (such as change in momentum at the breakpoint) can supplement unsigned quantities such as the various chisquares. We illustrate the method with electrons from real data, and with Monte Carlo simulations of pion decays.
Fast and accurate phylogenetic reconstruction from high-resolution whole-genome data and a novel robustness estimator.

Science.gov (United States)

Lin, Y; Rajan, V; Moret, B M E

2011-09-01

The rapid accumulation of whole-genome data has renewed interest in the study of genomic rearrangements. Comparative genomics, evolutionary biology, and cancer research all require models and algorithms to elucidate the mechanisms, history, and consequences of these rearrangements. However, even simple models lead to NP-hard problems, particularly in the area of phylogenetic analysis. Current approaches are limited to small collections of genomes and low-resolution data (typically a few hundred syntenic blocks). Moreover, whereas phylogenetic analyses from sequence data are deemed incomplete unless bootstrapping scores (a measure of confidence) are given for each tree edge, no equivalent to bootstrapping exists for rearrangement-based phylogenetic analysis. We describe a fast and accurate algorithm for rearrangement analysis that scales up, in both time and accuracy, to modern high-resolution genomic data. We also describe a novel approach to estimate the robustness of results-an equivalent to the bootstrapping analysis used in sequence-based phylogenetic reconstruction. We present the results of extensive testing on both simulated and real data showing that our algorithm returns very accurate results, while scaling linearly with the size of the genomes and cubically with their number. We also present extensive experimental results showing that our approach to robustness testing provides excellent estimates of confidence, which, moreover, can be tuned to trade off thresholds between false positives and false negatives. Together, these two novel approaches enable us to attack heretofore intractable problems, such as phylogenetic inference for high-resolution vertebrate genomes, as we demonstrate on a set of six vertebrate genomes with 8,380 syntenic blocks. A copy of the software is available on demand.
SATe-II: very fast and accurate simultaneous estimation of multiple sequence alignments and phylogenetic trees.

Science.gov (United States)

Liu, Kevin; Warnow, Tandy J; Holder, Mark T; Nelesen, Serita M; Yu, Jiaye; Stamatakis, Alexandros P; Linder, C Randal

2012-01-01

Highly accurate estimation of phylogenetic trees for large data sets is difficult, in part because multiple sequence alignments must be accurate for phylogeny estimation methods to be accurate. Coestimation of alignments and trees has been attempted but currently only SATé estimates reasonably accurate trees and alignments for large data sets in practical time frames (Liu K., Raghavan S., Nelesen S., Linder C.R., Warnow T. 2009b. Rapid and accurate large-scale coestimation of sequence alignments and phylogenetic trees. Science. 324:1561-1564). Here, we present a modification to the original SATé algorithm that improves upon SATé (which we now call SATé-I) in terms of speed and of phylogenetic and alignment accuracy. SATé-II uses a different divide-and-conquer strategy than SATé-I and so produces smaller more closely related subsets than SATé-I; as a result, SATé-II produces more accurate alignments and trees, can analyze larger data sets, and runs more efficiently than SATé-I. Generally, SATé is a metamethod that takes an existing multiple sequence alignment method as an input parameter and boosts the quality of that alignment method. SATé-II-boosted alignment methods are significantly more accurate than their unboosted versions, and trees based upon these improved alignments are more accurate than trees based upon the original alignments. Because SATé-I used maximum likelihood (ML) methods that treat gaps as missing data to estimate trees and because we found a correlation between the quality of tree/alignment pairs and ML scores, we explored the degree to which SATé's performance depends on using ML with gaps treated as missing data to determine the best tree/alignment pair. We present two lines of evidence that using ML with gaps treated as missing data to optimize the alignment and tree produces very poor results. First, we show that the optimization problem where a set of unaligned DNA sequences is given and the output is the tree and alignment of
Fast detection of deletion breakpoints using quantitative PCR

Directory of Open Access Journals (Sweden)

Gulshara Abildinova

2016-01-01

Full Text Available Abstract The routine detection of large and medium copy number variants (CNVs is well established. Hemizygotic deletions or duplications in the large Duchenne muscular dystrophy DMD gene responsible for Duchenne and Becker muscular dystrophies are routinely identified using multiple ligation probe amplification and array-based comparative genomic hybridization. These methods only map deleted or duplicated exons, without providing the exact location of breakpoints. Commonly used methods for the detection of CNV breakpoints include long-range PCR and primer walking, their success being limited by the deletion size, GC content and presence of DNA repeats. Here, we present a strategy for detecting the breakpoints of medium and large CNVs regardless of their size. The hemizygous deletion of exons 45-50 in the DMD gene and the large autosomal heterozygous PARK2 deletion were used to demonstrate the workflow that relies on real-time quantitative PCR to narrow down the deletion region and Sanger sequencing for breakpoint confirmation. The strategy is fast, reliable and cost-efficient, making it amenable to widespread use in genetic laboratories.
Susceptibility Breakpoint for Enrofloxacin against Swine Salmonella spp.

Science.gov (United States)

Hao, Haihong; Pan, Huafang; Ahmad, Ijaz; Cheng, Guyue; Wang, Yulian; Dai, Menghong; Tao, Yanfei; Chen, Dongmei; Peng, Dapeng; Liu, Zhenli

2013-01-01

Susceptibility breakpoints are crucial for prudent use of antimicrobials. This study has developed the first susceptibility breakpoint (MIC ≤ 0.25 μg/ml) for enrofloxacin against swine Salmonella spp. based on wild-type cutoff (COWT) and pharmacokinetic-pharmacodynamic (PK-PD) cutoff (COPD) values, consequently providing a criterion for susceptibility testing and clinical usage of enrofloxacin. PMID:23784134
A Mixture Model and a Hidden Markov Model to Simultaneously Detect Recombination Breakpoints and Reconstruct Phylogenies

Directory of Open Access Journals (Sweden)

Bastien Boussau

2009-06-01

Full Text Available Homologous recombination is a pervasive biological process that affects sequences in all living organisms and viruses. In the presence of recombination, the evolutionary history of an alignment of homologous sequences cannot be properly depicted by a single bifurcating tree: some sites have evolved along a specific phylogenetic tree, others have followed another path. Methods available to analyse recombination in sequences usually involve an analysis of the alignment through sliding-windows, or are particularly demanding in computational resources, and are often limited to nucleotide sequences. In this article, we propose and implement a Mixture Model on trees and a phylogenetic Hidden Markov Model to reveal recombination breakpoints while searching for the various evolutionary histories that are present in an alignment known to have undergone homologous recombination. These models are sufficiently efficient to be applied to dozens of sequences on a single desktop computer, and can handle equivalently nucleotide or protein sequences. We estimate their accuracy on simulated sequences and test them on real data.
A Mixture Model and a Hidden Markov Model to Simultaneously Detect Recombination Breakpoints and Reconstruct Phylogenies

Directory of Open Access Journals (Sweden)

Bastien Boussau

2009-01-01

Full Text Available Homologous recombination is a pervasive biological process that affects sequences in all living organisms and viruses. In the presence of recombination, the evolutionary history of an alignment of homologous sequences cannot be properly depicted by a single bifurcating tree: some sites have evolved along a specific phylogenetic tree, others have followed another path. Methods available to analyse recombination in sequences usually involve an analysis of the alignment through sliding-windows, or are particularly demanding in computational resources, and are often limited to nucleotide sequences. In this article, we propose and implement a Mixture Model on trees and a phylogenetic Hidden Markov Model to reveal recombination breakpoints while searching for the various evolutionary histories that are present in an alignment known to have undergone homologous recombination. These models are sufficiently efficient to be applied to dozens of sequences on a single desktop computer, and can handle equivalently nucleotide or protein sequences. We estimate their accuracy on simulated sequences and test them on real data.
Aluminum break-point contacts

NARCIS (Netherlands)

Heinemann, Martina; Groot, R.A. de

1997-01-01

Ab initio molecular dynamics is used to study the contribution of a single Al atom to an aluminum breakpoint contact during the final stages of breaking and the initial stages of the formation of such a contact. A hysteresis effect is found in excellent agreement with experiment and the form of the
Molecular population genetics of inversion breakpoint regions in Drosophila pseudoobscura.

Science.gov (United States)

Wallace, Andre G; Detweiler, Don; Schaeffer, Stephen W

2013-07-08

Paracentric inversions in populations can have a profound effect on the pattern and organization of nucleotide variability along a chromosome. Regions near inversion breakpoints are expected to have greater levels of differentiation because of reduced genetic exchange between different gene arrangements whereas central regions in the inverted segments are predicted to have lower levels of nucleotide differentiation due to greater levels of genetic flux among different karyotypes. We used the inversion polymorphism on the third chromosome of Drosophila pseudoobscura to test these predictions with an analysis of nucleotide diversity of 18 genetic markers near and away from inversion breakpoints. We tested hypotheses about how the presence of different chromosomal arrangements affects the pattern and organization of nucleotide variation. Overall, markers in the distal segment of the chromosome had greater levels of nucleotide heterozygosity than markers within the proximal segment of the chromosome. In addition, our results rejected the hypothesis that the breakpoints of derived inversions will have lower levels of nucleotide variability than breakpoints of ancestral inversions, even when strains with gene conversion events were removed. High levels of linkage disequilibrium were observed within all 11 breakpoint regions as well as between the ends of most proximal and distal breakpoints. The central region of the chromosome had the greatest levels of linkage disequilibrium compared with the proximal and distal regions because this is the region that experiences the highest level of recombination suppression. These data do not fully support the idea that genetic exchange is the sole force that influences genetic variation on inverted chromosomes.

Rapid mapping of chromosomal breakpoints: from blood to BAC in 20 days.

Energy Technology Data Exchange (ETDEWEB)

Lu, Chun-Mei; Kwan, Johnson; Weier, Jingly F.; Baumgartner, Aldof; Wang, Mei; Escudero, Tomas; Munne, Santiago; Weier, Heinz-Ulrich

2009-02-25

Structural chromosome aberrations and associated segmental or chromosomal aneusomies are major causes of reproductive failure in humans. Despite the fact that carriers of reciprocal balanced translocation often have no other clinical symptoms or disease, impaired chromosome homologue pairing in meiosis and karyokinesis errors lead to over-representation of translocations carriers in the infertile population and in recurrent pregnancy loss patients. At present, clinicians have no means to select healthy germ cells or balanced zygotes in vivo, but in vitro fertilization (IVF) followed by preimplantation genetic diagnosis (PGD) offers translocation carriers a chance to select balanced or normal embryos for transfer. Although a combination of telomeric and centromeric probes can differentiate embryos that are unbalanced from normal or unbalanced ones, a seemingly random position of breakpoints in these IVF-patients poses a serious obstacle to differentiating between normal and balanced embryos, which for most translocation couples, is desirable. Using a carrier with reciprocal translocation t(4;13) as an example, we describe our state-of-the-art approach to the preparation of patient-specific DNA probes that span or 'extent' the breakpoints. With the techniques and resources described here, most breakpoints can be accurately mapped in a matter of days using carrier lymphocytes, and a few extra days are allowed for PGD-probe optimization. The optimized probes will then be suitable for interphase cell analysis, a prerequisite for PGD since blastomeres are biopsied from normally growing day 3 - embryos regardless of their position in the mitotic cell cycle. Furthermore, routine application of these rapid methods should make PGD even more affordable for translocation carriers enrolled in IVF programs.
A molecular perspective on a complex polymorphic inversion system with cytological evidence of multiply reused breakpoints.

Science.gov (United States)

Orengo, D J; Puerma, E; Papaceit, M; Segarra, C; Aguadé, M

2015-06-01

Genome sequence comparison across the Drosophila genus revealed that some fixed inversion breakpoints had been multiply reused at this long timescale. Cytological studies of Drosophila inversion polymorphism had previously shown that, also at this shorter timescale, some breakpoints had been multiply reused. The paucity of molecularly characterized polymorphic inversion breakpoints has so far precluded contrasting whether cytologically shared breakpoints of these relatively young inversions are actually reused at the molecular level. The E chromosome of Drosophila subobscura stands out because it presents several inversion complexes. This is the case of the E1+2+9+3 arrangement that originated from the ancestral Est arrangement through the sequential accumulation of four inversions (E1, E2, E9 and E3) sharing some breakpoints. We recently identified the breakpoints of inversions E1 and E2, which allowed establishing reuse at the molecular level of the cytologically shared breakpoint of these inversions. Here, we identified and sequenced the breakpoints of inversions E9 and E3, because they share breakpoints at sections 58D and 64C with those of inversions E1 and E2. This has allowed establishing that E9 and E3 originated through the staggered-break mechanism. Most importantly, sequence comparison has revealed the multiple reuse at the molecular level of the proximal breakpoint (section 58D), which would have been used at least by inversions E2, E9 and E3. In contrast, the distal breakpoint (section 64C) might have been only reused once by inversions E1 and E2, because the distal E3 breakpoint is displaced >70 kb from the other breakpoint limits.
The analysis of distribution of the chromosome aberration breakpoints from medical diagnostic X-ray workers

International Nuclear Information System (INIS)

Wang Qin; Li Jin; Tang Weisheng; Wang Zhiquan

2003-01-01

Objective: To analyze the distribution of the chromosome aberration breakpoints from medical diagnostic x-ray workers. Methods: The breakpoints of lymphocyte chromosomes are analyzed using G-banding. Results: There are 146 breakpoints among 3545 metaphase in 37 cases of X-ray workers. There are statistically significant differences between observed values and expected values (χ 2 =42.82, df=23, P 0.05). Conclusion: The chromosome aberration breakpoints of medical diagnostic X-ray workers are non-random. The observed values of breakpoint numbers are higher than those of the expected values in 7 and 14 chromosomes (P<0.05)
Monitoring Forest Dynamics in the Andean Amazon: The Applicability of Breakpoint Detection Methods Using Landsat Time-Series and Genetic Algorithms

Directory of Open Access Journals (Sweden)

Fabián Santos

2017-01-01

-scale projects. In exceptional cases when data quality and quantity were adequate, we recommend the pre-processing approaches, noise reduction algorithms and breakpoint detection algorithms procedures that can enhance results. Finally, we include recommendations for achieving a faster and more accurate calibration of complex functions applied to remote sensing using genetic algorithms.
Structure and population genetics of the breakpoints of a polymorphic inversion in Drosophila subobscura.

Science.gov (United States)

Papaceit, Montserrat; Segarra, Carmen; Aguadé, Montserrat

2013-01-01

Drosophila subobscura is a paleartic species of the obscura group with a rich chromosomal polymorphism. To further our understanding on the origin of inversions and on how they regain variation, we have identified and sequenced the two breakpoints of a polymorphic inversion of D. subobscura--inversion 3 of the O chromosome--in a population sample. The breakpoints could be identified as two rather short fragments (∼300 bp and 60 bp long) with no similarity to any known transposable element family or repetitive sequence. The presence of the ∼300-bp fragment at the two breakpoints of inverted chromosomes implies its duplication, an indication of the inversion origin via staggered double-strand breaks. Present results and previous findings support that the mode of origin of inversions is neither related to the inversion age nor species-group specific. The breakpoint regions do not consistently exhibit the lower level of variation within and stronger genetic differentiation between arrangements than more internal regions that would be expected, even in moderately small inversions, if gene conversion were greatly restricted at inversion breakpoints. Comparison of the proximal breakpoint region in species of the obscura group shows that this breakpoint lies in a small high-turnover fragment within a long collinear region (∼300 kb). © 2012 The Author(s). Evolution© 2012 The Society for the Study of Evolution.
Recurrence of Chromosome Rearrangements and Reuse of DNA Breakpoints in the Evolution of the Triticeae Genomes

Directory of Open Access Journals (Sweden)

Wanlong Li

2016-12-01

Full Text Available Chromosomal rearrangements (CRs play important roles in karyotype diversity and speciation. While many CR breakpoints have been characterized at the sequence level in yeast, insects, and primates, little is known about the structure of evolutionary CR breakpoints in plant genomes, which are much more dynamic in genome size and sequence organization. Here, we report identification of breakpoints of a translocation between chromosome arms 4L and 5L of Triticeae, which is fixed in several species, including diploid wheat and rye, by comparative mapping and analysis of the draft genome and chromosome survey sequences of the Triticeae species. The wheat translocation joined the ends of breakpoints downstream of a WD40 gene on 4AL and a gene of the PMEI family on 5AL. A basic helix-loop-helix transcription factor gene in 5AL junction was significantly restructured. Rye and wheat share the same position for the 4L breakpoint, but the 5L breakpoint positions are not identical, although very close in these two species, indicating the recurrence of 4L/5L translocations in the Triticeae. Although barley does not carry the translocation, collinearity across the breakpoints was violated by putative inversions and/or transpositions. Alignment with model grass genomes indicated that the translocation breakpoints coincided with ancient inversion junctions in the Triticeae ancestor. Our results show that the 4L/5L translocation breakpoints represent two CR hotspots reused during Triticeae evolution, and support breakpoint reuse as a widespread mechanism in all eukaryotes. The mechanisms of the recurrent translocation and its role in Triticeae evolution are also discussed.
Heterogeneity of chromosome 22 breakpoint in Philadelphia-positive (Ph+) acute lymphocytic leukemia

International Nuclear Information System (INIS)

Erikson, J.; Griffin, C.A.; Ar-Rushdi, A.

1986-01-01

In chronic myelogenous leukemias (CML) with the t(9;22)(q34;q11) chromosome translocation the breakpoints on chromosome 22 occur within a 5.8-kilobase segment of DNA referred to as breakpoint cluster region (bcr). The same cytogenetically indinstinguishable translocation occurs in approximately 10% of patients with acute lymphocytic leukemias (ALL). In this study the authors have investigated the chromosome breakpoints in several cases of ALL carrying the t(9;22) translocation. In three of five cases of ALL they found that the bcr region was not involved in the chromosome rearrangement and that the 22q11 chromosome breakpoints were proximal (5') to the bcr region at band 22q11. In addition, they observed normal size bcr and c-alb transcripts in an ALL cell line carrying the t(9;22) translocation. They conclude, therefore, that if c-alb is inappropriately expressed in ALL cells without bcr rearrangements, the genetic mechanism of activation must be different from that reported for CML
Segmental Duplication, Microinversion, and Gene Loss Associated with a Complex Inversion Breakpoint Region in Drosophila

Science.gov (United States)

Calvete, Oriol; González, Josefa; Betrán, Esther; Ruiz, Alfredo

2012-01-01

Chromosomal inversions are usually portrayed as simple two-breakpoint rearrangements changing gene order but not gene number or structure. However, increasing evidence suggests that inversion breakpoints may often have a complex structure and entail gene duplications with potential functional consequences. Here, we used a combination of different techniques to investigate the breakpoint structure and the functional consequences of a complex rearrangement fixed in Drosophila buzzatii and comprising two tandemly arranged inversions sharing the middle breakpoint: 2m and 2n. By comparing the sequence in the breakpoint regions between D. buzzatii (inverted chromosome) and D. mojavensis (noninverted chromosome), we corroborate the breakpoint reuse at the molecular level and infer that inversion 2m was associated with a duplication of a ∼13 kb segment and likely generated by staggered breaks plus repair by nonhomologous end joining. The duplicated segment contained the gene CG4673, involved in nuclear transport, and its two nested genes CG5071 and CG5079. Interestingly, we found that other than the inversion and the associated duplication, both breakpoints suffered additional rearrangements, that is, the proximal breakpoint experienced a microinversion event associated at both ends with a 121-bp long duplication that contains a promoter. As a consequence of all these different rearrangements, CG5079 has been lost from the genome, CG5071 is now a single copy nonnested gene, and CG4673 has a transcript ∼9 kb shorter and seems to have acquired a more complex gene regulation. Our results illustrate the complex effects of chromosomal rearrangements and highlight the need of complementing genomic approaches with detailed sequence-level and functional analyses of breakpoint regions if we are to fully understand genome structure, function, and evolutionary dynamics. PMID:22328714
Major Chromosomal Breakpoint Intervals in Breast Cancer Co-Localize with Differentially Methylated Regions

Energy Technology Data Exchange (ETDEWEB)

Eric Tang, Man-Hung [Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (United States); Department of Oncology, Clinical Sciences, Lund University, Lund (Sweden); Varadan, Vinay; Kamalakaran, Sitharthan [Philips Research North America, Briarcliff Manor, NY (United States); Zhang, Michael Q. [Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (United States); The University of Texas at Dallas, Richardson, TX (United States); Tsinghua University, Beijing (China); Dimitrova, Nevenka, E-mail: nevenka.dimitrova@philips.com [Philips Research North America, Briarcliff Manor, NY (United States); Hicks, James, E-mail: hicks@cshl.edu [Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (United States)

2012-12-27

Solid tumors exhibit chromosomal rearrangements resulting in gain or loss of multiple chromosomal loci (copy number variation, or CNV), and translocations that occasionally result in the creation of novel chimeric genes. In the case of breast cancer, although most individual tumors each have unique CNV landscape, the breakpoints, as measured over large datasets, appear to be non-randomly distributed in the genome. Breakpoints show a significant regional concentration at genomic loci spanning perhaps several megabases. The proximal cause of these breakpoint concentrations is a subject of speculation, but is, as yet, largely unknown. To shed light on this issue, we have performed a bio-statistical analysis on our previously published data for a set of 119 breast tumors and normal controls (Wiedswang et al., 2003), where each sample has both high-resolution CNV and methylation data. The method examined the distribution of closeness of breakpoint regions with differentially methylated regions (DMR), coupled with additional genomic parameters, such as repeat elements and designated “fragile sites” in the reference genome. Through this analysis, we have identified a set of 93 regional loci called breakpoint enriched DMR (BEDMRs) characterized by altered DNA methylation in cancer compared to normal cells that are associated with frequent breakpoint concentrations within a distance of 1 Mb. BEDMR loci are further associated with local hypomethylation (66%), concentrations of the Alu SINE repeats within 3 Mb (35% of the cases), and tend to occur near a number of cancer related genes such as the protocadherins, AKT1, DUB3, GAB2. Furthermore, BEDMRs seem to deregulate members of the histone gene family and chromatin remodeling factors, e.g., JMJD1B, which might affect the chromatin structure and disrupt coordinate signaling and repair. From this analysis we propose that preference for chromosomal breakpoints is related to genome structure coupled with alterations in DNA
Distribution of X-ray-induced chromosome breakpoints in Down syndrome lymphocytes

International Nuclear Information System (INIS)

Shafik, H.M.; Au, W.W.; Whorton, E.B. Jr.; Legator, M.S.

1990-01-01

Down syndrome (DS) individuals are known to be predisposed to develop leukemia and their lymphocytes are highly sensitive to the induction of chromosome aberrations by X-rays. A study was conducted to identify the chromosome breakpoints and to evaluate whether site specificity for chromosome breakage and rearrangement may exist which may explain the predisposition phenomenon. DS lymphocytes at the G1 phase of the cell cycle were irradiated with 300, 450, and 600 rad of X-rays. Cells were harvested after 3 days in culture and 193 G-banded karyotypes were analyzed to identify the induced chromosome abnormalities. Out of 273 breakpoints identified, 122 were involved in the formation of stable chromosome rearrangements and 151 in the formation of unstable abnormalities. The Poisson analysis of these breakpoints demonstrated that 16 chromosome bands located in chromosomes 1, 3, 7, 12, 17, 19 and X were preferentially involved in breakage and rearrangement (P less than 0.05). These 16 bands are also found to be locations of cancer breakpoints, oncogenes, or fragile sites. Many abnormal cells were observed to carry stable chromosome rearrangements only. Therefore, these cells are presumed to be compatible with survival and to be initiated in the transformation process. We propose that similar stable and site-specific chromosome rearrangements may exist in proliferating cells in DS individuals after exposure to clastogens and that this abnormality predisposes them to develop leukemia
Investigating the role of X chromosome breakpoints in premature ovarian failure

Directory of Open Access Journals (Sweden)

Baronchelli Simona

2012-07-01

Full Text Available Abstract The importance of the genetic factor in the aetiology of premature ovarian failure (POF is emphasized by the high percentage of familial cases and X chromosome abnormalities account for 10% of chromosomal aberrations. In this study, we report the detailed analysis of 4 chromosomal abnormalities involving the X chromosome and associated with POF that were detected during a screening of 269 affected women. Conventional and molecular cytogenetics were valuable tools for locating the breakpoint regions and thus the following karyotypes were defined: 46,X,der(Xt(X;19(p21.1;q13.42mat, 46,X,t(X;2(q21.33;q14.3dn, 46,X,der(Xt(X;Y(q26.2;q11.223mat and 46,X,t(X;13(q13.3;q31dn. A bioinformatic analysis of the breakpoint regions identified putative candidate genes for ovarian failure near the breakpoint regions on the X chromosome or on autosomes that were involved in the translocation event. HS6ST1, HS6ST2 and MATER genes were identified and their functions and a literature review revealed an interesting connection to the POF phenotype. Moreover, the 19q13.32 locus is associated with the age of onset of the natural menopause. These results support the position effect of the breakpoint on flanking genes, and cytogenetic techniques, in combination with bioinformatic analysis, may help to improve what is known about this puzzling disorder and its diagnostic potential.
Tentative minimum inhibitory concentration and zone diameter breakpoints for moxifloxacin using BSAC criteria.

Science.gov (United States)

Andrews, J M; Ashby, J P; Jevons, G M; Wise, R

1999-12-01

Tentative MIC and zone diameter breakpoints were determined for moxifloxacin using BSAC criteria. An MIC breakpoint of or = 20 mm for Enterobacteriaceae and staphylococci, 18 mm for the respiratory pathogens (Streptococcus pneumoniae, Haemophilus influenzae and Moraxella catarrhalis) and 15 mm for enterococci. For Pseudomonas aeruginosa with a 5 microg disc, three bands are suggested for interpretation, that of > or = 25 mm (sensitive), 18-24 mm (intermediate) and < or = 17 mm (resistant).
Identification of chromosome 7 inversion breakpoints in an autistic family narrows candidate region for autism susceptibility.

Science.gov (United States)

Cukier, Holly N; Skaar, David A; Rayner-Evans, Melissa Y; Konidari, Ioanna; Whitehead, Patrice L; Jaworski, James M; Cuccaro, Michael L; Pericak-Vance, Margaret A; Gilbert, John R

2009-10-01

Chromosomal breaks and rearrangements have been observed in conjunction with autism and autistic spectrum disorders. A chromosomal inversion has been previously reported in autistic siblings, spanning the region from approximately 7q22.1 to 7q31. This family is distinguished by having multiple individuals with autism and associated disabilities. The region containing the inversion has been strongly implicated in autism by multiple linkage studies, and has been particularly associated with language defects in autism as well as in other disorders with language components. Mapping of the inversion breakpoints by FISH has localized the inversion to the region spanning approximately 99-108.75 Mb of chromosome 7. The proximal breakpoint has the potential to disrupt either the coding sequence or regulatory regions of a number of cytochrome P450 genes while the distal region falls in a relative gene desert. Copy number variant analysis of the breakpoint regions detected no duplication or deletion that could clearly be associated with disease status. Association analysis in our autism data set using single nucleotide polymorphisms located near the breakpoints showed no significant association with proximal breakpoint markers, but has identified markers near the distal breakpoint ( approximately 108-110 Mb) with significant associations to autism. The chromosomal abnormality in this family strengthens the case for an autism susceptibility gene in the chromosome 7q22-31 region and targets a candidate region for further investigation.
A new approach to assess COPD by identifying lung function break-points

Directory of Open Access Journals (Sweden)

Eriksson G

2015-10-01

Full Text Available Göran Eriksson,1,* Linnea Jarenbäck,1,* Stefan Peterson,2 Jaro Ankerst,1 Leif Bjermer,1 Ellen Tufvesson11Respiratory Medicine and Allergology, Department of Clinical Sciences, Lund University, 2Regional Cancer Center South, Skåne University Hospital, Lund, Sweden*These authors contributed equally to this workPurpose: COPD is a progressive disease, which can take different routes, leading to great heterogeneity. The aim of the post-hoc analysis reported here was to perform continuous analyses of advanced lung function measurements, using linear and nonlinear regressions.Patients and methods: Fifty-one COPD patients with mild to very severe disease (Global Initiative for Chronic Obstructive Lung Disease [GOLD] Stages I–IV and 41 healthy smokers were investigated post-bronchodilation by flow-volume spirometry, body plethysmography, diffusion capacity testing, and impulse oscillometry. The relationship between COPD severity, based on forced expiratory volume in 1 second (FEV1, and different lung function parameters was analyzed by flexible nonparametric method, linear regression, and segmented linear regression with break-points.Results: Most lung function parameters were nonlinear in relation to spirometric severity. Parameters related to volume (residual volume, functional residual capacity, total lung capacity, diffusion capacity [diffusion capacity of the lung for carbon monoxide], diffusion capacity of the lung for carbon monoxide/alveolar volume and reactance (reactance area and reactance at 5Hz were segmented with break-points at 60%–70% of FEV1. FEV1/forced vital capacity (FVC and resonance frequency had break-points around 80% of FEV1, while many resistance parameters had break-points below 40%. The slopes in percent predicted differed; resistance at 5 Hz minus resistance at 20 Hz had a linear slope change of -5.3 per unit FEV1, while residual volume had no slope change above and -3.3 change per unit FEV1 below its break-point of 61
Effects of Phylogenetic Tree Style on Student Comprehension

Science.gov (United States)

Dees, Jonathan Andrew

Phylogenetic trees are powerful tools of evolutionary biology that have become prominent across the life sciences. Consequently, learning to interpret and reason from phylogenetic trees is now an essential component of biology education. However, students often struggle to understand these diagrams, even after explicit instruction. One factor that has been observed to affect student understanding of phylogenetic trees is style (i.e., diagonal or bracket). The goal of this dissertation research was to systematically explore effects of style on student interpretations and construction of phylogenetic trees in the context of an introductory biology course. Before instruction, students were significantly more accurate with bracket phylogenetic trees for a variety of interpretation and construction tasks. Explicit instruction that balanced the use of diagonal and bracket phylogenetic trees mitigated some, but not all, style effects. After instruction, students were significantly more accurate for interpretation tasks involving taxa relatedness and construction exercises when using the bracket style. Based on this dissertation research and prior studies on style effects, I advocate for introductory biology instructors to use only the bracket style. Future research should examine causes of style effects and variables other than style to inform the development of research-based instruction that best supports student understanding of phylogenetic trees.
Reconstructing phylogenetic networks using maximum parsimony.

Science.gov (United States)

Nakhleh, Luay; Jin, Guohua; Zhao, Fengmei; Mellor-Crummey, John

2005-01-01

Phylogenies - the evolutionary histories of groups of organisms - are one of the most widely used tools throughout the life sciences, as well as objects of research within systematics, evolutionary biology, epidemiology, etc. Almost every tool devised to date to reconstruct phylogenies produces trees; yet it is widely understood and accepted that trees oversimplify the evolutionary histories of many groups of organims, most prominently bacteria (because of horizontal gene transfer) and plants (because of hybrid speciation). Various methods and criteria have been introduced for phylogenetic tree reconstruction. Parsimony is one of the most widely used and studied criteria, and various accurate and efficient heuristics for reconstructing trees based on parsimony have been devised. Jotun Hein suggested a straightforward extension of the parsimony criterion to phylogenetic networks. In this paper we formalize this concept, and provide the first experimental study of the quality of parsimony as a criterion for constructing and evaluating phylogenetic networks. Our results show that, when extended to phylogenetic networks, the parsimony criterion produces promising results. In a great majority of the cases in our experiments, the parsimony criterion accurately predicts the numbers and placements of non-tree events.
Clinical relevance of the breakpoint sites within the M-BCR in 50 patients from Argentina with chronic myeloid leukemia.

Science.gov (United States)

Giere, I A; Larripa, I B

1996-08-01

Fifty patients from Argentina with chronic myeloid leukemia (CML) were studied in order to characterize the breakpoint site within the major breakpoint cluster region (M-BCR) and its relationship with the duration of the chronic phase (CP). The DNA digestion with the restriction enzymes: Bgl II, BAM HI and Hind III and hybridization with the 1.2Kb Hind III-Bgl II bcr probe showed that 56% of cases had the breakpoint in 5'M-bcr region and the remaining 44% in 3'M-bcr region. The duration of chronic phase from diagnosis to the onset of the blast crisis (BC) was correlated with the location of the breakpoint within the M-bcr and no statistical differences were observed between the 5' and the 3' groups. These data indicate that the breakpoint site within the bcr gene is not a prognostic indicator of the duration of CP of the disease.
European gene mapping project (EUROGEM) : Breakpoint panels for human chromosomes based on the CEPH reference families

NARCIS (Netherlands)

Attwood, J; Bryant, SP; Bains, R; Povey, R; Povey, S; Rebello, M; Kapsetaki, M; Moschonas, NK; Grzeschik, KH; Otto, M; Dixon, M; Sudworth, HE; Kooy, RF; Wright, A; Teague, P; Terrenato, L; Vergnaud, G; Monfouilloux, S; Weissenbach, J; Alibert, O; Dib, C; Faure, S; Bakker, E; Pearson, NM; Vossen, RHAM; Gal, A; MuellerMyhsok, B; Cann, HM; Spurr, NK

Meiotic breakpoint panels for human chromosomes 2, 3, 4, 5, 6, 7, 8, 9, 10, 13, 14, 15, 17; 18, 20 and X were constructed from genotypes from the CEPH reference families. Each recombinant chromosome included has a breakpoint well-supported with reference to defined quantitative criteria. The panels
Gene alterations at Drosophila inversion breakpoints provide prima facie evidence for natural selection as an explanation for rapid chromosomal evolution.

Science.gov (United States)

Guillén, Yolanda; Ruiz, Alfredo

2012-02-01

Chromosomal inversions have been pervasive during the evolution of the genus Drosophila, but there is significant variation between lineages in the rate of rearrangement fixation. D. mojavensis, an ecological specialist adapted to a cactophilic niche under extreme desert conditions, is a chromosomally derived species with ten fixed inversions, five of them not present in any other species. In order to explore the causes of the rapid chromosomal evolution in D. mojavensis, we identified and characterized all breakpoints of seven inversions fixed in chromosome 2, the most dynamic one. One of the inversions presents unequivocal evidence for its generation by ectopic recombination between transposon copies and another two harbor inverted duplications of non-repetitive DNA at the two breakpoints and were likely generated by staggered single-strand breaks and repair by non-homologous end joining. Four out of 14 breakpoints lay in the intergenic region between preexisting duplicated genes, suggesting an adaptive advantage of separating previously tightly linked duplicates. Four out of 14 breakpoints are associated with transposed genes, suggesting these breakpoints are fragile regions. Finally two inversions contain novel genes at their breakpoints and another three show alterations of genes at breakpoints with potential adaptive significance. D. mojavensis chromosomal inversions were generated by multiple mechanisms, an observation that does not provide support for increased mutation rate as explanation for rapid chromosomal evolution. On the other hand, we have found a number of gene alterations at the breakpoints with putative adaptive consequences that directly point to natural selection as the cause of D. mojavensis rapid chromosomal evolution.
Gene alterations at Drosophila inversion breakpoints provide prima facie evidence for natural selection as an explanation for rapid chromosomal evolution

Directory of Open Access Journals (Sweden)

Guillén Yolanda

2012-02-01

Full Text Available Abstract Background Chromosomal inversions have been pervasive during the evolution of the genus Drosophila, but there is significant variation between lineages in the rate of rearrangement fixation. D. mojavensis, an ecological specialist adapted to a cactophilic niche under extreme desert conditions, is a chromosomally derived species with ten fixed inversions, five of them not present in any other species. Results In order to explore the causes of the rapid chromosomal evolution in D. mojavensis, we identified and characterized all breakpoints of seven inversions fixed in chromosome 2, the most dynamic one. One of the inversions presents unequivocal evidence for its generation by ectopic recombination between transposon copies and another two harbor inverted duplications of non-repetitive DNA at the two breakpoints and were likely generated by staggered single-strand breaks and repair by non-homologous end joining. Four out of 14 breakpoints lay in the intergenic region between preexisting duplicated genes, suggesting an adaptive advantage of separating previously tightly linked duplicates. Four out of 14 breakpoints are associated with transposed genes, suggesting these breakpoints are fragile regions. Finally two inversions contain novel genes at their breakpoints and another three show alterations of genes at breakpoints with potential adaptive significance. Conclusions D. mojavensis chromosomal inversions were generated by multiple mechanisms, an observation that does not provide support for increased mutation rate as explanation for rapid chromosomal evolution. On the other hand, we have found a number of gene alterations at the breakpoints with putative adaptive consequences that directly point to natural selection as the cause of D. mojavensis rapid chromosomal evolution.

Gene alterations at Drosophila inversion breakpoints provide prima facie evidence for natural selection as an explanation for rapid chromosomal evolution

Science.gov (United States)

2012-01-01

Background Chromosomal inversions have been pervasive during the evolution of the genus Drosophila, but there is significant variation between lineages in the rate of rearrangement fixation. D. mojavensis, an ecological specialist adapted to a cactophilic niche under extreme desert conditions, is a chromosomally derived species with ten fixed inversions, five of them not present in any other species. Results In order to explore the causes of the rapid chromosomal evolution in D. mojavensis, we identified and characterized all breakpoints of seven inversions fixed in chromosome 2, the most dynamic one. One of the inversions presents unequivocal evidence for its generation by ectopic recombination between transposon copies and another two harbor inverted duplications of non-repetitive DNA at the two breakpoints and were likely generated by staggered single-strand breaks and repair by non-homologous end joining. Four out of 14 breakpoints lay in the intergenic region between preexisting duplicated genes, suggesting an adaptive advantage of separating previously tightly linked duplicates. Four out of 14 breakpoints are associated with transposed genes, suggesting these breakpoints are fragile regions. Finally two inversions contain novel genes at their breakpoints and another three show alterations of genes at breakpoints with potential adaptive significance. Conclusions D. mojavensis chromosomal inversions were generated by multiple mechanisms, an observation that does not provide support for increased mutation rate as explanation for rapid chromosomal evolution. On the other hand, we have found a number of gene alterations at the breakpoints with putative adaptive consequences that directly point to natural selection as the cause of D. mojavensis rapid chromosomal evolution. PMID:22296923
Phylogenetically Acquired Representations and Evolutionary Algorithms.

OpenAIRE

Wozniak , Adrianna

2006-01-01

First, we explain why Genetic Algorithms (GAs), inspired by the Modern Synthesis, do not accurately model biological evolution, being rather an artificial version of artificial, rather than natural selection. Being focused on optimisation, we propose two improvements of GAs, with the aim to successfully generate adapted, desired behaviour. The first one concerns phylogenetic grounding of meaning, a way to avoid the Symbol Grounding Problem. We give a definition of Phylogenetically Acquired Re...
Using ESTs for phylogenomics: Can one accurately infer a phylogenetic tree from a gappy alignment?

Directory of Open Access Journals (Sweden)

Hartmann Stefanie

2008-03-01

Full Text Available Abstract Background While full genome sequences are still only available for a handful of taxa, large collections of partial gene sequences are available for many more. The alignment of partial gene sequences results in a multiple sequence alignment containing large gaps that are arranged in a staggered pattern. The consequences of this pattern of missing data on the accuracy of phylogenetic analysis are not well understood. We conducted a simulation study to determine the accuracy of phylogenetic trees obtained from gappy alignments using three commonly used phylogenetic reconstruction methods (Neighbor Joining, Maximum Parsimony, and Maximum Likelihood and studied ways to improve the accuracy of trees obtained from such datasets. Results We found that the pattern of gappiness in multiple sequence alignments derived from partial gene sequences substantially compromised phylogenetic accuracy even in the absence of alignment error. The decline in accuracy was beyond what would be expected based on the amount of missing data. The decline was particularly dramatic for Neighbor Joining and Maximum Parsimony, where the majority of gappy alignments contained 25% to 40% incorrect quartets. To improve the accuracy of the trees obtained from a gappy multiple sequence alignment, we examined two approaches. In the first approach, alignment masking, potentially problematic columns and input sequences are excluded from from the dataset. Even in the absence of alignment error, masking improved phylogenetic accuracy up to 100-fold. However, masking retained, on average, only 83% of the input sequences. In the second approach, alignment subdivision, the missing data is statistically modelled in order to retain as many sequences as possible in the phylogenetic analysis. Subdivision resulted in more modest improvements to alignment accuracy, but succeeded in including almost all of the input sequences. Conclusion These results demonstrate that partial gene
Using ESTs for phylogenomics: can one accurately infer a phylogenetic tree from a gappy alignment?

Science.gov (United States)

Hartmann, Stefanie; Vision, Todd J

2008-03-26

While full genome sequences are still only available for a handful of taxa, large collections of partial gene sequences are available for many more. The alignment of partial gene sequences results in a multiple sequence alignment containing large gaps that are arranged in a staggered pattern. The consequences of this pattern of missing data on the accuracy of phylogenetic analysis are not well understood. We conducted a simulation study to determine the accuracy of phylogenetic trees obtained from gappy alignments using three commonly used phylogenetic reconstruction methods (Neighbor Joining, Maximum Parsimony, and Maximum Likelihood) and studied ways to improve the accuracy of trees obtained from such datasets. We found that the pattern of gappiness in multiple sequence alignments derived from partial gene sequences substantially compromised phylogenetic accuracy even in the absence of alignment error. The decline in accuracy was beyond what would be expected based on the amount of missing data. The decline was particularly dramatic for Neighbor Joining and Maximum Parsimony, where the majority of gappy alignments contained 25% to 40% incorrect quartets. To improve the accuracy of the trees obtained from a gappy multiple sequence alignment, we examined two approaches. In the first approach, alignment masking, potentially problematic columns and input sequences are excluded from from the dataset. Even in the absence of alignment error, masking improved phylogenetic accuracy up to 100-fold. However, masking retained, on average, only 83% of the input sequences. In the second approach, alignment subdivision, the missing data is statistically modelled in order to retain as many sequences as possible in the phylogenetic analysis. Subdivision resulted in more modest improvements to alignment accuracy, but succeeded in including almost all of the input sequences. These results demonstrate that partial gene sequences and gappy multiple sequence alignments can pose a
SRBreak: A read-depth and split-read framework to identify breakpoints of different events inside simple copy-number variable regions

Directory of Open Access Journals (Sweden)

HOANG T NGUYEN

2016-09-01

Full Text Available Copy-number variation (CNV has been associated with increased risk of complex diseases. High throughput sequencing (HTS technologies facilitate the detection of copy-number variable regions (CNVRs and their breakpoints. This helps in understanding genome structures of genomes as well as their evolution process. Various approaches have been proposed for detecting CNV breakpoints, but currently it is still challenging for tools based on a single analysis method to identify breakpoints of CNVs. It has been shown, however, that pipelines which integrate multiple approaches are able to report more reliable breakpoints. Here, based on HTS data, we have developed a pipeline to identify approximate breakpoints (±10 bp relating to different ancestral events within a specific CNVR. The pipeline combines read-depth and split-read information to infer breakpoints, using information from multiple samples to allow an imputation approach to be taken. The main steps involve using a normal mixture model to cluster samples into different groups, followed by simple kernel-based approaches to maximise information obtained from read-depth and split-read approaches, after which common breakpoints of groups are inferred. The pipeline uses split-read information directly from CIGAR strings of BAM files, without using a re-alignment step. On simulated data sets, it was able to report breakpoints for very low-coverage samples including those for which only single-end reads were available. When applied to three loci from existing human resequencing data sets (NEGR1, LCE3, IRGM the pipeline obtained good concordance with results from the 1000 Genomes Project (92%, 100% and 82%, respectively.The package is available at https://github.com/hoangtn/SRBreak, and also as a docker-based application at https://registry.hub.docker.com/u/hoangtn/srbreak/.
Community Phylogenetics: Assessing Tree Reconstruction Methods and the Utility of DNA Barcodes

Science.gov (United States)

Boyle, Elizabeth E.; Adamowicz, Sarah J.

2015-01-01

Studies examining phylogenetic community structure have become increasingly prevalent, yet little attention has been given to the influence of the input phylogeny on metrics that describe phylogenetic patterns of co-occurrence. Here, we examine the influence of branch length, tree reconstruction method, and amount of sequence data on measures of phylogenetic community structure, as well as the phylogenetic signal (Pagel’s λ) in morphological traits, using Trichoptera larval communities from Churchill, Manitoba, Canada. We find that model-based tree reconstruction methods and the use of a backbone family-level phylogeny improve estimations of phylogenetic community structure. In addition, trees built using the barcode region of cytochrome c oxidase subunit I (COI) alone accurately predict metrics of phylogenetic community structure obtained from a multi-gene phylogeny. Input tree did not alter overall conclusions drawn for phylogenetic signal, as significant phylogenetic structure was detected in two body size traits across input trees. As the discipline of community phylogenetics continues to expand, it is important to investigate the best approaches to accurately estimate patterns. Our results suggest that emerging large datasets of DNA barcode sequences provide a vast resource for studying the structure of biological communities. PMID:26110886
Sequence analysis of the breakpoint regions of an X;5 translocation in a female with Duchenne muscular dystrophy

Energy Technology Data Exchange (ETDEWEB)

Bakel, I. van; Holt, S.; Craig, I. [Univ. of Oxford (United Kingdom)] [and others

1995-08-01

X;autosome translocations in females with Duchenne muscular dystrophy (DMD) provide an opportunity to study the mechanisms responsible for chromosomal rearrangements that occur in the germ line. We describe here a detailed molecular analysis of the translocation breakpoints of an X;autosome reciprocal translocation, t(X;5) (p21;q31.1), in a female with DMD. Cosmid clones that contained the X-chromosome breakpoint region were identified, and subclones that hybridized to the translocation junction fragment in restriction digests of the patient`s DNA were isolated and sequenced. Primers designed from the X-chromosomal sequence were used to obtain the junction fragments on the der(X) and the der(5) by inverse PCR. The resultant clones were also cloned and sequenced, and this information used to isolate the chromosome 5 breakpoint region. Comparison of the DNA sequences of the junction fragments with those of the breakpoint regions on chromosomes X and 5 revealed that the translocation arose by nonhomologous recombination with an imprecise reciprocal exchange. Four and six base pairs of unknown origin are inserted at the exchange points of the der(X) and der(5), respectively, and three nucleotides are deleted from the X-chromosome sequence. Two features were found that may have played a role in the generation of the translocation. These were (1) a repeat motif with an internal homopyrimidine stretch 10 bp upstream from the X-chromosome breakpoint and (2) a 9-bp sequence of 78% homology located near the breakpoints on chromosomes 5 and X. 32 refs., 4 figs., 2 tabs.
First report on an X-linked hypohidrotic ectodermal dysplasia family with X chromosome inversion: Breakpoint mapping reveals the pathogenic mechanism and preimplantation genetics diagnosis achieves an unaffected birth.

Science.gov (United States)

Wu, Tonghua; Yin, Biao; Zhu, Yuanchang; Li, Guangui; Ye, Lijun; Liang, Desheng; Zeng, Yong

2017-12-01

To investigate the etiology of X-linked hypohidrotic ectodermal dysplasia (XLHED) in a family with an inversion of the X chromosome [inv(X)(p21q13)] and to achieve a healthy birth following preimplantation genetic diagnosis (PGD). Next generation sequencing (NGS) and Sanger sequencing analysis were carried out to define the inversion breakpoint. Multiple displacement amplification, amplification of breakpoint junction fragments, Sanger sequencing of exon 1 of ED1, haplotyping of informative short tandem repeat markers and gender determination were performed for PGD. NGS data of the proband sample revealed that the size of the possible inverted fragment was over 42Mb, spanning from position 26, 814, 206 to position 69, 231, 915 on the X chromosome. The breakpoints were confirmed by Sanger sequencing. A total of 5 blastocyst embryos underwent trophectoderm biopsy. Two embryos were diagnosed as carriers and three were unaffected. Two unaffected blastocysts were transferred and a singleton pregnancy was achieved. Following confirmation by prenatal diagnosis, a healthy baby was delivered. This is the first report of an XLHED family with inv(X). ED1 is disrupted by the X chromosome inversion in this XLHED family and embryos with the X chromosomal abnormality can be accurately identified by means of PGD. Copyright © 2017. Published by Elsevier B.V.
Mutation analysis in Duchenne and Becker muscular dystrophy patients from Bulgaria shows a peculiar distribution of breakpoints by intron

Energy Technology Data Exchange (ETDEWEB)

Todorova, A.; Bronzova, J.; Kremensky, I. [Univ. Hospital of Obstetrics and Gynecology, Sofia (Bulgaria)] [and others

1996-10-02

For the first time in Bulgaria, a deletion/duplication screening was performed on a group of 84 unrelated Duchenne/Becker muscular dystrophy patients, and the breakpoint distribution in the dystrophin gene was analyzed. Intragenic deletions were detected in 67.8% of patients, and intragenic duplications in 2.4%. A peculiar distribution of deletion breakpoints was found. Only 13.2% of the deletion breakpoints fell in the {open_quotes}classical{close_quotes} hot spot in intron 44, whereas the majority (> 54%) were located within the segment encompassing introns 45-51, which includes intron 50, the richest in breakpoints (16%) in the Bulgarian sample. Comparison with data from Greece and Turkey points at the probable existence of a deletion hot spot within intron 50, which might be a characteristic of populations of the Balkan region. 17 refs., 2 figs.
Sequence characterisation of deletion breakpoints in the dystrophin gene by PCR

Energy Technology Data Exchange (ETDEWEB)

Abbs, S.; Sandhu, S.; Bobrow, M. [Guy`s Hospital, London (United Kingdom)

1994-09-01

Partial deletions of the dystrophin gene account for 65% of cases of Duchenne muscular dystrophy. A high proportion of these structural changes are generated by new mutational events, and lie predominantly within two `hotspot` regions, yet the underlying reasons for this are not known. We are characterizing and sequencing the regions surrounding deletion breakpoints in order to: (i) investigate the mechanisms of deletion mutation, and (ii) enable the design of PCR assays to specifically amplify mutant and normal sequences, allowing us to search for the presence of somatic mosaicism in appropriate family members. Using this approach we have been able to demonstrate the presence of somatic mosaicism in a maternal grandfather of a DMD-affected male, deleted for exons 49-50. Three deletions, namely of exons 48-49, 49-50, and 50, have been characterized using a PCR approach that avoids any cloning procedures. Breakpoints were initially localized to within regions of a few kilobases using Southern blot restriction analyses with exon-specific probes and PCR amplification of exonic and intronic loci. Sequencing was performed directly on PCR products: (i) mutant sequences were obtained from long-range or inverse-PCR across the deletion junction fragments, and (ii) normal sequences were obtained from the products of standard PCR, vectorette PCR, or inverse-PCR performed on YACs. Further characterization of intronic sequences will allow us to amplify and sequence across other deletion breakpoints and increase our knowledge of the mechanisms of mutation in the dystophin gene.
SVA retrotransposon insertion-associated deletion represents a novel mutational mechanism underlying large genomic copy number changes with non-recurrent breakpoints

Science.gov (United States)

2014-01-01

Background Genomic disorders are caused by copy number changes that may exhibit recurrent breakpoints processed by nonallelic homologous recombination. However, region-specific disease-associated copy number changes have also been observed which exhibit non-recurrent breakpoints. The mechanisms underlying these non-recurrent copy number changes have not yet been fully elucidated. Results We analyze large NF1 deletions with non-recurrent breakpoints as a model to investigate the full spectrum of causative mechanisms, and observe that they are mediated by various DNA double strand break repair mechanisms, as well as aberrant replication. Further, two of the 17 NF1 deletions with non-recurrent breakpoints, identified in unrelated patients, occur in association with the concomitant insertion of SINE/variable number of tandem repeats/Alu (SVA) retrotransposons at the deletion breakpoints. The respective breakpoints are refractory to analysis by standard breakpoint-spanning PCRs and are only identified by means of optimized PCR protocols designed to amplify across GC-rich sequences. The SVA elements are integrated within SUZ12P intron 8 in both patients, and were mediated by target-primed reverse transcription of SVA mRNA intermediates derived from retrotranspositionally active source elements. Both SVA insertions occurred during early postzygotic development and are uniquely associated with large deletions of 1 Mb and 867 kb, respectively, at the insertion sites. Conclusions Since active SVA elements are abundant in the human genome and the retrotranspositional activity of many SVA source elements is high, SVA insertion-associated large genomic deletions encompassing many hundreds of kilobases could constitute a novel and as yet under-appreciated mechanism underlying large-scale copy number changes in the human genome. PMID:24958239
Ant-Based Phylogenetic Reconstruction (ABPR: A new distance algorithm for phylogenetic estimation based on ant colony optimization

Directory of Open Access Journals (Sweden)

Karla Vittori

2008-12-01

Full Text Available We propose a new distance algorithm for phylogenetic estimation based on Ant Colony Optimization (ACO, named Ant-Based Phylogenetic Reconstruction (ABPR. ABPR joins two taxa iteratively based on evolutionary distance among sequences, while also accounting for the quality of the phylogenetic tree built according to the total length of the tree. Similar to optimization algorithms for phylogenetic estimation, the algorithm allows exploration of a larger set of nearly optimal solutions. We applied the algorithm to four empirical data sets of mitochondrial DNA ranging from 12 to 186 sequences, and from 898 to 16,608 base pairs, and covering taxonomic levels from populations to orders. We show that ABPR performs better than the commonly used Neighbor-Joining algorithm, except when sequences are too closely related (e.g., population-level sequences. The phylogenetic relationships recovered at and above species level by ABPR agree with conventional views. However, like other algorithms of phylogenetic estimation, the proposed algorithm failed to recover expected relationships when distances are too similar or when rates of evolution are very variable, leading to the problem of long-branch attraction. ABPR, as well as other ACO-based algorithms, is emerging as a fast and accurate alternative method of phylogenetic estimation for large data sets.
Quartet-net: a quartet-based method to reconstruct phylogenetic networks.

Science.gov (United States)

Yang, Jialiang; Grünewald, Stefan; Wan, Xiu-Feng

2013-05-01

Phylogenetic networks can model reticulate evolutionary events such as hybridization, recombination, and horizontal gene transfer. However, reconstructing such networks is not trivial. Popular character-based methods are computationally inefficient, whereas distance-based methods cannot guarantee reconstruction accuracy because pairwise genetic distances only reflect partial information about a reticulate phylogeny. To balance accuracy and computational efficiency, here we introduce a quartet-based method to construct a phylogenetic network from a multiple sequence alignment. Unlike distances that only reflect the relationship between a pair of taxa, quartets contain information on the relationships among four taxa; these quartets provide adequate capacity to infer a more accurate phylogenetic network. In applications to simulated and biological data sets, we demonstrate that this novel method is robust and effective in reconstructing reticulate evolutionary events and it has the potential to infer more accurate phylogenetic distances than other conventional phylogenetic network construction methods such as Neighbor-Joining, Neighbor-Net, and Split Decomposition. This method can be used in constructing phylogenetic networks from simple evolutionary events involving a few reticulate events to complex evolutionary histories involving a large number of reticulate events. A software called "Quartet-Net" is implemented and available at http://sysbio.cvm.msstate.edu/QuartetNet/.
On Unrooted and Root-Uncertain Variants of Several Well-Known Phylogenetic Network Problems

NARCIS (Netherlands)

van Iersel, L.J.J.; Kelk, Steven; Stougie, Leen; Boes, Olivier

2017-01-01

The hybridization number problem requires us to embed a set of binary rooted phylogenetic trees into a binary rooted phylogenetic network such that the number of nodes with indegree two is minimized. However, from a biological point of view accurately inferring the root location in a phylogenetic
En Route towards European Clinical breakpoints for veterinary antimicrobial susceptibility testing

NARCIS (Netherlands)

Toutain, Pierre Louis; Bousquet-Mélou, Alain; Damborg, Peter; Ferran, Aude A.; Mevius, Dik; Pelligand, Ludovic; Veldman, Kees T.; Lees, Peter

2017-01-01

VetCAST is the EUCAST sub-committee for Veterinary Antimicrobial Susceptibility Testing. Its remit is to define clinical breakpoints (CBPs) for antimicrobial drugs (AMDs) used in veterinary medicine in Europe. This position paper outlines the procedures and reviews scientific options to solve
SVA retrotransposon insertion-associated deletion represents a novel mutational mechanism underlying large genomic copy number changes with non-recurrent breakpoints

NARCIS (Netherlands)

J. Vogt (Julia); K. Bengesser (Kathrin); K.B.M. Claes (Kathleen B.M.); K. Wimmer (Katharina); V.-F. Mautner (Victor-Felix); R. van Minkelen (Rick); E. Legius (Eric); H. Brems (Hilde); M. Upadhyaya (Meena); J. Högel (Josef); C. Lazaro (Conxi); T. Rosenbaum (Thorsten); S. Bammert (Simone); L. Messiaen (Ludwine); D.N. Cooper (David); H. Kehrer-Sawatzki (Hildegard)

2014-01-01

textabstractBackground: Genomic disorders are caused by copy number changes that may exhibit recurrent breakpoints processed by nonallelic homologous recombination. However, region-specific disease-associated copy number changes have also been observed which exhibit non-recurrent breakpoints. The
Antimicrobial breakpoint estimation accounting for variability in pharmacokinetics

Directory of Open Access Journals (Sweden)

Nekka Fahima

2009-06-01

Full Text Available Abstract Background Pharmacokinetic and pharmacodynamic (PK/PD indices are increasingly being used in the microbiological field to assess the efficacy of a dosing regimen. In contrast to methods using MIC, PK/PD-based methods reflect in vivo conditions and are more predictive of efficacy. Unfortunately, they entail the use of one PK-derived value such as AUC or Cmax and may thus lead to biased efficiency information when the variability is large. The aim of the present work was to evaluate the efficacy of a treatment by adjusting classical breakpoint estimation methods to the situation of variable PK profiles. Methods and results We propose a logical generalisation of the usual AUC methods by introducing the concept of "efficiency" for a PK profile, which involves the efficacy function as a weight. We formulated these methods for both classes of concentration- and time-dependent antibiotics. Using drug models and in silico approaches, we provide a theoretical basis for characterizing the efficiency of a PK profile under in vivo conditions. We also used the particular case of variable drug intake to assess the effect of the variable PK profiles generated and to analyse the implications for breakpoint estimation. Conclusion Compared to traditional methods, our weighted AUC approach gives a more powerful PK/PD link and reveals, through examples, interesting issues about the uniqueness of therapeutic outcome indices and antibiotic resistance problems.
Phylogenetic relationships and the occurrence of interspecific recombination between beet chlorosis virus (BChV) and Beet mild yellowing virus (BMYV).

Science.gov (United States)

Kozlowska-Makulska, Anna; Hasiow-Jaroszewska, Beata; Szyndel, Marek S; Herrbach, Etienne; Bouzoubaa, Salah; Lemaire, Olivier; Beuve, Monique

2015-02-01

Samples containing two viruses belonging to the genus Polerovirus, beet chlorosis virus (BChV) and beet mild yellowing virus (BMYV), were collected from French and Polish sugar beet fields. The molecular properties of 24 isolates of BChV and BMYV were investigated, and their genetic diversity was examined in the coat protein (CP)- and P0-encoding genes. For the first time, we have demonstrated that beet polerovirus populations include recombinants between BChV and BMYV containing breakpoints within the CP gene. Moreover, a partial correlation between geographic origin and phylogenetic clustering was observed for BMYV isolates.
Breakpoints in annual rainfall trends in Córdoba, Argentina

Science.gov (United States)

de la Casa, Antonio; Nasello, Olga

2010-03-01

Long-term rainfall variability in the Province of Córdoba, Argentina is studied. The methodology used was developed by Tomé and Miranda (2004), and the most notable breakpoints in the time series were determined in order to identify sudden transitions from one period to another with a different linear trend sign. All the rain gauges operated by the "Servicio Meteorológico Nacional" (SMN) of Argentina in Córdoba Province, in the period 1930-2006, were analyzed. One of the stations studied, Córdoba Observatorio, has reliable rainfall data since 1873. In this case, the 1925-2006 period and the 1873-2006 period were studied to analyze the influence of series length in terms of the piecewise linear trends produced. Analyzing only one breakpoint in all the series, a trend change is observed from negative to positive in the 1950s in the north area of the region, while in the other areas the opposite change occurs in the 1970s. The residual sum of squares obtained with the partial trend method is compared to that produced by the traditional method. This comparison shows how the multiple trend method enables regional changes to be determined for a given climatological variable.
SoftSearch: integration of multiple sequence features to identify breakpoints of structural variations.

Directory of Open Access Journals (Sweden)

Steven N Hart

Full Text Available BACKGROUND: Structural variation (SV represents a significant, yet poorly understood contribution to an individual's genetic makeup. Advanced next-generation sequencing technologies are widely used to discover such variations, but there is no single detection tool that is considered a community standard. In an attempt to fulfil this need, we developed an algorithm, SoftSearch, for discovering structural variant breakpoints in Illumina paired-end next-generation sequencing data. SoftSearch combines multiple strategies for detecting SV including split-read, discordant read-pair, and unmated pairs. Co-localized split-reads and discordant read pairs are used to refine the breakpoints. RESULTS: We developed and validated SoftSearch using real and synthetic datasets. SoftSearch's key features are 1 not requiring secondary (or exhaustive primary alignment, 2 portability into established sequencing workflows, and 3 is applicable to any DNA-sequencing experiment (e.g. whole genome, exome, custom capture, etc.. SoftSearch identifies breakpoints from a small number of soft-clipped bases from split reads and a few discordant read-pairs which on their own would not be sufficient to make an SV call. CONCLUSIONS: We show that SoftSearch can identify more true SVs by combining multiple sequence features. SoftSearch was able to call clinically relevant SVs in the BRCA2 gene not reported by other tools while offering significantly improved overall performance.

On the Quirks of Maximum Parsimony and Likelihood on Phylogenetic Networks

OpenAIRE

Bryant, Christopher; Fischer, Mareike; Linz, Simone; Semple, Charles

2015-01-01

Maximum parsimony is one of the most frequently-discussed tree reconstruction methods in phylogenetic estimation. However, in recent years it has become more and more apparent that phylogenetic trees are often not sufficient to describe evolution accurately. For instance, processes like hybridization or lateral gene transfer that are commonplace in many groups of organisms and result in mosaic patterns of relationships cannot be represented by a single phylogenetic tree. This is why phylogene...
constNJ: an algorithm to reconstruct sets of phylogenetic trees satisfying pairwise topological constraints.

Science.gov (United States)

Matsen, Frederick A

2010-06-01

This article introduces constNJ (constrained neighbor-joining), an algorithm for phylogenetic reconstruction of sets of trees with constrained pairwise rooted subtree-prune-regraft (rSPR) distance. We are motivated by the problem of constructing sets of trees that must fit into a recombination, hybridization, or similar network. Rather than first finding a set of trees that are optimal according to a phylogenetic criterion (e.g., likelihood or parsimony) and then attempting to fit them into a network, constNJ estimates the trees while enforcing specified rSPR distance constraints. The primary input for constNJ is a collection of distance matrices derived from sequence blocks which are assumed to have evolved in a tree-like manner, such as blocks of an alignment which do not contain any recombination breakpoints. The other input is a set of rSPR constraint inequalities for any set of pairs of trees. constNJ is consistent and a strict generalization of the neighbor-joining algorithm; it uses the new notion of maximum agreement partitions (MAPs) to assure that the resulting trees satisfy the given rSPR distance constraints.
A DNA probe combination for improved detection of MLL/11q23 breakpoints by double-color interphase-FISH in acute leukemias.

NARCIS (Netherlands)

Bergh, A. von; Emanuel, B.; Zelderen-Bhola, S. van; Smetsers, A.F.C.M.; Soest, R. van; Stul, M.; Vranckx, H.; Schuuring, E.; Hagemeijer, A.; Kluin, P.

2000-01-01

Reciprocal translocations involving the MLL gene on chromosome band 11q23 have been observed in both acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL). In AML, identification of MLL breakpoints is an important prognostic factor. Breakpoints are clustered in an 8 kb DNA fragment
Fine mapping of the EDA gene: A translocation breakpoint is associated with a CpG island that is transcribed

Energy Technology Data Exchange (ETDEWEB)

Srivastava, A.K.; Schlessinger, D. [Washington Univ. School of Medicine, St. Louis, MO (United States); Montonen, O. [Univ. of Helsinki (Finland)] [and others

1996-01-01

In order to identify the gene for human X-linked anhidrotic ectodermal dysplasia (EDA), a translocation breakpoint in a female with t(X;1)(q13.1;p36.3) and EDA (patient AK) was finely mapped. The EDA region contains five groups of rare-cutter restriction sites that define CpG islands. The two more centromeric of these islands are associated with transcripts of 3.5 kb and 1.8 kb. The third CpG island maps within <1 kb of the translocation breakpoint in patient AK, as indicated by a genomic rearrangement, and {approximately}100 kb centromeric from another previously mapped translocation breakpoint (patient AnLy). Northern analysis with a probe from this CpG island detected an {approximately}6-kb mRNA in several fetal tissues tested. An extended YAC contig of 1,200 kb with an average of fivefold coverage was constructed. The two most telomeric CpG islands map 350 kb telomeric of the two translocations. Taken together, the results suggest that the CpG island just proximal of the AK translocation breakpoint lies at the 5{prime} end of a candidate gene for EDA. 26 refs., 4 figs., 1 tab.
Novel exon-exon breakpoint in CIC-DUX4 fusion sarcoma identified by anchored multiplex PCR (Archer FusionPlex Sarcoma Panel).

Science.gov (United States)

Loke, Benjamin Nathanael; Lee, Victor Kwan Min; Sudhanshi, Jain; Wong, Meng Kang; Kuick, Chik Hong; Puhaindran, Mark; Chang, Kenneth Tou En

2017-08-01

We describe the clinical and pathological features and novel genetic findings of a case of CIC-DUX4 sarcoma occurring in the thigh of a 35-year-old man. Fusion gene detection using a next-generation sequencing-based anchored multiplex PCR technique (Archer FusionPlex Sarcoma Panel) was used to identify the novel fusion breakpoints of this CIC-DUX4 sarcoma using formalin-fixed and paraffin-embedded tumour material. This CIC-DUX4 sarcoma has a novel fusion breakpoint between exon 20 of the CIC gene and exon 1 of the DUX4 gene. This case report describes an additional case of CIC-DUX4 sarcoma with a novel fusion breakpoint, and demonstrates the value of this next-generation sequencing-based anchored multiplex PCR technique (Archer FusionPlex Sarcoma Panel) in both diagnosis for patient care and in identification of a novel fusion breakpoint in this tumour type. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
Accurate reconstruction of insertion-deletion histories by statistical phylogenetics.

Directory of Open Access Journals (Sweden)

Oscar Westesson

Full Text Available The Multiple Sequence Alignment (MSA is a computational abstraction that represents a partial summary either of indel history, or of structural similarity. Taking the former view (indel history, it is possible to use formal automata theory to generalize the phylogenetic likelihood framework for finite substitution models (Dayhoff's probability matrices and Felsenstein's pruning algorithm to arbitrary-length sequences. In this paper, we report results of a simulation-based benchmark of several methods for reconstruction of indel history. The methods tested include a relatively new algorithm for statistical marginalization of MSAs that sums over a stochastically-sampled ensemble of the most probable evolutionary histories. For mammalian evolutionary parameters on several different trees, the single most likely history sampled by our algorithm appears less biased than histories reconstructed by other MSA methods. The algorithm can also be used for alignment-free inference, where the MSA is explicitly summed out of the analysis. As an illustration of our method, we discuss reconstruction of the evolutionary histories of human protein-coding genes.
In vitro antibacterial activity of doripenem against clinical isolates from French teaching hospitals: proposition of zone diameter breakpoints.

Science.gov (United States)

Lascols, C; Legrand, P; Mérens, A; Leclercq, R; Armand-Lefevre, L; Drugeon, H B; Kitzis, M D; Muller-Serieys, C; Reverdy, M E; Roussel-Delvallez, M; Moubareck, C; Lemire, A; Miara, A; Gjoklaj, M; Soussy, C-J

2011-04-01

The aims of the study were to determine the in vitro activity of doripenem, a new carbapenem, against a large number of bacterial pathogens and to propose zone diameter breakpoints for clinical categorization in France according to the European Committee on Antimicrobial Susceptibility Testing (EUCAST) minimum inhibitory concentration (MIC) breakpoints. The MICs of doripenem were determined by the broth microdilution method against 1,547 clinical isolates from eight French hospitals. The disk diffusion test was performed (10-μg discs) according to the Comité de l'Antibiogramme de la Société Française de Microbiologie (CASFM) method. The MIC(50/90) (mg/L) values were as follows: methicillin-susceptible Staphylococcus aureus (MSSA) (0.03/0.25), methicillin-resistant Staphylococcus aureus (MRSA) (1/2), methicillin-susceptible coagulase-negative staphylococci (MSCoNS) (0.03/0.12), methicillin-resistant coagulase-negative staphylococci (MRCoNS) (2/8), Streptococcus pneumoniae (0.016/0.25), viridans group streptococci (0.016/2), β-hemolytic streptococci (≤0.008/≤0.008), Enterococcus faecalis (2/4), Enterococcus faecium (128/>128), Enterobacteriaceae (0.06/0.25), Pseudomonas aeruginosa (0.5/8), Acinetobacter baumannii (0.25/2), Haemophilus influenzae (0.12/0.25), and Moraxella catarrhalis (0.03/0.06). According to the regression curve, the zone diameter breakpoints were 24 and 19 mm for MICs of 1 and 4 mg/L, respectively. This study confirms the potent in vitro activity of doripenem against Pseudomonas aeruginosa, Acinetobacter, Enterobacteriaceae, MSSA, MSCoNS, and respiratory pathogens. According to the EUCAST MIC breakpoints (mg/L) ≤1/>4 for Enterobacteriaceae, Pseudomonas aeruginosa, and Acinetobacter, and ≤1/>1 for streptococci, pneumococci, and Haemophilus, the zone diameter breakpoints could be (mm) ≥24/<19 and ≥24/<24, respectively.
Optimum temperature of a northern population of Arctic charr (Salvelinus alpinus) using heart rate Arrhenius breakpoint analysis

DEFF Research Database (Denmark)

Hansen, Aslak Kappel; Byriel, David Bille; R. Jensen, Mads

2017-01-01

± 0.4). The Q10 breakpoint occurred at an average of 7.1 °C ± 0.3. There was no significant difference between the breakpoint temperature found using Q10 and Arrhenius [two-sample t test, df = 16; p > 0.1]. The highest fHmax was found at 12.8 °C ± 1.0 reaching an average of 61.8 BPM ± 3.1. Arrhythmia...
DendroBLAST: approximate phylogenetic trees in the absence of multiple sequence alignments.

Science.gov (United States)

Kelly, Steven; Maini, Philip K

2013-01-01

The rapidly growing availability of genome information has created considerable demand for both fast and accurate phylogenetic inference algorithms. We present a novel method called DendroBLAST for reconstructing phylogenetic dendrograms/trees from protein sequences using BLAST. This method differs from other methods by incorporating a simple model of sequence evolution to test the effect of introducing sequence changes on the reliability of the bipartitions in the inferred tree. Using realistic simulated sequence data we demonstrate that this method produces phylogenetic trees that are more accurate than other commonly-used distance based methods though not as accurate as maximum likelihood methods from good quality multiple sequence alignments. In addition to tests on simulated data, we use DendroBLAST to generate input trees for a supertree reconstruction of the phylogeny of the Archaea. This independent analysis produces an approximate phylogeny of the Archaea that has both high precision and recall when compared to previously published analysis of the same dataset using conventional methods. Taken together these results demonstrate that approximate phylogenetic trees can be produced in the absence of multiple sequence alignments, and we propose that these trees will provide a platform for improving and informing downstream bioinformatic analysis. A web implementation of the DendroBLAST method is freely available for use at http://www.dendroblast.com/.
DendroBLAST: approximate phylogenetic trees in the absence of multiple sequence alignments.

Directory of Open Access Journals (Sweden)

Steven Kelly

Full Text Available The rapidly growing availability of genome information has created considerable demand for both fast and accurate phylogenetic inference algorithms. We present a novel method called DendroBLAST for reconstructing phylogenetic dendrograms/trees from protein sequences using BLAST. This method differs from other methods by incorporating a simple model of sequence evolution to test the effect of introducing sequence changes on the reliability of the bipartitions in the inferred tree. Using realistic simulated sequence data we demonstrate that this method produces phylogenetic trees that are more accurate than other commonly-used distance based methods though not as accurate as maximum likelihood methods from good quality multiple sequence alignments. In addition to tests on simulated data, we use DendroBLAST to generate input trees for a supertree reconstruction of the phylogeny of the Archaea. This independent analysis produces an approximate phylogeny of the Archaea that has both high precision and recall when compared to previously published analysis of the same dataset using conventional methods. Taken together these results demonstrate that approximate phylogenetic trees can be produced in the absence of multiple sequence alignments, and we propose that these trees will provide a platform for improving and informing downstream bioinformatic analysis. A web implementation of the DendroBLAST method is freely available for use at http://www.dendroblast.com/.
Translocation and gross deletion breakpoints in human inherited disease and cancer II: Potential involvement of repetitive sequence elements in secondary structure formation between DNA ends.

Science.gov (United States)

Chuzhanova, Nadia; Abeysinghe, Shaun S; Krawczak, Michael; Cooper, David N

2003-09-01

Translocations and gross deletions are responsible for a significant proportion of both cancer and inherited disease. Although such gene rearrangements are nonuniformly distributed in the human genome, the underlying mutational mechanisms remain unclear. We have studied the potential involvement of various types of repetitive sequence elements in the formation of secondary structure intermediates between the single-stranded DNA ends that recombine during rearrangements. Complexity analysis was used to assess the potential of these ends to form secondary structures, the maximum decrease in complexity consequent to a gross rearrangement being used as an indicator of the type of repeat and the specific DNA ends involved. A total of 175 pairs of deletion/translocation breakpoint junction sequences available from the Gross Rearrangement Breakpoint Database [GRaBD; www.uwcm.ac.uk/uwcm/mg/grabd/grabd.html] were analyzed. Potential secondary structure was noted between the 5' flanking sequence of the first breakpoint and the 3' flanking sequence of the second breakpoint in 49% of rearrangements and between the 5' flanking sequence of the second breakpoint and the 3' flanking sequence of the first breakpoint in 36% of rearrangements. Inverted repeats, inversions of inverted repeats, and symmetric elements were found in association with gross rearrangements at approximately the same frequency. However, inverted repeats and inversions of inverted repeats accounted for the vast majority (83%) of deletions plus small insertions, symmetric elements for one-half of all antigen receptor-mediated translocations, while direct repeats appear only to be involved in mediating simple deletions. These findings extend our understanding of illegitimate recombination by highlighting the importance of secondary structure formation between single-stranded DNA ends at breakpoint junctions. Copyright 2003 Wiley-Liss, Inc.
Sequencing and characterisation of rearrangements in three S. pastorianus strains reveals the presence of chimeric genes and gives evidence of breakpoint reuse.

Directory of Open Access Journals (Sweden)

Sarah K Hewitt

Full Text Available Gross chromosomal rearrangements have the potential to be evolutionarily advantageous to an adapting organism. The generation of a hybrid species increases opportunity for recombination by bringing together two homologous genomes. We sought to define the location of genomic rearrangements in three strains of Saccharomyces pastorianus, a natural lager-brewing yeast hybrid of Saccharomyces cerevisiae and Saccharomyces eubayanus, using whole genome shotgun sequencing. Each strain of S. pastorianus has lost species-specific portions of its genome and has undergone extensive recombination, producing chimeric chromosomes. We predicted 30 breakpoints that we confirmed at the single nucleotide level by designing species-specific primers that flank each breakpoint, and then sequencing the PCR product. These rearrangements are the result of recombination between areas of homology between the two subgenomes, rather than repetitive elements such as transposons or tRNAs. Interestingly, 28/30 S. cerevisiae-S. eubayanus recombination breakpoints are located within genic regions, generating chimeric genes. Furthermore we show evidence for the reuse of two breakpoints, located in HSP82 and KEM1, in strains of proposed independent origin.
Phylogenetic diversity and biodiversity indices on phylogenetic networks.

Science.gov (United States)

Wicke, Kristina; Fischer, Mareike

2018-04-01

In biodiversity conservation it is often necessary to prioritize the species to conserve. Existing approaches to prioritization, e.g. the Fair Proportion Index and the Shapley Value, are based on phylogenetic trees and rank species according to their contribution to overall phylogenetic diversity. However, in many cases evolution is not treelike and thus, phylogenetic networks have been developed as a generalization of phylogenetic trees, allowing for the representation of non-treelike evolutionary events, such as hybridization. Here, we extend the concepts of phylogenetic diversity and phylogenetic diversity indices from phylogenetic trees to phylogenetic networks. On the one hand, we consider the treelike content of a phylogenetic network, e.g. the (multi)set of phylogenetic trees displayed by a network and the so-called lowest stable ancestor tree associated with it. On the other hand, we derive the phylogenetic diversity of subsets of taxa and biodiversity indices directly from the internal structure of the network. We consider both approaches that are independent of so-called inheritance probabilities as well as approaches that explicitly incorporate these probabilities. Furthermore, we introduce our software package NetDiversity, which is implemented in Perl and allows for the calculation of all generalized measures of phylogenetic diversity and generalized phylogenetic diversity indices established in this note that are independent of inheritance probabilities. We apply our methods to a phylogenetic network representing the evolutionary relationships among swordtails and platyfishes (Xiphophorus: Poeciliidae), a group of species characterized by widespread hybridization. Copyright © 2018 Elsevier Inc. All rights reserved.
GeneBreak: detection of recurrent DNA copy number aberration-associated chromosomal breakpoints within genes [version 2; referees: 2 approved

Directory of Open Access Journals (Sweden)

Evert van den Broek

2017-07-01

Full Text Available Development of cancer is driven by somatic alterations, including numerical and structural chromosomal aberrations. Currently, several computational methods are available and are widely applied to detect numerical copy number aberrations (CNAs of chromosomal segments in tumor genomes. However, there is lack of computational methods that systematically detect structural chromosomal aberrations by virtue of the genomic location of CNA-associated chromosomal breaks and identify genes that appear non-randomly affected by chromosomal breakpoints across (large series of tumor samples. ‘GeneBreak’ is developed to systematically identify genes recurrently affected by the genomic location of chromosomal CNA-associated breaks by a genome-wide approach, which can be applied to DNA copy number data obtained by array-Comparative Genomic Hybridization (CGH or by (low-pass whole genome sequencing (WGS. First, ‘GeneBreak’ collects the genomic locations of chromosomal CNA-associated breaks that were previously pinpointed by the segmentation algorithm that was applied to obtain CNA profiles. Next, a tailored annotation approach for breakpoint-to-gene mapping is implemented. Finally, dedicated cohort-based statistics is incorporated with correction for covariates that influence the probability to be a breakpoint gene. In addition, multiple testing correction is integrated to reveal recurrent breakpoint events. This easy-to-use algorithm, ‘GeneBreak’, is implemented in R (www.cran.r-project.org and is available from Bioconductor (www.bioconductor.org/packages/release/bioc/html/GeneBreak.html.
The clinical impact of chromosomal rearrangements with breakpoints upstream of the SOX9 gene: two novel de novo balanced translocations associated with acampomelic campomelic dysplasia.

Science.gov (United States)

Fonseca, Ana Carolina S; Bonaldi, Adriano; Bertola, Débora R; Kim, Chong A; Otto, Paulo A; Vianna-Morgante, Angela M

2013-05-07

The association of balanced rearrangements with breakpoints near SOX9 [SRY (sex determining region Y)-box 9] with skeletal abnormalities has been ascribed to the presumptive altering of SOX9 expression by the direct disruption of regulatory elements, their separation from SOX9 or the effect of juxtaposed sequences. We report on two sporadic apparently balanced translocations, t(7;17)(p13;q24) and t(17;20)(q24.3;q11.2), whose carriers have skeletal abnormalities that led to the diagnosis of acampomelic campomelic dysplasia (ACD; MIM 114290). No pathogenic chromosomal imbalances were detected by a-CGH. The chromosome 17 breakpoints were mapped, respectively, 917-855 kb and 601-585 kb upstream of the SOX9 gene. A distal cluster of balanced rearrangements breakpoints on chromosome 17 associated with SOX9-related skeletal disorders has been mapped to a segment 932-789 kb upstream of SOX9. In this cluster, the breakpoint of the herein described t(17;20) is the most telomeric to SOX9, thus allowing the redefining of the telomeric boundary of the distal breakpoint cluster region related to skeletal disorders to 601-585 kb upstream of SOX9. Although both patients have skeletal abnormalities, the t(7;17) carrier presents with relatively mild clinical features, whereas the t(17;20) was detected in a boy with severe broncheomalacia, depending on mechanical ventilation. Balanced and unbalanced rearrangements associated with disorders of sex determination led to the mapping of a regulatory region of SOX9 function on testicular differentiation to a 517-595 kb interval upstream of SOX9, in addition to TESCO (Testis-specific enhancer of SOX9 core). As the carrier of t(17;20) has an XY sex-chromosome constitution and normal male development for his age, the segment of chromosome 17 distal to the translocation breakpoint should contain the regulatory elements for normal testis development. These two novel translocations illustrate the clinical variability in carriers of balanced
Cloning of the anhidrotic ectodermal dysplasia gene: Identification of cDNAs associated with CpG islands mapped near translocation breakpoint in two female patients

Energy Technology Data Exchange (ETDEWEB)

Srivastava, A.K.; Schlessinger, D. [Washington Univ. School of Medicine, St. Louis, MO (United States); Kere, J. [Univ. of Helsinki (Finland)] [and others

1994-09-01

The gene for the X chromosomal developmental disorder anhidrotic ectodermal dysplasia (EDA) has been mapped to Xq12-q13 by linkage analysis and is expressed in a few females with chromosomal translocations involving band Xq12-q13. A yeast artificial chromosome (YAC) contig (2.0 Mb) spanning two translocation breakpoints has been assembled by sequence-tagged site (STS)-based chromosomal walking. The two translocation breakpoints (X:autosome translocations from the affected female patients) have been mapped less than 60 kb apart within a YAC contig. Unique probes and intragenic STSs (mapped between the two translocations) have been developed and a somatic cell hybrid carrying the translocated X chromosome from the AK patient has been analyzed by isolating unique probes that span the breakpoint. Several STSs made from intragenic sequences have been found to be conserved in mouse, hamster and monkey, but we have detected no mRNAs in a number of tissues tested. However, a probe and STS developed from the DNA spanning the AK breakpoint is conserved in mouse, hamster and monkey, and we have detected expressed sequences in skin cells and cDNA libraries. In addition, unique sequences have been obtained from two CpG islands in the region that maps proximal to the breakpoints. cDNAs containing these sequences are being studied as candidates for the gene affected in the etiology of EDA.
Phylogenetic Inference of HIV Transmission Clusters

Directory of Open Access Journals (Sweden)

Vlad Novitsky

2017-10-01

Full Text Available Better understanding the structure and dynamics of HIV transmission networks is essential for designing the most efficient interventions to prevent new HIV transmissions, and ultimately for gaining control of the HIV epidemic. The inference of phylogenetic relationships and the interpretation of results rely on the definition of the HIV transmission cluster. The definition of the HIV cluster is complex and dependent on multiple factors, including the design of sampling, accuracy of sequencing, precision of sequence alignment, evolutionary models, the phylogenetic method of inference, and specified thresholds for cluster support. While the majority of studies focus on clusters, non-clustered cases could also be highly informative. A new dimension in the analysis of the global and local HIV epidemics is the concept of phylogenetically distinct HIV sub-epidemics. The identification of active HIV sub-epidemics reveals spreading viral lineages and may help in the design of targeted interventions.HIVclustering can also be affected by sampling density. Obtaining a proper sampling density may increase statistical power and reduce sampling bias, so sampling density should be taken into account in study design and in interpretation of phylogenetic results. Finally, recent advances in long-range genotyping may enable more accurate inference of HIV transmission networks. If performed in real time, it could both inform public-health strategies and be clinically relevant (e.g., drug-resistance testing.
On the quirks of maximum parsimony and likelihood on phylogenetic networks.

Science.gov (United States)

Bryant, Christopher; Fischer, Mareike; Linz, Simone; Semple, Charles

2017-03-21

Maximum parsimony is one of the most frequently-discussed tree reconstruction methods in phylogenetic estimation. However, in recent years it has become more and more apparent that phylogenetic trees are often not sufficient to describe evolution accurately. For instance, processes like hybridization or lateral gene transfer that are commonplace in many groups of organisms and result in mosaic patterns of relationships cannot be represented by a single phylogenetic tree. This is why phylogenetic networks, which can display such events, are becoming of more and more interest in phylogenetic research. It is therefore necessary to extend concepts like maximum parsimony from phylogenetic trees to networks. Several suggestions for possible extensions can be found in recent literature, for instance the softwired and the hardwired parsimony concepts. In this paper, we analyze the so-called big parsimony problem under these two concepts, i.e. we investigate maximum parsimonious networks and analyze their properties. In particular, we show that finding a softwired maximum parsimony network is possible in polynomial time. We also show that the set of maximum parsimony networks for the hardwired definition always contains at least one phylogenetic tree. Lastly, we investigate some parallels of parsimony to different likelihood concepts on phylogenetic networks. Copyright © 2017 Elsevier Ltd. All rights reserved.
Chromosome breakage in Prader-Willi and Angelman syndrome deletions may involve recombination between a repeat at the proximal and distal breakpoints

Energy Technology Data Exchange (ETDEWEB)

Amos-Landgraf J.; Nicholls, R.D. [Case Western Reserve Univ., Cleveland, OH (United States); Gottlieb, W. [Univ. of Florida, Gainesville, FL (United States)] [and others

1994-09-01

Prader-Willi (PWS) and Angelman (AS) syndromes most commonly arise from large deletions of 15q11-q13. Deletions in PWS are paternal in origin, while those in AS are maternal in origin, clearly demonstrating genomic imprinting in these clinically distinct neurobehavioural disorders. In at least 90% of PWS and AS deletion patients, the same 4 Mb region within 15q11-q13 is deleted with breakpoints clustering in single YAC clones at the proximal and distal ends. To study the mechanism of chromosome breakage in PWS and AS, we have previously isolated 25 independent clones from these three YACs using Alu-vector PCR. Four clones were selected that appear to detect a low copy repeat that is located in the proximal and distal breakpoint regions of chromosome 15q11-q13. Three clones detect the same 4 HindIII bands in genomic DNA, all from 15q11-q13, with differing intensities for the probes located at the proximal or distal breakpoints region, respectively. This suggests that these probes detect related members of a low-copy repeat at either location. Moreover, the 254RL2 probe detects a novel HindIII band in two unrelated PWS deletion patients, suggesting that this may represent a breakpoint fragment, with recombination occurring within a similar interval in both patients. A fourth clone, 318RL3 detects 5 bands in HindIII-digested genomic DNA, all from 15q11-q13. This YAC endclone itself is not deleted in PWS and AS deletion patients, as seen by an invariant strong band. Two other strong bands are variably intact or deleted in different PWS or AS deletion patients, suggesting a relationship of this sequence to the breakpoints. Moreover, PCR using 318RL3 primers from the distal 93C9 YAC led to the isolation of a related clone with 96% identity, demonstrating the existence of a low-copy repeat with members close to the proximal and distal breakpoints. Taken together, our data suggest a complex, low-copy repeat with members at both the proximal and distal boundaries.
Carbapenem Breakpoints for Acinetobacter baumannii Group: Supporting Clinical Outcome Data from Patients with Bacteremia.

Science.gov (United States)

Lee, Yi-Tzu; Chiang, Mei-Chun; Kuo, Shu-Chen; Wang, Yung-Chih; Lee, I-Hsin; Chen, Te-Li; Yang, Ya-Sung

2016-01-01

The carbapenem breakpoints set by different organizations for Acinetobacter are discordant, but supporting clinical data are lacking. This study aimed to provide the first clinical outcome data to support the carbapenem breakpoints for Acinetobacter baumannii (Ab) group in patients with bacteremia. This study included 117 adults who received carbapenems for treatment of Ab group bacteremia in Taipei Veterans General Hospital over an 8-year period. We analyzed 30-day mortality rates among patient groups acquiring isolates with different carbapenem minimal inhibitory concentrations (MICs). The carbapenem MIC breakpoint derived from classification and regression tree (CART) analysis to delineate the risk of 30-day mortality was between MICs of ≤ 4 mg/L and ≥ 8 mg/L. Mortality rate was higher in patients acquiring isolates with carbapenem MIC ≥ 8 mg/L than ≤ 4 mg/L, by bivariate (54.9% [28/51] vs 25.8% [17/66]; P = 0.003) and survival analysis (P = 0.001 by log-rank test). Multivariate analysis using logistic regression and Cox regression models including severity of illness indices demonstrated that treating patients with Ab group bacteremia caused by isolates with a carbapenem MIC ≥ 8 mg/L with carbapenem was an independent predictor of 30-day mortality (odds ratio, 5.125; 95% confidence interval [CI], 1.946-13.498; P = 0.001, and hazard ratio, 2.630; 95% CI, 1.431-4.834; P = 0.002, respectively). The clinical outcome data confirmed that isolates with MIC ≤ 4 mg/L were susceptible to carbapenem, and those with MIC ≥ 8 mg/L were resistant in patients with Ab group bacteremia.

Interphase FISH detection of BCL2 rearrangement in follicular lymphoma using breakpoint-flanking probes

NARCIS (Netherlands)

Vaandrager, J W; Schuuring, E; Raap, T; Philippo, K; Kleiverda, K; Kluin, P

Rearrangement of the BCL2 gene is an important parameter for the differential diagnosis of non-Hodgkin lymphomas. Although a relatively large proportion of breakpoints is clustered, many are missed by standard PCR. A FISH assay is therefore desired. Up to now, a lack of probes flanking the BCL2 gene
Unrealistic phylogenetic trees may improve phylogenetic footprinting.

Science.gov (United States)

Nettling, Martin; Treutler, Hendrik; Cerquides, Jesus; Grosse, Ivo

2017-06-01

The computational investigation of DNA binding motifs from binding sites is one of the classic tasks in bioinformatics and a prerequisite for understanding gene regulation as a whole. Due to the development of sequencing technologies and the increasing number of available genomes, approaches based on phylogenetic footprinting become increasingly attractive. Phylogenetic footprinting requires phylogenetic trees with attached substitution probabilities for quantifying the evolution of binding sites, but these trees and substitution probabilities are typically not known and cannot be estimated easily. Here, we investigate the influence of phylogenetic trees with different substitution probabilities on the classification performance of phylogenetic footprinting using synthetic and real data. For synthetic data we find that the classification performance is highest when the substitution probability used for phylogenetic footprinting is similar to that used for data generation. For real data, however, we typically find that the classification performance of phylogenetic footprinting surprisingly increases with increasing substitution probabilities and is often highest for unrealistically high substitution probabilities close to one. This finding suggests that choosing realistic model assumptions might not always yield optimal predictions in general and that choosing unrealistically high substitution probabilities close to one might actually improve the classification performance of phylogenetic footprinting. The proposed PF is implemented in JAVA and can be downloaded from https://github.com/mgledi/PhyFoo. : martin.nettling@informatik.uni-halle.de. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.
[Comparison of microdilution and disk diffusion methods for the detection of fluconazole and voriconazole susceptibility against clinical Candida glabrata isolates and determination of changing susceptibility with new CLSI breakpoints].

Science.gov (United States)

Hazırolan, Gülşen; Sarıbaş, Zeynep; Arıkan Akdağlı, Sevtap

2016-07-01

Candida albicans is the most frequently isolated species as the causative agent of Candida infections. However, in recent years, the isolation rate of non-albicans Candida species have increased. In many centers, Candida glabrata is one of the commonly isolated non-albicans species of C.glabrata infections which are difficult-to-treat due to decreased susceptibility to fluconazole and cross-resistance to other azoles. The aims of this study were to determine the in vitro susceptibility profiles of clinical C.glabrata isolates against fluconazole and voriconazole by microdilution and disk diffusion methods and to evaluate the results with both the previous (CLSI) and current species-specific CLSI (Clinical and Laboratory Standards Institute) clinical breakpoints. A total of 70 C.glabrata strains isolated from clinical samples were included in the study. The identification of the isolates was performed by morphologic examination on cornmeal Tween 80 agar and assimilation profiles obtained by using ID32C (BioMérieux, France). Broth microdilution and disk diffusion methods were performed according to CLSI M27-A3 and CLSI M44-A2 documents, respectively. The results were evaluated according to CLSI M27-A3 and M44-A2 documents and new vs. species-specific CLSI breakpoints. By using both previous and new CLSI breakpoints, broth microdilution test results showed that voriconazole has greater in vitro activity than fluconazole against C.glabrata isolates. For the two drugs tested, very major error was not observed with disk diffusion method when microdilution method was considered as the reference method. Since "susceptible" category no more exists for fluconazole vs. C.glabrata, the isolates that were interpreted as susceptible by previous breakpoints were evaluated as susceptible-dose dependent by current CLSI breakpoints. Since species-specific breakpoints remain yet undetermined for voriconazole, comparative analysis was not possible for this agent. The results obtained
Phylogenetic trees

OpenAIRE

Baños, Hector; Bushek, Nathaniel; Davidson, Ruth; Gross, Elizabeth; Harris, Pamela E.; Krone, Robert; Long, Colby; Stewart, Allen; Walker, Robert

2016-01-01

We introduce the package PhylogeneticTrees for Macaulay2 which allows users to compute phylogenetic invariants for group-based tree models. We provide some background information on phylogenetic algebraic geometry and show how the package PhylogeneticTrees can be used to calculate a generating set for a phylogenetic ideal as well as a lower bound for its dimension. Finally, we show how methods within the package can be used to compute a generating set for the join of any two ideals.
Cloning of the chromosome translocation breakpoint junction of the t(14;19) in chronic lymphocytic leukemia

International Nuclear Information System (INIS)

McKeithan, T.W.; Rowley, J.D.; Shows, T.B.; Diaz, M.O.

1987-01-01

The authors' laboratory has reported that t(14;19)(q32;q13.1) is a recurring translocation in the neoplastic cells of patients with chronic lymphocytic leukemia. In the present study, they have analyzed the leukemic cells from one such patient with probes from the immunoglobulin heavy-chain locus, which is present on band q32 of chromosome 14. Using a probe for the α constant-region gene segments, they detected a rearranged band by Southern blot analysis. This rearranged band was cloned and mapped. A subclone free of repetitive sequences was shown to be from chromosome 19 by analysis of human-mouse somatic cell hybrids, confirming that the rearranged band contains the translocation breakpoint junction. This probe may be used to identify a gene on chromosome 19 adjacent to the breakpoint that can contribute to the malignant development of B lymphocytes
On Computing Breakpoint Distances for Genomes with Duplicate Genes.

Science.gov (United States)

Shao, Mingfu; Moret, Bernard M E

2017-06-01

A fundamental problem in comparative genomics is to compute the distance between two genomes in terms of its higher level organization (given by genes or syntenic blocks). For two genomes without duplicate genes, we can easily define (and almost always efficiently compute) a variety of distance measures, but the problem is NP-hard under most models when genomes contain duplicate genes. To tackle duplicate genes, three formulations (exemplar, maximum matching, and any matching) have been proposed, all of which aim to build a matching between homologous genes so as to minimize some distance measure. Of the many distance measures, the breakpoint distance (the number of nonconserved adjacencies) was the first one to be studied and remains of significant interest because of its simplicity and model-free property. The three breakpoint distance problems corresponding to the three formulations have been widely studied. Although we provided last year a solution for the exemplar problem that runs very fast on full genomes, computing optimal solutions for the other two problems has remained challenging. In this article, we describe very fast, exact algorithms for these two problems. Our algorithms rely on a compact integer-linear program that we further simplify by developing an algorithm to remove variables, based on new results on the structure of adjacencies and matchings. Through extensive experiments using both simulations and biological data sets, we show that our algorithms run very fast (in seconds) on mammalian genomes and scale well beyond. We also apply these algorithms (as well as the classic orthology tool MSOAR) to create orthology assignment, then compare their quality in terms of both accuracy and coverage. We find that our algorithm for the "any matching" formulation significantly outperforms other methods in terms of accuracy while achieving nearly maximum coverage.
Putative cruciform DNA structures at BCL6 breakpoint region may explain BCL6 translocation in diffuse large B-Cell lymphoma

International Nuclear Information System (INIS)

Bhatelia, Khyati D.; Nambiar, Mridula; Choudhary, Bibha; Raghvan, Sathees C.

2010-01-01

Cancer is a disease characterized by uncontrolled proliferation of cells, caused by genetic alterations such as chromosomal translocations, which are present in almost all hematological malignancies. Diffuse Large B-cell Lymphoma (DLBL) is the most common non-Hodgkin's lymphoma, comprising 40-50% of all lymphomas both in India and worldwide, and is characterized by BCL6 chromosomal translocation. However, the mechanism of this translocation is completely unknown. By mapping of translocation breakpoints from patients, we have identified three breakpoint cluster regions at 5' UTR of BCL6 gene. Bioinformatics analysis of cluster II, which possesses majority of breakpoints, this region may form cruciform DNA structures. Gel mobility shift assays using oligomeric DNA from the region suggested that a portion of cluster II folded into hairpin structures. Mutations to the wild type sequences disrupted hairpin formation. Circular dichroism studies on BCL6 oligomers resulted in a spectra containing two overlapping peaks at 265 nm and 285 nm, confirming hairpin structure. Further, the structure was destroyed upon heating, and reformed when appropriate conditions were provided. P1 nuclease assay in conjunction with KMnO 4 probing suggested that the structure possessed an eight nucleotide double-stranded stem and a nine nucleotide loop. To further understand the mechanism of BCL6 translocation in vivo, human cells were transfected with episomes harboring cluster II region and the results obtained will be discussed. Hence, our results suggest the formation of a putative cruciform DNA structure at BCL6 breakpoint region and that may facilitate breakage at BCL6 gene explaining chromosomal translocations in DLBL. (author)
Visualizing phylogenetic tree landscapes.

Science.gov (United States)

Wilgenbusch, James C; Huang, Wen; Gallivan, Kyle A

2017-02-02

Genomic-scale sequence alignments are increasingly used to infer phylogenies in order to better understand the processes and patterns of evolution. Different partitions within these new alignments (e.g., genes, codon positions, and structural features) often favor hundreds if not thousands of competing phylogenies. Summarizing and comparing phylogenies obtained from multi-source data sets using current consensus tree methods discards valuable information and can disguise potential methodological problems. Discovery of efficient and accurate dimensionality reduction methods used to display at once in 2- or 3- dimensions the relationship among these competing phylogenies will help practitioners diagnose the limits of current evolutionary models and potential problems with phylogenetic reconstruction methods when analyzing large multi-source data sets. We introduce several dimensionality reduction methods to visualize in 2- and 3-dimensions the relationship among competing phylogenies obtained from gene partitions found in three mid- to large-size mitochondrial genome alignments. We test the performance of these dimensionality reduction methods by applying several goodness-of-fit measures. The intrinsic dimensionality of each data set is also estimated to determine whether projections in 2- and 3-dimensions can be expected to reveal meaningful relationships among trees from different data partitions. Several new approaches to aid in the comparison of different phylogenetic landscapes are presented. Curvilinear Components Analysis (CCA) and a stochastic gradient decent (SGD) optimization method give the best representation of the original tree-to-tree distance matrix for each of the three- mitochondrial genome alignments and greatly outperformed the method currently used to visualize tree landscapes. The CCA + SGD method converged at least as fast as previously applied methods for visualizing tree landscapes. We demonstrate for all three mtDNA alignments that 3D
TCS: a web server for multiple sequence alignment evaluation and phylogenetic reconstruction.

Science.gov (United States)

Chang, Jia-Ming; Di Tommaso, Paolo; Lefort, Vincent; Gascuel, Olivier; Notredame, Cedric

2015-07-01

This article introduces the Transitive Consistency Score (TCS) web server; a service making it possible to estimate the local reliability of protein multiple sequence alignments (MSAs) using the TCS index. The evaluation can be used to identify the aligned positions most likely to contain structurally analogous residues and also most likely to support an accurate phylogenetic reconstruction. The TCS scoring scheme has been shown to be accurate predictor of structural alignment correctness among commonly used methods. It has also been shown to outperform common filtering schemes like Gblocks or trimAl when doing MSA post-processing prior to phylogenetic tree reconstruction. The web server is available from http://tcoffee.crg.cat/tcs. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Spatial phylogenetics of the vascular flora of Chile.

Science.gov (United States)

Scherson, Rosa A; Thornhill, Andrew H; Urbina-Casanova, Rafael; Freyman, William A; Pliscoff, Patricio A; Mishler, Brent D

2017-07-01

Current geographic patterns of biodiversity are a consequence of the evolutionary history of the lineages that comprise them. This study was aimed at exploring how evolutionary features of the vascular flora of Chile are distributed across the landscape. Using a phylogeny at the genus level for 87% of the Chilean vascular flora, and a geographic database of sample localities, we calculated phylogenetic diversity (PD), phylogenetic endemism (PE), relative PD (RPD), and relative PE (RPE). Categorical Analyses of Neo- and Paleo-Endemism (CANAPE) were also performed, using a spatial randomization to assess statistical significance. A cluster analysis using range-weighted phylogenetic turnover was used to compare among grid cells, and with known Chilean bioclimates. PD patterns were concordant with known centers of high taxon richness and the Chilean biodiversity hotspot. In addition, several other interesting areas of concentration of evolutionary history were revealed as potential conservation targets. The south of the country shows areas of significantly high RPD and a concentration of paleo-endemism, and the north shows areas of significantly low PD and RPD, and a concentration of neo-endemism. Range-weighted phylogenetic turnover shows high congruence with the main macrobioclimates of Chile. Even though the study was done at the genus level, the outcome provides an accurate outline of phylogenetic patterns that can be filled in as more fine-scaled information becomes available. Copyright © 2017 Elsevier Inc. All rights reserved.
Robustness of ancestral sequence reconstruction to phylogenetic uncertainty.

Science.gov (United States)

Hanson-Smith, Victor; Kolaczkowski, Bryan; Thornton, Joseph W

2010-09-01

Ancestral sequence reconstruction (ASR) is widely used to formulate and test hypotheses about the sequences, functions, and structures of ancient genes. Ancestral sequences are usually inferred from an alignment of extant sequences using a maximum likelihood (ML) phylogenetic algorithm, which calculates the most likely ancestral sequence assuming a probabilistic model of sequence evolution and a specific phylogeny--typically the tree with the ML. The true phylogeny is seldom known with certainty, however. ML methods ignore this uncertainty, whereas Bayesian methods incorporate it by integrating the likelihood of each ancestral state over a distribution of possible trees. It is not known whether Bayesian approaches to phylogenetic uncertainty improve the accuracy of inferred ancestral sequences. Here, we use simulation-based experiments under both simplified and empirically derived conditions to compare the accuracy of ASR carried out using ML and Bayesian approaches. We show that incorporating phylogenetic uncertainty by integrating over topologies very rarely changes the inferred ancestral state and does not improve the accuracy of the reconstructed ancestral sequence. Ancestral state reconstructions are robust to uncertainty about the underlying tree because the conditions that produce phylogenetic uncertainty also make the ancestral state identical across plausible trees; conversely, the conditions under which different phylogenies yield different inferred ancestral states produce little or no ambiguity about the true phylogeny. Our results suggest that ML can produce accurate ASRs, even in the face of phylogenetic uncertainty. Using Bayesian integration to incorporate this uncertainty is neither necessary nor beneficial.
Incompletely resolved phylogenetic trees inflate estimates of phylogenetic conservatism.

Science.gov (United States)

Davies, T Jonathan; Kraft, Nathan J B; Salamin, Nicolas; Wolkovich, Elizabeth M

2012-02-01

The tendency for more closely related species to share similar traits and ecological strategies can be explained by their longer shared evolutionary histories and represents phylogenetic conservatism. How strongly species traits co-vary with phylogeny can significantly impact how we analyze cross-species data and can influence our interpretation of assembly rules in the rapidly expanding field of community phylogenetics. Phylogenetic conservatism is typically quantified by analyzing the distribution of species values on the phylogenetic tree that connects them. Many phylogenetic approaches, however, assume a completely sampled phylogeny: while we have good estimates of deeper phylogenetic relationships for many species-rich groups, such as birds and flowering plants, we often lack information on more recent interspecific relationships (i.e., within a genus). A common solution has been to represent these relationships as polytomies on trees using taxonomy as a guide. Here we show that such trees can dramatically inflate estimates of phylogenetic conservatism quantified using S. P. Blomberg et al.'s K statistic. Using simulations, we show that even randomly generated traits can appear to be phylogenetically conserved on poorly resolved trees. We provide a simple rarefaction-based solution that can reliably retrieve unbiased estimates of K, and we illustrate our method using data on first flowering times from Thoreau's woods (Concord, Massachusetts, USA).
Heterogeneous breakpoints in patients with acute lymphoblastic leukemia and the dic(9;20)(p11-13;q11) show recurrent involvement of genes at 20q11.21.

Science.gov (United States)

An, Qian; Wright, Sarah L; Moorman, Anthony V; Parker, Helen; Griffiths, Mike; Ross, Fiona M; Davies, Teresa; Harrison, Christine J; Strefford, Jon C

2009-08-01

The dic(9;20)(p11-13;q11) is a recurrent chromosomal abnormality in patients with acute lymphoblastic leukemia. Although it results in loss of material from 9p and 20q, the molecular targets on both chromosomes have not been fully elucidated. From an initial cohort of 58 with acute lymphoblastic leukemia patients with this translocation, breakpoint mapping with fluorescence in situ hybridization on 26 of them revealed breakpoint heterogeneity of both chromosomes. PAX5 has been proposed to be the target gene on 9p, while for 20q, FISH analysis implicated the involvement of the ASXL1 gene, either by a breakpoint within (n=4) or centromeric (deletion, n=12) of the gene. Molecular copy-number counting, long-distance inverse PCR and direct sequence analysis identified six dic(9;20) breakpoint sequences. In addition to the three previously reported: PAX5-ASXL1, PAX5-C20ORF112 and PAX5-KIF3B; we identified three new ones in this study: sequences 3' of PAX5 disrupting ASXL1, and ZCCHC7 disrupted by sequences 3' of FRG1B and LOC1499503. This study provides insight into the breakpoint complexity underlying dicentric chromosomal formation in acute lymphoblastic leukemia and highlights putative target gene loci.
Quartet-based methods to reconstruct phylogenetic networks.

Science.gov (United States)

Yang, Jialiang; Grünewald, Stefan; Xu, Yifei; Wan, Xiu-Feng

2014-02-20

Phylogenetic networks are employed to visualize evolutionary relationships among a group of nucleotide sequences, genes or species when reticulate events like hybridization, recombination, reassortant and horizontal gene transfer are believed to be involved. In comparison to traditional distance-based methods, quartet-based methods consider more information in the reconstruction process and thus have the potential to be more accurate. We introduce QuartetSuite, which includes a set of new quartet-based methods, namely QuartetS, QuartetA, and QuartetM, to reconstruct phylogenetic networks from nucleotide sequences. We tested their performances and compared them with other popular methods on two simulated nucleotide sequence data sets: one generated from a tree topology and the other from a complicated evolutionary history containing three reticulate events. We further validated these methods to two real data sets: a bacterial data set consisting of seven concatenated genes of 36 bacterial species and an influenza data set related to recently emerging H7N9 low pathogenic avian influenza viruses in China. QuartetS, QuartetA, and QuartetM have the potential to accurately reconstruct evolutionary scenarios from simple branching trees to complicated networks containing many reticulate events. These methods could provide insights into the understanding of complicated biological evolutionary processes such as bacterial taxonomy and reassortant of influenza viruses.
Ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses.

Science.gov (United States)

Fouquier, Jennifer; Rideout, Jai Ram; Bolyen, Evan; Chase, John; Shiffer, Arron; McDonald, Daniel; Knight, Rob; Caporaso, J Gregory; Kelley, Scott T

2016-02-24

Fungi play critical roles in many ecosystems, cause serious diseases in plants and animals, and pose significant threats to human health and structural integrity problems in built environments. While most fungal diversity remains unknown, the development of PCR primers for the internal transcribed spacer (ITS) combined with next-generation sequencing has substantially improved our ability to profile fungal microbial diversity. Although the high sequence variability in the ITS region facilitates more accurate species identification, it also makes multiple sequence alignment and phylogenetic analysis unreliable across evolutionarily distant fungi because the sequences are hard to align accurately. To address this issue, we created ghost-tree, a bioinformatics tool that integrates sequence data from two genetic markers into a single phylogenetic tree that can be used for diversity analyses. Our approach starts with a "foundation" phylogeny based on one genetic marker whose sequences can be aligned across organisms spanning divergent taxonomic groups (e.g., fungal families). Then, "extension" phylogenies are built for more closely related organisms (e.g., fungal species or strains) using a second more rapidly evolving genetic marker. These smaller phylogenies are then grafted onto the foundation tree by mapping taxonomic names such that each corresponding foundation-tree tip would branch into its new "extension tree" child. We applied ghost-tree to graft fungal extension phylogenies derived from ITS sequences onto a foundation phylogeny derived from fungal 18S sequences. Our analysis of simulated and real fungal ITS data sets found that phylogenetic distances between fungal communities computed using ghost-tree phylogenies explained significantly more variance than non-phylogenetic distances. The phylogenetic metrics also improved our ability to distinguish small differences (effect sizes) between microbial communities, though results were similar to non-phylogenetic
Conformation of phylogenetic relationship of Penaeidae shrimp based on morphometric and molecular investigations.

Science.gov (United States)

Rajakumaran, P; Vaseeharan, B; Jayakumar, R; Chidambara, R

2014-01-01

Understanding of accurate phylogenetic relationship among Penaeidae shrimp is important for academic and fisheries industry. The Morphometric and Randomly amplified polymorphic DNA (RAPD) analysis was used to make the phylogenetic relationsip among 13 Penaeidae shrimp. For morphometric analysis forty variables and total lengths of shrimp were measured for each species, and removed the effect of size variation. The size normalized values obtained was subjected to UPGMA (Unweighted Pair-Group Method with Arithmetic Mean) cluster analysis. For RAPD analysis, the four primers showed reliable differentiation between species, and used correlation coefficient between the DNA banding patterns of 13 Penaeidae species to construct UPGMA dendrogram. Phylogenetic relationship from morphometric and molecular analysis for Penaeidae species found to be congruent. We concluded that as the results from morphometry investigations concur with molecular one, phylogenetic relationship obtained for the studied Penaeidae are considered to be reliable.
EM for phylogenetic topology reconstruction on nonhomogeneous data.

Science.gov (United States)

Ibáñez-Marcelo, Esther; Casanellas, Marta

2014-06-17

The reconstruction of the phylogenetic tree topology of four taxa is, still nowadays, one of the main challenges in phylogenetics. Its difficulties lie in considering not too restrictive evolutionary models, and correctly dealing with the long-branch attraction problem. The correct reconstruction of 4-taxon trees is crucial for making quartet-based methods work and being able to recover large phylogenies. We adapt the well known expectation-maximization algorithm to evolutionary Markov models on phylogenetic 4-taxon trees. We then use this algorithm to estimate the substitution parameters, compute the corresponding likelihood, and to infer the most likely quartet. In this paper we consider an expectation-maximization method for maximizing the likelihood of (time nonhomogeneous) evolutionary Markov models on trees. We study its success on reconstructing 4-taxon topologies and its performance as input method in quartet-based phylogenetic reconstruction methods such as QFIT and QuartetSuite. Our results show that the method proposed here outperforms neighbor-joining and the usual (time-homogeneous continuous-time) maximum likelihood methods on 4-leaved trees with among-lineage instantaneous rate heterogeneity, and perform similarly to usual continuous-time maximum-likelihood when data satisfies the assumptions of both methods. The method presented in this paper is well suited for reconstructing the topology of any number of taxa via quartet-based methods and is highly accurate, specially regarding largely divergent trees and time nonhomogeneous data.
En Route towards European Clinical Breakpoints for Veterinary Antimicrobial Susceptibility Testing: A Position Paper Explaining the VetCAST Approach

Science.gov (United States)

Toutain, Pierre-Louis; Bousquet-Mélou, Alain; Damborg, Peter; Ferran, Aude A.; Mevius, Dik; Pelligand, Ludovic; Veldman, Kees T.; Lees, Peter

2017-01-01

VetCAST is the EUCAST sub-committee for Veterinary Antimicrobial Susceptibility Testing. Its remit is to define clinical breakpoints (CBPs) for antimicrobial drugs (AMDs) used in veterinary medicine in Europe. This position paper outlines the procedures and reviews scientific options to solve challenges for the determination of specific CBPs for animal species, drug substances and disease conditions. VetCAST will adopt EUCAST approaches: the initial step will be data assessment; then procedures for decisions on the CBP; and finally the release of recommendations for CBP implementation. The principal challenges anticipated by VetCAST are those associated with the differing modalities of AMD administration, including mass medication, specific long-acting product formulations or local administration. Specific challenges comprise mastitis treatment in dairy cattle, the range of species and within species breed considerations and several other variable factors not relevant to human medicine. Each CBP will be based on consideration of: (i) an epidemiological cut-off value (ECOFF) – the highest MIC that defines the upper end of the wild-type MIC distribution; (ii) a PK/PD breakpoint obtained from pre-clinical pharmacokinetic data [this PK/PD break-point is the highest possible MIC for which a given percentage of animals in the target population achieves a critical value for the selected PK/PD index (fAUC/MIC or fT > MIC)] and (iii) when possible, a clinical cut-off, that is the relationship between MIC and clinical cure. For the latter, VetCAST acknowledges the paucity of such data in veterinary medicine. When a CBP cannot be established, VetCAST will recommend use of ECOFF as surrogate. For decision steps, VetCAST will follow EUCAST procedures involving transparency, consensus and independence. VetCAST will ensure freely available dissemination of information, concerning standards, guidelines, ECOFF, PK/PD breakpoints, CBPs and other relevant information for AST
En Route towards European Clinical Breakpoints for Veterinary Antimicrobial Susceptibility Testing: A Position Paper Explaining the VetCAST Approach.

Science.gov (United States)

Toutain, Pierre-Louis; Bousquet-Mélou, Alain; Damborg, Peter; Ferran, Aude A; Mevius, Dik; Pelligand, Ludovic; Veldman, Kees T; Lees, Peter

2017-01-01

VetCAST is the EUCAST sub-committee for Veterinary Antimicrobial Susceptibility Testing. Its remit is to define clinical breakpoints (CBPs) for antimicrobial drugs (AMDs) used in veterinary medicine in Europe. This position paper outlines the procedures and reviews scientific options to solve challenges for the determination of specific CBPs for animal species, drug substances and disease conditions. VetCAST will adopt EUCAST approaches: the initial step will be data assessment; then procedures for decisions on the CBP; and finally the release of recommendations for CBP implementation. The principal challenges anticipated by VetCAST are those associated with the differing modalities of AMD administration, including mass medication, specific long-acting product formulations or local administration. Specific challenges comprise mastitis treatment in dairy cattle, the range of species and within species breed considerations and several other variable factors not relevant to human medicine. Each CBP will be based on consideration of: (i) an epidemiological cut-off value (ECOFF) - the highest MIC that defines the upper end of the wild-type MIC distribution; (ii) a PK/PD breakpoint obtained from pre-clinical pharmacokinetic data [this PK/PD break-point is the highest possible MIC for which a given percentage of animals in the target population achieves a critical value for the selected PK/PD index ( f AUC/MIC or f T > MIC)] and (iii) when possible, a clinical cut-off, that is the relationship between MIC and clinical cure. For the latter, VetCAST acknowledges the paucity of such data in veterinary medicine. When a CBP cannot be established, VetCAST will recommend use of ECOFF as surrogate. For decision steps, VetCAST will follow EUCAST procedures involving transparency, consensus and independence. VetCAST will ensure freely available dissemination of information, concerning standards, guidelines, ECOFF, PK/PD breakpoints, CBPs and other relevant information for AST
En Route towards European Clinical Breakpoints for Veterinary Antimicrobial Susceptibility Testing: A Position Paper Explaining the VetCAST Approach

Directory of Open Access Journals (Sweden)

Pierre-Louis Toutain

2017-12-01

Full Text Available VetCAST is the EUCAST sub-committee for Veterinary Antimicrobial Susceptibility Testing. Its remit is to define clinical breakpoints (CBPs for antimicrobial drugs (AMDs used in veterinary medicine in Europe. This position paper outlines the procedures and reviews scientific options to solve challenges for the determination of specific CBPs for animal species, drug substances and disease conditions. VetCAST will adopt EUCAST approaches: the initial step will be data assessment; then procedures for decisions on the CBP; and finally the release of recommendations for CBP implementation. The principal challenges anticipated by VetCAST are those associated with the differing modalities of AMD administration, including mass medication, specific long-acting product formulations or local administration. Specific challenges comprise mastitis treatment in dairy cattle, the range of species and within species breed considerations and several other variable factors not relevant to human medicine. Each CBP will be based on consideration of: (i an epidemiological cut-off value (ECOFF – the highest MIC that defines the upper end of the wild-type MIC distribution; (ii a PK/PD breakpoint obtained from pre-clinical pharmacokinetic data [this PK/PD break-point is the highest possible MIC for which a given percentage of animals in the target population achieves a critical value for the selected PK/PD index (fAUC/MIC or fT > MIC] and (iii when possible, a clinical cut-off, that is the relationship between MIC and clinical cure. For the latter, VetCAST acknowledges the paucity of such data in veterinary medicine. When a CBP cannot be established, VetCAST will recommend use of ECOFF as surrogate. For decision steps, VetCAST will follow EUCAST procedures involving transparency, consensus and independence. VetCAST will ensure freely available dissemination of information, concerning standards, guidelines, ECOFF, PK/PD breakpoints, CBPs and other relevant information

Phylogenetic comparative methods on phylogenetic networks with reticulations.

Science.gov (United States)

Bastide, Paul; Solís-Lemus, Claudia; Kriebel, Ricardo; Sparks, K William; Ané, Cécile

2018-04-25

The goal of Phylogenetic Comparative Methods (PCMs) is to study the distribution of quantitative traits among related species. The observed traits are often seen as the result of a Brownian Motion (BM) along the branches of a phylogenetic tree. Reticulation events such as hybridization, gene flow or horizontal gene transfer, can substantially affect a species' traits, but are not modeled by a tree. Phylogenetic networks have been designed to represent reticulate evolution. As they become available for downstream analyses, new models of trait evolution are needed, applicable to networks. One natural extension of the BM is to use a weighted average model for the trait of a hybrid, at a reticulation point. We develop here an efficient recursive algorithm to compute the phylogenetic variance matrix of a trait on a network, in only one preorder traversal of the network. We then extend the standard PCM tools to this new framework, including phylogenetic regression with covariates (or phylogenetic ANOVA), ancestral trait reconstruction, and Pagel's λ test of phylogenetic signal. The trait of a hybrid is sometimes outside of the range of its two parents, for instance because of hybrid vigor or hybrid depression. These two phenomena are rather commonly observed in present-day hybrids. Transgressive evolution can be modeled as a shift in the trait value following a reticulation point. We develop a general framework to handle such shifts, and take advantage of the phylogenetic regression view of the problem to design statistical tests for ancestral transgressive evolution in the evolutionary history of a group of species. We study the power of these tests in several scenarios, and show that recent events have indeed the strongest impact on the trait distribution of present-day taxa. We apply those methods to a dataset of Xiphophorus fishes, to confirm and complete previous analysis in this group. All the methods developed here are available in the Julia package PhyloNetworks.
Genomic instability in rat: Breakpoints induced by ionising radiation and interstitial telomeric-like sequences

International Nuclear Information System (INIS)

Camats, Nuria; Ruiz-Herrera, Aurora; Parrilla, Juan Jose; Acien, Maribel; Paya, Pilar; Giulotto, Elena; Egozcue, Josep; Garcia, Francisca; Garcia, Montserrat

2006-01-01

The Norwegian rat (Rattus norvegicus) is the most widely studied experimental species in biomedical research although little is known about its chromosomal structure. The characterisation of possible unstable regions of the karyotype of this species would contribute to the better understanding of its genomic architecture. The cytogenetic effects of ionising radiation have been widely used for the study of genomic instability, and the importance of interstitial telomeric-like sequences (ITSs) in instability of the genome has also been reported in previous studies in vertebrates. In order to describe the unstable chromosomal regions of R. norvegicus, the distribution of breakpoints induced by X-irradiation and ITSs in its karyotype were analysed in this work. For the X-irradiation analysis, 52 foetuses (from 14 irradiated rats) were studied, 4803 metaphases were analysed, and a total of 456 breakpoints induced by X-rays were detected, located in 114 chromosomal bands, with 25 of them significantly affected by X-irradiation (hot spots). For the analysis of ITSs, three foetuses (from three rats) were studied, 305 metaphases were analysed and 121 ITSs were detected, widely distributed in the karyotype of this species. Seventy-six percent of all hot spots analysed in this study were co-localised with ITSs
Genomic instability in rat: Breakpoints induced by ionising radiation and interstitial telomeric-like sequences

Energy Technology Data Exchange (ETDEWEB)

Camats, Nuria [Institut de Biotecnologia i Biomedicina (IBB), Universitat Autonoma de Barcelona, 08193 Barcelona (Spain); Departament de Biologia Cel.lular, Fisiologia i Immunologia Universitat Autonoma de Barcelona, 08193 Barcelona (Spain); Ruiz-Herrera, Aurora [Departament de Biologia Cel.lular, Fisiologia i Immunologia Universitat Autonoma de Barcelona, 08193 Barcelona (Spain); Parrilla, Juan Jose [Servicio de Ginecologia y Obstetricia, Hospital Universitario Virgen de la Arrixaca, Ctra, Madrid-Cartagena, s/n, El Palmar, 30120 Murcia (Spain); Acien, Maribel [Servicio de Ginecologia y Obstetricia, Hospital Universitario Virgen de la Arrixaca, Ctra, Madrid-Cartagena, s/n, El Palmar, 30120 Murcia (Spain); Paya, Pilar [Servicio de Ginecologia y Obstetricia, Hospital Universitario Virgen de la Arrixaca, Ctra, Madrid-Cartagena, s/n, El Palmar, 30120 Murcia (Spain); Giulotto, Elena [Dipartimento di Genetica e Microbiologia Adriano Buzzati Traverso, Universita degli Studi di Pavia, 27100 Pavia (Italy); Egozcue, Josep [Departament de Biologia Cel.lular, Fisiologia i Immunologia Universitat Autonoma de Barcelona, 08193 Barcelona (Spain); Garcia, Francisca [Institut de Biotecnologia i Biomedicina (IBB), Universitat Autonoma de Barcelona, 08193 Barcelona (Spain); Garcia, Montserrat [Institut de Biotecnologia i Biomedicina (IBB), Universitat Autonoma de Barcelona, 08193 Barcelona (Spain) and Departament de Biologia Cellular, Fisiologia i Immunologia Universitat Autonoma de Barcelona, 08193 Barcelona (Spain)]. E-mail: Montserrat.Garcia.Caldes@uab.es

2006-03-20

The Norwegian rat (Rattus norvegicus) is the most widely studied experimental species in biomedical research although little is known about its chromosomal structure. The characterisation of possible unstable regions of the karyotype of this species would contribute to the better understanding of its genomic architecture. The cytogenetic effects of ionising radiation have been widely used for the study of genomic instability, and the importance of interstitial telomeric-like sequences (ITSs) in instability of the genome has also been reported in previous studies in vertebrates. In order to describe the unstable chromosomal regions of R. norvegicus, the distribution of breakpoints induced by X-irradiation and ITSs in its karyotype were analysed in this work. For the X-irradiation analysis, 52 foetuses (from 14 irradiated rats) were studied, 4803 metaphases were analysed, and a total of 456 breakpoints induced by X-rays were detected, located in 114 chromosomal bands, with 25 of them significantly affected by X-irradiation (hot spots). For the analysis of ITSs, three foetuses (from three rats) were studied, 305 metaphases were analysed and 121 ITSs were detected, widely distributed in the karyotype of this species. Seventy-six percent of all hot spots analysed in this study were co-localised with ITSs.
Phylogenetic turnover during subtropical forest succession across environmental and phylogenetic scales.

Science.gov (United States)

Purschke, Oliver; Michalski, Stefan G; Bruelheide, Helge; Durka, Walter

2017-12-01

Although spatial and temporal patterns of phylogenetic community structure during succession are inherently interlinked and assembly processes vary with environmental and phylogenetic scales, successional studies of community assembly have yet to integrate spatial and temporal components of community structure, while accounting for scaling issues. To gain insight into the processes that generate biodiversity after disturbance, we combine analyses of spatial and temporal phylogenetic turnover across phylogenetic scales, accounting for covariation with environmental differences. We compared phylogenetic turnover, at the species- and individual-level, within and between five successional stages, representing woody plant communities in a subtropical forest chronosequence. We decomposed turnover at different phylogenetic depths and assessed its covariation with between-plot abiotic differences. Phylogenetic turnover between stages was low relative to species turnover and was not explained by abiotic differences. However, within the late-successional stages, there was high presence-/absence-based turnover (clustering) that occurred deep in the phylogeny and covaried with environmental differentiation. Our results support a deterministic model of community assembly where (i) phylogenetic composition is constrained through successional time, but (ii) toward late succession, species sorting into preferred habitats according to niche traits that are conserved deep in phylogeny, becomes increasingly important.
Phylogenetic signal dissection identifies the root of starfishes.

Directory of Open Access Journals (Sweden)

Roberto Feuda

Full Text Available Relationships within the class Asteroidea have remained controversial for almost 100 years and, despite many attempts to resolve this problem using molecular data, no consensus has yet emerged. Using two nuclear genes and a taxon sampling covering the major asteroid clades we show that non-phylogenetic signal created by three factors--Long Branch Attraction, compositional heterogeneity and the use of poorly fitting models of evolution--have confounded accurate estimation of phylogenetic relationships. To overcome the effect of this non-phylogenetic signal we analyse the data using non-homogeneous models, site stripping and the creation of subpartitions aimed to reduce or amplify the systematic error, and calculate Bayes Factor support for a selection of previously suggested topological arrangements of asteroid orders. We show that most of the previous alternative hypotheses are not supported in the most reliable data partitions, including the previously suggested placement of either Forcipulatida or Paxillosida as sister group to the other major branches. The best-supported solution places Velatida as the sister group to other asteroids, and the implications of this finding for the morphological evolution of asteroids are presented.
Molecular cloning of the papillary renal cell carcinoma-associated translocation (X;1)(p11;q21) breakpoint

NARCIS (Netherlands)

Weterman, MAJ; Janssen, [No Value; Janssen, HAP; vandenBerg, E; Fisher, SE; Craig, [No Value; vanKessel, AG

1996-01-01

A combination of Southern blot analysis on a panel of tumor-derived somatic cell hybrids and fluorescence in situ hybridization techniques was used to map YACs, cosmids and DNA markers from the Xp11.2 region relative to the X chromosome breakpoint of the renal cell carcinoma-associated
An Improved Binary Differential Evolution Algorithm to Infer Tumor Phylogenetic Trees.

Science.gov (United States)

Liang, Ying; Liao, Bo; Zhu, Wen

2017-01-01

Tumourigenesis is a mutation accumulation process, which is likely to start with a mutated founder cell. The evolutionary nature of tumor development makes phylogenetic models suitable for inferring tumor evolution through genetic variation data. Copy number variation (CNV) is the major genetic marker of the genome with more genes, disease loci, and functional elements involved. Fluorescence in situ hybridization (FISH) accurately measures multiple gene copy number of hundreds of single cells. We propose an improved binary differential evolution algorithm, BDEP, to infer tumor phylogenetic tree based on FISH platform. The topology analysis of tumor progression tree shows that the pathway of tumor subcell expansion varies greatly during different stages of tumor formation. And the classification experiment shows that tree-based features are better than data-based features in distinguishing tumor. The constructed phylogenetic trees have great performance in characterizing tumor development process, which outperforms other similar algorithms.
Bayesian phylogenetic estimation of fossil ages.

Science.gov (United States)

Drummond, Alexei J; Stadler, Tanja

2016-07-19

Recent advances have allowed for both morphological fossil evidence and molecular sequences to be integrated into a single combined inference of divergence dates under the rule of Bayesian probability. In particular, the fossilized birth-death tree prior and the Lewis-Mk model of discrete morphological evolution allow for the estimation of both divergence times and phylogenetic relationships between fossil and extant taxa. We exploit this statistical framework to investigate the internal consistency of these models by producing phylogenetic estimates of the age of each fossil in turn, within two rich and well-characterized datasets of fossil and extant species (penguins and canids). We find that the estimation accuracy of fossil ages is generally high with credible intervals seldom excluding the true age and median relative error in the two datasets of 5.7% and 13.2%, respectively. The median relative standard error (RSD) was 9.2% and 7.2%, respectively, suggesting good precision, although with some outliers. In fact, in the two datasets we analyse, the phylogenetic estimate of fossil age is on average less than 2 Myr from the mid-point age of the geological strata from which it was excavated. The high level of internal consistency found in our analyses suggests that the Bayesian statistical model employed is an adequate fit for both the geological and morphological data, and provides evidence from real data that the framework used can accurately model the evolution of discrete morphological traits coded from fossil and extant taxa. We anticipate that this approach will have diverse applications beyond divergence time dating, including dating fossils that are temporally unconstrained, testing of the 'morphological clock', and for uncovering potential model misspecification and/or data errors when controversial phylogenetic hypotheses are obtained based on combined divergence dating analyses.This article is part of the themed issue 'Dating species divergences using
Over half of breakpoints in gene pairs involved in cancer-specific recurrent translocations are mapped to human chromosomal fragile sites

Directory of Open Access Journals (Sweden)

Pierce Levi CT

2009-01-01

Full Text Available Abstract Background Gene rearrangements such as chromosomal translocations have been shown to contribute to cancer development. Human chromosomal fragile sites are regions of the genome especially prone to breakage, and have been implicated in various chromosome abnormalities found in cancer. However, there has been no comprehensive and quantitative examination of the location of fragile sites in relation to all chromosomal aberrations. Results Using up-to-date databases containing all cancer-specific recurrent translocations, we have examined 444 unique pairs of genes involved in these translocations to determine the correlation of translocation breakpoints and fragile sites in the gene pairs. We found that over half (52% of translocation breakpoints in at least one gene of these gene pairs are mapped to fragile sites. Among these, we examined the DNA sequences within and flanking three randomly selected pairs of translocation-prone genes, and found that they exhibit characteristic features of fragile DNA, with frequent AT-rich flexibility islands and the potential of forming highly stable secondary structures. Conclusion Our study is the first to examine gene pairs involved in all recurrent chromosomal translocations observed in tumor cells, and to correlate the location of more than half of breakpoints to positions of known fragile sites. These results provide strong evidence to support a causative role for fragile sites in the generation of cancer-specific chromosomal rearrangements.
Molecular breakpoint cloning and gene expression studies of a novel translocation t(4;15(q27;q11.2 associated with Prader-Willi syndrome

Directory of Open Access Journals (Sweden)

Slater Howard R

2005-05-01

Full Text Available Abstract Background Prader-Willi syndrome (MIM #176270; PWS is caused by lack of the paternally-derived copies, or their expression, of multiple genes in a 4 Mb region on chromosome 15q11.2. Known mechanisms include large deletions, maternal uniparental disomy or mutations involving the imprinting center. De novo balanced reciprocal translocations in 5 reported individuals had breakpoints clustering in SNRPN intron 2 or exon 20/intron 20. To further dissect the PWS phenotype and define the minimal critical region for PWS features, we have studied a 22 year old male with a milder PWS phenotype and a de novo translocation t(4;15(q27;q11.2. Methods We used metaphase FISH to narrow the breakpoint region and molecular analyses to map the breakpoints on both chromosomes at the nucleotide level. The expression of genes on chromosome 15 on both sides of the breakpoint was determined by RT-PCR analyses. Results Pertinent clinical features include neonatal hypotonia with feeding difficulties, hypogonadism, short stature, late-onset obesity, learning difficulties, abnormal social behavior and marked tolerance to pain, as well as sticky saliva and narcolepsy. Relative macrocephaly and facial features are not typical for PWS. The translocation breakpoints were identified within SNRPN intron 17 and intron 10 of a spliced non-coding transcript in band 4q27. LINE and SINE sequences at the exchange points may have contributed to the translocation event. By RT-PCR of lymphoblasts and fibroblasts, we find that upstream SNURF/SNRPN exons and snoRNAs HBII-437 and HBII-13 are expressed, but the downstream snoRNAs PWCR1/HBII-85 and HBII-438A/B snoRNAs are not. Conclusion As part of the PWCR1/HBII-85 snoRNA cluster is highly conserved between human and mice, while no copy of HBII-438 has been found in mouse, we conclude that PWCR1/HBII-85 snoRNAs is likely to play a major role in the PWS- phenotype.
Revised Ciprofloxacin Breakpoints for Salmonella: Is it Time to Write an Obituary?

Science.gov (United States)

Girish, Revathy; Kumar, Anil; Khan, Sadia; Dinesh, Kavitha R; Karim, Shamsul

2013-11-01

To determine the minimum inhibitory concentration of ciprofloxacin among 50 blood stream isolates of Salmonella enterica. A total of 50 consecutive isolates of Salmonella enterica were tested for susceptibility to antimicrobials using the Kirby Bauer disk diffusion method. Minimum inhibitory concentrations were determined using Hi-Comb strips. All results were interpreted according to the CLSI guidelines. Of the 50 isolates 70%were Salmonella Typhi, 4% Salmonella paratyphi A, 2% Salmonella paratyphi B and the remaining 10% were identified only as Salmonella species. Using the CLSI 2011 breakpoints for disc diffusion, 86% (43/50) were resistant to nalidixic acid(NA), 22% (11/50) to ciprofloxacin, 12% to azithromycin, 6% to cotrimoxazole, 4% to ampicillin and 1% to chloramphenicol. The MIC50 and MIC90 of ciprofloxacin for S.Typhi were 0.181 μg/mL and 5.06 μg/mL respectively. While the same for S. paratyphi A was 0.212μg/mL and 0.228μg/mL respectively. None of the isolates were multi drug resistant and all were susceptible to ceftriaxone. Using the CLSI 2012 revised ciprofloxacin breakpoints for disc diffusion (>31mm) & MIC (<0.06 μg/mL), 90% (45/50) of these isolates were found to be resistant. MIC's of ciprofloxacin should be reported for all salmonella isolates and should be used to guide treatment. Blindly following western guidelines for a disease which is highly endemic in the subcontinent will spell the death knell of a cheap and effective drug in our armamentarium. Therefore it will be too premature to declare that "the concept of using ciprofloxacin in typhoid fever is dead!"
Automatic selection of reference taxa for protein-protein interaction prediction with phylogenetic profiling

DEFF Research Database (Denmark)

Simonsen, Martin; Maetschke, S.R.; Ragan, M.A.

2012-01-01

Motivation: Phylogenetic profiling methods can achieve good accuracy in predicting protein–protein interactions, especially in prokaryotes. Recent studies have shown that the choice of reference taxa (RT) is critical for accurate prediction, but with more than 2500 fully sequenced taxa publicly......: We present three novel methods for automating the selection of RT, using machine learning based on known protein–protein interaction networks. One of these methods in particular, Tree-Based Search, yields greatly improved prediction accuracies. We further show that different methods for constituting...... phylogenetic profiles often require very different RT sets to support high prediction accuracy....
BIMLR: a method for constructing rooted phylogenetic networks from rooted phylogenetic trees.

Science.gov (United States)

Wang, Juan; Guo, Maozu; Xing, Linlin; Che, Kai; Liu, Xiaoyan; Wang, Chunyu

2013-09-15

Rooted phylogenetic trees constructed from different datasets (e.g. from different genes) are often conflicting with one another, i.e. they cannot be integrated into a single phylogenetic tree. Phylogenetic networks have become an important tool in molecular evolution, and rooted phylogenetic networks are able to represent conflicting rooted phylogenetic trees. Hence, the development of appropriate methods to compute rooted phylogenetic networks from rooted phylogenetic trees has attracted considerable research interest of late. The CASS algorithm proposed by van Iersel et al. is able to construct much simpler networks than other available methods, but it is extremely slow, and the networks it constructs are dependent on the order of the input data. Here, we introduce an improved CASS algorithm, BIMLR. We show that BIMLR is faster than CASS and less dependent on the input data order. Moreover, BIMLR is able to construct much simpler networks than almost all other methods. BIMLR is available at http://nclab.hit.edu.cn/wangjuan/BIMLR/. © 2013 Elsevier B.V. All rights reserved.
Characterization of IGH locus breakpoints in multiple myeloma indicates a subset of translocations appear to occur in pregerminal center B cells.

Science.gov (United States)

Walker, Brian A; Wardell, Christopher P; Johnson, David C; Kaiser, Martin F; Begum, Dil B; Dahir, Nasrin B; Ross, Fiona M; Davies, Faith E; Gonzalez, David; Morgan, Gareth J

2013-04-25

Translocations in myeloma are thought to occur solely in mature B cells in the germinal center through class switch recombination (CSR). We used a targeted captured technique followed by massively parallel sequencing to determine the exact breakpoints in both the immunoglobulin heavy chain (IGH) locus and the partner chromosome in 61 presentation multiple myeloma samples. The majority of samples (62%) have a breakpoint within the switch regions upstream of the IGH constant genes and are generated through CSR in a mature B cell. However, the proportion of CSR translocations is not consistent between cytogenetic subgroups. We find that 100% of t(4;14) are CSR-mediated; however, 21% of t(11;14) and 25% of t(14;20) are generated through DH-JH recombination activation gene-mediated mechanisms, indicating they occur earlier in B-cell development at the pro-B-cell stage in the bone marrow. These 2 groups also generate translocations through receptor revision, as determined by the breakpoints and mutation status of the segments used in 10% and 50% of t(11;14) and t(14;20) samples, respectively. The study indicates that in a significant number of cases the translocation-based etiological events underlying myeloma may arise at the pro-B-cell hematological progenitor cell level, much earlier in B-cell development than was previously thought.
Sequence Capture and Phylogenetic Utility of Genomic Ultraconserved Elements Obtained from Pinned Insect Specimens.

Directory of Open Access Journals (Sweden)

Bonnie B Blaimer

Full Text Available Obtaining sequence data from historical museum specimens has been a growing research interest, invigorated by next-generation sequencing methods that allow inputs of highly degraded DNA. We applied a target enrichment and next-generation sequencing protocol to generate ultraconserved elements (UCEs from 51 large carpenter bee specimens (genus Xylocopa, representing 25 species with specimen ages ranging from 2-121 years. We measured the correlation between specimen age and DNA yield (pre- and post-library preparation DNA concentration and several UCE sequence capture statistics (raw read count, UCE reads on target, UCE mean contig length and UCE locus count with linear regression models. We performed piecewise regression to test for specific breakpoints in the relationship of specimen age and DNA yield and sequence capture variables. Additionally, we compared UCE data from newer and older specimens of the same species and reconstructed their phylogeny in order to confirm the validity of our data. We recovered 6-972 UCE loci from samples with pre-library DNA concentrations ranging from 0.06-9.8 ng/μL. All investigated DNA yield and sequence capture variables were significantly but only moderately negatively correlated with specimen age. Specimens of age 20 years or less had significantly higher pre- and post-library concentrations, UCE contig lengths, and locus counts compared to specimens older than 20 years. We found breakpoints in our data indicating a decrease of the initial detrimental effect of specimen age on pre- and post-library DNA concentration and UCE contig length starting around 21-39 years after preservation. Our phylogenetic results confirmed the integrity of our data, giving preliminary insights into relationships within Xylocopa. We consider the effect of additional factors not measured in this study on our age-related sequence capture results, such as DNA fragmentation and preservation method, and discuss the promise of the UCE
Searching for genes for cleft lip and/or palate based on breakpoint analysis of a balanced translocation t(9;17)(q32;q12).

Science.gov (United States)

Machida, Junichiro; Félix, Têmis M; Murray, Jeffrey C; Yoshiura, Koh-ichiro; Tanemura, Mitsuyo; Kamamoto, Munefumi; Shimozato, Kazuo; Sonta, Shin-ichi; Ono, Takao

2009-09-01

Identification of the breakpoints of disease-associated chromosome rearrangements can provide informative clues to a positional cloning approach for genes responsible for inherited diseases. Recently, we found a three-generation Japanese family segregating balanced chromosome translocation t(9;17)(q32;q12). One of the subjects had cleft lip and palate. We examined whether regions near the breakpoint could be associated with cleft lip and/or palate. We determined the breakpoints involved in the translocation by fluorescence in situ hybridization analysis and subsequent long-range polymerase chain reaction. In order to study the role of these disrupted regions in nonsyndromic cleft lip and/or palate, we performed mutation analysis and a haplotype-based transmission disequilibrium test using tagging single-nucleotide polymorphisms in the flanking regions of the breakpoints in white and Filipino nonsyndromic cleft lip and/or palate populations. Sequence analysis demonstrated that two genes, SLC31A1 (solute carrier family 31 member 1) on chromosome 9 and CCL2 (chemokine ligand 2) on chromosome 17, were rearranged with the breaks occurring within their introns. It is interesting that SLC31A1 lies closed to BSPRY (B-box and SPRY domain), which is a candidate for involvement with cleft lip and/or palate. Some of the variants in BSPRY and CCL2 showed significant p values in the cleft lip and/or palate population compared with the control population. There was also statistically significant evidence of transmission distortion for haplotypes on both chromosomes 9 and 17. The data support previous reports that genes on chromosomal regions of 9q and 17q play an important role in facial development.
Phylogenetic turnover during subtropical forest succession across environmental and phylogenetic scales

OpenAIRE

Purschke, Oliver; Michalski, Stefan G.; Bruelheide, Helge; Durka, Walter

2017-01-01

Abstract Although spatial and temporal patterns of phylogenetic community structure during succession are inherently interlinked and assembly processes vary with environmental and phylogenetic scales, successional studies of community assembly have yet to integrate spatial and temporal components of community structure, while accounting for scaling issues. To gain insight into the processes that generate biodiversity after disturbance, we combine analyses of spatial and temporal phylogenetic ...
A high-resolution comparative map between pig chromosome 17 and human chromosomes 4, 8, and 20: Identification of synteny breakpoints

DEFF Research Database (Denmark)

Lahbib-Mansais, Yvette; Karlskov-Mortensen, Peter; Mompart, Florence

2005-01-01

We report on the construction of a high-resolution comparative map of porcine chromosome 17 (SSC17) focusing on evolutionary breakpoints with human chromosomes. The comparative map shows high homology with human chromosome 20 but suggests more limited homologies with other human chromosomes. SSC1...
Detection of three common translocation breakpoints in non-Hodgkin's lymphomas by fluorescence in situ hybridization on routine paraffin-embedded tissue sections

NARCIS (Netherlands)

Haralambieva, E; Kleiverda, K; Mason, DY; Schuuring, E; Kluin, PM

2002-01-01

Non-random chromosomal translocations are specifically involved in the pathogenesis of many non-Hodgkin's lymphomas and have clinical implications as diagnostic and/or prognostic markers. Their detection is often impaired by technical problems, including the distribution of the breakpoints over
The origin, global distribution, and functional impact of the human 8p23 inversion polymorphism.

Science.gov (United States)

Salm, Maximilian P A; Horswell, Stuart D; Hutchison, Claire E; Speedy, Helen E; Yang, Xia; Liang, Liming; Schadt, Eric E; Cookson, William O; Wierzbicki, Anthony S; Naoumova, Rossi P; Shoulders, Carol C

2012-06-01

Genomic inversions are an increasingly recognized source of genetic variation. However, a lack of reliable high-throughput genotyping assays for these structures has precluded a full understanding of an inversion's phylogenetic, phenotypic, and population genetic properties. We characterize these properties for one of the largest polymorphic inversions in man (the ∼4.5-Mb 8p23.1 inversion), a structure that encompasses numerous signals of natural selection and disease association. We developed and validated a flexible bioinformatics tool that utilizes SNP data to enable accurate, high-throughput genotyping of the 8p23.1 inversion. This tool was applied retrospectively to diverse genome-wide data sets, revealing significant population stratification that largely follows a clinal "serial founder effect" distribution model. Phylogenetic analyses establish the inversion's ancestral origin within the Homo lineage, indicating that 8p23.1 inversion has occurred independently in the Pan lineage. The human inversion breakpoint was localized to an inverted pair of human endogenous retrovirus elements within the large, flanking low-copy repeats; experimental validation of this breakpoint confirmed these elements as the likely intermediary substrates that sponsored inversion formation. In five data sets, mRNA levels of disease-associated genes were robustly associated with inversion genotype. Moreover, a haplotype associated with systemic lupus erythematosus was restricted to the derived inversion state. We conclude that the 8p23.1 inversion is an evolutionarily dynamic structure that can now be accommodated into the understanding of human genetic and phenotypic diversity.

Untangling hybrid phylogenetic signals: horizontal gene transfer and artifacts of phylogenetic reconstruction.

Science.gov (United States)

Beiko, Robert G; Ragan, Mark A

2009-01-01

Phylogenomic methods can be used to investigate the tangled evolutionary relationships among genomes. Building 'all the trees of all the genes' can potentially identify common pathways of horizontal gene transfer (HGT) among taxa at varying levels of phylogenetic depth. Phylogenetic affinities can be aggregated and merged with the information about genetic linkage and biochemical function to examine hypotheses of adaptive evolution via HGT. Additionally, the use of many genetic data sets increases the power of statistical tests for phylogenetic artifacts. However, large-scale phylogenetic analyses pose several challenges, including the necessary abandonment of manual validation techniques, the need to translate inferred phylogenetic discordance into inferred HGT events, and the challenges involved in aggregating results from search-based inference methods. In this chapter we describe a tree search procedure to recover the most parsimonious pathways of HGT, and examine some of the assumptions that are made by this method.
Breakpoint of an inversion of chromosome 14 in a T-cell leukemia: sequences downstream of the immunoglobulin heavy chain locus are implicated in tumorigenesis

International Nuclear Information System (INIS)

Baer, R.; Heppell, A.; Taylor, A.M.R.; Rabbitts, P.H.; Boullier, B.; Rabbitts, T.H.

1987-01-01

T-cell tumors are characterized by inversions or translocations of chromosome 14. The breakpoints of these karyotypic abnormalities occur in chromosome bands 14q11 and 14q32 - the same bands in which the T-cell receptor (TCR) α-chain and immunoglobulin heavy chain genes have been mapped, respectively. Patients with ataxia-telangiectasia are particularly prone to development of T-cell chronic lymphocytic leukemia with such chromosomal abnormalities. The authors describe DNA rearrangements of the TCR α-chain gene in an ataxia-telangiectasia-associated leukemia containing both a normal and an inverted chromosome 14. The normal chromosome 14 has undergone a productive join of TCR α-chain variable (V/sub α/) and joining (J/sub α/) gene segments. The other allele of the TCR α-chain gene features a DNA rearrangement, about 50 kilobases from the TCR α-chain constant (C/sub α/) gene, that represents the breakpoint of the chromosome 14 inversion; this breakpoint is comprised of a TCR J/sub α/) segment (from 14q11) fused to sequences derived from 14q32 but on the centromeric side of C/sub μ/. These results imply that 14q32 sequences located at an undetermined distance downstream of immunoglobulin C/sub μ/ locus can contribute to the development of T-cell tumors
Evolutionary history of the third chromosome gene arrangements of Drosophila pseudoobscura inferred from inversion breakpoints.

Science.gov (United States)

Wallace, Andre G; Detweiler, Don; Schaeffer, Stephen W

2011-08-01

The third chromosome of Drosophila pseudoobscura is polymorphic for numerous gene arrangements that form classical clines in North America. The polytene salivary chromosomes isolated from natural populations revealed changes in gene order that allowed the different gene arrangements to be linked together by paracentric inversions representing one of the first cases where genetic data were used to construct a phylogeny. Although the inversion phylogeny can be used to determine the relationships among the gene arrangements, the cytogenetic data are unable to infer the ancestral arrangement or the age of the different chromosome types. These are both important properties if one is to infer the evolutionary forces responsible for the spread and maintenance of the chromosomes. Here, we employ the nucleotide sequences of 18 regions distributed across the third chromosome in 80-100 D. pseudoobscura strains to test whether five gene arrangements are of unique or multiple origin, what the ancestral arrangement was, and what are the ages of the different arrangements. Each strain carried one of six commonly found gene arrangements and the sequences were used to infer their evolutionary relationships. Breakpoint regions in the center of the chromosome supported monophyly of the gene arrangements, whereas regions at the ends of the chromosome gave phylogenies that provided less support for monophyly of the chromosomes either because the individual markers did not have enough phylogenetically informative sites or genetic exchange scrambled information among the gene arrangements. A data set where the genetic markers were concatenated strongly supported a unique origin of the different gene arrangements. The inversion polymorphism of D. pseudoobscura is estimated to be about a million years old. We have also shown that the generated phylogeny is consistent with the cytological phylogeny of this species. In addition, the data presented here support hypothetical as the ancestral
Susceptibility breakpoints and target values for therapeutic drug monitoring of voriconazole and Aspergillus fumigatus in an in vitro pharmacokinetic/pharmacodynamic model

NARCIS (Netherlands)

Siopi, M.; Mavridou, E.; Mouton, J.W.; Verweij, P.E.; Zerva, L.; Meletiadis, J.

2014-01-01

BACKGROUND: Although voriconazole reached the bedside 10 years ago and became the standard care in the treatment of invasive aspergillosis, reliable clinical breakpoints are still in high demand. Moreover, this has increased due to the recent emergence of azole resistance. METHODS: Four clinical
Tourette syndrome in a pedigree with a 7;18 translocation: Identification of a YAC spanning the translocation breakpoint at 18q22.3

Energy Technology Data Exchange (ETDEWEB)

Boghosian-Sell, L.; Overhauser, J. [Thomas Jefferson Univ., Philadelphia, PA (United States); Comings, D.E. [City of Hope Medical Center, Duarte, CA (United States)

1996-11-01

Tourette syndrome is a neuropsychiatric disorder characterized by the presence of multiple, involuntary motor and vocal tics. Associated pathologies include attention deficit disorder and obsessive-compulsive disorder (OCD). Extensive linkage analysis based on an autosomal dominant mode of transmission with reduced penetrance has failed to show linkage with polymorphic markers, suggesting either locus heterogeneity or a polygenic origin for Tourette syndrome. An individual diagnosed with Tourette syndrome has been described carrying a constitutional chromosome translocation. Other family members carrying the translocation exhibit features seen in Tourette syndrome including motor tics, vocal tics, and OCD. Since the disruption of specific genes by a chromosomal rearrangement can elicit a particular phenotype, we have undertaken the physical mapping of the 7;18 translocation such that genes mapping at the site of the breakpoint can be identified and evaluated for a possible involvement in Tourette syndrome. Using somatic cell hybrids retaining either the der(7) or the der(18), a more precise localization of the breakpoints on chromosomes 7 and 18 have been determined. Furthermore, physical mapping has identified two YAC clones that span the translocation breakpoint on chromosome 18 as determined by FISH. These YAC clones will be useful for the eventual identification of genes that map to chromosomes 7 and 18 at the site of the translocation. 41 refs., 3 figs., 1 tab.
Nonbinary Tree-Based Phylogenetic Networks.

Science.gov (United States)

Jetten, Laura; van Iersel, Leo

2018-01-01

Rooted phylogenetic networks are used to describe evolutionary histories that contain non-treelike evolutionary events such as hybridization and horizontal gene transfer. In some cases, such histories can be described by a phylogenetic base-tree with additional linking arcs, which can, for example, represent gene transfer events. Such phylogenetic networks are called tree-based. Here, we consider two possible generalizations of this concept to nonbinary networks, which we call tree-based and strictly-tree-based nonbinary phylogenetic networks. We give simple graph-theoretic characterizations of tree-based and strictly-tree-based nonbinary phylogenetic networks. Moreover, we show for each of these two classes that it can be decided in polynomial time whether a given network is contained in the class. Our approach also provides a new view on tree-based binary phylogenetic networks. Finally, we discuss two examples of nonbinary phylogenetic networks in biology and show how our results can be applied to them.
Sequence exploration reveals information bias among molecular markers used in phylogenetic reconstruction for Colletotrichum species.

Science.gov (United States)

Rampersad, Sephra N; Hosein, Fazeeda N; Carrington, Christine Vf

2014-01-01

The Colletotrichum gloeosporioides species complex is among the most destructive fungal plant pathogens in the world, however, identification of isolates of quarantine importance to the intra-specific level is confounded by a number of factors that affect phylogenetic reconstruction. Information bias and quality parameters were investigated to determine whether nucleotide sequence alignments and phylogenetic trees accurately reflect the genetic diversity and phylogenetic relatedness of individuals. Sequence exploration of GAPDH, ACT, TUB2 and ITS markers indicated that the query sequences had different patterns of nucleotide substitution but were without evidence of base substitution saturation. Regions of high entropy were much more dispersed in the ACT and GAPDH marker alignments than for the ITS and TUB2 markers. A discernible bimodal gap in the genetic distance frequency histograms was produced for the ACT and GAPDH markers which indicated successful separation of intra- and inter-specific sequences in the data set. Overall, analyses indicated clear differences in the ability of these markers to phylogenetically separate individuals to the intra-specific level which coincided with information bias.
Phylogenetic analyses of Vitis (Vitaceae) based on complete chloroplast genome sequences: effects of taxon sampling and phylogenetic methods on resolving relationships among rosids.

Science.gov (United States)

Jansen, Robert K; Kaittanis, Charalambos; Saski, Christopher; Lee, Seung-Bum; Tomkins, Jeffrey; Alverson, Andrew J; Daniell, Henry

2006-04-09

The Vitaceae (grape) is an economically important family of angiosperms whose phylogenetic placement is currently unresolved. Recent phylogenetic analyses based on one to several genes have suggested several alternative placements of this family, including sister to Caryophyllales, asterids, Saxifragales, Dilleniaceae or to rest of rosids, though support for these different results has been weak. There has been a recent interest in using complete chloroplast genome sequences for resolving phylogenetic relationships among angiosperms. These studies have clarified relationships among several major lineages but they have also emphasized the importance of taxon sampling and the effects of different phylogenetic methods for obtaining accurate phylogenies. We sequenced the complete chloroplast genome of Vitis vinifera and used these data to assess relationships among 27 angiosperms, including nine taxa of rosids. The Vitis vinifera chloroplast genome is 160,928 bp in length, including a pair of inverted repeats of 26,358 bp that are separated by small and large single copy regions of 19,065 bp and 89,147 bp, respectively. The gene content and order of Vitis is identical to many other unrearranged angiosperm chloroplast genomes, including tobacco. Phylogenetic analyses using maximum parsimony and maximum likelihood were performed on DNA sequences of 61 protein-coding genes for two datasets with 28 or 29 taxa, including eight or nine taxa from four of the seven currently recognized major clades of rosids. Parsimony and likelihood phylogenies of both data sets provide strong support for the placement of Vitaceae as sister to the remaining rosids. However, the position of the Myrtales and support for the monophyly of the eurosid I clade differs between the two data sets and the two methods of analysis. In parsimony analyses, the inclusion of Gossypium is necessary to obtain trees that support the monophyly of the eurosid I clade. However, maximum likelihood analyses place
Phylogenetic analyses of Vitis (Vitaceae based on complete chloroplast genome sequences: effects of taxon sampling and phylogenetic methods on resolving relationships among rosids

Directory of Open Access Journals (Sweden)

Alverson Andrew J

2006-04-01

Full Text Available Abstract Background The Vitaceae (grape is an economically important family of angiosperms whose phylogenetic placement is currently unresolved. Recent phylogenetic analyses based on one to several genes have suggested several alternative placements of this family, including sister to Caryophyllales, asterids, Saxifragales, Dilleniaceae or to rest of rosids, though support for these different results has been weak. There has been a recent interest in using complete chloroplast genome sequences for resolving phylogenetic relationships among angiosperms. These studies have clarified relationships among several major lineages but they have also emphasized the importance of taxon sampling and the effects of different phylogenetic methods for obtaining accurate phylogenies. We sequenced the complete chloroplast genome of Vitis vinifera and used these data to assess relationships among 27 angiosperms, including nine taxa of rosids. Results The Vitis vinifera chloroplast genome is 160,928 bp in length, including a pair of inverted repeats of 26,358 bp that are separated by small and large single copy regions of 19,065 bp and 89,147 bp, respectively. The gene content and order of Vitis is identical to many other unrearranged angiosperm chloroplast genomes, including tobacco. Phylogenetic analyses using maximum parsimony and maximum likelihood were performed on DNA sequences of 61 protein-coding genes for two datasets with 28 or 29 taxa, including eight or nine taxa from four of the seven currently recognized major clades of rosids. Parsimony and likelihood phylogenies of both data sets provide strong support for the placement of Vitaceae as sister to the remaining rosids. However, the position of the Myrtales and support for the monophyly of the eurosid I clade differs between the two data sets and the two methods of analysis. In parsimony analyses, the inclusion of Gossypium is necessary to obtain trees that support the monophyly of the eurosid I clade
In vitro antibacterial activity of ceftobiprole against clinical isolates from French teaching hospitals: proposition of zone diameter breakpoints.

Science.gov (United States)

Lascols, C; Legrand, P; Mérens, A; Leclercq, R; Muller-Serieys, C; Drugeon, H B; Kitzis, M D; Reverdy, M E; Roussel-Delvallez, M; Moubareck, C; Brémont, S; Miara, A; Gjoklaj, M; Soussy, C-J

2011-03-01

The aims of this study were to determine the in vitro activity profile of ceftobiprole, a pyrrolidinone cephalosporin, against a large number of bacterial pathogens and to propose zone diameter breakpoints for clinical categorisation according to the European Committee on Antimicrobial Susceptibility Testing (EUCAST) minimum inhibitory concentration (MIC) breakpoints. MICs of ceftobiprole were determined by broth microdilution against 1548 clinical isolates collected in eight French hospitals. Disk diffusion testing was performed using 30 μg disks according to the method of the Comité de l'Antibiogramme de la Société Française de Microbiologie (CA-SFM). The in vitro activity of ceftobiprole, expressed by MIC(50/90) (MICs for 50% and 90% of the organisms, respectively) (mg/L), was as follows: meticillin-susceptible Staphylococcus aureus, 0.25/0.5; meticillin-resistant S. aureus (MRSA), 1/2; meticillin-susceptible coagulase-negative staphylococci (CoNS), 0.12/0.5; meticillin-resistant CoNS, 1/2; penicillin-susceptible Streptococcus pneumoniae, ≤ 0.008/0.03; penicillin-resistant S. pneumoniae, 0.12/0.5; viridans group streptococci, 0.03/0.12; β-haemolytic streptococci, ≤ 0.008/0.016; Enterococcus faecalis, 0.25/1; Enterococcus faecium, 64/128; Enterobacteriaceae, 0.06/32; Pseudomonas aeruginosa, 4/16; Acinetobacter baumannii, 0.5/64; Haemophilus influenzae, 0.03/0.12; and Moraxella catarrhalis, 0.25/0.5. According to the regression curve, zone diameter breakpoints could be 28, 26, 24 and 22 mm for MICs of 0.5, 1, 2 and 4 mg/L respectively. In conclusion, this study confirms the potent in vitro activity of ceftobiprole against many Gram-positive bacteria, including MRSA but not E. faecium, whilst maintaining a Gram-negative spectrum similar to the advanced-generation cephalosporins such as cefepime. Thus ceftobiprole appears to be well suited for the empirical treatment of a variety of healthcare-associated infections. Copyright © 2011 Elsevier B.V. and the
Polytomy identification in microbial phylogenetic reconstruction

Directory of Open Access Journals (Sweden)

Lin Guan

2011-12-01

Full Text Available Abstract Background A phylogenetic tree, showing ancestral relations among organisms, is commonly represented as a rooted tree with sets of bifurcating branches (dichotomies for simplicity, although polytomies (multifurcating branches may reflect more accurate evolutionary relationships. To represent the true evolutionary relationships, it is important to systematically identify the polytomies from a bifurcating tree and generate a taxonomy-compatible multifurcating tree. For this purpose we propose a novel approach, "PolyPhy", which would classify a set of bifurcating branches of a phylogenetic tree into a set of branches with dichotomies and polytomies by considering genome distances among genomes and tree topological properties. Results PolyPhy employs a machine learning technique, BLR (Bayesian logistic regression classifier, to identify possible bifurcating subtrees as polytomies from the trees resulted from ComPhy. Other than considering genome-scale distances between all pairs of species, PolyPhy also takes into account different properties of tree topology between dichotomy and polytomy, such as long-branch retraction and short-branch contraction, and quantifies these properties into comparable rates among different sub-branches. We extract three tree topological features, 'LR' (Leaf rate, 'IntraR' (Intra-subset branch rate and 'InterR' (Inter-subset branch rate, all of which are calculated from bifurcating tree branch sets for classification. We have achieved F-measure (balanced measure between precision and recall of 81% with about 0.9 area under the curve (AUC of ROC. Conclusions PolyPhy is a fast and robust method to identify polytomies from phylogenetic trees based on genome-wide inference of evolutionary relationships among genomes. The software package and test data can be downloaded from http://digbio.missouri.edu/ComPhy/phyloTreeBiNonBi-1.0.zip.
Transforming phylogenetic networks: Moving beyond tree space

OpenAIRE

Huber, Katharina T.; Moulton, Vincent; Wu, Taoyang

2016-01-01

Phylogenetic networks are a generalization of phylogenetic trees that are used to represent reticulate evolution. Unrooted phylogenetic networks form a special class of such networks, which naturally generalize unrooted phylogenetic trees. In this paper we define two operations on unrooted phylogenetic networks, one of which is a generalization of the well-known nearest-neighbor interchange (NNI) operation on phylogenetic trees. We show that any unrooted phylogenetic network can be transforme...
Analysis of HIV-1 intersubtype recombination breakpoints suggests region with high pairing probability may be a more fundamental factor than sequence similarity affecting HIV-1 recombination.

Science.gov (United States)

Jia, Lei; Li, Lin; Gui, Tao; Liu, Siyang; Li, Hanping; Han, Jingwan; Guo, Wei; Liu, Yongjian; Li, Jingyun

2016-09-21

With increasing data on HIV-1, a more relevant molecular model describing mechanism details of HIV-1 genetic recombination usually requires upgrades. Currently an incomplete structural understanding of the copy choice mechanism along with several other issues in the field that lack elucidation led us to perform an analysis of the correlation between breakpoint distributions and (1) the probability of base pairing, and (2) intersubtype genetic similarity to further explore structural mechanisms. Near full length sequences of URFs from Asia, Europe, and Africa (one sequence/patient), and representative sequences of worldwide CRFs were retrieved from the Los Alamos HIV database. Their recombination patterns were analyzed by jpHMM in detail. Then the relationships between breakpoint distributions and (1) the probability of base pairing, and (2) intersubtype genetic similarities were investigated. Pearson correlation test showed that all URF groups and the CRF group exhibit the same breakpoint distribution pattern. Additionally, the Wilcoxon two-sample test indicated a significant and inexplicable limitation of recombination in regions with high pairing probability. These regions have been found to be strongly conserved across distinct biological states (i.e., strong intersubtype similarity), and genetic similarity has been determined to be a very important factor promoting recombination. Thus, the results revealed an unexpected disagreement between intersubtype similarity and breakpoint distribution, which were further confirmed by genetic similarity analysis. Our analysis reveals a critical conflict between results from natural HIV-1 isolates and those from HIV-1-based assay vectors in which genetic similarity has been shown to be a very critical factor promoting recombination. These results indicate the region with high-pairing probabilities may be a more fundamental factor affecting HIV-1 recombination than sequence similarity in natural HIV-1 infections. Our
The Independent Evolution Method Is Not a Viable Phylogenetic Comparative Method.

Directory of Open Access Journals (Sweden)

Randi H Griffin

Full Text Available Phylogenetic comparative methods (PCMs use data on species traits and phylogenetic relationships to shed light on evolutionary questions. Recently, Smaers and Vinicius suggested a new PCM, Independent Evolution (IE, which purportedly employs a novel model of evolution based on Felsenstein's Adaptive Peak Model. The authors found that IE improves upon previous PCMs by producing more accurate estimates of ancestral states, as well as separate estimates of evolutionary rates for each branch of a phylogenetic tree. Here, we document substantial theoretical and computational issues with IE. When data are simulated under a simple Brownian motion model of evolution, IE produces severely biased estimates of ancestral states and changes along individual branches. We show that these branch-specific changes are essentially ancestor-descendant or "directional" contrasts, and draw parallels between IE and previous PCMs such as "minimum evolution". Additionally, while comparisons of branch-specific changes between variables have been interpreted as reflecting the relative strength of selection on those traits, we demonstrate through simulations that regressing IE estimated branch-specific changes against one another gives a biased estimate of the scaling relationship between these variables, and provides no advantages or insights beyond established PCMs such as phylogenetically independent contrasts. In light of our findings, we discuss the results of previous papers that employed IE. We conclude that Independent Evolution is not a viable PCM, and should not be used in comparative analyses.
Targeted next-generation sequencing at copy-number breakpoints for personalized analysis of rearranged ends in solid tumors.

Directory of Open Access Journals (Sweden)

Hyun-Kyoung Kim

Full Text Available BACKGROUND: The concept of the utilization of rearranged ends for development of personalized biomarkers has attracted much attention owing to its clinical applicability. Although targeted next-generation sequencing (NGS for recurrent rearrangements has been successful in hematologic malignancies, its application to solid tumors is problematic due to the paucity of recurrent translocations. However, copy-number breakpoints (CNBs, which are abundant in solid tumors, can be utilized for identification of rearranged ends. METHOD: As a proof of concept, we performed targeted next-generation sequencing at copy-number breakpoints (TNGS-CNB in nine colon cancer cases including seven primary cancers and two cell lines, COLO205 and SW620. For deduction of CNBs, we developed a novel competitive single-nucleotide polymorphism (cSNP microarray method entailing CNB-region refinement by competitor DNA. RESULT: Using TNGS-CNB, 19 specific rearrangements out of 91 CNBs (20.9% were identified, and two polymerase chain reaction (PCR-amplifiable rearrangements were obtained in six cases (66.7%. And significantly, TNGS-CNB, with its high positive identification rate (82.6% of PCR-amplifiable rearrangements at candidate sites (19/23, just from filtering of aligned sequences, requires little effort for validation. CONCLUSION: Our results indicate that TNGS-CNB, with its utility for identification of rearrangements in solid tumors, can be successfully applied in the clinical laboratory for cancer-relapse and therapy-response monitoring.
Targeted next-generation sequencing at copy-number breakpoints for personalized analysis of rearranged ends in solid tumors.

Science.gov (United States)

Kim, Hyun-Kyoung; Park, Won Cheol; Lee, Kwang Man; Hwang, Hai-Li; Park, Seong-Yeol; Sorn, Sungbin; Chandra, Vishal; Kim, Kwang Gi; Yoon, Woong-Bae; Bae, Joon Seol; Shin, Hyoung Doo; Shin, Jong-Yeon; Seoh, Ju-Young; Kim, Jong-Il; Hong, Kyeong-Man

2014-01-01

The concept of the utilization of rearranged ends for development of personalized biomarkers has attracted much attention owing to its clinical applicability. Although targeted next-generation sequencing (NGS) for recurrent rearrangements has been successful in hematologic malignancies, its application to solid tumors is problematic due to the paucity of recurrent translocations. However, copy-number breakpoints (CNBs), which are abundant in solid tumors, can be utilized for identification of rearranged ends. As a proof of concept, we performed targeted next-generation sequencing at copy-number breakpoints (TNGS-CNB) in nine colon cancer cases including seven primary cancers and two cell lines, COLO205 and SW620. For deduction of CNBs, we developed a novel competitive single-nucleotide polymorphism (cSNP) microarray method entailing CNB-region refinement by competitor DNA. Using TNGS-CNB, 19 specific rearrangements out of 91 CNBs (20.9%) were identified, and two polymerase chain reaction (PCR)-amplifiable rearrangements were obtained in six cases (66.7%). And significantly, TNGS-CNB, with its high positive identification rate (82.6%) of PCR-amplifiable rearrangements at candidate sites (19/23), just from filtering of aligned sequences, requires little effort for validation. Our results indicate that TNGS-CNB, with its utility for identification of rearrangements in solid tumors, can be successfully applied in the clinical laboratory for cancer-relapse and therapy-response monitoring.
On Nakhleh's metric for reduced phylogenetic networks

OpenAIRE

Cardona, Gabriel; Llabrés, Mercè; Rosselló, Francesc; Valiente Feruglio, Gabriel Alejandro

2009-01-01

We prove that Nakhleh’s metric for reduced phylogenetic networks is also a metric on the classes of tree-child phylogenetic networks, semibinary tree-sibling time consistent phylogenetic networks, and multilabeled phylogenetic trees. We also prove that it separates distinguishable phylogenetic networks. In this way, it becomes the strongest dissimilarity measure for phylogenetic networks available so far. Furthermore, we propose a generalization of that metric that separates arbitrary phyl...
Phylogenetic Analysis Using Protein Mass Spectrometry.

Science.gov (United States)

Ma, Shiyong; Downard, Kevin M; Wong, Jason W H

2017-01-01

Through advances in molecular biology, comparative analysis of DNA sequences is currently the cornerstone in the study of molecular evolution and phylogenetics. Nevertheless, protein mass spectrometry offers some unique opportunities to enable phylogenetic analyses in organisms where DNA may be difficult or costly to obtain. To date, the methods of phylogenetic analysis using protein mass spectrometry can be classified into three categories: (1) de novo protein sequencing followed by classical phylogenetic reconstruction, (2) direct phylogenetic reconstruction using proteolytic peptide mass maps, and (3) mapping of mass spectral data onto classical phylogenetic trees. In this chapter, we provide a brief description of the three methods and the protocol for each method along with relevant tools and algorithms.
Transforming phylogenetic networks: Moving beyond tree space.

Science.gov (United States)

Huber, Katharina T; Moulton, Vincent; Wu, Taoyang

2016-09-07

Phylogenetic networks are a generalization of phylogenetic trees that are used to represent reticulate evolution. Unrooted phylogenetic networks form a special class of such networks, which naturally generalize unrooted phylogenetic trees. In this paper we define two operations on unrooted phylogenetic networks, one of which is a generalization of the well-known nearest-neighbor interchange (NNI) operation on phylogenetic trees. We show that any unrooted phylogenetic network can be transformed into any other such network using only these operations. This generalizes the well-known fact that any phylogenetic tree can be transformed into any other such tree using only NNI operations. It also allows us to define a generalization of tree space and to define some new metrics on unrooted phylogenetic networks. To prove our main results, we employ some fascinating new connections between phylogenetic networks and cubic graphs that we have recently discovered. Our results should be useful in developing new strategies to search for optimal phylogenetic networks, a topic that has recently generated some interest in the literature, as well as for providing new ways to compare networks. Copyright © 2016 Elsevier Ltd. All rights reserved.
Attempt to validate breakpoint MIC values estimated from pharmacokinetic data obtained during oxolinic acid therapy of winter ulcer disease in Atlantic salmon ( Salmo salar )

DEFF Research Database (Denmark)

Coyne, R.; Bergh, Ø.; Samuelsen, O.

2004-01-01

Concentrations of oxolinic acid (OXA) were measured in the plasma, muscle, liver, and kidney of 48 Atlantic salmons (Salmo salar) 1 day after the end of an oral administration. OXA was administered over a period of 13 days to control an outbreak of winter ulcer disease in a commercial marine farm...... administration of OXA. A numerical description of the concentration of the antimicrobial agent achieved in therapy is necessary to determine the resistance or sensitivity of the bacteria involved in the infection. The degree of fish-to-fish variation in the concentrations of OXA, both within the healthy fish...... a useful parameter for describing the concentrations of agents achieved during therapy. The plasma data from this investigation were used to estimate clinically relevant breakpoint minimum inhibitory concentration (MIC) values. The validity of these breakpoint values was discussed with reference...

Functional and phylogenetic ecology in R

CERN Document Server

Swenson, Nathan G

2014-01-01

Functional and Phylogenetic Ecology in R is designed to teach readers to use R for phylogenetic and functional trait analyses. Over the past decade, a dizzying array of tools and methods were generated to incorporate phylogenetic and functional information into traditional ecological analyses. Increasingly these tools are implemented in R, thus greatly expanding their impact. Researchers getting started in R can use this volume as a step-by-step entryway into phylogenetic and functional analyses for ecology in R. More advanced users will be able to use this volume as a quick reference to understand particular analyses. The volume begins with an introduction to the R environment and handling relevant data in R. Chapters then cover phylogenetic and functional metrics of biodiversity; null modeling and randomizations for phylogenetic and functional trait analyses; integrating phylogenetic and functional trait information; and interfacing the R environment with a popular C-based program. This book presents a uni...
A format for phylogenetic placements.

Directory of Open Access Journals (Sweden)

Frederick A Matsen

Full Text Available We have developed a unified format for phylogenetic placements, that is, mappings of environmental sequence data (e.g., short reads into a phylogenetic tree. We are motivated to do so by the growing number of tools for computing and post-processing phylogenetic placements, and the lack of an established standard for storing them. The format is lightweight, versatile, extensible, and is based on the JSON format, which can be parsed by most modern programming languages. Our format is already implemented in several tools for computing and post-processing parsimony- and likelihood-based phylogenetic placements and has worked well in practice. We believe that establishing a standard format for analyzing read placements at this early stage will lead to a more efficient development of powerful and portable post-analysis tools for the growing applications of phylogenetic placement.
Fast and accurate methods for phylogenomic analyses

Directory of Open Access Journals (Sweden)

Warnow Tandy

2011-10-01

Full Text Available Abstract Background Species phylogenies are not estimated directly, but rather through phylogenetic analyses of different gene datasets. However, true gene trees can differ from the true species tree (and hence from one another due to biological processes such as horizontal gene transfer, incomplete lineage sorting, and gene duplication and loss, so that no single gene tree is a reliable estimate of the species tree. Several methods have been developed to estimate species trees from estimated gene trees, differing according to the specific algorithmic technique used and the biological model used to explain differences between species and gene trees. Relatively little is known about the relative performance of these methods. Results We report on a study evaluating several different methods for estimating species trees from sequence datasets, simulating sequence evolution under a complex model including indels (insertions and deletions, substitutions, and incomplete lineage sorting. The most important finding of our study is that some fast and simple methods are nearly as accurate as the most accurate methods, which employ sophisticated statistical methods and are computationally quite intensive. We also observe that methods that explicitly consider errors in the estimated gene trees produce more accurate trees than methods that assume the estimated gene trees are correct. Conclusions Our study shows that highly accurate estimations of species trees are achievable, even when gene trees differ from each other and from the species tree, and that these estimations can be obtained using fairly simple and computationally tractable methods.
Phylogenetic inertia and Darwin's higher law.

Science.gov (United States)

Shanahan, Timothy

2011-03-01

The concept of 'phylogenetic inertia' is routinely deployed in evolutionary biology as an alternative to natural selection for explaining the persistence of characteristics that appear sub-optimal from an adaptationist perspective. However, in many of these contexts the precise meaning of 'phylogenetic inertia' and its relationship to selection are far from clear. After tracing the history of the concept of 'inertia' in evolutionary biology, I argue that treating phylogenetic inertia and natural selection as alternative explanations is mistaken because phylogenetic inertia is, from a Darwinian point of view, simply an expected effect of selection. Although Darwin did not discuss 'phylogenetic inertia,' he did assert the explanatory priority of selection over descent. An analysis of 'phylogenetic inertia' provides a perspective from which to assess Darwin's view. Copyright © 2010 Elsevier Ltd. All rights reserved.
The transposition distance for phylogenetic trees

OpenAIRE

Rossello, Francesc; Valiente, Gabriel

2006-01-01

The search for similarity and dissimilarity measures on phylogenetic trees has been motivated by the computation of consensus trees, the search by similarity in phylogenetic databases, and the assessment of clustering results in bioinformatics. The transposition distance for fully resolved phylogenetic trees is a recent addition to the extensive collection of available metrics for comparing phylogenetic trees. In this paper, we generalize the transposition distance from fully resolved to arbi...
Phylogenetic Trees From Sequences

Science.gov (United States)

Ryvkin, Paul; Wang, Li-San

In this chapter, we review important concepts and approaches for phylogeny reconstruction from sequence data.We first cover some basic definitions and properties of phylogenetics, and briefly explain how scientists model sequence evolution and measure sequence divergence. We then discuss three major approaches for phylogenetic reconstruction: distance-based phylogenetic reconstruction, maximum parsimony, and maximum likelihood. In the third part of the chapter, we review how multiple phylogenies are compared by consensus methods and how to assess confidence using bootstrapping. At the end of the chapter are two sections that list popular software packages and additional reading.
The phylogenetic likelihood library.

Science.gov (United States)

Flouri, T; Izquierdo-Carrasco, F; Darriba, D; Aberer, A J; Nguyen, L-T; Minh, B Q; Von Haeseler, A; Stamatakis, A

2015-03-01

We introduce the Phylogenetic Likelihood Library (PLL), a highly optimized application programming interface for developing likelihood-based phylogenetic inference and postanalysis software. The PLL implements appropriate data structures and functions that allow users to quickly implement common, error-prone, and labor-intensive tasks, such as likelihood calculations, model parameter as well as branch length optimization, and tree space exploration. The highly optimized and parallelized implementation of the phylogenetic likelihood function and a thorough documentation provide a framework for rapid development of scalable parallel phylogenetic software. By example of two likelihood-based phylogenetic codes we show that the PLL improves the sequential performance of current software by a factor of 2-10 while requiring only 1 month of programming time for integration. We show that, when numerical scaling for preventing floating point underflow is enabled, the double precision likelihood calculations in the PLL are up to 1.9 times faster than those in BEAGLE. On an empirical DNA dataset with 2000 taxa the AVX version of PLL is 4 times faster than BEAGLE (scaling enabled and required). The PLL is available at http://www.libpll.org under the GNU General Public License (GPL). © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
Locating a tree in a phylogenetic network

NARCIS (Netherlands)

Iersel, van L.J.J.; Semple, C.; Steel, M.A.

2010-01-01

Phylogenetic trees and networks are leaf-labelled graphs that are used to describe evolutionary histories of species. The Tree Containment problem asks whether a given phylogenetic tree is embedded in a given phylogenetic network. Given a phylogenetic network and a cluster of species, the Cluster
A recurrent deletion syndrome at chromosome bands 2p11.2-2p12 flanked by segmental duplications at the breakpoints and including REEP1.

Science.gov (United States)

Stevens, Servi J C; Blom, Eveline W; Siegelaer, Ingrid T J; Smeets, Eric E J G L

2015-04-01

We identified an identical and recurrent 9.4-Mbp deletion at chromosome bands 2p11.2-2p12, which occurred de novo in two unrelated patients. It is flanked at the distal and proximal breakpoints by two homologous segmental duplications consisting of low copy repeat (LCR) blocks in direct orientation, which have >99% sequence identity. Despite the fact that the deletion was almost 10 Mbp in size, the patients showed a relatively mild clinical phenotype, that is, mild-to-moderate intellectual disability, a happy disposition, speech delay and delayed motor development. Their phenotype matches with that of previously described patients. The 2p11.2-2p12 deletion includes the REEP1 gene that is associated with spastic paraplegia and phenotypic features related to this are apparent in most 2p11.2-2p12 deletion patients, but not in all. Other hemizygous genes that may contribute to the clinical phenotype include LRRTM1 and CTNNA2. We propose a recurrent but rare 2p11.2-2p12 deletion syndrome based on (1) the identical, non-random localisation of the de novo deletion breakpoints in two unrelated patients and a patient from literature, (2) the patients' phenotypic similarity and their phenotypic overlap with other 2p deletions and (3) the presence of highly identical LCR blocks flanking both breakpoints, consistent with a non-allelic homologous recombination (NAHR)-mediated rearrangement.
Comparative evolutionary diversity and phylogenetic structure across multiple forest dynamics plots: a mega-phylogeny approach

Directory of Open Access Journals (Sweden)

David Lee Erickson

2014-11-01

Full Text Available Forest dynamics plots, which now span longitudes, latitudes, and habitat types across the globe, offer unparalleled insights into the ecological and evolutionary processes that determine how species are assembled into communities. Understanding phylogenetic relationships among species in a community has become an important component of assessing assembly processes. However, the application of evolutionary information to questions in community ecology has been limited in large part by the lack of accurate estimates of phylogenetic relationships among individual species found within communities, and is particularly limiting in comparisons between communities. Therefore, streamlining and maximizing the information content of these community phylogenies is a priority. To test the viability and advantage of a multi-community phylogeny, we constructed a multi-plot mega-phylogeny of 1,347 species of trees across 15 forest dynamics plots in the ForestGEO network using DNA barcode sequence data (rbcL, matK and psbA-trnH and compared community phylogenies for each individual plot with respect to support for topology and branch lengths, which affect evolutionary inference of community processes. The levels of taxonomic differentiation across the phylogeny were examined by quantifying the frequency of resolved nodes throughout. In addition, three phylogenetic distance metrics that are commonly used to infer assembly processes were estimated for each plot (Phylogenetic Distance [PD], Mean Phylogenetic Distance [MPD], and Mean Nearest Taxon Distance [MNTD]. Lastly, we examine the partitioning of phylogenetic diversity among community plots through quantification of inter-community MPD and MNTD. Overall, evolutionary relationships were highly resolved across the DNA barcode-based mega-phylogeny, and phylogenetic resolution for each community plot was improved when estimated within the context of the mega-phylogeny. Likewise, when compared with phylogenies for
Molecular Phylogenetics: Concepts for a Newcomer.

Science.gov (United States)

Ajawatanawong, Pravech

Molecular phylogenetics is the study of evolutionary relationships among organisms using molecular sequence data. The aim of this review is to introduce the important terminology and general concepts of tree reconstruction to biologists who lack a strong background in the field of molecular evolution. Some modern phylogenetic programs are easy to use because of their user-friendly interfaces, but understanding the phylogenetic algorithms and substitution models, which are based on advanced statistics, is still important for the analysis and interpretation without a guide. Briefly, there are five general steps in carrying out a phylogenetic analysis: (1) sequence data preparation, (2) sequence alignment, (3) choosing a phylogenetic reconstruction method, (4) identification of the best tree, and (5) evaluating the tree. Concepts in this review enable biologists to grasp the basic ideas behind phylogenetic analysis and also help provide a sound basis for discussions with expert phylogeneticists.
Effects of clinical breakpoint changes in CLSI guidelines 2010/2011 and EUCAST guidelines 2011 on antibiotic susceptibility test reporting of Gram-negative bacilli.

Science.gov (United States)

Hombach, Michael; Bloemberg, Guido V; Böttger, Erik C

2012-03-01

The aim of this study was to analyse the effects of clinical breakpoint changes in CLSI 2010 and 2011 guidelines and EUCAST 2011 guidelines on antibiotic susceptibility testing (AST) reports. In total, 3713 non-duplicate clinical isolates of Enterobacteriaceae, Pseudomonas aeruginosa, Stenotrophomonas maltophilia and Acinetobacter baumannii were analysed. Inhibition zone diameters were determined for β-lactams, carbapenems, fluoroquinolones, aminoglycosides and trimethoprim/sulfamethoxazole. CLSI 2009-11 and EUCAST 2011 clinical breakpoints were applied. Changes in resistance as defined per the guidelines affected individual species and drug classes differently. The cefepime resistance rate in Escherichia coli and Enterobacter cloacae increased from 2.1% and 1.3% to 8.2% and 6.9%, respectively, applying CLSI 2009-11 versus EUCAST 2011 guidelines. Ertapenem resistance rates in E. cloacae increased from 2.6% with CLSI 2009 to 7.2% for CLSI 2010 and 2011, and to 10.1% when applying EUCAST 2011. Cefepime and meropenem resistance rates in P. aeruginosa increased from 12.2% and 20.6% to 19.8% and 27.7%, respectively, comparing CLSI 2009-11 with EUCAST 2011. Tobramycin and gentamicin resistance rates in A. baumannii increased from 15.9% and 25.4% to 34.9% and 44.4% applying CLSI 2009-11 versus EUCAST 2011. Higher resistance rates reported due to breakpoint changes in CLSI and EUCAST guidelines will result in increasing numbers of Gram-negative bacilli reported as multidrug resistant. AST reports classifying amoxicillin/clavulanic acid, cefepime or carbapenem resistance will lead clinicians to use alternative agents. Upon implementation of the EUCAST guidelines, laboratories should be aware of the implications of modified drug susceptibility testing reports on antibiotic prescription policies.
Locating a tree in a phylogenetic network

OpenAIRE

van Iersel, Leo; Semple, Charles; Steel, Mike

2010-01-01

Phylogenetic trees and networks are leaf-labelled graphs that are used to describe evolutionary histories of species. The Tree Containment problem asks whether a given phylogenetic tree is embedded in a given phylogenetic network. Given a phylogenetic network and a cluster of species, the Cluster Containment problem asks whether the given cluster is a cluster of some phylogenetic tree embedded in the network. Both problems are known to be NP-complete in general. In this article, we consider t...
Nonbinary tree-based phylogenetic networks

OpenAIRE

Jetten, Laura; van Iersel, Leo

2016-01-01

Rooted phylogenetic networks are used to describe evolutionary histories that contain non-treelike evolutionary events such as hybridization and horizontal gene transfer. In some cases, such histories can be described by a phylogenetic base-tree with additional linking arcs, which can for example represent gene transfer events. Such phylogenetic networks are called tree-based. Here, we consider two possible generalizations of this concept to nonbinary networks, which we call tree-based and st...
Encoding phylogenetic trees in terms of weighted quartets.

Science.gov (United States)

Grünewald, Stefan; Huber, Katharina T; Moulton, Vincent; Semple, Charles

2008-04-01

One of the main problems in phylogenetics is to develop systematic methods for constructing evolutionary or phylogenetic trees. For a set of species X, an edge-weighted phylogenetic X-tree or phylogenetic tree is a (graph theoretical) tree with leaf set X and no degree 2 vertices, together with a map assigning a non-negative length to each edge of the tree. Within phylogenetics, several methods have been proposed for constructing such trees that work by trying to piece together quartet trees on X, i.e. phylogenetic trees each having four leaves in X. Hence, it is of interest to characterise when a collection of quartet trees corresponds to a (unique) phylogenetic tree. Recently, Dress and Erdös provided such a characterisation for binary phylogenetic trees, that is, phylogenetic trees all of whose internal vertices have degree 3. Here we provide a new characterisation for arbitrary phylogenetic trees.
Analysis of t(9;17)(q33.2;q25.3) chromosomal breakpoint regions and genetic association reveals novel candidate genes for bipolar disorder

DEFF Research Database (Denmark)

Rajkumar, Anto P; Christensen, Jane H; Mattheisen, Manuel

2015-01-01

,856) data. Genetic associations between these disorders and single nucleotide polymorphisms within these breakpoint regions were analysed by BioQ, FORGE, and RegulomeDB programmes. RESULTS: Four protein-coding genes [coding for (endonuclease V (ENDOV), neuronal pentraxin I (NPTX1), ring finger protein 213...
Translocation breakpoint at 7q31 associated with tics: further evidence for IMMP2L as a candidate gene for Tourette syndrome.

Science.gov (United States)

Patel, Chirag; Cooper-Charles, Lisa; McMullan, Dominic J; Walker, Judith M; Davison, Val; Morton, Jenny

2011-06-01

Gilles de la Tourette syndrome is a complex neuropsychiatric disorder with a strong genetic basis. We identified a male patient with Tourette syndrome-like tics and an apparently balanced de novo translocation [46,XY,t(2;7)(p24.2;q31)]. Further analysis using array comparative genomic hybridisation (CGH) revealed a cryptic deletion at 7q31.1-7q31.2. Breakpoints disrupting this region have been reported in one isolated and one familial case of Tourette syndrome. In our case, IMMP2L, a gene coding for a human homologue of the yeast inner mitochondrial membrane peptidase subunit 2, was disrupted by the breakpoint on 7q31.1, with deletion of exons 1-3 of the gene. The IMMP2L gene has previously been proposed as a candidate gene for Tourette syndrome, and our case provides further evidence of its possible role in the pathogenesis. The deleted region (7q31.1-7q31.2) of 7.2 Mb of genomic DNA also encompasses numerous genes, including FOXP2, associated with verbal dyspraxia, and the CFTR gene.
Global patterns of amphibian phylogenetic diversity

DEFF Research Database (Denmark)

Fritz, Susanne; Rahbek, Carsten

2012-01-01

Aim Phylogenetic diversity can provide insight into how evolutionary processes may have shaped contemporary patterns of species richness. Here, we aim to test for the influence of phylogenetic history on global patterns of amphibian species richness, and to identify areas where macroevolutionary...... processes such as diversification and dispersal have left strong signatures on contemporary species richness. Location Global; equal-area grid cells of approximately 10,000 km2. Methods We generated an amphibian global supertree (6111 species) and repeated analyses with the largest available molecular...... phylogeny (2792 species). We combined each tree with global species distributions to map four indices of phylogenetic diversity. To investigate congruence between global spatial patterns of amphibian species richness and phylogenetic diversity, we selected Faith’s phylogenetic diversity (PD) index...
Tree-Based Unrooted Phylogenetic Networks.

Science.gov (United States)

Francis, A; Huber, K T; Moulton, V

2018-02-01

Phylogenetic networks are a generalization of phylogenetic trees that are used to represent non-tree-like evolutionary histories that arise in organisms such as plants and bacteria, or uncertainty in evolutionary histories. An unrooted phylogenetic network on a non-empty, finite set X of taxa, or network, is a connected, simple graph in which every vertex has degree 1 or 3 and whose leaf set is X. It is called a phylogenetic tree if the underlying graph is a tree. In this paper we consider properties of tree-based networks, that is, networks that can be constructed by adding edges into a phylogenetic tree. We show that although they have some properties in common with their rooted analogues which have recently drawn much attention in the literature, they have some striking differences in terms of both their structural and computational properties. We expect that our results could eventually have applications to, for example, detecting horizontal gene transfer or hybridization which are important factors in the evolution of many organisms.
Nodal distances for rooted phylogenetic trees.

Science.gov (United States)

Cardona, Gabriel; Llabrés, Mercè; Rosselló, Francesc; Valiente, Gabriel

2010-08-01

Dissimilarity measures for (possibly weighted) phylogenetic trees based on the comparison of their vectors of path lengths between pairs of taxa, have been present in the systematics literature since the early seventies. For rooted phylogenetic trees, however, these vectors can only separate non-weighted binary trees, and therefore these dissimilarity measures are metrics only on this class of rooted phylogenetic trees. In this paper we overcome this problem, by splitting in a suitable way each path length between two taxa into two lengths. We prove that the resulting splitted path lengths matrices single out arbitrary rooted phylogenetic trees with nested taxa and arcs weighted in the set of positive real numbers. This allows the definition of metrics on this general class of rooted phylogenetic trees by comparing these matrices through metrics in spaces M(n)(R) of real-valued n x n matrices. We conclude this paper by establishing some basic facts about the metrics for non-weighted phylogenetic trees defined in this way using L(p) metrics on M(n)(R), with p [epsilon] R(>0).

Evaluation of phylogenetic reconstruction methods using bacterial whole genomes: a simulation based study [version 1; referees: 1 approved, 2 approved with reservations

Directory of Open Access Journals (Sweden)

John A. Lees

2018-03-01

Full Text Available Background: Phylogenetic reconstruction is a necessary first step in many analyses which use whole genome sequence data from bacterial populations. There are many available methods to infer phylogenies, and these have various advantages and disadvantages, but few unbiased comparisons of the range of approaches have been made. Methods: We simulated data from a defined “true tree” using a realistic evolutionary model. We built phylogenies from this data using a range of methods, and compared reconstructed trees to the true tree using two measures, noting the computational time needed for different phylogenetic reconstructions. We also used real data from Streptococcus pneumoniae alignments to compare individual core gene trees to a core genome tree. Results: We found that, as expected, maximum likelihood trees from good quality alignments were the most accurate, but also the most computationally intensive. Using less accurate phylogenetic reconstruction methods, we were able to obtain results of comparable accuracy; we found that approximate results can rapidly be obtained using genetic distance based methods. In real data we found that highly conserved core genes, such as those involved in translation, gave an inaccurate tree topology, whereas genes involved in recombination events gave inaccurate branch lengths. We also show a tree-of-trees, relating the results of different phylogenetic reconstructions to each other. Conclusions: We recommend three approaches, depending on requirements for accuracy and computational time. Quicker approaches that do not perform full maximum likelihood optimisation may be useful for many analyses requiring a phylogeny, as generating a high quality input alignment is likely to be the major limiting factor of accurate tree topology. We have publicly released our simulated data and code to enable further comparisons.
Species trees for the tree swallows (Genus Tachycineta): an alternative phylogenetic hypothesis to the mitochondrial gene tree.

Science.gov (United States)

Dor, Roi; Carling, Matthew D; Lovette, Irby J; Sheldon, Frederick H; Winkler, David W

2012-10-01

The New World swallow genus Tachycineta comprises nine species that collectively have a wide geographic distribution and remarkable variation both within- and among-species in ecologically important traits. Existing phylogenetic hypotheses for Tachycineta are based on mitochondrial DNA sequences, thus they provide estimates of a single gene tree. In this study we sequenced multiple individuals from each species at 16 nuclear intron loci. We used gene concatenated approaches (Bayesian and maximum likelihood) as well as coalescent-based species tree inference to reconstruct phylogenetic relationships of the genus. We examined the concordance and conflict between the nuclear and mitochondrial trees and between concatenated and coalescent-based inferences. Our results provide an alternative phylogenetic hypothesis to the existing mitochondrial DNA estimate of phylogeny. This new hypothesis provides a more accurate framework in which to explore trait evolution and examine the evolution of the mitochondrial genome in this group. Copyright © 2012 Elsevier Inc. All rights reserved.
Characterization of a branch of the phylogenetic tree

International Nuclear Information System (INIS)

Samuel, Stuart A.; Weng, Gezhi

2003-04-01

We use a combination of analytic models and computer simulations to gain insight into the dynamics of evolution. Our results suggest that certain interesting phenomena should eventually emerge from the fossil record. For example, there should be a 'tortoise and hare effect': Those genera with the smallest species death rate are likely to survive much longer than genera with large species birth and death rates. A complete characterization of the behavior of a branch of the phylogenetic tree corresponding to a genus and accurate mathematical representations of the various stages are obtained. We apply our results to address certain controversial issues that have arisen in paleontology such as the importance of punctuated equilibrium and whether unique Cambrian phyla have survived to the present
On Tree-Based Phylogenetic Networks.

Science.gov (United States)

Zhang, Louxin

2016-07-01

A large class of phylogenetic networks can be obtained from trees by the addition of horizontal edges between the tree edges. These networks are called tree-based networks. We present a simple necessary and sufficient condition for tree-based networks and prove that a universal tree-based network exists for any number of taxa that contains as its base every phylogenetic tree on the same set of taxa. This answers two problems posted by Francis and Steel recently. A byproduct is a computer program for generating random binary phylogenetic networks under the uniform distribution model.
Functional & phylogenetic diversity of copepod communities

Science.gov (United States)

Benedetti, F.; Ayata, S. D.; Blanco-Bercial, L.; Cornils, A.; Guilhaumon, F.

2016-02-01

The diversity of natural communities is classically estimated through species identification (taxonomic diversity) but can also be estimated from the ecological functions performed by the species (functional diversity), or from the phylogenetic relationships among them (phylogenetic diversity). Estimating functional diversity requires the definition of specific functional traits, i.e., phenotypic characteristics that impact fitness and are relevant to ecosystem functioning. Estimating phylogenetic diversity requires the description of phylogenetic relationships, for instance by using molecular tools. In the present study, we focused on the functional and phylogenetic diversity of copepod surface communities in the Mediterranean Sea. First, we implemented a specific trait database for the most commonly-sampled and abundant copepod species of the Mediterranean Sea. Our database includes 191 species, described by seven traits encompassing diverse ecological functions: minimal and maximal body length, trophic group, feeding type, spawning strategy, diel vertical migration and vertical habitat. Clustering analysis in the functional trait space revealed that Mediterranean copepods can be gathered into groups that have different ecological roles. Second, we reconstructed a phylogenetic tree using the available sequences of 18S rRNA. Our tree included 154 of the analyzed Mediterranean copepod species. We used these two datasets to describe the functional and phylogenetic diversity of copepod surface communities in the Mediterranean Sea. The replacement component (turn-over) and the species richness difference component (nestedness) of the beta diversity indices were identified. Finally, by comparing various and complementary aspects of plankton diversity (taxonomic, functional, and phylogenetic diversity) we were able to gain a better understanding of the relationships among the zooplankton community, biodiversity, ecosystem function, and environmental forcing.
treespace: Statistical exploration of landscapes of phylogenetic trees.

Science.gov (United States)

Jombart, Thibaut; Kendall, Michelle; Almagro-Garcia, Jacob; Colijn, Caroline

2017-11-01

The increasing availability of large genomic data sets as well as the advent of Bayesian phylogenetics facilitates the investigation of phylogenetic incongruence, which can result in the impossibility of representing phylogenetic relationships using a single tree. While sometimes considered as a nuisance, phylogenetic incongruence can also reflect meaningful biological processes as well as relevant statistical uncertainty, both of which can yield valuable insights in evolutionary studies. We introduce a new tool for investigating phylogenetic incongruence through the exploration of phylogenetic tree landscapes. Our approach, implemented in the R package treespace, combines tree metrics and multivariate analysis to provide low-dimensional representations of the topological variability in a set of trees, which can be used for identifying clusters of similar trees and group-specific consensus phylogenies. treespace also provides a user-friendly web interface for interactive data analysis and is integrated alongside existing standards for phylogenetics. It fills a gap in the current phylogenetics toolbox in R and will facilitate the investigation of phylogenetic results. © 2017 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.
Phylogenetic diversity and relationships among species of genus ...

African Journals Online (AJOL)

Fifty six Nicotiana species were used to construct phylogenetic trees and to asses the genetic relationships between them. Genetic distances estimated from RAPD analysis was used to construct phylogenetic trees using Phylogenetic Inference Package (PHYLIP). Since phylogenetic relationships estimated for closely ...
Nonbinary Tree-Based Phylogenetic Networks

NARCIS (Netherlands)

Jetten, L.; van Iersel, L.J.J.

2018-01-01

Rooted phylogenetic networks are used to describe evolutionary histories that contain non-treelike evolutionary events such as hybridization and horizontal gene transfer. In some cases, such histories can be described by a phylogenetic base-tree with additional linking arcs, which can for example
Phylogenetic inference with weighted codon evolutionary distances.

Science.gov (United States)

Criscuolo, Alexis; Michel, Christian J

2009-04-01

We develop a new approach to estimate a matrix of pairwise evolutionary distances from a codon-based alignment based on a codon evolutionary model. The method first computes a standard distance matrix for each of the three codon positions. Then these three distance matrices are weighted according to an estimate of the global evolutionary rate of each codon position and averaged into a unique distance matrix. Using a large set of both real and simulated codon-based alignments of nucleotide sequences, we show that this approach leads to distance matrices that have a significantly better treelikeness compared to those obtained by standard nucleotide evolutionary distances. We also propose an alternative weighting to eliminate the part of the noise often associated with some codon positions, particularly the third position, which is known to induce a fast evolutionary rate. Simulation results show that fast distance-based tree reconstruction algorithms on distance matrices based on this codon position weighting can lead to phylogenetic trees that are at least as accurate as, if not better, than those inferred by maximum likelihood. Finally, a well-known multigene dataset composed of eight yeast species and 106 codon-based alignments is reanalyzed and shows that our codon evolutionary distances allow building a phylogenetic tree which is similar to those obtained by non-distance-based methods (e.g., maximum parsimony and maximum likelihood) and also significantly improved compared to standard nucleotide evolutionary distance estimates.
Phylogenetic molecular function annotation

International Nuclear Information System (INIS)

Engelhardt, Barbara E; Jordan, Michael I; Repo, Susanna T; Brenner, Steven E

2009-01-01

It is now easier to discover thousands of protein sequences in a new microbial genome than it is to biochemically characterize the specific activity of a single protein of unknown function. The molecular functions of protein sequences have typically been predicted using homology-based computational methods, which rely on the principle that homologous proteins share a similar function. However, some protein families include groups of proteins with different molecular functions. A phylogenetic approach for predicting molecular function (sometimes called 'phylogenomics') is an effective means to predict protein molecular function. These methods incorporate functional evidence from all members of a family that have functional characterizations using the evolutionary history of the protein family to make robust predictions for the uncharacterized proteins. However, they are often difficult to apply on a genome-wide scale because of the time-consuming step of reconstructing the phylogenies of each protein to be annotated. Our automated approach for function annotation using phylogeny, the SIFTER (Statistical Inference of Function Through Evolutionary Relationships) methodology, uses a statistical graphical model to compute the probabilities of molecular functions for unannotated proteins. Our benchmark tests showed that SIFTER provides accurate functional predictions on various protein families, outperforming other available methods.
Ultrafast Approximation for Phylogenetic Bootstrap

NARCIS (Netherlands)

Bui Quang Minh, [No Value; Nguyen, Thi; von Haeseler, Arndt

Nonparametric bootstrap has been a widely used tool in phylogenetic analysis to assess the clade support of phylogenetic trees. However, with the rapidly growing amount of data, this task remains a computational bottleneck. Recently, approximation methods such as the RAxML rapid bootstrap (RBS) and
Phylogenetic Framework and Molecular Signatures for the Main Clades of the Phylum Actinobacteria

Science.gov (United States)

Gao, Beile

2012-01-01

Summary: The phylum Actinobacteria harbors many important human pathogens and also provides one of the richest sources of natural products, including numerous antibiotics and other compounds of biotechnological interest. Thus, a reliable phylogeny of this large phylum and the means to accurately identify its different constituent groups are of much interest. Detailed phylogenetic and comparative analyses of >150 actinobacterial genomes reported here form the basis for achieving these objectives. In phylogenetic trees based upon 35 conserved proteins, most of the main groups of Actinobacteria as well as a number of their superageneric clades are resolved. We also describe large numbers of molecular markers consisting of conserved signature indels in protein sequences and whole proteins that are specific for either all Actinobacteria or their different clades (viz., orders, families, genera, and subgenera) at various taxonomic levels. These signatures independently support the existence of different phylogenetic clades, and based upon them, it is now possible to delimit the phylum Actinobacteria (excluding Coriobacteriia) and most of its major groups in clear molecular terms. The species distribution patterns of these markers also provide important information regarding the interrelationships among different main orders of Actinobacteria. The identified molecular markers, in addition to enabling the development of a stable and reliable phylogenetic framework for this phylum, also provide novel and powerful means for the identification of different groups of Actinobacteria in diverse environments. Genetic and biochemical studies on these Actinobacteria-specific markers should lead to the discovery of novel biochemical and/or other properties that are unique to different groups of Actinobacteria. PMID:22390973
Different relationships between temporal phylogenetic turnover and phylogenetic similarity and in two forests were detected by a new null model.

Science.gov (United States)

Huang, Jian-Xiong; Zhang, Jian; Shen, Yong; Lian, Ju-yu; Cao, Hong-lin; Ye, Wan-hui; Wu, Lin-fang; Bin, Yue

2014-01-01

Ecologists have been monitoring community dynamics with the purpose of understanding the rates and causes of community change. However, there is a lack of monitoring of community dynamics from the perspective of phylogeny. We attempted to understand temporal phylogenetic turnover in a 50 ha tropical forest (Barro Colorado Island, BCI) and a 20 ha subtropical forest (Dinghushan in southern China, DHS). To obtain temporal phylogenetic turnover under random conditions, two null models were used. The first shuffled names of species that are widely used in community phylogenetic analyses. The second simulated demographic processes with careful consideration on the variation in dispersal ability among species and the variations in mortality both among species and among size classes. With the two models, we tested the relationships between temporal phylogenetic turnover and phylogenetic similarity at different spatial scales in the two forests. Results were more consistent with previous findings using the second null model suggesting that the second null model is more appropriate for our purposes. With the second null model, a significantly positive relationship was detected between phylogenetic turnover and phylogenetic similarity in BCI at a 10 m×10 m scale, potentially indicating phylogenetic density dependence. This relationship in DHS was significantly negative at three of five spatial scales. This could indicate abiotic filtering processes for community assembly. Using variation partitioning, we found phylogenetic similarity contributed to variation in temporal phylogenetic turnover in the DHS plot but not in BCI plot. The mechanisms for community assembly in BCI and DHS vary from phylogenetic perspective. Only the second null model detected this difference indicating the importance of choosing a proper null model.
Rooting phylogenetic trees under the coalescent model using site pattern probabilities.

Science.gov (United States)

Tian, Yuan; Kubatko, Laura

2017-12-19

Phylogenetic tree inference is a fundamental tool to estimate ancestor-descendant relationships among different species. In phylogenetic studies, identification of the root - the most recent common ancestor of all sampled organisms - is essential for complete understanding of the evolutionary relationships. Rooted trees benefit most downstream application of phylogenies such as species classification or study of adaptation. Often, trees can be rooted by using outgroups, which are species that are known to be more distantly related to the sampled organisms than any other species in the phylogeny. However, outgroups are not always available in evolutionary research. In this study, we develop a new method for rooting species tree under the coalescent model, by developing a series of hypothesis tests for rooting quartet phylogenies using site pattern probabilities. The power of this method is examined by simulation studies and by application to an empirical North American rattlesnake data set. The method shows high accuracy across the simulation conditions considered, and performs well for the rattlesnake data. Thus, it provides a computationally efficient way to accurately root species-level phylogenies that incorporates the coalescent process. The method is robust to variation in substitution model, but is sensitive to the assumption of a molecular clock. Our study establishes a computationally practical method for rooting species trees that is more efficient than traditional methods. The method will benefit numerous evolutionary studies that require rooting a phylogenetic tree without having to specify outgroups.
Undergraduate Students’ Difficulties in Reading and Constructing Phylogenetic Tree

Science.gov (United States)

Sa'adah, S.; Tapilouw, F. S.; Hidayat, T.

2017-02-01

Representation is a very important communication tool to communicate scientific concepts. Biologists produce phylogenetic representation to express their understanding of evolutionary relationships. The phylogenetic tree is visual representation depict a hypothesis about the evolutionary relationship and widely used in the biological sciences. Phylogenetic tree currently growing for many disciplines in biology. Consequently, learning about phylogenetic tree become an important part of biological education and an interesting area for biology education research. However, research showed many students often struggle with interpreting the information that phylogenetic trees depict. The purpose of this study was to investigate undergraduate students’ difficulties in reading and constructing a phylogenetic tree. The method of this study is a descriptive method. In this study, we used questionnaires, interviews, multiple choice and open-ended questions, reflective journals and observations. The findings showed students experiencing difficulties, especially in constructing a phylogenetic tree. The students’ responds indicated that main reasons for difficulties in constructing a phylogenetic tree are difficult to placing taxa in a phylogenetic tree based on the data provided so that the phylogenetic tree constructed does not describe the actual evolutionary relationship (incorrect relatedness). Students also have difficulties in determining the sister group, character synapomorphy, autapomorphy from data provided (character table) and comparing among phylogenetic tree. According to them building the phylogenetic tree is more difficult than reading the phylogenetic tree. Finding this studies provide information to undergraduate instructor and students to overcome learning difficulties of reading and constructing phylogenetic tree.
Phylogenetic structure in tropical hummingbird communities

DEFF Research Database (Denmark)

Graham, Catherine H; Parra, Juan L; Rahbek, Carsten

2009-01-01

How biotic interactions, current and historical environment, and biogeographic barriers determine community structure is a fundamental question in ecology and evolution, especially in diverse tropical regions. To evaluate patterns of local and regional diversity, we quantified the phylogenetic...... composition of 189 hummingbird communities in Ecuador. We assessed how species and phylogenetic composition changed along environmental gradients and across biogeographic barriers. We show that humid, low-elevation communities are phylogenetically overdispersed (coexistence of distant relatives), a pattern...... that is consistent with the idea that competition influences the local composition of hummingbirds. At higher elevations communities are phylogenetically clustered (coexistence of close relatives), consistent with the expectation of environmental filtering, which may result from the challenge of sustaining...
Constructing phylogenetic trees using interacting pathways.

Science.gov (United States)

Wan, Peng; Che, Dongsheng

2013-01-01

Phylogenetic trees are used to represent evolutionary relationships among biological species or organisms. The construction of phylogenetic trees is based on the similarities or differences of their physical or genetic features. Traditional approaches of constructing phylogenetic trees mainly focus on physical features. The recent advancement of high-throughput technologies has led to accumulation of huge amounts of biological data, which in turn changed the way of biological studies in various aspects. In this paper, we report our approach of building phylogenetic trees using the information of interacting pathways. We have applied hierarchical clustering on two domains of organisms-eukaryotes and prokaryotes. Our preliminary results have shown the effectiveness of using the interacting pathways in revealing evolutionary relationships.
Inferring Phylogenetic Networks Using PhyloNet.

Science.gov (United States)

Wen, Dingqiao; Yu, Yun; Zhu, Jiafan; Nakhleh, Luay

2018-07-01

PhyloNet was released in 2008 as a software package for representing and analyzing phylogenetic networks. At the time of its release, the main functionalities in PhyloNet consisted of measures for comparing network topologies and a single heuristic for reconciling gene trees with a species tree. Since then, PhyloNet has grown significantly. The software package now includes a wide array of methods for inferring phylogenetic networks from data sets of unlinked loci while accounting for both reticulation (e.g., hybridization) and incomplete lineage sorting. In particular, PhyloNet now allows for maximum parsimony, maximum likelihood, and Bayesian inference of phylogenetic networks from gene tree estimates. Furthermore, Bayesian inference directly from sequence data (sequence alignments or biallelic markers) is implemented. Maximum parsimony is based on an extension of the "minimizing deep coalescences" criterion to phylogenetic networks, whereas maximum likelihood and Bayesian inference are based on the multispecies network coalescent. All methods allow for multiple individuals per species. As computing the likelihood of a phylogenetic network is computationally hard, PhyloNet allows for evaluation and inference of networks using a pseudolikelihood measure. PhyloNet summarizes the results of the various analyzes and generates phylogenetic networks in the extended Newick format that is readily viewable by existing visualization software.
Phylogenetic classification of bony fishes.

Science.gov (United States)

Betancur-R, Ricardo; Wiley, Edward O; Arratia, Gloria; Acero, Arturo; Bailly, Nicolas; Miya, Masaki; Lecointre, Guillaume; Ortí, Guillermo

2017-07-06

Fish classifications, as those of most other taxonomic groups, are being transformed drastically as new molecular phylogenies provide support for natural groups that were unanticipated by previous studies. A brief review of the main criteria used by ichthyologists to define their classifications during the last 50 years, however, reveals slow progress towards using an explicit phylogenetic framework. Instead, the trend has been to rely, in varying degrees, on deep-rooted anatomical concepts and authority, often mixing taxa with explicit phylogenetic support with arbitrary groupings. Two leading sources in ichthyology frequently used for fish classifications (JS Nelson's volumes of Fishes of the World and W. Eschmeyer's Catalog of Fishes) fail to adopt a global phylogenetic framework despite much recent progress made towards the resolution of the fish Tree of Life. The first explicit phylogenetic classification of bony fishes was published in 2013, based on a comprehensive molecular phylogeny ( www.deepfin.org ). We here update the first version of that classification by incorporating the most recent phylogenetic results. The updated classification presented here is based on phylogenies inferred using molecular and genomic data for nearly 2000 fishes. A total of 72 orders (and 79 suborders) are recognized in this version, compared with 66 orders in version 1. The phylogeny resolves placement of 410 families, or ~80% of the total of 514 families of bony fishes currently recognized. The ordinal status of 30 percomorph families included in this study, however, remains uncertain (incertae sedis in the series Carangaria, Ovalentaria, or Eupercaria). Comments to support taxonomic decisions and comparisons with conflicting taxonomic groups proposed by others are presented. We also highlight cases were morphological support exist for the groups being classified. This version of the phylogenetic classification of bony fishes is substantially improved, providing resolution
Use of M-FISH analysis of α-particle-induced chromosome aberrations for the assessment of chromosomal breakpoint distribution and complex aberration formation

International Nuclear Information System (INIS)

Anderson, R.M.; Sumption, N.D.; Papworth, D.G.; Goodhead, D.T.

2003-01-01

Double strand breaks (dsb) of varying complexity are an important class of damage induced after exposure to ionising radiation and are considered to be the critical lesion for the formation of radiation-induced chromosome aberrations. Assuming the basic principles of the 'Breakage and Reunion' theory, dsb represent 'breakage' and aberrations are produced from the illegitimate repair (reunion) of the resulting dsb free-'ends'. Numerous questions relate to this process, in particular, (1) do chromosomal breakpoint 'hot-spots' that represent sensitive sites for breakage and/or regions of preferential repair/mis-repair, exist? (2) Considering that individual chromosomes and chromosome regions occupy discrete territories in the interphase nucleus, could rearrangements between specific chromosomes reflect domain organisation at the time of damage? (3) Assuming the topological constraints imposed on chromatin are not dramatically influenced by the presence of dsb, then how do multiple 'ends' from different chromosomes proximally associate for mis-repair as complex chromosome aberrations? To address these questions, we have analysed the chromosome aberrations induced in peripheral blood lymphocytes after exposure to 0.5 Gy α -particles (mean of 1 α -particle/cell) using the technique of M-FISH. This technique 'paints' all the human chromosomes (excluding homologues) uniquely, allowing chromosomal mis-repair to be visualised as differential colour-junctions and in addition, enhanced DAPI banding enables gross breakpoint assignation of these colour junctions. To test for non-randomness, we are comparing the frequency of occurrence of breakpoints obtained up to now with the F98 glioma model our knowledbased on chromosome length. Similarly, the involvement of each chromosome relative to other chromosomes within individual rearrangements can be determined by assuming the volume of chromosome domains is also proportional to their length. The current data to be presented will

The space of ultrametric phylogenetic trees.

Science.gov (United States)

Gavryushkin, Alex; Drummond, Alexei J

2016-08-21

The reliability of a phylogenetic inference method from genomic sequence data is ensured by its statistical consistency. Bayesian inference methods produce a sample of phylogenetic trees from the posterior distribution given sequence data. Hence the question of statistical consistency of such methods is equivalent to the consistency of the summary of the sample. More generally, statistical consistency is ensured by the tree space used to analyse the sample. In this paper, we consider two standard parameterisations of phylogenetic time-trees used in evolutionary models: inter-coalescent interval lengths and absolute times of divergence events. For each of these parameterisations we introduce a natural metric space on ultrametric phylogenetic trees. We compare the introduced spaces with existing models of tree space and formulate several formal requirements that a metric space on phylogenetic trees must possess in order to be a satisfactory space for statistical analysis, and justify them. We show that only a few known constructions of the space of phylogenetic trees satisfy these requirements. However, our results suggest that these basic requirements are not enough to distinguish between the two metric spaces we introduce and that the choice between metric spaces requires additional properties to be considered. Particularly, that the summary tree minimising the square distance to the trees from the sample might be different for different parameterisations. This suggests that further fundamental insight is needed into the problem of statistical consistency of phylogenetic inference methods. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Detecting taxonomic and phylogenetic signals in equid cheek teeth: towards new palaeontological and archaeological proxies

Science.gov (United States)

Mohaseb, A.; Peigné, S.; Debue, K.; Orlando, L.; Mashkour, M.

2017-01-01

The Plio–Pleistocene evolution of Equus and the subsequent domestication of horses and donkeys remains poorly understood, due to the lack of phenotypic markers capable of tracing this evolutionary process in the palaeontological/archaeological record. Using images from 345 specimens, encompassing 15 extant taxa of equids, we quantified the occlusal enamel folding pattern in four mandibular cheek teeth with a single geometric morphometric protocol. We initially investigated the protocol accuracy by assigning each tooth to its correct anatomical position and taxonomic group. We then contrasted the phylogenetic signal present in each tooth shape with an exome-wide phylogeny from 10 extant equine species. We estimated the strength of the phylogenetic signal using a Brownian motion model of evolution with multivariate K statistic, and mapped the dental shape along the molecular phylogeny using an approach based on squared-change parsimony. We found clear evidence for the relevance of dental phenotypes to accurately discriminate all modern members of the genus Equus and capture their phylogenetic relationships. These results are valuable for both palaeontologists and zooarchaeologists exploring the spatial and temporal dynamics of the evolutionary history of the horse family, up to the latest domestication trajectories of horses and donkeys. PMID:28484618
phangorn: phylogenetic analysis in R.

Science.gov (United States)

Schliep, Klaus Peter

2011-02-15

phangorn is a package for phylogenetic reconstruction and analysis in the R language. Previously it was only possible to estimate phylogenetic trees with distance methods in R. phangorn, now offers the possibility of reconstructing phylogenies with distance based methods, maximum parsimony or maximum likelihood (ML) and performing Hadamard conjugation. Extending the general ML framework, this package provides the possibility of estimating mixture and partition models. Furthermore, phangorn offers several functions for comparing trees, phylogenetic models or splits, simulating character data and performing congruence analyses. phangorn can be obtained through the CRAN homepage http://cran.r-project.org/web/packages/phangorn/index.html. phangorn is licensed under GPL 2.
Undergraduate Students’ Initial Ability in Understanding Phylogenetic Tree

Science.gov (United States)

Sa'adah, S.; Hidayat, T.; Sudargo, Fransisca

2017-04-01

The Phylogenetic tree is a visual representation depicts a hypothesis about the evolutionary relationship among taxa. Evolutionary experts use this representation to evaluate the evidence for evolution. The phylogenetic tree is currently growing for many disciplines in biology. Consequently, learning about the phylogenetic tree has become an important part of biological education and an interesting area of biology education research. Skill to understanding and reasoning of the phylogenetic tree, (called tree thinking) is an important skill for biology students. However, research showed many students have difficulty in interpreting, constructing, and comparing among the phylogenetic tree, as well as experiencing a misconception in the understanding of the phylogenetic tree. Students are often not taught how to reason about evolutionary relationship depicted in the diagram. Students are also not provided with information about the underlying theory and process of phylogenetic. This study aims to investigate the initial ability of undergraduate students in understanding and reasoning of the phylogenetic tree. The research method is the descriptive method. Students are given multiple choice questions and an essay that representative by tree thinking elements. Each correct answer made percentages. Each student is also given questionnaires. The results showed that the undergraduate students’ initial ability in understanding and reasoning phylogenetic tree is low. Many students are not able to answer questions about the phylogenetic tree. Only 19 % undergraduate student who answered correctly on indicator evaluate the evolutionary relationship among taxa, 25% undergraduate student who answered correctly on indicator applying concepts of the clade, 17% undergraduate student who answered correctly on indicator determines the character evolution, and only a few undergraduate student who can construct the phylogenetic tree.
Improved Maximum Parsimony Models for Phylogenetic Networks.

Science.gov (United States)

Van Iersel, Leo; Jones, Mark; Scornavacca, Celine

2018-05-01

Phylogenetic networks are well suited to represent evolutionary histories comprising reticulate evolution. Several methods aiming at reconstructing explicit phylogenetic networks have been developed in the last two decades. In this article, we propose a new definition of maximum parsimony for phylogenetic networks that permits to model biological scenarios that cannot be modeled by the definitions currently present in the literature (namely, the "hardwired" and "softwired" parsimony). Building on this new definition, we provide several algorithmic results that lay the foundations for new parsimony-based methods for phylogenetic network reconstruction.
An improved model for whole genome phylogenetic analysis by Fourier transform.

Science.gov (United States)

Yin, Changchuan; Yau, Stephen S-T

2015-10-07

DNA sequence similarity comparison is one of the major steps in computational phylogenetic studies. The sequence comparison of closely related DNA sequences and genomes is usually performed by multiple sequence alignments (MSA). While the MSA method is accurate for some types of sequences, it may produce incorrect results when DNA sequences undergone rearrangements as in many bacterial and viral genomes. It is also limited by its computational complexity for comparing large volumes of data. Previously, we proposed an alignment-free method that exploits the full information contents of DNA sequences by Discrete Fourier Transform (DFT), but still with some limitations. Here, we present a significantly improved method for the similarity comparison of DNA sequences by DFT. In this method, we map DNA sequences into 2-dimensional (2D) numerical sequences and then apply DFT to transform the 2D numerical sequences into frequency domain. In the 2D mapping, the nucleotide composition of a DNA sequence is a determinant factor and the 2D mapping reduces the nucleotide composition bias in distance measure, and thus improving the similarity measure of DNA sequences. To compare the DFT power spectra of DNA sequences with different lengths, we propose an improved even scaling algorithm to extend shorter DFT power spectra to the longest length of the underlying sequences. After the DFT power spectra are evenly scaled, the spectra are in the same dimensionality of the Fourier frequency space, then the Euclidean distances of full Fourier power spectra of the DNA sequences are used as the dissimilarity metrics. The improved DFT method, with increased computational performance by 2D numerical representation, can be applicable to any DNA sequences of different length ranges. We assess the accuracy of the improved DFT similarity measure in hierarchical clustering of different DNA sequences including simulated and real datasets. The method yields accurate and reliable phylogenetic trees
Fast and accurate phylogeny reconstruction using filtered spaced-word matches

Science.gov (United States)

Sohrabi-Jahromi, Salma; Morgenstern, Burkhard

2017-01-01

Abstract Motivation: Word-based or ‘alignment-free’ algorithms are increasingly used for phylogeny reconstruction and genome comparison, since they are much faster than traditional approaches that are based on full sequence alignments. Existing alignment-free programs, however, are less accurate than alignment-based methods. Results: We propose Filtered Spaced Word Matches (FSWM), a fast alignment-free approach to estimate phylogenetic distances between large genomic sequences. For a pre-defined binary pattern of match and don’t-care positions, FSWM rapidly identifies spaced word-matches between input sequences, i.e. gap-free local alignments with matching nucleotides at the match positions and with mismatches allowed at the don’t-care positions. We then estimate the number of nucleotide substitutions per site by considering the nucleotides aligned at the don’t-care positions of the identified spaced-word matches. To reduce the noise from spurious random matches, we use a filtering procedure where we discard all spaced-word matches for which the overall similarity between the aligned segments is below a threshold. We show that our approach can accurately estimate substitution frequencies even for distantly related sequences that cannot be analyzed with existing alignment-free methods; phylogenetic trees constructed with FSWM distances are of high quality. A program run on a pair of eukaryotic genomes of a few hundred Mb each takes a few minutes. Availability and Implementation: The program source code for FSWM including a documentation, as well as the software that we used to generate artificial genome sequences are freely available at http://fswm.gobics.de/ Contact: chris.leimeister@stud.uni-goettingen.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:28073754
Evaluation of CLSI M44-A2 Disk Diffusion and Associated Breakpoint Testing of Caspofungin and Micafungin Using a Well-Characterized Panel of Wild-Type and fks Hot Spot Mutant Candida Isolates▿

Science.gov (United States)

Arendrup, Maiken Cavling; Park, Steven; Brown, Steven; Pfaller, Michael; Perlin, David S.

2011-01-01

Disk diffusion testing has recently been standardized by the CLSI, and susceptibility breakpoints have been established for several antifungal compounds. For caspofungin, 5-μg disks are approved, and for micafungin, 10-μg disks are under evaluation. We evaluated the performances of caspofungin and micafungin disk testing using a panel of Candida isolates with and without known FKS echinocandin resistance mechanisms. Disk diffusion and microdilution assays were performed strictly according to CLSI documents M44-A2 and M27-A3. Eighty-nine clinical Candida isolates were included: Candida albicans (20 isolates/10 mutants), C. glabrata (19 isolates/10 mutants), C. dubliniensis (2 isolates/1 mutant), C. krusei (16 isolates/3 mutants), C. parapsilosis (14 isolates/0 mutants), and C. tropicalis (18 isolates/4 mutants). Quality control strains were C. parapsilosis ATCC 22019 and C. krusei ATCC 6258. The correlations between zone diameters and MIC results were good for both compounds, with identical susceptibility classifications for 93.3% of the isolates by applying the current CLSI breakpoints. However, the numbers of fks hot spot mutant isolates misclassified as being susceptible (S) (very major errors [VMEs]) were high (61% for caspofungin [S, ≥11 mm] and 93% for micafungin [S, ≥14 mm]). Changing the disk diffusion breakpoint to S at ≥22 mm significantly improved the discrimination. For caspofungin, 1 VME was detected (a C. tropicalis isolate with an F76S substitution) (3.5%), and for micafungin, 10 VMEs were detected, the majority of which were for C. glabrata (8/10). The broadest separation between zone diameter ranges for wild-type (WT) and mutant isolates was seen for caspofungin (6 to 12 mm versus −4 to 7 mm). In conclusion, caspofungin disk diffusion testing with a modified breakpoint led to excellent separation between WT and mutant isolates for all Candida species. PMID:21357293
Investigation of Breakpoint and Trend of Daily Air Temperature Range for Mashhad, Iran

Directory of Open Access Journals (Sweden)

shideh shams

2017-01-01

same temperatures. Third, a revision of internal consistence was done, verifying that daily Tmax always exceeds daily Tmin. Fourth, the temporal coherency was tested by checking if consecutive temperature records differ by more than 8 degrees. The homogeneity of the series was tested by means of the Standard Normal Homogeneity test, the Buishand range and the Pettitt tests, on yearly, seasonal and monthly time scales. Breakpoint can be detected by means of these methods. In addition, Von Neumann ratio test was used to explore the series’ randomness. Having investigated data’s randomness in this study, series’ trend was determined by the Kendal-Tau test. Furthermore, the slope of the series’ trend was calculated using the Sen’s slope method. Results Discussion: Results indicated a decreasing trend in DTR during last 60 years (1951-2010 in Mashhad climatological station. Moreover, the results revealed that the slope of yearly DTR was decreasing (-0.029 ⁰C per year, which indicates that minimum air temperature values raise more maximum air temperature values. A breakpoint was detected during 1985. During 1951-1985, the average amount of DTR was 14.6⁰C, while this parameter reduced to 12.9⁰C for the period 1985-2010. The Kendall-Tau test was used to obtain the significance of trend during 1951-2010, 1951-1985 and 1985-2010. The results showed that during 1951-2010, DTR significantly reduced at a rate of 0.29oC per decade. However, between 1951 and 1985, DTR trend increased at a rate of 0.61oC per decade, while DTR trend between 1985 and 2010 reduced at a rate of 0.19 ⁰C per decade, which was not significant (P-value=5%. In the seasonal DTR series, the highest trend’s slope was calculated for the summer data (-0.43 ⁰C in a decade, while the lowest one accrued in spring (-0.15⁰C in a decade. From 1951 to 1985, DTR had an increasing trend, due to minimum air temperature’s downward trend. But from the late 1980 to 2010, as it was expected, downward
A program for verification of phylogenetic network models.

Science.gov (United States)

Gunawan, Andreas D M; Lu, Bingxin; Zhang, Louxin

2016-09-01

Genetic material is transferred in a non-reproductive manner across species more frequently than commonly thought, particularly in the bacteria kingdom. On one hand, extant genomes are thus more properly considered as a fusion product of both reproductive and non-reproductive genetic transfers. This has motivated researchers to adopt phylogenetic networks to study genome evolution. On the other hand, a gene's evolution is usually tree-like and has been studied for over half a century. Accordingly, the relationships between phylogenetic trees and networks are the basis for the reconstruction and verification of phylogenetic networks. One important problem in verifying a network model is determining whether or not certain existing phylogenetic trees are displayed in a phylogenetic network. This problem is formally called the tree containment problem. It is NP-complete even for binary phylogenetic networks. We design an exponential time but efficient method for determining whether or not a phylogenetic tree is displayed in an arbitrary phylogenetic network. It is developed on the basis of the so-called reticulation-visible property of phylogenetic networks. A C-program is available for download on http://www.math.nus.edu.sg/∼matzlx/tcp_package matzlx@nus.edu.sg Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Fast Construction of Near Parsimonious Hybridization Networks for Multiple Phylogenetic Trees.

Science.gov (United States)

Mirzaei, Sajad; Wu, Yufeng

2016-01-01

Hybridization networks represent plausible evolutionary histories of species that are affected by reticulate evolutionary processes. An established computational problem on hybridization networks is constructing the most parsimonious hybridization network such that each of the given phylogenetic trees (called gene trees) is "displayed" in the network. There have been several previous approaches, including an exact method and several heuristics, for this NP-hard problem. However, the exact method is only applicable to a limited range of data, and heuristic methods can be less accurate and also slow sometimes. In this paper, we develop a new algorithm for constructing near parsimonious networks for multiple binary gene trees. This method is more efficient for large numbers of gene trees than previous heuristics. This new method also produces more parsimonious results on many simulated datasets as well as a real biological dataset than a previous method. We also show that our method produces topologically more accurate networks for many datasets.
Rearrangement moves on rooted phylogenetic networks.

Science.gov (United States)

Gambette, Philippe; van Iersel, Leo; Jones, Mark; Lafond, Manuel; Pardi, Fabio; Scornavacca, Celine

2017-08-01

Phylogenetic tree reconstruction is usually done by local search heuristics that explore the space of the possible tree topologies via simple rearrangements of their structure. Tree rearrangement heuristics have been used in combination with practically all optimization criteria in use, from maximum likelihood and parsimony to distance-based principles, and in a Bayesian context. Their basic components are rearrangement moves that specify all possible ways of generating alternative phylogenies from a given one, and whose fundamental property is to be able to transform, by repeated application, any phylogeny into any other phylogeny. Despite their long tradition in tree-based phylogenetics, very little research has gone into studying similar rearrangement operations for phylogenetic network-that is, phylogenies explicitly representing scenarios that include reticulate events such as hybridization, horizontal gene transfer, population admixture, and recombination. To fill this gap, we propose "horizontal" moves that ensure that every network of a certain complexity can be reached from any other network of the same complexity, and "vertical" moves that ensure reachability between networks of different complexities. When applied to phylogenetic trees, our horizontal moves-named rNNI and rSPR-reduce to the best-known moves on rooted phylogenetic trees, nearest-neighbor interchange and rooted subtree pruning and regrafting. Besides a number of reachability results-separating the contributions of horizontal and vertical moves-we prove that rNNI moves are local versions of rSPR moves, and provide bounds on the sizes of the rNNI neighborhoods. The paper focuses on the most biologically meaningful versions of phylogenetic networks, where edges are oriented and reticulation events clearly identified. Moreover, our rearrangement moves are robust to the fact that networks with higher complexity usually allow a better fit with the data. Our goal is to provide a solid basis for
Rearrangement moves on rooted phylogenetic networks.

Directory of Open Access Journals (Sweden)

Philippe Gambette

2017-08-01

Full Text Available Phylogenetic tree reconstruction is usually done by local search heuristics that explore the space of the possible tree topologies via simple rearrangements of their structure. Tree rearrangement heuristics have been used in combination with practically all optimization criteria in use, from maximum likelihood and parsimony to distance-based principles, and in a Bayesian context. Their basic components are rearrangement moves that specify all possible ways of generating alternative phylogenies from a given one, and whose fundamental property is to be able to transform, by repeated application, any phylogeny into any other phylogeny. Despite their long tradition in tree-based phylogenetics, very little research has gone into studying similar rearrangement operations for phylogenetic network-that is, phylogenies explicitly representing scenarios that include reticulate events such as hybridization, horizontal gene transfer, population admixture, and recombination. To fill this gap, we propose "horizontal" moves that ensure that every network of a certain complexity can be reached from any other network of the same complexity, and "vertical" moves that ensure reachability between networks of different complexities. When applied to phylogenetic trees, our horizontal moves-named rNNI and rSPR-reduce to the best-known moves on rooted phylogenetic trees, nearest-neighbor interchange and rooted subtree pruning and regrafting. Besides a number of reachability results-separating the contributions of horizontal and vertical moves-we prove that rNNI moves are local versions of rSPR moves, and provide bounds on the sizes of the rNNI neighborhoods. The paper focuses on the most biologically meaningful versions of phylogenetic networks, where edges are oriented and reticulation events clearly identified. Moreover, our rearrangement moves are robust to the fact that networks with higher complexity usually allow a better fit with the data. Our goal is to provide
Phylogenetic tests of distribution patterns in South Asia: towards

Indian Academy of Sciences (India)

The last four decades have seen an increasing integration of phylogenetics and biogeography. However, a dearth of phylogenetic studies has precluded such biogeographic analyses in South Asia until recently. Noting the increase in phylogenetic research and interest in phylogenetic biogeography in the region, we ...
Fourier transform inequalities for phylogenetic trees.

Science.gov (United States)

Matsen, Frederick A

2009-01-01

Phylogenetic invariants are not the only constraints on site-pattern frequency vectors for phylogenetic trees. A mutation matrix, by its definition, is the exponential of a matrix with non-negative off-diagonal entries; this positivity requirement implies non-trivial constraints on the site-pattern frequency vectors. We call these additional constraints "edge-parameter inequalities". In this paper, we first motivate the edge-parameter inequalities by considering a pathological site-pattern frequency vector corresponding to a quartet tree with a negative internal edge. This site-pattern frequency vector nevertheless satisfies all of the constraints described up to now in the literature. We next describe two complete sets of edge-parameter inequalities for the group-based models; these constraints are square-free monomial inequalities in the Fourier transformed coordinates. These inequalities, along with the phylogenetic invariants, form a complete description of the set of site-pattern frequency vectors corresponding to bona fide trees. Said in mathematical language, this paper explicitly presents two finite lists of inequalities in Fourier coordinates of the form "monomial < or = 1", each list characterizing the phylogenetically relevant semialgebraic subsets of the phylogenetic varieties.
Phylogenetic search through partial tree mixing

Science.gov (United States)

2012-01-01

Background Recent advances in sequencing technology have created large data sets upon which phylogenetic inference can be performed. Current research is limited by the prohibitive time necessary to perform tree search on a reasonable number of individuals. This research develops new phylogenetic algorithms that can operate on tens of thousands of species in a reasonable amount of time through several innovative search techniques. Results When compared to popular phylogenetic search algorithms, better trees are found much more quickly for large data sets. These algorithms are incorporated in the PSODA application available at http://dna.cs.byu.edu/psoda Conclusions The use of Partial Tree Mixing in a partition based tree space allows the algorithm to quickly converge on near optimal tree regions. These regions can then be searched in a methodical way to determine the overall optimal phylogenetic solution. PMID:23320449
TreeFam: a curated database of phylogenetic trees of animal gene families

DEFF Research Database (Denmark)

Li, Heng; Coghlan, Avril; Ruan, Jue

2006-01-01

TreeFam is a database of phylogenetic trees of gene families found in animals. It aims to develop a curated resource that presents the accurate evolutionary history of all animal gene families, as well as reliable ortholog and paralog assignments. Curated families are being added progressively......, based on seed alignments and trees in a similar fashion to Pfam. Release 1.1 of TreeFam contains curated trees for 690 families and automatically generated trees for another 11 646 families. These represent over 128 000 genes from nine fully sequenced animal genomes and over 45 000 other animal proteins...
Phylogenetic Signal in AFLP Data Sets

NARCIS (Netherlands)

Koopman, W.J.M.

2005-01-01

AFLP markers provide a potential source of phylogenetic information for molecular systematic studies. However, there are properties of restriction fragment data that limit phylogenetic interpretation of AFLPs. These are (a) possible nonindependence of fragments, (b) problems of homology assignment
How does cognition evolve? Phylogenetic comparative psychology

Science.gov (United States)

Matthews, Luke J.; Hare, Brian A.; Nunn, Charles L.; Anderson, Rindy C.; Aureli, Filippo; Brannon, Elizabeth M.; Call, Josep; Drea, Christine M.; Emery, Nathan J.; Haun, Daniel B. M.; Herrmann, Esther; Jacobs, Lucia F.; Platt, Michael L.; Rosati, Alexandra G.; Sandel, Aaron A.; Schroepfer, Kara K.; Seed, Amanda M.; Tan, Jingzhi; van Schaik, Carel P.; Wobber, Victoria

2014-01-01

Now more than ever animal studies have the potential to test hypotheses regarding how cognition evolves. Comparative psychologists have developed new techniques to probe the cognitive mechanisms underlying animal behavior, and they have become increasingly skillful at adapting methodologies to test multiple species. Meanwhile, evolutionary biologists have generated quantitative approaches to investigate the phylogenetic distribution and function of phenotypic traits, including cognition. In particular, phylogenetic methods can quantitatively (1) test whether specific cognitive abilities are correlated with life history (e.g., lifespan), morphology (e.g., brain size), or socio-ecological variables (e.g., social system), (2) measure how strongly phylogenetic relatedness predicts the distribution of cognitive skills across species, and (3) estimate the ancestral state of a given cognitive trait using measures of cognitive performance from extant species. Phylogenetic methods can also be used to guide the selection of species comparisons that offer the strongest tests of a priori predictions of cognitive evolutionary hypotheses (i.e., phylogenetic targeting). Here, we explain how an integration of comparative psychology and evolutionary biology will answer a host of questions regarding the phylogenetic distribution and history of cognitive traits, as well as the evolutionary processes that drove their evolution. PMID:21927850
How does cognition evolve? Phylogenetic comparative psychology.

Science.gov (United States)

MacLean, Evan L; Matthews, Luke J; Hare, Brian A; Nunn, Charles L; Anderson, Rindy C; Aureli, Filippo; Brannon, Elizabeth M; Call, Josep; Drea, Christine M; Emery, Nathan J; Haun, Daniel B M; Herrmann, Esther; Jacobs, Lucia F; Platt, Michael L; Rosati, Alexandra G; Sandel, Aaron A; Schroepfer, Kara K; Seed, Amanda M; Tan, Jingzhi; van Schaik, Carel P; Wobber, Victoria

2012-03-01

Now more than ever animal studies have the potential to test hypotheses regarding how cognition evolves. Comparative psychologists have developed new techniques to probe the cognitive mechanisms underlying animal behavior, and they have become increasingly skillful at adapting methodologies to test multiple species. Meanwhile, evolutionary biologists have generated quantitative approaches to investigate the phylogenetic distribution and function of phenotypic traits, including cognition. In particular, phylogenetic methods can quantitatively (1) test whether specific cognitive abilities are correlated with life history (e.g., lifespan), morphology (e.g., brain size), or socio-ecological variables (e.g., social system), (2) measure how strongly phylogenetic relatedness predicts the distribution of cognitive skills across species, and (3) estimate the ancestral state of a given cognitive trait using measures of cognitive performance from extant species. Phylogenetic methods can also be used to guide the selection of species comparisons that offer the strongest tests of a priori predictions of cognitive evolutionary hypotheses (i.e., phylogenetic targeting). Here, we explain how an integration of comparative psychology and evolutionary biology will answer a host of questions regarding the phylogenetic distribution and history of cognitive traits, as well as the evolutionary processes that drove their evolution.

Phylogenetic reconstruction methods: an overview.

Science.gov (United States)

De Bruyn, Alexandre; Martin, Darren P; Lefeuvre, Pierre

2014-01-01

Initially designed to infer evolutionary relationships based on morphological and physiological characters, phylogenetic reconstruction methods have greatly benefited from recent developments in molecular biology and sequencing technologies with a number of powerful methods having been developed specifically to infer phylogenies from macromolecular data. This chapter, while presenting an overview of basic concepts and methods used in phylogenetic reconstruction, is primarily intended as a simplified step-by-step guide to the construction of phylogenetic trees from nucleotide sequences using fairly up-to-date maximum likelihood methods implemented in freely available computer programs. While the analysis of chloroplast sequences from various Vanilla species is used as an illustrative example, the techniques covered here are relevant to the comparative analysis of homologous sequences datasets sampled from any group of organisms.
Barcoding and Phylogenetic Inferences in Nine Mugilid Species (Pisces, Mugiliformes

Directory of Open Access Journals (Sweden)

Neonila Polyakova

2013-10-01

Full Text Available Accurate identification of fish and fish products, from eggs to adults, is important in many areas. Grey mullets of the family Mugilidae are distributed worldwide and inhabit marine, estuarine, and freshwater environments in all tropical and temperate regions. Various Mugilid species are commercially important species in fishery and aquaculture of many countries. For the present study we have chosen two Mugilid genes with different phylogenetic signals: relatively variable mitochondrial cytochrome oxidase subunit I (COI and conservative nuclear rhodopsin (RHO. We examined their diversity within and among 9 Mugilid species belonging to 4 genera, many of which have been examined from multiple specimens, with the goal of determining whether DNA barcoding can achieve unambiguous species recognition of Mugilid species. The data obtained showed that information based on COI sequences was diagnostic not only for species-level identification but also for recognition of intraspecific units, e.g., allopatric populations of circumtropical Mugil cephalus, or even native and acclimatized specimens of Chelon haematocheila. All RHO sequences appeared strictly species specific. Based on the data obtained, we conclude that COI, as well as RHO sequencing can be used to unambiguously identify fish species. Topologies of phylogeny based on RHO and COI sequences coincided with each other, while together they had a good phylogenetic signal.
Comparative evolutionary diversity and phylogenetic structure across multiple forest dynamics plots: a mega-phylogeny approach

Science.gov (United States)

Erickson, David L.; Jones, Frank A.; Swenson, Nathan G.; Pei, Nancai; Bourg, Norman A.; Chen, Wenna; Davies, Stuart J.; Ge, Xue-jun; Hao, Zhanqing; Howe, Robert W.; Huang, Chun-Lin; Larson, Andrew J.; Lum, Shawn K. Y.; Lutz, James A.; Ma, Keping; Meegaskumbura, Madhava; Mi, Xiangcheng; Parker, John D.; Fang-Sun, I.; Wright, S. Joseph; Wolf, Amy T.; Ye, W.; Xing, Dingliang; Zimmerman, Jess K.; Kress, W. John

2014-01-01

Forest dynamics plots, which now span longitudes, latitudes, and habitat types across the globe, offer unparalleled insights into the ecological and evolutionary processes that determine how species are assembled into communities. Understanding phylogenetic relationships among species in a community has become an important component of assessing assembly processes. However, the application of evolutionary information to questions in community ecology has been limited in large part by the lack of accurate estimates of phylogenetic relationships among individual species found within communities, and is particularly limiting in comparisons between communities. Therefore, streamlining and maximizing the information content of these community phylogenies is a priority. To test the viability and advantage of a multi-community phylogeny, we constructed a multi-plot mega-phylogeny of 1347 species of trees across 15 forest dynamics plots in the ForestGEO network using DNA barcode sequence data (rbcL, matK, and psbA-trnH) and compared community phylogenies for each individual plot with respect to support for topology and branch lengths, which affect evolutionary inference of community processes. The levels of taxonomic differentiation across the phylogeny were examined by quantifying the frequency of resolved nodes throughout. In addition, three phylogenetic distance (PD) metrics that are commonly used to infer assembly processes were estimated for each plot [PD, Mean Phylogenetic Distance (MPD), and Mean Nearest Taxon Distance (MNTD)]. Lastly, we examine the partitioning of phylogenetic diversity among community plots through quantification of inter-community MPD and MNTD. Overall, evolutionary relationships were highly resolved across the DNA barcode-based mega-phylogeny, and phylogenetic resolution for each community plot was improved when estimated within the context of the mega-phylogeny. Likewise, when compared with phylogenies for individual plots, estimates of
Detecting exact breakpoints of deletions with diversity in hepatitis B viral genomic DNA from next-generation sequencing data.

Science.gov (United States)

Cheng, Ji-Hong; Liu, Wen-Chun; Chang, Ting-Tsung; Hsieh, Sun-Yuan; Tseng, Vincent S

2017-10-01

Many studies have suggested that deletions of Hepatitis B Viral (HBV) are associated with the development of progressive liver diseases, even ultimately resulting in hepatocellular carcinoma (HCC). Among the methods for detecting deletions from next-generation sequencing (NGS) data, few methods considered the characteristics of virus, such as high evolution rates and high divergence among the different HBV genomes. Sequencing high divergence HBV genome sequences using the NGS technology outputs millions of reads. Thus, detecting exact breakpoints of deletions from these big and complex data incurs very high computational cost. We proposed a novel analytical method named VirDelect (Virus Deletion Detect), which uses split read alignment base to detect exact breakpoint and diversity variable to consider high divergence in single-end reads data, such that the computational cost can be reduced without losing accuracy. We use four simulated reads datasets and two real pair-end reads datasets of HBV genome sequence to verify VirDelect accuracy by score functions. The experimental results show that VirDelect outperforms the state-of-the-art method Pindel in terms of accuracy score for all simulated datasets and VirDelect had only two base errors even in real datasets. VirDelect is also shown to deliver high accuracy in analyzing the single-end read data as well as pair-end data. VirDelect can serve as an effective and efficient bioinformatics tool for physiologists with high accuracy and efficient performance and applicable to further analysis with characteristics similar to HBV on genome length and high divergence. The software program of VirDelect can be downloaded at https://sourceforge.net/projects/virdelect/. Copyright © 2017. Published by Elsevier Inc.
Nucleotide diversity and phylogenetic relationships among ...

Indian Academy of Sciences (India)

NIRAJ SINGH

for phylogenetic analysis of Gladiolus and related taxa using combined datasets from chloroplast genome. The psbA–trnH ... phylogenetic relationships among cultivars could be useful for hybridization programmes for further improvement of the crop. [Singh N. ... breeding in nature, and exhibited diverse pollination mech-.
A Comprehensive Strategy for Accurate Mutation Detection of the Highly Homologous PMS2.

Science.gov (United States)

Li, Jianli; Dai, Hongzheng; Feng, Yanming; Tang, Jia; Chen, Stella; Tian, Xia; Gorman, Elizabeth; Schmitt, Eric S; Hansen, Terah A A; Wang, Jing; Plon, Sharon E; Zhang, Victor Wei; Wong, Lee-Jun C

2015-09-01

Germline mutations in the DNA mismatch repair gene PMS2 underlie the cancer susceptibility syndrome, Lynch syndrome. However, accurate molecular testing of PMS2 is complicated by a large number of highly homologous sequences. To establish a comprehensive approach for mutation detection of PMS2, we have designed a strategy combining targeted capture next-generation sequencing (NGS), multiplex ligation-dependent probe amplification, and long-range PCR followed by NGS to simultaneously detect point mutations and copy number changes of PMS2. Exonic deletions (E2 to E9, E5 to E9, E8, E10, E14, and E1 to E15), duplications (E11 to E12), and a nonsense mutation, p.S22*, were identified. Traditional multiplex ligation-dependent probe amplification and Sanger sequencing approaches cannot differentiate the origin of the exonic deletions in the 3' region when PMS2 and PMS2CL share identical sequences as a result of gene conversion. Our approach allows unambiguous identification of mutations in the active gene with a straightforward long-range-PCR/NGS method. Breakpoint analysis of multiple samples revealed that recurrent exon 14 deletions are mediated by homologous Alu sequences. Our comprehensive approach provides a reliable tool for accurate molecular analysis of genes containing multiple copies of highly homologous sequences and should improve PMS2 molecular analysis for patients with Lynch syndrome. Copyright © 2015 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Phylogenetic Position of Barbus lacerta Heckel, 1843

Directory of Open Access Journals (Sweden)

Mustafa Korkmaz

2015-11-01

As a result, five clades come out from phylogenetic reconstruction and in phylogenetic tree Barbus lacerta determined to be sister group of Barbus macedonicus, Barbus oligolepis and Barbus plebejus complex.
The phylogenetics of succession can guide restoration

DEFF Research Database (Denmark)

Shooner, Stephanie; Chisholm, Chelsea Lee; Davies, T. Jonathan

2015-01-01

Phylogenetic tools have increasingly been used in community ecology to describe the evolutionary relationships among co-occurring species. In studies of succession, such tools may allow us to identify the evolutionary lineages most suited for particular stages of succession and habitat...... rehabilitation. However, to date, these two applications have been largely separate. Here, we suggest that information on phylogenetic community structure might help to inform community restoration strategies following major disturbance. Our study examined phylogenetic patterns of succession based...... for species sorting along abiotic gradients (slope and aspect) on the mine sites that had been abandoned for the longest. Synthesis and applications. Understanding the trajectory of succession is critical for restoration efforts. Our results suggest that early colonizers represent a phylogenetically random...
Bayesian models for comparative analysis integrating phylogenetic uncertainty

Directory of Open Access Journals (Sweden)

Villemereuil Pierre de

2012-06-01

Full Text Available Abstract Background Uncertainty in comparative analyses can come from at least two sources: a phylogenetic uncertainty in the tree topology or branch lengths, and b uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow and inflated significance in hypothesis testing (e.g. p-values will be too small. Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. Methods We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. Results We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Conclusions Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible
Bayesian models for comparative analysis integrating phylogenetic uncertainty

Science.gov (United States)

2012-01-01

Background Uncertainty in comparative analyses can come from at least two sources: a) phylogenetic uncertainty in the tree topology or branch lengths, and b) uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow) and inflated significance in hypothesis testing (e.g. p-values will be too small). Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. Methods We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. Results We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Conclusions Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible general purpose tool for
DNA Sequences Proximal to Human Mitochondrial DNA Deletion Breakpoints Prevalent in Human Disease Form G-quadruplexes, a Class of DNA Structures Inefficiently Unwound by the Mitochondrial Replicative Twinkle Helicase*

Science.gov (United States)

Bharti, Sanjay Kumar; Sommers, Joshua A.; Zhou, Jun; Kaplan, Daniel L.; Spelbrink, Johannes N.; Mergny, Jean-Louis; Brosh, Robert M.

2014-01-01

Mitochondrial DNA deletions are prominent in human genetic disorders, cancer, and aging. It is thought that stalling of the mitochondrial replication machinery during DNA synthesis is a prominent source of mitochondrial genome instability; however, the precise molecular determinants of defective mitochondrial replication are not well understood. In this work, we performed a computational analysis of the human mitochondrial genome using the “Pattern Finder” G-quadruplex (G4) predictor algorithm to assess whether G4-forming sequences reside in close proximity (within 20 base pairs) to known mitochondrial DNA deletion breakpoints. We then used this information to map G4P sequences with deletions characteristic of representative mitochondrial genetic disorders and also those identified in various cancers and aging. Circular dichroism and UV spectral analysis demonstrated that mitochondrial G-rich sequences near deletion breakpoints prevalent in human disease form G-quadruplex DNA structures. A biochemical analysis of purified recombinant human Twinkle protein (gene product of c10orf2) showed that the mitochondrial replicative helicase inefficiently unwinds well characterized intermolecular and intramolecular G-quadruplex DNA substrates, as well as a unimolecular G4 substrate derived from a mitochondrial sequence that nests a deletion breakpoint described in human renal cell carcinoma. Although G4 has been implicated in the initiation of mitochondrial DNA replication, our current findings suggest that mitochondrial G-quadruplexes are also likely to be a source of instability for the mitochondrial genome by perturbing the normal progression of the mitochondrial replication machinery, including DNA unwinding by Twinkle helicase. PMID:25193669
Nucleotide diversity and phylogenetic relationships among ...

Indian Academy of Sciences (India)

Navya

2 attached at the base of tree as the diverging Iridaceae relative's lineage. Present study revealed that psbA-trnH region are useful in addressing questions of phylogenetic relationships among the Gladiolus cultivars, as these intergenic spacers are more variable and have more phylogenetically informative sites than the ...
Monte Carlo estimation of total variation distance of Markov chains on large spaces, with application to phylogenetics.

Science.gov (United States)

Herbei, Radu; Kubatko, Laura

2013-03-26

Markov chains are widely used for modeling in many areas of molecular biology and genetics. As the complexity of such models advances, it becomes increasingly important to assess the rate at which a Markov chain converges to its stationary distribution in order to carry out accurate inference. A common measure of convergence to the stationary distribution is the total variation distance, but this measure can be difficult to compute when the state space of the chain is large. We propose a Monte Carlo method to estimate the total variation distance that can be applied in this situation, and we demonstrate how the method can be efficiently implemented by taking advantage of GPU computing techniques. We apply the method to two Markov chains on the space of phylogenetic trees, and discuss the implications of our findings for the development of algorithms for phylogenetic inference.
Point estimates in phylogenetic reconstructions

OpenAIRE

Benner, Philipp; Bacak, Miroslav; Bourguignon, Pierre-Yves

2013-01-01

Motivation: The construction of statistics for summarizing posterior samples returned by a Bayesian phylogenetic study has so far been hindered by the poor geometric insights available into the space of phylogenetic trees, and ad hoc methods such as the derivation of a consensus tree makeup for the ill-definition of the usual concepts of posterior mean, while bootstrap methods mitigate the absence of a sound concept of variance. Yielding satisfactory results with sufficiently concentrated pos...
Open Reading Frame Phylogenetic Analysis on the Cloud

Directory of Open Access Journals (Sweden)

Che-Lun Hung

2013-01-01

Full Text Available Phylogenetic analysis has become essential in researching the evolutionary relationships between viruses. These relationships are depicted on phylogenetic trees, in which viruses are grouped based on sequence similarity. Viral evolutionary relationships are identified from open reading frames rather than from complete sequences. Recently, cloud computing has become popular for developing internet-based bioinformatics tools. Biocloud is an efficient, scalable, and robust bioinformatics computing service. In this paper, we propose a cloud-based open reading frame phylogenetic analysis service. The proposed service integrates the Hadoop framework, virtualization technology, and phylogenetic analysis methods to provide a high-availability, large-scale bioservice. In a case study, we analyze the phylogenetic relationships among Norovirus. Evolutionary relationships are elucidated by aligning different open reading frame sequences. The proposed platform correctly identifies the evolutionary relationships between members of Norovirus.
Characterization of Escherichia coli Phylogenetic Groups ...

African Journals Online (AJOL)

Background: Escherichia coli strains mainly fall into four phylogenetic groups (A, B1, B2, and D) and that virulent extra‑intestinal strains mainly belong to groups B2 and D. Aim: The aim was to determine the association between phylogenetic groups of E. coli causing extraintestinal infections (ExPEC) regarding the site of ...
N-nitrosodimethylamine (NDMA) formation potential of amine-based water treatment polymers: Effects of in situ chloramination, breakpoint chlorination, and pre-oxidation.

Science.gov (United States)

Park, Sang Hyuck; Padhye, Lokesh P; Wang, Pei; Cho, Min; Kim, Jae-Hong; Huang, Ching-Hua

2015-01-23

Recent studies show that cationic amine-based water treatment polymers may be important precursors that contribute to formation of the probable human carcinogen N-nitrosodimethylamine (NDMA) during water treatment and disinfection. To better understand how water treatment parameters affect NDMA formation from the polymers, the effects of in situ chloramination, breakpoint chlorination, and pre-oxidation on the NDMA formation from the polymers were investigated. NDMA formation potential (NDMA-FP) as well as dimethylamine (DMA) residual concentration were measured from poly(epichlorohydrin dimethylamine) (polyamine) and poly(diallyldimethylammonium chloride) (polyDADMAC) solutions upon reactions with oxidants including free chlorine, chlorine dioxide, ozone, and monochloramine under different treatment conditions. The results supported that dichloramine (NHCl2) formation was the critical factor affecting NDMA formation from the polymers during in situ chloramination. The highest NDMA formation from the polymers occurred near the breakpoint of chlorination. Polymer chain breakdown and transformation of the released DMA and other intermediates were important factors affecting NDMA formation from the polymers in pre-oxidation followed by post-chloramination. Pre-oxidation generally reduced NDMA-FP of the polymers; however, the treatments involving pre-ozonation increased polyDADMAC's NDMA-FP and DMA release. The strategies for reducing NDMA formation from the polymers may include the avoidance of the conditions favorable to NHCl2 formation and the avoidance of polymer exposure to strong oxidants such as ozone. Copyright © 2014 Elsevier B.V. All rights reserved.
A Practical Algorithm for Reconstructing Level-1 Phylogenetic Networks

NARCIS (Netherlands)

K.T. Huber; L.J.J. van Iersel (Leo); S.M. Kelk (Steven); R. Suchecki

2010-01-01

htmlabstractRecently much attention has been devoted to the construction of phylogenetic networks which generalize phylogenetic trees in order to accommodate complex evolutionary processes. Here we present an efficient, practical algorithm for reconstructing level-1 phylogenetic networks - a type of
A practical algorithm for reconstructing level-1 phylogenetic networks

NARCIS (Netherlands)

Huber, K.T.; Iersel, van L.J.J.; Kelk, S.M.; Suchecki, R.

2011-01-01

Recently, much attention has been devoted to the construction of phylogenetic networks which generalize phylogenetic trees in order to accommodate complex evolutionary processes. Here, we present an efficient, practical algorithm for reconstructing level-1 phylogenetic networks-a type of network
Folding and unfolding phylogenetic trees and networks.

Science.gov (United States)

Huber, Katharina T; Moulton, Vincent; Steel, Mike; Wu, Taoyang

2016-12-01

Phylogenetic networks are rooted, labelled directed acyclic graphswhich are commonly used to represent reticulate evolution. There is a close relationship between phylogenetic networks and multi-labelled trees (MUL-trees). Indeed, any phylogenetic network N can be "unfolded" to obtain a MUL-tree U(N) and, conversely, a MUL-tree T can in certain circumstances be "folded" to obtain aphylogenetic network F(T) that exhibits T. In this paper, we study properties of the operations U and F in more detail. In particular, we introduce the class of stable networks, phylogenetic networks N for which F(U(N)) is isomorphic to N, characterise such networks, and show that they are related to the well-known class of tree-sibling networks. We also explore how the concept of displaying a tree in a network N can be related to displaying the tree in the MUL-tree U(N). To do this, we develop aphylogenetic analogue of graph fibrations. This allows us to view U(N) as the analogue of the universal cover of a digraph, and to establish a close connection between displaying trees in U(N) and reconciling phylogenetic trees with networks.

Topological variation in single-gene phylogenetic trees

OpenAIRE

Castresana, Jose

2007-01-01

A recent large-scale phylogenomic study has shown the great degree of topological variation that can be found among eukaryotic phylogenetic trees constructed from single genes, highlighting the problems that can be associated with gene sampling in phylogenetic studies.
Sequence comparison and phylogenetic analysis of core gene of ...

African Journals Online (AJOL)

Phylogenetic analysis suggests that our sequences are clustered with sequences reported from Japan. This is the first phylogenetic analysis of HCV core gene from Pakistani population. Our sequences and sequences from Japan are grouped into same cluster in the phylogenetic tree. Sequence comparison and ...
PhyDesign: an online application for profiling phylogenetic informativeness

Directory of Open Access Journals (Sweden)

Townsend Jeffrey P

2011-05-01

Full Text Available Abstract Background The rapid increase in number of sequenced genomes for species across of the tree of life is revealing a diverse suite of orthologous genes that could potentially be employed to inform molecular phylogenetic studies that encompass broader taxonomic sampling. Optimal usage of this diversity of loci requires user-friendly tools to facilitate widespread cost-effective locus prioritization for phylogenetic sampling. The Townsend (2007 phylogenetic informativeness provides a unique empirical metric for guiding marker selection. However, no software or automated methodology to evaluate sequence alignments and estimate the phylogenetic informativeness metric has been available. Results Here, we present PhyDesign, a platform-independent online application that implements the Townsend (2007 phylogenetic informativeness analysis, providing a quantitative prediction of the utility of loci to solve specific phylogenetic questions. An easy-to-use interface facilitates uploading of alignments and ultrametric trees to calculate and depict profiles of informativeness over specified time ranges, and provides rankings of locus prioritization for epochs of interest. Conclusions By providing these profiles, PhyDesign facilitates locus prioritization increasing the efficiency of sequencing for phylogenetic purposes compared to traditional studies with more laborious and low capacity screening methods, as well as increasing the accuracy of phylogenetic studies. Together with a manual and sample files, the application is freely accessible at http://phydesign.townsend.yale.edu.
The Diaporthe sojae species complex: Phylogenetic re-assessment of pathogens associated with soybean, cucurbits and other field crops.

Science.gov (United States)

Udayanga, Dhanushka; Castlebury, Lisa A; Rossman, Amy Y; Chukeatirote, Ekachai; Hyde, Kevin D

2015-05-01

Phytopathogenic species of Diaporthe are associated with a number of soybean diseases including seed decay, pod and stem blight and stem canker and lead to considerable crop production losses worldwide. Accurate morphological identification of the species that cause these diseases has been difficult. In this study, we determined the phylogenetic relationships and species boundaries of Diaporthe longicolla, Diaporthe phaseolorum, Diaporthe sojae and closely related taxa. Species boundaries for this complex were determined based on combined phylogenetic analysis of five gene regions: partial sequences of calmodulin (CAL), beta-tubulin (TUB), histone-3 (HIS), translation elongation factor 1-α (EF1-α), and the nuclear ribosomal internal transcribed spacers (ITS). Phylogenetic analyses revealed that this large complex of taxa is comprised of soybean pathogens as well as species associated with herbaceous field crops and weeds. Diaporthe arctii, Diaporthe batatas, D. phaseolorum and D. sojae are epitypified. The seed decay pathogen D. longicolla was determined to be distinct from D. sojae. D. phaseolorum, originally associated with stem and leaf blight of Lima bean, was not found to be associated with soybean. A new species, Diaporthe ueckerae on Cucumis melo, is introduced with description and illustrations. Published by Elsevier Ltd.
Phylogenetic tree construction using trinucleotide usage profile (TUP).

Science.gov (United States)

Chen, Si; Deng, Lih-Yuan; Bowman, Dale; Shiau, Jyh-Jen Horng; Wong, Tit-Yee; Madahian, Behrouz; Lu, Henry Horng-Shing

2016-10-06

It has been a challenging task to build a genome-wide phylogenetic tree for a large group of species containing a large number of genes with long nucleotides sequences. The most popular method, called feature frequency profile (FFP-k), finds the frequency distribution for all words of certain length k over the whole genome sequence using (overlapping) windows of the same length. For a satisfactory result, the recommended word length (k) ranges from 6 to 15 and it may not be a multiple of 3 (codon length). The total number of possible words needed for FFP-k can range from 4 6 =4096 to 4 15 . We propose a simple improvement over the popular FFP method using only a typical word length of 3. A new method, called Trinucleotide Usage Profile (TUP), is proposed based only on the (relative) frequency distribution using non-overlapping windows of length 3. The total number of possible words needed for TUP is 4 3 =64, which is much less than the total count for the recommended optimal "resolution" for FFP. To build a phylogenetic tree, we propose first representing each of the species by a TUP vector and then using an appropriate distance measure between pairs of the TUP vectors for the tree construction. In particular, we propose summarizing a DNA sequence by a matrix of three rows corresponding to three reading frames, recording the frequency distribution of the non-overlapping words of length 3 in each of the reading frame. We also provide a numerical measure for comparing trees constructed with various methods. Compared to the FFP method, our empirical study showed that the proposed TUP method is more capable of building phylogenetic trees with a stronger biological support. We further provide some justifications on this from the information theory viewpoint. Unlike the FFP method, the TUP method takes the advantage that the starting of the first reading frame is (usually) known. Without this information, the FFP method could only rely on the frequency distribution of
Predicting rates of interspecific interaction from phylogenetic trees.

Science.gov (United States)

Nuismer, Scott L; Harmon, Luke J

2015-01-01

Integrating phylogenetic information can potentially improve our ability to explain species' traits, patterns of community assembly, the network structure of communities, and ecosystem function. In this study, we use mathematical models to explore the ecological and evolutionary factors that modulate the explanatory power of phylogenetic information for communities of species that interact within a single trophic level. We find that phylogenetic relationships among species can influence trait evolution and rates of interaction among species, but only under particular models of species interaction. For example, when interactions within communities are mediated by a mechanism of phenotype matching, phylogenetic trees make specific predictions about trait evolution and rates of interaction. In contrast, if interactions within a community depend on a mechanism of phenotype differences, phylogenetic information has little, if any, predictive power for trait evolution and interaction rate. Together, these results make clear and testable predictions for when and how evolutionary history is expected to influence contemporary rates of species interaction. © 2014 John Wiley & Sons Ltd/CNRS.
Phylogenetic Structure of Foliar Spectral Traits in Tropical Forest Canopies

Directory of Open Access Journals (Sweden)

Kelly M. McManus

2016-02-01

Full Text Available The Spectranomics approach to tropical forest remote sensing has established a link between foliar reflectance spectra and the phylogenetic composition of tropical canopy tree communities vis-à-vis the taxonomic organization of biochemical trait variation. However, a direct relationship between phylogenetic affiliation and foliar reflectance spectra of species has not been established. We sought to develop this relationship by quantifying the extent to which underlying patterns of phylogenetic structure drive interspecific variation among foliar reflectance spectra within three Neotropical canopy tree communities with varying levels of soil fertility. We interpreted the resulting spectral patterns of phylogenetic signal in the context of foliar biochemical traits that may contribute to the spectral-phylogenetic link. We utilized a multi-model ensemble to elucidate trait-spectral relationships, and quantified phylogenetic signal for spectral wavelengths and traits using Pagel’s lambda statistic. Foliar reflectance spectra showed evidence of phylogenetic influence primarily within the visible and shortwave infrared spectral regions. These regions were also selected by the multi-model ensemble as those most important to the quantitative prediction of several foliar biochemical traits. Patterns of phylogenetic organization of spectra and traits varied across sites and with soil fertility, indicative of the complex interactions between the environmental and phylogenetic controls underlying patterns of biodiversity.
Molecular identification and phylogenetic analysis of Wuchereria bancrofti from human blood samples in Egypt.

Science.gov (United States)

Abdel-Shafi, Iman R; Shoieb, Eman Y; Attia, Samar S; Rubio, José M; Ta-Tang, Thuy-Huong; El-Badry, Ayman A

2017-03-01

Lymphatic filariasis (LF) is a serious vector-borne health problem, and Wuchereria bancrofti (W.b) is the major cause of LF worldwide and is focally endemic in Egypt. Identification of filarial infection using traditional morphologic and immunological criteria can be difficult and lead to misdiagnosis. The aim of the present study was molecular detection of W.b in residents in endemic areas in Egypt, sequence variance analysis, and phylogenetic analysis of W.b DNA. Collected blood samples from residents in filariasis endemic areas in five governorates were subjected to semi-nested PCR targeting repeated DNA sequence, for detection of W.b DNA. PCR products were sequenced; subsequently, a phylogenetic analysis of the obtained sequences was performed. Out of 300 blood samples, W.b DNA was identified in 48 (16%). Sequencing analysis confirmed PCR results identifying only W.b species. Sequence alignment and phylogenetic analysis indicated genetically distinct clusters of W.b among the study population. Study results demonstrated that the semi-nested PCR proved to be an effective diagnostic tool for accurate and rapid detection of W.b infections in nano-epidemics and is applicable for samples collected in the daytime as well as the night time. PCR products sequencing and phylogenitic analysis revealed three different nucleotide sequences variants. Further genetic studies of W.b in Egypt and other endemic areas are needed to distinguish related strains and the various ecological as well as drug effects exerted on them to support W.b elimination.
Maximizing the phylogenetic diversity of seed banks.

Science.gov (United States)

Griffiths, Kate E; Balding, Sharon T; Dickie, John B; Lewis, Gwilym P; Pearce, Tim R; Grenyer, Richard

2015-04-01

Ex situ conservation efforts such as those of zoos, botanical gardens, and seed banks will form a vital complement to in situ conservation actions over the coming decades. It is therefore necessary to pay the same attention to the biological diversity represented in ex situ conservation facilities as is often paid to protected-area networks. Building the phylogenetic diversity of ex situ collections will strengthen our capacity to respond to biodiversity loss. Since 2000, the Millennium Seed Bank Partnership has banked seed from 14% of the world's plant species. We assessed the taxonomic, geographic, and phylogenetic diversity of the Millennium Seed Bank collection of legumes (Leguminosae). We compared the collection with all known legume genera, their known geographic range (at country and regional levels), and a genus-level phylogeny of the legume family constructed for this study. Over half the phylogenetic diversity of legumes at the genus level was represented in the Millennium Seed Bank. However, pragmatic prioritization of species of economic importance and endangerment has led to the banking of a less-than-optimal phylogenetic diversity and prioritization of range-restricted species risks an underdispersed collection. The current state of the phylogenetic diversity of legumes in the Millennium Seed Bank could be substantially improved through the strategic banking of relatively few additional taxa. Our method draws on tools that are widely applied to in situ conservation planning, and it can be used to evaluate and improve the phylogenetic diversity of ex situ collections. © 2014 Society for Conservation Biology.
Rapid and accurate pyrosequencing of angiosperm plastid genomes

Science.gov (United States)

Moore, Michael J; Dhingra, Amit; Soltis, Pamela S; Shaw, Regina; Farmerie, William G; Folta, Kevin M; Soltis, Douglas E

2006-01-01

Background Plastid genome sequence information is vital to several disciplines in plant biology, including phylogenetics and molecular biology. The past five years have witnessed a dramatic increase in the number of completely sequenced plastid genomes, fuelled largely by advances in conventional Sanger sequencing technology. Here we report a further significant reduction in time and cost for plastid genome sequencing through the successful use of a newly available pyrosequencing platform, the Genome Sequencer 20 (GS 20) System (454 Life Sciences Corporation), to rapidly and accurately sequence the whole plastid genomes of the basal eudicot angiosperms Nandina domestica (Berberidaceae) and Platanus occidentalis (Platanaceae). Results More than 99.75% of each plastid genome was simultaneously obtained during two GS 20 sequence runs, to an average depth of coverage of 24.6× in Nandina and 17.3× in Platanus. The Nandina and Platanus plastid genomes shared essentially identical gene complements and possessed the typical angiosperm plastid structure and gene arrangement. To assess the accuracy of the GS 20 sequence, over 45 kilobases of sequence were generated for each genome using conventional sequencing. Overall error rates of 0.043% and 0.031% were observed in GS 20 sequence for Nandina and Platanus, respectively. More than 97% of all observed errors were associated with homopolymer runs, with ~60% of all errors associated with homopolymer runs of 5 or more nucleotides and ~50% of all errors associated with regions of extensive homopolymer runs. No substitution errors were present in either genome. Error rates were generally higher in the single-copy and noncoding regions of both plastid genomes relative to the inverted repeat and coding regions. Conclusion Highly accurate and essentially complete sequence information was obtained for the Nandina and Platanus plastid genomes using the GS 20 System. More importantly, the high accuracy observed in the GS 20 plastid
Rapid and accurate pyrosequencing of angiosperm plastid genomes

Directory of Open Access Journals (Sweden)

Farmerie William G

2006-08-01

Full Text Available Abstract Background Plastid genome sequence information is vital to several disciplines in plant biology, including phylogenetics and molecular biology. The past five years have witnessed a dramatic increase in the number of completely sequenced plastid genomes, fuelled largely by advances in conventional Sanger sequencing technology. Here we report a further significant reduction in time and cost for plastid genome sequencing through the successful use of a newly available pyrosequencing platform, the Genome Sequencer 20 (GS 20 System (454 Life Sciences Corporation, to rapidly and accurately sequence the whole plastid genomes of the basal eudicot angiosperms Nandina domestica (Berberidaceae and Platanus occidentalis (Platanaceae. Results More than 99.75% of each plastid genome was simultaneously obtained during two GS 20 sequence runs, to an average depth of coverage of 24.6× in Nandina and 17.3× in Platanus. The Nandina and Platanus plastid genomes shared essentially identical gene complements and possessed the typical angiosperm plastid structure and gene arrangement. To assess the accuracy of the GS 20 sequence, over 45 kilobases of sequence were generated for each genome using conventional sequencing. Overall error rates of 0.043% and 0.031% were observed in GS 20 sequence for Nandina and Platanus, respectively. More than 97% of all observed errors were associated with homopolymer runs, with ~60% of all errors associated with homopolymer runs of 5 or more nucleotides and ~50% of all errors associated with regions of extensive homopolymer runs. No substitution errors were present in either genome. Error rates were generally higher in the single-copy and noncoding regions of both plastid genomes relative to the inverted repeat and coding regions. Conclusion Highly accurate and essentially complete sequence information was obtained for the Nandina and Platanus plastid genomes using the GS 20 System. More importantly, the high accuracy
Mapping Phylogenetic Trees to Reveal Distinct Patterns of Evolution.

Science.gov (United States)

Kendall, Michelle; Colijn, Caroline

2016-10-01

Evolutionary relationships are frequently described by phylogenetic trees, but a central barrier in many fields is the difficulty of interpreting data containing conflicting phylogenetic signals. We present a metric-based method for comparing trees which extracts distinct alternative evolutionary relationships embedded in data. We demonstrate detection and resolution of phylogenetic uncertainty in a recent study of anole lizards, leading to alternate hypotheses about their evolutionary relationships. We use our approach to compare trees derived from different genes of Ebolavirus and find that the VP30 gene has a distinct phylogenetic signature composed of three alternatives that differ in the deep branching structure. phylogenetics, evolution, tree metrics, genetics, sequencing. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Molecular phylogenetics and species delimitation of leaf-toed geckos (Phyllodactylidae: Phyllodactylus) throughout the Mexican tropical dry forest.

Science.gov (United States)

Blair, Christopher; Méndez de la Cruz, Fausto R; Law, Christopher; Murphy, Robert W

2015-03-01

Methods and approaches for accurate species delimitation continue to be a highly controversial subject in the systematics community. Inaccurate assessment of species' limits precludes accurate inference of historical evolutionary processes. Recent evidence suggests that multilocus coalescent methods show promise in delimiting species in cryptic clades. We combine multilocus sequence data with coalescence-based phylogenetics in a hypothesis-testing framework to assess species limits and elucidate the timing of diversification in leaf-toed geckos (Phyllodactylus) of Mexico's dry forests. Tropical deciduous forests (TDF) of the Neotropics are among the planet's most diverse ecosystems. However, in comparison to moist tropical forests, little is known about the mode and tempo of biotic evolution throughout this threatened biome. We find increased speciation and substantial, cryptic molecular diversity originating following the formation of Mexican TDF 30-20million years ago due to orogenesis of the Sierra Madre Occidental and Mexican Volcanic Belt. Phylogenetic results suggest that the Mexican Volcanic Belt, the Rio Fuerte, and Isthmus of Tehuantepec may be important biogeographic barriers. Single- and multilocus coalescent analyses suggest that nearly every sampling locality may be a distinct species. These results suggest unprecedented levels of diversity, a complex evolutionary history, and that the formation and expansion of TDF vegetation in the Miocene may have influenced subsequent cladogenesis of leaf-toed geckos throughout western Mexico. Copyright © 2015 Elsevier Inc. All rights reserved.
Molecular Phylogenetics: Mathematical Framework and Unsolved Problems

Science.gov (United States)

Xia, Xuhua

Phylogenetic relationship is essential in dating evolutionary events, reconstructing ancestral genes, predicting sites that are important to natural selection, and, ultimately, understanding genomic evolution. Three categories of phylogenetic methods are currently used: the distance-based, the maximum parsimony, and the maximum likelihood method. Here, I present the mathematical framework of these methods and their rationales, provide computational details for each of them, illustrate analytically and numerically the potential biases inherent in these methods, and outline computational challenges and unresolved problems. This is followed by a brief discussion of the Bayesian approach that has been recently used in molecular phylogenetics.
Phylogenetic community structure: temporal variation in fish assemblage

OpenAIRE

Santorelli, Sergio; Magnusson, William; Ferreira, Efrem; Caramaschi, Erica; Zuanon, Jansen; Amadio, Sidnéia

2014-01-01

Hypotheses about phylogenetic relationships among species allow inferences about the mechanisms that affect species coexistence. Nevertheless, most studies assume that phylogenetic patterns identified are stable over time. We used data on monthly samples of fish from a single lake over 10 years to show that the structure in phylogenetic assemblages varies over time and conclusions depend heavily on the time scale investigated. The data set was organized in guild structures and temporal scales...
Anchoring quartet-based phylogenetic distances and applications to species tree reconstruction.

Science.gov (United States)

Sayyari, Erfan; Mirarab, Siavash

2016-11-11

Inferring species trees from gene trees using the coalescent-based summary methods has been the subject of much attention, yet new scalable and accurate methods are needed. We introduce DISTIQUE, a new statistically consistent summary method for inferring species trees from gene trees under the coalescent model. We generalize our results to arbitrary phylogenetic inference problems; we show that two arbitrarily chosen leaves, called anchors, can be used to estimate relative distances between all other pairs of leaves by inferring relevant quartet trees. This results in a family of distance-based tree inference methods, with running times ranging between quadratic to quartic in the number of leaves. We show in simulated studies that DISTIQUE has comparable accuracy to leading coalescent-based summary methods and reduced running times.
PCR detection of a Maell polymorphism in the human major breakpoint cluster region (BCR)

Energy Technology Data Exchange (ETDEWEB)

McClure, J.S.; Litz, C.E. (Medical School, Minneapolis, MN (United States))

1991-09-25

Nested primer pairs flanking the second intron of the breakpoint cluster region were constructed from the published cDNA sequence. The outer primer pair 5{prime}BCR Exon 2 (5{prime}-GTT TCA GAA GCT TCT CCC TG-3{prime}) and 3{prime}BCR Exon 3 (5{prime}-ACT CTG CTT AAA TCC AGT GG-3{prime}), amplified a fragment of genomic DNA approximately 810 bp in length. The inner primer pair, 3{prime}BCR Exon 2(5{prime}-CGC TGA CCA TCA ATA AGG AA-3{prime}) and 5{prime}BCR Exon 3 (5{prime}-AGA AAC CCA TAG AGC CCC GG-3{prime}), amplified a fragment approximately 730 bp in length. Double stranded DNA amplified with the outer primer pair was subjected to asymmetric PCR using the inner primer pair. Sequencing reactions were performed using the Sequenase dideoxy sequencing kit with S{sup 35}-dATP. Sequences in homozygotes revealed either an A or a G 85 bp 5{prime} of the BCR BamHI site. Heterozygotes demonstrated both bands at the corresponding position.
Virulence, serotype and phylogenetic groups of diarrhoeagenic ...

African Journals Online (AJOL)

Dr DADIE Thomas

2014-02-17

Feb 17, 2014 ... The virulence, serotype and phylogenetic traits of diarrhoeagenic Escherichia coli were detected in 502 strains isolated during digestive infections. Molecular detection of the target virulence genes, rfb gene of operon O and phylogenetic grouping genes Chua, yjaA and TSPE4.C2 was performed.
["Long-branch Attraction" artifact in phylogenetic reconstruction].

Science.gov (United States)

Li, Yi-Wei; Yu, Li; Zhang, Ya-Ping

2007-06-01

Phylogenetic reconstruction among various organisms not only helps understand their evolutionary history but also reveal several fundamental evolutionary questions. Understanding of the evolutionary relationships among organisms establishes the foundation for the investigations of other biological disciplines. However, almost all the widely used phylogenetic methods have limitations which fail to eliminate systematic errors effectively, preventing the reconstruction of true organismal relationships. "Long-branch Attraction" (LBA) artifact is one of the most disturbing factors in phylogenetic reconstruction. In this review, the conception and analytic method as well as the avoidance strategy of LBA were summarized. In addition, several typical examples were provided. The approach to avoid and resolve LBA artifact has been discussed.
Long-branch attraction bias and inconsistency in Bayesian phylogenetics.

Science.gov (United States)

Kolaczkowski, Bryan; Thornton, Joseph W

2009-12-09

Bayesian inference (BI) of phylogenetic relationships uses the same probabilistic models of evolution as its precursor maximum likelihood (ML), so BI has generally been assumed to share ML's desirable statistical properties, such as largely unbiased inference of topology given an accurate model and increasingly reliable inferences as the amount of data increases. Here we show that BI, unlike ML, is biased in favor of topologies that group long branches together, even when the true model and prior distributions of evolutionary parameters over a group of phylogenies are known. Using experimental simulation studies and numerical and mathematical analyses, we show that this bias becomes more severe as more data are analyzed, causing BI to infer an incorrect tree as the maximum a posteriori phylogeny with asymptotically high support as sequence length approaches infinity. BI's long branch attraction bias is relatively weak when the true model is simple but becomes pronounced when sequence sites evolve heterogeneously, even when this complexity is incorporated in the model. This bias--which is apparent under both controlled simulation conditions and in analyses of empirical sequence data--also makes BI less efficient and less robust to the use of an incorrect evolutionary model than ML. Surprisingly, BI's bias is caused by one of the method's stated advantages--that it incorporates uncertainty about branch lengths by integrating over a distribution of possible values instead of estimating them from the data, as ML does. Our findings suggest that trees inferred using BI should be interpreted with caution and that ML may be a more reliable framework for modern phylogenetic analysis.

Molecular characterization and phylogenetic analysis of Sugarcane yellow leaf virus isolates from China.

Science.gov (United States)

Gao, San-Ji; Lin, Yi-Hua; Pan, Yong-Bao; Damaj, Mona B; Wang, Qin-Nan; Mirkov, T Erik; Chen, Ru-Kai

2012-10-01

Sugarcane yellow leaf virus (SCYLV) (genus Polerovirus, family Luteoviridae), the causal agent of sugarcane yellow leaf disease (YLD), was first detected in China in 2006. To assess the distribution of SCYLV in the major sugarcane-growing Chinese provinces, leaf samples from 22 sugarcane clones (Saccharum spp. hybrid) showing YLD symptoms were collected and analyzed for infection by the virus using reverse transcription PCR (RT-PCR), quantitative RT-PCR, and immunological assays. A complete genomic sequence (5,879 nt) of the Chinese SCYLV isolate CHN-FJ1 and partial genomic sequences (2,915 nt) of 13 other Chinese SCYLV isolates from this study were amplified, cloned, and sequenced. The genomic sequence of the CHN-FJ1 isolate was found to share a high identity (98.4-99.1 %) with those of the Brazilian (BRA) genotype isolates and a low identity (86.5-86.9 %) with those of the CHN1 and Cuban (CUB) genotype isolates. The genetic diversity of these 14 Chinese SCYLV isolates was assessed along with that of 29 SCYLV isolates of worldwide origin reported in the GenBank database, based on the full or partial genomic sequence. Phylogenetic analysis demonstrated that all the 14 Chinese SCYLV isolates clustered into one large group with the BRA genotype and 12 other reported SCYLV isolates. In addition, five reported Chinese SCYLV isolates were grouped with the Peruvian (PER), CHN1 and CUB genotypes. We therefore speculated that at least four SCYLV genotypes, BRA, PER, CHN1, and CUB, are associated with YLD in China. Interestingly, a 39-nt deletion was detected in the sequence of the CHN-GD3 isolate, in the middle of the ORF1 region adjacent to the overlap between ORF1 and ORF2. This location is known to be one of the recombination breakpoints in the Luteoviridae family.
A program to compute the soft Robinson-Foulds distance between phylogenetic networks.

Science.gov (United States)

Lu, Bingxin; Zhang, Louxin; Leong, Hon Wai

2017-03-14

Over the past two decades, phylogenetic networks have been studied to model reticulate evolutionary events. The relationships among phylogenetic networks, phylogenetic trees and clusters serve as the basis for reconstruction and comparison of phylogenetic networks. To understand these relationships, two problems are raised: the tree containment problem, which asks whether a phylogenetic tree is displayed in a phylogenetic network, and the cluster containment problem, which asks whether a cluster is represented at a node in a phylogenetic network. Both the problems are NP-complete. A fast exponential-time algorithm for the cluster containment problem on arbitrary networks is developed and implemented in C. The resulting program is further extended into a computer program for fast computation of the Soft Robinson-Foulds distance between phylogenetic networks. Two computer programs are developed for facilitating reconstruction and validation of phylogenetic network models in evolutionary and comparative genomics. Our simulation tests indicated that they are fast enough for use in practice. Additionally, the distribution of the Soft Robinson-Foulds distance between phylogenetic networks is demonstrated to be unlikely normal by our simulation data.
Molecular Phylogenetic: Organism Taxonomy Method Based on Evolution History

Directory of Open Access Journals (Sweden)

N.L.P Indi Dharmayanti

2011-03-01

Full Text Available Phylogenetic is described as taxonomy classification of an organism based on its evolution history namely its phylogeny and as a part of systematic science that has objective to determine phylogeny of organism according to its characteristic. Phylogenetic analysis from amino acid and protein usually became important area in sequence analysis. Phylogenetic analysis can be used to follow the rapid change of a species such as virus. The phylogenetic evolution tree is a two dimensional of a species graphic that shows relationship among organisms or particularly among their gene sequences. The sequence separation are referred as taxa (singular taxon that is defined as phylogenetically distinct units on the tree. The tree consists of outer branches or leaves that represents taxa and nodes and branch represent correlation among taxa. When the nucleotide sequence from two different organism are similar, they were inferred to be descended from common ancestor. There were three methods which were used in phylogenetic, namely (1 Maximum parsimony, (2 Distance, and (3 Maximum likehoood. Those methods generally are applied to construct the evolutionary tree or the best tree for determine sequence variation in group. Every method is usually used for different analysis and data.
Heterotachy and long-branch attraction in phylogenetics

Directory of Open Access Journals (Sweden)

Rodrigue Nicolas

2005-10-01

Full Text Available Abstract Background Probabilistic methods have progressively supplanted the Maximum Parsimony (MP method for inferring phylogenetic trees. One of the major reasons for this shift was that MP is much more sensitive to the Long Branch Attraction (LBA artefact than is Maximum Likelihood (ML. However, recent work by Kolaczkowski and Thornton suggested, on the basis of simulations, that MP is less sensitive than ML to tree reconstruction artefacts generated by heterotachy, a phenomenon that corresponds to shifts in site-specific evolutionary rates over time. These results led these authors to recommend that the results of ML and MP analyses should be both reported and interpreted with the same caution. This specific conclusion revived the debate on the choice of the most accurate phylogenetic method for analysing real data in which various types of heterogeneities occur. However, variation of evolutionary rates across species was not explicitly incorporated in the original study of Kolaczkowski and Thornton, and in most of the subsequent heterotachous simulations published to date, where all terminal branch lengths were kept equal, an assumption that is biologically unrealistic. Results In this report, we performed more realistic simulations to evaluate the relative performance of MP and ML methods when two kinds of heterogeneities are considered: (i within-site rate variation (heterotachy, and (ii rate variation across lineages. Using a similar protocol as Kolaczkowski and Thornton to generate heterotachous datasets, we found that heterotachy, which constitutes a serious violation of existing models, decreases the accuracy of ML whatever the level of rate variation across lineages. In contrast, the accuracy of MP can either increase or decrease when the level of heterotachy increases, depending on the relative branch lengths. This result demonstrates that MP is not insensitive to heterotachy, contrary to the report of Kolaczkowski and Thornton
The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses

Science.gov (United States)

Zhang, Yanjun; Du, Liuwen; Liu, Ao; Chen, Jianjun; Wu, Li; Hu, Weiming; Zhang, Wei; Kim, Kyunghee; Lee, Sang-Choon; Yang, Tae-Jin; Wang, Ying

2016-01-01

Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp) genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR) region and the single-copy (SC) boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR) and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants. PMID:27014326
The complete chloroplast genome sequences of five Epimedium species: lights into phylogenetic and taxonomic analyses

Directory of Open Access Journals (Sweden)

Yanjun eZhang

2016-03-01

Full Text Available Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR region and the single-copy (SC boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants.
Phylogenetic signal from rearrangements in 18 Anopheles species by joint scaffolding extant and ancestral genomes.

Science.gov (United States)

Anselmetti, Yoann; Duchemin, Wandrille; Tannier, Eric; Chauve, Cedric; Bérard, Sèverine

2018-05-09

Genomes rearrangements carry valuable information for phylogenetic inference or the elucidation of molecular mechanisms of adaptation. However, the detection of genome rearrangements is often hampered by current deficiencies in data and methods: Genomes obtained from short sequence reads have generally very fragmented assemblies, and comparing multiple gene orders generally leads to computationally intractable algorithmic questions. We present a computational method, ADSEQ, which, by combining ancestral gene order reconstruction, comparative scaffolding and de novo scaffolding methods, overcomes these two caveats. ADSEQ provides simultaneously improved assemblies and ancestral genomes, with statistical supports on all local features. Compared to previous comparative methods, it runs in polynomial time, it samples solutions in a probabilistic space, and it can handle a significantly larger gene complement from the considered extant genomes, with complex histories including gene duplications and losses. We use ADSEQ to provide improved assemblies and a genome history made of duplications, losses, gene translocations, rearrangements, of 18 complete Anopheles genomes, including several important malaria vectors. We also provide additional support for a differentiated mode of evolution of the sex chromosome and of the autosomes in these mosquito genomes. We demonstrate the method's ability to improve extant assemblies accurately through a procedure simulating realistic assembly fragmentation. We study a debated issue regarding the phylogeny of the Gambiae complex group of Anopheles genomes in the light of the evolution of chromosomal rearrangements, suggesting that the phylogenetic signal they carry can differ from the phylogenetic signal carried by gene sequences, more prone to introgression.
YBYRÁ facilitates comparison of large phylogenetic trees.

Science.gov (United States)

Machado, Denis Jacob

2015-07-01

The number and size of tree topologies that are being compared by phylogenetic systematists is increasing due to technological advancements in high-throughput DNA sequencing. However, we still lack tools to facilitate comparison among phylogenetic trees with a large number of terminals. The "YBYRÁ" project integrates software solutions for data analysis in phylogenetics. It comprises tools for (1) topological distance calculation based on the number of shared splits or clades, (2) sensitivity analysis and automatic generation of sensitivity plots and (3) clade diagnoses based on different categories of synapomorphies. YBYRÁ also provides (4) an original framework to facilitate the search for potential rogue taxa based on how much they affect average matching split distances (using MSdist). YBYRÁ facilitates comparison of large phylogenetic trees and outperforms competing software in terms of usability and time efficiency, specially for large data sets. The programs that comprises this toolkit are written in Python, hence they do not require installation and have minimum dependencies. The entire project is available under an open-source licence at http://www.ib.usp.br/grant/anfibios/researchSoftware.html .
Disentangling the phylogenetic and ecological components of spider phenotypic variation.

Science.gov (United States)

Gonçalves-Souza, Thiago; Diniz-Filho, José Alexandre Felizola; Romero, Gustavo Quevedo

2014-01-01

An understanding of how the degree of phylogenetic relatedness influences the ecological similarity among species is crucial to inferring the mechanisms governing the assembly of communities. We evaluated the relative importance of spider phylogenetic relationships and ecological niche (plant morphological variables) to the variation in spider body size and shape by comparing spiders at different scales: (i) between bromeliads and dicot plants (i.e., habitat scale) and (ii) among bromeliads with distinct architectural features (i.e., microhabitat scale). We partitioned the interspecific variation in body size and shape into phylogenetic (that express trait values as expected by phylogenetic relationships among species) and ecological components (that express trait values independent of phylogenetic relationships). At the habitat scale, bromeliad spiders were larger and flatter than spiders associated with the surrounding dicots. At this scale, plant morphology sorted out close related spiders. Our results showed that spider flatness is phylogenetically clustered at the habitat scale, whereas it is phylogenetically overdispersed at the microhabitat scale, although phylogenic signal is present in both scales. Taken together, these results suggest that whereas at the habitat scale selective colonization affect spider body size and shape, at fine scales both selective colonization and adaptive evolution determine spider body shape. By partitioning the phylogenetic and ecological components of phenotypic variation, we were able to disentangle the evolutionary history of distinct spider traits and show that plant architecture plays a role in the evolution of spider body size and shape. We also discussed the relevance in considering multiple scales when studying phylogenetic community structure.
Phylogenetic system and zoogeography of the Plecoptera.

Science.gov (United States)

Zwick, P

2000-01-01

Information about the phylogenetic relationships of Plecoptera is summarized. The few characters supporting monophyly of the order are outlined. Several characters of possible significance for the search for the closest relatives of the stoneflies are discussed, but the sister-group of the order remains unknown. Numerous characters supporting the presently recognized phylogenetic system of Plecoptera are presented, alternative classifications are discussed, and suggestions for future studies are made. Notes on zoogeography are appended. The order as such is old (Permian fossils), but phylogenetic relationships and global distribution patterns suggest that evolution of the extant suborders started with the breakup of Pangaea. There is evidence of extensive recent speciation in all parts of the world.
A response to Yu et al. "A forward-backward fragment assembling algorithm for the identification of genomic amplification and deletion breakpoints using high-density single nucleotide polymorphism (SNP) array", BMC Bioinformatics 2007, 8: 145.

Science.gov (United States)

Rueda, Oscar M; Diaz-Uriarte, Ramon

2007-10-16

Yu et al. (BMC Bioinformatics 2007,8: 145+) have recently compared the performance of several methods for the detection of genomic amplification and deletion breakpoints using data from high-density single nucleotide polymorphism arrays. One of the methods compared is our non-homogenous Hidden Markov Model approach. Our approach uses Markov Chain Monte Carlo for inference, but Yu et al. ran the sampler for a severely insufficient number of iterations for a Markov Chain Monte Carlo-based method. Moreover, they did not use the appropriate reference level for the non-altered state. We rerun the analysis in Yu et al. using appropriate settings for both the Markov Chain Monte Carlo iterations and the reference level. Additionally, to show how easy it is to obtain answers to additional specific questions, we have added a new analysis targeted specifically to the detection of breakpoints. The reanalysis shows that the performance of our method is comparable to that of the other methods analyzed. In addition, we can provide probabilities of a given spot being a breakpoint, something unique among the methods examined. Markov Chain Monte Carlo methods require using a sufficient number of iterations before they can be assumed to yield samples from the distribution of interest. Running our method with too small a number of iterations cannot be representative of its performance. Moreover, our analysis shows how our original approach can be easily adapted to answer specific additional questions (e.g., identify edges).
Cophenetic metrics for phylogenetic trees, after Sokal and Rohlf.

Science.gov (United States)

Cardona, Gabriel; Mir, Arnau; Rosselló, Francesc; Rotger, Lucía; Sánchez, David

2013-01-16

Phylogenetic tree comparison metrics are an important tool in the study of evolution, and hence the definition of such metrics is an interesting problem in phylogenetics. In a paper in Taxon fifty years ago, Sokal and Rohlf proposed to measure quantitatively the difference between a pair of phylogenetic trees by first encoding them by means of their half-matrices of cophenetic values, and then comparing these matrices. This idea has been used several times since then to define dissimilarity measures between phylogenetic trees but, to our knowledge, no proper metric on weighted phylogenetic trees with nested taxa based on this idea has been formally defined and studied yet. Actually, the cophenetic values of pairs of different taxa alone are not enough to single out phylogenetic trees with weighted arcs or nested taxa. For every (rooted) phylogenetic tree T, let its cophenetic vectorφ(T) consist of all pairs of cophenetic values between pairs of taxa in T and all depths of taxa in T. It turns out that these cophenetic vectors single out weighted phylogenetic trees with nested taxa. We then define a family of cophenetic metrics dφ,p by comparing these cophenetic vectors by means of Lp norms, and we study, either analytically or numerically, some of their basic properties: neighbors, diameter, distribution, and their rank correlation with each other and with other metrics. The cophenetic metrics can be safely used on weighted phylogenetic trees with nested taxa and no restriction on degrees, and they can be computed in O(n2) time, where n stands for the number of taxa. The metrics dφ,1 and dφ,2 have positive skewed distributions, and they show a low rank correlation with the Robinson-Foulds metric and the nodal metrics, and a very high correlation with each other and with the splitted nodal metrics. The diameter of dφ,p, for p⩾1 , is in O(n(p+2)/p), and thus for low p they are more discriminative, having a wider range of values.
The best of both worlds: Phylogenetic eigenvector regression and mapping

Directory of Open Access Journals (Sweden)

José Alexandre Felizola Diniz Filho

2015-09-01

Full Text Available Eigenfunction analyses have been widely used to model patterns of autocorrelation in time, space and phylogeny. In a phylogenetic context, Diniz-Filho et al. (1998 proposed what they called Phylogenetic Eigenvector Regression (PVR, in which pairwise phylogenetic distances among species are submitted to a Principal Coordinate Analysis, and eigenvectors are then used as explanatory variables in regression, correlation or ANOVAs. More recently, a new approach called Phylogenetic Eigenvector Mapping (PEM was proposed, with the main advantage of explicitly incorporating a model-based warping in phylogenetic distance in which an Ornstein-Uhlenbeck (O-U process is fitted to data before eigenvector extraction. Here we compared PVR and PEM in respect to estimated phylogenetic signal, correlated evolution under alternative evolutionary models and phylogenetic imputation, using simulated data. Despite similarity between the two approaches, PEM has a slightly higher prediction ability and is more general than the original PVR. Even so, in a conceptual sense, PEM may provide a technique in the best of both worlds, combining the flexibility of data-driven and empirical eigenfunction analyses and the sounding insights provided by evolutionary models well known in comparative analyses.
Phylogenetic Analyses of Armillaria Reveal at Least 15 Phylogenetic Lineages in China, Seven of Which Are Associated with Cultivated Gastrodia elata.

Directory of Open Access Journals (Sweden)

Ting Guo

Full Text Available Fungal species of Armillaria, which can act as plant pathogens and/or symbionts of the Chinese traditional medicinal herb Gastrodia elata ("Tianma", are ecologically and economically important and have consequently attracted the attention of mycologists. However, their taxonomy has been highly dependent on morphological characterization and mating tests. In this study, we phylogenetically analyzed Chinese Armillaria samples using the sequences of the internal transcribed spacer region, translation elongation factor-1 alpha gene and beta-tubulin gene. Our data revealed at least 15 phylogenetic lineages of Armillaria from China, of which seven were newly discovered and two were recorded from China for the first time. Fourteen Chinese biological species of Armillaria, which were previously defined based on mating tests, could be assigned to the 15 phylogenetic lineages identified herein. Seven of the 15 phylogenetic lineages were found to be disjunctively distributed in different continents of the Northern Hemisphere, while eight were revealed to be endemic to certain continents. In addition, we found that seven phylogenetic lineages of Armillaria were used for the cultivation of Tianma, only two of which had been recorded to be associated with Tianma previously. We also illustrated that G. elata f. glauca ("Brown Tianma" and G. elata f. elata ("Red Tianma", two cultivars of Tianma grown in different regions of China, form symbiotic relationships with different phylogenetic lineages of Armillaria. These findings should aid the development of Tianma cultivation in China.
Phylogenetic relationships within and among Brassica species from ...

African Journals Online (AJOL)

Consequently, two potentially susceptible B. napus accessions were identified. The high polymorphic information content (PIC) and number of phylogenetically informative bands established RAPD as a useful tool for phylogenetic reconstruction, quantification of genetic diversity for conservation, cultivar classification and ...
Applying species-tree analyses to deep phylogenetic histories: challenges and potential suggested from a survey of empirical phylogenetic studies.

Science.gov (United States)

Lanier, Hayley C; Knowles, L Lacey

2015-02-01

Coalescent-based methods for species-tree estimation are becoming a dominant approach for reconstructing species histories from multi-locus data, with most of the studies examining these methodologies focused on recently diverged species. However, deeper phylogenies, such as the datasets that comprise many Tree of Life (ToL) studies, also exhibit gene-tree discordance. This discord may also arise from the stochastic sorting of gene lineages during the speciation process (i.e., reflecting the random coalescence of gene lineages in ancestral populations). It remains unknown whether guidelines regarding methodologies and numbers of loci established by simulation studies at shallow tree depths translate into accurate species relationships for deeper phylogenetic histories. We address this knowledge gap and specifically identify the challenges and limitations of species-tree methods that account for coalescent variance for deeper phylogenies. Using simulated data with characteristics informed by empirical studies, we evaluate both the accuracy of estimated species trees and the characteristics associated with recalcitrant nodes, with a specific focus on whether coalescent variance is generally responsible for the lack of resolution. By determining the proportion of coalescent genealogies that support a particular node, we demonstrate that (1) species-tree methods account for coalescent variance at deep nodes and (2) mutational variance - not gene-tree discord arising from the coalescent - posed the primary challenge for accurate reconstruction across the tree. For example, many nodes were accurately resolved despite predicted discord from the random coalescence of gene lineages and nodes with poor support were distributed across a range of depths (i.e., they were not restricted to a particular recent divergences). Given their broad taxonomic scope and large sampling of taxa, deep level phylogenies pose several potential methodological complications including
Visualising very large phylogenetic trees in three dimensional hyperbolic space

Directory of Open Access Journals (Sweden)

Liberles David A

2004-04-01

Full Text Available Abstract Background Common existing phylogenetic tree visualisation tools are not able to display readable trees with more than a few thousand nodes. These existing methodologies are based in two dimensional space. Results We introduce the idea of visualising phylogenetic trees in three dimensional hyperbolic space with the Walrus graph visualisation tool and have developed a conversion tool that enables the conversion of standard phylogenetic tree formats to Walrus' format. With Walrus, it becomes possible to visualise and navigate phylogenetic trees with more than 100,000 nodes. Conclusion Walrus enables desktop visualisation of very large phylogenetic trees in 3 dimensional hyperbolic space. This application is potentially useful for visualisation of the tree of life and for functional genomics derivatives, like The Adaptive Evolution Database (TAED.
Climate-driven extinctions shape the phylogenetic structure of temperate tree floras.

Science.gov (United States)

Eiserhardt, Wolf L; Borchsenius, Finn; Plum, Christoffer M; Ordonez, Alejandro; Svenning, Jens-Christian

2015-03-01

When taxa go extinct, unique evolutionary history is lost. If extinction is selective, and the intrinsic vulnerabilities of taxa show phylogenetic signal, more evolutionary history may be lost than expected under random extinction. Under what conditions this occurs is insufficiently known. We show that late Cenozoic climate change induced phylogenetically selective regional extinction of northern temperate trees because of phylogenetic signal in cold tolerance, leading to significantly and substantially larger than random losses of phylogenetic diversity (PD). The surviving floras in regions that experienced stronger extinction are phylogenetically more clustered, indicating that non-random losses of PD are of increasing concern with increasing extinction severity. Using simulations, we show that a simple threshold model of survival given a physiological trait with phylogenetic signal reproduces our findings. Our results send a strong warning that we may expect future assemblages to be phylogenetically and possibly functionally depauperate if anthropogenic climate change affects taxa similarly. © 2015 John Wiley & Sons Ltd/CNRS.
Increased phylogenetic resolution using target enrichment in Rubus

Science.gov (United States)

Phylogenetic analyses in Rubus L. have been challenging due to polyploidy, hybridization, and apomixis within the genus. Wide morphological diversity occurs within and between species, contributing to challenges at lower and higher systematic levels. Phylogenetic inferences to date have been based o...
Phylogenetic relationships of African sunbird-like warblers: Moho ...

African Journals Online (AJOL)

Phylogenetic relationships of African sunbird-like warblers: Moho ( Hypergerus atriceps ), Green Hylia ( Hylia prasina ) and Tit-hylia ( Pholidornis rushiae ) ... different points in avian evolution reduces the phylogenetic signal in molecular sequence data, making difficult the reconstruction of relationships among taxa resulting ...

Interpreting the universal phylogenetic tree

Science.gov (United States)

Woese, C. R.

2000-01-01

The universal phylogenetic tree not only spans all extant life, but its root and earliest branchings represent stages in the evolutionary process before modern cell types had come into being. The evolution of the cell is an interplay between vertically derived and horizontally acquired variation. Primitive cellular entities were necessarily simpler and more modular in design than are modern cells. Consequently, horizontal gene transfer early on was pervasive, dominating the evolutionary dynamic. The root of the universal phylogenetic tree represents the first stage in cellular evolution when the evolving cell became sufficiently integrated and stable to the erosive effects of horizontal gene transfer that true organismal lineages could exist.
The performance of the Congruence Among Distance Matrices (CADM) test in phylogenetic analysis

Science.gov (United States)

2011-01-01

Background CADM is a statistical test used to estimate the level of Congruence Among Distance Matrices. It has been shown in previous studies to have a correct rate of type I error and good power when applied to dissimilarity matrices and to ultrametric distance matrices. Contrary to most other tests of incongruence used in phylogenetic analysis, the null hypothesis of the CADM test assumes complete incongruence of the phylogenetic trees instead of congruence. In this study, we performed computer simulations to assess the type I error rate and power of the test. It was applied to additive distance matrices representing phylogenies and to genetic distance matrices obtained from nucleotide sequences of different lengths that were simulated on randomly generated trees of varying sizes, and under different evolutionary conditions. Results Our results showed that the test has an accurate type I error rate and good power. As expected, power increased with the number of objects (i.e., taxa), the number of partially or completely congruent matrices and the level of congruence among distance matrices. Conclusions Based on our results, we suggest that CADM is an excellent candidate to test for congruence and, when present, to estimate its level in phylogenomic studies where numerous genes are analysed simultaneously. PMID:21388552
The performance of the Congruence Among Distance Matrices (CADM test in phylogenetic analysis

Directory of Open Access Journals (Sweden)

Lapointe François-Joseph

2011-03-01

Full Text Available Abstract Background CADM is a statistical test used to estimate the level of Congruence Among Distance Matrices. It has been shown in previous studies to have a correct rate of type I error and good power when applied to dissimilarity matrices and to ultrametric distance matrices. Contrary to most other tests of incongruence used in phylogenetic analysis, the null hypothesis of the CADM test assumes complete incongruence of the phylogenetic trees instead of congruence. In this study, we performed computer simulations to assess the type I error rate and power of the test. It was applied to additive distance matrices representing phylogenies and to genetic distance matrices obtained from nucleotide sequences of different lengths that were simulated on randomly generated trees of varying sizes, and under different evolutionary conditions. Results Our results showed that the test has an accurate type I error rate and good power. As expected, power increased with the number of objects (i.e., taxa, the number of partially or completely congruent matrices and the level of congruence among distance matrices. Conclusions Based on our results, we suggest that CADM is an excellent candidate to test for congruence and, when present, to estimate its level in phylogenomic studies where numerous genes are analysed simultaneously.
Utilization of complete chloroplast genomes for phylogenetic studies

NARCIS (Netherlands)

Ramlee, Shairul Izan Binti

2016-01-01

Chloroplast DNA sequence polymorphisms are a primary source of data in many plant phylogenetic studies. The chloroplast genome is relatively conserved in its evolution making it an ideal molecule to retain phylogenetic signals. The chloroplast genome is also largely, but not completely, free from
Estimating phylogenetic trees from genome-scale data.

Science.gov (United States)

Liu, Liang; Xi, Zhenxiang; Wu, Shaoyuan; Davis, Charles C; Edwards, Scott V

2015-12-01

The heterogeneity of signals in the genomes of diverse organisms poses challenges for traditional phylogenetic analysis. Phylogenetic methods known as "species tree" methods have been proposed to directly address one important source of gene tree heterogeneity, namely the incomplete lineage sorting that occurs when evolving lineages radiate rapidly, resulting in a diversity of gene trees from a single underlying species tree. Here we review theory and empirical examples that help clarify conflicts between species tree and concatenation methods, and misconceptions in the literature about the performance of species tree methods. Considering concatenation as a special case of the multispecies coalescent model helps explain differences in the behavior of the two methods on phylogenomic data sets. Recent work suggests that species tree methods are more robust than concatenation approaches to some of the classic challenges of phylogenetic analysis, including rapidly evolving sites in DNA sequences and long-branch attraction. We show that approaches, such as binning, designed to augment the signal in species tree analyses can distort the distribution of gene trees and are inconsistent. Computationally efficient species tree methods incorporating biological realism are a key to phylogenetic analysis of whole-genome data. © 2015 New York Academy of Sciences.
New weighting methods for phylogenetic tree reconstruction using multiple loci.

Science.gov (United States)

Misawa, Kazuharu; Tajima, Fumio

2012-08-01

Efficient determination of evolutionary distances is important for the correct reconstruction of phylogenetic trees. The performance of the pooled distance required for reconstructing a phylogenetic tree can be improved by applying large weights to appropriate distances for reconstructing phylogenetic trees and small weights to inappropriate distances. We developed two weighting methods, the modified Tajima-Takezaki method and the modified least-squares method, for reconstructing phylogenetic trees from multiple loci. By computer simulations, we found that both of the new methods were more efficient in reconstructing correct topologies than the no-weight method. Hence, we reconstructed hominoid phylogenetic trees from mitochondrial DNA using our new methods, and found that the levels of bootstrap support were significantly increased by the modified Tajima-Takezaki and by the modified least-squares method.
On the information content of discrete phylogenetic characters.

Science.gov (United States)

Bordewich, Magnus; Deutschmann, Ina Maria; Fischer, Mareike; Kasbohm, Elisa; Semple, Charles; Steel, Mike

2017-12-16

Phylogenetic inference aims to reconstruct the evolutionary relationships of different species based on genetic (or other) data. Discrete characters are a particular type of data, which contain information on how the species should be grouped together. However, it has long been known that some characters contain more information than others. For instance, a character that assigns the same state to each species groups all of them together and so provides no insight into the relationships of the species considered. At the other extreme, a character that assigns a different state to each species also conveys no phylogenetic signal. In this manuscript, we study a natural combinatorial measure of the information content of an individual character and analyse properties of characters that provide the maximum phylogenetic information, particularly, the number of states such a character uses and how the different states have to be distributed among the species or taxa of the phylogenetic tree.
DendroPy: a Python library for phylogenetic computing.

Science.gov (United States)

Sukumaran, Jeet; Holder, Mark T

2010-06-15

DendroPy is a cross-platform library for the Python programming language that provides for object-oriented reading, writing, simulation and manipulation of phylogenetic data, with an emphasis on phylogenetic tree operations. DendroPy uses a splits-hash mapping to perform rapid calculations of tree distances, similarities and shape under various metrics. It contains rich simulation routines to generate trees under a number of different phylogenetic and coalescent models. DendroPy's data simulation and manipulation facilities, in conjunction with its support of a broad range of phylogenetic data formats (NEXUS, Newick, PHYLIP, FASTA, NeXML, etc.), allow it to serve a useful role in various phyloinformatics and phylogeographic pipelines. The stable release of the library is available for download and automated installation through the Python Package Index site (http://pypi.python.org/pypi/DendroPy), while the active development source code repository is available to the public from GitHub (http://github.com/jeetsukumaran/DendroPy).
The complete chloroplast genome sequence of Ampelopsis: gene organization, comparative analysis and phylogenetic relationships to other angiosperms

Directory of Open Access Journals (Sweden)

Gurusamy eRaman

2016-03-01

Full Text Available Ampelopsis brevipedunculata is an economically important plant that belongs to the Vitaceae family of angiosperms. The phylogenetic placement of Vitaceae is still unresolved. Recent phylogenetic studies suggested that it should be placed in various alternative families including Caryophyllaceae, asteraceae, Saxifragaceae, Dilleniaceae, or with the rest of the rosid families. However, these analyses provided weak supportive results because they were based on only one of several genes. Accordingly, complete chloroplast genome sequences are required to resolve the phylogenetic relationships among angiosperms. Recent phylogenetic analyses based on the complete chloroplast genome sequence suggested strong support for the position of Vitaceae as the earliest diverging lineage of rosids and placed it as a sister to the remaining rosids. These studies also revealed relationships among several major lineages of angiosperms; however, they highlighted the significance of taxon sampling for obtaining accurate phylogenies. In the present study, we sequenced the complete chloroplast genome of A. brevipedunculata and used these data to assess the relationships among 32 angiosperms, including 18 taxa of rosids. The Ampelopsis chloroplast genome is 161,090 bp in length, and includes a pair of inverted repeats of 26,394 bp that are separated by small and large single copy regions of 19,036 bp and 89,266 bp, respectively. The gene content and order of Ampelopsis is identical to many other unrearranged angiosperm chloroplast genomes, including Vitis and tobacco. A phylogenetic tree constructed based on 70 protein-coding genes of 33 angiosperms showed that both Saxifragales and Vitaceae diverged from the rosid clade and formed two clades with 100% bootstrap value. The position of the Vitaceae is sister to Saxifragales, and both are the basal and earliest diverging lineages. Moreover, Saxifragales forms a sister clade to Vitaceae of rosids. Overall, the results of
Long-branch attraction bias and inconsistency in Bayesian phylogenetics.

Directory of Open Access Journals (Sweden)

Bryan Kolaczkowski

Full Text Available Bayesian inference (BI of phylogenetic relationships uses the same probabilistic models of evolution as its precursor maximum likelihood (ML, so BI has generally been assumed to share ML's desirable statistical properties, such as largely unbiased inference of topology given an accurate model and increasingly reliable inferences as the amount of data increases. Here we show that BI, unlike ML, is biased in favor of topologies that group long branches together, even when the true model and prior distributions of evolutionary parameters over a group of phylogenies are known. Using experimental simulation studies and numerical and mathematical analyses, we show that this bias becomes more severe as more data are analyzed, causing BI to infer an incorrect tree as the maximum a posteriori phylogeny with asymptotically high support as sequence length approaches infinity. BI's long branch attraction bias is relatively weak when the true model is simple but becomes pronounced when sequence sites evolve heterogeneously, even when this complexity is incorporated in the model. This bias--which is apparent under both controlled simulation conditions and in analyses of empirical sequence data--also makes BI less efficient and less robust to the use of an incorrect evolutionary model than ML. Surprisingly, BI's bias is caused by one of the method's stated advantages--that it incorporates uncertainty about branch lengths by integrating over a distribution of possible values instead of estimating them from the data, as ML does. Our findings suggest that trees inferred using BI should be interpreted with caution and that ML may be a more reliable framework for modern phylogenetic analysis.
Consequences of recombination on traditional phylogenetic analysis

DEFF Research Database (Denmark)

Schierup, M H; Hein, J

2000-01-01

We investigate the shape of a phylogenetic tree reconstructed from sequences evolving under the coalescent with recombination. The motivation is that evolutionary inferences are often made from phylogenetic trees reconstructed from population data even though recombination may well occur (mt......DNA or viral sequences) or does occur (nuclear sequences). We investigate the size and direction of biases when a single tree is reconstructed ignoring recombination. Standard software (PHYLIP) was used to construct the best phylogenetic tree from sequences simulated under the coalescent with recombination....... With recombination present, the length of terminal branches and the total branch length are larger, and the time to the most recent common ancestor smaller, than for a tree reconstructed from sequences evolving with no recombination. The effects are pronounced even for small levels of recombination that may...
Phylogenetic relationships of the lancelets of the genus ...

African Journals Online (AJOL)

phylogenetic relationships of the Branchiostoma lancelets from South (Xiamen) and North (Qingdao and Rizhao) China, and phylogenetic trees constructed also included the existing data from Japanese waters. The genetic distances of the lancelets between South and North China averaged 0.19, 0.21, and 0.17 based on ...
Estimation of rates-across-sites distributions in phylogenetic substitution models.

Science.gov (United States)

Susko, Edward; Field, Chris; Blouin, Christian; Roger, Andrew J

2003-10-01

Previous work has shown that it is often essential to account for the variation in rates at different sites in phylogenetic models in order to avoid phylogenetic artifacts such as long branch attraction. In most current models, the gamma distribution is used for the rates-across-sites distributions and is implemented as an equal-probability discrete gamma. In this article, we introduce discrete distribution estimates with large numbers of equally spaced rate categories allowing us to investigate the appropriateness of the gamma model. With large numbers of rate categories, these discrete estimates are flexible enough to approximate the shape of almost any distribution. Likelihood ratio statistical tests and a nonparametric bootstrap confidence-bound estimation procedure based on the discrete estimates are presented that can be used to test the fit of a parametric family. We applied the methodology to several different protein data sets, and found that although the gamma model often provides a good parametric model for this type of data, rate estimates from an equal-probability discrete gamma model with a small number of categories will tend to underestimate the largest rates. In cases when the gamma model assumption is in doubt, rate estimates coming from the discrete rate distribution estimate with a large number of rate categories provide a robust alternative to gamma estimates. An alternative implementation of the gamma distribution is proposed that, for equal numbers of rate categories, is computationally more efficient during optimization than the standard gamma implementation and can provide more accurate estimates of site rates.
A phylogenetic Kalman filter for ancestral trait reconstruction using molecular data.

Science.gov (United States)

Lartillot, Nicolas

2014-02-15

Correlation between life history or ecological traits and genomic features such as nucleotide or amino acid composition can be used for reconstructing the evolutionary history of the traits of interest along phylogenies. Thus far, however, such ancestral reconstructions have been done using simple linear regression approaches that do not account for phylogenetic inertia. These reconstructions could instead be seen as a genuine comparative regression problem, such as formalized by classical generalized least-square comparative methods, in which the trait of interest and the molecular predictor are represented as correlated Brownian characters coevolving along the phylogeny. Here, a Bayesian sampler is introduced, representing an alternative and more efficient algorithmic solution to this comparative regression problem, compared with currently existing generalized least-square approaches. Technically, ancestral trait reconstruction based on a molecular predictor is shown to be formally equivalent to a phylogenetic Kalman filter problem, for which backward and forward recursions are developed and implemented in the context of a Markov chain Monte Carlo sampler. The comparative regression method results in more accurate reconstructions and a more faithful representation of uncertainty, compared with simple linear regression. Application to the reconstruction of the evolution of optimal growth temperature in Archaea, using GC composition in ribosomal RNA stems and amino acid composition of a sample of protein-coding genes, confirms previous findings, in particular, pointing to a hyperthermophilic ancestor for the kingdom. The program is freely available at www.phylobayes.org.
Use of Whole-Genus Genome Sequence Data To Develop a Multilocus Sequence Typing Tool That Accurately Identifies Yersinia Isolates to the Species and Subspecies Levels

Science.gov (United States)

Hall, Miquette; Chattaway, Marie A.; Reuter, Sandra; Savin, Cyril; Strauch, Eckhard; Carniel, Elisabeth; Connor, Thomas; Van Damme, Inge; Rajakaruna, Lakshani; Rajendram, Dunstan; Jenkins, Claire; Thomson, Nicholas R.

2014-01-01

The genus Yersinia is a large and diverse bacterial genus consisting of human-pathogenic species, a fish-pathogenic species, and a large number of environmental species. Recently, the phylogenetic and population structure of the entire genus was elucidated through the genome sequence data of 241 strains encompassing every known species in the genus. Here we report the mining of this enormous data set to create a multilocus sequence typing-based scheme that can identify Yersinia strains to the species level to a level of resolution equal to that for whole-genome sequencing. Our assay is designed to be able to accurately subtype the important human-pathogenic species Yersinia enterocolitica to whole-genome resolution levels. We also report the validation of the scheme on 386 strains from reference laboratory collections across Europe. We propose that the scheme is an important molecular typing system to allow accurate and reproducible identification of Yersinia isolates to the species level, a process often inconsistent in nonspecialist laboratories. Additionally, our assay is the most phylogenetically informative typing scheme available for Y. enterocolitica. PMID:25339391
Enumerating all maximal frequent subtrees in collections of phylogenetic trees.

Science.gov (United States)

Deepak, Akshay; Fernández-Baca, David

2014-01-01

A common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees. The goal is, roughly, to find a subset of the species (taxa) on which all or some significant subset of the trees agree. One popular method to do so is through maximum agreement subtrees (MASTs). MASTs are also used, among other things, as a metric for comparing phylogenetic trees, computing congruence indices and to identify horizontal gene transfer events. We give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. These approaches can return subtrees on larger sets of taxa than MASTs, and can reveal new common phylogenetic relationships not present in either MASTs or the majority rule tree (a popular consensus method). Our current implementation is available on the web at https://code.google.com/p/mfst-miner/. Our computational results confirm that maximal agreement subtrees and all maximal frequent subtrees can reveal a more complete phylogenetic picture of the common patterns in collections of phylogenetic trees than maximum agreement subtrees; they are also often more resolved than the majority rule tree. Further, our experiments show that enumerating maximal frequent subtrees is considerably more practical than enumerating ordinary (not necessarily maximal) frequent subtrees.
Comparative phylogenetic analysis of intergenic spacers and small ...

African Journals Online (AJOL)

The phylogenetic analysis of test isolates included assessment of variation in sequences and length of IGS and SSU-rRNA genes with reference to 16 different microsporidian sequences. The results proved that IGS sequences have more variation than SSU-rRNA gene sequences. Analysis of phylogenetic trees reveal that ...
Phylogenetic comparative methods complement discriminant function analysis in ecomorphology.

Science.gov (United States)

Barr, W Andrew; Scott, Robert S

2014-04-01

In ecomorphology, Discriminant Function Analysis (DFA) has been used as evidence for the presence of functional links between morphometric variables and ecological categories. Here we conduct simulations of characters containing phylogenetic signal to explore the performance of DFA under a variety of conditions. Characters were simulated using a phylogeny of extant antelope species from known habitats. Characters were modeled with no biomechanical relationship to the habitat category; the only sources of variation were body mass, phylogenetic signal, or random "noise." DFA on the discriminability of habitat categories was performed using subsets of the simulated characters, and Phylogenetic Generalized Least Squares (PGLS) was performed for each character. Analyses were repeated with randomized habitat assignments. When simulated characters lacked phylogenetic signal and/or habitat assignments were random, ecomorphology. Copyright © 2013 Wiley Periodicals, Inc.
Maximum Parsimony on Phylogenetic networks

Science.gov (United States)

2012-01-01

Background Phylogenetic networks are generalizations of phylogenetic trees, that are used to model evolutionary events in various contexts. Several different methods and criteria have been introduced for reconstructing phylogenetic trees. Maximum Parsimony is a character-based approach that infers a phylogenetic tree by minimizing the total number of evolutionary steps required to explain a given set of data assigned on the leaves. Exact solutions for optimizing parsimony scores on phylogenetic trees have been introduced in the past. Results In this paper, we define the parsimony score on networks as the sum of the substitution costs along all the edges of the network; and show that certain well-known algorithms that calculate the optimum parsimony score on trees, such as Sankoff and Fitch algorithms extend naturally for networks, barring conflicting assignments at the reticulate vertices. We provide heuristics for finding the optimum parsimony scores on networks. Our algorithms can be applied for any cost matrix that may contain unequal substitution costs of transforming between different characters along different edges of the network. We analyzed this for experimental data on 10 leaves or fewer with at most 2 reticulations and found that for almost all networks, the bounds returned by the heuristics matched with the exhaustively determined optimum parsimony scores. Conclusion The parsimony score we define here does not directly reflect the cost of the best tree in the network that displays the evolution of the character. However, when searching for the most parsimonious network that describes a collection of characters, it becomes necessary to add additional cost considerations to prefer simpler structures, such as trees over networks. The parsimony score on a network that we describe here takes into account the substitution costs along the additional edges incident on each reticulate vertex, in addition to the substitution costs along the other edges which are
Maximum parsimony, substitution model, and probability phylogenetic trees.

Science.gov (United States)

Weng, J F; Thomas, D A; Mareels, I

2011-01-01

The problem of inferring phylogenies (phylogenetic trees) is one of the main problems in computational biology. There are three main methods for inferring phylogenies-Maximum Parsimony (MP), Distance Matrix (DM) and Maximum Likelihood (ML), of which the MP method is the most well-studied and popular method. In the MP method the optimization criterion is the number of substitutions of the nucleotides computed by the differences in the investigated nucleotide sequences. However, the MP method is often criticized as it only counts the substitutions observable at the current time and all the unobservable substitutions that really occur in the evolutionary history are omitted. In order to take into account the unobservable substitutions, some substitution models have been established and they are now widely used in the DM and ML methods but these substitution models cannot be used within the classical MP method. Recently the authors proposed a probability representation model for phylogenetic trees and the reconstructed trees in this model are called probability phylogenetic trees. One of the advantages of the probability representation model is that it can include a substitution model to infer phylogenetic trees based on the MP principle. In this paper we explain how to use a substitution model in the reconstruction of probability phylogenetic trees and show the advantage of this approach with examples.

Towards an integrated phylogenetic classification of the Tremellomycetes.

Science.gov (United States)

Liu, X-Z; Wang, Q-M; Göker, M; Groenewald, M; Kachalkin, A V; Lumbsch, H T; Millanes, A M; Wedin, M; Yurkov, A M; Boekhout, T; Bai, F-Y

2015-06-01

Families and genera assigned to Tremellomycetes have been mainly circumscribed by morphology and for the yeasts also by biochemical and physiological characteristics. This phenotype-based classification is largely in conflict with molecular phylogenetic analyses. Here a phylogenetic classification framework for the Tremellomycetes is proposed based on the results of phylogenetic analyses from a seven-genes dataset covering the majority of tremellomycetous yeasts and closely related filamentous taxa. Circumscriptions of the taxonomic units at the order, family and genus levels recognised were quantitatively assessed using the phylogenetic rank boundary optimisation (PRBO) and modified general mixed Yule coalescent (GMYC) tests. In addition, a comprehensive phylogenetic analysis on an expanded LSU rRNA (D1/D2 domains) gene sequence dataset covering as many as available teleomorphic and filamentous taxa within Tremellomycetes was performed to investigate the relationships between yeasts and filamentous taxa and to examine the stability of undersampled clades. Based on the results inferred from molecular data and morphological and physiochemical features, we propose an updated classification for the Tremellomycetes. We accept five orders, 17 families and 54 genera, including seven new families and 18 new genera. In addition, seven families and 17 genera are emended and one new species name and 185 new combinations are proposed. We propose to use the term pro tempore or pro tem. in abbreviation to indicate the species names that are temporarily maintained.
Beyond trend analysis: How a modified breakpoint analysis enhances knowledge of agricultural production after Zimbabwe's fast track land reform

Science.gov (United States)

Hentze, Konrad; Thonfeld, Frank; Menz, Gunter

2017-10-01

In the discourse on land reform assessments, a significant lack of spatial and time-series data has been identified, especially with respect to Zimbabwe's ;Fast-Track Land Reform Programme; (FTLRP). At the same time, interest persists among land use change scientists to evaluate causes of land use change and therefore to increase the explanatory power of remote sensing products. This study recognizes these demands and aims to provide input on both levels: Evaluating the potential of satellite remote sensing time-series to answer questions which evolved after intensive land redistribution efforts in Zimbabwe; and investigating how time-series analysis of Normalized Difference Vegetation Index (NDVI) can be enhanced to provide information on land reform induced land use change. To achieve this, two time-series methods are applied to MODIS NDVI data: Seasonal Trend Analysis (STA) and Breakpoint Analysis for Additive Season and Trend (BFAST). In our first analysis, a link of agricultural productivity trends to different land tenure regimes shows that regional clustering of trends is more dominant than a relationship between tenure and trend with a slightly negative slope for all regimes. We demonstrate that clusters of strong negative and positive productivity trends are results of changing irrigation patterns. To locate emerging and fallow irrigation schemes in semi-arid Zimbabwe, a new multi-method approach is developed which allows to map changes from bimodal seasonal phenological patterns to unimodal and vice versa. With an enhanced breakpoint analysis through the combination of STA and BFAST, we are able to provide a technique that can be applied on large scale to map status and development of highly productive cropping systems, which are key for food production, national export and local employment. We therefore conclude that the combination of existing and accessible time-series analysis methods: is able to achieve both: overcoming demonstrated limitations of
An efficient and extensible approach for compressing phylogenetic trees

KAUST Repository

Matthews, Suzanne J; Williams, Tiffani L

2011-01-01

Background: Biologists require new algorithms to efficiently compress and store their large collections of phylogenetic trees. Our previous work showed that TreeZip is a promising approach for compressing phylogenetic trees. In this paper, we extend
Fire modifies the phylogenetic structure of soil bacterial co-occurrence networks.

Science.gov (United States)

Pérez-Valera, Eduardo; Goberna, Marta; Faust, Karoline; Raes, Jeroen; García, Carlos; Verdú, Miguel

2017-01-01

Fire alters ecosystems by changing the composition and community structure of soil microbes. The phylogenetic structure of a community provides clues about its main assembling mechanisms. While environmental filtering tends to reduce the community phylogenetic diversity by selecting for functionally (and hence phylogenetically) similar species, processes like competitive exclusion by limiting similarity tend to increase it by preventing the coexistence of functionally (and phylogenetically) similar species. We used co-occurrence networks to detect co-presence (bacteria that co-occur) or exclusion (bacteria that do not co-occur) links indicative of the ecological interactions structuring the community. We propose that inspecting the phylogenetic structure of co-presence or exclusion links allows to detect the main processes simultaneously assembling the community. We monitored a soil bacterial community after an experimental fire and found that fire altered its composition, richness and phylogenetic diversity. Both co-presence and exclusion links were more phylogenetically related than expected by chance. We interpret such a phylogenetic clustering in co-presence links as a result of environmental filtering, while that in exclusion links reflects competitive exclusion by limiting similarity. This suggests that environmental filtering and limiting similarity operate simultaneously to assemble soil bacterial communities, widening the traditional view that only environmental filtering structures bacterial communities. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.
Conus pennaceus : a phylogenetic analysis of the Mozambican ...

African Journals Online (AJOL)

The genus Conus has over 500 species and is the most species-rich taxon of marine invertebrates. Based on mitochondrial DNA, this study focuses on the phylogenetics of Conus, particularly the pennaceus complex collected along the Mozambican coast. Phylogenetic trees based on both the 16S and the 12S ribosomal ...
Identification of Four Distinct Phylogenetic Groups in Flavobacterium columnare With Fish Host Associations

Directory of Open Access Journals (Sweden)

Benjamin R. LaFrentz

2018-03-01

Full Text Available Columnaris disease, caused by the Gram-negative bacterium Flavobacterium columnare, is one of the most prevalent fish diseases worldwide. An exceptionally high level of genetic diversity among isolates of F. columnare has long been recognized, whereby six established genomovars have been described to date. However, little has been done to quantify or characterize this diversity further in a systematic fashion. The objective of this research was to perform phylogenetic analyses of 16S rRNA and housekeeping gene sequences to decipher the genetic diversity of F. columnare. Fifty isolates and/or genomes of F. columnare, originating from diverse years, geographic locations, fish hosts, and representative of the six genomovars were analyzed in this study. A multilocus phylogenetic analysis (MLPA of the 16S rRNA and six housekeeping genes supported four distinct F. columnare genetic groups. There were associations between genomovar and genetic group, but these relationships were imperfect indicating that genomovar assignment does not accurately reflect F. columnare genetic diversity. To expand the dataset, an additional 90 16S rRNA gene sequences were retrieved from GenBank and a phylogenetic analysis of this larger dataset also supported the establishment of four genetic groups. Examination of isolate historical data indicated biological relevance to the identified genetic diversity, with some genetic groups isolated preferentially from specific fish species or families. It is proposed that F. columnare isolates be assigned to the four genetic groups defined in this study rather than genomovar in order to facilitate a standard nomenclature across the scientific community. An increased understanding of which genetic groups are most prevalent in different regions and/or aquaculture industries may allow for the development of improved targeted control and treatment measures for columnaris disease.
Trophic phylogenetics: evolutionary influences on body size, feeding, and species associations in grassland arthropods.

Science.gov (United States)

Lind, Eric M; Vincent, John B; Weiblen, George D; Cavender-Bares, Jeannine; Borer, Elizabeth T

2015-04-01

Contemporary animal-plant interactions such as herbivory are widely understood to be shaped by evolutionary history. Yet questions remain about the role of plant phylogenetic diversity in generating and maintaining herbivore diversity, and whether evolutionary relatedness of producers might predict the composition of consumer communities. We tested for evidence of evolutionary associations among arthropods and the plants on which they were found, using phylogenetic analysis of naturally occurring arthropod assemblages sampled from a plant-diversity manipulation experiment. Considering phylogenetic relationships among more than 900 arthropod consumer taxa and 29 plant species in the experiment, we addressed several interrelated questions. First, our results support the hypothesis that arthropod functional traits such as body size and trophic role are phylogenetically conserved in community ecological samples. Second, herbivores tended to cooccur with closer phylogenetic relatives than would be expected at random, whereas predators and parasitoids did not show phylogenetic association patterns. Consumer specialization, as measured by association through time with monocultures of particular host plant species, showed significant phylogenetic signal, although the. strength of this association varied among plant species. Polycultures of phylogenetically dissimilar plant species supported more phylogenetically dissimilar consumer communities than did phylogenetically similar polycultures. Finally, we separated the effects of plant species richness and relatedness in predicting the phylogenetic distribution of the arthropod assemblages in this experiment. The phylogenetic diversity of plant communities predicted the phylogenetic diversity of herbivore communities even after accounting for plant species richness. The phylogenetic diversity of secondary consumers differed by guild, with predator phylogenetic diversity responding to herbivore relatedness, while parasitoid
Enumerating all maximal frequent subtrees in collections of phylogenetic trees

Science.gov (United States)

2014-01-01

Background A common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees. The goal is, roughly, to find a subset of the species (taxa) on which all or some significant subset of the trees agree. One popular method to do so is through maximum agreement subtrees (MASTs). MASTs are also used, among other things, as a metric for comparing phylogenetic trees, computing congruence indices and to identify horizontal gene transfer events. Results We give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. These approaches can return subtrees on larger sets of taxa than MASTs, and can reveal new common phylogenetic relationships not present in either MASTs or the majority rule tree (a popular consensus method). Our current implementation is available on the web at https://code.google.com/p/mfst-miner/. Conclusions Our computational results confirm that maximal agreement subtrees and all maximal frequent subtrees can reveal a more complete phylogenetic picture of the common patterns in collections of phylogenetic trees than maximum agreement subtrees; they are also often more resolved than the majority rule tree. Further, our experiments show that enumerating maximal frequent subtrees is considerably more practical than enumerating ordinary (not necessarily maximal) frequent subtrees. PMID:25061474
Phylogenetic fields through time: temporal dynamics of geographical co-occurrence and phylogenetic structure within species ranges.

Science.gov (United States)

Villalobos, Fabricio; Carotenuto, Francesco; Raia, Pasquale; Diniz-Filho, José Alexandre F

2016-04-05

Species co-occur with different sets of other species across their geographical distribution, which can be either closely or distantly related. Such co-occurrence patterns and their phylogenetic structure within individual species ranges represent what we call the species phylogenetic fields (PFs). These PFs allow investigation of the role of historical processes--speciation, extinction and dispersal--in shaping species co-occurrence patterns, in both extinct and extant species. Here, we investigate PFs of large mammalian species during the last 3 Myr, and how these correlate with trends in diversification rates. Using the fossil record, we evaluate species' distributional and co-occurrence patterns along with their phylogenetic structure. We apply a novel Bayesian framework on fossil occurrences to estimate diversification rates through time. Our findings highlight the effect of evolutionary processes and past climatic changes on species' distributions and co-occurrences. From the Late Pliocene to the Recent, mammal species seem to have responded in an individualistic manner to climate changes and diversification dynamics, co-occurring with different sets of species from different lineages across their geographical ranges. These findings stress the difficulty of forecasting potential effects of future climate changes on biodiversity. © 2016 The Author(s).
Orthology prediction at scalable resolution by phylogenetic tree analysis

NARCIS (Netherlands)

Heijden, R.T.J.M. van der; Snel, B.; Noort, V. van; Huynen, M.A.

2007-01-01

BACKGROUND: Orthology is one of the cornerstones of gene function prediction. Dividing the phylogenetic relations between genes into either orthologs or paralogs is however an oversimplification. Already in two-species gene-phylogenies, the complicated, non-transitive nature of phylogenetic
Applying phylogenetic analysis to viral livestock diseases: moving beyond molecular typing.

Science.gov (United States)

Olvera, Alex; Busquets, Núria; Cortey, Marti; de Deus, Nilsa; Ganges, Llilianne; Núñez, José Ignacio; Peralta, Bibiana; Toskano, Jennifer; Dolz, Roser

2010-05-01

Changes in livestock production systems in recent years have altered the presentation of many diseases resulting in the need for more sophisticated control measures. At the same time, new molecular assays have been developed to support the diagnosis of animal viral disease. Nucleotide sequences generated by these diagnostic techniques can be used in phylogenetic analysis to infer phenotypes by sequence homology and to perform molecular epidemiology studies. In this review, some key elements of phylogenetic analysis are highlighted, such as the selection of the appropriate neutral phylogenetic marker, the proper phylogenetic method and different techniques to test the reliability of the resulting tree. Examples are given of current and future applications of phylogenetic reconstructions in viral livestock diseases. Copyright 2009 Elsevier Ltd. All rights reserved.
PhyloBayes MPI: phylogenetic reconstruction with infinite mixtures of profiles in a parallel environment.

Science.gov (United States)

Lartillot, Nicolas; Rodrigue, Nicolas; Stubbs, Daniel; Richer, Jacques

2013-07-01

Modeling across site variation of the substitution process is increasingly recognized as important for obtaining more accurate phylogenetic reconstructions. Both finite and infinite mixture models have been proposed and have been shown to significantly improve on classical single-matrix models. Compared with their finite counterparts, infinite mixtures have a greater expressivity. However, they are computationally more challenging. This has resulted in practical compromises in the design of infinite mixture models. In particular, a fast but simplified version of a Dirichlet process model over equilibrium frequency profiles implemented in PhyloBayes has often been used in recent phylogenomics studies, while more refined model structures, more realistic and empirically more fit, have been practically out of reach. We introduce a message passing interface version of PhyloBayes, implementing the Dirichlet process mixture models as well as more classical empirical matrices and finite mixtures. The parallelization is made efficient thanks to the combination of two algorithmic strategies: a partial Gibbs sampling update of the tree topology and the use of a truncated stick-breaking representation for the Dirichlet process prior. The implementation shows close to linear gains in computational speed for up to 64 cores, thus allowing faster phylogenetic reconstruction under complex mixture models. PhyloBayes MPI is freely available from our website www.phylobayes.org.
Phylogenetic analysis of Common Garter Snake (Thamnophis sirtalis) stomach contents detects cryptic range of a secretive salamander (Ensatina eschscholtzii oregonensis) Herpetological Conservation and Biology 5(3):395–402

Science.gov (United States)

Sean B. Reilly; Andrew D Gottsho; Justin M. Garwood; Bryan. Jennings

2010-01-01

Given the current global amphibian decline, it is crucial to obtain accurate and current information regarding species distributions. Secretive amphibians such as plethodontid salamanders can be difficult to detect in many cases, especially in remote, high elevation areas. We used molecular phylogenetic analyses to identify three partially digested salamanders palped...
Phylogenetic congruence between subtropical trees and their associated fungi

NARCIS (Netherlands)

Liu, Xubing; Liang, Minxia; Etienne, Rampal S.; Gilbert, Gregory S; Yu, Shixiao

2016-01-01

Recent studies have detected phylogenetic signals in pathogen-host networks for both soil-borne and leaf-infecting fungi, suggesting that pathogenic fungi may track or coevolve with their preferred hosts. However, a phylogenetically concordant relationship between multiple hosts and multiple fungi
One tree to link them all: a phylogenetic dataset for the European tetrapoda.

Science.gov (United States)

Roquet, Cristina; Lavergne, Sébastien; Thuiller, Wilfried

2014-08-08

Since the ever-increasing availability of phylogenetic informative data, the last decade has seen an upsurge of ecological studies incorporating information on evolutionary relationships among species. However, detailed species-level phylogenies are still lacking for many large groups and regions, which are necessary for comprehensive large-scale eco-phylogenetic analyses. Here, we provide a dataset of 100 dated phylogenetic trees for all European tetrapods based on a mixture of supermatrix and supertree approaches. Phylogenetic inference was performed separately for each of the main Tetrapoda groups of Europe except mammals (i.e. amphibians, birds, squamates and turtles) by means of maximum likelihood (ML) analyses of supermatrix applying a tree constraint at the family (amphibians and squamates) or order (birds and turtles) levels based on consensus knowledge. For each group, we inferred 100 ML trees to be able to provide a phylogenetic dataset that accounts for phylogenetic uncertainty, and assessed node support with bootstrap analyses. Each tree was dated using penalized-likelihood and fossil calibration. The trees obtained were well-supported by existing knowledge and previous phylogenetic studies. For mammals, we modified the most complete supertree dataset available on the literature to include a recent update of the Carnivora clade. As a final step, we merged the phylogenetic trees of all groups to obtain a set of 100 phylogenetic trees for all European Tetrapoda species for which data was available (91%). We provide this phylogenetic dataset (100 chronograms) for the purpose of comparative analyses, macro-ecological or community ecology studies aiming to incorporate phylogenetic information while accounting for phylogenetic uncertainty.
Assessing the relationships between phylogenetic and functional singularities in sharks (Chondrichthyes).

Science.gov (United States)

Cachera, Marie; Le Loc'h, François

2017-08-01

The relationships between diversity and ecosystem functioning have become a major focus of science. A crucial issue is to estimate functional diversity, as it is intended to impact ecosystem dynamics and stability. However, depending on the ecosystem, it may be challenging or even impossible to directly measure ecological functions and thus functional diversity. Phylogenetic diversity was recently under consideration as a proxy for functional diversity. Phylogenetic diversity is indeed supposed to match functional diversity if functions are conservative traits along evolution. However, in case of adaptive radiation and/or evolutive convergence, a mismatch may appear between species phylogenetic and functional singularities. Using highly threatened taxa, sharks, this study aimed to explore the relationships between phylogenetic and functional diversities and singularities. Different statistical computations were used in order to test both methodological issue (phylogenetic reconstruction) and overall a theoretical questioning: the predictive power of phylogeny for function diversity. Despite these several methodological approaches, a mismatch between phylogeny and function was highlighted. This mismatch revealed that (i) functions are apparently nonconservative in shark species, and (ii) phylogenetic singularity is not a proxy for functional singularity. Functions appeared to be not conservative along the evolution of sharks, raising the conservational challenge to identify and protect both phylogenetic and functional singular species. Facing the current rate of species loss, it is indeed of major importance to target phylogenetically singular species to protect genetic diversity and also functionally singular species in order to maintain particular functions within ecosystem.
Species divergence and phylogenetic variation of ecophysiological traits in lianas and trees.

Science.gov (United States)

Rios, Rodrigo S; Salgado-Luarte, Cristian; Gianoli, Ernesto

2014-01-01

The climbing habit is an evolutionary key innovation in plants because it is associated with enhanced clade diversification. We tested whether patterns of species divergence and variation of three ecophysiological traits that are fundamental for plant adaptation to light environments (maximum photosynthetic rate [A(max)], dark respiration rate [R(d)], and specific leaf area [SLA]) are consistent with this key innovation. Using data reported from four tropical forests and three temperate forests, we compared phylogenetic distance among species as well as the evolutionary rate, phylogenetic distance and phylogenetic signal of those traits in lianas and trees. Estimates of evolutionary rates showed that R(d) evolved faster in lianas, while SLA evolved faster in trees. The mean phylogenetic distance was 1.2 times greater among liana species than among tree species. Likewise, estimates of phylogenetic distance indicated that lianas were less related than by chance alone (phylogenetic evenness across 63 species), and trees were more related than expected by chance (phylogenetic clustering across 71 species). Lianas showed evenness for R(d), while trees showed phylogenetic clustering for this trait. In contrast, for SLA, lianas exhibited phylogenetic clustering and trees showed phylogenetic evenness. Lianas and trees showed patterns of ecophysiological trait variation among species that were independent of phylogenetic relatedness. We found support for the expected pattern of greater species divergence in lianas, but did not find consistent patterns regarding ecophysiological trait evolution and divergence. R(d) followed the species-level pattern, i.e., greater divergence/evolution in lianas compared to trees, while the opposite occurred for SLA and no pattern was detected for A(max). R(d) may have driven lianas' divergence across forest environments, and might contribute to diversification in climber clades.
Load Balancing Issues with Constructing Phylogenetic Trees using Neighbour-Joining Algorithm

International Nuclear Information System (INIS)

Al Mamun, S M

2012-01-01

Phylogenetic tree construction is one of the most important and interesting problems in bioinformatics. Constructing an efficient phylogenetic tree has always been a research issue. It needs to consider both the correctness and the speed of the tree construction. In this paper, we implemented the neighbour-joining algorithm, using Message Passing Interface (MPI) for constructing the phylogenetic tree. Performance is efficacious, comparing to the best sequential algorithm. From this paper, it would be clear to the researchers that how load balance can make a great effect for constructing phylogenetic trees using neighbour-joining algorithm.
Efficient Detection of Repeating Sites to Accelerate Phylogenetic Likelihood Calculations.

Science.gov (United States)

Kobert, K; Stamatakis, A; Flouri, T

2017-03-01

The phylogenetic likelihood function (PLF) is the major computational bottleneck in several applications of evolutionary biology such as phylogenetic inference, species delimitation, model selection, and divergence times estimation. Given the alignment, a tree and the evolutionary model parameters, the likelihood function computes the conditional likelihood vectors for every node of the tree. Vector entries for which all input data are identical result in redundant likelihood operations which, in turn, yield identical conditional values. Such operations can be omitted for improving run-time and, using appropriate data structures, reducing memory usage. We present a fast, novel method for identifying and omitting such redundant operations in phylogenetic likelihood calculations, and assess the performance improvement and memory savings attained by our method. Using empirical and simulated data sets, we show that a prototype implementation of our method yields up to 12-fold speedups and uses up to 78% less memory than one of the fastest and most highly tuned implementations of the PLF currently available. Our method is generic and can seamlessly be integrated into any phylogenetic likelihood implementation. [Algorithms; maximum likelihood; phylogenetic likelihood function; phylogenetics]. © The Author(s) 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
Mycorrhizae support oaks growing in a phylogenetically distant neighbourhood

NARCIS (Netherlands)

Yguel, B.; Courty, P.E.; Jactel, H.; Pan, X.; Butenschoen, O.; Murray, P.J.; Prinzing, A.

2014-01-01

Host-plants may rarely leave their ancestral niche and in which case they tend to be surrounded by phylogenetically distant neighbours. Phylogenetically isolated host-plants might share few mutualists with their neighbours and might suffer from a decrease in mutualist support. In addition host

Maximum likelihood phylogenetic reconstruction from high-resolution whole-genome data and a tree of 68 eukaryotes.

Science.gov (United States)

Lin, Yu; Hu, Fei; Tang, Jijun; Moret, Bernard M E

2013-01-01

The rapid accumulation of whole-genome data has renewed interest in the study of the evolution of genomic architecture, under such events as rearrangements, duplications, losses. Comparative genomics, evolutionary biology, and cancer research all require tools to elucidate the mechanisms, history, and consequences of those evolutionary events, while phylogenetics could use whole-genome data to enhance its picture of the Tree of Life. Current approaches in the area of phylogenetic analysis are limited to very small collections of closely related genomes using low-resolution data (typically a few hundred syntenic blocks); moreover, these approaches typically do not include duplication and loss events. We describe a maximum likelihood (ML) approach for phylogenetic analysis that takes into account genome rearrangements as well as duplications, insertions, and losses. Our approach can handle high-resolution genomes (with 40,000 or more markers) and can use in the same analysis genomes with very different numbers of markers. Because our approach uses a standard ML reconstruction program (RAxML), it scales up to large trees. We present the results of extensive testing on both simulated and real data showing that our approach returns very accurate results very quickly. In particular, we analyze a dataset of 68 high-resolution eukaryotic genomes, with from 3,000 to 42,000 genes, from the eGOB database; the analysis, including bootstrapping, takes just 3 hours on a desktop system and returns a tree in agreement with all well supported branches, while also suggesting resolutions for some disputed placements.
[PK/PD breakpoints and clinical/bacteriological effects of cefcapene pivoxil fine granules for children at free drug concentrations in pediatric patients with respiratory infection].

Science.gov (United States)

Toyonaga, Yoshikiyo; Iwai, Naoichi; Motohiro, Takashi; Sunakawa, Keisuke; Fujii, Ryochi

2008-06-01

A post-marketing clinical study was previously conducted in pediatric patients with respiratory infection to evaluate the pharmacokinetics, efficacy and safety of cefcapene pivoxil (CFPN-PI) fine granules for children. Based on the results from this study, we evaluated PK/PD breakpoints and clinical/bacteriological effects of CFPN-PI at free drug concentrations in pediatric patients with respiratory infection to determine an effective and safe dosage regimen of CFPN-PI. The following results were obtained from 61 pediatric patients evaluated in our research. 1) The response rate of pediatric respiratory infection to CFPN-PI was 100% for laryngopharyngitis, 84.6% for acute bronchitis, 100% for tonsillitis, 100% for pneumonia and 95.8% for all. 2) The bacteriological response (eradication rate of Haemophilus influenzae, Streptococcus pyogenes, Moraxella catarrhalis, Streptococcus pneumoniae, etc.) of pediatric respiratory infection to CFPN-PI was 87.5% for laryngopharyngitis, 66.7% for acute bronchitis, 75.0% for tonsillitis, 63.6% for pneumonia and 73.8% for all. 3) The blood concentration simulation demonstrated that the PK/PD breakpoint exceeding the time above MIC (TAM) of 40% after administration of CFPN-PI 3 mg/kg three times daily was 0.27 microg/mL. 4) The pediatric patients with respiratory infection were stratified by the TAM (%) of CFPN-PI into 40% to 100% (TAM > or = 40% group) and 0% to 40% (TAM or = 40% group, and 88.9% and 62.5% in the TAM or = 40% group than in the TAM < 40% group, although the between-group difference was not statistically significant.
Whole Genome Phylogenetic Tree Reconstruction using Colored de Bruijn Graphs

OpenAIRE

Lyman, Cole

2017-01-01

We present kleuren, a novel assembly-free method to reconstruct phylogenetic trees using the Colored de Bruijn Graph. kleuren works by constructing the Colored de Bruijn Graph and then traversing it, finding bubble structures in the graph that provide phylogenetic signal. The bubbles are then aligned and concatenated to form a supermatrix, from which a phylogenetic tree is inferred. We introduce the algorithm that kleuren uses to accomplish this task, and show its performance on reconstructin...
Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction.

Science.gov (United States)

Mai, Uyen; Sayyari, Erfan; Mirarab, Siavash

2017-01-01

Phylogenetic trees inferred using commonly-used models of sequence evolution are unrooted, but the root position matters both for interpretation and downstream applications. This issue has been long recognized; however, whether the potential for discordance between the species tree and gene trees impacts methods of rooting a phylogenetic tree has not been extensively studied. In this paper, we introduce a new method of rooting a tree based on its branch length distribution; our method, which minimizes the variance of root to tip distances, is inspired by the traditional midpoint rerooting and is justified when deviations from the strict molecular clock are random. Like midpoint rerooting, the method can be implemented in a linear time algorithm. In extensive simulations that consider discordance between gene trees and the species tree, we show that the new method is more accurate than midpoint rerooting, but its relative accuracy compared to using outgroups to root gene trees depends on the size of the dataset and levels of deviations from the strict clock. We show high levels of error for all methods of rooting estimated gene trees due to factors that include effects of gene tree discordance, deviations from the clock, and gene tree estimation error. Our simulations, however, did not reveal significant differences between two equivalent methods for species tree estimation that use rooted and unrooted input, namely, STAR and NJst. Nevertheless, our results point to limitations of existing scalable rooting methods.
Distance-Based Phylogenetic Methods Around a Polytomy.

Science.gov (United States)

Davidson, Ruth; Sullivant, Seth

2014-01-01

Distance-based phylogenetic algorithms attempt to solve the NP-hard least-squares phylogeny problem by mapping an arbitrary dissimilarity map representing biological data to a tree metric. The set of all dissimilarity maps is a Euclidean space properly containing the space of all tree metrics as a polyhedral fan. Outputs of distance-based tree reconstruction algorithms such as UPGMA and neighbor-joining are points in the maximal cones in the fan. Tree metrics with polytomies lie at the intersections of maximal cones. A phylogenetic algorithm divides the space of all dissimilarity maps into regions based upon which combinatorial tree is reconstructed by the algorithm. Comparison of phylogenetic methods can be done by comparing the geometry of these regions. We use polyhedral geometry to compare the local nature of the subdivisions induced by least-squares phylogeny, UPGMA, and neighbor-joining when the true tree has a single polytomy with exactly four neighbors. Our results suggest that in some circumstances, UPGMA and neighbor-joining poorly match least-squares phylogeny.
The power and pitfalls of HIV phylogenetics in public health.

Science.gov (United States)

Brooks, James I; Sandstrom, Paul A

2013-07-25

Phylogenetics is the application of comparative studies of genetic sequences in order to infer evolutionary relationships among organisms. This tool can be used as a form of molecular epidemiology to enhance traditional population-level communicable disease surveillance. Phylogenetic study has resulted in new paradigms being created in the field of communicable diseases and this commentary aims to provide the reader with an explanation of how phylogenetics can be used in tracking infectious diseases. Special emphasis will be placed upon the application of phylogenetics as a tool to help elucidate HIV transmission patterns and the limitations to these methods when applied to forensic analysis. Understanding infectious disease epidemiology in order to prevent new transmissions is the sine qua non of public health. However, with increasing epidemiological resolution, there may be an associated potential loss of privacy to the individual. It is within this context that we aim to promote the discussion on how to use phylogenetics to achieve important public health goals, while at the same time protecting the rights of the individual.
Phylogenetic Pattern, Evolutionary Processes and Species Delimitation in the Genus Echinococcus.

Science.gov (United States)

Lymbery, A J

2017-01-01

An accurate and stable alpha taxonomy requires a clear conception of what constitutes a species and agreed criteria for delimiting different species. An evolutionary or general lineage concept defines a species as a single lineage of organisms with a common evolutionary trajectory, distinguishable from other such lineages. Delimiting evolutionary species is a two-step process. In the first step, phylogenetic reconstruction identifies putative species as groups of organisms that are monophyletic (share a common ancestor) and exclusive (more closely related to each other than to organisms outside the group). The second step is to assess whether members of the group possess genetic exchangeability (where cohesion is maintained by gene flow among populations) or ecological exchangeability (where cohesion is maintained because populations occupy the same ecological niche). Recent taxonomic reviews have recognized nine species within the genus Echinococcus. Phylogenetic reconstructions of the relationships between these putative species using mtDNA and nuclear gene sequences show that for the most part these nine species are monophyletic, although there are important incongruences that need to be resolved. Applying the criteria of genetic and ecological exchangeability suggests that seven of the currently recognized species represent evolutionarily distinct lineages. The species status of Echinococcus canadensis and Echinococcus ortleppi could not be confirmed. Coalescent-based analyses represent a promising approach to species delimitation in these closely related taxa. It seems likely, from a comparison of sister species groups, that speciation in the genus has been driven by geographic isolation, but biogeographic scenarios are largely speculative and require further testing. Copyright © 2017 Elsevier Ltd. All rights reserved.
A Universal Phylogenetic Tree.

Science.gov (United States)

Offner, Susan

2001-01-01

Presents a universal phylogenetic tree suitable for use in high school and college-level biology classrooms. Illustrates the antiquity of life and that all life is related, even if it dates back 3.5 billion years. Reflects important evolutionary relationships and provides an exciting way to learn about the history of life. (SAH)
The idic(X)(q13) in myeloid malignancies: breakpoint clustering in segmental duplications and association with TET2 mutations

DEFF Research Database (Denmark)

Paulsson, Kajsa; Haferlach, Claudia; Fonatsch, Christa

2010-01-01

Myelodysplastic syndromes and acute myeloid leukemia with an isodicentric X chromosome [idic(X)(q13)] occur in elderly women and frequently display ringed sideroblasts. Because of the rarity of idic(X)(q13), little is known about its formation, whether a fusion gene is generated, and patterns...... of additional aberrations. We here present an SNP array study of 14 idic(X)-positive myeloid malignancies, collected through an international collaborative effort. The breakpoints clustered in two regions of segmental duplications and were not in a gene, making dosage effects from the concurrent gain of Xpter......-q13 and loss of Xq13-qter, rather than a fusion gene, the most likely pathogenetic outcome. Methylation analysis revealed involvement of the inactive X chromosomes in five cases and of the active in two. The ABCB7 gene, mutated in X-linked sideroblastic anemia and spinocerebellar ataxia...
Evolution of the brain and phylogenetic development of Mrican ...

African Journals Online (AJOL)

Evolution of the brain and phylogenetic development of Mrican Bovidae. Henriette Oboussier. Zoological Institute and Museum, University of Hamburg. Evidence drawn from the study of 270 brains of 54 species and subspecies of African Bovidae makes it possible to base phylogenetic relationships on the similarities in the ...
Orthology prediction at scalable resolution by phylogenetic tree analysis

Directory of Open Access Journals (Sweden)

Huynen Martijn A

2007-03-01

Full Text Available Abstract Background Orthology is one of the cornerstones of gene function prediction. Dividing the phylogenetic relations between genes into either orthologs or paralogs is however an oversimplification. Already in two-species gene-phylogenies, the complicated, non-transitive nature of phylogenetic relations results in inparalogs and outparalogs. For situations with more than two species we lack semantics to specifically describe the phylogenetic relations, let alone to exploit them. Published procedures to extract orthologous groups from phylogenetic trees do not allow identification of orthology at various levels of resolution, nor do they document the relations between the orthologous groups. Results We introduce "levels of orthology" to describe the multi-level nature of gene relations. This is implemented in a program LOFT (Levels of Orthology From Trees that assigns hierarchical orthology numbers to genes based on a phylogenetic tree. To decide upon speciation and gene duplication events in a tree LOFT can be instructed either to perform classical species-tree reconciliation or to use the species overlap between partitions in the tree. The hierarchical orthology numbers assigned by LOFT effectively summarize the phylogenetic relations between genes. The resulting high-resolution orthologous groups are depicted in colour, facilitating visual inspection of (large trees. A benchmark for orthology prediction, that takes into account the varying levels of orthology between genes, shows that the phylogeny-based high-resolution orthology assignments made by LOFT are reliable. Conclusion The "levels of orthology" concept offers high resolution, reliable orthology, while preserving the relations between orthologous groups. A Windows as well as a preliminary Java version of LOFT is available from the LOFT website http://www.cmbi.ru.nl/LOFT.
The macroecology of phylogenetically structured hummingbird-plant networks

DEFF Research Database (Denmark)

González, Ana M. Martín; Dalsgaard, Bo; Nogues, David Bravo

2015-01-01

Aim To investigate the association between hummingbird–plant network structure and species richness, phylogenetic signal on species' interaction pattern, insularity and historical and current climate. Location Fifty-four communities along a c. 10,000 km latitudinal gradient across the Americas (39...... approach, we examined the influence of species richness, phylogenetic signal, insularity and current and historical climate conditions on network structure (null-model-corrected specialization and modularity). Results Phylogenetically related species, especially plants, showed a tendency to interact...... with a similar array of mutualistic partners. The spatial variation in network structure exhibited a constant association with species phylogeny (R2 = 0.18–0.19); however, network structure showed the strongest association with species richness and environmental factors (R2 = 0.20–0.44 and R2 = 0...
Pharmacokinetic-pharmacodynamic (PK-PD) modeling and the rational selection of dosage regimes for the prudent use of antimicrobial drugs.

Science.gov (United States)

Papich, Mark G

2014-07-16

One of the strategies to decrease inappropriate antimicrobial use in veterinary medicine is to apply pharmacokinetic-pharmacodynamic (PK-PD) principles to dosing regimens. If antimicrobials are used appropriately by applying these principles to attain targets for area-under-the-curve to MIC ratio (AUC/MIC), peak concentration to MIC ratio (CMAX/MIC), and time above MIC (T>MIC), more effective antibiotic therapy is possible, thus avoiding ineffective administration. Another mechanism whereby inappropriate antibiotic administration can be avoided is to use accurate Interpretive Criteria established by the Clinical Laboratory Standards Institute (CLSI) for breakpoint selection. Inaccurate breakpoints will encourage antibiotic administration that is likely to be ineffective. For newly approved antimicrobials, three criteria are used for determining breakpoints: PK-PD criteria, MIC distributions, and clinical response. For older (often generic drugs) evaluated by the CLSI, recent clinical data may not be available and breakpoints are derived from PK-PD principles, wild-type distributions, and Monte Carlo simulations. It is the goal of the CLSI subcommittee that these revised breakpoints will encourage more effective antimicrobial use and avoid unnecessary antimicrobial administration. Copyright © 2014 Elsevier B.V. All rights reserved.
TREEFINDER: a powerful graphical analysis environment for molecular phylogenetics

Directory of Open Access Journals (Sweden)

von Haeseler Arndt

2004-06-01

Full Text Available Abstract Background Most analysis programs for inferring molecular phylogenies are difficult to use, in particular for researchers with little programming experience. Results TREEFINDER is an easy-to-use integrative platform-independent analysis environment for molecular phylogenetics. In this paper the main features of TREEFINDER (version of April 2004 are described. TREEFINDER is written in ANSI C and Java and implements powerful statistical approaches for inferring gene tree and related analyzes. In addition, it provides a user-friendly graphical interface and a phylogenetic programming language. Conclusions TREEFINDER is a versatile framework for analyzing phylogenetic data across different platforms that is suited both for exploratory as well as advanced studies.
Antimicrobial susceptibility of gram-negative pathogens isolated from patients with complicated intra-abdominal infections in South African hospitals (SMART Study 2004-2009): impact of the new carbapenem breakpoints.

Science.gov (United States)

Brink, Adrian J; Botha, Roelof F; Poswa, Xoliswa; Senekal, Marthinus; Badal, Robert E; Grolman, David C; Richards, Guy A; Feldman, Charles; Boffard, Kenneth D; Veller, Martin; Joubert, Ivan; Pretorius, Jan

2012-02-01

The Study for Monitoring Antimicrobial Resistance Trends (SMART) follows trends in resistance among aerobic and facultative anaerobic gram-negative bacilli (GNB) isolated from complicated intra-abdominal infections (cIAIs) in patients around the world. During 2004-2009, three centralized clinical microbiology laboratories serving 59 private hospitals in three large South African cities collected 1,218 GNB from complicated intra-abdominal infections (cIAIs) and tested them for susceptibility to 12 antibiotics according to the 2011 Clinical Laboratory Standards Institute (CLSI) guidelines. Enterobacteriaceae comprised 83.7% of the isolates. Escherichia coli was the species isolated most commonly (46.4%), and 7.6% of these were extended-spectrum β-lactamase (ESBL)-positive. The highest ESBL rate was documented for Klebsiella pneumoniae (41.2%). Overall, ertapenem was the antibiotic most active against susceptible species for which it has breakpoints (94.6%) followed by amikacin (91.9%), piperacillin-tazobactam (89.3%), and imipenem-cilastatin (87.1%), whereas rates of resistance to ceftriaxone, cefotaxime, ciprofloxacin, and levofloxacin were documented to be 29.7%, 28.7%, 22.5%, and 21.1%, respectively. Multi-drug resistance (MDR), defined as resistance to three or more antibiotic classes, was significantly more common in K. pneumoniae (27.9%) than in E. coli (4.9%; p<0.0001) or Proteus mirabilis (4.1%; p<0.05). Applying the new CLSI breakpoints for carbapenems, susceptibility to ertapenem was reduced significantly in ESBL-positive E. coli compared with ESBL-negative isolates (91% vs. 98%; p<0.05), but this did not apply to imipenem-cilastatin (95% vs. 99%; p=0.0928). A large disparity between imipenem-cilastatin and ertapenem susceptibility in P. mirabilis and Morganella morganii was documented (24% vs. 96% and 15% vs. 92%, respectively), as most isolates of these two species had imipenem-cilastatin minimum inhibitory concentrations in the 2-4 mcg/mL range, which
Phylogenetic trees in bioinformatics

Energy Technology Data Exchange (ETDEWEB)

Burr, Tom L [Los Alamos National Laboratory

2008-01-01

Genetic data is often used to infer evolutionary relationships among a collection of viruses, bacteria, animal or plant species, or other operational taxonomic units (OTU). A phylogenetic tree depicts such relationships and provides a visual representation of the estimated branching order of the OTUs. Tree estimation is unique for several reasons, including: the types of data used to represent each OTU; the use ofprobabilistic nucleotide substitution models; the inference goals involving both tree topology and branch length, and the huge number of possible trees for a given sample of a very modest number of OTUs, which implies that fmding the best tree(s) to describe the genetic data for each OTU is computationally demanding. Bioinformatics is too large a field to review here. We focus on that aspect of bioinformatics that includes study of similarities in genetic data from multiple OTUs. Although research questions are diverse, a common underlying challenge is to estimate the evolutionary history of the OTUs. Therefore, this paper reviews the role of phylogenetic tree estimation in bioinformatics, available methods and software, and identifies areas for additional research and development.
Phylogenetic congruence between subtropical trees and their associated fungi.

Science.gov (United States)

Liu, Xubing; Liang, Minxia; Etienne, Rampal S; Gilbert, Gregory S; Yu, Shixiao

2016-12-01

Recent studies have detected phylogenetic signals in pathogen-host networks for both soil-borne and leaf-infecting fungi, suggesting that pathogenic fungi may track or coevolve with their preferred hosts. However, a phylogenetically concordant relationship between multiple hosts and multiple fungi in has rarely been investigated. Using next-generation high-throughput DNA sequencing techniques, we analyzed fungal taxa associated with diseased leaves, rotten seeds, and infected seedlings of subtropical trees. We compared the topologies of the phylogenetic trees of the soil and foliar fungi based on the internal transcribed spacer (ITS) region with the phylogeny of host tree species based on matK , rbcL , atpB, and 5.8S genes. We identified 37 foliar and 103 soil pathogenic fungi belonging to the Ascomycota and Basidiomycota phyla and detected significantly nonrandom host-fungus combinations, which clustered on both the fungus phylogeny and the host phylogeny. The explicit evidence of congruent phylogenies between tree hosts and their potential fungal pathogens suggests either diffuse coevolution among the plant-fungal interaction networks or that the distribution of fungal species tracked spatially associated hosts with phylogenetically conserved traits and habitat preferences. Phylogenetic conservatism in plant-fungal interactions within a local community promotes host and parasite specificity, which is integral to the important role of fungi in promoting species coexistence and maintaining biodiversity of forest communities.
Turnover of plant lineages shapes herbivore phylogenetic beta diversity along ecological gradients.

Science.gov (United States)

Pellissier, Loïc; Ndiribe, Charlotte; Dubuis, Anne; Pradervand, Jean-Nicolas; Salamin, Nicolas; Guisan, Antoine; Rasmann, Sergio

2013-05-01

Understanding drivers of biodiversity patterns is of prime importance in this era of severe environmental crisis. More diverse plant communities have been postulated to represent a larger functional trait-space, more likely to sustain a diverse assembly of herbivore species. Here, we expand this hypothesis to integrate environmental, functional and phylogenetic variation of plant communities as factors explaining the diversity of lepidopteran assemblages along elevation gradients in the Swiss Western Alps. According to expectations, we found that the association between butterflies and their host plants is highly phylogenetically structured. Multiple regression analyses showed the combined effect of climate, functional traits and phylogenetic diversity in structuring butterfly communities. Furthermore, we provide the first evidence that plant phylogenetic beta diversity is the major driver explaining butterfly phylogenetic beta diversity. Along ecological gradients, the bottom up control of herbivore diversity is thus driven by phylogenetically structured turnover of plant traits as well as environmental variables. © 2013 Blackwell Publishing Ltd/CNRS.
Isolation and molecular analysis of inv dup(15) and construction of a physical map of a common breakpoint in order to elucidate their mechanism of formation.

Science.gov (United States)

Wandstrat, A E; Schwartz, S

2000-11-01

An inverted duplication of chromosome 15 [inv dup(15)] is the most common supernumerary marker chromosome, comprising approximately 50% of all chromosomes in this class. Structurally, the inv dup(15) is a mirror image with the central axis defining a distal break within either the heterochromatic alpha-satellite array or along the euchromatin in the long (q) arm of the chromosome. There are several types of inv dup(15), classified by the amount of euchromatic material present. Generally, they are bisatellited, pseudodicentric and have a breakpoint in 15q11-q14. A suggested mechanism of formation of inv dup(15) involves illegitimate recombination between homologous chromosomes followed by nondisjunction and centromere inactivation. The proximal portion of chromosome 15 contains several low-copy repeat sequence families and it has been hypothesized that errors in pairing among these repeats may result in structural rearrangements of this chromosome including the inv dup(15). To test this hypothesis and to determine the mechanism of formation, the inv dup(15) from four cases was isolated in somatic cell hybrids and polymerase chain reaction microsatellite markers were used to determine the origin of exchange. Two appeared to result from interchromosomal and two from intrachromosomal exchange, one of which occurred post-recombination. In addition, a detailed physical map of the breakpoint region in the largest inv dup(15) was constructed placing eight new sequence-tagged sites and ten new bacterial artificial chromosome markers in the region.
Efficient FPT Algorithms for (Strict) Compatibility of Unrooted Phylogenetic Trees.

Science.gov (United States)

Baste, Julien; Paul, Christophe; Sau, Ignasi; Scornavacca, Celine

2017-04-01

In phylogenetics, a central problem is to infer the evolutionary relationships between a set of species X; these relationships are often depicted via a phylogenetic tree-a tree having its leaves labeled bijectively by elements of X and without degree-2 nodes-called the "species tree." One common approach for reconstructing a species tree consists in first constructing several phylogenetic trees from primary data (e.g., DNA sequences originating from some species in X), and then constructing a single phylogenetic tree maximizing the "concordance" with the input trees. The obtained tree is our estimation of the species tree and, when the input trees are defined on overlapping-but not identical-sets of labels, is called "supertree." In this paper, we focus on two problems that are central when combining phylogenetic trees into a supertree: the compatibility and the strict compatibility problems for unrooted phylogenetic trees. These problems are strongly related, respectively, to the notions of "containing as a minor" and "containing as a topological minor" in the graph community. Both problems are known to be fixed parameter tractable in the number of input trees k, by using their expressibility in monadic second-order logic and a reduction to graphs of bounded treewidth. Motivated by the fact that the dependency on k of these algorithms is prohibitively large, we give the first explicit dynamic programming algorithms for solving these problems, both running in time [Formula: see text], where n is the total size of the input.

PhyloSift: phylogenetic analysis of genomes and metagenomes.

Science.gov (United States)

Darling, Aaron E; Jospin, Guillaume; Lowe, Eric; Matsen, Frederick A; Bik, Holly M; Eisen, Jonathan A

2014-01-01

Like all organisms on the planet, environmental microbes are subject to the forces of molecular evolution. Metagenomic sequencing provides a means to access the DNA sequence of uncultured microbes. By combining DNA sequencing of microbial communities with evolutionary modeling and phylogenetic analysis we might obtain new insights into microbiology and also provide a basis for practical tools such as forensic pathogen detection. In this work we present an approach to leverage phylogenetic analysis of metagenomic sequence data to conduct several types of analysis. First, we present a method to conduct phylogeny-driven Bayesian hypothesis tests for the presence of an organism in a sample. Second, we present a means to compare community structure across a collection of many samples and develop direct associations between the abundance of certain organisms and sample metadata. Third, we apply new tools to analyze the phylogenetic diversity of microbial communities and again demonstrate how this can be associated to sample metadata. These analyses are implemented in an open source software pipeline called PhyloSift. As a pipeline, PhyloSift incorporates several other programs including LAST, HMMER, and pplacer to automate phylogenetic analysis of protein coding and RNA sequences in metagenomic datasets generated by modern sequencing platforms (e.g., Illumina, 454).
PhyloSift: phylogenetic analysis of genomes and metagenomes

Directory of Open Access Journals (Sweden)

Aaron E. Darling

2014-01-01

Full Text Available Like all organisms on the planet, environmental microbes are subject to the forces of molecular evolution. Metagenomic sequencing provides a means to access the DNA sequence of uncultured microbes. By combining DNA sequencing of microbial communities with evolutionary modeling and phylogenetic analysis we might obtain new insights into microbiology and also provide a basis for practical tools such as forensic pathogen detection.In this work we present an approach to leverage phylogenetic analysis of metagenomic sequence data to conduct several types of analysis. First, we present a method to conduct phylogeny-driven Bayesian hypothesis tests for the presence of an organism in a sample. Second, we present a means to compare community structure across a collection of many samples and develop direct associations between the abundance of certain organisms and sample metadata. Third, we apply new tools to analyze the phylogenetic diversity of microbial communities and again demonstrate how this can be associated to sample metadata.These analyses are implemented in an open source software pipeline called PhyloSift. As a pipeline, PhyloSift incorporates several other programs including LAST, HMMER, and pplacer to automate phylogenetic analysis of protein coding and RNA sequences in metagenomic datasets generated by modern sequencing platforms (e.g., Illumina, 454.
Statistical assignment of DNA sequences using Bayesian phylogenetics

DEFF Research Database (Denmark)

Terkelsen, Kasper Munch; Boomsma, Wouter Krogh; Huelsenbeck, John P.

2008-01-01

We provide a new automated statistical method for DNA barcoding based on a Bayesian phylogenetic analysis. The method is based on automated database sequence retrieval, alignment, and phylogenetic analysis using a custom-built program for Bayesian phylogenetic analysis. We show on real data...... that the method outperforms Blast searches as a measure of confidence and can help eliminate 80% of all false assignment based on best Blast hit. However, the most important advance of the method is that it provides statistically meaningful measures of confidence. We apply the method to a re......-analysis of previously published ancient DNA data and show that, with high statistical confidence, most of the published sequences are in fact of Neanderthal origin. However, there are several cases of chimeric sequences that are comprised of a combination of both Neanderthal and modern human DNA....
Continental scale patterns and predictors of fern richness and phylogenetic diversity

Directory of Open Access Journals (Sweden)

Nathalie eNagalingum

2015-04-01

Full Text Available Because ferns have a wide range of habitat preferences and are widely distributed, they are an ideal group for understanding how diversity is distributed. Here we examine fern diversity on a broad-scale using standard and corrected richness measures as well as phylogenetic indices; in addition we determine the environmental predictors of each diversity metric. Using the combined records of Australian herbaria, a dataset of over 60,000 records was obtained for 89 genera to infer richness. A phylogenetic tree of all the genera was constructed and combined with the herbarium records to obtain phylogenetic diversity patterns. A hotspot of both taxic and phylogenetic diversity occurs in the Wet Tropics of northeastern Australia. Although considerable diversity is distributed along the eastern coast, some important regions of diversity are identified only after sample-standardization of richness and through the phylogenetic metric. Of all of the metrics, annual precipitation was identified as the most explanatory variable, in part, in agreement with global and regional fern studies. Precipitation was combined with a different variable for each different metric. For corrected richness, precipitation is combined with temperature seasonality, while correlation of phylogenetic diversity to precipitation plus radiation indicates support for the species-energy hypothesis. Significantly high and significantly low phylogenetic diversity were found in geographically separate areas. These areas are correlated with different climatic conditions such as seasonality in precipitation. The use of phylogenetic metrics identifies additional areas of significant diversity, some of which have not been revealed using traditional taxonomic analyses, suggesting that different ecological and evolutionary processes have operated over the continent. Our study demonstrates that it is possible and vital to incorporate evolutionary metrics when inferring biodiversity hotspots
Fast algorithms for computing phylogenetic divergence time.

Science.gov (United States)

Crosby, Ralph W; Williams, Tiffani L

2017-12-06

The inference of species divergence time is a key step in most phylogenetic studies. Methods have been available for the last ten years to perform the inference, but the performance of the methods does not yet scale well to studies with hundreds of taxa and thousands of DNA base pairs. For example a study of 349 primate taxa was estimated to require over 9 months of processing time. In this work, we present a new algorithm, AncestralAge, that significantly improves the performance of the divergence time process. As part of AncestralAge, we demonstrate a new method for the computation of phylogenetic likelihood and our experiments show a 90% improvement in likelihood computation time on the aforementioned dataset of 349 primates taxa with over 60,000 DNA base pairs. Additionally, we show that our new method for the computation of the Bayesian prior on node ages reduces the running time for this computation on the 349 taxa dataset by 99%. Through the use of these new algorithms we open up the ability to perform divergence time inference on large phylogenetic studies.
Phylogenetics of neotropical Platymiscium (Leguminosae

DEFF Research Database (Denmark)

Saslis-Lagoudakis, C. Haris; Chase, Mark W; Robinson, Daniel N

2008-01-01

Platymiscium is a neotropical legume genus of forest trees in the Pterocarpus clade of the pantropical "dalbergioid" clade. It comprises 19 species (29 taxa), distributed from Mexico to southern Brazil. This study presents a molecular phylogenetic analysis of Platymiscium and allies inferred from...
Phylogenetic relationships among Maloideae species

Science.gov (United States)

The Maloideae is a highly diverse sub-family of the Rosaceae containing several agronomically important species (Malus sp. and Pyrus sp.) and their wild relatives. Previous phylogenetic work within the group has revealed extensive intergeneric hybridization and polyploidization. In order to develop...
Effect of site-specific heterogeneous evolution on phylogenetic reconstruction: a simple evaluation.

Science.gov (United States)

Cheng, Qiqun; Su, Zhixi; Zhong, Yang; Gu, Xun

2009-07-15

Recent studies have shown that heterogeneous evolution may mislead phylogenetic analysis, which has been neglected for a long time. We evaluate the effect of heterogeneous evolution on phylogenetic analysis, using 18 fish mitogenomic coding sequences as an example. Using the software DIVERGE, we identify 198 amino acid sites that have experienced heterogeneous evolution. After removing these sites, the rest of sites are shown to be virtually homogeneous in the evolutionary rate. There are some differences between phylogenetic trees built with heterogeneous sites ("before tree") and without heterogeneous sites ("after tree"). Our study demonstrates that for phylogenetic reconstruction, an effective approach is to identify and remove sites with heterogeneous evolution, and suggests that researchers can use the software DIVERGE to remove the influence of heterogeneous evolution before reconstructing phylogenetic trees.
Phylogenetic versus functional signals in the evolution of form-function relationships in terrestrial vision.

Science.gov (United States)

Motani, Ryosuke; Schmitz, Lars

2011-08-01

Phylogeny is deeply pertinent to evolutionary studies. Traits that perform a body function are expected to be strongly influenced by physical "requirements" of the function. We investigated if such traits exhibit phylogenetic signals, and, if so, how phylogenetic noises bias quantification of form-function relationships. A form-function system that is strongly influenced by physics, namely the relationship between eye morphology and visual optics in amniotes, was used. We quantified the correlation between form (i.e., eye morphology) and function (i.e., ocular optics) while varying the level of phylogenetic bias removal through adjusting Pagel's λ. Ocular soft-tissue dimensions exhibited the highest correlation with ocular optics when 1% of phylogenetic bias expected from Brownian motion was removed (i.e., λ= 0.01); the value for hard-tissue data were 8%. A small degree of phylogenetic bias therefore exists in morphology despite of the stringent functional constraints. We also devised a phylogenetically informed discriminant analysis and recorded the effects of phylogenetic bias on this method using the same data. Use of proper λ values during phylogenetic bias removal improved misidentification rates in resulting classifications when prior probabilities were assumed to be equal. Even a small degree of phylogenetic bias affected the classification resulting from phylogenetically informed discriminant analysis. © 2011 The Author(s). Evolution© 2011 The Society for the Study of Evolution.
SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data.

Science.gov (United States)

Lee, Tae-Ho; Guo, Hui; Wang, Xiyin; Kim, Changsoo; Paterson, Andrew H

2014-02-26

Phylogenetic trees are widely used for genetic and evolutionary studies in various organisms. Advanced sequencing technology has dramatically enriched data available for constructing phylogenetic trees based on single nucleotide polymorphisms (SNPs). However, massive SNP data makes it difficult to perform reliable analysis, and there has been no ready-to-use pipeline to generate phylogenetic trees from these data. We developed a new pipeline, SNPhylo, to construct phylogenetic trees based on large SNP datasets. The pipeline may enable users to construct a phylogenetic tree from three representative SNP data file formats. In addition, in order to increase reliability of a tree, the pipeline has steps such as removing low quality data and considering linkage disequilibrium. A maximum likelihood method for the inference of phylogeny is also adopted in generation of a tree in our pipeline. Using SNPhylo, users can easily produce a reliable phylogenetic tree from a large SNP data file. Thus, this pipeline can help a researcher focus more on interpretation of the results of analysis of voluminous data sets, rather than manipulations necessary to accomplish the analysis.
BLAST-EXPLORER helps you building datasets for phylogenetic analysis

Directory of Open Access Journals (Sweden)

Claverie Jean-Michel

2010-01-01

Full Text Available Abstract Background The right sampling of homologous sequences for phylogenetic or molecular evolution analyses is a crucial step, the quality of which can have a significant impact on the final interpretation of the study. There is no single way for constructing datasets suitable for phylogenetic analysis, because this task intimately depends on the scientific question we want to address, Moreover, database mining softwares such as BLAST which are routinely used for searching homologous sequences are not specifically optimized for this task. Results To fill this gap, we designed BLAST-Explorer, an original and friendly web-based application that combines a BLAST search with a suite of tools that allows interactive, phylogenetic-oriented exploration of the BLAST results and flexible selection of homologous sequences among the BLAST hits. Once the selection of the BLAST hits is done using BLAST-Explorer, the corresponding sequence can be imported locally for external analysis or passed to the phylogenetic tree reconstruction pipelines available on the Phylogeny.fr platform. Conclusions BLAST-Explorer provides a simple, intuitive and interactive graphical representation of the BLAST results and allows selection and retrieving of the BLAST hit sequences based a wide range of criterions. Although BLAST-Explorer primarily aims at helping the construction of sequence datasets for further phylogenetic study, it can also be used as a standard BLAST server with enriched output. BLAST-Explorer is available at http://www.phylogeny.fr
Phylogenetic impoverishment of Amazonian tree communities in an experimentally fragmented forest landscape.

Science.gov (United States)

Santos, Bráulio A; Tabarelli, Marcelo; Melo, Felipe P L; Camargo, José L C; Andrade, Ana; Laurance, Susan G; Laurance, William F

2014-01-01

Amazonian rainforests sustain some of the richest tree communities on Earth, but their ecological and evolutionary responses to human threats remain poorly known. We used one of the largest experimental datasets currently available on tree dynamics in fragmented tropical forests and a recent phylogeny of angiosperms to test whether tree communities have lost phylogenetic diversity since their isolation about two decades previously. Our findings revealed an overall trend toward phylogenetic impoverishment across the experimentally fragmented landscape, irrespective of whether tree communities were in 1-ha, 10-ha, or 100-ha forest fragments, near forest edges, or in continuous forest. The magnitude of the phylogenetic diversity loss was low (phylogenetic diversity, we observed a significant decrease of 50% in phylogenetic dispersion since forest isolation, irrespective of plot location. Analyses based on tree genera that have significantly increased (28 genera) or declined (31 genera) in abundance and basal area in the landscape revealed that increasing genera are more phylogenetically related than decreasing ones. Also, the loss of phylogenetic diversity was greater in tree communities where increasing genera proliferated and decreasing genera reduced their importance values, suggesting that this taxonomic replacement is partially underlying the phylogenetic impoverishment at the landscape scale. This finding has clear implications for the current debate about the role human-modified landscapes play in sustaining biodiversity persistence and key ecosystem services, such as carbon storage. Although the generalization of our findings to other fragmented tropical forests is uncertain, it could negatively affect ecosystem productivity and stability and have broader impacts on coevolved organisms.
Phylogenetic rooting using minimal ancestor deviation.

Science.gov (United States)

Tria, Fernando Domingues Kümmel; Landan, Giddy; Dagan, Tal

2017-06-19

Ancestor-descendent relations play a cardinal role in evolutionary theory. Those relations are determined by rooting phylogenetic trees. Existing rooting methods are hampered by evolutionary rate heterogeneity or the unavailability of auxiliary phylogenetic information. Here we present a rooting approach, the minimal ancestor deviation (MAD) method, which accommodates heterotachy by using all pairwise topological and metric information in unrooted trees. We demonstrate the performance of the method, in comparison to existing rooting methods, by the analysis of phylogenies from eukaryotes and prokaryotes. MAD correctly recovers the known root of eukaryotes and uncovers evidence for the origin of cyanobacteria in the ocean. MAD is more robust and consistent than existing methods, provides measures of the root inference quality and is applicable to any tree with branch lengths.
Molecular identification and phylogenetic study of Demodex caprae.

Science.gov (United States)

Zhao, Ya-E; Cheng, Juan; Hu, Li; Ma, Jun-Xian

2014-10-01

The DNA barcode has been widely used in species identification and phylogenetic analysis since 2003, but there have been no reports in Demodex. In this study, to obtain an appropriate DNA barcode for Demodex, molecular identification of Demodex caprae based on mitochondrial cox1 was conducted. Firstly, individual adults and eggs of D. caprae were obtained for genomic DNA (gDNA) extraction; Secondly, mitochondrial cox1 fragment was amplified, cloned, and sequenced; Thirdly, cox1 fragments of D. caprae were aligned with those of other Demodex retrieved from GenBank; Finally, the intra- and inter-specific divergences were computed and the phylogenetic trees were reconstructed to analyze phylogenetic relationship in Demodex. Results obtained from seven 429-bp fragments of D. caprae showed that sequence identities were above 99.1% among three adults and four eggs. The intraspecific divergences in D. caprae, Demodex folliculorum, Demodex brevis, and Demodex canis were 0.0-0.9, 0.5-0.9, 0.0-0.2, and 0.0-0.5%, respectively, while the interspecific divergences between D. caprae and D. folliculorum, D. canis, and D. brevis were 20.3-20.9, 21.8-23.0, and 25.0-25.3, respectively. The interspecific divergences were 10 times higher than intraspecific ones, indicating considerable barcoding gap. Furthermore, the phylogenetic trees showed that four Demodex species gathered separately, representing independent species; and Demodex folliculorum gathered with canine Demodex, D. caprae, and D. brevis in sequence. In conclusion, the selected 429-bp mitochondrial cox1 gene is an appropriate DNA barcode for molecular classification, identification, and phylogenetic analysis of Demodex. D. caprae is an independent species and D. folliculorum is closer to D. canis than to D. caprae or D. brevis.
Reconstructible phylogenetic networks: do not distinguish the indistinguishable.

Science.gov (United States)

Pardi, Fabio; Scornavacca, Celine

2015-04-01

Phylogenetic networks represent the evolution of organisms that have undergone reticulate events, such as recombination, hybrid speciation or lateral gene transfer. An important way to interpret a phylogenetic network is in terms of the trees it displays, which represent all the possible histories of the characters carried by the organisms in the network. Interestingly, however, different networks may display exactly the same set of trees, an observation that poses a problem for network reconstruction: from the perspective of many inference methods such networks are "indistinguishable". This is true for all methods that evaluate a phylogenetic network solely on the basis of how well the displayed trees fit the available data, including all methods based on input data consisting of clades, triples, quartets, or trees with any number of taxa, and also sequence-based approaches such as popular formalisations of maximum parsimony and maximum likelihood for networks. This identifiability problem is partially solved by accounting for branch lengths, although this merely reduces the frequency of the problem. Here we propose that network inference methods should only attempt to reconstruct what they can uniquely identify. To this end, we introduce a novel definition of what constitutes a uniquely reconstructible network. For any given set of indistinguishable networks, we define a canonical network that, under mild assumptions, is unique and thus representative of the entire set. Given data that underwent reticulate evolution, only the canonical form of the underlying phylogenetic network can be uniquely reconstructed. While on the methodological side this will imply a drastic reduction of the solution space in network inference, for the study of reticulate evolution this is a fundamental limitation that will require an important change of perspective when interpreting phylogenetic networks.
Reconstructible phylogenetic networks: do not distinguish the indistinguishable.

Directory of Open Access Journals (Sweden)

Fabio Pardi

2015-04-01

Full Text Available Phylogenetic networks represent the evolution of organisms that have undergone reticulate events, such as recombination, hybrid speciation or lateral gene transfer. An important way to interpret a phylogenetic network is in terms of the trees it displays, which represent all the possible histories of the characters carried by the organisms in the network. Interestingly, however, different networks may display exactly the same set of trees, an observation that poses a problem for network reconstruction: from the perspective of many inference methods such networks are "indistinguishable". This is true for all methods that evaluate a phylogenetic network solely on the basis of how well the displayed trees fit the available data, including all methods based on input data consisting of clades, triples, quartets, or trees with any number of taxa, and also sequence-based approaches such as popular formalisations of maximum parsimony and maximum likelihood for networks. This identifiability problem is partially solved by accounting for branch lengths, although this merely reduces the frequency of the problem. Here we propose that network inference methods should only attempt to reconstruct what they can uniquely identify. To this end, we introduce a novel definition of what constitutes a uniquely reconstructible network. For any given set of indistinguishable networks, we define a canonical network that, under mild assumptions, is unique and thus representative of the entire set. Given data that underwent reticulate evolution, only the canonical form of the underlying phylogenetic network can be uniquely reconstructed. While on the methodological side this will imply a drastic reduction of the solution space in network inference, for the study of reticulate evolution this is a fundamental limitation that will require an important change of perspective when interpreting phylogenetic networks.
Edge-related loss of tree phylogenetic diversity in the severely fragmented Brazilian Atlantic forest.

Science.gov (United States)

Santos, Bráulio A; Arroyo-Rodríguez, Víctor; Moreno, Claudia E; Tabarelli, Marcelo

2010-09-08

Deforestation and forest fragmentation are known major causes of nonrandom extinction, but there is no information about their impact on the phylogenetic diversity of the remaining species assemblages. Using a large vegetation dataset from an old hyper-fragmented landscape in the Brazilian Atlantic rainforest we assess whether the local extirpation of tree species and functional impoverishment of tree assemblages reduce the phylogenetic diversity of the remaining tree assemblages. We detected a significant loss of tree phylogenetic diversity in forest edges, but not in core areas of small (phylogenetic distance between any two randomly chosen individuals from forest edges; an increase of 17% in the average phylogenetic distance to closest non-conspecific relative for each individual in forest edges; and to the potential manifestation of late edge effects in the core areas of small forest remnants. We found no evidence supporting fragmentation-induced phylogenetic clustering or evenness. This could be explained by the low phylogenetic conservatism of key life-history traits corresponding to vulnerable species. Edge effects must be reduced to effectively protect tree phylogenetic diversity in the severely fragmented Brazilian Atlantic forest.
A taxonomic and phylogenetic re-appraisal of the genus Curvularia

Science.gov (United States)

Species of Curvularia are important plant and human pathogens worldwide. In this study, the genus Curvularia is re-assessed based on molecular phylogenetic analysis and morphological observations of available isolates and specimens. A multi-gene phylogenetic tree inferred from ITS, TEF and GPDH gene...
Student Interpretations of Phylogenetic Trees in an Introductory Biology Course

Science.gov (United States)

Dees, Jonathan; Momsen, Jennifer L.; Niemi, Jarad; Montplaisir, Lisa

2014-01-01

Phylogenetic trees are widely used visual representations in the biological sciences and the most important visual representations in evolutionary biology. Therefore, phylogenetic trees have also become an important component of biology education. We sought to characterize reasoning used by introductory biology students in interpreting taxa…
A new algorithm to construct phylogenetic networks from trees.

Science.gov (United States)

Wang, J

2014-03-06

Developing appropriate methods for constructing phylogenetic networks from tree sets is an important problem, and much research is currently being undertaken in this area. BIMLR is an algorithm that constructs phylogenetic networks from tree sets. The algorithm can construct a much simpler network than other available methods. Here, we introduce an improved version of the BIMLR algorithm, QuickCass. QuickCass changes the selection strategy of the labels of leaves below the reticulate nodes, i.e., the nodes with an indegree of at least 2 in BIMLR. We show that QuickCass can construct simpler phylogenetic networks than BIMLR. Furthermore, we show that QuickCass is a polynomial-time algorithm when the output network that is constructed by QuickCass is binary.

Hal: an automated pipeline for phylogenetic analyses of genomic data.

Science.gov (United States)

Robbertse, Barbara; Yoder, Ryan J; Boyd, Alex; Reeves, John; Spatafora, Joseph W

2011-02-07

The rapid increase in genomic and genome-scale data is resulting in unprecedented levels of discrete sequence data available for phylogenetic analyses. Major analytical impasses exist, however, prior to analyzing these data with existing phylogenetic software. Obstacles include the management of large data sets without standardized naming conventions, identification and filtering of orthologous clusters of proteins or genes, and the assembly of alignments of orthologous sequence data into individual and concatenated super alignments. Here we report the production of an automated pipeline, Hal that produces multiple alignments and trees from genomic data. These alignments can be produced by a choice of four alignment programs and analyzed by a variety of phylogenetic programs. In short, the Hal pipeline connects the programs BLASTP, MCL, user specified alignment programs, GBlocks, ProtTest and user specified phylogenetic programs to produce species trees. The script is available at sourceforge (http://sourceforge.net/projects/bio-hal/). The results from an example analysis of Kingdom Fungi are briefly discussed.
Testing for Polytomies in Phylogenetic Species Trees Using Quartet Frequencies.

Science.gov (United States)

Sayyari, Erfan; Mirarab, Siavash

2018-02-28

Phylogenetic species trees typically represent the speciation history as a bifurcating tree. Speciation events that simultaneously create more than two descendants, thereby creating polytomies in the phylogeny, are possible. Moreover, the inability to resolve relationships is often shown as a (soft) polytomy. Both types of polytomies have been traditionally studied in the context of gene tree reconstruction from sequence data. However, polytomies in the species tree cannot be detected or ruled out without considering gene tree discordance. In this paper, we describe a statistical test based on properties of the multi-species coalescent model to test the null hypothesis that a branch in an estimated species tree should be replaced by a polytomy. On both simulated and biological datasets, we show that the null hypothesis is rejected for all but the shortest branches, and in most cases, it is retained for true polytomies. The test, available as part of the Accurate Species TRee ALgorithm (ASTRAL) package, can help systematists decide whether their datasets are sufficient to resolve specific relationships of interest.
Testing for Polytomies in Phylogenetic Species Trees Using Quartet Frequencies

Science.gov (United States)

Sayyari, Erfan

2018-01-01

Phylogenetic species trees typically represent the speciation history as a bifurcating tree. Speciation events that simultaneously create more than two descendants, thereby creating polytomies in the phylogeny, are possible. Moreover, the inability to resolve relationships is often shown as a (soft) polytomy. Both types of polytomies have been traditionally studied in the context of gene tree reconstruction from sequence data. However, polytomies in the species tree cannot be detected or ruled out without considering gene tree discordance. In this paper, we describe a statistical test based on properties of the multi-species coalescent model to test the null hypothesis that a branch in an estimated species tree should be replaced by a polytomy. On both simulated and biological datasets, we show that the null hypothesis is rejected for all but the shortest branches, and in most cases, it is retained for true polytomies. The test, available as part of the Accurate Species TRee ALgorithm (ASTRAL) package, can help systematists decide whether their datasets are sufficient to resolve specific relationships of interest. PMID:29495636
Coalescent methods for estimating phylogenetic trees.

Science.gov (United States)

Liu, Liang; Yu, Lili; Kubatko, Laura; Pearl, Dennis K; Edwards, Scott V

2009-10-01

We review recent models to estimate phylogenetic trees under the multispecies coalescent. Although the distinction between gene trees and species trees has come to the fore of phylogenetics, only recently have methods been developed that explicitly estimate species trees. Of the several factors that can cause gene tree heterogeneity and discordance with the species tree, deep coalescence due to random genetic drift in branches of the species tree has been modeled most thoroughly. Bayesian approaches to estimating species trees utilizes two likelihood functions, one of which has been widely used in traditional phylogenetics and involves the model of nucleotide substitution, and the second of which is less familiar to phylogeneticists and involves the probability distribution of gene trees given a species tree. Other recent parametric and nonparametric methods for estimating species trees involve parsimony criteria, summary statistics, supertree and consensus methods. Species tree approaches are an appropriate goal for systematics, appear to work well in some cases where concatenation can be misleading, and suggest that sampling many independent loci will be paramount. Such methods can also be challenging to implement because of the complexity of the models and computational time. In addition, further elaboration of the simplest of coalescent models will be required to incorporate commonly known issues such as deviation from the molecular clock, gene flow and other genetic forces.
SuperTRI: A new approach based on branch support analyses of multiple independent data sets for assessing reliability of phylogenetic inferences.

Science.gov (United States)

Ropiquet, Anne; Li, Blaise; Hassanin, Alexandre

2009-09-01

Supermatrix and supertree are two methods for constructing a phylogenetic tree by using multiple data sets. However, these methods are not a panacea, as conflicting signals between data sets can lead to misinterpret the evolutionary history of taxa. In particular, the supermatrix approach is expected to be misleading if the species-tree signal is not dominant after the combination of the data sets. Moreover, most current supertree methods suffer from two limitations: (i) they ignore or misinterpret secondary (non-dominant) phylogenetic signals of the different data sets; and (ii) the logical basis of node robustness measures is unclear. To overcome these limitations, we propose a new approach, called SuperTRI, which is based on the branch support analyses of the independent data sets, and where the reliability of the nodes is assessed using three measures: the supertree Bootstrap percentage and two other values calculated from the separate analyses: the mean branch support (mean Bootstrap percentage or mean posterior probability) and the reproducibility index. The SuperTRI approach is tested on a data matrix including seven genes for 82 taxa of the family Bovidae (Mammalia, Ruminantia), and the results are compared to those found with the supermatrix approach. The phylogenetic analyses of the supermatrix and independent data sets were done using four methods of tree reconstruction: Bayesian inference, maximum likelihood, and unweighted and weighted maximum parsimony. The results indicate, firstly, that the SuperTRI approach shows less sensitivity to the four phylogenetic methods, secondly, that it is more accurate to interpret the relationships among taxa, and thirdly, that interesting conclusions on introgression and radiation can be drawn from the comparisons between SuperTRI and supermatrix analyses.
Phylogenetic stratigraphy in the Guerrero Negro hypersaline microbial mat.

Science.gov (United States)

Harris, J Kirk; Caporaso, J Gregory; Walker, Jeffrey J; Spear, John R; Gold, Nicholas J; Robertson, Charles E; Hugenholtz, Philip; Goodrich, Julia; McDonald, Daniel; Knights, Dan; Marshall, Paul; Tufo, Henry; Knight, Rob; Pace, Norman R

2013-01-01

The microbial mats of Guerrero Negro (GN), Baja California Sur, Mexico historically were considered a simple environment, dominated by cyanobacteria and sulfate-reducing bacteria. Culture-independent rRNA community profiling instead revealed these microbial mats as among the most phylogenetically diverse environments known. A preliminary molecular survey of the GN mat based on only ∼1500 small subunit rRNA gene sequences discovered several new phylum-level groups in the bacterial phylogenetic domain and many previously undetected lower-level taxa. We determined an additional ∼119,000 nearly full-length sequences and 28,000 >200 nucleotide 454 reads from a 10-layer depth profile of the GN mat. With this unprecedented coverage of long sequences from one environment, we confirm the mat is phylogenetically stratified, presumably corresponding to light and geochemical gradients throughout the depth of the mat. Previous shotgun metagenomic data from the same depth profile show the same stratified pattern and suggest that metagenome properties may be predictable from rRNA gene sequences. We verify previously identified novel lineages and identify new phylogenetic diversity at lower taxonomic levels, for example, thousands of operational taxonomic units at the family-genus levels differ considerably from known sequences. The new sequences populate parts of the bacterial phylogenetic tree that previously were poorly described, but indicate that any comprehensive survey of GN diversity has only begun. Finally, we show that taxonomic conclusions are generally congruent between Sanger and 454 sequencing technologies, with the taxonomic resolution achieved dependent on the abundance of reference sequences in the relevant region of the rRNA tree of life.
New substitution models for rooting phylogenetic trees.

Science.gov (United States)

Williams, Tom A; Heaps, Sarah E; Cherlin, Svetlana; Nye, Tom M W; Boys, Richard J; Embley, T Martin

2015-09-26

The root of a phylogenetic tree is fundamental to its biological interpretation, but standard substitution models do not provide any information on its position. Here, we describe two recently developed models that relax the usual assumptions of stationarity and reversibility, thereby facilitating root inference without the need for an outgroup. We compare the performance of these models on a classic test case for phylogenetic methods, before considering two highly topical questions in evolutionary biology: the deep structure of the tree of life and the root of the archaeal radiation. We show that all three alignments contain meaningful rooting information that can be harnessed by these new models, thus complementing and extending previous work based on outgroup rooting. In particular, our analyses exclude the root of the tree of life from the eukaryotes or Archaea, placing it on the bacterial stem or within the Bacteria. They also exclude the root of the archaeal radiation from several major clades, consistent with analyses using other rooting methods. Overall, our results demonstrate the utility of non-reversible and non-stationary models for rooting phylogenetic trees, and identify areas where further progress can be made. © 2015 The Authors.
Nucleotide diversity and phylogenetic relationships among ...

Indian Academy of Sciences (India)

2017-03-03

Mar 3, 2017 ... 2Department of Botany, D. S. B. Campus, Kumaun University, Nainital 263 001, India ... Rana T. S. 2017 Nucleotide diversity and phylogenetic relationships ... Anderson and Park 1989). ..... Edgewood Press, Edgewood, USA.
Marine turtle mitogenome phylogenetics and evolution

DEFF Research Database (Denmark)

Duchene, Sebastián; Frey, Amy; Alfaro-Núñez, Luis Alonso

2012-01-01

The sea turtles are a group of cretaceous origin containing seven recognized living species: leatherback, hawksbill, Kemp's ridley, olive ridley, loggerhead, green, and flatback. The leatherback is the single member of the Dermochelidae family, whereas all other sea turtles belong in Cheloniidae...... distributions, shedding light on complex migration patterns and possible geographic or climatic events as driving forces of sea-turtle distribution. We have sequenced complete mitogenomes for all sea-turtle species, including samples from their geographic range extremes, and performed phylogenetic analyses...... to assess sea-turtle evolution with a large molecular dataset. We found variation in the length of the ATP8 gene and a highly variable site in ND4 near a proton translocation channel in the resulting protein. Complete mitogenomes show strong support and resolution for phylogenetic relationships among all...
Inferring phylogenetic trees from the knowledge of rare evolutionary events.

Science.gov (United States)

Hellmuth, Marc; Hernandez-Rosales, Maribel; Long, Yangjing; Stadler, Peter F

2018-06-01

Rare events have played an increasing role in molecular phylogenetics as potentially homoplasy-poor characters. In this contribution we analyze the phylogenetic information content from a combinatorial point of view by considering the binary relation on the set of taxa defined by the existence of a single event separating two taxa. We show that the graph-representation of this relation must be a tree. Moreover, we characterize completely the relationship between the tree of such relations and the underlying phylogenetic tree. With directed operations such as tandem-duplication-random-loss events in mind we demonstrate how non-symmetric information constrains the position of the root in the partially reconstructed phylogeny.
A RAD-based phylogenetics for Orestias fishes from Lake Titicaca.

Science.gov (United States)

Takahashi, Tetsumi; Moreno, Edmundo

2015-12-01

The fish genus Orestias is endemic to the Andes highlands, and Lake Titicaca is the centre of the species diversity of the genus. Previous phylogenetic studies based on a single locus of mitochondrial and nuclear DNA strongly support the monophyly of a group composed of many of species endemic to the Lake Titicaca basin (the Lake Titicaca radiation), but the relationships among the species in the radiation remain unclear. Recently, restriction site-associated DNA (RAD) sequencing, which can produce a vast number of short sequences from various loci of nuclear DNA, has emerged as a useful way to resolve complex phylogenetic problems. To propose a new phylogenetic hypothesis of Orestias fishes of the Lake Titicaca radiation, we conducted a cluster analysis based on morphological similarities among fish samples and a molecular phylogenetic analysis based on RAD sequencing. From a morphological cluster analysis, we recognised four species groups in the radiation, and three of the four groups were resolved as monophyletic groups in maximum-likelihood trees based on RAD sequencing data. The other morphology-based group was not resolved as a monophyletic group in molecular phylogenies, and some members of the group were diverged from its sister group close to the root of the Lake Titicaca radiation. The evolution of these fishes is discussed from the phylogenetic relationships. Copyright © 2015 Elsevier Inc. All rights reserved.
Some limitations of public sequence data for phylogenetic inference (in plants).

Science.gov (United States)

Hinchliff, Cody E; Smith, Stephen Andrew

2014-01-01

The GenBank database contains essentially all of the nucleotide sequence data generated for published molecular systematic studies, but for the majority of taxa these data remain sparse. GenBank has value for phylogenetic methods that leverage data-mining and rapidly improving computational methods, but the limits imposed by the sparse structure of the data are not well understood. Here we present a tree representing 13,093 land plant genera--an estimated 80% of extant plant diversity--to illustrate the potential of public sequence data for broad phylogenetic inference in plants, and we explore the limits to inference imposed by the structure of these data using theoretical foundations from phylogenetic data decisiveness. We find that despite very high levels of missing data (over 96%), the present data retain the potential to inform over 86.3% of all possible phylogenetic relationships. Most of these relationships, however, are informed by small amounts of data--approximately half are informed by fewer than four loci, and more than 99% are informed by fewer than fifteen. We also apply an information theoretic measure of branch support to assess the strength of phylogenetic signal in the data, revealing many poorly supported branches concentrated near the tips of the tree, where data are sparse and the limiting effects of this sparseness are stronger. We argue that limits to phylogenetic inference and signal imposed by low data coverage may pose significant challenges for comprehensive phylogenetic inference at the species level. Computational requirements provide additional limits for large reconstructions, but these may be overcome by methodological advances, whereas insufficient data coverage can only be remedied by additional sampling effort. We conclude that public databases have exceptional value for modern systematics and evolutionary biology, and that a continued emphasis on expanding taxonomic and genomic coverage will play a critical role in developing
Plant Biodiversity Drivers in Brazilian Campos Rupestres: Insights from Phylogenetic Structure.

Science.gov (United States)

Zappi, Daniela C; Moro, Marcelo F; Meagher, Thomas R; Nic Lughadha, Eimear

2017-01-01

Old, climate-buffered infertile landscapes (Ocbils) have attracted increasing levels of interest in recent years because of their exceptionally diverse plant communities. Brazil's campos rupestres (rupestrian grasslands) are home to almost 15% of Brazil's native flora in less than 0.8% of Brazil's territory: an ideal study system for exploring variation in floristic diversity and phylogenetic structure in sites differing in geology and phytophysiognomy. We found significant differences in floristic diversity and phylogenetic structure across a range of study sites encompassing open vegetation and forest on quartzite (FQ) and on ironstone substrates, commonly termed canga . Substrate and physiognomy were key in structuring floristic diversity in the Espinhaço and physiognomy was more important than substrate in structuring phylogenetic diversity, with neither substrate nor its interaction with physiognomy accounting for significant variation in phylogenetic structure. Phylogenetic clustering was significant in open vegetation on both canga and quartzite, reflecting the potential role of environmental filtering in these exposed montane communities adapted to multiple environmental stressors. In forest communities, phylogenetic clustering was significant only at relatively deep nodes of the phylogeny in FQ while no significant phylogenetic clustering was detected across forest on canga (FC), which may be attributable to proximity to the megadiverse Atlantic forest biome and/or comparatively benign environmental conditions in FC with relatively deep, nutrient-rich soils and access to edaphic water reliable in comparison to those for open vegetation on canga and open or forest communities on quartzite. Clades representing relatively old lineages are significantly over-represented in campos rupestres on quartzite, consistent with the Gondwanan Heritage Hypothesis of Ocbil theory. In contrast, forested sites on canga are recognized as Yodfels. To be effective
Plant Biodiversity Drivers in Brazilian Campos Rupestres: Insights from Phylogenetic Structure

Directory of Open Access Journals (Sweden)

Daniela C. Zappi

2017-12-01

Full Text Available Old, climate-buffered infertile landscapes (Ocbils have attracted increasing levels of interest in recent years because of their exceptionally diverse plant communities. Brazil’s campos rupestres (rupestrian grasslands are home to almost 15% of Brazil’s native flora in less than 0.8% of Brazil’s territory: an ideal study system for exploring variation in floristic diversity and phylogenetic structure in sites differing in geology and phytophysiognomy. We found significant differences in floristic diversity and phylogenetic structure across a range of study sites encompassing open vegetation and forest on quartzite (FQ and on ironstone substrates, commonly termed canga. Substrate and physiognomy were key in structuring floristic diversity in the Espinhaço and physiognomy was more important than substrate in structuring phylogenetic diversity, with neither substrate nor its interaction with physiognomy accounting for significant variation in phylogenetic structure. Phylogenetic clustering was significant in open vegetation on both canga and quartzite, reflecting the potential role of environmental filtering in these exposed montane communities adapted to multiple environmental stressors. In forest communities, phylogenetic clustering was significant only at relatively deep nodes of the phylogeny in FQ while no significant phylogenetic clustering was detected across forest on canga (FC, which may be attributable to proximity to the megadiverse Atlantic forest biome and/or comparatively benign environmental conditions in FC with relatively deep, nutrient-rich soils and access to edaphic water reliable in comparison to those for open vegetation on canga and open or forest communities on quartzite. Clades representing relatively old lineages are significantly over-represented in campos rupestres on quartzite, consistent with the Gondwanan Heritage Hypothesis of Ocbil theory. In contrast, forested sites on canga are recognized as Yodfels. To be
Keeping All the PIECES: Phylogenetically Informed Ex Situ Conservation of Endangered Species.

Science.gov (United States)

Larkin, Daniel J; Jacobi, Sarah K; Hipp, Andrew L; Kramer, Andrea T

2016-01-01

Ex situ conservation in germplasm and living collections is a major focus of global plant conservation strategies. Prioritizing species for ex situ collection is a necessary component of this effort for which sound strategies are needed. Phylogenetic considerations can play an important role in prioritization. Collections that are more phylogenetically diverse are likely to encompass more ecological and trait variation, and thus provide stronger conservation insurance and richer resources for future restoration efforts. However, phylogenetic criteria need to be weighed against other, potentially competing objectives. We used ex situ collection and threat rank data for North American angiosperms to investigate gaps in ex situ coverage and phylogenetic diversity of collections and to develop a flexible framework for prioritizing species across multiple objectives. We found that ex situ coverage of 18,766 North American angiosperm taxa was low with respect to the most vulnerable taxa: just 43% of vulnerable to critically imperiled taxa were in ex situ collections, far short of a year-2020 goal of 75%. In addition, species held in ex situ collections were phylogenetically clustered (P species been drawn at random. These patterns support incorporating phylogenetic considerations into ex situ prioritization in a manner balanced with other criteria, such as vulnerability. To meet this need, we present the 'PIECES' index (Phylogenetically Informed Ex situ Conservation of Endangered Species). PIECES integrates phylogenetic considerations into a flexible framework for prioritizing species across competing objectives using multi-criteria decision analysis. Applying PIECES to prioritizing ex situ conservation of North American angiosperms, we show strong return on investment across multiple objectives, some of which are negatively correlated with each other. A spreadsheet-based decision support tool for North American angiosperms is provided; this tool can be customized to
GenNon-h: Generating multiple sequence alignments on nonhomogeneous phylogenetic trees

Directory of Open Access Journals (Sweden)

Kedzierska Anna M

2012-08-01

Full Text Available Abstract Background A number of software packages are available to generate DNA multiple sequence alignments (MSAs evolved under continuous-time Markov processes on phylogenetic trees. On the other hand, methods of simulating the DNA MSA directly from the transition matrices do not exist. Moreover, existing software restricts to the time-reversible models and it is not optimized to generate nonhomogeneous data (i.e. placing distinct substitution rates at different lineages. Results We present the first package designed to generate MSAs evolving under discrete-time Markov processes on phylogenetic trees, directly from probability substitution matrices. Based on the input model and a phylogenetic tree in the Newick format (with branch lengths measured as the expected number of substitutions per site, the algorithm produces DNA alignments of desired length. GenNon-h is publicly available for download. Conclusion The software presented here is an efficient tool to generate DNA MSAs on a given phylogenetic tree. GenNon-h provides the user with the nonstationary or nonhomogeneous phylogenetic data that is well suited for testing complex biological hypotheses, exploring the limits of the reconstruction algorithms and their robustness to such models.
A Multi-Criterion Evolutionary Approach Applied to Phylogenetic Reconstruction

OpenAIRE

Cancino, W.; Delbem, A.C.B.

2010-01-01

In this paper, we proposed an MOEA approach, called PhyloMOEA which solves the phylogenetic inference problem using maximum parsimony and maximum likelihood criteria. The PhyloMOEA's development was motivated by several studies in the literature (Huelsenbeck, 1995; Jin & Nei, 1990; Kuhner & Felsenstein, 1994; Tateno et al., 1994), which point out that various phylogenetic inference methods lead to inconsistent solutions. Techniques using parsimony and likelihood criteria yield to different tr...
Taxonomic colouring of phylogenetic trees of protein sequences

Directory of Open Access Journals (Sweden)

Andrade-Navarro Miguel A

2006-02-01

Full Text Available Abstract Background Phylogenetic analyses of protein families are used to define the evolutionary relationships between homologous proteins. The interpretation of protein-sequence phylogenetic trees requires the examination of the taxonomic properties of the species associated to those sequences. However, there is no online tool to facilitate this interpretation, for example, by automatically attaching taxonomic information to the nodes of a tree, or by interactively colouring the branches of a tree according to any combination of taxonomic divisions. This is especially problematic if the tree contains on the order of hundreds of sequences, which, given the accelerated increase in the size of the protein sequence databases, is a situation that is becoming common. Results We have developed PhyloView, a web based tool for colouring phylogenetic trees upon arbitrary taxonomic properties of the species represented in a protein sequence phylogenetic tree. Provided that the tree contains SwissProt, SpTrembl, or GenBank protein identifiers, the tool retrieves the taxonomic information from the corresponding database. A colour picker displays a summary of the findings and allows the user to associate colours to the leaves of the tree according to any number of taxonomic partitions. Then, the colours are propagated to the branches of the tree. Conclusion PhyloView can be used at http://www.ogic.ca/projects/phyloview/. A tutorial, the software with documentation, and GPL licensed source code, can be accessed at the same web address.
Phylogenetic systematics of the genus Echinococcus (Cestoda: Taeniidae).

Science.gov (United States)

Nakao, Minoru; Lavikainen, Antti; Yanagida, Tetsuya; Ito, Akira

2013-11-01

Echinococcosis is a serious helminthic zoonosis in humans, livestock and wildlife. The pathogenic organisms are members of the genus Echinococcus (Cestoda: Taeniidae). Life cycles of Echinococcus spp. are consistently dependent on predator-prey association between two obligate mammalian hosts. Carnivores (canids and felids) serve as definitive hosts for adult tapeworms and their herbivore prey (ungulates, rodents and lagomorphs) as intermediate hosts for metacestode larvae. Humans are involved as an accidental host for metacestode infections. The metacestodes develop in various internal organs, particularly in liver and lungs. Each metacestode of Echinococcus spp. has an organotropism and a characteristic form known as an unilocular (cystic), alveolar or polycystic hydatid. Recent molecular phylogenetic studies have demonstrated that the type species, Echinococcus granulosus, causing cystic echinococcosis is a cryptic species complex. Therefore, the orthodox taxonomy of Echinococcus established from morphological criteria has been revised from the standpoint of phylogenetic systematics. Nine valid species including newly resurrected taxa are recognised as a result of the revision. This review summarises the recent advances in the phylogenetic systematics of Echinococcus, together with the historical backgrounds and molecular epidemiological aspects of each species. A new phylogenetic tree inferred from the mitochondrial genomes of all valid Echinococcus spp. is also presented. The taxonomic nomenclature for Echinococcus oligarthrus is shown to be incorrect and this name should be replaced with Echinococcus oligarthra. Copyright © 2013 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
A reconstruction problem for a class of phylogenetic networks with lateral gene transfers.

Science.gov (United States)

Cardona, Gabriel; Pons, Joan Carles; Rosselló, Francesc

2015-01-01

Lateral, or Horizontal, Gene Transfers are a type of asymmetric evolutionary events where genetic material is transferred from one species to another. In this paper we consider LGT networks, a general model of phylogenetic networks with lateral gene transfers which consist, roughly, of a principal rooted tree with its leaves labelled on a set of taxa, and a set of extra secondary arcs between nodes in this tree representing lateral gene transfers. An LGT network gives rise in a natural way to a principal phylogenetic subtree and a set of secondary phylogenetic subtrees, which, roughly, represent, respectively, the main line of evolution of most genes and the secondary lines of evolution through lateral gene transfers. We introduce a set of simple conditions on an LGT network that guarantee that its principal and secondary phylogenetic subtrees are pairwise different and that these subtrees determine, up to isomorphism, the LGT network. We then give an algorithm that, given a set of pairwise different phylogenetic trees [Formula: see text] on the same set of taxa, outputs, when it exists, the LGT network that satisfies these conditions and such that its principal phylogenetic tree is [Formula: see text] and its secondary phylogenetic trees are [Formula: see text].

Computing the Skewness of the Phylogenetic Mean Pairwise Distance in Linear Time

DEFF Research Database (Denmark)

Tsirogiannis, Constantinos; Sandel, Brody Steven

2014-01-01

The phylogenetic Mean Pairwise Distance (MPD) is one of the most popular measures for computing the phylogenetic distance between a given group of species. More specifically, for a phylogenetic tree and for a set of species R represented by a subset of the leaf nodes of , the MPD of R is equal...... to the average cost of all possible simple paths in that connect pairs of nodes in R. Among other phylogenetic measures, the MPD is used as a tool for deciding if the species of a given group R are closely related. To do this, it is important to compute not only the value of the MPD for this group but also...
The Fair Proportion is a Shapley Value on phylogenetic networks too

OpenAIRE

Coronado, Tomás M.; Riera, Gabriel; Rosselló, Francesc

2018-01-01

The Fair Proportion of a species in a phylogenetic tree is a very simple measure that has been used to assess its value relative to the overall phylogenetic diversity represented by the tree. It has recently been proved by Fuchs and Jin to be equal to the Shapley Value of the coallitional game that sends each subset of species to its rooted Phylogenetic Diversity in the tree. We prove in this paper that this result extends to the natural translations of the Fair Proportion and the rooted Phyl...
PhyLIS: a simple GNU/Linux distribution for phylogenetics and phyloinformatics.

Science.gov (United States)

Thomson, Robert C

2009-07-30

PhyLIS is a free GNU/Linux distribution that is designed to provide a simple, standardized platform for phylogenetic and phyloinformatic analysis. The operating system incorporates most commonly used phylogenetic software, which has been pre-compiled and pre-configured, allowing for straightforward application of phylogenetic methods and development of phyloinformatic pipelines in a stable Linux environment. The software is distributed as a live CD and can be installed directly or run from the CD without making changes to the computer. PhyLIS is available for free at http://www.eve.ucdavis.edu/rcthomson/phylis/.
Edge-related loss of tree phylogenetic diversity in the severely fragmented Brazilian Atlantic forest.

Directory of Open Access Journals (Sweden)

Bráulio A Santos

Full Text Available Deforestation and forest fragmentation are known major causes of nonrandom extinction, but there is no information about their impact on the phylogenetic diversity of the remaining species assemblages. Using a large vegetation dataset from an old hyper-fragmented landscape in the Brazilian Atlantic rainforest we assess whether the local extirpation of tree species and functional impoverishment of tree assemblages reduce the phylogenetic diversity of the remaining tree assemblages. We detected a significant loss of tree phylogenetic diversity in forest edges, but not in core areas of small (<80 ha forest fragments. This was attributed to a reduction of 11% in the average phylogenetic distance between any two randomly chosen individuals from forest edges; an increase of 17% in the average phylogenetic distance to closest non-conspecific relative for each individual in forest edges; and to the potential manifestation of late edge effects in the core areas of small forest remnants. We found no evidence supporting fragmentation-induced phylogenetic clustering or evenness. This could be explained by the low phylogenetic conservatism of key life-history traits corresponding to vulnerable species. Edge effects must be reduced to effectively protect tree phylogenetic diversity in the severely fragmented Brazilian Atlantic forest.
Phylogenetic reconstruction of orthology, paralogy, and conserved synteny for dog and human.

Science.gov (United States)

Goodstadt, Leo; Ponting, Chris P

2006-09-29

Accurate predictions of orthology and paralogy relationships are necessary to infer human molecular function from experiments in model organisms. Previous genome-scale approaches to predicting these relationships have been limited by their use of protein similarity and their failure to take into account multiple splicing events and gene prediction errors. We have developed PhyOP, a new phylogenetic orthology prediction pipeline based on synonymous rate estimates, which accurately predicts orthology and paralogy relationships for transcripts, genes, exons, or genomic segments between closely related genomes. We were able to identify orthologue relationships to human genes for 93% of all dog genes from Ensembl. Among 1:1 orthologues, the alignments covered a median of 97.4% of protein sequences, and 92% of orthologues shared essentially identical gene structures. PhyOP accurately recapitulated genomic maps of conserved synteny. Benchmarking against predictions from Ensembl and Inparanoid showed that PhyOP is more accurate, especially in its predictions of paralogy. Nearly half (46%) of PhyOP paralogy predictions are unique. Using PhyOP to investigate orthologues and paralogues in the human and dog genomes, we found that the human assembly contains 3-fold more gene duplications than the dog. Species-specific duplicate genes, or "in-paralogues," are generally shorter and have fewer exons than 1:1 orthologues, which is consistent with selective constraints and mutation biases based on the sizes of duplicated genes. In-paralogues have experienced elevated amino acid and synonymous nucleotide substitution rates. Duplicates possess similar biological functions for either the dog or human lineages. Having accounted for 2,954 likely pseudogenes and gene fragments, and after separating 346 erroneously merged genes, we estimated that the human genome encodes a minimum of 19,700 protein-coding genes, similar to the gene count of nematode worms. PhyOP is a fast and robust
Phylogenetic reconstruction of orthology, paralogy, and conserved synteny for dog and human.

Directory of Open Access Journals (Sweden)

Leo Goodstadt

2006-09-01

Full Text Available Accurate predictions of orthology and paralogy relationships are necessary to infer human molecular function from experiments in model organisms. Previous genome-scale approaches to predicting these relationships have been limited by their use of protein similarity and their failure to take into account multiple splicing events and gene prediction errors. We have developed PhyOP, a new phylogenetic orthology prediction pipeline based on synonymous rate estimates, which accurately predicts orthology and paralogy relationships for transcripts, genes, exons, or genomic segments between closely related genomes. We were able to identify orthologue relationships to human genes for 93% of all dog genes from Ensembl. Among 1:1 orthologues, the alignments covered a median of 97.4% of protein sequences, and 92% of orthologues shared essentially identical gene structures. PhyOP accurately recapitulated genomic maps of conserved synteny. Benchmarking against predictions from Ensembl and Inparanoid showed that PhyOP is more accurate, especially in its predictions of paralogy. Nearly half (46% of PhyOP paralogy predictions are unique. Using PhyOP to investigate orthologues and paralogues in the human and dog genomes, we found that the human assembly contains 3-fold more gene duplications than the dog. Species-specific duplicate genes, or "in-paralogues," are generally shorter and have fewer exons than 1:1 orthologues, which is consistent with selective constraints and mutation biases based on the sizes of duplicated genes. In-paralogues have experienced elevated amino acid and synonymous nucleotide substitution rates. Duplicates possess similar biological functions for either the dog or human lineages. Having accounted for 2,954 likely pseudogenes and gene fragments, and after separating 346 erroneously merged genes, we estimated that the human genome encodes a minimum of 19,700 protein-coding genes, similar to the gene count of nematode worms. PhyOP is a
Phylogenetic representativeness: a new method for evaluating taxon sampling in evolutionary studies

Directory of Open Access Journals (Sweden)

Passamonti Marco

2010-04-01

Full Text Available Abstract Background Taxon sampling is a major concern in phylogenetic studies. Incomplete, biased, or improper taxon sampling can lead to misleading results in reconstructing evolutionary relationships. Several theoretical methods are available to optimize taxon choice in phylogenetic analyses. However, most involve some knowledge about the genetic relationships of the group of interest (i.e., the ingroup, or even a well-established phylogeny itself; these data are not always available in general phylogenetic applications. Results We propose a new method to assess taxon sampling developing Clarke and Warwick statistics. This method aims to measure the "phylogenetic representativeness" of a given sample or set of samples and it is based entirely on the pre-existing available taxonomy of the ingroup, which is commonly known to investigators. Moreover, our method also accounts for instability and discordance in taxonomies. A Python-based script suite, called PhyRe, has been developed to implement all analyses we describe in this paper. Conclusions We show that this method is sensitive and allows direct discrimination between representative and unrepresentative samples. It is also informative about the addition of taxa to improve taxonomic coverage of the ingroup. Provided that the investigators' expertise is mandatory in this field, phylogenetic representativeness makes up an objective touchstone in planning phylogenetic studies.
The ethnobotany of psychoactive plant use: a phylogenetic perspective

Directory of Open Access Journals (Sweden)

Nashmiah Aid Alrashedy

2016-10-01

Full Text Available Psychoactive plants contain chemicals that presumably evolved as allelochemicals but target certain neuronal receptors when consumed by humans, altering perception, emotion and cognition. These plants have been used since ancient times as medicines and in the context of religious rituals for their various psychoactive effects (e.g., as hallucinogens, stimulants, sedatives. The ubiquity of psychoactive plants in various cultures motivates investigation of the commonalities among these plants, in which a phylogenetic framework may be insightful. A phylogeny of culturally diverse psychoactive plant taxa was constructed with their psychotropic effects and affected neurotransmitter systems mapped on the phylogeny. The phylogenetic distribution shows multiple evolutionary origins of psychoactive families. The plant families Myristicaceae (e.g., nutmeg, Papaveraceae (opium poppy, Cactaceae (peyote, Convolvulaceae (morning glory, Solanaceae (tobacco, Lamiaceae (mints, Apocynaceae (dogbane have a disproportionate number of psychoactive genera with various indigenous groups using geographically disparate members of these plant families for the same psychoactive effect, an example of cultural convergence. Pharmacological traits related to hallucinogenic and sedative potential are phylogenetically conserved within families. Unrelated families that exert similar psychoactive effects also modulate similar neurotransmitter systems (i.e., mechanistic convergence. However, pharmacological mechanisms for stimulant effects were varied even within families suggesting that stimulant chemicals may be more evolutionarily labile than those associated with hallucinogenic and sedative effects. Chemically similar psychoactive chemicals may also exist in phylogenetically unrelated lineages, suggesting convergent evolution or differential gene regulation of a common metabolic pathway. Our study has shown that phylogenetic analysis of traditionally used psychoactive plants
Phylogenetic constrains on mycorrhizal specificity in eight Dendrobium (Orchidaceae) species.

Science.gov (United States)

Xing, Xiaoke; Ma, Xueting; Men, Jinxin; Chen, Yanhong; Guo, Shunxing

2017-05-01

Plant phylogeny constrains orchid mycorrhizal (OrM) fungal community composition in some orchids. Here, we investigated the structures of the OrM fungal communities of eight Dendrobium species in one niche to determine whether similarities in the OrM fungal communities correlated with the phylogeny of the host plants and whether the Dendrobium-OrM fungal interactions are phylogenetically conserved. A phylogeny based on DNA data was constructed for the eight coexisting Dendrobium species, and the OrM fungal communities were characterized by their roots. There were 31 different fungal lineages associated with the eight Dendrobium species. In total, 82.98% of the identified associations belonging to Tulasnellaceae, and a smaller proportion involved members of the unknown Basidiomycota (9.67%). Community analyses revealed that phylogenetically related Dendrobium tended to interact with a similar set of Tulasnellaceae fungi. The interactions between Dendrobium and Tulasnellaceae fungi were significantly influenced by the phylogenetic relationships among the Dendrobium species. Our results provide evidence that the mycorrhizal specificity in the eight coexisting Dendrobium species was phylogenetically conserved.
Dimensional Reduction for the General Markov Model on Phylogenetic Trees.

Science.gov (United States)

Sumner, Jeremy G

2017-03-01

We present a method of dimensional reduction for the general Markov model of sequence evolution on a phylogenetic tree. We show that taking certain linear combinations of the associated random variables (site pattern counts) reduces the dimensionality of the model from exponential in the number of extant taxa, to quadratic in the number of taxa, while retaining the ability to statistically identify phylogenetic divergence events. A key feature is the identification of an invariant subspace which depends only bilinearly on the model parameters, in contrast to the usual multi-linear dependence in the full space. We discuss potential applications including the computation of split (edge) weights on phylogenetic trees from observed sequence data.
The Drosophila bipectinata species complex: phylogenetic ...

Indian Academy of Sciences (India)

PARUL BANERJEE

c Indian Academy of Sciences. RESEARCH ARTICLE. The Drosophila bipectinata species complex: phylogenetic relationship among different members based on chromosomal variations. PARUL BANERJEE and BASHISTH N. SINGH. ∗. Genetics Laboratory, Department of Zoology, Banaras Hindu University, Varanasi ...
Elongation Factor-1α Accurately Reconstructs Relationships Amongst Psyllid Families (Hemiptera: Psylloidea), with Possible Diagnostic Implications.

Science.gov (United States)

Martoni, Francesco; Bulman, Simon R; Pitman, Andrew; Armstrong, Karen F

2017-12-05

The superfamily Psylloidea (Hemiptera: Sternorrhyncha) lacks a robust multigene phylogeny. This impedes our understanding of the evolution of this group of insects and, consequently, an accurate identification of individuals, of their plant host associations, and their roles as vectors of economically important plant pathogens. The conserved nuclear gene elongation factor-1 alpha (EF-1α) has been valuable as a higher-level phylogenetic marker in insects and it has also been widely used to investigate the evolution of intron/exon structure. To explore evolutionary relationships among Psylloidea, polymerase chain reaction amplification and nucleotide sequencing of a 250-bp EF-1α gene fragment was applied to psyllids belonging to five different families. Introns were detected in three individuals belonging to two families. The nine genera belonging to the family Aphalaridae all lacked introns, highlighting the possibility of using intron presence/absence as a diagnostic tool at a family level. When paired with cytochrome oxidase I gene sequences, the 250 bp EF-1α sequence appeared to be a very promising higher-level phylogenetic marker for psyllids. © The Author(s) 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Ecological and phylogenetic influences on maxillary dentition in snakes

Directory of Open Access Journals (Sweden)

Kate Jackson

2010-12-01

Full Text Available The maxillary dentition of snakes was used as a system with which to investigate the relative importance of the interacting forces of ecological selective pressures and phylogenetic constraints indetermining morphology. The maxillary morphology of three groups of snakes having different diets, with each group comprising two distinct lineages — boids and colubroids — was examined. Our results suggest that dietary selective pressures may be more significantthan phylogenetic history in shaping maxillary morphology.
Two results on expected values of imbalance indices of phylogenetic trees

OpenAIRE

Mir, Arnau; Rossello, Francesc

2012-01-01

We compute an explicit formula for the expected value of the Colless index of a phylogenetic tree generated under the Yule model, and an explicit formula for the expected value of the Sackin index of a phylogenetic tree generated under the uniform model.
On the Shapley Value of Unrooted Phylogenetic Trees.

Science.gov (United States)

Wicke, Kristina; Fischer, Mareike

2018-01-17

The Shapley value, a solution concept from cooperative game theory, has recently been considered for both unrooted and rooted phylogenetic trees. Here, we focus on the Shapley value of unrooted trees and first revisit the so-called split counts of a phylogenetic tree and the Shapley transformation matrix that allows for the calculation of the Shapley value from the edge lengths of a tree. We show that non-isomorphic trees may have permutation-equivalent Shapley transformation matrices and permutation-equivalent null spaces. This implies that estimating the split counts associated with a tree or the Shapley values of its leaves does not suffice to reconstruct the correct tree topology. We then turn to the use of the Shapley value as a prioritization criterion in biodiversity conservation and compare it to a greedy solution concept. Here, we show that for certain phylogenetic trees, the Shapley value may fail as a prioritization criterion, meaning that the diversity spanned by the top k species (ranked by their Shapley values) cannot approximate the total diversity of all n species.
Rooting the tree of life: the phylogenetic jury is still out.

Science.gov (United States)

Gouy, Richard; Baurain, Denis; Philippe, Hervé

2015-09-26

This article aims to shed light on difficulties in rooting the tree of life (ToL) and to explore the (sociological) reasons underlying the limited interest in accurately addressing this fundamental issue. First, we briefly review the difficulties plaguing phylogenetic inference and the ways to improve the modelling of the substitution process, which is highly heterogeneous, both across sites and over time. We further observe that enriched taxon samplings, better gene samplings and clever data removal strategies have led to numerous revisions of the ToL, and that these improved shallow phylogenies nearly always relocate simple organisms higher in the ToL provided that long-branch attraction artefacts are kept at bay. Then, we note that, despite the flood of genomic data available since 2000, there has been a surprisingly low interest in inferring the root of the ToL. Furthermore, the rare studies dealing with this question were almost always based on methods dating from the 1990s that have been shown to be inaccurate for much more shallow issues! This leads us to argue that the current consensus about a bacterial root for the ToL can be traced back to the prejudice of Aristotle's Great Chain of Beings, in which simple organisms are ancestors of more complex life forms. Finally, we demonstrate that even the best models cannot yet handle the complexity of the evolutionary process encountered both at shallow depth, when the outgroup is too distant, and at the level of the inter-domain relationships. Altogether, we conclude that the commonly accepted bacterial root is still unproven and that the root of the ToL should be revisited using phylogenomic supermatrices to ensure that new evidence for eukaryogenesis, such as the recently described Lokiarcheota, is interpreted in a sound phylogenetic framework. © 2015 The Author(s).
Evaluating Fast Maximum Likelihood-Based Phylogenetic Programs Using Empirical Phylogenomic Data Sets

Science.gov (United States)

Zhou, Xiaofan; Shen, Xing-Xing; Hittinger, Chris Todd

2018-01-01

Abstract The sizes of the data matrices assembled to resolve branches of the tree of life have increased dramatically, motivating the development of programs for fast, yet accurate, inference. For example, several different fast programs have been developed in the very popular maximum likelihood framework, including RAxML/ExaML, PhyML, IQ-TREE, and FastTree. Although these programs are widely used, a systematic evaluation and comparison of their performance using empirical genome-scale data matrices has so far been lacking. To address this question, we evaluated these four programs on 19 empirical phylogenomic data sets with hundreds to thousands of genes and up to 200 taxa with respect to likelihood maximization, tree topology, and computational speed. For single-gene tree inference, we found that the more exhaustive and slower strategies (ten searches per alignment) outperformed faster strategies (one tree search per alignment) using RAxML, PhyML, or IQ-TREE. Interestingly, single-gene trees inferred by the three programs yielded comparable coalescent-based species tree estimations. For concatenation-based species tree inference, IQ-TREE consistently achieved the best-observed likelihoods for all data sets, and RAxML/ExaML was a close second. In contrast, PhyML often failed to complete concatenation-based analyses, whereas FastTree was the fastest but generated lower likelihood values and more dissimilar tree topologies in both types of analyses. Finally, data matrix properties, such as the number of taxa and the strength of phylogenetic signal, sometimes substantially influenced the programs’ relative performance. Our results provide real-world gene and species tree phylogenetic inference benchmarks to inform the design and execution of large-scale phylogenomic data analyses. PMID:29177474
Reconstruction of phylogenetic relationships in dermatomycete genus Trichophyton Malmsten 1848 based on ribosomal internal transcribed spacer region, partial 28S rRNA and beta-tubulin genes sequences.

Science.gov (United States)

Pchelin, Ivan M; Zlatogursky, Vasily V; Rudneva, Mariya V; Chilina, Galina A; Rezaei-Matehkolaei, Ali; Lavnikevich, Dmitry M; Vasilyeva, Natalya V; Taraskina, Anastasia E

2016-09-01

Trichophyton spp. are important causative agents of superficial mycoses. The phylogeny of the genus and accurate strain identification, based on the ribosomal ITS region sequencing, are still under development. The present work is aimed at (i) inferring the genus phylogeny from partial ITS, LSU and BT2 sequences (ii) description of ribosomal ITS region polymorphism in 15 strains of Trichophyton interdigitale. We performed DNA sequence-based species identification and phylogenetic analysis on 48 strains belonging to the genus Trichophyton. Phylogenetic relationships were inferred by maximum likelihood and Bayesian methods on concatenated ITS, LSU and BT2 sequences. Ribosomal ITS region polymorphisms were assessed directly on the alignment. By phylogenetic reconstruction, we reveal major anthropophilic and zoophilic species clusters in the genus Trichophyton. We describe several sequences of the ITS region of T. interdigitale, which do not fit in the traditional polymorphism scheme and propose emendations in this scheme for discrimination between ITS sequence types in T. interdigitale. The new polymorphism scheme will allow inclusion of a wider spectrum of isolates while retaining its explanatory power. This scheme was also found to be partially congruent with NTS typing technique. © 2016 Blackwell Verlag GmbH.
PhyLIS: A Simple GNU/Linux Distribution for Phylogenetics and Phyloinformatics

Directory of Open Access Journals (Sweden)

Robert C. Thomson

2009-01-01

Full Text Available PhyLIS is a free GNU/Linux distribution that is designed to provide a simple, standardized platform for phylogenetic and phyloinformatic analysis. The operating system incorporates most commonly used phylogenetic software, which has been pre-compiled and pre-configured, allowing for straightforward application of phylogenetic methods and development of phyloinformatic pipelines in a stable Linux environment. The software is distributed as a live CD and can be installed directly or run from the CD without making changes to the computer. PhyLIS is available for free at http://www.eve.ucdavis.edu/rcthomson/phylis/.
Phenotypic diversity and phylogenetic relationship between the ...

African Journals Online (AJOL)

Phenotypic diversity and phylogenetic relationship between the Bakosi/Baweri and other pig breeds ( Sus scrofa Domesticus ) in the humid forest with monomodal rainfall agro-ecological zone of Cameroon.

Plant traits determine the phylogenetic structure of arbuscular mycorrhizal fungal communities.

Science.gov (United States)

López-García, Álvaro; Varela-Cervero, Sara; Vasar, Martti; Öpik, Maarja; Barea, José M; Azcón-Aguilar, Concepción

2017-12-01

Functional diversity in ecosystems has traditionally been studied using aboveground plant traits. Despite the known effect of plant traits on the microbial community composition, their effects on the microbial functional diversity are only starting to be assessed. In this study, the phylogenetic structure of arbuscular mycorrhizal (AM) fungal communities associated with plant species differing in life cycle and growth form, that is, plant life forms, was determined to unravel the effect of plant traits on the functional diversity of this fungal group. The results of the 454 pyrosequencing showed that the AM fungal community composition differed across plant life forms and this effect was dependent on the soil collection date. Plants with ruderal characteristics tended to associate with phylogenetically clustered AM fungal communities. By contrast, plants with resource-conservative traits associated with phylogenetically overdispersed AM fungal communities. Additionally, the soil collected in different seasons yielded AM fungal communities with different phylogenetic dispersion. In summary, we found that the phylogenetic structure, and hence the functional diversity, of AM fungal communities is dependent on plant traits. This finding adds value to the use of plant traits for the evaluation of belowground ecosystem diversity, functions and processes. © 2017 John Wiley & Sons Ltd.
Host specialization and phylogenetic diversity of Corynespora cassiicola.

Science.gov (United States)

Dixon, L J; Schlub, R L; Pernezny, K; Datnoff, L E

2009-09-01

The fungus Corynespora cassiicola is primarily found in the tropics and subtropics, and is widely diverse in substrate utilization and host association. Isolate characterization within C. cassiicola was undertaken to investigate how genetic diversity correlates with host specificity, growth rate, and geographic distribution. C. cassiicola isolates were collected from 68 different plant species in American Samoa, Brazil, Malaysia, and Micronesia, and Florida, Mississippi, and Tennessee within the United States. Phylogenetic analyses using four loci were performed with 143 Corynespora spp. isolates, including outgroup taxa obtained from culture collections: C. citricola, C. melongenae, C. olivacea, C. proliferata, C. sesamum, and C. smithii. Phylogenetic trees were congruent from the ribosomal DNA internal transcribed spacer region, two random hypervariable loci (caa5 and ga4), and the actin-encoding locus act1, indicating a lack of recombination within the species and asexual propagation. Fifty isolates were tested for pathogenicity on eight known C. cassiicola crop hosts: basil, bean, cowpea, cucumber, papaya, soybean, sweet potato, and tomato. Pathogenicity profiles ranged from one to four hosts, with cucumber appearing in 14 of the 16 profiles. Bootstrap analyses and Bayesian posterior probability values identified six statistically significant phylogenetic lineages. The six phylogenetic lineages correlated with host of origin, pathogenicity, and growth rate but not with geographic location. Common fungal genotypes were widely distributed geographically, indicating long-distance and global dispersal of clonal lineages. This research reveals an abundance of previously unrecognized genetic diversity within the species and provides evidence for host specialization on papaya.
Efficient parsimony-based methods for phylogenetic network reconstruction.

Science.gov (United States)

Jin, Guohua; Nakhleh, Luay; Snir, Sagi; Tuller, Tamir

2007-01-15

Phylogenies--the evolutionary histories of groups of organisms-play a major role in representing relationships among biological entities. Although many biological processes can be effectively modeled as tree-like relationships, others, such as hybrid speciation and horizontal gene transfer (HGT), result in networks, rather than trees, of relationships. Hybrid speciation is a significant evolutionary mechanism in plants, fish and other groups of species. HGT plays a major role in bacterial genome diversification and is a significant mechanism by which bacteria develop resistance to antibiotics. Maximum parsimony is one of the most commonly used criteria for phylogenetic tree inference. Roughly speaking, inference based on this criterion seeks the tree that minimizes the amount of evolution. In 1990, Jotun Hein proposed using this criterion for inferring the evolution of sequences subject to recombination. Preliminary results on small synthetic datasets. Nakhleh et al. (2005) demonstrated the criterion's application to phylogenetic network reconstruction in general and HGT detection in particular. However, the naive algorithms used by the authors are inapplicable to large datasets due to their demanding computational requirements. Further, no rigorous theoretical analysis of computing the criterion was given, nor was it tested on biological data. In the present work we prove that the problem of scoring the parsimony of a phylogenetic network is NP-hard and provide an improved fixed parameter tractable algorithm for it. Further, we devise efficient heuristics for parsimony-based reconstruction of phylogenetic networks. We test our methods on both synthetic and biological data (rbcL gene in bacteria) and obtain very promising results.
Genome-wide comparative analysis of phylogenetic trees: the prokaryotic forest of life.

Science.gov (United States)

Puigbò, Pere; Wolf, Yuri I; Koonin, Eugene V

2012-01-01

Genome-wide comparison of phylogenetic trees is becoming an increasingly common approach in evolutionary genomics, and a variety of approaches for such comparison have been developed. In this article, we present several methods for comparative analysis of large numbers of phylogenetic trees. To compare phylogenetic trees taking into account the bootstrap support for each internal branch, the Boot-Split Distance (BSD) method is introduced as an extension of the previously developed Split Distance method for tree comparison. The BSD method implements the straightforward idea that comparison of phylogenetic trees can be made more robust by treating tree splits differentially depending on the bootstrap support. Approaches are also introduced for detecting tree-like and net-like evolutionary trends in the phylogenetic Forest of Life (FOL), i.e., the entirety of the phylogenetic trees for conserved genes of prokaryotes. The principal method employed for this purpose includes mapping quartets of species onto trees to calculate the support of each quartet topology and so to quantify the tree and net contributions to the distances between species. We describe the application of these methods to analyze the FOL and the results obtained with these methods. These results support the concept of the Tree of Life (TOL) as a central evolutionary trend in the FOL as opposed to the traditional view of the TOL as a "species tree."
An efficient and extensible approach for compressing phylogenetic trees.

Science.gov (United States)

Matthews, Suzanne J; Williams, Tiffani L

2011-10-18

Biologists require new algorithms to efficiently compress and store their large collections of phylogenetic trees. Our previous work showed that TreeZip is a promising approach for compressing phylogenetic trees. In this paper, we extend our TreeZip algorithm by handling trees with weighted branches. Furthermore, by using the compressed TreeZip file as input, we have designed an extensible decompressor that can extract subcollections of trees, compute majority and strict consensus trees, and merge tree collections using set operations such as union, intersection, and set difference. On unweighted phylogenetic trees, TreeZip is able to compress Newick files in excess of 98%. On weighted phylogenetic trees, TreeZip is able to compress a Newick file by at least 73%. TreeZip can be combined with 7zip with little overhead, allowing space savings in excess of 99% (unweighted) and 92%(weighted). Unlike TreeZip, 7zip is not immune to branch rotations, and performs worse as the level of variability in the Newick string representation increases. Finally, since the TreeZip compressed text (TRZ) file contains all the semantic information in a collection of trees, we can easily filter and decompress a subset of trees of interest (such as the set of unique trees), or build the resulting consensus tree in a matter of seconds. We also show the ease of which set operations can be performed on TRZ files, at speeds quicker than those performed on Newick or 7zip compressed Newick files, and without loss of space savings. TreeZip is an efficient approach for compressing large collections of phylogenetic trees. The semantic and compact nature of the TRZ file allow it to be operated upon directly and quickly, without a need to decompress the original Newick file. We believe that TreeZip will be vital for compressing and archiving trees in the biological community.
Reconstruction of certain phylogenetic networks from their tree-average distances.

Science.gov (United States)

Willson, Stephen J

2013-10-01

Trees are commonly utilized to describe the evolutionary history of a collection of biological species, in which case the trees are called phylogenetic trees. Often these are reconstructed from data by making use of distances between extant species corresponding to the leaves of the tree. Because of increased recognition of the possibility of hybridization events, more attention is being given to the use of phylogenetic networks that are not necessarily trees. This paper describes the reconstruction of certain such networks from the tree-average distances between the leaves. For a certain class of phylogenetic networks, a polynomial-time method is presented to reconstruct the network from the tree-average distances. The method is proved to work if there is a single reticulation cycle.
Molecular characterization and phylogenetic relationships among ...

African Journals Online (AJOL)

Molecular characterization and phylogenetic relationships among and within species of Phalaenopsis (Epidendroideae: Orchidaceae) based on RAPD analysis. ... Ph. parishii, Ph. labbi nepal, Ph. speciosa, Ph. lobbi yellow, Ph. venosa, Ph. hieroglyphica, and Ph. maculata; the third group consisted of Ph. minho princess, ...
Phylogenetic mixtures and linear invariants for equal input models.

Science.gov (United States)

Casanellas, Marta; Steel, Mike

2017-04-01

The reconstruction of phylogenetic trees from molecular sequence data relies on modelling site substitutions by a Markov process, or a mixture of such processes. In general, allowing mixed processes can result in different tree topologies becoming indistinguishable from the data, even for infinitely long sequences. However, when the underlying Markov process supports linear phylogenetic invariants, then provided these are sufficiently informative, the identifiability of the tree topology can be restored. In this paper, we investigate a class of processes that support linear invariants once the stationary distribution is fixed, the 'equal input model'. This model generalizes the 'Felsenstein 1981' model (and thereby the Jukes-Cantor model) from four states to an arbitrary number of states (finite or infinite), and it can also be described by a 'random cluster' process. We describe the structure and dimension of the vector spaces of phylogenetic mixtures and of linear invariants for any fixed phylogenetic tree (and for all trees-the so called 'model invariants'), on any number n of leaves. We also provide a precise description of the space of mixtures and linear invariants for the special case of [Formula: see text] leaves. By combining techniques from discrete random processes and (multi-) linear algebra, our results build on a classic result that was first established by James Lake (Mol Biol Evol 4:167-191, 1987).
Inferring 'weak spots' in phylogenetic trees: application to mosasauroid nomenclature.

Science.gov (United States)

Madzia, Daniel; Cau, Andrea

2017-01-01

Mosasauroid squamates represented the apex predators within the Late Cretaceous marine and occasionally also freshwater ecosystems. Proper understanding of the origin of their ecological adaptations or paleobiogeographic dispersals requires adequate knowledge of their phylogeny. The studies assessing the position of mosasauroids on the squamate evolutionary tree and their origins have long given conflicting results. The phylogenetic relationships within Mosasauroidea, however, have experienced only little changes throughout the last decades. Considering the substantial improvements in the development of phylogenetic methodology that have undergone in recent years, resulting, among others, in numerous alterations in the phylogenetic hypotheses of other fossil amniotes, we test the robustness in our understanding of mosasauroid beginnings and their evolutionary history. We re-examined a data set that results from modifications assembled in the course of the last 20 years and performed multiple parsimony analyses and Bayesian tip-dating analysis. Following the inferred topologies and the 'weak spots' in the phylogeny of mosasauroids, we revise the nomenclature of the 'traditionally' recognized mosasauroid clades, to acknowledge the overall weakness among branches and the alternative topologies suggested previously, and discuss several factors that might have an impact on the differing phylogenetic hypotheses and their statistical support.
Incorporating phylogenetic information for the definition of floristic districts in hyperdiverse Amazon forests: Implications for conservation.

Science.gov (United States)

Guevara Andino, Juan Ernesto; Pitman, Nigel C A; Ter Steege, Hans; Mogollón, Hugo; Ceron, Carlos; Palacios, Walter; Oleas, Nora; Fine, Paul V A

2017-11-01

Using complementary metrics to evaluate phylogenetic diversity can facilitate the delimitation of floristic units and conservation priority areas. In this study, we describe the spatial patterns of phylogenetic alpha and beta diversity, phylogenetic endemism, and evolutionary distinctiveness of the hyperdiverse Ecuador Amazon forests and define priority areas for conservation. We established a network of 62 one-hectare plots in terra firme forests of Ecuadorian Amazon. In these plots, we tagged, collected, and identified every single adult tree with dbh ≥10 cm. These data were combined with a regional community phylogenetic tree to calculate different phylogenetic diversity (PD) metrics in order to create spatial models. We used Loess regression to estimate the spatial variation of taxonomic and phylogenetic beta diversity as well as phylogenetic endemism and evolutionary distinctiveness. We found evidence for the definition of three floristic districts in the Ecuadorian Amazon, supported by both taxonomic and phylogenetic diversity data. Areas with high levels of phylogenetic endemism and evolutionary distinctiveness in Ecuadorian Amazon forests are unprotected. Furthermore, these areas are severely threatened by proposed plans of oil and mining extraction at large scales and should be prioritized in conservation planning for this region.
Molecular phylogenetics of mastodon and Tyrannosaurus rex.

Science.gov (United States)

Organ, Chris L; Schweitzer, Mary H; Zheng, Wenxia; Freimark, Lisa M; Cantley, Lewis C; Asara, John M

2008-04-25

We report a molecular phylogeny for a nonavian dinosaur, extending our knowledge of trait evolution within nonavian dinosaurs into the macromolecular level of biological organization. Fragments of collagen alpha1(I) and alpha2(I) proteins extracted from fossil bones of Tyrannosaurus rex and Mammut americanum (mastodon) were analyzed with a variety of phylogenetic methods. Despite missing sequence data, the mastodon groups with elephant and the T. rex groups with birds, consistent with predictions based on genetic and morphological data for mastodon and on morphological data for T. rex. Our findings suggest that molecular data from long-extinct organisms may have the potential for resolving relationships at critical areas in the vertebrate evolutionary tree that have, so far, been phylogenetically intractable.
A guide to phylogenetic metrics for conservation, community ecology and macroecology

Science.gov (United States)

Cadotte, Marc W.; Carvalho, Silvia B.; Davies, T. Jonathan; Ferrier, Simon; Fritz, Susanne A.; Grenyer, Rich; Helmus, Matthew R.; Jin, Lanna S.; Mooers, Arne O.; Pavoine, Sandrine; Purschke, Oliver; Redding, David W.; Rosauer, Dan F.; Winter, Marten; Mazel, Florent

2016-01-01

ABSTRACT The use of phylogenies in ecology is increasingly common and has broadened our understanding of biological diversity. Ecological sub‐disciplines, particularly conservation, community ecology and macroecology, all recognize the value of evolutionary relationships but the resulting development of phylogenetic approaches has led to a proliferation of phylogenetic diversity metrics. The use of many metrics across the sub‐disciplines hampers potential meta‐analyses, syntheses, and generalizations of existing results. Further, there is no guide for selecting the appropriate metric for a given question, and different metrics are frequently used to address similar questions. To improve the choice, application, and interpretation of phylo‐diversity metrics, we organize existing metrics by expanding on a unifying framework for phylogenetic information. Generally, questions about phylogenetic relationships within or between assemblages tend to ask three types of question: how much; how different; or how regular? We show that these questions reflect three dimensions of a phylogenetic tree: richness, divergence, and regularity. We classify 70 existing phylo‐diversity metrics based on their mathematical form within these three dimensions and identify ‘anchor’ representatives: for α‐diversity metrics these are PD (Faith's phylogenetic diversity), MPD (mean pairwise distance), and VPD (variation of pairwise distances). By analysing mathematical formulae and using simulations, we use this framework to identify metrics that mix dimensions, and we provide a guide to choosing and using the most appropriate metrics. We show that metric choice requires connecting the research question with the correct dimension of the framework and that there are logical approaches to selecting and interpreting metrics. The guide outlined herein will help researchers navigate the current jungle of indices. PMID:26785932
The Drosophila bipectinata species complex: phylogenetic ...

Indian Academy of Sciences (India)

[Banerjee P. and Singh B. N. 2017 The Drosophila bipectinata species complex: phylogenetic relationship among different members based on chromosomal variations. J. Genet. 96, 97–107]. Introduction ..... loops touch the chromocenter and in our microphotograph. (depicting both the arms) too, the involvement of chromo-.
Multiple sequence alignment accuracy and phylogenetic inference.

Science.gov (United States)

Ogden, T Heath; Rosenberg, Michael S

2006-04-01

Phylogenies are often thought to be more dependent upon the specifics of the sequence alignment rather than on the method of reconstruction. Simulation of sequences containing insertion and deletion events was performed in order to determine the role that alignment accuracy plays during phylogenetic inference. Data sets were simulated for pectinate, balanced, and random tree shapes under different conditions (ultrametric equal branch length, ultrametric random branch length, nonultrametric random branch length). Comparisons between hypothesized alignments and true alignments enabled determination of two measures of alignment accuracy, that of the total data set and that of individual branches. In general, our results indicate that as alignment error increases, topological accuracy decreases. This trend was much more pronounced for data sets derived from more pectinate topologies. In contrast, for balanced, ultrametric, equal branch length tree shapes, alignment inaccuracy had little average effect on tree reconstruction. These conclusions are based on average trends of many analyses under different conditions, and any one specific analysis, independent of the alignment accuracy, may recover very accurate or inaccurate topologies. Maximum likelihood and Bayesian, in general, outperformed neighbor joining and maximum parsimony in terms of tree reconstruction accuracy. Results also indicated that as the length of the branch and of the neighboring branches increase, alignment accuracy decreases, and the length of the neighboring branches is the major factor in topological accuracy. Thus, multiple-sequence alignment can be an important factor in downstream effects on topological reconstruction.
Nucleotide and amino acid sequences of a coat protein of an Ukrainian isolate of Potato virus Y: comparison with homologous sequences of other isolates and phylogenetic analysis

Directory of Open Access Journals (Sweden)

Budzanivska I. G.

2014-03-01

Full Text Available Aim. Identification of the widespread Ukrainian isolate(s of PVY (Potato virus Y in different potato cultivars and subsequent phylogenetic analysis of detected PVY isolates based on NA and AA sequences of coat protein. Methods. ELISA, RT-PCR, DNA sequencing and phylogenetic analysis. Results. PVY has been identified serologically in potato cultivars of Ukrainian selection. In this work we have optimized a method for total RNA extraction from potato samples and offered a sensitive and specific PCR-based test system of own design for diagnostics of the Ukrainian PVY isolates. Part of the CP gene of the Ukrainian PVY isolate has been sequenced and analyzed phylogenetically. It is demonstrated that the Ukrainian isolate of Potato virus Y (CP gene has a higher percentage of homology with the recombinant isolates (strains of this pathogen (approx. 98.8– 99.8 % of homology for both nucleotide and translated amino acid sequences of the CP gene. The Ukrainian isolate of PVY is positioned in the separate cluster together with the isolates found in Syria, Japan and Iran; these isolates possibly have common origin. The Ukrainian PVY isolate is confirmed to be recombinant. Conclusions. This work underlines the need and provides the means for accurate monitoring of Potato virus Y in the agroecosystems of Ukraine. Most importantly, the phylogenetic analysis demonstrated the recombinant nature of this PVY isolate which has been attributed to the strain group O, subclade N:O.
LifePrint: a novel k-tuple distance method for construction of phylogenetic trees

Directory of Open Access Journals (Sweden)

Fabián Reyes-Prieto

2011-01-01

substitution.Conclusion: Our LPS9 method generates more accurate phylogenetic reconstructions than the previously proposed 5-tuples strategy. LPS9-reconstructed trees show higher bootstrap proportion values than distance trees derived from the 5-tuple method.Keywords: phylogeny, sequence alignment, similarity search, tuple, viroid
Evaluating the relationship between evolutionary divergence and phylogenetic accuracy in AFLP data sets.

Science.gov (United States)

García-Pereira, María Jesús; Caballero, Armando; Quesada, Humberto

2010-05-01

Using in silico amplified fragment length polymorphism (AFLP) fingerprints, we explore the relationship between sequence similarity and phylogeny accuracy to test when, in terms of genetic divergence, the quality of AFLP data becomes too low to be informative for a reliable phylogenetic reconstruction. We generated DNA sequences with known phylogenies using balanced and unbalanced trees with recent, uniform and ancient radiations, and average branch lengths (from the most internal node to the tip) ranging from 0.02 to 0.4 substitutions per site. The resulting sequences were used to emulate the AFLP procedure. Trees were estimated by maximum parsimony (MP), neighbor-joining (NJ), and minimum evolution (ME) methods from both DNA sequences and virtual AFLP fingerprints. The estimated trees were compared with the reference trees using a score that measures overall differences in both topology and relative branch length. As expected, the accuracy of AFLP-based phylogenies decreased dramatically in the more divergent data sets. Above a divergence of approximately 0.05, AFLP-based phylogenies were largely inaccurate irrespective of the distinct topology, radiation model, or phylogenetic method used. This value represents an upper bound of expected tree accuracy for data sets with a simple divergence history; AFLP data sets with a similar divergence but with unbalanced topologies and short ancestral branches produced much less accurate trees. The lack of homology of AFLP bands quickly increases with divergence and reaches its maximum value (100%) at a divergence of only 0.4. Low guanine-cytosine (GC) contents increase the number of nonhomologous bands in AFLP data sets and lead to less reliable trees. However, the effect of the lack of band homology on tree accuracy is surprisingly small relative to the negative impact due to the low information content of AFLP characters. Tree-building methods based on genetic distance displayed similar trends and outperformed parsimony
Disentangling environmental and spatial effects on phylogenetic structure of angiosperm tree communities in China.

Science.gov (United States)

Qian, Hong; Chen, Shengbin; Zhang, Jin-Long

2017-07-17

Niche-based and neutrality-based theories are two major classes of theories explaining the assembly mechanisms of local communities. Both theories have been frequently used to explain species diversity and composition in local communities but their relative importance remains unclear. Here, we analyzed 57 assemblages of angiosperm trees in 0.1-ha forest plots across China to examine the effects of environmental heterogeneity (relevant to niche-based processes) and spatial contingency (relevant to neutrality-based processes) on phylogenetic structure of angiosperm tree assemblages distributed across a wide range of environment and space. Phylogenetic structure was quantified with six phylogenetic metrics (i.e., phylogenetic diversity, mean pairwise distance, mean nearest taxon distance, and the standardized effect sizes of these three metrics), which emphasize on different depths of evolutionary histories and account for different degrees of species richness effects. Our results showed that the variation in phylogenetic metrics explained independently by environmental variables was on average much greater than that explained independently by spatial structure, and the vast majority of the variation in phylogenetic metrics was explained by spatially structured environmental variables. We conclude that niche-based processes have played a more important role than neutrality-based processes in driving phylogenetic structure of angiosperm tree species in forest communities in China.
Ulysses: accurate detection of low-frequency structural variations in large insert-size sequencing libraries.

Science.gov (United States)

Gillet-Markowska, Alexandre; Richard, Hugues; Fischer, Gilles; Lafontaine, Ingrid

2015-03-15

The detection of structural variations (SVs) in short-range Paired-End (PE) libraries remains challenging because SV breakpoints can involve large dispersed repeated sequences, or carry inherent complexity, hardly resolvable with classical PE sequencing data. In contrast, large insert-size sequencing libraries (Mate-Pair libraries) provide higher physical coverage of the genome and give access to repeat-containing regions. They can thus theoretically overcome previous limitations as they are becoming routinely accessible. Nevertheless, broad insert size distributions and high rates of chimerical sequences are usually associated to this type of libraries, which makes the accurate annotation of SV challenging. Here, we present Ulysses, a tool that achieves drastically higher detection accuracy than existing tools, both on simulated and real mate-pair sequencing datasets from the 1000 Human Genome project. Ulysses achieves high specificity over the complete spectrum of variants by assessing, in a principled manner, the statistical significance of each possible variant (duplications, deletions, translocations, insertions and inversions) against an explicit model for the generation of experimental noise. This statistical model proves particularly useful for the detection of low frequency variants. SV detection performed on a large insert Mate-Pair library from a breast cancer sample revealed a high level of somatic duplications in the tumor and, to a lesser extent, in the blood sample as well. Altogether, these results show that Ulysses is a valuable tool for the characterization of somatic mosaicism in human tissues and in cancer genomes. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Phylogenetic signals in the climatic niches of the world's amphibians

DEFF Research Database (Denmark)

Hof, Christian; Rahbek, Carsten; Araújo, Miguel B.

2010-01-01

amphibian orders and across biogeographical regions. To our knowledge, this is the first study providing a comprehensive analysis of the phylogenetic signal in species climatic niches for an entire clade across the world. Even though our results do not provide a strong test of the niche conservatism......The question of whether closely related species share similar ecological requirements has attracted increasing attention, because of its importance for understanding global diversity gradients and the impacts of climate change on species distributions. In fact, the assumption that related species...... are also ecologically similar has often been made, although the prevalence of such a phylogenetic signal in ecological niches remains heavily debated. Here, we provide a global analysis of phylogenetic niche relatedness for the world's amphibians. In particular, we assess which proportion of the variance...

Nitrogen addition, not initial phylogenetic diversity, increases litter decomposition by fungal communities

Directory of Open Access Journals (Sweden)

Anthony Stuart Amend

2015-02-01

Full Text Available Fungi play a critical role in the degradation of organic matter. Because different combinations of fungi result in different rates of decomposition, determining how climate change will affect microbial composition and function is fundamental to predicting future environments. Fungal response to global change is patterned by genetic relatedness, resulting in communities with comparatively low phylogenetic diversity. This may have important implications for the functional capacity of disturbed communities if lineages sensitive to disturbance also contain unique traits important for litter decomposition. Here we tested the relationship between phylogenetic diversity and decomposition rates. Leaf litter fungi were isolated from the field and deployed in microcosms as mock communities along a gradient of initial phylogenetic diversity, while species richness was held constant. Replicate communities were subject to nitrogen fertilization comparable to anthropogenic deposition levels. Carbon mineralization rates were measured over the course of sixty-six days. We found that nitrogen fertilization increased cumulative respiration by 24.8%, and that differences in respiration between fertilized and ambient communities diminished over the course of the experiment. Initial phylogenetic diversity failed to predict respiration rates or their change in response to nitrogen fertilization, and there was no correlation between community similarity and respiration rates. Last, we detected no phylogenetic signal in the contributions of individual isolates to respiration rates. Our results suggest that the degree to which phylogenetic diversity predicts ecosystem function will depend on environmental context.
Analyzing Phylogenetic Trees with Timed and Probabilistic Model Checking: The Lactose Persistence Case Study.

Science.gov (United States)

Requeno, José Ignacio; Colom, José Manuel

2014-12-01

Model checking is a generic verification technique that allows the phylogeneticist to focus on models and specifications instead of on implementation issues. Phylogenetic trees are considered as transition systems over which we interrogate phylogenetic questions written as formulas of temporal logic. Nonetheless, standard logics become insufficient for certain practices of phylogenetic analysis since they do not allow the inclusion of explicit time and probabilities. The aim of this paper is to extend the application of model checking techniques beyond qualitative phylogenetic properties and adapt the existing logical extensions and tools to the field of phylogeny. The introduction of time and probabilities in phylogenetic specifications is motivated by the study of a real example: the analysis of the ratio of lactose intolerance in some populations and the date of appearance of this phenotype.
treeman: an R package for efficient and intuitive manipulation of phylogenetic trees.

Science.gov (United States)

Bennett, Dominic J; Sutton, Mark D; Turvey, Samuel T

2017-01-07

Phylogenetic trees are hierarchical structures used for representing the inter-relationships between biological entities. They are the most common tool for representing evolution and are essential to a range of fields across the life sciences. The manipulation of phylogenetic trees-in terms of adding or removing tips-is often performed by researchers not just for reasons of management but also for performing simulations in order to understand the processes of evolution. Despite this, the most common programming language among biologists, R, has few class structures well suited to these tasks. We present an R package that contains a new class, called TreeMan, for representing the phylogenetic tree. This class has a list structure allowing phylogenetic trees to be manipulated more efficiently. Computational running times are reduced because of the ready ability to vectorise and parallelise methods. Development is also improved due to fewer lines of code being required for performing manipulation processes. We present three use cases-pinning missing taxa to a supertree, simulating evolution with a tree-growth model and detecting significant phylogenetic turnover-that demonstrate the new package's speed and simplicity.
Phylogenetic congruence and ecological coherence in terrestrial Thaumarchaeota.

Science.gov (United States)

Oton, Eduard Vico; Quince, Christopher; Nicol, Graeme W; Prosser, James I; Gubry-Rangin, Cécile

2016-01-01

Thaumarchaeota form a ubiquitously distributed archaeal phylum, comprising both the ammonia-oxidising archaea (AOA) and other archaeal groups in which ammonia oxidation has not been demonstrated (including Group 1.1c and Group 1.3). The ecology of AOA in terrestrial environments has been extensively studied using either a functional gene, encoding ammonia monooxygenase subunit A (amoA) or 16S ribosomal RNA (rRNA) genes, which show phylogenetic coherence with respect to soil pH. To test phylogenetic congruence between these two markers and to determine ecological coherence in all Thaumarchaeota, we performed high-throughput sequencing of 16S rRNA and amoA genes in 46 UK soils presenting 29 available contextual soil characteristics. Adaptation to pH and organic matter content reflected strong ecological coherence at various levels of taxonomic resolution for Thaumarchaeota (AOA and non-AOA), whereas nitrogen, total mineralisable nitrogen and zinc concentration were also important factors associated with AOA thaumarchaeotal community distribution. Other significant associations with environmental factors were also detected for amoA and 16S rRNA genes, reflecting different diversity characteristics between these two markers. Nonetheless, there was significant statistical congruence between the markers at fine phylogenetic resolution, supporting the hypothesis of low horizontal gene transfer between Thaumarchaeota. Group 1.1c Thaumarchaeota were also widely distributed, with two clusters predominating, particularly in environments with higher moisture content and organic matter, whereas a similar ecological pattern was observed for Group 1.3 Thaumarchaeota. The ecological and phylogenetic congruence identified is fundamental to understand better the life strategies, evolutionary history and ecosystem function of the Thaumarchaeota.
Detection of Horizontal Gene Transfers from Phylogenetic Comparisons

Science.gov (United States)

Pylro, Victor Satler; Vespoli, Luciano de Souza; Duarte, Gabriela Frois; Yotoko, Karla Suemy Clemente

2012-01-01

Bacterial phylogenies have become one of the most important challenges for microbial ecology. This field started in the mid-1970s with the aim of using the sequence of the small subunit ribosomal RNA (16S) tool to infer bacterial phylogenies. Phylogenetic hypotheses based on other sequences usually give conflicting topologies that reveal different evolutionary histories, which in some cases may be the result of horizontal gene transfer events. Currently, one of the major goals of molecular biology is to understand the role that horizontal gene transfer plays in species adaptation and evolution. In this work, we compared the phylogenetic tree based on 16S with the tree based on dszC, a gene involved in the cleavage of carbon-sulfur bonds. Bacteria of several genera perform this survival task when living in environments lacking free mineral sulfur. The biochemical pathway of the desulphurization process was extensively studied due to its economic importance, since this step is expensive and indispensable in fuel production. Our results clearly show that horizontal gene transfer events could be detected using common phylogenetic methods with gene sequences obtained from public sequence databases. PMID:22675653
AFLPMax: a user-friendly application for computing the optimal number of amplified fragment length polymorphism markers needed in phylogenetic reconstruction.

Science.gov (United States)

García-Pereira, M J; Quesada, H; Caballero, A; Carvajal-Rodríguez, A

2012-05-01

Amplified fragment length polymorphisms (AFLPs) are widely used for phylogenetic inference especially in non-model species. Frequently, trees obtained with other nuclear or mitochondrial markers or with morphological information need additional resolution, increased branch support, or independent data sources (i.e. unlinked loci). In such cases, the use of AFLPs is a quick and cheap option. Computer simulation has shown that dominant AFLP markers lead to less accurate tree topologies than bi-allelic codominant markers such as SNPs, but this difference becomes negligible for shallow trees when using AFLP data sets that include a sufficiently large number of characters. Thus, determining how many AFLP characters are required to recover a given phylogeny is a key issue regarding the appropriateness of AFLPs for phylogenetic reconstruction. Here, we present a user-friendly, java-based graphical interface, AFLPMax, which executes an automatic pipeline of different programs providing the user with the optimal number of AFLP characters needed to recover a given phylogeny with high accuracy and support. Executables for Windows, linux and MacOS X operating systems, source code and user manual are available from: http://webs.uvigo.es/acraaj/AFLPMax.htm. © 2012 Blackwell Publishing Ltd.
Resolving ambiguity in the phylogenetic relationship of genotypes A, B, and C of hepatitis B virus

Science.gov (United States)

2013-01-01

Background Hepatitis B virus (HBV) is an important infectious agent that causes widespread concern because billions of people are infected by at least 8 different HBV genotypes worldwide. However, reconstruction of the phylogenetic relationship between HBV genotypes is difficult. Specifically, the phylogenetic relationships among genotypes A, B, and C are not clear from previous studies because of the confounding effects of genotype recombination. In order to clarify the evolutionary relationships, a rigorous approach is required that can effectively explore genetic sequences with recombination. Result In the present study, phylogenetic relationship of the HBV genotypes was reconstructed using a consensus phylogeny of phylogenetic trees of HBV genome segments. Reliability of the reconstructed phylogeny was extensively evaluated in agreements of local phylogenies of genome segments. The reconstructed phylogenetic tree revealed that HBV genotypes B and C had a closer phylogenetic relationship than genotypes A and B or A and C. Evaluations showed the consensus method was capable to reconstruct reliable phylogenetic relationship in the presence of recombinants. Conclusion The consensus method implemented in this study provides an alternative approach for reconstructing reliable phylogenetic relationships for viruses with possible genetic recombination. Our approach revealed the phylogenetic relationships of genotypes A, B, and C of HBV. PMID:23758960
Phylogenetic and chemical diversity of MAR4 streptomycete lineage

Directory of Open Access Journals (Sweden)

Marisa Paulino

2014-06-01

To date, phylogenetic characterization of 6 representative isolates, based on partial sequence of gene encoding 16S rRNA, confirm that these strains belong to the specie Streptomyces aculeolatus. Figure 2. Neighbour-joining phylogenetic tree created from 6 partial 16S rRNA gene sequence from Streptomyces aculeolatus strains cultured from Madeira Archipelago, based on 1000 bootstrap replicates. BLAST matches (deposited in GenBank are included with species and strain name followed by accession number. Verrucosispora maris and Micromonospora aurantiaca were used as outgroups.
A paternally transmitted complex chromosomal rearrangement (CCR) involving chromosomes 2, 6, and 18 includes eight breakpoints and five insertional translocations (ITs) through three generations.

Science.gov (United States)

Gruchy, Nicolas; Barreau, Morgane; Kessler, Ketty; Gourdier, Dominique; Leporrier, Nathalie

2010-01-01

Complex chromosomal rearrangements (CCRs) are uncommon and mainly occur de novo. We report here on a familial CCR involving chromosomes 2, 6, and 18. The propositus is a boy first referred because of growth delays, hypotonia, and facial anomalies, suggestive of deletion 18q syndrome. However, a cytogenetic family study disclosed a balanced CCR in three generations, which was detailed by FISH using BAC clones, and consisted of eight breakpoints with five insertional translocations (ITs). The propositus had a cryptic 18q deletion and a 6p duplication. Paternal transmission of this CCR was observed through three generations without meiotic recombination. Our investigation allowed us to provide porosities counseling and management of prenatal diagnosis for propositus cousin who carries this particular CCR.
Phylogenetically-informed priorities for amphibian conservation.

Science.gov (United States)

Isaac, Nick J B; Redding, David W; Meredith, Helen M; Safi, Kamran

2012-01-01

The amphibian decline and extinction crisis demands urgent action to prevent further large numbers of species extinctions. Lists of priority species for conservation, based on a combination of species' threat status and unique contribution to phylogenetic diversity, are one tool for the direction and catalyzation of conservation action. We describe the construction of a near-complete species-level phylogeny of 5713 amphibian species, which we use to create a list of evolutionarily distinct and globally endangered species (EDGE list) for the entire class Amphibia. We present sensitivity analyses to test the robustness of our priority list to uncertainty in species' phylogenetic position and threat status. We find that both sources of uncertainty have only minor impacts on our 'top 100' list of priority species, indicating the robustness of the approach. By contrast, our analyses suggest that a large number of Data Deficient species are likely to be high priorities for conservation action from the perspective of their contribution to the evolutionary history.
Phylogenetically-informed priorities for amphibian conservation.

Directory of Open Access Journals (Sweden)

Nick J B Isaac

Full Text Available The amphibian decline and extinction crisis demands urgent action to prevent further large numbers of species extinctions. Lists of priority species for conservation, based on a combination of species' threat status and unique contribution to phylogenetic diversity, are one tool for the direction and catalyzation of conservation action. We describe the construction of a near-complete species-level phylogeny of 5713 amphibian species, which we use to create a list of evolutionarily distinct and globally endangered species (EDGE list for the entire class Amphibia. We present sensitivity analyses to test the robustness of our priority list to uncertainty in species' phylogenetic position and threat status. We find that both sources of uncertainty have only minor impacts on our 'top 100' list of priority species, indicating the robustness of the approach. By contrast, our analyses suggest that a large number of Data Deficient species are likely to be high priorities for conservation action from the perspective of their contribution to the evolutionary history.
Ecological traits influence the phylogenetic structure of bird species co-occurrences worldwide

NARCIS (Netherlands)

Barnagaud, J.-Y.; Kissling, W.D.; Sandel, B.; Eiserhardt, W.L.; Şekercioğlu, Ç.H.; Enquist, B.J.; Tsirogiannis, C.; Svenning, J.-C.

2014-01-01

The extent to which species’ ecological and phylogenetic relatedness shape their co-occurrence patterns at large spatial scales remains poorly understood. By quantifying phylogenetic assemblage structure within geographic ranges of >8000 bird species, we show that global co-occurrence patterns are
On the need for phylogenetic ‘corrections’ in functional trait-based approaches

NARCIS (Netherlands)

de Bello, F.; Berg, M.P.; Dias, A.T.C.; Diniz-Filho, J.A.F.; Götzenberger, L.; Hortal, J.; Ladle, R.J.; Lepš, J.

2015-01-01

There is considerable uncertainty about if, and when, phylogenetic information is needed to answer various ecological questions about trait-based ecological studies. It has been recommended that both functional and phylogenetic information should be combined, and some researchers have even suggested
Phylogenetic Analysis of Phytophthora Species Based on Mitochondrial and Nuclear DNA Sequences

NARCIS (Netherlands)

Kroon, L.P.N.M.; Bakker, F.T.; Bosch, van den G.B.M.; Bonants, P.J.M.; Flier, W.G.

2004-01-01

A molecular phylogenetic analysis of the genus Phytophthora was performed, 113 isolates from 48 Phytophthora species were included in this analysis. Phylogenetic analyses were performed on regions of mitochondrial (cytochrome c oxidase subunit 1; NADH dehydrogenase subunit 1) and nuclear gene
Phylogenetic paleobiogeography of Late Ordovician Laurentian brachiopods

Directory of Open Access Journals (Sweden)

Jennifer E. Bauer

2014-12-01

Full Text Available Phylogenetic biogeographic analysis of four brachiopod genera was used to uncover large-scale geologic drivers of Late Ordovician biogeographic differentiation in Laurentia. Previously generated phylogenetic hypotheses were converted into area cladograms, ancestral geographic ranges were optimized and speciation events characterized as via dispersal or vicariance, when possible. Area relationships were reconstructed using Lieberman-modified Brooks Parsimony Analysis. The resulting area cladograms indicate tectonic and oceanographic changes were the primary geologic drivers of biogeographic patterns within the focal taxa. The Taconic tectophase contributed to the separation of the Appalachian and Central basins as well as the two midcontinent basins, whereas sea level rise following the Boda Event promoted interbasinal dispersal. Three migration pathways into the Cincinnati Basin were recognized, which supports the multiple pathway hypothesis for the Richmondian Invasion.
Translocations and deletions with breakpoint on 21q are nonrandomly associated with treatment-related acute nonlymphocytic leukemia and preleukemia

International Nuclear Information System (INIS)

Keldsen, N.; Philip, P.; Pedersen-Bjergaard, J.

1987-01-01

Six of 70 (8.6%) consecutive cases with therapy-related acute nonlymphocytic leukemia (ANLL) or preleukemia had a translocation or deletion with a breakpoint on 21q. Such aberrations were seen in only one of 200 (0.5%) consecutive cases of de novo ANLL examined at our laboratory. The figures reflect a 17.1-fold increased incidence of 21q aberrations in therapy-related ANLL or preleukemia, compared with ANLL de novo. The difference is highly significant (p = 0.003). The increased incidence of 21q aberrations in therapy-related myelodysplastic syndromes was confirmed by literature studies. Band 21q22 was most often involved. Cases with t(8;21), which is strongly associated with the M2 variant of ANLL, or cases with i(21q), which is supposedly due to a centromeric misdivision, were not included in the count. It is concluded that the 21q aberrations are associated with treatment-related ANLL or preleukemia with at least the same degree of specificity as aberrations of number5 and number7. 61 references
Model checking software for phylogenetic trees using distribution and database methods

Directory of Open Access Journals (Sweden)

Requeno José Ignacio

2013-12-01

Full Text Available Model checking, a generic and formal paradigm stemming from computer science based on temporal logics, has been proposed for the study of biological properties that emerge from the labeling of the states defined over the phylogenetic tree. This strategy allows us to use generic software tools already present in the industry. However, the performance of traditional model checking is penalized when scaling the system for large phylogenies. To this end, two strategies are presented here. The first one consists of partitioning the phylogenetic tree into a set of subgraphs each one representing a subproblem to be verified so as to speed up the computation time and distribute the memory consumption. The second strategy is based on uncoupling the information associated to each state of the phylogenetic tree (mainly, the DNA sequence and exporting it to an external tool for the management of large information systems. The integration of all these approaches outperforms the results of monolithic model checking and helps us to execute the verification of properties in a real phylogenetic tree.
IcyTree: rapid browser-based visualization for phylogenetic trees and networks.

Science.gov (United States)

Vaughan, Timothy G

2017-08-01

IcyTree is an easy-to-use application which can be used to visualize a wide variety of phylogenetic trees and networks. While numerous phylogenetic tree viewers exist already, IcyTree distinguishes itself by being a purely online tool, having a responsive user interface, supporting phylogenetic networks (ancestral recombination graphs in particular), and efficiently drawing trees that include information such as ancestral locations or trait values. IcyTree also provides intuitive panning and zooming utilities that make exploring large phylogenetic trees of many thousands of taxa feasible. IcyTree is a web application and can be accessed directly at http://tgvaughan.github.com/icytree . Currently supported web browsers include Mozilla Firefox and Google Chrome. IcyTree is written entirely in client-side JavaScript (no plugin required) and, once loaded, does not require network access to run. IcyTree is free software, and the source code is made available at http://github.com/tgvaughan/icytree under version 3 of the GNU General Public License. tgvaughan@gmail.com. © The Author(s) 2017. Published by Oxford University Press.
Applying a multiobjective metaheuristic inspired by honey bees to phylogenetic inference.

Science.gov (United States)

Santander-Jiménez, Sergio; Vega-Rodríguez, Miguel A

2013-10-01

The development of increasingly popular multiobjective metaheuristics has allowed bioinformaticians to deal with optimization problems in computational biology where multiple objective functions must be taken into account. One of the most relevant research topics that can benefit from these techniques is phylogenetic inference. Throughout the years, different researchers have proposed their own view about the reconstruction of ancestral evolutionary relationships among species. As a result, biologists often report different phylogenetic trees from a same dataset when considering distinct optimality principles. In this work, we detail a multiobjective swarm intelligence approach based on the novel Artificial Bee Colony algorithm for inferring phylogenies. The aim of this paper is to propose a complementary view of phylogenetics according to the maximum parsimony and maximum likelihood criteria, in order to generate a set of phylogenetic trees that represent a compromise between these principles. Experimental results on a variety of nucleotide data sets and statistical studies highlight the relevance of the proposal with regard to other multiobjective algorithms and state-of-the-art biological methods. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Phylogenetic relationship among Kenyan sorghum germplasms ...

African Journals Online (AJOL)

Mr Kiboi

phylogenetic relationships based on 10 DNA fragments at AltSB loci with SbMATE, ORF9 and MITE primers. .... estimate the overall genetic diversity in Kenyan sorghum lines: Cheprot et al. 3529 ..... EARN project and Generation Challenge (GCP), ... genetics and molecular biology of plant aluminum resistance and toxicity.

Morphological characterization and phylogenetic distance among ...

African Journals Online (AJOL)

The genetic diversity was calculated with Nei and Li's index, and the phylogenetic tree (dendrogram) was generated with a neighbor-joining program. The dendrogram indicates the diversity of the genotypes, which are grouped into three distinctive large groups. The largest group includes species from the Mediolobivia and ...
Genomic repeat abundances contain phylogenetic signal

Czech Academy of Sciences Publication Activity Database

Dodsworth, S.; Chase, M.W.; Kelly, L.J.; Leitch, I.J.; Macas, Jiří; Novák, Petr; Piednoël, M.; Weiß-Schneeweiss, H.; Leitch, A.R.

2015-01-01

Roč. 64, č. 1 (2015), s. 112-126 ISSN 1063-5157 R&D Projects: GA ČR GBP501/12/G090 Institutional support: RVO:60077344 Keywords : Repetitive DNA * continuous characters * genomics * next-generation sequencing * phylogenetics Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 8.225, year: 2015
Worst-case optimal approximation algorithms for maximizing triplet consistency within phylogenetic networks

NARCIS (Netherlands)

J. Byrka (Jaroslaw); K.T. Huber; S.M. Kelk (Steven); P. Gawrychowski

2009-01-01

htmlabstractThe study of phylogenetic networks is of great interest to computational evolutionary biology and numerous different types of such structures are known. This article addresses the following question concerning rooted versions of phylogenetic networks. What is the maximum value of pset
Autumn Algorithm-Computation of Hybridization Networks for Realistic Phylogenetic Trees.

Science.gov (United States)

Huson, Daniel H; Linz, Simone

2018-01-01

A minimum hybridization network is a rooted phylogenetic network that displays two given rooted phylogenetic trees using a minimum number of reticulations. Previous mathematical work on their calculation has usually assumed the input trees to be bifurcating, correctly rooted, or that they both contain the same taxa. These assumptions do not hold in biological studies and "realistic" trees have multifurcations, are difficult to root, and rarely contain the same taxa. We present a new algorithm for computing minimum hybridization networks for a given pair of "realistic" rooted phylogenetic trees. We also describe how the algorithm might be used to improve the rooting of the input trees. We introduce the concept of "autumn trees", a nice framework for the formulation of algorithms based on the mathematics of "maximum acyclic agreement forests". While the main computational problem is hard, the run-time depends mainly on how different the given input trees are. In biological studies, where the trees are reasonably similar, our parallel implementation performs well in practice. The algorithm is available in our open source program Dendroscope 3, providing a platform for biologists to explore rooted phylogenetic networks. We demonstrate the utility of the algorithm using several previously studied data sets.
The phylogenetic relationships among infraorders and superfamilies of Diptera based on morphological evidence

DEFF Research Database (Denmark)

Lambkin, Christine L.; Sinclair, Bradley J.; Pape, Thomas

2013-01-01

Members of the megadiverse insect order Diptera (flies) have successfully colonized all continents and nearly all habitats. There are more than 154 000 described fly species, representing 1012% of animal species. Elucidating the phylogenetic relationships of such a large component of global...... biodiversity is challenging, but significant advances have been made in the last few decades. Since Hennig first discussed the monophyly of major groupings, Diptera has attracted much study, but most researchers have used non-numerical qualitative methods to assess morphological data. More recently......, quantitative phylogenetic methods have been used on both morphological and molecular data. All previous quantitative morphological studies addressed narrower phylogenetic problems, often below the suborder or infraorder level. Here we present the first numerical analysis of phylogenetic relationships...
Virulence, serotype and phylogenetic groups of diarrhoeagenic ...

African Journals Online (AJOL)

Dr DADIE Thomas

2014-02-17

Feb 17, 2014 ... Phylogenetic characteristics play an important role for traceability and knowledge of ... amplification reaction was performed by PCR in a 25 μl reaction ...... Hierarchical grouping to optimize an objective function. J. Am. Statist.
Aquatic insect ecophysiological traits reveal phylogenetically based differences in dissolved cadmium susceptibility.

Science.gov (United States)

Buchwalter, David B; Cain, Daniel J; Martin, Caitrin A; Xie, Lingtian; Luoma, Samuel N; Garland, Theodore

2008-06-17

We used a phylogenetically based comparative approach to evaluate the potential for physiological studies to reveal patterns of diversity in traits related to susceptibility to an environmental stressor, the trace metal cadmium (Cd). Physiological traits related to Cd bioaccumulation, compartmentalization, and ultimately susceptibility were measured in 21 aquatic insect species representing the orders Ephemeroptera, Plecoptera, and Trichoptera. We mapped these experimentally derived physiological traits onto a phylogeny and quantified the tendency for related species to be similar (phylogenetic signal). All traits related to Cd bioaccumulation and susceptibility exhibited statistically significant phylogenetic signal, although the signal strength varied among traits. Conventional and phylogenetically based regression models were compared, revealing great variability within orders but consistent, strong differences among insect families. Uptake and elimination rate constants were positively correlated among species, but only when effects of body size and phylogeny were incorporated in the analysis. Together, uptake and elimination rates predicted dramatic Cd bioaccumulation differences among species that agreed with field-based measurements. We discovered a potential tradeoff between the ability to eliminate Cd and the ability to detoxify it across species, particularly mayflies. The best-fit regression models were driven by phylogenetic parameters (especially differences among families) rather than functional traits, suggesting that it may eventually be possible to predict a taxon's physiological performance based on its phylogenetic position, provided adequate physiological information is available for close relatives. There appears to be great potential for evolutionary physiological approaches to augment our understanding of insect responses to environmental stressors in nature.
Measures of phylogenetic differentiation provide robust and complementary insights into microbial communities.

Science.gov (United States)

Parks, Donovan H; Beiko, Robert G

2013-01-01

High-throughput sequencing techniques have made large-scale spatial and temporal surveys of microbial communities routine. Gaining insight into microbial diversity requires methods for effectively analyzing and visualizing these extensive data sets. Phylogenetic β-diversity measures address this challenge by allowing the relationship between large numbers of environmental samples to be explored using standard multivariate analysis techniques. Despite the success and widespread use of phylogenetic β-diversity measures, an extensive comparative analysis of these measures has not been performed. Here, we compare 39 measures of phylogenetic β diversity in order to establish the relative similarity of these measures along with key properties and performance characteristics. While many measures are highly correlated, those commonly used within microbial ecology were found to be distinct from those popular within classical ecology, and from the recently recommended Gower and Canberra measures. Many of the measures are surprisingly robust to different rootings of the gene tree, the choice of similarity threshold used to define operational taxonomic units, and the presence of outlying basal lineages. Measures differ considerably in their sensitivity to rare organisms, and the effectiveness of measures can vary substantially under alternative models of differentiation. Consequently, the depth of sequencing required to reveal underlying patterns of relationships between environmental samples depends on the selected measure. Our results demonstrate that using complementary measures of phylogenetic β diversity can further our understanding of how communities are phylogenetically differentiated. Open-source software implementing the phylogenetic β-diversity measures evaluated in this manuscript is available at http://kiwi.cs.dal.ca/Software/ExpressBetaDiversity.
An efficient and extensible approach for compressing phylogenetic trees

KAUST Repository

Matthews, Suzanne J

2011-01-01

Background: Biologists require new algorithms to efficiently compress and store their large collections of phylogenetic trees. Our previous work showed that TreeZip is a promising approach for compressing phylogenetic trees. In this paper, we extend our TreeZip algorithm by handling trees with weighted branches. Furthermore, by using the compressed TreeZip file as input, we have designed an extensible decompressor that can extract subcollections of trees, compute majority and strict consensus trees, and merge tree collections using set operations such as union, intersection, and set difference.Results: On unweighted phylogenetic trees, TreeZip is able to compress Newick files in excess of 98%. On weighted phylogenetic trees, TreeZip is able to compress a Newick file by at least 73%. TreeZip can be combined with 7zip with little overhead, allowing space savings in excess of 99% (unweighted) and 92%(weighted). Unlike TreeZip, 7zip is not immune to branch rotations, and performs worse as the level of variability in the Newick string representation increases. Finally, since the TreeZip compressed text (TRZ) file contains all the semantic information in a collection of trees, we can easily filter and decompress a subset of trees of interest (such as the set of unique trees), or build the resulting consensus tree in a matter of seconds. We also show the ease of which set operations can be performed on TRZ files, at speeds quicker than those performed on Newick or 7zip compressed Newick files, and without loss of space savings.Conclusions: TreeZip is an efficient approach for compressing large collections of phylogenetic trees. The semantic and compact nature of the TRZ file allow it to be operated upon directly and quickly, without a need to decompress the original Newick file. We believe that TreeZip will be vital for compressing and archiving trees in the biological community. © 2011 Matthews and Williams; licensee BioMed Central Ltd.
The performance of phylogenetic algorithms in estimating haplotype genealogies with migration.

Science.gov (United States)

Salzburger, Walter; Ewing, Greg B; Von Haeseler, Arndt

2011-05-01

Genealogies estimated from haplotypic genetic data play a prominent role in various biological disciplines in general and in phylogenetics, population genetics and phylogeography in particular. Several software packages have specifically been developed for the purpose of reconstructing genealogies from closely related, and hence, highly similar haplotype sequence data. Here, we use simulated data sets to test the performance of traditional phylogenetic algorithms, neighbour-joining, maximum parsimony and maximum likelihood in estimating genealogies from nonrecombining haplotypic genetic data. We demonstrate that these methods are suitable for constructing genealogies from sets of closely related DNA sequences with or without migration. As genealogies based on phylogenetic reconstructions are fully resolved, but not necessarily bifurcating, and without reticulations, these approaches outperform widespread 'network' constructing methods. In our simulations of coalescent scenarios involving panmictic, symmetric and asymmetric migration, we found that phylogenetic reconstruction methods performed well, while the statistical parsimony approach as implemented in TCS performed poorly. Overall, parsimony as implemented in the PHYLIP package performed slightly better than other methods. We further point out that we are not making the case that widespread 'network' constructing methods are bad, but that traditional phylogenetic tree finding methods are applicable to haplotypic data and exhibit reasonable performance with respect to accuracy and robustness. We also discuss some of the problems of converting a tree to a haplotype genealogy, in particular that it is nonunique. © 2011 Blackwell Publishing Ltd.
Conservation threats and the phylogenetic utility of IUCN Red List rankings in Incilius toads.

Science.gov (United States)

Schachat, Sandra R; Mulcahy, Daniel G; Mendelson, Joseph R

2016-02-01

Phylogenetic analysis of extinction threat is an emerging tool in the field of conservation. However, there are problems with the methods and data as commonly used. Phylogenetic sampling usually extends to the level of family or genus, but International Union for Conservation of Nature (IUCN) rankings are available only for individual species, and, although different species within a taxonomic group may have the same IUCN rank, the species may have been ranked as such for different reasons. Therefore, IUCN rank may not reflect evolutionary history and thus may not be appropriate for use in a phylogenetic context. To be used appropriately, threat-risk data should reflect the cause of extinction threat rather than the IUCN threat ranking. In a case study of the toad genus Incilius, with phylogenetic sampling at the species level (so that the resolution of the phylogeny matches character data from the IUCN Red List), we analyzed causes of decline and IUCN threat rankings by calculating metrics of phylogenetic signal (such as Fritz and Purvis' D). We also analyzed the extent to which cause of decline and threat ranking overlap by calculating phylogenetic correlation between these 2 types of character data. Incilius species varied greatly in both threat ranking and cause of decline; this variability would be lost at a coarser taxonomic resolution. We found far more phylogenetic signal, likely correlated with evolutionary history, for causes of decline than for IUCN threat ranking. Individual causes of decline and IUCN threat rankings were largely uncorrelated on the phylogeny. Our results demonstrate the importance of character selection and taxonomic resolution when extinction threat is analyzed in a phylogenetic context. © 2015 Society for Conservation Biology.
Molecular phylogenetics of porcini mushrooms (Boletus section Boletus).

Science.gov (United States)

Dentinger, Bryn T M; Ammirati, Joseph F; Both, Ernst E; Desjardin, Dennis E; Halling, Roy E; Henkel, Terry W; Moreau, Pierre-Arthur; Nagasawa, Eiji; Soytong, Kasem; Taylor, Andy F; Watling, Roy; Moncalvo, Jean-Marc; McLaughlin, David J

2010-12-01

Porcini (Boletus section Boletus: Boletaceae: Boletineae: Boletales) are a conspicuous group of wild, edible mushrooms characterized by fleshy fruiting bodies with a poroid hymenophore that is "stuffed" with white hyphae when young. Their reported distribution is with ectomycorrhizal plants throughout the Northern Hemisphere. Little progress has been made on the systematics of this group using modern molecular phylogenetic tools because sampling has been limited primarily to European species and the genes employed were insufficient to resolve the phylogeny. We examined the evolutionary history of porcini by using a global geographic sampling of most known species, new discoveries from little explored areas, and multiple genes. We used 78 sequences from the fast-evolving nuclear internal transcribed spacers and are able to recognize 18 reciprocally monophyletic species. To address whether or not porcini form a monophyletic group, we compiled a broadly sampled dataset of 41 taxa, including other members of the Boletineae, and used separate and combined phylogenetic analysis of sequences from the nuclear large subunit ribosomal DNA, the largest subunit of RNA polymerase II, and the mitochondrial ATPase subunit six gene. Contrary to previous studies, our separate and combined phylogenetic analyses support the monophyly of porcini. We also report the discovery of two taxa that expand the known distribution of porcini to Australia and Thailand and have ancient phylogenetic connections to the rest of the group. A relaxed molecular clock analysis with these new taxa dates the origin of porcini to between 42 and 54 million years ago, coinciding with the initial diversification of angiosperms, during the Eocene epoch when the climate was warm and humid. These results reveal an unexpected diversity, distribution, and ancient origin of a group of commercially valuable mushrooms that may provide an economic incentive for conservation and support the hypothesis of a tropical
Fast Computations for Measures of Phylogenetic Beta Diversity.

Directory of Open Access Journals (Sweden)

Constantinos Tsirogiannis

Full Text Available For many applications in ecology, it is important to examine the phylogenetic relations between two communities of species. More formally, let [Formula: see text] be a phylogenetic tree and let A and B be two samples of its tips, representing the examined communities. We want to compute a value that expresses the phylogenetic diversity between A and B in [Formula: see text]. There exist several measures that can do this; these are the so-called phylogenetic beta diversity (β-diversity measures. Two popular measures of this kind are the Community Distance (CD and the Common Branch Length (CBL. In most applications, it is not sufficient to compute the value of a beta diversity measure for two communities A and B; we also want to know if this value is relatively large or small compared to all possible pairs of communities in [Formula: see text] that have the same size. To decide this, the ideal approach is to compute a standardised index that involves the mean and the standard deviation of this measure among all pairs of species samples that have the same number of elements as A and B. However, no method exists for computing exactly and efficiently this index for CD and CBL. We present analytical expressions for computing the expectation and the standard deviation of CD and CBL. Based on these expressions, we describe efficient algorithms for computing the standardised indices of the two measures. Using standard algorithmic analysis, we provide guarantees on the theoretical efficiency of our algorithms. We implemented our algorithms and measured their efficiency in practice. Our implementations compute the standardised indices of CD and CBL in less than twenty seconds for a hundred pairs of samples on trees with 7 ⋅ 10(4 tips. Our implementations are available through the R package PhyloMeasures.
Bias in phylogenetic reconstruction of vertebrate rhodopsin sequences.

Science.gov (United States)

Chang, B S; Campbell, D L

2000-08-01

Two spurious nodes were found in phylogenetic analyses of vertebrate rhodopsin sequences in comparison with well-established vertebrate relationships. These spurious reconstructions were well supported in bootstrap analyses and occurred independently of the method of phylogenetic analysis used (parsimony, distance, or likelihood). Use of this data set of vertebrate rhodopsin sequences allowed us to exploit established vertebrate relationships, as well as the considerable amount known about the molecular evolution of this gene, in order to identify important factors contributing to the spurious reconstructions. Simulation studies using parametric bootstrapping indicate that it is unlikely that the spurious nodes in the parsimony analyses are due to long branches or other topological effects. Rather, they appear to be due to base compositional bias at third positions, codon bias, and convergent evolution at nucleotide positions encoding the hydrophobic residues isoleucine, leucine, and valine. LogDet distance methods, as well as maximum-likelihood methods which allow for nonstationary changes in base composition, reduce but do not entirely eliminate support for the spurious resolutions. Inclusion of five additional rhodopsin sequences in the phylogenetic analyses largely corrected one of the spurious reconstructions while leaving the other unaffected. The additional sequences not only were more proximal to the corrected node, but were also found to have intermediate levels of base composition and codon bias as compared with neighboring sequences on the tree. This study shows that the spurious reconstructions can be corrected either by excluding third positions, as well as those encoding the amino acids Ile, Val, and Leu (which may not be ideal, as these sites can contain useful phylogenetic signal for other parts of the tree), or by the addition of sequences that reduce problems associated with convergent evolution.
Local-scale Partitioning of Functional and Phylogenetic Beta Diversity in a Tropical Tree Assemblage.

Science.gov (United States)

Yang, Jie; Swenson, Nathan G; Zhang, Guocheng; Ci, Xiuqin; Cao, Min; Sha, Liqing; Li, Jie; Ferry Slik, J W; Lin, Luxiang

2015-08-03

The relative degree to which stochastic and deterministic processes underpin community assembly is a central problem in ecology. Quantifying local-scale phylogenetic and functional beta diversity may shed new light on this problem. We used species distribution, soil, trait and phylogenetic data to quantify whether environmental distance, geographic distance or their combination are the strongest predictors of phylogenetic and functional beta diversity on local scales in a 20-ha tropical seasonal rainforest dynamics plot in southwest China. The patterns of phylogenetic and functional beta diversity were generally consistent. The phylogenetic and functional dissimilarity between subplots (10 × 10 m, 20 × 20 m, 50 × 50 m and 100 × 100 m) was often higher than that expected by chance. The turnover of lineages and species function within habitats was generally slower than that across habitats. Partitioning the variation in phylogenetic and functional beta diversity showed that environmental distance was generally a better predictor of beta diversity than geographic distance thereby lending relatively more support for deterministic environmental filtering over stochastic processes. Overall, our results highlight that deterministic processes play a stronger role than stochastic processes in structuring community composition in this diverse assemblage of tropical trees.
Phylogenetic networks do not need to be complex: using fewer reticulations to represent conflicting clusters

NARCIS (Netherlands)

Iersel, van L.J.J.; Kelk, S.M.; Rupp, R.; Huson, D.H.

2010-01-01

Phylogenetic trees are widely used to display estimates of how groups of species are evolved. Each phylogenetic tree can be seen as a collection of clusters, subgroups of the species that evolved from a common ancestor. When phylogenetic trees are obtained for several datasets (e.g. for different
Evidence of two distinct phylogenetic lineages of dog rabies virus circulating in Cambodia.

Science.gov (United States)

Mey, Channa; Metlin, Artem; Duong, Veasna; Ong, Sivuth; In, Sotheary; Horwood, Paul F; Reynes, Jean-Marc; Bourhy, Hervé; Tarantola, Arnaud; Buchy, Philippe

2016-03-01

This first extensive retrospective study of the molecular epidemiology of dog rabies in Cambodia included 149 rabies virus (RABV) entire nucleoprotein sequences obtained from 1998-2011. The sequences were analyzed in conjunction with RABVs from other Asian countries. Phylogenetic reconstruction confirmed the South-East Asian phylogenetic clade comprising viruses from Cambodia, Vietnam, Thailand, Laos and Myanmar. The present study represents the first attempt to classify the phylogenetic lineages inside this clade, resulting in the confirmation that all the Cambodian viruses belonged to the South-East Asian (SEA) clade. Three distinct phylogenetic lineages in the region were established with the majority of viruses from Cambodia closely related to viruses from Thailand, Laos and Vietnam, forming the geographically widespread phylogenetic lineage SEA1. A South-East Asian lineage SEA2 comprised two viruses from Cambodia was identified, which shared a common ancestor with RABVs originating from Laos. Viruses from Myanmar formed separate phylogenetic lineages within the major SEA clade. Bayesian molecular clock analysis suggested that the time to most recent common ancestor (TMRCA) of all Cambodian RABVs dated to around 1950. The TMRCA of the Cambodian SEA1 lineage was around 1964 and that of the SEA2 lineage was around 1953. The results identified three phylogenetically distinct and geographically separated lineages inside the earlier identified major SEA clade, covering at least five countries in the region. A greater understanding of the molecular epidemiology of rabies in South-East Asia is an important step to monitor progress on the efforts to control canine rabies in the region. Copyright © 2015 Elsevier B.V. All rights reserved.
Opposing assembly mechanisms in a neotropical dry forest: implications for phylogenetic and functional community ecology.

Science.gov (United States)

Swenson, Nathan G; Enquist, Brian J

2009-08-01

Species diversity is promoted and maintained by ecological and evolutionary processes operating on species attributes through space and time. The degree to which variability in species function regulates distribution and promotes coexistence of species has been debated. Previous work has attempted to quantify the relative importance of species function by using phylogenetic relatedness as a proxy for functional similarity. The key assumption of this approach is that function is phylogenetically conserved. If this assumption is supported, then the phylogenetic dispersion in a community should mirror the functional dispersion. Here we quantify functional trait dispersion along several key axes of tree life-history variation and on multiple spatial scales in a Neotropical dry-forest community. We next compare these results to previously reported patterns of phylogenetic dispersion in this same forest. We find that, at small spatial scales, coexisting species are typically more functionally clustered than expected, but traits related to adult and regeneration niches are overdispersed. This outcome was repeated when the analyses were stratified by size class. Some of the trait dispersion results stand in contrast to the previously reported phylogenetic dispersion results. In order to address this inconsistency we examined the strength of phylogenetic signal in traits at different depths in the phylogeny. We argue that: (1) while phylogenetic relatedness may be a good general multivariate proxy for ecological similarity, it may have a reduced capacity to depict the functional mechanisms behind species coexistence when coexisting species simultaneously converge and diverge in function; and (2) the previously used metric of phylogenetic signal provided erroneous inferences about trait dispersion when married with patterns of phylogenetic dispersion.
A Consistent Phylogenetic Backbone for the Fungi

Science.gov (United States)

Ebersberger, Ingo; de Matos Simoes, Ricardo; Kupczok, Anne; Gube, Matthias; Kothe, Erika; Voigt, Kerstin; von Haeseler, Arndt

2012-01-01

The kingdom of fungi provides model organisms for biotechnology, cell biology, genetics, and life sciences in general. Only when their phylogenetic relationships are stably resolved, can individual results from fungal research be integrated into a holistic picture of biology. However, and despite recent progress, many deep relationships within the fungi remain unclear. Here, we present the first phylogenomic study of an entire eukaryotic kingdom that uses a consistency criterion to strengthen phylogenetic conclusions. We reason that branches (splits) recovered with independent data and different tree reconstruction methods are likely to reflect true evolutionary relationships. Two complementary phylogenomic data sets based on 99 fungal genomes and 109 fungal expressed sequence tag (EST) sets analyzed with four different tree reconstruction methods shed light from different angles on the fungal tree of life. Eleven additional data sets address specifically the phylogenetic position of Blastocladiomycota, Ustilaginomycotina, and Dothideomycetes, respectively. The combined evidence from the resulting trees supports the deep-level stability of the fungal groups toward a comprehensive natural system of the fungi. In addition, our analysis reveals methodologically interesting aspects. Enrichment for EST encoded data—a common practice in phylogenomic analyses—introduces a strong bias toward slowly evolving and functionally correlated genes. Consequently, the generalization of phylogenomic data sets as collections of randomly selected genes cannot be taken for granted. A thorough characterization of the data to assess possible influences on the tree reconstruction should therefore become a standard in phylogenomic analyses. PMID:22114356
Phylogenetic distribution of large-scale genome patchiness

Directory of Open Access Journals (Sweden)

Hackenberg Michael

2008-04-01

Full Text Available Abstract Background The phylogenetic distribution of large-scale genome structure (i.e. mosaic compositional patchiness has been explored mainly by analytical ultracentrifugation of bulk DNA. However, with the availability of large, good-quality chromosome sequences, and the recently developed computational methods to directly analyze patchiness on the genome sequence, an evolutionary comparative analysis can be carried out at the sequence level. Results The local variations in the scaling exponent of the Detrended Fluctuation Analysis are used here to analyze large-scale genome structure and directly uncover the characteristic scales present in genome sequences. Furthermore, through shuffling experiments of selected genome regions, computationally-identified, isochore-like regions were identified as the biological source for the uncovered large-scale genome structure. The phylogenetic distribution of short- and large-scale patchiness was determined in the best-sequenced genome assemblies from eleven eukaryotic genomes: mammals (Homo sapiens, Pan troglodytes, Mus musculus, Rattus norvegicus, and Canis familiaris, birds (Gallus gallus, fishes (Danio rerio, invertebrates (Drosophila melanogaster and Caenorhabditis elegans, plants (Arabidopsis thaliana and yeasts (Saccharomyces cerevisiae. We found large-scale patchiness of genome structure, associated with in silico determined, isochore-like regions, throughout this wide phylogenetic range. Conclusion Large-scale genome structure is detected by directly analyzing DNA sequences in a wide range of eukaryotic chromosome sequences, from human to yeast. In all these genomes, large-scale patchiness can be associated with the isochore-like regions, as directly detected in silico at the sequence level.

Molecular phylogenetics and historical biogeography of Rhinolophus bats.

Science.gov (United States)

Stoffberg, Samantha; Jacobs, David S; Mackie, Iain J; Matthee, Conrad A

2010-01-01

The phylogenetic relationships within the horseshoe bats (genus Rhinolophus) are poorly resolved, particularly at deeper levels within the tree. We present a better-resolved phylogenetic hypothesis for 30 rhinolophid species based on parsimony and Bayesian analyses of the mitochondrial cytochrome b gene and three nuclear introns (TG, THY and PRKC1). Strong support was found for the existence of two geographic clades within the monophyletic Rhinolophidae: an African group and an Oriental assemblage. The relaxed Bayesian clock method indicated that the two rhinolophid clades diverged approximately 35 million years ago and results from Dispersal Vicariance (DIVA) analysis suggest that the horseshoe bats arose in Asia and subsequently dispersed into Europe and Africa.
Sequence homology at the breakpoint and clinical phenotype of mitochondrial DNA deletion syndromes.

Science.gov (United States)

Sadikovic, Bekim; Wang, Jing; El-Hattab, Ayman W; Landsverk, Megan; Douglas, Ganka; Brundage, Ellen K; Craigen, William J; Schmitt, Eric S; Wong, Lee-Jun C

2010-12-20

Mitochondrial DNA (mtDNA) deletions are a common cause of mitochondrial disorders. Large mtDNA deletions can lead to a broad spectrum of clinical features with different age of onset, ranging from mild mitochondrial myopathies (MM), progressive external ophthalmoplegia (PEO), and Kearns-Sayre syndrome (KSS), to severe Pearson syndrome. The aim of this study is to investigate the molecular signatures surrounding the deletion breakpoints and their association with the clinical phenotype and age at onset. MtDNA deletions in 67 patients were characterized using array comparative genomic hybridization (aCGH) followed by PCR-sequencing of the deletion junctions. Sequence homology including both perfect and imperfect short repeats flanking the deletion regions were analyzed and correlated with clinical features and patients' age group. In all age groups, there was a significant increase in sequence homology flanking the deletion compared to mtDNA background. The youngest patient group (deletion distribution in size and locations, with a significantly lower sequence homology flanking the deletion, and the highest percentage of deletion mutant heteroplasmy. The older age groups showed rather discrete pattern of deletions with 44% of all patients over 6 years old carrying the most common 5 kb mtDNA deletion, which was found mostly in muscle specimens (22/41). Only 15% (3/20) of the young patients (deletion, which is usually present in blood rather than muscle. This group of patients predominantly (16 out of 17) exhibit multisystem disorder and/or Pearson syndrome, while older patients had predominantly neuromuscular manifestations including KSS, PEO, and MM. In conclusion, sequence homology at the deletion flanking regions is a consistent feature of mtDNA deletions. Decreased levels of sequence homology and increased levels of deletion mutant heteroplasmy appear to correlate with earlier onset and more severe disease with multisystem involvement.
Hippo pathway phylogenetics predicts monoubiquitylation of Salvador and Merlin/Nf2.

Directory of Open Access Journals (Sweden)

Robert G Wisotzkey

Full Text Available Recently we employed phylogenetics to predict that the cellular interpretation of TGF-β signals is modulated by monoubiquitylation cycles affecting the Smad4 signal transducer/tumor suppressor. This prediction was subsequently validated by experiments in flies, frogs and mammalian cells. Here we apply a phylogenetic approach to the Hippo pathway and predict that two of its signal transducers, Salvador and Merlin/Nf2 (also a tumor suppressor are regulated by monoubiquitylation. This regulatory mechanism does not lead to protein degradation but instead serves as a highly efficient "off/on" switch when the protein is subsequently deubiquitylated. Overall, our study shows that the creative application of phylogenetics can predict new roles for pathway components and new mechanisms for regulating intercellular signaling pathways.
Forensic application of phylogenetic analyses - Exploration of suspected HIV-1 transmission case.

Science.gov (United States)

Siljic, Marina; Salemovic, Dubravka; Cirkovic, Valentina; Pesic-Pavlovic, Ivana; Ranin, Jovan; Todorovic, Marija; Nikolic, Slobodan; Jevtovic, Djordje; Stanojevic, Maja

2017-03-01

Transmission of human immunodeficiency virus (HIV) between individuals may have important legal implications and therefore may come to require forensic investigation based upon phylogenetic analysis. In criminal trials results of phylogenetic analyses have been used as evidence of responsibility for HIV transmission. In Serbia, as in many countries worldwide, exposure and deliberate transmission of HIV are criminalized. We present the results of applying state of the art phylogenetic analyses, based on pol and env genetic sequences, in exploration of suspected HIV transmission among three subjects: a man and two women, with presumed assumption of transmission direction from one woman to a man. Phylogenetic methods included relevant neighbor-joining (NJ), maximum likelihood (ML) and Bayesian methods of phylogenetic trees reconstruction and hypothesis testing, that has been shown to be the most sensitive for the reconstruction of epidemiological links mostly from sexually infected individuals. End-point limiting-dilution PCR (EPLD-PCR) assay, generating the minimum of 10 sequences per genetic region per subject, was performed to assess HIV quasispecies distribution and to explore the direction of HIV transmission between three subjects. Phylogenetic analysis revealed that the viral sequences from the three subjects were more genetically related to each other than to other strains circulating in the same area with the similar epidemiological profile, forming strongly supported transmission chain, which could be in favour of a priori hypothesis of one of the women infecting the man. However, in the EPLD based phylogenetic trees for both pol and env genetic region, viral sequences of one subject (man) were paraphyletic to those of two other subjects (women), implying the direction of transmission opposite to the a priori assumption. The dated tree in our analysis confirmed the clustering pattern of query sequences. Still, in the context of unsampled sequences and
Developing a statistically powerful measure for quartet tree inference using phylogenetic identities and Markov invariants.

Science.gov (United States)

Sumner, Jeremy G; Taylor, Amelia; Holland, Barbara R; Jarvis, Peter D

2017-12-01

Recently there has been renewed interest in phylogenetic inference methods based on phylogenetic invariants, alongside the related Markov invariants. Broadly speaking, both these approaches give rise to polynomial functions of sequence site patterns that, in expectation value, either vanish for particular evolutionary trees (in the case of phylogenetic invariants) or have well understood transformation properties (in the case of Markov invariants). While both approaches have been valued for their intrinsic mathematical interest, it is not clear how they relate to each other, and to what extent they can be used as practical tools for inference of phylogenetic trees. In this paper, by focusing on the special case of binary sequence data and quartets of taxa, we are able to view these two different polynomial-based approaches within a common framework. To motivate the discussion, we present three desirable statistical properties that we argue any invariant-based phylogenetic method should satisfy: (1) sensible behaviour under reordering of input sequences; (2) stability as the taxa evolve independently according to a Markov process; and (3) explicit dependence on the assumption of a continuous-time process. Motivated by these statistical properties, we develop and explore several new phylogenetic inference methods. In particular, we develop a statistically bias-corrected version of the Markov invariants approach which satisfies all three properties. We also extend previous work by showing that the phylogenetic invariants can be implemented in such a way as to satisfy property (3). A simulation study shows that, in comparison to other methods, our new proposed approach based on bias-corrected Markov invariants is extremely powerful for phylogenetic inference. The binary case is of particular theoretical interest as-in this case only-the Markov invariants can be expressed as linear combinations of the phylogenetic invariants. A wider implication of this is that, for
Phylogenetic analysis of hepatitis B virus in pakistan

International Nuclear Information System (INIS)

Baig, S.; Hasnain, N.U.

2008-01-01

To identify the distribution pattern of Hepatitis B Virus (HBV) genotype in a group of patients and to study its phylogenetic divergence. Two hundred and one HBV infected patients were genotyped for this study. All HbsAg positive individuals, either healthy carriers or suffering from conditions such as acute or chronic hepatitis, cirrhosis and hepatocellular carcinoma were included. Hepatitis B patients co-infected with other hepatic viruses were excluded. Hepatitis B virus DNA was extracted from serum, and subjected to a nested PCR, using the primers type-specific for genotype detection. Phylogenetic analysis was performed in the pre-S1 through S genes of HBV. The divergence was studied through 15 sequences of 967bp submitted to the DBJ/EMBL/GenBank databases accessible under accession number EF584640 through EF584654. Out of 201 patients tested, 156 were males and 45 were females. Genotype D was the predominant type found in 128 (64%) patients followed by A in 47 (23%) and mixed A/D in 26 (13%). Phylogenetic analysis confirmed the dominance of genotype D and subtype ayw2. There was dominance of genotype D subtype ayw2. It had a close resemblance with HBV strains that circulate in Iran, India and Japan. (author)
A phylogenetic analysis of the sugar porters in hemiascomycetous yeasts.

Science.gov (United States)

Palma, Margarida; Goffeau, André; Spencer-Martins, Isabel; Baret, Philippe V

2007-01-01

A total of 214 members of the sugar porter (SP) family (TC 2.A.1.1) from eight hemiascomycetous yeasts: Saccharomyces cerevisiae, Candida glabrata, Kluyveromyces lactis, Ashbya (Eremothecium) gossypii, Debaryomyces hansenii, Yarrowia lipolytica, Candida albicans and Pichia stipitis, were identified. The yeast SPs were classified in 13 different phylogenetic clusters. Specific sugar substrates could be allocated to nine phylogenetic clusters, including two novel TC clusters that are specific to fungi, i.e. the glycerol:H(+) symporter (2.A.1.1.38) and the high-affinity glucose transporter (2.A.1.1.39). Four phylogenetic clusters are identified by the preliminary fifth number Z23, Z24, Z25 and Z26 and the substrates of their members remain undetermined. The amplification of the SP clusters across the Hemiascomycetes reflects adaptation to specific carbon and energy sources available in the habitat of each yeast species. (c) 2007 S. Karger AG, Basel.
Phylogenetic Analysis and Molecular Characterization of Xanthium sibiricum Using DNA Barcoding, PCR-RFLP, and Specific Primers.

Science.gov (United States)

Tomasello, Salvatore; Heubl, Günther

2017-07-01

The fruits of Xanthium sibiricum have been widely used in traditional Chinese medicine for the treatment of nasal sinusitis and headaches. The genus Xanthium (cocklebur) is a taxonomically complex genus. Different taxonomic concepts have been proposed, some including several species, others lumping the different taxa in a few extremely polymorphic species. Due to the morphological similarities between species, the correct authentication of X. sibiricum is very difficult. Therefore, we established a polymerase chain reaction-restriction fragment length polymorphism method and diagnostic PCR based on nuclear internal transcribed spacer and chloroplast trnQ-rps16 barcodes to differentiate X. sibirium from related species.Results from the phylogenetic analyses based on sequence information from four marker regions (plastidal psbA-trnH and trnQ-rps16 and nuclear ITS and D35 ) support those taxonomic concepts accepting a reduced number of species, as four to five major clades are revealed in the phylogenetic reconstructions. X. sibiricum , together with some accessions from closely related taxa, is always supported as monophyletic, constituting a well-defined genetic entity. Allele-specific primer pairs for ITS and trnQ-rps16 were designed to amplify diagnostic products from the genomic DNA of X. sibiricum . Specific PCR in combination with digestion using the restriction enzyme Mse I allowed for the identification of X. sibiricum by producing specific restriction patterns. The results demonstrate that the applied techniques provide effective and accurate authentication of X. sibiricum . Georg Thieme Verlag KG Stuttgart · New York.
High-resolution phylogenetic microbial community profiling

Energy Technology Data Exchange (ETDEWEB)

Singer, Esther; Coleman-Derr, Devin; Bowman, Brett; Schwientek, Patrick; Clum, Alicia; Copeland, Alex; Ciobanu, Doina; Cheng, Jan-Fang; Gies, Esther; Hallam, Steve; Tringe, Susannah; Woyke, Tanja

2014-03-17

The representation of bacterial and archaeal genome sequences is strongly biased towards cultivated organisms, which belong to merely four phylogenetic groups. Functional information and inter-phylum level relationships are still largely underexplored for candidate phyla, which are often referred to as microbial dark matter. Furthermore, a large portion of the 16S rRNA gene records in the GenBank database are labeled as environmental samples and unclassified, which is in part due to low read accuracy, potential chimeric sequences produced during PCR amplifications and the low resolution of short amplicons. In order to improve the phylogenetic classification of novel species and advance our knowledge of the ecosystem function of uncultivated microorganisms, high-throughput full length 16S rRNA gene sequencing methodologies with reduced biases are needed. We evaluated the performance of PacBio single-molecule real-time (SMRT) sequencing in high-resolution phylogenetic microbial community profiling. For this purpose, we compared PacBio and Illumina metagenomic shotgun and 16S rRNA gene sequencing of a mock community as well as of an environmental sample from Sakinaw Lake, British Columbia. Sakinaw Lake is known to contain a large age of microbial species from candidate phyla. Sequencing results show that community structure based on PacBio shotgun and 16S rRNA gene sequences is highly similar in both the mock and the environmental communities. Resolution power and community representation accuracy from SMRT sequencing data appeared to be independent of GC content of microbial genomes and was higher when compared to Illumina-based metagenome shotgun and 16S rRNA gene (iTag) sequences, e.g. full-length sequencing resolved all 23 OTUs in the mock community, while iTags did not resolve closely related species. SMRT sequencing hence offers various potential benefits when characterizing uncharted microbial communities.
Evolutionary history of tall fescue morphotypes inferred from molecular phylogenetics of the Lolium-Festuca species complex

Directory of Open Access Journals (Sweden)

Stewart Alan V

2010-10-01

Full Text Available Abstract Background The agriculturally important pasture grass tall fescue (Festuca arundinacea Schreb. syn. Lolium arundinaceum (Schreb. Darbysh. is an outbreeding allohexaploid, that may be more accurately described as a species complex consisting of three major (Continental, Mediterranean and rhizomatous morphotypes. Observation of hybrid infertility in some crossing combinations between morphotypes suggests the possibility of independent origins from different diploid progenitors. This study aims to clarify the evolutionary relationships between each tall fescue morphotype through phylogenetic analysis using two low-copy nuclear genes (encoding plastid acetyl-CoA carboxylase [Acc1] and centroradialis [CEN], the nuclear ribosomal DNA internal transcribed spacer (rDNA ITS and the chloroplast DNA (cpDNA genome-located matK gene. Other taxa within the closely related Lolium-Festuca species complex were also included in the study, to increase understanding of evolutionary processes in a taxonomic group characterised by multiple inter-specific hybridisation events. Results Putative homoeologous sequences from both nuclear genes were obtained from each polyploid species and compared to counterparts from 15 diploid taxa. Phylogenetic reconstruction confirmed F. pratensis and F. arundinacea var. glaucescens as probable progenitors to Continental tall fescue, and these species are also likely to be ancestral to the rhizomatous morphotype. However, these two morphotypes are sufficiently distinct to be located in separate clades based on the ITS-derived data set. All four of the generated data sets suggest independent evolution of the Mediterranean and Continental morphotypes, with minimal affinity between cognate sequence haplotypes. No obvious candidate progenitor species for Mediterranean tall fescues were identified, and only two putative sub-genome-specific haplotypes were identified for this morphotype. Conclusions This study describes the first
Molecular phylogenetic reconstruction of the endemic Asian salamander family Hynobiidae (Amphibia, Caudata).

Science.gov (United States)

Weisrock, David W; Macey, J Robert; Matsui, Masafumi; Mulcahy, Daniel G; Papenfuss, Theodore J

2013-01-01

The salamander family Hynobiidae contains over 50 species and has been the subject of a number of molecular phylogenetic investigations aimed at reconstructing branches across the entire family. In general, studies using the greatest amount of sequence data have used reduced taxon sampling, while the study with the greatest taxon sampling has used a limited sequence data set. Here, we provide insights into the phylogenetic history of the Hynobiidae using both dense taxon sampling and a large mitochondrial DNA sequence data set. We report exclusive new mitochondrial DNA data of 2566 aligned bases (with 151 excluded sites, of included sites 1157 are variable with 957 parsimony informative). This is sampled from two genic regions encoding a 12S-16S region (the 3' end of 12S rRNA, tRNA(VAI), and the 5' end of 16S rRNA), and a ND2-COI region (ND2, tRNA(Trp), tRNA(Ala), tRNA(Asn), the origin for light strand replication--O(L), tRNA(Cys), tRNAT(Tyr), and the 5' end of COI). Analyses using parsimony, Bayesian, and maximum likelihood optimality criteria produce similar phylogenetic trees, with discordant branches generally receiving low levels of branch support. Monophyly of the Hynobiidae is strongly supported across all analyses, as is the sister relationship and deep divergence between the genus Onychodactylus with all remaining hynobiids. Within this latter grouping our phylogenetic results identify six clades that are relatively divergent from one another, but for which there is minimal support for their phylogenetic placement. This includes the genus Batrachuperus, the genus Hynobius, the genus Pachyhynobius, the genus Salamandrella, a clade containing the genera Ranodon and Paradactylodon, and a clade containing the genera Liua and Pseudohynobius. This latter clade receives low bootstrap support in the parsimony analysis, but is consistent across all three analytical methods. Our results also clarify a number of well-supported relationships within the larger
Identifiability of tree-child phylogenetic networks under a probabilistic recombination-mutation model of evolution.

Science.gov (United States)

Francis, Andrew; Moulton, Vincent

2018-06-07

Phylogenetic networks are an extension of phylogenetic trees which are used to represent evolutionary histories in which reticulation events (such as recombination and hybridization) have occurred. A central question for such networks is that of identifiability, which essentially asks under what circumstances can we reliably identify the phylogenetic network that gave rise to the observed data? Recently, identifiability results have appeared for networks relative to a model of sequence evolution that generalizes the standard Markov models used for phylogenetic trees. However, these results are quite limited in terms of the complexity of the networks that are considered. In this paper, by introducing an alternative probabilistic model for evolution along a network that is based on some ground-breaking work by Thatte for pedigrees, we are able to obtain an identifiability result for a much larger class of phylogenetic networks (essentially the class of so-called tree-child networks). To prove our main theorem, we derive some new results for identifying tree-child networks combinatorially, and then adapt some techniques developed by Thatte for pedigrees to show that our combinatorial results imply identifiability in the probabilistic setting. We hope that the introduction of our new model for networks could lead to new approaches to reliably construct phylogenetic networks. Copyright © 2018 Elsevier Ltd. All rights reserved.
A Genome-Scale Investigation of How Sequence, Function, and Tree-Based Gene Properties Influence Phylogenetic Inference.

Science.gov (United States)

Shen, Xing-Xing; Salichos, Leonidas; Rokas, Antonis

2016-09-02

Molecular phylogenetic inference is inherently dependent on choices in both methodology and data. Many insightful studies have shown how choices in methodology, such as the model of sequence evolution or optimality criterion used, can strongly influence inference. In contrast, much less is known about the impact of choices in the properties of the data, typically genes, on phylogenetic inference. We investigated the relationships between 52 gene properties (24 sequence-based, 19 function-based, and 9 tree-based) with each other and with three measures of phylogenetic signal in two assembled data sets of 2,832 yeast and 2,002 mammalian genes. We found that most gene properties, such as evolutionary rate (measured through the percent average of pairwise identity across taxa) and total tree length, were highly correlated with each other. Similarly, several gene properties, such as gene alignment length, Guanine-Cytosine content, and the proportion of tree distance on internal branches divided by relative composition variability (treeness/RCV), were strongly correlated with phylogenetic signal. Analysis of partial correlations between gene properties and phylogenetic signal in which gene evolutionary rate and alignment length were simultaneously controlled, showed similar patterns of correlations, albeit weaker in strength. Examination of the relative importance of each gene property on phylogenetic signal identified gene alignment length, alongside with number of parsimony-informative sites and variable sites, as the most important predictors. Interestingly, the subsets of gene properties that optimally predicted phylogenetic signal differed considerably across our three phylogenetic measures and two data sets; however, gene alignment length and RCV were consistently included as predictors of all three phylogenetic measures in both yeasts and mammals. These results suggest that a handful of sequence-based gene properties are reliable predictors of phylogenetic signal
Phylo_dCor: distance correlation as a novel metric for phylogenetic profiling.

Science.gov (United States)

Sferra, Gabriella; Fratini, Federica; Ponzi, Marta; Pizzi, Elisabetta

2017-09-05

Elaboration of powerful methods to predict functional and/or physical protein-protein interactions from genome sequence is one of the main tasks in the post-genomic era. Phylogenetic profiling allows the prediction of protein-protein interactions at a whole genome level in both Prokaryotes and Eukaryotes. For this reason it is considered one of the most promising methods. Here, we propose an improvement of phylogenetic profiling that enables handling of large genomic datasets and infer global protein-protein interactions. This method uses the distance correlation as a new measure of phylogenetic profile similarity. We constructed robust reference sets and developed Phylo-dCor, a parallelized version of the algorithm for calculating the distance correlation that makes it applicable to large genomic data. Using Saccharomyces cerevisiae and Escherichia coli genome datasets, we showed that Phylo-dCor outperforms phylogenetic profiling methods previously described based on the mutual information and Pearson's correlation as measures of profile similarity. In this work, we constructed and assessed robust reference sets and propose the distance correlation as a measure for comparing phylogenetic profiles. To make it applicable to large genomic data, we developed Phylo-dCor, a parallelized version of the algorithm for calculating the distance correlation. Two R scripts that can be run on a wide range of machines are available upon request.
Is invasion success of Australian trees mediated by their native biogeography, phylogenetic history, or both?

Science.gov (United States)

Miller, Joseph T; Hui, Cang; Thornhill, Andrew; Gallien, Laure; Le Roux, Johannes J; Richardson, David M

2016-12-30

For a plant species to become invasive it has to progress along the introduction-naturalization-invasion (INI) continuum which reflects the joint direction of niche breadth. Identification of traits that correlate with and drive species invasiveness along the continuum is a major focus of invasion biology. If invasiveness is underlain by heritable traits, and if such traits are phylogenetically conserved, then we would expect non-native species with different introduction status (i.e. position along the INI continuum) to show phylogenetic signal. This study uses two clades that contain a large number of invasive tree species from the genera Acacia and Eucalyptus to test whether geographic distribution and a novel phylogenetic conservation method can predict which species have been introduced, became naturalized, and invasive. Our results suggest that no underlying phylogenetic signal underlie the introduction status for both groups of trees, except for introduced acacias. The more invasive acacia clade contains invasive species that have smoother geographic distributions and are more marginal in the phylogenetic network. The less invasive eucalyptus group contains invasive species that are more clustered geographically, more centrally located in the phylogenetic network and have phylogenetic distances between invasive and non-invasive species that are trending toward the mean pairwise distance. This suggests that highly invasive groups may be identified because they have invasive species with smoother and faster expanding native distributions and are located more to the edges of phylogenetic networks than less invasive groups. Published by Oxford University Press on behalf of the Annals of Botany Company.
TreeScaper: Visualizing and Extracting Phylogenetic Signal from Sets of Trees.

Science.gov (United States)

Huang, Wen; Zhou, Guifang; Marchand, Melissa; Ash, Jeremy R; Morris, David; Van Dooren, Paul; Brown, Jeremy M; Gallivan, Kyle A; Wilgenbusch, Jim C

2016-12-01

Modern phylogenomic analyses often result in large collections of phylogenetic trees representing uncertainty in individual gene trees, variation across genes, or both. Extracting phylogenetic signal from these tree sets can be challenging, as they are difficult to visualize, explore, and quantify. To overcome some of these challenges, we have developed TreeScaper, an application for tree set visualization as well as the identification of distinct phylogenetic signals. GUI and command-line versions of TreeScaper and a manual with tutorials can be downloaded from https://github.com/whuang08/TreeScaper/releases TreeScaper is distributed under the GNU General Public License. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Phylogenetic classification of the halichondrids (Porifera, Demospongiae)

NARCIS (Netherlands)

Soest, van R.W.M.; Díaz, Maria Cristina; Pomponi, Shirley A.

1990-01-01

Using a multicharacter approach and numerical cladistic computer programs a phylogenetic analysis is made of a newly defined order Halichondrida (which includes all Halichondrida and parts of the Axinellida sensu Lévi, 1973), with emphasis on the newly defined family Halichondriidae (which includes
A Model of Desired Performance in Phylogenetic Tree Construction for Teaching Evolution.

Science.gov (United States)

Brewer, Steven D.

This research paper examines phylogenetic tree construction-a form of problem solving in biology-by studying the strategies and heuristics used by experts. One result of the research is the development of a model of desired performance for phylogenetic tree construction. A detailed description of the model and the sample problems which illustrate…
galaxie--CGI scripts for sequence identification through automated phylogenetic analysis.

Science.gov (United States)

Nilsson, R Henrik; Larsson, Karl-Henrik; Ursing, Björn M

2004-06-12

The prevalent use of similarity searches like BLAST to identify sequences and species implicitly assumes the reference database to be of extensive sequence sampling. This is often not the case, restraining the correctness of the outcome as a basis for sequence identification. Phylogenetic inference outperforms similarity searches in retrieving correct phylogenies and consequently sequence identities, and a project was initiated to design a freely available script package for sequence identification through automated Web-based phylogenetic analysis. Three CGI scripts were designed to facilitate qualified sequence identification from a Web interface. Query sequences are aligned to pre-made alignments or to alignments made by ClustalW with entries retrieved from a BLAST search. The subsequent phylogenetic analysis is based on the PHYLIP package for inferring neighbor-joining and parsimony trees. The scripts are highly configurable. A service installation and a version for local use are found at http://andromeda.botany.gu.se/galaxiewelcome.html and http://galaxie.cgb.ki.se
Metagenomic species profiling using universal phylogenetic marker genes

DEFF Research Database (Denmark)

Sunagawa, Shinichi; Mende, Daniel R; Zeller, Georg

2013-01-01

To quantify known and unknown microorganisms at species-level resolution using shotgun sequencing data, we developed a method that establishes metagenomic operational taxonomic units (mOTUs) based on single-copy phylogenetic marker genes. Applied to 252 human fecal samples, the method revealed th...... that on average 43% of the species abundance and 58% of the richness cannot be captured by current reference genome-based methods. An implementation of the method is available at http://www.bork.embl.de/software/mOTU/.......To quantify known and unknown microorganisms at species-level resolution using shotgun sequencing data, we developed a method that establishes metagenomic operational taxonomic units (mOTUs) based on single-copy phylogenetic marker genes. Applied to 252 human fecal samples, the method revealed...

Phylogenetic patterns are not proxies of community assembly mechanisms (they are far better)

NARCIS (Netherlands)

Gerhold, Pille; Cahill, J.F.; Winter, Marten; Bartish, I.V.; Prinzing, Andreas

2015-01-01

The subdiscipline of 'community phylogenetics' is rapidly growing and influencing thinking regarding community assembly. In particular, phylogenetic dispersion of co-occurring species within a community is commonly used as a proxy to identify which community assembly processes may have structured
Seed plant phylogenetic diversity and species richness in conservation planning within a global biodiversity hotspot in eastern Asia.

Science.gov (United States)

Li, Rong; Kraft, Nathan J B; Yu, Haiying; Li, Heng

2015-12-01

One of the main goals of conservation biology is to understand the factors shaping variation in biodiversity across the planet. This understanding is critical for conservation planners to be able to develop effective conservation strategies. Although many studies have focused on species richness and the protection of rare and endemic species, less attention has been paid to the protection of the phylogenetic dimension of biodiversity. We explored how phylogenetic diversity, species richness, and phylogenetic community structure vary in seed plant communities along an elevational gradient in a relatively understudied high mountain region, the Dulong Valley, in southeastern Tibet, China. As expected, phylogenetic diversity was well correlated with species richness among the elevational bands and among communities. At the community level, evergreen broad-leaved forests had the highest levels of species richness and phylogenetic diversity. Using null model analyses, we found evidence of nonrandom phylogenetic structure across the region. Evergreen broad-leaved forests were phylogenetically overdispersed, whereas other vegetation types tended to be phylogenetically clustered. We suggest that communities with high species richness or overdispersed phylogenetic structure should be a focus for biodiversity conservation within the Dulong Valley because these areas may help maximize the potential of this flora to respond to future global change. In biodiversity hotspots worldwide, we suggest that the phylogenetic structure of a community may serve as a useful measure of phylogenetic diversity in the context of conservation planning. © 2015 Society for Conservation Biology.
An Interaction with Ewing's Sarcoma Breakpoint Protein EWS Defines a Specific Oncogenic Mechanism of ETS Factors Rearranged in Prostate Cancer.

Science.gov (United States)

Kedage, Vivekananda; Selvaraj, Nagarathinam; Nicholas, Taylor R; Budka, Justin A; Plotnik, Joshua P; Jerde, Travis J; Hollenhorst, Peter C

2016-10-25

More than 50% of prostate tumors have a chromosomal rearrangement resulting in aberrant expression of an oncogenic ETS family transcription factor. However, mechanisms that differentiate the function of oncogenic ETS factors expressed in prostate tumors from non-oncogenic ETS factors expressed in normal prostate are unknown. Here, we find that four oncogenic ETS (ERG, ETV1, ETV4, and ETV5), and no other ETS, interact with the Ewing's sarcoma breakpoint protein, EWS. This EWS interaction was necessary and sufficient for oncogenic ETS functions including gene activation, cell migration, clonogenic survival, and transformation. Significantly, the EWS interacting region of ERG has no homology with that of ETV1, ETV4, and ETV5. Therefore, this finding may explain how divergent ETS factors have a common oncogenic function. Strikingly, EWS is fused to various ETS factors by the chromosome translocations that cause Ewing's sarcoma. Therefore, these findings link oncogenic ETS function in both prostate cancer and Ewing's sarcoma. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.
Bacterial phylogenetic reconstruction from whole genomes is robust to recombination but demographic inference is not.

Science.gov (United States)

Hedge, Jessica; Wilson, Daniel J

2014-11-25

Phylogenetic inference in bacterial genomics is fundamental to understanding problems such as population history, antimicrobial resistance, and transmission dynamics. The field has been plagued by an apparent state of contradiction since the distorting effects of recombination on phylogeny were discovered more than a decade ago. Researchers persist with detailed phylogenetic analyses while simultaneously acknowledging that recombination seriously misleads inference of population dynamics and selection. Here we resolve this paradox by showing that phylogenetic tree topologies based on whole genomes robustly reconstruct the clonal frame topology but that branch lengths are badly skewed. Surprisingly, removing recombining sites can exacerbate branch length distortion caused by recombination. Phylogenetic tree reconstruction is a popular approach for understanding the relatedness of bacteria in a population from differences in their genome sequences. However, bacteria frequently exchange regions of their genomes by a process called homologous recombination, which violates a fundamental assumption of phylogenetic methods. Since many researchers continue to use phylogenetics for recombining bacteria, it is important to understand how recombination affects the conclusions drawn from these analyses. We find that whole-genome sequences afford great accuracy in reconstructing evolutionary relationships despite concerns surrounding the presence of recombination, but the branch lengths of the phylogenetic tree are indeed badly distorted. Surprisingly, methods to reduce the impact of recombination on branch lengths can exacerbate the problem. Copyright © 2014 Hedge and Wilson.
Phylogenetics and differentiation of Salmonella Newport lineages by whole genome sequencing.

Directory of Open Access Journals (Sweden)

Guojie Cao

Full Text Available Salmonella Newport has ranked in the top three Salmonella serotypes associated with foodborne outbreaks from 1995 to 2011 in the United States. In the current study, we selected 26 S. Newport strains isolated from diverse sources and geographic locations and then conducted 454 shotgun pyrosequencing procedures to obtain 16-24 × coverage of high quality draft genomes for each strain. Comparative genomic analysis of 28 S. Newport strains (including 2 reference genomes and 15 outgroup genomes identified more than 140,000 informative SNPs. A resulting phylogenetic tree consisted of four sublineages and indicated that S. Newport had a clear geographic structure. Strains from Asia were divergent from those from the Americas. Our findings demonstrated that analysis using whole genome sequencing data resulted in a more accurate picture of phylogeny compared to that using single genes or small sets of genes. We selected loci around the mutS gene of S. Newport to differentiate distinct lineages, including those between invH and mutS genes at the 3' end of Salmonella Pathogenicity Island 1 (SPI-1, ste fimbrial operon, and Clustered, Regularly Interspaced, Short Palindromic Repeats (CRISPR associated-proteins (cas. These genes in the outgroup genomes held high similarity with either S. Newport Lineage II or III at the same loci. S. Newport Lineages II and III have different evolutionary histories in this region and our data demonstrated genetic flow and homologous recombination events around mutS. The findings suggested that S. Newport Lineages II and III diverged early in the serotype evolution and have evolved largely independently. Moreover, we identified genes that could delineate sublineages within the phylogenetic tree and that could be used as potential biomarkers for trace-back investigations during outbreaks. Thus, whole genome sequencing data enabled us to better understand the genetic background of pathogenicity and evolutionary history of S
Tetrapods on the EDGE: Overcoming data limitations to identify phylogenetic conservation priorities

Science.gov (United States)

Gray, Claudia L.; Wearn, Oliver R.; Owen, Nisha R.

2018-01-01

The scale of the ongoing biodiversity crisis requires both effective conservation prioritisation and urgent action. As extinction is non-random across the tree of life, it is important to prioritise threatened species which represent large amounts of evolutionary history. The EDGE metric prioritises species based on their Evolutionary Distinctiveness (ED), which measures the relative contribution of a species to the total evolutionary history of their taxonomic group, and Global Endangerment (GE), or extinction risk. EDGE prioritisations rely on adequate phylogenetic and extinction risk data to generate meaningful priorities for conservation. However, comprehensive phylogenetic trees of large taxonomic groups are extremely rare and, even when available, become quickly out-of-date due to the rapid rate of species descriptions and taxonomic revisions. Thus, it is important that conservationists can use the available data to incorporate evolutionary history into conservation prioritisation. We compared published and new methods to estimate missing ED scores for species absent from a phylogenetic tree whilst simultaneously correcting the ED scores of their close taxonomic relatives. We found that following artificial removal of species from a phylogenetic tree, the new method provided the closest estimates of their “true” ED score, differing from the true ED score by an average of less than 1%, compared to the 31% and 38% difference of the previous methods. The previous methods also substantially under- and over-estimated scores as more species were artificially removed from a phylogenetic tree. We therefore used the new method to estimate ED scores for all tetrapods. From these scores we updated EDGE prioritisation rankings for all tetrapod species with IUCN Red List assessments, including the first EDGE prioritisation for reptiles. Further, we identified criteria to identify robust priority species in an effort to further inform conservation action whilst
Comparison of sequence-based and structure-based phylogenetic ...

Indian Academy of Sciences (India)

Prakash

phylogenetic tree construction methods, has been considered as an equivalent of .... Further detailed analysis described is restricted to the first two groups only. ..... Aspartate-ammonia ligase. Plant virus ..... enzymatic activities?; Trends ...
SWPhylo - A Novel Tool for Phylogenomic Inferences by Comparison of Oligonucleotide Patterns and Integration of Genome-Based and Gene-Based Phylogenetic Trees.

Science.gov (United States)

Yu, Xiaoyu; Reva, Oleg N

2018-01-01

Modern phylogenetic studies may benefit from the analysis of complete genome sequences of various microorganisms. Evolutionary inferences based on genome-scale analysis are believed to be more accurate than the gene-based alternative. However, the computational complexity of current phylogenomic procedures, inappropriateness of standard phylogenetic tools to process genome-wide data, and lack of reliable substitution models which correlates with alignment-free phylogenomic approaches deter microbiologists from using these opportunities. For example, the super-matrix and super-tree approaches of phylogenomics use multiple integrated genomic loci or individual gene-based trees to infer an overall consensus tree. However, these approaches potentially multiply errors of gene annotation and sequence alignment not mentioning the computational complexity and laboriousness of the methods. In this article, we demonstrate that the annotation- and alignment-free comparison of genome-wide tetranucleotide frequencies, termed oligonucleotide usage patterns (OUPs), allowed a fast and reliable inference of phylogenetic trees. These were congruent to the corresponding whole genome super-matrix trees in terms of tree topology when compared with other known approaches including 16S ribosomal RNA and GyrA protein sequence comparison, complete genome-based MAUVE, and CVTree methods. A Web-based program to perform the alignment-free OUP-based phylogenomic inferences was implemented at http://swphylo.bi.up.ac.za/. Applicability of the tool was tested on different taxa from subspecies to intergeneric levels. Distinguishing between closely related taxonomic units may be enforced by providing the program with alignments of marker protein sequences, eg, GyrA.
On the number of vertices of each rank in phylogenetic trees and their generalizations

OpenAIRE

Bóna, Miklós

2015-01-01

We find surprisingly simple formulas for the limiting probability that the rank of a randomly selected vertex in a randomly selected phylogenetic tree or generalized phylogenetic tree is a given integer.
CORE: a phylogenetically-curated 16S rDNA database of the core oral microbiome.

Directory of Open Access Journals (Sweden)

Ann L Griffen

2011-04-01

Full Text Available Comparing bacterial 16S rDNA sequences to GenBank and other large public databases via BLAST often provides results of little use for identification and taxonomic assignment of the organisms of interest. The human microbiome, and in particular the oral microbiome, includes many taxa, and accurate identification of sequence data is essential for studies of these communities. For this purpose, a phylogenetically curated 16S rDNA database of the core oral microbiome, CORE, was developed. The goal was to include a comprehensive and minimally redundant representation of the bacteria that regularly reside in the human oral cavity with computationally robust classification at the level of species and genus. Clades of cultivated and uncultivated taxa were formed based on sequence analyses using multiple criteria, including maximum-likelihood-based topology and bootstrap support, genetic distance, and previous naming. A number of classification inconsistencies for previously named species, especially at the level of genus, were resolved. The performance of the CORE database for identifying clinical sequences was compared to that of three publicly available databases, GenBank nr/nt, RDP and HOMD, using a set of sequencing reads that had not been used in creation of the database. CORE offered improved performance compared to other public databases for identification of human oral bacterial 16S sequences by a number of criteria. In addition, the CORE database and phylogenetic tree provide a framework for measures of community divergence, and the focused size of the database offers advantages of efficiency for BLAST searching of large datasets. The CORE database is available as a searchable interface and for download at http://microbiome.osu.edu.
A Penalized Likelihood Framework For High-Dimensional Phylogenetic Comparative Methods And An Application To New-World Monkeys Brain Evolution.

Science.gov (United States)

Julien, Clavel; Leandro, Aristide; Hélène, Morlon

2018-06-19

Working with high-dimensional phylogenetic comparative datasets is challenging because likelihood-based multivariate methods suffer from low statistical performances as the number of traits p approaches the number of species n and because some computational complications occur when p exceeds n. Alternative phylogenetic comparative methods have recently been proposed to deal with the large p small n scenario but their use and performances are limited. Here we develop a penalized likelihood framework to deal with high-dimensional comparative datasets. We propose various penalizations and methods for selecting the intensity of the penalties. We apply this general framework to the estimation of parameters (the evolutionary trait covariance matrix and parameters of the evolutionary model) and model comparison for the high-dimensional multivariate Brownian (BM), Early-burst (EB), Ornstein-Uhlenbeck (OU) and Pagel's lambda models. We show using simulations that our penalized likelihood approach dramatically improves the estimation of evolutionary trait covariance matrices and model parameters when p approaches n, and allows for their accurate estimation when p equals or exceeds n. In addition, we show that penalized likelihood models can be efficiently compared using Generalized Information Criterion (GIC). We implement these methods, as well as the related estimation of ancestral states and the computation of phylogenetic PCA in the R package RPANDA and mvMORPH. Finally, we illustrate the utility of the new proposed framework by evaluating evolutionary models fit, analyzing integration patterns, and reconstructing evolutionary trajectories for a high-dimensional 3-D dataset of brain shape in the New World monkeys. We find a clear support for an Early-burst model suggesting an early diversification of brain morphology during the ecological radiation of the clade. Penalized likelihood offers an efficient way to deal with high-dimensional multivariate comparative data.
[A phylogenetic analysis of plant communities of Teberda Biosphere Reserve].

Science.gov (United States)

Shulakov, A A; Egorov, A V; Onipchenko, V G

2016-01-01

Phylogenetic analysis of communities is based on the comparison of distances on the phylogenetic tree between species of a community under study and those distances in random samples taken out of local flora. It makes it possible to determine to what extent a community composition is formed by more closely related species (i.e., "clustered") or, on the opposite, it is more even and includes species that are less related with each other. The first case is usually interpreted as a result of strong influence caused by abiotic factors, due to which species with similar ecology, a priori more closely related, would remain: In the second case, biotic factors, such as competition, may come to the fore and lead to forming a community out of distant clades due to divergence of their ecological niches: The aim of this' study Was Ad explore the phylogenetic structure in communities of the northwestern Caucasus at two spatial scales - the scale of area from 4 to 100 m2 and the smaller scale within a community. The list of local flora of the alpine belt has been composed using the database of geobotanic descriptions carried out in Teberda Biosphere Reserve at true altitudes exceeding.1800 m. It includes 585 species of flowering plants belonging to 57 families. Basal groups of flowering plants are.not represented in the list. At the scale of communities of three classes, namely Thlaspietea rotundifolii - commumties formed on screes and pebbles, Calluno-Ulicetea - alpine meadow, and Mulgedio-Aconitetea subalpine meadows, have not demonstrated significant distinction of phylogenetic structure. At intra level, for alpine meadows the larger share of closely related species. (clustered community) is detected. Significantly clustered happen to be those communities developing on rocks (class Asplenietea trichomanis) and alpine (class Juncetea trifidi). At the same time, alpine lichen proved to have even phylogenetic structure at the small scale. Alpine (class Salicetea herbaceae) that
Fast phylogenetic DNA barcoding

DEFF Research Database (Denmark)

Terkelsen, Kasper Munch; Boomsma, Wouter Krogh; Willerslev, Eske

2008-01-01

We present a heuristic approach to the DNA assignment problem based on phylogenetic inferences using constrained neighbour joining and non-parametric bootstrapping. We show that this method performs as well as the more computationally intensive full Bayesian approach in an analysis of 500 insect...... DNA sequences obtained from GenBank. We also analyse a previously published dataset of environmental DNA sequences from soil from New Zealand and Siberia, and use these data to illustrate the fact that statistical approaches to the DNA assignment problem allow for more appropriate criteria...... for determining the taxonomic level at which a particular DNA sequence can be assigned....
BioMatriX: Sequence analysis, structure visualization, phylogenetics ...

African Journals Online (AJOL)

bmx-biomatrix.blogspot.com) developed for biological science community to augment scientific research regarding genomics, proteomics, phylogenetics and linkage analysis in one platform. BioMatriX offers multi-functional services to perform ...
Phylogenetic footprints in organizational behavior

OpenAIRE

Witt, Ulrich; Schwesinger, Georg

2012-01-01

An evolutionary tool kit is applied in this paper to explain how innate social behavior traits evolved in early human groups. These traits were adapted to the particular production requirements of the group in human phylogeny. They shaped the group members' attitudes towards contributing to the group's goals and towards other group members. We argue that these attitudes are still present in modern humans and leave their phylogenetic footprints also in present-day organizational life. We discu...
Path integral formulation and Feynman rules for phylogenetic branching models

Energy Technology Data Exchange (ETDEWEB)

Jarvis, P D; Bashford, J D; Sumner, J G [School of Mathematics and Physics, University of Tasmania, GPO Box 252C, 7001 Hobart, TAS (Australia)

2005-11-04

A dynamical picture of phylogenetic evolution is given in terms of Markov models on a state space, comprising joint probability distributions for character types of taxonomic classes. Phylogenetic branching is a process which augments the number of taxa under consideration, and hence the rank of the underlying joint probability state tensor. We point out the combinatorial necessity for a second-quantized, or Fock space setting, incorporating discrete counting labels for taxa and character types, to allow for a description in the number basis. Rate operators describing both time evolution without branching, and also phylogenetic branching events, are identified. A detailed development of these ideas is given, using standard transcriptions from the microscopic formulation of non-equilibrium reaction-diffusion or birth-death processes. These give the relations between stochastic rate matrices, the matrix elements of the corresponding evolution operators representing them, and the integral kernels needed to implement these as path integrals. The 'free' theory (without branching) is solved, and the correct trilinear 'interaction' terms (representing branching events) are presented. The full model is developed in perturbation theory via the derivation of explicit Feynman rules which establish that the probabilities (pattern frequencies of leaf colourations) arising as matrix elements of the time evolution operator are identical with those computed via the standard analysis. Simple examples (phylogenetic trees with two or three leaves), are discussed in detail. Further implications for the work are briefly considered including the role of time reparametrization covariance.
Path integral formulation and Feynman rules for phylogenetic branching models

International Nuclear Information System (INIS)

Jarvis, P D; Bashford, J D; Sumner, J G

2005-01-01

A dynamical picture of phylogenetic evolution is given in terms of Markov models on a state space, comprising joint probability distributions for character types of taxonomic classes. Phylogenetic branching is a process which augments the number of taxa under consideration, and hence the rank of the underlying joint probability state tensor. We point out the combinatorial necessity for a second-quantized, or Fock space setting, incorporating discrete counting labels for taxa and character types, to allow for a description in the number basis. Rate operators describing both time evolution without branching, and also phylogenetic branching events, are identified. A detailed development of these ideas is given, using standard transcriptions from the microscopic formulation of non-equilibrium reaction-diffusion or birth-death processes. These give the relations between stochastic rate matrices, the matrix elements of the corresponding evolution operators representing them, and the integral kernels needed to implement these as path integrals. The 'free' theory (without branching) is solved, and the correct trilinear 'interaction' terms (representing branching events) are presented. The full model is developed in perturbation theory via the derivation of explicit Feynman rules which establish that the probabilities (pattern frequencies of leaf colourations) arising as matrix elements of the time evolution operator are identical with those computed via the standard analysis. Simple examples (phylogenetic trees with two or three leaves), are discussed in detail. Further implications for the work are briefly considered including the role of time reparametrization covariance
The phylogenetic distribution of extrafloral nectaries in plants.

Science.gov (United States)

Weber, Marjorie G; Keeler, Kathleen H

2013-06-01

Understanding the evolutionary patterns of ecologically relevant traits is a central goal in plant biology. However, for most important traits, we lack the comprehensive understanding of their taxonomic distribution needed to evaluate their evolutionary mode and tempo across the tree of life. Here we evaluate the broad phylogenetic patterns of a common plant-defence trait found across vascular plants: extrafloral nectaries (EFNs), plant glands that secrete nectar and are located outside the flower. EFNs typically defend plants indirectly by attracting invertebrate predators who reduce herbivory. Records of EFNs published over the last 135 years were compiled. After accounting for changes in taxonomy, phylogenetic comparative methods were used to evaluate patterns of EFN evolution, using a phylogeny of over 55 000 species of vascular plants. Using comparisons of parametric and non-parametric models, the true number of species with EFNs likely to exist beyond the current list was estimated. To date, EFNs have been reported in 3941 species representing 745 genera in 108 families, about 1-2 % of vascular plant species and approx. 21 % of families. They are found in 33 of 65 angiosperm orders. Foliar nectaries are known in four of 36 fern families. Extrafloral nectaries are unknown in early angiosperms, magnoliids and gymnosperms. They occur throughout monocotyledons, yet most EFNs are found within eudicots, with the bulk of species with EFNs being rosids. Phylogenetic analyses strongly support the repeated gain and loss of EFNs across plant clades, especially in more derived dicot families, and suggest that EFNs are found in a minimum of 457 independent lineages. However, model selection methods estimate that the number of unreported cases of EFNs may be as high as the number of species already reported. EFNs are widespread and evolutionarily labile traits that have repeatedly evolved a remarkable number of times in vascular plants. Our current understanding of the
[Phylogenetic analysis of closely related Leuconostoc citreum species based on partial housekeeping genes].

Science.gov (United States)

Lv, Qiang; Chen, Ming; Xu, Haiyan; Song, Yuqin; Sun, Zhihong; Dan, Tong; Sun, Tiansong

2013-07-04

Using the 16S rRNA, dnaA, murC and pyrG gene sequences, we identified the phylogenetic relationship among closely related Leuconostoc citreum species. Seven Leu. citreum strains originally isolated from sourdough were characterized by PCR methods to amplify the dnaA, murC and pyrG gene sequences, which were determined to assess the suitability as phylogenetic markers. Then, we estimated the genetic distance and constructed the phylogenetic trees including 16S rRNA and above mentioned three housekeeping genes combining with published corresponding sequences. By comparing the phylogenetic trees, the topology of three housekeeping genes trees were consistent with that of 16S rRNA gene. The homology of closely related Leu. citreum species among dnaA, murC, pyrG and 16S rRNA gene sequences were different, ranged from75.5% to 97.2%, 50.2% to 99.7%, 65.0% to 99.8% and 98.5% 100%, respectively. The phylogenetic relationship of three housekeeping genes sequences were highly consistent with the results of 16S rRNA gene sequence, while the genetic distance of these housekeeping genes were extremely high than 16S rRNA gene. Consequently, the dnaA, murC and pyrG gene are suitable for classification and identification closely related Leu. citreum species.
Mitochondrial DNA sequence-based phylogenetic relationship ...

Indian Academy of Sciences (India)

cophaga ranges from 0.037–0.106 and 0.049–0.207 for COI and ND5 genes, respectively (tables 2 and 3). Analysis of genetic distance on the basis of sequence difference for both the mitochondrial genes shows very little genetic difference. The discrepancy in the phylogenetic trees based on individ- ual genes may be due ...

Potentials and limitations of histone repeat sequences for phylogenetic reconstruction of Sophophora.

Science.gov (United States)

Baldo, A M; Les, D H; Strausbaugh, L D

1999-11-01

Simplified DNA sequence acquisition has provided many new data sets that are useful for phylogenetic reconstruction, including single- and multiple-copy nuclear and organellar genes. Although transcribed regions receive much attention, nontranscribed regions have recently been added to the repertoire of sequences suitable for phylogenetic studies, especially for closely related taxa. We evaluated the efficacy of a small portion of the histone repeat for phylogenetic reconstruction among Drosophila species. Histone repeats in invertebrates offer distinct advantages similar to those of widely used ribosomal repeats. First, the units are tandemly repeated and undergo concerted evolution. Second, histone repeats include both highly conserved coding and variable intergenic regions. This composition facilitates application of "universal" primers spanning potentially informative sites. We examined a small region of the histone repeat, including the intergenic spacer segments of coding regions from the divergently transcribed H2A and H2B histone genes. The spacer (about 230 bp) exists as a mosaic with highly conserved functional motifs interspersed with rapidly diverging regions; the former aid in alignment of the spacer. There are no ambiguities in alignment of coding regions. Coding and noncoding regions were analyzed together and separately for phylogenetic information. Parsimony, distance, and maximum-likelihood methods successfully retrieve the corroborated phylogeny for the taxa examined. This study demonstrates the resolving power of a small histone region which may now be added to the growing collection of phylogenetically useful DNA sequences.
The anatomy, affinity, and phylogenetic significance of Markuelia.

Science.gov (United States)

Dong, Xi-Ping; Donoghue, Philip C J; Cunningham, John A; Liu, Jian-Bo; Cheng, Hong

2005-01-01

The fossil record provides a paucity of data on the development of extinct organisms, particularly for their embryology. The recovery of fossilized embryos heralds new insight into the evolution of development but advances are limited by an almost complete absence of phylogenetic constraint. Markuelia is an exception to this, known from cleavage and pre-hatchling stages as a vermiform and profusely annulated direct-developing bilaterian with terminal circumoral and posterior radial arrays of spines. Phylogenetic analyses have hitherto suggested assignment to stem-Scalidophora (phyla Kinorhyncha, Loricifera, Priapulida). We test this assumption with additional data and through the inclusion of additional taxa. The available evidence supports stem-Scalidophora affinity, leading to the conclusion that scalidophorans, cyclonerualians, and ecdysozoans are primitive direct developers, and the likelihood that scalidophorans are primitively metameric.
Assembly and phylogenetic structure of Neotropical palm communities

DEFF Research Database (Denmark)

Eiserhardt, Wolf L.; Svenning, J.-C.; Balslev, Henrik

Diversity, composition and dynamics of Neotropical palm communities are receiving an increasing amount of attention due to their economic importance, but also because their high species richness and functional diversity render them valuable model systems for overall forest biodiversity. However......, to better understand these palm communities, it is crucial to gain insight into the mechanisms responsible for their assembly. These can be dispersal limitation, environmental filtering, or biotic interactions. If the degree of niche conservatism is known for a group of organisms, patterns of community...... an unspecific assumption of “general niche conservatism”, phylogenetic signal will be analysed for Neotropical palms. Moreover, as an example for evolutionary mechanisms disrupting phylogenetic signal, speciation modes will be examined in selected genera. With the combined results we aim to show the relative...
Incorporating phylogenetic information for the definition of floristic districts in hyper-diverse Amazon forests: implications for conservation

NARCIS (Netherlands)

Guevara, J.E.; Pitman, N.C.A.; ter Steege, H.; Mogollón, H.; Ceron, C.; Palacios, W.; Oleas, N.; Fine, P.V.A.

2017-01-01

Using complementary metrics to evaluate phylogenetic diversity can facilitate the delimitation of floristic units and conservation priority areas. In this study, we describe the spatial patterns of phylogenetic alpha and beta diversity, phylogenetic endemism, and evolutionary distinctiveness of the
Long-term nitrogen addition affects the phylogenetic turnover of soil microbial community responding to moisture pulse.

Science.gov (United States)

Liu, Chi; Yao, Minjie; Stegen, James C; Rui, Junpeng; Li, Jiabao; Li, Xiangzhen

2017-12-13

How press disturbance (long-term) influences the phylogenetic turnover of soil microbial communities responding to pulse disturbances (short-term) is not fully known. Understanding the complex connections between the history of environmental conditions, assembly processes and microbial community dynamics is necessary to predict microbial response to perturbation. We started by investigating phylogenetic spatial turnover (based on DNA) of soil prokaryotic communities after long-term nitrogen (N) deposition and temporal turnover (based on RNA) of communities responding to pulse by conducting short-term rewetting experiments. The results showed that moderate N addition increased ecological stochasticity and phylogenetic diversity. In contrast, high N addition slightly increased homogeneous selection and decreased phylogenetic diversity. Examining the system with higher phylogenetic resolution revealed a moderate contribution of variable selection across the whole N gradient. The moisture pulse experiment showed that high N soils had higher rates of phylogenetic turnover across short phylogenetic distances and significant changes in community compositions through time. Long-term N input history influenced spatial turnover of microbial communities, but the dominant community assembly mechanisms differed across different N deposition gradients. We further revealed an interaction between press and pulse disturbances whereby deterministic processes were particularly important following pulse disturbances in high N soils.
Phylogenetic Origins of Brain Organisers

Directory of Open Access Journals (Sweden)

Ellen Robertshaw

2012-01-01

Full Text Available The regionalisation of the nervous system begins early in embryogenesis, concomitant with the establishment of the anteroposterior (AP and dorsoventral (DV body axes. The molecular mechanisms that drive axis induction appear to be conserved throughout the animal kingdom and may be phylogenetically older than the emergence of bilateral symmetry. As a result of this process, groups of patterning genes that are equally well conserved are expressed at specific AP and DV coordinates of the embryo. In the emerging nervous system of vertebrate embryos, this initial pattern is refined by local signalling centres, secondary organisers, that regulate patterning, proliferation, and axonal pathfinding in adjacent neuroepithelium. The main secondary organisers for the AP neuraxis are the midbrain-hindbrain boundary, zona limitans intrathalamica, and anterior neural ridge and for the DV neuraxis the notochord, floor plate, and roof plate. A search for homologous secondary organisers in nonvertebrate lineages has led to controversy over their phylogenetic origins. Based on a recent study in hemichordates, it has been suggested that the AP secondary organisers evolved at the base of the deuterostome superphylum, earlier than previously thought. According to this view, the lack of signalling centres in some deuterostome lineages is likely to reflect a secondary loss due to adaptive processes. We propose that the relative evolutionary flexibility of secondary organisers has contributed to a broader morphological complexity of nervous systems in different clades.
Data for constructing insect genome content matrices for phylogenetic analysis and functional annotation

Directory of Open Access Journals (Sweden)

Jeffrey Rosenfeld

2016-03-01

Full Text Available Twenty one fully sequenced and well annotated insect genomes were used to construct genome content matrices for phylogenetic analysis and functional annotation of insect genomes. To examine the role of e-value cutoff in ortholog determination we used scaled e-value cutoffs and a single linkage clustering approach.. The present communication includes (1 a list of the genomes used to construct the genome content phylogenetic matrices, (2 a nexus file with the data matrices used in phylogenetic analysis, (3 a nexus file with the Newick trees generated by phylogenetic analysis, (4 an excel file listing the Core (CORE genes and Unique (UNI genes found in five insect groups, and (5 a figure showing a plot of consistency index (CI versus percent of unannotated genes that are apomorphies in the data set for gene losses and gains and bar plots of gains and losses for four consistency index (CI cutoffs.
Phylogenetic affinity of tree shrews to Glires is attributed to fast evolution rate.

Science.gov (United States)

Lin, Jiannan; Chen, Guangfeng; Gu, Liang; Shen, Yuefeng; Zheng, Meizhu; Zheng, Weisheng; Hu, Xinjie; Zhang, Xiaobai; Qiu, Yu; Liu, Xiaoqing; Jiang, Cizhong

2014-02-01

Previous phylogenetic analyses have led to incongruent evolutionary relationships between tree shrews and other suborders of Euarchontoglires. What caused the incongruence remains elusive. In this study, we identified 6845 orthologous genes between seventeen placental mammals. Tree shrews and Primates were monophyletic in the phylogenetic trees derived from the first or/and second codon positions whereas tree shrews and Glires formed a monophyly in the trees derived from the third or all codon positions. The same topology was obtained in the phylogeny inference using the slowly and fast evolving genes, respectively. This incongruence was likely attributed to the fast substitution rate in tree shrews and Glires. Notably, sequence GC content only was not informative to resolve the controversial phylogenetic relationships between tree shrews, Glires, and Primates. Finally, estimation in the confidence of the tree selection strongly supported the phylogenetic affiliation of tree shrews to Primates as a monophyly. Copyright © 2013 Elsevier Inc. All rights reserved.
Evolutionary history determines how plant productivity responds to phylogenetic diversity and species richness

Directory of Open Access Journals (Sweden)

Mark A. Genung

2014-03-01

Full Text Available The relationship between biodiversity and ecosystem function has received a great deal of attention in ecological research and recent results, from re-analyses, suggest that ecosystem function improves with increases in phylogenetic diversity. However, many of these results have been generalized across a range of different species and clades, and plants with different evolutionary histories could display different relationships between biodiversity and ecosystem function. To experimentally test this hypothesis, we manipulated species richness and phylogenetic diversity using 26 species from two subgenera of the genus Eucalyptus (subgenus Eucalyptus and subgenus Symphyomyrtus. We found that plant biomass (a measurement of ecosystem function sometimes, but not always, responded to increases in species richness and phylogenetic diversity. Specifically, Symphyomyrtus plants showed a positive response while no comparable effect was observed for Eucalyptus plants, showing that responses to biodiversity can vary across different phylogenetic groups. Our results show that the impacts of evolutionary history may complicate the relationship between the diversity of plant communities and plant biomass.
Increased competition does not lead to increased phylogenetic overdispersion in a native grassland.

Science.gov (United States)

Bennett, Jonathan A; Lamb, Eric G; Hall, Jocelyn C; Cardinal-McTeague, Warren M; Cahill, James F

2013-09-01

That competition is stronger among closely related species and leads to phylogenetic overdispersion is a common assumption in community ecology. However, tests of this assumption are rare and field-based experiments lacking. We tested the relationship between competition, the degree of relatedness, and overdispersion among plants experimentally and using a field survey in a native grassland. Relatedness did not affect competition, nor was competition associated with phylogenetic overdispersion. Further, there was only weak evidence for increased overdispersion at spatial scales where plants are likely to compete. These results challenge traditional theory, but are consistent with recent theories regarding the mechanisms of plant competition and its potential effect on phylogenetic structure. We suggest that specific conditions related to the form of competition and trait conservatism must be met for competition to cause phylogenetic overdispersion. Consequently, overdispersion as a result of competition is likely to be rare in natural communities. © 2013 John Wiley & Sons Ltd/CNRS.
TCS: a new multiple sequence alignment reliability measure to estimate alignment accuracy and improve phylogenetic tree reconstruction.

Science.gov (United States)

Chang, Jia-Ming; Di Tommaso, Paolo; Notredame, Cedric

2014-06-01

Multiple sequence alignment (MSA) is a key modeling procedure when analyzing biological sequences. Homology and evolutionary modeling are the most common applications of MSAs. Both are known to be sensitive to the underlying MSA accuracy. In this work, we show how this problem can be partly overcome using the transitive consistency score (TCS), an extended version of the T-Coffee scoring scheme. Using this local evaluation function, we show that one can identify the most reliable portions of an MSA, as judged from BAliBASE and PREFAB structure-based reference alignments. We also show how this measure can be used to improve phylogenetic tree reconstruction using both an established simulated data set and a novel empirical yeast data set. For this purpose, we describe a novel lossless alternative to site filtering that involves overweighting the trustworthy columns. Our approach relies on the T-Coffee framework; it uses libraries of pairwise alignments to evaluate any third party MSA. Pairwise projections can be produced using fast or slow methods, thus allowing a trade-off between speed and accuracy. We compared TCS with Heads-or-Tails, GUIDANCE, Gblocks, and trimAl and found it to lead to significantly better estimates of structural accuracy and more accurate phylogenetic trees. The software is available from www.tcoffee.org/Projects/tcs. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Phylogenetic resolution and habitat specificity of members of the Photobacterium phosphoreum species group.

Science.gov (United States)

Ast, Jennifer C; Dunlap, Paul V

2005-10-01

Substantial ambiguity exists regarding the phylogenetic status of facultatively psychrophilic luminous bacteria identified as Photobacterium phosphoreum, a species thought to be widely distributed in the world's oceans and believed to be the specific bioluminescent light-organ symbiont of several deep-sea fishes. Members of the P. phosphoreum species group include luminous and non-luminous strains identified phenotypically from a variety of different habitats as well as phylogenetically defined lineages that appear to be evolutionarily distinct. To resolve this ambiguity and to begin developing a meaningful knowledge of the geographic distributions, habitats and symbiotic relationships of bacteria in the P. phosphoreum species group, we carried out a multilocus, fine-scale phylogenetic analysis based on sequences of the 16S rRNA, gyrB and luxABFE genes of many newly isolated luminous strains from symbiotic and saprophytic habitats, together with previously isolated luminous and non-luminous strains identified as P. phosphoreum from these and other habitats. Parsimony analysis unambiguously resolved three evolutionarily distinct clades, phosphoreum, iliopiscarium and kishitanii. The tight phylogenetic clustering within these clades and the distinct separation between them indicates they are different species, P. phosphoreum, Photobacterium iliopiscarium and the newly recognized 'Photobacterium kishitanii'. Previously reported non-luminous strains, which had been identified phenotypically as P. phosphoreum, resolved unambiguously as P. iliopiscarium, and all examined deep-sea fishes (specimens of families Chlorophthalmidae, Macrouridae, Moridae, Trachichthyidae and Acropomatidae) were found to harbour 'P. kishitanii', not P. phosphoreum, in their light organs. This resolution revealed also that 'P. kishitanii' is cosmopolitan in its geographic distribution. Furthermore, the lack of phylogenetic variation within 'P. kishitanii' indicates that this facultatively
Beyond barcoding: a mitochondrial genomics approach to molecular phylogenetics and diagnostics of blowflies (Diptera: Calliphoridae).

Science.gov (United States)

Nelson, Leigh A; Lambkin, Christine L; Batterham, Philip; Wallman, James F; Dowton, Mark; Whiting, Michael F; Yeates, David K; Cameron, Stephen L

2012-12-15

Members of the Calliphoridae (blowflies) are significant for medical and veterinary management, due to the ability of some species to consume living flesh as larvae, and for forensic investigations due to the ability of others to develop in corpses. Due to the difficulty of accurately identifying larval blowflies to species there is a need for DNA-based diagnostics for this family, however the widely used DNA-barcoding marker, cox1, has been shown to fail for several groups within this family. Additionally, many phylogenetic relationships within the Calliphoridae are still unresolved, particularly deeper level relationships. Sequencing whole mt genomes has been demonstrated both as an effective method for identifying the most informative diagnostic markers and for resolving phylogenetic relationships. Twenty-seven complete, or nearly so, mt genomes were sequenced representing 13 species, seven genera and four calliphorid subfamilies and a member of the related family Tachinidae. PCR and sequencing primers developed for sequencing one calliphorid species could be reused to sequence related species within the same superfamily with success rates ranging from 61% to 100%, demonstrating the speed and efficiency with which an mt genome dataset can be assembled. Comparison of molecular divergences for each of the 13 protein-coding genes and 2 ribosomal RNA genes, at a range of taxonomic scales identified novel targets for developing as diagnostic markers which were 117-200% more variable than the markers which have been used previously in calliphorids. Phylogenetic analysis of whole mt genome sequences resulted in much stronger support for family and subfamily-level relationships. The Calliphoridae are polyphyletic, with the Polleninae more closely related to the Tachinidae, and the Sarcophagidae are the sister group of the remaining calliphorids. Within the Calliphoridae, there was strong support for the monophyly of the Chrysomyinae and Luciliinae and for the sister
On the use of cartographic projections in visualizing phylo-genetic tree space

Directory of Open Access Journals (Sweden)

Clement Mark

2010-06-01

Full Text Available Abstract Phylogenetic analysis is becoming an increasingly important tool for biological research. Applications include epidemiological studies, drug development, and evolutionary analysis. Phylogenetic search is a known NP-Hard problem. The size of the data sets which can be analyzed is limited by the exponential growth in the number of trees that must be considered as the problem size increases. A better understanding of the problem space could lead to better methods, which in turn could lead to the feasible analysis of more data sets. We present a definition of phylogenetic tree space and a visualization of this space that shows significant exploitable structure. This structure can be used to develop search methods capable of handling much larger data sets.
An attempt to reconstruct phylogenetic relationships within Caribbean nummulitids: simulating relationships and tracing character evolution

Science.gov (United States)

Eder, Wolfgang; Ives Torres-Silva, Ana; Hohenegger, Johann

2017-04-01

Phylogenetic analysis and trees based on molecular data are broadly applied and used to infer genetical and biogeographic relationship in recent larger foraminifera. Molecular phylogenetic is intensively used within recent nummulitids, however for fossil representatives these trees are only of minor informational value. Hence, within paleontological studies a phylogenetic approach through morphometric analysis is of much higher value. To tackle phylogenetic relationships within the nummulitid family, a much higher number of morphological character must be measured than are commonly used in biometric studies, where mostly parameters describing embryonic size (e.g., proloculus diameter, deuteroloculus diameter) and/or the marginal spiral (e.g., spiral diagrams, spiral indices) are studied. For this purpose 11 growth-independent and/or growth-invariant characters have been used to describe the morphological variability of equatorial thin sections of seven Carribbean nummulitid taxa (Nummulites striatoreticulatus, N. macgillavry, Palaeonummulites willcoxi, P.floridensis, P. soldadensis, P.trinitatensis and P.ocalanus) and one outgroup taxon (Ranikothalia bermudezi). Using these characters, phylogenetic trees were calculated using a restricted maximum likelihood algorithm (REML), and results are cross-checked by ordination and cluster analysis. Square-change parsimony method has been run to reconstruct ancestral states, as well as to simulate the evolution of the chosen characters along the calculated phylogenetic tree and, independent - contrast analysis was used to estimate confidence intervals. Based on these simulations, phylogenetic tendencies of certain characters proposed for nummulitids (e.g., Cope's rule or nepionic acceleration) can be tested, whether these tendencies are valid for the whole family or only for certain clades. At least, within the Carribean nummulitids, phylogenetic trends along some growth-independent characters of the embryo (e.g., first
Using tree diversity to compare phylogenetic heuristics.

Science.gov (United States)

Sul, Seung-Jin; Matthews, Suzanne; Williams, Tiffani L

2009-04-29

Evolutionary trees are family trees that represent the relationships between a group of organisms. Phylogenetic heuristics are used to search stochastically for the best-scoring trees in tree space. Given that better tree scores are believed to be better approximations of the true phylogeny, traditional evaluation techniques have used tree scores to determine the heuristics that find the best scores in the fastest time. We develop new techniques to evaluate phylogenetic heuristics based on both tree scores and topologies to compare Pauprat and Rec-I-DCM3, two popular Maximum Parsimony search algorithms. Our results show that although Pauprat and Rec-I-DCM3 find the trees with the same best scores, topologically these trees are quite different. Furthermore, the Rec-I-DCM3 trees cluster distinctly from the Pauprat trees. In addition to our heatmap visualizations of using parsimony scores and the Robinson-Foulds distance to compare best-scoring trees found by the two heuristics, we also develop entropy-based methods to show the diversity of the trees found. Overall, Pauprat identifies more diverse trees than Rec-I-DCM3. Overall, our work shows that there is value to comparing heuristics beyond the parsimony scores that they find. Pauprat is a slower heuristic than Rec-I-DCM3. However, our work shows that there is tremendous value in using Pauprat to reconstruct trees-especially since it finds identical scoring but topologically distinct trees. Hence, instead of discounting Pauprat, effort should go in improving its implementation. Ultimately, improved performance measures lead to better phylogenetic heuristics and will result in better approximations of the true evolutionary history of the organisms of interest.
Evaluation of properties over phylogenetic trees using stochastic logics.

Science.gov (United States)

Requeno, José Ignacio; Colom, José Manuel

2016-06-14

Model checking has been recently introduced as an integrated framework for extracting information of the phylogenetic trees using temporal logics as a querying language, an extension of modal logics that imposes restrictions of a boolean formula along a path of events. The phylogenetic tree is considered a transition system modeling the evolution as a sequence of genomic mutations (we understand mutation as different ways that DNA can be changed), while this kind of logics are suitable for traversing it in a strict and exhaustive way. Given a biological property that we desire to inspect over the phylogeny, the verifier returns true if the specification is satisfied or a counterexample that falsifies it. However, this approach has been only considered over qualitative aspects of the phylogeny. In this paper, we repair the limitations of the previous framework for including and handling quantitative information such as explicit time or probability. To this end, we apply current probabilistic continuous-time extensions of model checking to phylogenetics. We reinterpret a catalog of qualitative properties in a numerical way, and we also present new properties that couldn't be analyzed before. For instance, we obtain the likelihood of a tree topology according to a mutation model. As case of study, we analyze several phylogenies in order to obtain the maximum likelihood with the model checking tool PRISM. In addition, we have adapted the software for optimizing the computation of maximum likelihoods. We have shown that probabilistic model checking is a competitive framework for describing and analyzing quantitative properties over phylogenetic trees. This formalism adds soundness and readability to the definition of models and specifications. Besides, the existence of model checking tools hides the underlying technology, omitting the extension, upgrade, debugging and maintenance of a software tool to the biologists. A set of benchmarks justify the feasibility of our
galaxieEST: addressing EST identity through automated phylogenetic analysis.

Science.gov (United States)

Nilsson, R Henrik; Rajashekar, Balaji; Larsson, Karl-Henrik; Ursing, Björn M

2004-07-05

Research involving expressed sequence tags (ESTs) is intricately coupled to the existence of large, well-annotated sequence repositories. Comparatively complete and satisfactory annotated public sequence libraries are, however, available only for a limited range of organisms, rendering the absence of sequences and gene structure information a tangible problem for those working with taxa lacking an EST or genome sequencing project. Paralogous genes belonging to the same gene family but distinguished by derived characteristics are particularly prone to misidentification and erroneous annotation; high but incomplete levels of sequence similarity are typically difficult to interpret and have formed the basis of many unsubstantiated assumptions of orthology. In these cases, a phylogenetic study of the query sequence together with the most similar sequences in the database may be of great value to the identification process. In order to facilitate this laborious procedure, a project to employ automated phylogenetic analysis in the identification of ESTs was initiated. galaxieEST is an open source Perl-CGI script package designed to complement traditional similarity-based identification of EST sequences through employment of automated phylogenetic analysis. It uses a series of BLAST runs as a sieve to retrieve nucleotide and protein sequences for inclusion in neighbour joining and parsimony analyses; the output includes the BLAST output, the results of the phylogenetic analyses, and the corresponding multiple alignments. galaxieEST is available as an on-line web service for identification of fungal ESTs and for download / local installation for use with any organism group at http://galaxie.cgb.ki.se/galaxieEST.html. By addressing sequence relatedness in addition to similarity, galaxieEST provides an integrative view on EST origin and identity, which may prove particularly useful in cases where similarity searches return one or more pertinent, but not full, matches and
BEAGLE: an application programming interface and high-performance computing library for statistical phylogenetics.

Science.gov (United States)

Ayres, Daniel L; Darling, Aaron; Zwickl, Derrick J; Beerli, Peter; Holder, Mark T; Lewis, Paul O; Huelsenbeck, John P; Ronquist, Fredrik; Swofford, David L; Cummings, Michael P; Rambaut, Andrew; Suchard, Marc A

2012-01-01

Phylogenetic inference is fundamental to our understanding of most aspects of the origin and evolution of life, and in recent years, there has been a concentration of interest in statistical approaches such as Bayesian inference and maximum likelihood estimation. Yet, for large data sets and realistic or interesting models of evolution, these approaches remain computationally demanding. High-throughput sequencing can yield data for thousands of taxa, but scaling to such problems using serial computing often necessitates the use of nonstatistical or approximate approaches. The recent emergence of graphics processing units (GPUs) provides an opportunity to leverage their excellent floating-point computational performance to accelerate statistical phylogenetic inference. A specialized library for phylogenetic calculation would allow existing software packages to make more effective use of available computer hardware, including GPUs. Adoption of a common library would also make it easier for other emerging computing architectures, such as field programmable gate arrays, to be used in the future. We present BEAGLE, an application programming interface (API) and library for high-performance statistical phylogenetic inference. The API provides a uniform interface for performing phylogenetic likelihood calculations on a variety of compute hardware platforms. The library includes a set of efficient implementations and can currently exploit hardware including GPUs using NVIDIA CUDA, central processing units (CPUs) with Streaming SIMD Extensions and related processor supplementary instruction sets, and multicore CPUs via OpenMP. To demonstrate the advantages of a common API, we have incorporated the library into several popular phylogenetic software packages. The BEAGLE library is free open source software licensed under the Lesser GPL and available from http://beagle-lib.googlecode.com. An example client program is available as public domain software.
Bayesian nonparametric clustering in phylogenetics: modeling antigenic evolution in influenza.

Science.gov (United States)

Cybis, Gabriela B; Sinsheimer, Janet S; Bedford, Trevor; Rambaut, Andrew; Lemey, Philippe; Suchard, Marc A

2018-01-30

Influenza is responsible for up to 500,000 deaths every year, and antigenic variability represents much of its epidemiological burden. To visualize antigenic differences across many viral strains, antigenic cartography methods use multidimensional scaling on binding assay data to map influenza antigenicity onto a low-dimensional space. Analysis of such assay data ideally leads to natural clustering of influenza strains of similar antigenicity that correlate with sequence evolution. To understand the dynamics of these antigenic groups, we present a framework that jointly models genetic and antigenic evolution by combining multidimensional scaling of binding assay data, Bayesian phylogenetic machinery and nonparametric clustering methods. We propose a phylogenetic Chinese restaurant process that extends the current process to incorporate the phylogenetic dependency structure between strains in the modeling of antigenic clusters. With this method, we are able to use the genetic information to better understand the evolution of antigenicity throughout epidemics, as shown in applications of this model to H1N1 influenza. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.