WorldWideScience

Sample records for haplotype-specific genomic diversity

  1. Evolutionary analysis of two classical MHC class I loci of the medaka fish, Oryzias latipes: haplotype-specific genomic diversity, locus-specific polymorphisms, and interlocus homogenization.

    Science.gov (United States)

    Nonaka, Mayumi I; Nonaka, Masaru

    2010-05-01

    The major histocompatibility complex (MHC) region of the teleost medaka (Oryzias latipes) contains two classical class I loci, UAA and UBA, whereas most lower vertebrates possess or express a single locus. To elucidate the allelic diversification and evolutionary relationships of these loci, we compared the BAC-based complete genomic sequences of the MHC class I region of three medaka strains and the PCR-based cDNA sequences of two more strains and two wild individuals, representing nine haplotypes. These were derived from two geographically distinct medaka populations isolated for four to five million years. Comparison of the genomic sequences showed a marked diversity in the region encompassing UAA and UBA even between the strains derived from the same population, and also showed an ancient divergence of these loci. cDNA analysis indicated that the peptide-binding domains of both UAA and UBA are highly polymorphic and that most of the polymorphisms were established in a locus-specific manner before the divergence of the two populations. Interallelic recombination between exons 2 and 3 encoding these domains was observed. The second intron of the UAA genes contains a highly conserved region with a palindromic sequence, suggesting that this region contributed to the recombination events. In contrast, the alpha3 domain is extremely homogenized not only within each locus but also between UAA and UBA regardless of populations. Two lineages of the transmembrane and cytoplasmic regions are also shared by UAA and UBA, suggesting that these two loci evolved with intimate genetic interaction through gene conversion or unequal crossing over.

  2. Human Genome Diversity workshop 1

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1992-12-31

    The Human Genome Diversity Project (HGD) is an international interdisciplinary program whose goal is to reveal as much as possible about the current state of genetic diversity among humans and the processes that were responsible for that diversity. Classical premolecular techniques have already proved that a significant component of human genetic variability lies within populations rather than among them. New molecular techniques will permit a dramatic increase in the resolving power of genetic analysis at the population level. Recent social changes in many parts of the world threaten the identity of a number of populations that may be extremely important for understanding human evolutionary history. It is therefore urgent to conduct research on human variation in these areas, while there is still time. The plan is to identify the most representative descendants of ancestral human populations worldwide and then to preserve genetic records of these populations. This is a report of the Population Genetics Workshop (Workshop 1), the first of three to be held to plan HGD, which was focused on sampling strategies and analytic methods from population genetics. The topics discussed were sampling and population structure; analysis of populations; drift versus natural selection; modeling migration and population subdivision; and population structure and subdivision.

  3. The Human Genome Diversity Project

    Energy Technology Data Exchange (ETDEWEB)

    Cavalli-Sforza, L. [Stanford Univ., CA (United States)

    1994-12-31

    The Human Genome Diversity Project (HGD Project) is an international anthropology project that seeks to study the genetic richness of the entire human species. This kind of genetic information can add a unique thread to the tapestry knowledge of humanity. Culture, environment, history, and other factors are often more important, but humanity`s genetic heritage, when analyzed with recent technology, brings another type of evidence for understanding species` past and present. The Project will deepen the understanding of this genetic richness and show both humanity`s diversity and its deep and underlying unity. The HGD Project is still largely in its planning stages, seeking the best ways to reach its goals. The continuing discussions of the Project, throughout the world, should improve the plans for the Project and their implementation. The Project is as global as humanity itself; its implementation will require the kinds of partnerships among different nations and cultures that make the involvement of UNESCO and other international organizations particularly appropriate. The author will briefly discuss the Project`s history, describe the Project, set out the core principles of the Project, and demonstrate how the Project will help combat the scourge of racism.

  4. Genomic Diversity in the Genus of Aspergillus

    DEFF Research Database (Denmark)

    Rasmussen, Jane Lind Nybo

    , and scientific model organisms. The phenotypic diversity in this genus is extraordinary and identifying the genetic basis for this diversity has great potential for academia and industry. When the genomic era began for Aspergillus in 2005 with the genome sequences of A. nidulans, A. oryzae and A. fumigatus...

  5. Genomic management of animal genetic diversity

    NARCIS (Netherlands)

    Oldenbroek, Kor

    2017-01-01

    Recently developed genomic tools, like SNP-genotyping and whole genome sequencing, and their analysis, offer great opportunities for the conservation and utilisation of animal genetic diversity, both among and within breeds. These genomic tools can be used to detect potentially valuable rare alleles

  6. Comparative Genomics Reveals High Genomic Diversity in the Genus Photobacterium

    DEFF Research Database (Denmark)

    Machado, Henrique; Gram, Lone

    2017-01-01

    Vibrionaceae is a large marine bacterial family, which can constitute up to 50% of the prokaryotic population in marine waters. Photobacterium is the second largest genus in the family and we used comparative genomics on 35 strains representing 16 of the 28 species described so far, to understand...... the genomic diversity present in the Photobacterium genus. Such understanding is important for ecophysiology studies of the genus. We used whole genome sequences to evaluate phylogenetic relationships using several analyses (16S rRNA, MLSA, fur, amino-acid usage, ANI), which allowed us to identify two...... misidentified strains. Genome analyses also revealed occurrence of higher and lower GC content clades, correlating with phylogenetic clusters. Pan-and core-genome analysis revealed the conservation of 25% of the genome throughout the genus, with a large and open pan-genome. The major source of genomic diversity...

  7. Ethical aspects of genome diversity research: genome research into cultural diversity or cultural diversity in genome research?

    Science.gov (United States)

    Ilkilic, Ilhan; Paul, Norbert W

    2009-03-01

    The goal of the Human Genome Diversity Project (HGDP) was to reconstruct the history of human evolution and the historical and geographical distribution of populations with the help of scientific research. Through this kind of research, the entire spectrum of genetic diversity to be found in the human species was to be explored with the hope of generating a better understanding of the history of humankind. An important part of this genome diversity research consists in taking blood and tissue samples from indigenous populations. For various reasons, it has not been possible to execute this project in the planned scope and form to date. Nevertheless, genomic diversity research addresses complex issues which prove to be highly relevant from the perspective of research ethics, transcultural medical ethics, and cultural philosophy. In the article at hand, we discuss these ethical issues as illustrated by the HGDP. This investigation focuses on the confrontation of culturally diverse images of humans and their cosmologies within the framework of genome diversity research and the ethical questions it raises. We argue that in addition to complex questions pertaining to research ethics such as informed consent and autonomy of probands, genome diversity research also has a cultural-philosophical, meta-ethical, and phenomenological dimension which must be taken into account in ethical discourses. Acknowledging this fact, we attempt to show the limits of current guidelines used in international genome diversity studies, following this up by a formulation of theses designed to facilitate an appropriate inquiry and ethical evaluation of intercultural dimensions of genome research.

  8. Genomic diversity and evolution of the lyssaviruses.

    Directory of Open Access Journals (Sweden)

    Olivier Delmas

    2008-04-01

    Full Text Available Lyssaviruses are RNA viruses with single-strand, negative-sense genomes responsible for rabies-like diseases in mammals. To date, genomic and evolutionary studies have most often utilized partial genome sequences, particularly of the nucleoprotein and glycoprotein genes, with little consideration of genome-scale evolution. Herein, we report the first genomic and evolutionary analysis using complete genome sequences of all recognised lyssavirus genotypes, including 14 new complete genomes of field isolates from 6 genotypes and one genotype that is completely sequenced for the first time. In doing so we significantly increase the extent of genome sequence data available for these important viruses. Our analysis of these genome sequence data reveals that all lyssaviruses have the same genomic organization. A phylogenetic analysis reveals strong geographical structuring, with the greatest genetic diversity in Africa, and an independent origin for the two known genotypes that infect European bats. We also suggest that multiple genotypes may exist within the diversity of viruses currently classified as 'Lagos Bat'. In sum, we show that rigorous phylogenetic techniques based on full length genome sequence provide the best discriminatory power for genotype classification within the lyssaviruses.

  9. India, Genomic diversity & Disease susceptibility

    Indian Academy of Sciences (India)

    Involved in earlier stages of Immune response protecting us from Diseases, Responsible for kidney and other transplant rejections Inherited from our parents · PowerPoint Presentation · Slide 5 · Slide 6 · Slide 7 · Slide 8 · Slide 9 · Slide 10 · Slide 11 · Slide 12 · Leprosy predisposing genome identified in southern India, was ...

  10. Consequences of genomic diversity in Mycobacterium tuberculosis

    Science.gov (United States)

    Coscolla, Mireia; Gagneux, Sebastien

    2014-01-01

    The causative agent of human tuberculosis, Mycobacterium tuberculosis complex (MTBC), comprises seven phylogenetically distinct lineages associated with different geographical regions. Here we review the latest findings on the nature and amount of genomic diversity within and between MTBC lineages. We then review recent evidence for the effect of this genomic diversity on mycobacterial phenotypes measured experimentally and in clinical settings. We conclude that overall, the most geographically widespread Lineage 2 (includes Beijing) and Lineage 4 (also known as Euro-American) are more virulent than other lineages that are more geographically restricted. This increased virulence is associated with delayed or reduced pro-inflammatory host immune responses, greater severity of disease, and enhanced transmission. Future work should focus on the interaction between MTBC and human genetic diversity, as well as on the environmental factors that modulate these interactions. PMID:25453224

  11. OryzaGenome: Genome Diversity Database of Wild Oryza Species

    KAUST Repository

    Ohyanagi, Hajime

    2015-11-18

    The species in the genus Oryza, encompassing nine genome types and 23 species, are a rich genetic resource and may have applications in deeper genomic analyses aiming to understand the evolution of plant genomes. With the advancement of next-generation sequencing (NGS) technology, a flood of Oryza species reference genomes and genomic variation information has become available in recent years. This genomic information, combined with the comprehensive phenotypic information that we are accumulating in our Oryzabase, can serve as an excellent genotype-phenotype association resource for analyzing rice functional and structural evolution, and the associated diversity of the Oryza genus. Here we integrate our previous and future phenotypic/habitat information and newly determined genotype information into a united repository, named OryzaGenome, providing the variant information with hyperlinks to Oryzabase. The current version of OryzaGenome includes genotype information of 446 O. rufipogon accessions derived by imputation and of 17 accessions derived by imputation-free deep sequencing. Two variant viewers are implemented: SNP Viewer as a conventional genome browser interface and Variant Table as a textbased browser for precise inspection of each variant one by one. Portable VCF (variant call format) file or tabdelimited file download is also available. Following these SNP (single nucleotide polymorphism) data, reference pseudomolecules/ scaffolds/contigs and genome-wide variation information for almost all of the closely and distantly related wild Oryza species from the NIG Wild Rice Collection will be available in future releases. All of the resources can be accessed through http://viewer.shigen.info/oryzagenome/.

  12. Genomic diversity within the haloalkaliphilic genus Thioalkalivibrio.

    Directory of Open Access Journals (Sweden)

    Anne-Catherine Ahn

    Full Text Available Thioalkalivibrio is a genus of obligate chemolithoautotrophic haloalkaliphilic sulfur-oxidizing bacteria. Their habitat are soda lakes which are dual extreme environments with a pH range from 9.5 to 11 and salt concentrations up to saturation. More than 100 strains of this genus have been isolated from various soda lakes all over the world, but only ten species have been effectively described yet. Therefore, the assignment of the remaining strains to either existing or novel species is important and will further elucidate their genomic diversity as well as give a better general understanding of this genus. Recently, the genomes of 76 Thioalkalivibrio strains were sequenced. On these, we applied different methods including (i 16S rRNA gene sequence analysis, (ii Multilocus Sequence Analysis (MLSA based on eight housekeeping genes, (iii Average Nucleotide Identity based on BLAST (ANIb and MUMmer (ANIm, (iv Tetranucleotide frequency correlation coefficients (TETRA, (v digital DNA:DNA hybridization (dDDH as well as (vi nucleotide- and amino acid-based Genome BLAST Distance Phylogeny (GBDP analyses. We detected a high genomic diversity by revealing 15 new "genomic" species and 16 new "genomic" subspecies in addition to the ten already described species. Phylogenetic and phylogenomic analyses showed that the genus is not monophyletic, because four strains were clearly separated from the other Thioalkalivibrio by type strains from other genera. Therefore, it is recommended to classify the latter group as a novel genus. The biogeographic distribution of Thioalkalivibrio suggested that the different "genomic" species can be classified as candidate disjunct or candidate endemic species. This study is a detailed genome-based classification and identification of members within the genus Thioalkalivibrio. However, future phenotypical and chemotaxonomical studies will be needed for a full species description of this genus.

  13. HLA diversity in the 1000 genomes dataset.

    Directory of Open Access Journals (Sweden)

    Pierre-Antoine Gourraud

    Full Text Available The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation by sequencing at a level that should allow the genome-wide detection of most variants with frequencies as low as 1%. However, in the major histocompatibility complex (MHC, only the top 10 most frequent haplotypes are in the 1% frequency range whereas thousands of haplotypes are present at lower frequencies. Given the limitation of both the coverage and the read length of the sequences generated by the 1000 Genomes Project, the highly variable positions that define HLA alleles may be difficult to identify. We used classical Sanger sequencing techniques to type the HLA-A, HLA-B, HLA-C, HLA-DRB1 and HLA-DQB1 genes in the available 1000 Genomes samples and combined the results with the 103,310 variants in the MHC region genotyped by the 1000 Genomes Project. Using pairwise identity-by-descent distances between individuals and principal component analysis, we established the relationship between ancestry and genetic diversity in the MHC region. As expected, both the MHC variants and the HLA phenotype can identify the major ancestry lineage, informed mainly by the most frequent HLA haplotypes. To some extent, regions of the genome with similar genetic or similar recombination rate have similar properties. An MHC-centric analysis underlines departures between the ancestral background of the MHC and the genome-wide picture. Our analysis of linkage disequilibrium (LD decay in these samples suggests that overestimation of pairwise LD occurs due to a limited sampling of the MHC diversity. This collection of HLA-specific MHC variants, available on the dbMHC portal, is a valuable resource for future analyses of the role of MHC in population and disease studies.

  14. Separation of Y-chromosomal haplotypes from male DNA mixtures via multiplex haplotype-specific extraction.

    Science.gov (United States)

    Rothe, Jessica; Nagy, Marion

    2015-11-01

    In forensic analysis, the interpretation of DNA mixtures is the subject of ongoing debate and requires expertise knowledge. Haplotype-specific extraction (HSE) is an alternative method that enables the separation of large chromosome fragments or haplotypes by using magnetic beads in conjunction with allele-specific probes. HSE thus allows physical separation of the components of a DNA mixture. Here, we present the first multiplex HSE separation of a Y-chromosomal haplotype consisting of six Yfiler short tandem repeat markers from a mixture of male DNA. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  15. Genomic diversity of Escherichia isolates from diverse habitats.

    Directory of Open Access Journals (Sweden)

    Seungdae Oh

    Full Text Available Our understanding of the Escherichia genus is heavily biased toward pathogenic or commensal isolates from human or animal hosts. Recent studies have recovered Escherichia isolates that persist, and even grow, outside these hosts. Although the environmental isolates are typically phylogenetically distinct, they are highly related to and phenotypically indistinguishable from their human counterparts, including for the coliform test. To gain insights into the genomic diversity of Escherichia isolates from diverse habitats, including freshwater, soil, animal, and human sources, we carried out comparative DNA-DNA hybridizations using a multi-genome E. coli DNA microarray. The microarray was validated based on hybridizations with selected strains whose genome sequences were available and used to assess the frequency of microarray false positive and negative signals. Our results showed that human fecal isolates share two sets of genes (n>90 that are rarely found among environmental isolates, including genes presumably important for evading host immune mechanisms (e.g., a multi-drug transporter for acids and antimicrobials and adhering to epithelial cells (e.g., hemolysin E and fimbrial-like adhesin protein. These results imply that environmental isolates are characterized by decreased ability to colonize host cells relative to human isolates. Our study also provides gene markers that can distinguish human isolates from those of warm-blooded animal and environmental origins, and thus can be used to more reliably assess fecal contamination in natural ecosystems.

  16. PRDM9 drives evolutionary erosion of hotspots in Mus musculus through haplotype-specific initiation of meiotic recombination.

    Directory of Open Access Journals (Sweden)

    Christopher L Baker

    2015-01-01

    Full Text Available Meiotic recombination generates new genetic variation and assures the proper segregation of chromosomes in gametes. PRDM9, a zinc finger protein with histone methyltransferase activity, initiates meiotic recombination by binding DNA at recombination hotspots and directing the position of DNA double-strand breaks (DSB. The DSB repair mechanism suggests that hotspots should eventually self-destruct, yet genome-wide recombination levels remain constant, a conundrum known as the hotspot paradox. To test if PRDM9 drives this evolutionary erosion, we measured activity of the Prdm9Cst allele in two Mus musculus subspecies, M.m. castaneus, in which Prdm9Cst arose, and M.m. domesticus, into which Prdm9Cst was introduced experimentally. Comparing these two strains, we find that haplotype differences at hotspots lead to qualitative and quantitative changes in PRDM9 binding and activity. Using Mus spretus as an outlier, we found most variants affecting PRDM9Cst binding arose and were fixed in M.m. castaneus, suppressing hotspot activity. Furthermore, M.m. castaneus×M.m. domesticus F1 hybrids exhibit novel hotspots, with large haplotype biases in both PRDM9 binding and chromatin modification. These novel hotspots represent sites of historic evolutionary erosion that become activated in hybrids due to crosstalk between one parent's Prdm9 allele and the opposite parent's chromosome. Together these data support a model where haplotype-specific PRDM9 binding directs biased gene conversion at hotspots, ultimately leading to hotspot erosion.

  17. PRDM9 drives evolutionary erosion of hotspots in Mus musculus through haplotype-specific initiation of meiotic recombination.

    Science.gov (United States)

    Baker, Christopher L; Kajita, Shimpei; Walker, Michael; Saxl, Ruth L; Raghupathy, Narayanan; Choi, Kwangbom; Petkov, Petko M; Paigen, Kenneth

    2015-01-01

    Meiotic recombination generates new genetic variation and assures the proper segregation of chromosomes in gametes. PRDM9, a zinc finger protein with histone methyltransferase activity, initiates meiotic recombination by binding DNA at recombination hotspots and directing the position of DNA double-strand breaks (DSB). The DSB repair mechanism suggests that hotspots should eventually self-destruct, yet genome-wide recombination levels remain constant, a conundrum known as the hotspot paradox. To test if PRDM9 drives this evolutionary erosion, we measured activity of the Prdm9Cst allele in two Mus musculus subspecies, M.m. castaneus, in which Prdm9Cst arose, and M.m. domesticus, into which Prdm9Cst was introduced experimentally. Comparing these two strains, we find that haplotype differences at hotspots lead to qualitative and quantitative changes in PRDM9 binding and activity. Using Mus spretus as an outlier, we found most variants affecting PRDM9Cst binding arose and were fixed in M.m. castaneus, suppressing hotspot activity. Furthermore, M.m. castaneus×M.m. domesticus F1 hybrids exhibit novel hotspots, with large haplotype biases in both PRDM9 binding and chromatin modification. These novel hotspots represent sites of historic evolutionary erosion that become activated in hybrids due to crosstalk between one parent's Prdm9 allele and the opposite parent's chromosome. Together these data support a model where haplotype-specific PRDM9 binding directs biased gene conversion at hotspots, ultimately leading to hotspot erosion.

  18. Genomic landscape of human diversity across Madagascar

    Science.gov (United States)

    Pierron, Denis; Heiske, Margit; Razafindrazaka, Harilanto; Rakoto, Ignace; Rabetokotany, Nelly; Ravololomanga, Bodo; Rakotozafy, Lucien M.-A.; Rakotomalala, Mireille Mialy; Razafiarivony, Michel; Rasoarifetra, Bako; Raharijesy, Miakabola Andriamampianina; Razafindralambo, Lolona; Ramilisonina; Fanony, Fulgence; Lejamble, Sendra; Thomas, Olivier; Mohamed Abdallah, Ahmed; Rocher, Christophe; Arachiche, Amal; Tonaso, Laure; Pereda-loth, Veronica; Schiavinato, Stéphanie; Brucato, Nicolas; Ricaut, Francois-Xavier; Kusuma, Pradiptajati; Sudoyo, Herawati; Ni, Shengyu; Boland, Anne; Deleuze, Jean-Francois; Beaujard, Philippe; Grange, Philippe; Adelaar, Sander; Stoneking, Mark; Rakotoarisoa, Jean-Aimé; Radimilahy, Chantal; Letellier, Thierry

    2017-01-01

    Although situated ∼400 km from the east coast of Africa, Madagascar exhibits cultural, linguistic, and genetic traits from both Southeast Asia and Eastern Africa. The settlement history remains contentious; we therefore used a grid-based approach to sample at high resolution the genomic diversity (including maternal lineages, paternal lineages, and genome-wide data) across 257 villages and 2,704 Malagasy individuals. We find a common Bantu and Austronesian descent for all Malagasy individuals with a limited paternal contribution from Europe and the Middle East. Admixture and demographic growth happened recently, suggesting a rapid settlement of Madagascar during the last millennium. However, the distribution of African and Asian ancestry across the island reveals that the admixture was sex biased and happened heterogeneously across Madagascar, suggesting independent colonization of Madagascar from Africa and Asia rather than settlement by an already admixed population. In addition, there are geographic influences on the present genomic diversity, independent of the admixture, showing that a few centuries is sufficient to produce detectable genetic structure in human populations. PMID:28716916

  19. Integrated genetic and epigenetic analysis identifies haplotype-specific methylation in the FTO type 2 diabetes and obesity susceptibility locus.

    Directory of Open Access Journals (Sweden)

    Christopher G Bell

    Full Text Available Recent multi-dimensional approaches to the study of complex disease have revealed powerful insights into how genetic and epigenetic factors may underlie their aetiopathogenesis. We examined genotype-epigenotype interactions in the context of Type 2 Diabetes (T2D, focussing on known regions of genomic susceptibility. We assayed DNA methylation in 60 females, stratified according to disease susceptibility haplotype using previously identified association loci. CpG methylation was assessed using methylated DNA immunoprecipitation on a targeted array (MeDIP-chip and absolute methylation values were estimated using a Bayesian algorithm (BATMAN. Absolute methylation levels were quantified across LD blocks, and we identified increased DNA methylation on the FTO obesity susceptibility haplotype, tagged by the rs8050136 risk allele A (p = 9.40×10(-4, permutation p = 1.0×10(-3. Further analysis across the 46 kb LD block using sliding windows localised the most significant difference to be within a 7.7 kb region (p = 1.13×10(-7. Sequence level analysis, followed by pyrosequencing validation, revealed that the methylation difference was driven by the co-ordinated phase of CpG-creating SNPs across the risk haplotype. This 7.7 kb region of haplotype-specific methylation (HSM, encapsulates a Highly Conserved Non-Coding Element (HCNE that has previously been validated as a long-range enhancer, supported by the histone H3K4me1 enhancer signature. This study demonstrates that integration of Genome-Wide Association (GWA SNP and epigenomic DNA methylation data can identify potential novel genotype-epigenotype interactions within disease-associated loci, thus providing a novel route to aid unravelling common complex diseases.

  20. Retinal degeneration slow (rds) in mouse results from simple insertion of a t haplotype-specific element into protein-coding exon II

    Energy Technology Data Exchange (ETDEWEB)

    Ma, J.; Norton, J.C.; Allen, A.C.; Burns, J.L.; Travis, G.H. [Univ. of Texas Southwestern Medical Center, Dallas, TX (United States)] [and others

    1995-07-20

    Retinal degeneration slow (rds) is a semidominant mutation of mice that causes dysplasia and degeneration of rod and cone photoreceptors. Mutations in RDS, the human ortholog of the rds gene, are responsible for several inherited retinal dystrophies including a subset of retinitis pigmentosa. The normal rds locus encodes rds/peripherin, an integral membrane glycoprotein present in outer segment discs. Genomic libraries form wildtype and rds/rds mice were screened with an rds cDNA, and phage {lambda} clones that span the normal and mutant loci were mapped. We show that in mice, rds is caused by the insertion into exon II of a 9.2-kb repetitive genomic element that is very similar to the t haplotype-specific element in the H-2 complex. The entire element is included in the RNA products of the mutant locus. We present evidence that rds in mice represents a null allele. 40 refs., 4 figs.

  1. Two Tales of Prokaryotic Genomic Diversity: Escherichia coli and Halophiles

    Directory of Open Access Journals (Sweden)

    Lejla Pašić

    2014-01-01

    Full Text Available Prokaryotes are generally characterized by vast genomic diversity that has been shaped by mutations, horizontal gene transfer, bacteriocins and phage predation. Enormous genetic diversity has developed as a result of stresses imposed in harsh environments and the ability of microorganisms to adapt. Two examples of prokaryotic diversity are presented: on intraspecies level, exemplified by Escherichia coli, and the diversity of the hypersaline environment, with the discussion of food-related health issues and biotechnological potential.

  2. Genetic Competence Drives Genome Diversity in Bacillus subtilis

    Science.gov (United States)

    Chevreux, Bastien; Serra, Cláudia R; Schyns, Ghislain; Henriques, Adriano O

    2018-01-01

    Abstract Prokaryote genomes are the result of a dynamic flux of genes, with increases achieved via horizontal gene transfer and reductions occurring through gene loss. The ecological and selective forces that drive this genomic flexibility vary across species. Bacillus subtilis is a naturally competent bacterium that occupies various environments, including plant-associated, soil, and marine niches, and the gut of both invertebrates and vertebrates. Here, we quantify the genomic diversity of B. subtilis and infer the genome dynamics that explain the high genetic and phenotypic diversity observed. Phylogenomic and comparative genomic analyses of 42 B. subtilis genomes uncover a remarkable genome diversity that translates into a core genome of 1,659 genes and an asymptotic pangenome growth rate of 57 new genes per new genome added. This diversity is due to a large proportion of low-frequency genes that are acquired from closely related species. We find no gene-loss bias among wild isolates, which explains why the cloud genome, 43% of the species pangenome, represents only a small proportion of each genome. We show that B. subtilis can acquire xenologous copies of core genes that propagate laterally among strains within a niche. While not excluding the contributions of other mechanisms, our results strongly suggest a process of gene acquisition that is largely driven by competence, where the long-term maintenance of acquired genes depends on local and global fitness effects. This competence-driven genomic diversity provides B. subtilis with its generalist character, enabling it to occupy a wide range of ecological niches and cycle through them. PMID:29272410

  3. Genome size diversity in orchids: consequences and evolution

    Science.gov (United States)

    Leitch, I. J.; Kahandawala, I.; Suda, J.; Hanson, L.; Ingrouille, M. J.; Chase, M. W.; Fay, M. F.

    2009-01-01

    Background The amount of DNA comprising the genome of an organism (its genome size) varies a remarkable 40 000-fold across eukaryotes, yet most groups are characterized by much narrower ranges (e.g. 14-fold in gymnosperms, 3- to 4-fold in mammals). Angiosperms stand out as one of the most variable groups with genome sizes varying nearly 2000-fold. Nevertheless within angiosperms the majority of families are characterized by genomes which are small and vary little. Species with large genomes are mostly restricted to a few monocots families including Orchidaceae. Scope A survey of the literature revealed that genome size data for Orchidaceae are comparatively rare representing just 327 species. Nevertheless they reveal that Orchidaceae are currently the most variable angiosperm family with genome sizes ranging 168-fold (1C = 0·33–55·4 pg). Analysing the data provided insights into the distribution, evolution and possible consequences to the plant of this genome size diversity. Conclusions Superimposing the data onto the increasingly robust phylogenetic tree of Orchidaceae revealed how different subfamilies were characterized by distinct genome size profiles. Epidendroideae possessed the greatest range of genome sizes, although the majority of species had small genomes. In contrast, the largest genomes were found in subfamilies Cypripedioideae and Vanilloideae. Genome size evolution within this subfamily was analysed as this is the only one with reasonable representation of data. This approach highlighted striking differences in genome size and karyotype evolution between the closely related Cypripedium, Paphiopedilum and Phragmipedium. As to the consequences of genome size diversity, various studies revealed that this has both practical (e.g. application of genetic fingerprinting techniques) and biological consequences (e.g. affecting where and when an orchid may grow) and emphasizes the importance of obtaining further genome size data given the considerable

  4. Genome size diversity in orchids: consequences and evolution.

    Science.gov (United States)

    Leitch, I J; Kahandawala, I; Suda, J; Hanson, L; Ingrouille, M J; Chase, M W; Fay, M F

    2009-08-01

    The amount of DNA comprising the genome of an organism (its genome size) varies a remarkable 40 000-fold across eukaryotes, yet most groups are characterized by much narrower ranges (e.g. 14-fold in gymnosperms, 3- to 4-fold in mammals). Angiosperms stand out as one of the most variable groups with genome sizes varying nearly 2000-fold. Nevertheless within angiosperms the majority of families are characterized by genomes which are small and vary little. Species with large genomes are mostly restricted to a few monocots families including Orchidaceae. A survey of the literature revealed that genome size data for Orchidaceae are comparatively rare representing just 327 species. Nevertheless they reveal that Orchidaceae are currently the most variable angiosperm family with genome sizes ranging 168-fold (1C = 0.33-55.4 pg). Analysing the data provided insights into the distribution, evolution and possible consequences to the plant of this genome size diversity. Superimposing the data onto the increasingly robust phylogenetic tree of Orchidaceae revealed how different subfamilies were characterized by distinct genome size profiles. Epidendroideae possessed the greatest range of genome sizes, although the majority of species had small genomes. In contrast, the largest genomes were found in subfamilies Cypripedioideae and Vanilloideae. Genome size evolution within this subfamily was analysed as this is the only one with reasonable representation of data. This approach highlighted striking differences in genome size and karyotype evolution between the closely related Cypripedium, Paphiopedilum and Phragmipedium. As to the consequences of genome size diversity, various studies revealed that this has both practical (e.g. application of genetic fingerprinting techniques) and biological consequences (e.g. affecting where and when an orchid may grow) and emphasizes the importance of obtaining further genome size data given the considerable phylogenetic gaps which have been

  5. Population genomics diversity of Plasmodium falciparum in malaria ...

    African Journals Online (AJOL)

    There is however little information about the genetic diversity of Plasmodium falciparum in Nigeria. Objective: To determine the population genomic diversity of Plasmodium falciparum in malaria patients attending Okelele Com- munity Healthcare Centre, Okelele, Ilorin, Kwara State. Methods: In this study, 50 Plasmodium ...

  6. Population size changes reshape genomic patterns of diversity

    DEFF Research Database (Denmark)

    Pool, John E; Nielsen, Rasmus

    2007-01-01

    Elucidating the forces responsible for genomic variation is critical for understanding evolution. Under standard conditions, X-linked diversity is expected to be three-quarters the level of autosomal diversity. Empirical data often deviate from this prediction, but the reasons for these departures...... of taxa supports an important role for this effect in accounting for population differences in the ratio of X-linked to autosomal diversity. Consideration of this effect may improve the inference of population history and other evolutionary processes....

  7. Genomic and Genetic Diversity within the Pseudomonas fluorescens Complex.

    Directory of Open Access Journals (Sweden)

    Daniel Garrido-Sanz

    Full Text Available The Pseudomonas fluorescens complex includes Pseudomonas strains that have been taxonomically assigned to more than fifty different species, many of which have been described as plant growth-promoting rhizobacteria (PGPR with potential applications in biocontrol and biofertilization. So far the phylogeny of this complex has been analyzed according to phenotypic traits, 16S rDNA, MLSA and inferred by whole-genome analysis. However, since most of the type strains have not been fully sequenced and new species are frequently described, correlation between taxonomy and phylogenomic analysis is missing. In recent years, the genomes of a large number of strains have been sequenced, showing important genomic heterogeneity and providing information suitable for genomic studies that are important to understand the genomic and genetic diversity shown by strains of this complex. Based on MLSA and several whole-genome sequence-based analyses of 93 sequenced strains, we have divided the P. fluorescens complex into eight phylogenomic groups that agree with previous works based on type strains. Digital DDH (dDDH identified 69 species and 75 subspecies within the 93 genomes. The eight groups corresponded to clustering with a threshold of 31.8% dDDH, in full agreement with our MLSA. The Average Nucleotide Identity (ANI approach showed inconsistencies regarding the assignment to species and to the eight groups. The small core genome of 1,334 CDSs and the large pan-genome of 30,848 CDSs, show the large diversity and genetic heterogeneity of the P. fluorescens complex. However, a low number of strains were enough to explain most of the CDSs diversity at core and strain-specific genomic fractions. Finally, the identification and analysis of group-specific genome and the screening for distinctive characters revealed a phylogenomic distribution of traits among the groups that provided insights into biocontrol and bioremediation applications as well as their role as

  8. Cancer Genomics: Diversity and Disparity Across Ethnicity and Geography.

    Science.gov (United States)

    Tan, Daniel S W; Mok, Tony S K; Rebbeck, Timothy R

    2016-01-01

    Ethnic and geographic differences in cancer incidence, prognosis, and treatment outcomes can be attributed to diversity in the inherited (germline) and somatic genome. Although international large-scale sequencing efforts are beginning to unravel the genomic underpinnings of cancer traits, much remains to be known about the underlying mechanisms and determinants of genomic diversity. Carcinogenesis is a dynamic, complex phenomenon representing the interplay between genetic and environmental factors that results in divergent phenotypes across ethnicities and geography. For example, compared with whites, there is a higher incidence of prostate cancer among Africans and African Americans, and the disease is generally more aggressive and fatal. Genome-wide association studies have identified germline susceptibility loci that may account for differences between the African and non-African patients, but the lack of availability of appropriate cohorts for replication studies and the incomplete understanding of genomic architecture across populations pose major limitations. We further discuss the transformative potential of routine diagnostic evaluation for actionable somatic alterations, using lung cancer as an example, highlighting implications of population disparities, current hurdles in implementation, and the far-reaching potential of clinical genomics in enhancing cancer prevention, diagnosis, and treatment. As we enter the era of precision cancer medicine, a concerted multinational effort is key to addressing population and genomic diversity as well as overcoming barriers and geographical disparities in research and health care delivery. © 2015 by American Society of Clinical Oncology.

  9. Genomic diversity of citrate fermentation in Klebsiella pneumoniae

    Directory of Open Access Journals (Sweden)

    Liu Yen-Ming

    2009-08-01

    Full Text Available Abstract Background It has long been recognized that Klebsiella pneumoniae can grow anaerobically on citrate. Genes responsible for citrate fermentation of K. pneumoniae were known to be located in a 13-kb gene cluster on the chromosome. By whole genome comparison of the available K. pneumoniae sequences (MGH 78578, 342, and NTUH-K2044, however, we discovered that the fermentation gene cluster was present in MGH 78578 and 342, but absent in NTUH-K2044. In the present study, the previously unknown genome diversity of citrate fermentation among K. pneumoniae clinical isolates was investigated. Results Using a genomic microarray containing probe sequences from multiple K. pneumoniae strains, we investigated genetic diversity among K. pneumoniae clinical isolates and found that a genomic region containing the citrate fermentation genes was not universally present in all strains. We confirmed by PCR analysis that the gene cluster was detectable in about half of the strains tested. To demonstrate the metabolic function of the genomic region, anaerobic growth of K. pneumoniae in artificial urine medium (AUM was examined for ten strains with different clinical histories and genomic backgrounds, and the citrate fermentation potential was found correlated with the genomic region. PCR detection of the genomic region yielded high positive rates among a variety of clinical isolates collected from urine, blood, wound infection, and pneumonia. Conserved genetic organizations in the vicinity of the citrate fermentation gene clusters among K. pneumoniae, Salmonella enterica, and Escherichia coli suggest that the13-kb genomic region were not independently acquired. Conclusion Not all, but nearly half of the K. pneumoniae clinical isolates carry the genes responsible for anaerobic growth on citrate. Genomic variation of citrate fermentation genes in K. pneumoniae may contribute to metabolic diversity and adaptation to variable nutrient conditions in different

  10. The genomic and phenotypic diversity of Schizosaccharomyces pombe.

    Science.gov (United States)

    Jeffares, Daniel C; Rallis, Charalampos; Rieux, Adrien; Speed, Doug; Převorovský, Martin; Mourier, Tobias; Marsellach, Francesc X; Iqbal, Zamin; Lau, Winston; Cheng, Tammy M K; Pracana, Rodrigo; Mülleder, Michael; Lawson, Jonathan L D; Chessel, Anatole; Bala, Sendu; Hellenthal, Garrett; O'Fallon, Brendan; Keane, Thomas; Simpson, Jared T; Bischof, Leanne; Tomiczek, Bartlomiej; Bitton, Danny A; Sideri, Theodora; Codlin, Sandra; Hellberg, Josephine E E U; van Trigt, Laurent; Jeffery, Linda; Li, Juan-Juan; Atkinson, Sophie; Thodberg, Malte; Febrer, Melanie; McLay, Kirsten; Drou, Nizar; Brown, William; Hayles, Jacqueline; Carazo Salas, Rafael E; Ralser, Markus; Maniatis, Nikolas; Balding, David J; Balloux, Francois; Durbin, Richard; Bähler, Jürg

    2015-03-01

    Natural variation within species reveals aspects of genome evolution and function. The fission yeast Schizosaccharomyces pombe is an important model for eukaryotic biology, but researchers typically use one standard laboratory strain. To extend the usefulness of this model, we surveyed the genomic and phenotypic variation in 161 natural isolates. We sequenced the genomes of all strains, finding moderate genetic diversity (π = 3 × 10(-3) substitutions/site) and weak global population structure. We estimate that dispersal of S. pombe began during human antiquity (∼340 BCE), and ancestors of these strains reached the Americas at ∼1623 CE. We quantified 74 traits, finding substantial heritable phenotypic diversity. We conducted 223 genome-wide association studies, with 89 traits showing at least one association. The most significant variant for each trait explained 22% of the phenotypic variance on average, with indels having larger effects than SNPs. This analysis represents a rich resource to examine genotype-phenotype relationships in a tractable model.

  11. Castor bean organelle genome sequencing and worldwide genetic diversity analysis.

    Directory of Open Access Journals (Sweden)

    Maximo Rivarola

    Full Text Available Castor bean is an important oil-producing plant in the Euphorbiaceae family. Its high-quality oil contains up to 90% of the unusual fatty acid ricinoleate, which has many industrial and medical applications. Castor bean seeds also contain ricin, a highly toxic Type 2 ribosome-inactivating protein, which has gained relevance in recent years due to biosafety concerns. In order to gain knowledge on global genetic diversity in castor bean and to ultimately help the development of breeding and forensic tools, we carried out an extensive chloroplast sequence diversity analysis. Taking advantage of the recently published genome sequence of castor bean, we assembled the chloroplast and mitochondrion genomes extracting selected reads from the available whole genome shotgun reads. Using the chloroplast reference genome we used the methylation filtration technique to readily obtain draft genome sequences of 7 geographically and genetically diverse castor bean accessions. These sequence data were used to identify single nucleotide polymorphism markers and phylogenetic analysis resulted in the identification of two major clades that were not apparent in previous population genetic studies using genetic markers derived from nuclear DNA. Two distinct sub-clades could be defined within each major clade and large-scale genotyping of castor bean populations worldwide confirmed previously observed low levels of genetic diversity and showed a broad geographic distribution of each sub-clade.

  12. The Global Invertebrate Genomics Alliance (GIGA). 2014. Developing Community Resources to Study Diverse Invertebrate Genomes

    NARCIS (Netherlands)

    Pomponi, S.A.

    2014-01-01

    Over 95% of all metazoan (animal) species comprise the “invertebrates,” but very few genomes from these organisms have been sequenced. We have, therefore, formed a “Global Invertebrate Genomics Alliance” (GIGA). Our intent is to build a collaborative network of diverse scientists to tackle major

  13. Genome diversity in wild grasses under environmental stress

    Science.gov (United States)

    Fitzgerald, Timothy L.; Shapter, Frances M.; McDonald, Stuart; Waters, Daniel L. E.; Chivers, Ian H.; Drenth, Andre; Nevo, Eviatar; Henry, Robert J.

    2011-01-01

    Patterns of diversity distribution in the Isa defense locus in wild-barley populations suggest adaptive selection at this locus. The extent to which environmental selection may act at additional nuclear-encoded defense loci and within the whole chloroplast genome has now been examined by analyses in two grass species. Analysis of genetic diversity in wild barley (Hordeum spontaneum) defense genes revealed much greater variation in biotic stress-related genes than abiotic stress-related genes. Genetic diversity at the Isa defense locus in wild populations of weeping ricegrass [Microlaena stipoides (Labill.) R. Br.], a very distant wild-rice relative, was more diverse in samples from relatively hotter and drier environments, a phenomenon that reflects observations in wild barley populations. Whole-chloroplast genome sequences of bulked weeping ricegrass individuals sourced from contrasting environments showed higher levels of diversity in the drier environment in both coding and noncoding portions of the genome. Increased genetic diversity may be important in allowing plant populations to adapt to greater environmental variation in warmer and drier climatic conditions. PMID:22173638

  14. Evolution and Diversity in Human Herpes Simplex Virus Genomes

    Science.gov (United States)

    Gatherer, Derek; Ochoa, Alejandro; Greenbaum, Benjamin; Dolan, Aidan; Bowden, Rory J.; Enquist, Lynn W.; Legendre, Matthieu; Davison, Andrew J.

    2014-01-01

    Herpes simplex virus 1 (HSV-1) causes a chronic, lifelong infection in >60% of adults. Multiple recent vaccine trials have failed, with viral diversity likely contributing to these failures. To understand HSV-1 diversity better, we comprehensively compared 20 newly sequenced viral genomes from China, Japan, Kenya, and South Korea with six previously sequenced genomes from the United States, Europe, and Japan. In this diverse collection of passaged strains, we found that one-fifth of the newly sequenced members share a gene deletion and one-third exhibit homopolymeric frameshift mutations (HFMs). Individual strains exhibit genotypic and potential phenotypic variation via HFMs, deletions, short sequence repeats, and single-nucleotide polymorphisms, although the protein sequence identity between strains exceeds 90% on average. In the first genome-scale analysis of positive selection in HSV-1, we found signs of selection in specific proteins and residues, including the fusion protein glycoprotein H. We also confirmed previous results suggesting that recombination has occurred with high frequency throughout the HSV-1 genome. Despite this, the HSV-1 strains analyzed clustered by geographic origin during whole-genome distance analysis. These data shed light on likely routes of HSV-1 adaptation to changing environments and will aid in the selection of vaccine antigens that are invariant worldwide. PMID:24227835

  15. Genomic diversity among Basmati rice ( Oryza sativa L) mutants ...

    African Journals Online (AJOL)

    Genomic diversity among Basmati rice ( Oryza sativa L) mutants obtained through 60 Co gamma radiations using AFLP markers. ... In order to obtain new varieties of rice with improved agronomic and grain characteristics, gamma radiation (60Co) has been used to generate novel mutants of the Basmati rice. In this study ...

  16. Comparative Analysis of Genome Diversity in Bullmastiff Dogs.

    Directory of Open Access Journals (Sweden)

    Sally-Anne Mortlock

    Full Text Available Management and preservation of genomic diversity in dog breeds is a major objective for maintaining health. The present study was undertaken to characterise genomic diversity in Bullmastiff dogs using both genealogical and molecular analysis. Genealogical analysis of diversity was conducted using a database consisting of 16,378 Bullmastiff pedigrees from year 1980 to 2013. Additionally, a total of 188 Bullmastiff dogs were genotyped using the 170,000 SNP Illumina CanineHD Beadchip. Genealogical parameters revealed a mean inbreeding coefficient of 0.047; 142 total founders (f; an effective number of founders (fe of 79; an effective number of ancestors (fa of 62; and an effective population size of the reference population of 41. Genetic diversity and the degree of genome-wide homogeneity within the breed were also investigated using molecular data. Multiple-locus heterozygosity (MLH was equal to 0.206; runs of homozygosity (ROH as proportion of the genome, averaged 16.44%; effective population size was 29.1, with an average inbreeding coefficient of 0.035, all estimated using SNP Data. Fine-scale population structure was analysed using NETVIEW, a population analysis pipeline. Visualisation of the high definition network captured relationships among individuals within and between subpopulations. Effects of unequal founder use, and ancestral inbreeding and selection, were evident. While current levels of Bullmastiff heterozygosity, inbreeding and homozygosity are not unusual, a relatively small effective population size indicates that a breeding strategy to reduce the inbreeding rate may be beneficial.

  17. Comparative Analysis of Genome Diversity in Bullmastiff Dogs.

    Science.gov (United States)

    Mortlock, Sally-Anne; Khatkar, Mehar S; Williamson, Peter

    2016-01-01

    Management and preservation of genomic diversity in dog breeds is a major objective for maintaining health. The present study was undertaken to characterise genomic diversity in Bullmastiff dogs using both genealogical and molecular analysis. Genealogical analysis of diversity was conducted using a database consisting of 16,378 Bullmastiff pedigrees from year 1980 to 2013. Additionally, a total of 188 Bullmastiff dogs were genotyped using the 170,000 SNP Illumina CanineHD Beadchip. Genealogical parameters revealed a mean inbreeding coefficient of 0.047; 142 total founders (f); an effective number of founders (fe) of 79; an effective number of ancestors (fa) of 62; and an effective population size of the reference population of 41. Genetic diversity and the degree of genome-wide homogeneity within the breed were also investigated using molecular data. Multiple-locus heterozygosity (MLH) was equal to 0.206; runs of homozygosity (ROH) as proportion of the genome, averaged 16.44%; effective population size was 29.1, with an average inbreeding coefficient of 0.035, all estimated using SNP Data. Fine-scale population structure was analysed using NETVIEW, a population analysis pipeline. Visualisation of the high definition network captured relationships among individuals within and between subpopulations. Effects of unequal founder use, and ancestral inbreeding and selection, were evident. While current levels of Bullmastiff heterozygosity, inbreeding and homozygosity are not unusual, a relatively small effective population size indicates that a breeding strategy to reduce the inbreeding rate may be beneficial.

  18. Natural Product Biosynthetic Diversity and Comparative Genomics of the Cyanobacteria.

    Science.gov (United States)

    Dittmann, Elke; Gugger, Muriel; Sivonen, Kaarina; Fewer, David P

    2015-10-01

    Cyanobacteria are an ancient lineage of slow-growing photosynthetic bacteria and a prolific source of natural products with intricate chemical structures and potent biological activities. The bulk of these natural products are known from just a handful of genera. Recent efforts have elucidated the mechanisms underpinning the biosynthesis of a diverse array of natural products from cyanobacteria. Many of the biosynthetic mechanisms are unique to cyanobacteria or rarely described from other organisms. Advances in genome sequence technology have precipitated a deluge of genome sequences for cyanobacteria. This makes it possible to link known natural products to biosynthetic gene clusters but also accelerates the discovery of new natural products through genome mining. These studies demonstrate that cyanobacteria encode a huge variety of cryptic gene clusters for the production of natural products, and the known chemical diversity is likely to be just a fraction of the true biosynthetic capabilities of this fascinating and ancient group of organisms. Copyright © 2015. Published by Elsevier Ltd.

  19. Lampreys as Diverse Model Organisms in the Genomics Era

    Science.gov (United States)

    McCauley, David W.; Docker, Margaret F.; Whyard, Steve; Li, Weiming

    2015-01-01

    Lampreys, one of the two surviving groups of ancient vertebrates, have become important models for study in diverse fields of biology. Lampreys (of which there are approximately 40 species) are being studied, for example, (a) to control pest sea lamprey in the North American Great Lakes and to restore declining populations of native species elsewhere; (b) in biomedical research, focusing particularly on the regenerative capability of lampreys; and (c) by developmental biologists studying the evolution of key vertebrate characters. Although a lack of genetic resources has hindered research on the mechanisms regulating many aspects of lamprey life history and development, formerly intractable questions are now amenable to investigation following the recent publication of the sea lamprey genome. Here, we provide an overview of the ways in which genomic tools are currently being deployed to tackle diverse research questions and suggest several areas that may benefit from the availability of the sea lamprey genome. PMID:26951616

  20. Genomic Analysis of Two Phylogenetically DistinctNitrospiraSpecies Reveals Their Genomic Plasticity and Functional Diversity.

    Science.gov (United States)

    Ushiki, Norisuke; Fujitani, Hirotsugu; Shimada, Yu; Morohoshi, Tomohiro; Sekiguchi, Yuji; Tsuneda, Satoshi

    2017-01-01

    The genus Nitrospira represents a dominant group of nitrite-oxidizing bacteria in natural and engineered ecosystems. This genus is phylogenetically divided into six lineages, for which vast phylogenetic and functional diversity has been revealed by recent molecular ecophysiological analyses. However, the genetic basis underlying these phenotypic differences remains largely unknown because of the lack of genome sequences representing their diversity. To gain a more comprehensive understanding of Nitrospira , we performed genomic comparisons between two Nitrospira strains (ND1 and NJ1 belonging to lineages I and II, respectively) previously isolated from activated sludge. In addition, the genomes of these strains were systematically compared with previously reported six Nitrospira genomes to reveal their similarity and presence/absence of several functional genes/operons. Comparisons of Nitrospira genomes indicated that their genomic diversity reflects phenotypic differences and versatile nitrogen metabolisms. Although most genes involved in key metabolic pathways were conserved between strains ND1 and NJ1, assimilatory nitrite reduction pathways of the two Nitrospira strains were different. In addition, the genomes of both strains contain a phylogenetically different urease locus and we confirmed their ureolytic activity. During gene annotation of strain NJ1, we found a gene cluster encoding a quorum-sensing system. From the enriched supernatant of strain NJ1, we successfully identified seven types of acyl-homoserine lactones with a range of C10-C14. In addition, the genome of strain NJ1 lacks genes relevant to flagella and the clustered regularly interspaced short palindromic repeat (CRISPR)-Cas (CRISPR-associated genes) systems, whereas most nitrifying bacteria including strain ND1 possess these genomic elements. These findings enhance our understanding of genomic plasticity and functional diversity among members of the genus Nitrospira .

  1. Across language families: Genome diversity mirrors linguistic variation within Europe.

    Science.gov (United States)

    Longobardi, Giuseppe; Ghirotto, Silvia; Guardiano, Cristina; Tassi, Francesca; Benazzo, Andrea; Ceolin, Andrea; Barbujani, Guido

    2015-08-01

    The notion that patterns of linguistic and biological variation may cast light on each other and on population histories dates back to Darwin's times; yet, turning this intuition into a proper research program has met with serious methodological difficulties, especially affecting language comparisons. This article takes advantage of two new tools of comparative linguistics: a refined list of Indo-European cognate words, and a novel method of language comparison estimating linguistic diversity from a universal inventory of grammatical polymorphisms, and hence enabling comparison even across different families. We corroborated the method and used it to compare patterns of linguistic and genomic variation in Europe. Two sets of linguistic distances, lexical and syntactic, were inferred from these data and compared with measures of geographic and genomic distance through a series of matrix correlation tests. Linguistic and genomic trees were also estimated and compared. A method (Treemix) was used to infer migration episodes after the main population splits. We observed significant correlations between genomic and linguistic diversity, the latter inferred from data on both Indo-European and non-Indo-European languages. Contrary to previous observations, on the European scale, language proved a better predictor of genomic differences than geography. Inferred episodes of genetic admixture following the main population splits found convincing correlates also in the linguistic realm. These results pave the ground for previously unfeasible cross-disciplinary analyses at the worldwide scale, encompassing populations of distant language families. © 2015 Wiley Periodicals, Inc.

  2. Report of the second Human Genome Diversity workshop

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1992-12-31

    The Second Human Genome Diversity Workshop was successfully held at Penn State University from October 29--31, 1992. The Workshop was essentially organized around 7 groups, each comprising approximately 10 participants, representing the sampling issues in different regions of the world. These groups worked independently, using a common format provided by the organizers; this was adjusted as needed by the individual groups. The Workshop began with a presentation of the mandate to the participants, and of the procedures to be followed during the workshop. Dr. Feldman presented a summary of the results from the First Workshop. He and the other organizers also presented brief comments giving their perspective on the objectives of the Second Workshop. Dr. Julia Bodmer discussed the study of European genetic diversity, especially in the context of the HLA experience there, and of plans to extend such studies in the coming years. She also discussed surveys of world HLA laboratories in regard to resources related to Human Genome Diversity. Dr. Mark Weiss discussed the relevance of nonhuman primate studies for understanding how demographic processes, such as mate exchange between local groups, affected the local dispersion of genetic variation. Primate population geneticists have some relevant experience in interpreting variation at this local level, in particular, with various DNA fingerprinting methods. This experience may be relevant to the Human Genome Diversity Project, in terms of practical and statistical issues.

  3. Impact of Clostridium botulinum genomic diversity on food safety.

    Science.gov (United States)

    Peck, Michael W; van Vliet, Arnoud Hm

    2016-08-01

    The deadly botulinum neurotoxin formed by Clostridium botulinum is the causative agent of foodborne botulism. The increasing availability of C. botulinum genome sequences is starting to allow the genomic diversity of C. botulinum Groups I and II and their neurotoxins to be characterised. This information will impact on microbiological food safety through improved surveillance and tracing/tracking during outbreaks, and a better characterisation of C. botulinum Groups I and II, including the risk presented, and new insights into their biology, food chain transmission, and evolution.

  4. Transposable elements: genome innovation, chromosome diversity, and centromere conflict.

    Science.gov (United States)

    Klein, Savannah J; O'Neill, Rachel J

    2018-01-13

    Although it was nearly 70 years ago when transposable elements (TEs) were first discovered "jumping" from one genomic location to another, TEs are now recognized as contributors to genomic innovations as well as genome instability across a wide variety of species. In this review, we illustrate the ways in which active TEs, specifically retroelements, can create novel chromosome rearrangements and impact gene expression, leading to disease in some cases and species-specific diversity in others. We explore the ways in which eukaryotic genomes have evolved defense mechanisms to temper TE activity and the ways in which TEs continue to influence genome structure despite being rendered transpositionally inactive. Finally, we focus on the role of TEs in the establishment, maintenance, and stabilization of critical, yet rapidly evolving, chromosome features: eukaryotic centromeres. Across centromeres, specific types of TEs participate in genomic conflict, a balancing act wherein they are actively inserting into centromeric domains yet are harnessed for the recruitment of centromeric histones and potentially new centromere formation.

  5. Genomic analyses provide new insights into apple evolution, domestication and genetic diversity

    Science.gov (United States)

    Human selection has reshaped crop genomes. Here we report an apple genome variation map generated through genome sequencing of 117 diverse accessions. A comprehensive model of apple speciation and domestication along the Silk Road was proposed based on evidence from diverse genomic analyses. Cultiva...

  6. A genomic scale map of genetic diversity in Trypanosoma cruzi

    Directory of Open Access Journals (Sweden)

    Ackermann Alejandro A

    2012-12-01

    Full Text Available Abstract Background Trypanosoma cruzi, the causal agent of Chagas Disease, affects more than 16 million people in Latin America. The clinical outcome of the disease results from a complex interplay between environmental factors and the genetic background of both the human host and the parasite. However, knowledge of the genetic diversity of the parasite, is currently limited to a number of highly studied loci. The availability of a number of genomes from different evolutionary lineages of T. cruzi provides an unprecedented opportunity to look at the genetic diversity of the parasite at a genomic scale. Results Using a bioinformatic strategy, we have clustered T. cruzi sequence data available in the public domain and obtained multiple sequence alignments in which one or two alleles from the reference CL-Brener were included. These data covers 4 major evolutionary lineages (DTUs: TcI, TcII, TcIII, and the hybrid TcVI. Using these set of alignments we have identified 288,957 high quality single nucleotide polymorphisms and 1,480 indels. In a reduced re-sequencing study we were able to validate ~ 97% of high-quality SNPs identified in 47 loci. Analysis of how these changes affect encoded protein products showed a 0.77 ratio of synonymous to non-synonymous changes in the T. cruzi genome. We observed 113 changes that introduce or remove a stop codon, some causing significant functional changes, and a number of tri-allelic and tetra-allelic SNPs that could be exploited in strain typing assays. Based on an analysis of the observed nucleotide diversity we show that the T. cruzi genome contains a core set of genes that are under apparent purifying selection. Interestingly, orthologs of known druggable targets show statistically significant lower nucleotide diversity values. Conclusions This study provides the first look at the genetic diversity of T. cruzi at a genomic scale. The analysis covers an estimated ~ 60% of the genetic diversity present in the

  7. Correlation exploration of metabolic and genomic diversity in rice

    Directory of Open Access Journals (Sweden)

    Shinozaki Kazuo

    2009-12-01

    Full Text Available Abstract Background It is essential to elucidate the relationship between metabolic and genomic diversity to understand the genetic regulatory networks associated with the changing metabolo-phenotype among natural variation and/or populations. Recent innovations in metabolomics technologies allow us to grasp the comprehensive features of the metabolome. Metabolite quantitative trait analysis is a key approach for the identification of genetic loci involved in metabolite variation using segregated populations. Although several attempts have been made to find correlative relationships between genetic and metabolic diversity among natural populations in various organisms, it is still unclear whether it is possible to discover such correlations between each metabolite and the polymorphisms found at each chromosomal location. To assess the correlative relationship between the metabolic and genomic diversity found in rice accessions, we compared the distance matrices for these two "omics" patterns in the rice accessions. Results We selected 18 accessions from the world rice collection based on their population structure. To determine the genomic diversity of the rice genome, we genotyped 128 restriction fragment length polymorphism (RFLP markers to calculate the genetic distance among the accessions. To identify the variations in the metabolic fingerprint, a soluble extract from the seed grain of each accession was analyzed with one dimensional 1H-nuclear magnetic resonance (NMR. We found no correlation between global metabolic diversity and the phylogenetic relationships among the rice accessions (rs = 0.14 by analyzing the distance matrices (calculated from the pattern of the metabolic fingerprint in the 4.29- to 0.71-ppm 1H chemical shift and the genetic distance on the basis of the RFLP markers. However, local correlation analysis between the distance matrices (derived from each 0.04-ppm integral region of the 1H chemical shift against genetic

  8. Demographic history, selection and functional diversity of the canine genome.

    Science.gov (United States)

    Ostrander, Elaine A; Wayne, Robert K; Freedman, Adam H; Davis, Brian W

    2017-12-01

    The domestic dog represents one of the most dramatic long-term evolutionary experiments undertaken by humans. From a large wolf-like progenitor, unparalleled diversity in phenotype and behaviour has developed in dogs, providing a model for understanding the developmental and genomic mechanisms of diversification. We discuss pattern and process in domestication, beginning with general findings about early domestication and problems in documenting selection at the genomic level. Furthermore, we summarize genotype-phenotype studies based first on single nucleotide polymorphism (SNP) genotyping and then with whole-genome data and show how an understanding of evolution informs topics as different as human history, adaptive and deleterious variation, morphological development, ageing, cancer and behaviour.

  9. Genomic Diversity of Phages Infecting Probiotic Strains of Lactobacillus paracasei

    Science.gov (United States)

    Rousseau, Geneviève M.; Capra, María L.; Quiberoni, Andrea; Tremblay, Denise M.; Labrie, Simon J.

    2015-01-01

    Strains of the Lactobacillus casei group have been extensively studied because some are used as probiotics in foods. Conversely, their phages have received much less attention. We analyzed the complete genome sequences of five L. paracasei temperate phages: CL1, CL2, iLp84, iLp1308, and iA2. Only phage iA2 could not replicate in an indicator strain. The genome lengths ranged from 34,155 bp (iA2) to 39,474 bp (CL1). Phages iA2 and iLp1308 (34,176 bp) possess the smallest genomes reported, thus far, for phages of the L. casei group. The GC contents of the five phage genomes ranged from 44.8 to 45.6%. As observed with many other phages, their genomes were organized as follows: genes coding for DNA packaging, morphogenesis, lysis, lysogeny, and replication. Phages CL1, CL2, and iLp1308 are highly related to each other. Phage iLp84 was also related to these three phages, but the similarities were limited to gene products involved in DNA packaging and structural proteins. Genomic fragments of phages CL1, CL2, iLp1308, and iLp84 were found in several genomes of L. casei strains. Prophage iA2 is unrelated to these four phages, but almost all of its genome was found in at least four L. casei strains. Overall, these phages are distinct from previously characterized Lactobacillus phages. Our results highlight the diversity of L. casei phages and indicate frequent DNA exchanges between phages and their hosts. PMID:26475105

  10. Diversity and genomics of Antarctic marine micro-organisms.

    Science.gov (United States)

    Murray, Alison E; Grzymski, Joseph J

    2007-12-29

    Marine bacterioplanktons are thought to play a vital role in Southern Ocean ecology and ecosystem function, as they do in other ocean systems. However, our understanding of phylogenetic diversity, genome-enabled capabilities and specific adaptations to this persistently cold environment is limited. Bacterioplankton community composition shifts significantly over the annual cycle as sea ice melts and phytoplankton bloom. Microbial diversity in sea ice is better known than that of the plankton, where culture collections do not appear to represent organisms detected with molecular surveys. Broad phylogenetic groupings of Antarctic bacterioplankton such as the marine group I Crenarchaeota, alpha-Proteobacteria (Roseobacter-related and SAR-11 clusters), gamma-Proteobacteria (both cultivated and uncultivated groups) and Bacteriodetes-affiliated organisms in Southern Ocean waters are in common with other ocean systems. Antarctic SSU rRNA gene phylotypes are typically affiliated with other polar sequences. Some species such as Polaribacter irgensii and currently uncultivated gamma-Proteobacteria (Ant4D3 and Ant10A4) may flourish in Antarctic waters, though further studies are needed to address diversity on a larger scale. Insights from initial genomics studies on both cultivated organisms and genomes accessed through shotgun cloning of environmental samples suggest that there are many unique features of these organisms that facilitate survival in high-latitude, persistently cold environments.

  11. The Global Invertebrate Genomics Alliance (GIGA): Developing Community Resources to Study Diverse Invertebrate Genomes

    Science.gov (United States)

    2014-01-01

    Over 95% of all metazoan (animal) species comprise the “invertebrates,” but very few genomes from these organisms have been sequenced. We have, therefore, formed a “Global Invertebrate Genomics Alliance” (GIGA). Our intent is to build a collaborative network of diverse scientists to tackle major challenges (e.g., species selection, sample collection and storage, sequence assembly, annotation, analytical tools) associated with genome/transcriptome sequencing across a large taxonomic spectrum. We aim to promote standards that will facilitate comparative approaches to invertebrate genomics and collaborations across the international scientific community. Candidate study taxa include species from Porifera, Ctenophora, Cnidaria, Placozoa, Mollusca, Arthropoda, Echinodermata, Annelida, Bryozoa, and Platyhelminthes, among others. GIGA will target 7000 noninsect/nonnematode species, with an emphasis on marine taxa because of the unrivaled phyletic diversity in the oceans. Priorities for selecting invertebrates for sequencing will include, but are not restricted to, their phylogenetic placement; relevance to organismal, ecological, and conservation research; and their importance to fisheries and human health. We highlight benefits of sequencing both whole genomes (DNA) and transcriptomes and also suggest policies for genomic-level data access and sharing based on transparency and inclusiveness. The GIGA Web site (http://giga.nova.edu) has been launched to facilitate this collaborative venture. PMID:24336862

  12. The Global Invertebrate Genomics Alliance (GIGA): developing community resources to study diverse invertebrate genomes.

    Science.gov (United States)

    Bracken-Grissom, Heather; Collins, Allen G; Collins, Timothy; Crandall, Keith; Distel, Daniel; Dunn, Casey; Giribet, Gonzalo; Haddock, Steven; Knowlton, Nancy; Martindale, Mark; Medina, Mónica; Messing, Charles; O'Brien, Stephen J; Paulay, Gustav; Putnam, Nicolas; Ravasi, Timothy; Rouse, Greg W; Ryan, Joseph F; Schulze, Anja; Wörheide, Gert; Adamska, Maja; Bailly, Xavier; Breinholt, Jesse; Browne, William E; Diaz, M Christina; Evans, Nathaniel; Flot, Jean-François; Fogarty, Nicole; Johnston, Matthew; Kamel, Bishoy; Kawahara, Akito Y; Laberge, Tammy; Lavrov, Dennis; Michonneau, François; Moroz, Leonid L; Oakley, Todd; Osborne, Karen; Pomponi, Shirley A; Rhodes, Adelaide; Santos, Scott R; Satoh, Nori; Thacker, Robert W; Van de Peer, Yves; Voolstra, Christian R; Welch, David Mark; Winston, Judith; Zhou, Xin

    2014-01-01

    Over 95% of all metazoan (animal) species comprise the "invertebrates," but very few genomes from these organisms have been sequenced. We have, therefore, formed a "Global Invertebrate Genomics Alliance" (GIGA). Our intent is to build a collaborative network of diverse scientists to tackle major challenges (e.g., species selection, sample collection and storage, sequence assembly, annotation, analytical tools) associated with genome/transcriptome sequencing across a large taxonomic spectrum. We aim to promote standards that will facilitate comparative approaches to invertebrate genomics and collaborations across the international scientific community. Candidate study taxa include species from Porifera, Ctenophora, Cnidaria, Placozoa, Mollusca, Arthropoda, Echinodermata, Annelida, Bryozoa, and Platyhelminthes, among others. GIGA will target 7000 noninsect/nonnematode species, with an emphasis on marine taxa because of the unrivaled phyletic diversity in the oceans. Priorities for selecting invertebrates for sequencing will include, but are not restricted to, their phylogenetic placement; relevance to organismal, ecological, and conservation research; and their importance to fisheries and human health. We highlight benefits of sequencing both whole genomes (DNA) and transcriptomes and also suggest policies for genomic-level data access and sharing based on transparency and inclusiveness. The GIGA Web site (http://giga.nova.edu) has been launched to facilitate this collaborative venture.

  13. The Global Invertebrate Genomics Alliance (GIGA): Developing Community Resources to Study Diverse Invertebrate Genomes

    KAUST Repository

    Bracken-Grissom, Heather

    2013-12-12

    Over 95% of all metazoan (animal) species comprise the invertebrates, but very few genomes from these organisms have been sequenced. We have, therefore, formed a Global Invertebrate Genomics Alliance (GIGA). Our intent is to build a collaborative network of diverse scientists to tackle major challenges (e.g., species selection, sample collection and storage, sequence assembly, annotation, analytical tools) associated with genome/transcriptome sequencing across a large taxonomic spectrum. We aim to promote standards that will facilitate comparative approaches to invertebrate genomics and collaborations across the international scientific community. Candidate study taxa include species from Porifera, Ctenophora, Cnidaria, Placozoa, Mollusca, Arthropoda, Echinodermata, Annelida, Bryozoa, and Platyhelminthes, among others. GIGA will target 7000 noninsect/nonnematode species, with an emphasis on marine taxa because of the unrivaled phyletic diversity in the oceans. Priorities for selecting invertebrates for sequencing will include, but are not restricted to, their phylogenetic placement; relevance to organismal, ecological, and conservation research; and their importance to fisheries and human health. We highlight benefits of sequencing both whole genomes (DNA) and transcriptomes and also suggest policies for genomic-level data access and sharing based on transparency and inclusiveness. The GIGA Web site () has been launched to facilitate this collaborative venture.

  14. Diversity of Pseudomonas Genomes, Including Populus-Associated Isolates, as Revealed by Comparative Genome Analysis

    Science.gov (United States)

    Jun, Se-Ran; Wassenaar, Trudy M.; Nookaew, Intawat; Hauser, Loren; Wanchai, Visanu; Land, Miriam; Timm, Collin M.; Lu, Tse-Yuan S.; Schadt, Christopher W.; Doktycz, Mitchel J.; Pelletier, Dale A.

    2015-01-01

    The Pseudomonas genus contains a metabolically versatile group of organisms that are known to occupy numerous ecological niches, including the rhizosphere and endosphere of many plants. Their diversity influences the phylogenetic diversity and heterogeneity of these communities. On the basis of average amino acid identity, comparative genome analysis of >1,000 Pseudomonas genomes, including 21 Pseudomonas strains isolated from the roots of native Populus deltoides (eastern cottonwood) trees resulted in consistent and robust genomic clusters with phylogenetic homogeneity. All Pseudomonas aeruginosa genomes clustered together, and these were clearly distinct from other Pseudomonas species groups on the basis of pangenome and core genome analyses. In contrast, the genomes of Pseudomonas fluorescens were organized into 20 distinct genomic clusters, representing enormous diversity and heterogeneity. Most of our 21 Populus-associated isolates formed three distinct subgroups within the major P. fluorescens group, supported by pathway profile analysis, while two isolates were more closely related to Pseudomonas chlororaphis and Pseudomonas putida. Genes specific to Populus-associated subgroups were identified. Genes specific to subgroup 1 include several sensory systems that act in two-component signal transduction, a TonB-dependent receptor, and a phosphorelay sensor. Genes specific to subgroup 2 contain hypothetical genes, and genes specific to subgroup 3 were annotated with hydrolase activity. This study justifies the need to sequence multiple isolates, especially from P. fluorescens, which displays the most genetic variation, in order to study functional capabilities from a pangenomic perspective. This information will prove useful when choosing Pseudomonas strains for use to promote growth and increase disease resistance in plants. PMID:26519390

  15. Genomic diversity of drug-resistant Mycobacterium tuberculosis isolates in Lisbon Portugal: Towards tuberculosis genomic epidemiology

    KAUST Repository

    Perdigão, João

    2015-03-01

    Multidrug- (MDR) and extensively drug-resistant (XDR) tuberculosis (TB) present a challenge to disease control and elimination goals. Lisbon, Portugal, has a high TB incidence rate and unusual and successful XDR-TB strains that have been found in circulation for almost two decades. For the last 20. years, a continued circulation of two phylogenetic clades, Lisboa3 and Q1, which are highly associated with MDR and XDR, have been observed. In recent years, these strains have been well characterized regarding the molecular basis of drug resistance and have been inclusively subjected to whole genome sequencing (WGS). Researchers have been studying the genomic diversity of strains circulating in Lisbon and its genomic determinants through cutting-edge next generation sequencing. An enormous amount of whole genome sequence data are now available for the most prevalent and clinically relevant strains circulating in Lisbon.It is the persistence, prevalence and rapid evolution towards drug resistance that has prompted researchers to investigate the properties of these strains at the genomic level and in the future at a global transcriptomic level. Seventy Mycobacterium tuberculosis (MTB) isolates, mostly recovered in Lisbon, were genotyped by 24-. loci Mycobacterial Interspersed Repetitive Unit - Variable Number of Tandem Repeats (MIRU-VNTR) and the genomes sequenced using a next generation sequencing platform - Illumina HiSeq 2000.The genotyping data revealed three major clusters associated with MDR-TB (Lisboa3-A, Lisboa3-B and Q1), two of which are associated with XDR-TB (Lisboa3-B and Q1), whilst the genomic data contributed to elucidating the phylogenetic positioning of circulating MDR-TB strains, showing a high predominance of a single SNP cluster group 5. Furthermore, a genome-wide phylogeny analysis from these strains, together with 19 publicly available genomes of MTB clinical isolates, revealed two major clades responsible for MDR/XDR-TB in the region: Lisboa3 and Q

  16. Comparative genomics reveals diversity among xanthomonads infecting tomato and pepper

    LENUS (Irish Health Repository)

    Potnis, Neha

    2011-03-11

    Abstract Background Bacterial spot of tomato and pepper is caused by four Xanthomonas species and is a major plant disease in warm humid climates. The four species are distinct from each other based on physiological and molecular characteristics. The genome sequence of strain 85-10, a member of one of the species, Xanthomonas euvesicatoria (Xcv) has been previously reported. To determine the relationship of the four species at the genome level and to investigate the molecular basis of their virulence and differing host ranges, draft genomic sequences of members of the other three species were determined and compared to strain 85-10. Results We sequenced the genomes of X. vesicatoria (Xv) strain 1111 (ATCC 35937), X. perforans (Xp) strain 91-118 and X. gardneri (Xg) strain 101 (ATCC 19865). The genomes were compared with each other and with the previously sequenced Xcv strain 85-10. In addition, the molecular features were predicted that may be required for pathogenicity including the type III secretion apparatus, type III effectors, other secretion systems, quorum sensing systems, adhesins, extracellular polysaccharide, and lipopolysaccharide determinants. Several novel type III effectors from Xg strain 101 and Xv strain 1111 genomes were computationally identified and their translocation was validated using a reporter gene assay. A homolog to Ax21, the elicitor of XA21-mediated resistance in rice, and a functional Ax21 sulfation system were identified in Xcv. Genes encoding proteins with functions mediated by type II and type IV secretion systems have also been compared, including enzymes involved in cell wall deconstruction, as contributors to pathogenicity. Conclusions Comparative genomic analyses revealed considerable diversity among bacterial spot pathogens, providing new insights into differences and similarities that may explain the diverse nature of these strains. Genes specific to pepper pathogens, such as the O-antigen of the lipopolysaccharide cluster

  17. Comparative genomics reveals diversity among xanthomonads infecting tomato and pepper

    Directory of Open Access Journals (Sweden)

    Koebnik Ralf

    2011-03-01

    Full Text Available Abstract Background Bacterial spot of tomato and pepper is caused by four Xanthomonas species and is a major plant disease in warm humid climates. The four species are distinct from each other based on physiological and molecular characteristics. The genome sequence of strain 85-10, a member of one of the species, Xanthomonas euvesicatoria (Xcv has been previously reported. To determine the relationship of the four species at the genome level and to investigate the molecular basis of their virulence and differing host ranges, draft genomic sequences of members of the other three species were determined and compared to strain 85-10. Results We sequenced the genomes of X. vesicatoria (Xv strain 1111 (ATCC 35937, X. perforans (Xp strain 91-118 and X. gardneri (Xg strain 101 (ATCC 19865. The genomes were compared with each other and with the previously sequenced Xcv strain 85-10. In addition, the molecular features were predicted that may be required for pathogenicity including the type III secretion apparatus, type III effectors, other secretion systems, quorum sensing systems, adhesins, extracellular polysaccharide, and lipopolysaccharide determinants. Several novel type III effectors from Xg strain 101 and Xv strain 1111 genomes were computationally identified and their translocation was validated using a reporter gene assay. A homolog to Ax21, the elicitor of XA21-mediated resistance in rice, and a functional Ax21 sulfation system were identified in Xcv. Genes encoding proteins with functions mediated by type II and type IV secretion systems have also been compared, including enzymes involved in cell wall deconstruction, as contributors to pathogenicity. Conclusions Comparative genomic analyses revealed considerable diversity among bacterial spot pathogens, providing new insights into differences and similarities that may explain the diverse nature of these strains. Genes specific to pepper pathogens, such as the O-antigen of the

  18. Diversity and Evolution in the Genome of Clostridium difficile.

    Science.gov (United States)

    Knight, Daniel R; Elliott, Briony; Chang, Barbara J; Perkins, Timothy T; Riley, Thomas V

    2015-07-01

    Clostridium difficile infection (CDI) is the leading cause of antimicrobial and health care-associated diarrhea in humans, presenting a significant burden to global health care systems. In the last 2 decades, PCR- and sequence-based techniques, particularly whole-genome sequencing (WGS), have significantly furthered our knowledge of the genetic diversity, evolution, epidemiology, and pathogenicity of this once enigmatic pathogen. C. difficile is taxonomically distinct from many other well-known clostridia, with a diverse population structure comprising hundreds of strain types spread across at least 6 phylogenetic clades. The C. difficile species is defined by a large diverse pangenome with extreme levels of evolutionary plasticity that has been shaped over long time periods by gene flux and recombination, often between divergent lineages. These evolutionary events are in response to environmental and anthropogenic activities and have led to the rapid emergence and worldwide dissemination of virulent clonal lineages. Moreover, genome analysis of large clinically relevant data sets has improved our understanding of CDI outbreaks, transmission, and recurrence. The epidemiology of CDI has changed dramatically over the last 15 years, and CDI may have a foodborne or zoonotic etiology. The WGS era promises to continue to redefine our view of this significant pathogen. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  19. Patterns of genome size diversity in bats (order Chiroptera).

    Science.gov (United States)

    Smith, Jillian D L; Bickham, John W; Gregory, T Ryan

    2013-08-01

    Despite being a group of particular interest in considering relationships between genome size and metabolic parameters, bats have not been well studied from this perspective. This study presents new estimates for 121 "microbat" species from 12 families and complements a previous study on members of the family Pteropodidae ("megabats"). The results confirm that diversity in genome size in bats is very limited even compared with other mammals, varying approximately 2-fold from 1.63 pg in Lophostoma carrikeri to 3.17 pg in Rhinopoma hardwickii and averaging only 2.35 pg ± 0.02 SE (versus 3.5 pg overall for mammals). However, contrary to some other vertebrate groups, and perhaps owing to the narrow range observed, genome size correlations were not apparent with any chromosomal, physiological, flight-related, developmental, or ecological characteristics within the order Chiroptera. Genome size is positively correlated with measures of body size in bats, though the strength of the relationships differs between pteropodids ("megabats") and nonpteropodids ("microbats").

  20. Exceptionally diverse morphotypes and genomes of crenarchaeal hyperthermophilic viruses

    DEFF Research Database (Denmark)

    Prangishvili, D; Garrett, R A

    2004-01-01

    The remarkable diversity of the morphologies of viruses found in terrestrial hydrothermal environments with temperatures >80 degrees C is unprecedented for aquatic ecosystems. The best-studied viruses from these habitats have been assigned to novel viral families: Fuselloviridae, Lipothrixviridae...... no significant matches to sequences in public databases. This suggests that these hyperthermophilic viruses have exceptional biochemical solutions for biological functions. Specific features of genome organization, as well as strategies for DNA replication, suggest that phylogenetic relationships exist between...... crenarchaeal rudiviruses and the large eukaryal DNA viruses: poxviruses, the African swine fever virus and Chlorella viruses. Sequence patterns at the ends of the linear genome of the lipothrixvirus AFV1 are reminiscent of the telomeric ends of linear eukaryal chromosomes and suggest that a primitive telomeric...

  1. Genomes, diversity and resistance gene analogues in Musa species.

    Science.gov (United States)

    Azhar, M; Heslop-Harrison, J S

    2008-01-01

    Resistance genes (R genes) in plants are abundant and may represent more than 1% of all the genes. Their diversity is critical to the recognition and response to attack from diverse pathogens. Like many other crops, banana and plantain face attacks from potentially devastating fungal and bacterial diseases, increased by a combination of worldwide spread of pathogens, exploitation of a small number of varieties, new pathogen mutations, and the lack of effective, benign and cheap chemical control. The challenge for plant breeders is to identify and exploit genetic resistances to diseases, which is particularly difficult in banana and plantain where the valuable cultivars are sterile, parthenocarpic and mostly triploid so conventional genetic analysis and breeding is impossible. In this paper, we review the nature of R genes and the key motifs, particularly in the Nucleotide Binding Sites (NBS), Leucine Rich Repeat (LRR) gene class. We present data about identity, nature and evolutionary diversity of the NBS domains of Musa R genes in diploid wild species with the Musa acuminata (A), M. balbisiana (B), M. schizocarpa (S), M. textilis (T), M. velutina and M. ornata genomes, and from various cultivated hybrid and triploid accessions, using PCR primers to isolate the domains from genomic DNA. Of 135 new sequences, 75% of the sequenced clones had uninterrupted open reading frames (ORFs), and phylogenetic UPGMA tree construction showed four clusters, one from Musa ornata, one largely from the B and T genomes, one from A and M. velutina, and the largest with A, B, T and S genomes. Only genes of the coiled-coil (non-TIR) class were found, typical of the grasses and presumably monocotyledons. The analysis of R genes in cultivated banana and plantain, and their wild relatives, has implications for identification and selection of resistance genes within the genus which may be useful for plant selection and breeding and also for defining relationships and genome evolution

  2. Metabolic Genes within Cyanophage Genomes: Implications for Diversity and Evolution

    Directory of Open Access Journals (Sweden)

    E-Bin Gao

    2016-09-01

    Full Text Available Cyanophages, a group of viruses specifically infecting cyanobacteria, are genetically diverse and extensively abundant in water environments. As a result of selective pressure, cyanophages often acquire a range of metabolic genes from host genomes. The host-derived genes make a significant contribution to the ecological success of cyanophages. In this review, we summarize the host-derived metabolic genes, as well as their origin and roles in cyanophage evolution and important host metabolic pathways, such as the light-dependent reactions of photosynthesis, the pentose phosphate pathway, nutrient acquisition and nucleotide biosynthesis. We also discuss the suitability of the host-derived metabolic genes as potential diagnostic markers for the detection of genetic diversity of cyanophages in natural environments.

  3. Genetics, Genomics and Evolution of Ergot Alkaloid Diversity

    Directory of Open Access Journals (Sweden)

    Carolyn A. Young

    2015-04-01

    Full Text Available The ergot alkaloid biosynthesis system has become an excellent model to study evolutionary diversification of specialized (secondary metabolites. This is a very diverse class of alkaloids with various neurotropic activities, produced by fungi in several orders of the phylum Ascomycota, including plant pathogens and protective plant symbionts in the family Clavicipitaceae. Results of comparative genomics and phylogenomic analyses reveal multiple examples of three evolutionary processes that have generated ergot-alkaloid diversity: gene gains, gene losses, and gene sequence changes that have led to altered substrates or product specificities of the enzymes that they encode (neofunctionalization. The chromosome ends appear to be particularly effective engines for gene gains, losses and rearrangements, but not necessarily for neofunctionalization. Changes in gene expression could lead to accumulation of various pathway intermediates and affect levels of different ergot alkaloids. Genetic alterations associated with interspecific hybrids of Epichloë species suggest that such variation is also selectively favored. The huge structural diversity of ergot alkaloids probably represents adaptations to a wide variety of ecological situations by affecting the biological spectra and mechanisms of defense against herbivores, as evidenced by the diverse pharmacological effects of ergot alkaloids used in medicine.

  4. Pervasive, Genome-Wide Transcription in the Organelle Genomes of Diverse Plastid-Bearing Protists

    Directory of Open Access Journals (Sweden)

    Matheus Sanitá Lima

    2017-11-01

    Full Text Available Organelle genomes are among the most sequenced kinds of chromosome. This is largely because they are small and widely used in molecular studies, but also because next-generation sequencing technologies made sequencing easier, faster, and cheaper. However, studies of organelle RNA have not kept pace with those of DNA, despite huge amounts of freely available eukaryotic RNA-sequencing (RNA-seq data. Little is known about organelle transcription in nonmodel species, and most of the available eukaryotic RNA-seq data have not been mined for organelle transcripts. Here, we use publicly available RNA-seq experiments to investigate organelle transcription in 30 diverse plastid-bearing protists with varying organelle genomic architectures. Mapping RNA-seq data to organelle genomes revealed pervasive, genome-wide transcription, regardless of the taxonomic grouping, gene organization, or noncoding content. For every species analyzed, transcripts covered ≥85% of the mitochondrial and/or plastid genomes (all of which were ≤105 kb, indicating that most of the organelle DNA—coding and noncoding—is transcriptionally active. These results follow earlier studies of model species showing that organellar transcription is coupled and ubiquitous across the genome, requiring significant downstream processing of polycistronic transcripts. Our findings suggest that noncoding organelle DNA can be transcriptionally active, raising questions about the underlying function of these transcripts and underscoring the utility of publicly available RNA-seq data for recovering complete genome sequences. If pervasive transcription is also found in bigger organelle genomes (>105 kb and across a broader range of eukaryotes, this could indicate that noncoding organelle RNAs are regulating fundamental processes within eukaryotic cells.

  5. A Glimpse of the genomic diversity of haloarchaeal tailed viruses

    Directory of Open Access Journals (Sweden)

    Ana eSencilo

    2014-03-01

    Full Text Available Tailed viruses are the most common isolates infecting prokaryotic hosts residing hypersaline environments. Archaeal tailed viruses represent only a small portion of all characterized tailed viruses of prokaryotes. But even this small dataset revealed that archaeal tailed viruses have many similarities to their counterparts infecting bacteria, the bacteriophages. Shared functional homologues and similar genome organizations suggested that all microbial tailed viruses have common virion architectural and assembly principles. Recent structural studies have provided evidence justifying this thereby grouping archaeal and bacterial tailed viruses into a single lineage. Currently there are 17 haloarchaeal tailed viruses with entirely sequenced genomes. Nine viruses have at least one close relative among the 17 viruses and, according to the similarities, can be divided into three groups. Two other viruses share some homologues and therefore are distantly related, whereas the rest of the viruses are rather divergent (or singletons. Comparative genomics analysis of these viruses offers a glimpse into the genetic diversity and structure of haloarchaeal tailed virus communities.

  6. Phenotypic Heterogeneity of Genomically-Diverse Isolates of Streptococcus mutans

    Science.gov (United States)

    Palmer, Sara R.; Miller, James H.; Abranches, Jacqueline; Zeng, Lin; Lefebure, Tristan; Richards, Vincent P.; Lemos, José A.; Stanhope, Michael J.; Burne, Robert A.

    2013-01-01

    High coverage, whole genome shotgun (WGS) sequencing of 57 geographically- and genetically-diverse isolates of Streptococcus mutans from individuals of known dental caries status was recently completed. Of the 57 sequenced strains, fifteen isolates, were selected based primarily on differences in gene content and phenotypic characteristics known to affect virulence and compared with the reference strain UA159. A high degree of variability in these properties was observed between strains, with a broad spectrum of sensitivities to low pH, oxidative stress (air and paraquat) and exposure to competence stimulating peptide (CSP). Significant differences in autolytic behavior and in biofilm development in glucose or sucrose were also observed. Natural genetic competence varied among isolates, and this was correlated to the presence or absence of competence genes, comCDE and comX, and to bacteriocins. In general strains that lacked the ability to become competent possessed fewer genes for bacteriocins and immunity proteins or contained polymorphic variants of these genes. WGS sequence analysis of the pan-genome revealed, for the first time, components of a Type VII secretion system in several S. mutans strains, as well as two putative ORFs that encode possible collagen binding proteins located upstream of the cnm gene, which is associated with host cell invasiveness. The virulence of these particular strains was assessed in a wax-worm model. This is the first study to combine a comprehensive analysis of key virulence-related phenotypes with extensive genomic analysis of a pathogen that evolved closely with humans. Our analysis highlights the phenotypic diversity of S. mutans isolates and indicates that the species has evolved a variety of adaptive strategies to persist in the human oral cavity and, when conditions are favorable, to initiate disease. PMID:23613838

  7. Phenotypic heterogeneity of genomically-diverse isolates of Streptococcus mutans.

    Directory of Open Access Journals (Sweden)

    Sara R Palmer

    Full Text Available High coverage, whole genome shotgun (WGS sequencing of 57 geographically- and genetically-diverse isolates of Streptococcus mutans from individuals of known dental caries status was recently completed. Of the 57 sequenced strains, fifteen isolates, were selected based primarily on differences in gene content and phenotypic characteristics known to affect virulence and compared with the reference strain UA159. A high degree of variability in these properties was observed between strains, with a broad spectrum of sensitivities to low pH, oxidative stress (air and paraquat and exposure to competence stimulating peptide (CSP. Significant differences in autolytic behavior and in biofilm development in glucose or sucrose were also observed. Natural genetic competence varied among isolates, and this was correlated to the presence or absence of competence genes, comCDE and comX, and to bacteriocins. In general strains that lacked the ability to become competent possessed fewer genes for bacteriocins and immunity proteins or contained polymorphic variants of these genes. WGS sequence analysis of the pan-genome revealed, for the first time, components of a Type VII secretion system in several S. mutans strains, as well as two putative ORFs that encode possible collagen binding proteins located upstream of the cnm gene, which is associated with host cell invasiveness. The virulence of these particular strains was assessed in a wax-worm model. This is the first study to combine a comprehensive analysis of key virulence-related phenotypes with extensive genomic analysis of a pathogen that evolved closely with humans. Our analysis highlights the phenotypic diversity of S. mutans isolates and indicates that the species has evolved a variety of adaptive strategies to persist in the human oral cavity and, when conditions are favorable, to initiate disease.

  8. Genome Size Diversity and Its Impact on the Evolution of Land Plants

    Directory of Open Access Journals (Sweden)

    Jaume Pellicer

    2018-02-01

    Full Text Available Genome size is a biodiversity trait that shows staggering diversity across eukaryotes, varying over 64,000-fold. Of all major taxonomic groups, land plants stand out due to their staggering genome size diversity, ranging ca. 2400-fold. As our understanding of the implications and significance of this remarkable genome size diversity in land plants grows, it is becoming increasingly evident that this trait plays not only an important role in shaping the evolution of plant genomes, but also in influencing plant community assemblages at the ecosystem level. Recent advances and improvements in novel sequencing technologies, as well as analytical tools, make it possible to gain critical insights into the genomic and epigenetic mechanisms underpinning genome size changes. In this review we provide an overview of our current understanding of genome size diversity across the different land plant groups, its implications on the biology of the genome and what future directions need to be addressed to fill key knowledge gaps.

  9. Integrated analysis of whole genome and transcriptome sequencing reveals diverse transcriptomic aberrations driven by somatic genomic changes in liver cancers.

    Directory of Open Access Journals (Sweden)

    Yuichi Shiraishi

    Full Text Available Recent studies applying high-throughput sequencing technologies have identified several recurrently mutated genes and pathways in multiple cancer genomes. However, transcriptional consequences from these genomic alterations in cancer genome remain unclear. In this study, we performed integrated and comparative analyses of whole genomes and transcriptomes of 22 hepatitis B virus (HBV-related hepatocellular carcinomas (HCCs and their matched controls. Comparison of whole genome sequence (WGS and RNA-Seq revealed much evidence that various types of genomic mutations triggered diverse transcriptional changes. Not only splice-site mutations, but also silent mutations in coding regions, deep intronic mutations and structural changes caused splicing aberrations. HBV integrations generated diverse patterns of virus-human fusion transcripts depending on affected gene, such as TERT, CDK15, FN1 and MLL4. Structural variations could drive over-expression of genes such as WNT ligands, with/without creating gene fusions. Furthermore, by taking account of genomic mutations causing transcriptional aberrations, we could improve the sensitivity of deleterious mutation detection in known cancer driver genes (TP53, AXIN1, ARID2, RPS6KA3, and identified recurrent disruptions in putative cancer driver genes such as HNF4A, CPS1, TSC1 and THRAP3 in HCCs. These findings indicate genomic alterations in cancer genome have diverse transcriptomic effects, and integrated analysis of WGS and RNA-Seq can facilitate the interpretation of a large number of genomic alterations detected in cancer genome.

  10. Karyotype diversity and genome size variation in Neotropical Maxillariinae orchids.

    Science.gov (United States)

    Moraes, A P; Koehler, S; Cabral, J S; Gomes, S S L; Viccini, L F; Barros, F; Felix, L P; Guerra, M; Forni-Martins, E R

    2017-03-01

    Orchidaceae is a widely distributed plant family with very diverse vegetative and floral morphology, and such variability is also reflected in their karyotypes. However, since only a low proportion of Orchidaceae has been analysed for chromosome data, greater diversity may await to be unveiled. Here we analyse both genome size (GS) and karyotype in two subtribes recently included in the broadened Maxillariinea to detect how much chromosome and GS variation there is in these groups and to evaluate which genome rearrangements are involved in the species evolution. To do so, the GS (14 species), the karyotype - based on chromosome number, heterochromatic banding and 5S and 45S rDNA localisation (18 species) - was characterised and analysed along with published data using phylogenetic approaches. The GS presented a high phylogenetic correlation and it was related to morphological groups in Bifrenaria (larger plants - higher GS). The two largest GS found among genera were caused by different mechanisms: polyploidy in Bifrenaria tyrianthina and accumulation of repetitive DNA in Scuticaria hadwenii. The chromosome number variability was caused mainly through descending dysploidy, and x=20 was estimated as the base chromosome number. Combining GS and karyotype data with molecular phylogeny, our data provide a more complete scenario of the karyotype evolution in Maxillariinae orchids, allowing us to suggest, besides dysploidy, that inversions and transposable elements as two mechanisms involved in the karyotype evolution. Such karyotype modifications could be associated with niche changes that occurred during species evolution. © 2016 German Botanical Society and The Royal Botanical Society of the Netherlands.

  11. The Human Genome Diversity (HGD) Project. Summary document

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1993-12-31

    In 1991 a group of human geneticists and molecular biologists proposed to the scientific community that a world wide survey be undertaken of variation in the human genome. To aid their considerations, the committee therefore decided to hold a small series of international workshops to explore the major scientific issues involved. The intention was to define a framework for the project which could provide a basis for much wider and more detailed discussion and planning--it was recognized that the successful implementation of the proposed project, which has come to be known as the Human Genome Diversity (HGD) Project, would not only involve scientists but also various national and international non-scientific groups all of which should contribute to the project`s development. The international HGD workshop held in Sardinia in September 1993 was the last in the initial series of planning workshops. As such it not only explored new ground but also pulled together into a more coherent form much of the formal and informal discussion that had taken place in the preceding two years. This report presents the deliberations of the Sardinia workshop within a consideration of the overall development of the HGD Project to date.

  12. The genome diversity and karyotype evolution of mammals

    Directory of Open Access Journals (Sweden)

    Trifonov Vladimir A

    2011-10-01

    Full Text Available Abstract The past decade has witnessed an explosion of genome sequencing and mapping in evolutionary diverse species. While full genome sequencing of mammals is rapidly progressing, the ability to assemble and align orthologous whole chromosome regions from more than a few species is still not possible. The intense focus on building of comparative maps for companion (dog and cat, laboratory (mice and rat and agricultural (cattle, pig, and horse animals has traditionally been used as a means to understand the underlying basis of disease-related or economically important phenotypes. However, these maps also provide an unprecedented opportunity to use multispecies analysis as a tool for inferring karyotype evolution. Comparative chromosome painting and related techniques are now considered to be the most powerful approaches in comparative genome studies. Homologies can be identified with high accuracy using molecularly defined DNA probes for fluorescence in situ hybridization (FISH on chromosomes of different species. Chromosome painting data are now available for members of nearly all mammalian orders. In most orders, there are species with rates of chromosome evolution that can be considered as 'default' rates. The number of rearrangements that have become fixed in evolutionary history seems comparatively low, bearing in mind the 180 million years of the mammalian radiation. Comparative chromosome maps record the history of karyotype changes that have occurred during evolution. The aim of this review is to provide an overview of these recent advances in our endeavor to decipher the karyotype evolution of mammals by integrating the published results together with some of our latest unpublished results.

  13. Genetic diversity and trait genomic prediction in a pea diversity panel.

    Science.gov (United States)

    Burstin, Judith; Salloignon, Pauline; Chabert-Martinello, Marianne; Magnin-Robert, Jean-Bernard; Siol, Mathieu; Jacquin, Françoise; Chauveau, Aurélie; Pont, Caroline; Aubert, Grégoire; Delaitre, Catherine; Truntzer, Caroline; Duc, Gérard

    2015-02-21

    Pea (Pisum sativum L.), a major pulse crop grown for its protein-rich seeds, is an important component of agroecological cropping systems in diverse regions of the world. New breeding challenges imposed by global climate change and new regulations urge pea breeders to undertake more efficient methods of selection and better take advantage of the large genetic diversity present in the Pisum sativum genepool. Diversity studies conducted so far in pea used Simple Sequence Repeat (SSR) and Retrotransposon Based Insertion Polymorphism (RBIP) markers. Recently, SNP marker panels have been developed that will be useful for genetic diversity assessment and marker-assisted selection. A collection of diverse pea accessions, including landraces and cultivars of garden, field or fodder peas as well as wild peas was characterised at the molecular level using newly developed SNP markers, as well as SSR markers and RBIP markers. The three types of markers were used to describe the structure of the collection and revealed different pictures of the genetic diversity among the collection. SSR showed the fastest rate of evolution and RBIP the slowest rate of evolution, pointing to their contrasted mode of evolution. SNP markers were then used to predict phenotypes -the date of flowering (BegFlo), the number of seeds per plant (Nseed) and thousand seed weight (TSW)- that were recorded for the collection. Different statistical methods were tested including the LASSO (Least Absolute Shrinkage ans Selection Operator), PLS (Partial Least Squares), SPLS (Sparse Partial Least Squares), Bayes A, Bayes B and GBLUP (Genomic Best Linear Unbiased Prediction) methods and the structure of the collection was taken into account in the prediction. Despite a limited number of 331 markers used for prediction, TSW was reliably predicted. The development of marker assisted selection has not reached its full potential in pea until now. This paper shows that the high-throughput SNP arrays that are being

  14. The high-quality draft genome of peach (Prunus persica) identifies unique patterns of genetic diversity, domestication and genome evolution.

    Science.gov (United States)

    Verde, Ignazio; Abbott, Albert G; Scalabrin, Simone; Jung, Sook; Shu, Shengqiang; Marroni, Fabio; Zhebentyayeva, Tatyana; Dettori, Maria Teresa; Grimwood, Jane; Cattonaro, Federica; Zuccolo, Andrea; Rossini, Laura; Jenkins, Jerry; Vendramin, Elisa; Meisel, Lee A; Decroocq, Veronique; Sosinski, Bryon; Prochnik, Simon; Mitros, Therese; Policriti, Alberto; Cipriani, Guido; Dondini, Luca; Ficklin, Stephen; Goodstein, David M; Xuan, Pengfei; Del Fabbro, Cristian; Aramini, Valeria; Copetti, Dario; Gonzalez, Susana; Horner, David S; Falchi, Rachele; Lucas, Susan; Mica, Erica; Maldonado, Jonathan; Lazzari, Barbara; Bielenberg, Douglas; Pirona, Raul; Miculan, Mara; Barakat, Abdelali; Testolin, Raffaele; Stella, Alessandra; Tartarini, Stefano; Tonutti, Pietro; Arús, Pere; Orellana, Ariel; Wells, Christina; Main, Dorrie; Vizzotto, Giannina; Silva, Herman; Salamini, Francesco; Schmutz, Jeremy; Morgante, Michele; Rokhsar, Daniel S

    2013-05-01

    Rosaceae is the most important fruit-producing clade, and its key commercially relevant genera (Fragaria, Rosa, Rubus and Prunus) show broadly diverse growth habits, fruit types and compact diploid genomes. Peach, a diploid Prunus species, is one of the best genetically characterized deciduous trees. Here we describe the high-quality genome sequence of peach obtained from a completely homozygous genotype. We obtained a complete chromosome-scale assembly using Sanger whole-genome shotgun methods. We predicted 27,852 protein-coding genes, as well as noncoding RNAs. We investigated the path of peach domestication through whole-genome resequencing of 14 Prunus accessions. The analyses suggest major genetic bottlenecks that have substantially shaped peach genome diversity. Furthermore, comparative analyses showed that peach has not undergone recent whole-genome duplication, and even though the ancestral triplicated blocks in peach are fragmentary compared to those in grape, all seven paleosets of paralogs from the putative paleoancestor are detectable.

  15. The Great Migration and African-American Genomic Diversity.

    Directory of Open Access Journals (Sweden)

    Soheil Baharian

    2016-05-01

    Full Text Available We present a comprehensive assessment of genomic diversity in the African-American population by studying three genotyped cohorts comprising 3,726 African-Americans from across the United States that provide a representative description of the population across all US states and socioeconomic status. An estimated 82.1% of ancestors to African-Americans lived in Africa prior to the advent of transatlantic travel, 16.7% in Europe, and 1.2% in the Americas, with increased African ancestry in the southern United States compared to the North and West. Combining demographic models of ancestry and those of relatedness suggests that admixture occurred predominantly in the South prior to the Civil War and that ancestry-biased migration is responsible for regional differences in ancestry. We find that recent migrations also caused a strong increase in genetic relatedness among geographically distant African-Americans. Long-range relatedness among African-Americans and between African-Americans and European-Americans thus track north- and west-bound migration routes followed during the Great Migration of the twentieth century. By contrast, short-range relatedness patterns suggest comparable mobility of ∼15-16km per generation for African-Americans and European-Americans, as estimated using a novel analytical model of isolation-by-distance.

  16. The Great Migration and African-American Genomic Diversity.

    Science.gov (United States)

    Baharian, Soheil; Barakatt, Maxime; Gignoux, Christopher R; Shringarpure, Suyash; Errington, Jacob; Blot, William J; Bustamante, Carlos D; Kenny, Eimear E; Williams, Scott M; Aldrich, Melinda C; Gravel, Simon

    2016-05-01

    We present a comprehensive assessment of genomic diversity in the African-American population by studying three genotyped cohorts comprising 3,726 African-Americans from across the United States that provide a representative description of the population across all US states and socioeconomic status. An estimated 82.1% of ancestors to African-Americans lived in Africa prior to the advent of transatlantic travel, 16.7% in Europe, and 1.2% in the Americas, with increased African ancestry in the southern United States compared to the North and West. Combining demographic models of ancestry and those of relatedness suggests that admixture occurred predominantly in the South prior to the Civil War and that ancestry-biased migration is responsible for regional differences in ancestry. We find that recent migrations also caused a strong increase in genetic relatedness among geographically distant African-Americans. Long-range relatedness among African-Americans and between African-Americans and European-Americans thus track north- and west-bound migration routes followed during the Great Migration of the twentieth century. By contrast, short-range relatedness patterns suggest comparable mobility of ∼15-16km per generation for African-Americans and European-Americans, as estimated using a novel analytical model of isolation-by-distance.

  17. Reconstruction of Diverse Verrucomicrobial Genomes from Metagenome Datasets of Freshwater Reservoirs

    Directory of Open Access Journals (Sweden)

    Pedro J. Cabello-Yeves

    2017-11-01

    Full Text Available The phylum Verrucomicrobia contains freshwater representatives which remain poorly studied at the genomic, taxonomic, and ecological levels. In this work we present eighteen new reconstructed verrucomicrobial genomes from two freshwater reservoirs located close to each other (Tous and Amadorio, Spain. These metagenome-assembled genomes (MAGs display a remarkable taxonomic diversity inside the phylum and comprise wide ranges of estimated genome sizes (from 1.8 to 6 Mb. Among all Verrucomicrobia studied we found some of the smallest genomes of the Spartobacteria and Opitutae classes described so far. Some of the Opitutae family MAGs were small, cosmopolitan, with a general heterotrophic metabolism with preference for carbohydrates, and capable of xylan, chitin, or cellulose degradation. Besides, we assembled large copiotroph genomes, which contain a higher number of transporters, polysaccharide degrading pathways and in general more strategies for the uptake of nutrients and carbohydrate-based metabolic pathways in comparison with the representatives with the smaller genomes. The diverse genomes revealed interesting features like green-light absorbing rhodopsins and a complete set of genes involved in nitrogen fixation. The large diversity in genome sizes and physiological properties emphasize the diversity of this clade in freshwaters enlarging even further the already broad eco-physiological range of these microbes.

  18. Maize inbreds exhibit high levels of copy number variation (CNV) and presence/absence variation (PAV) in genome content.

    Science.gov (United States)

    Springer, Nathan M; Ying, Kai; Fu, Yan; Ji, Tieming; Yeh, Cheng-Ting; Jia, Yi; Wu, Wei; Richmond, Todd; Kitzman, Jacob; Rosenbaum, Heidi; Iniguez, A Leonardo; Barbazuk, W Brad; Jeddeloh, Jeffrey A; Nettleton, Daniel; Schnable, Patrick S

    2009-11-01

    Following the domestication of maize over the past approximately 10,000 years, breeders have exploited the extensive genetic diversity of this species to mold its phenotype to meet human needs. The extent of structural variation, including copy number variation (CNV) and presence/absence variation (PAV), which are thought to contribute to the extraordinary phenotypic diversity and plasticity of this important crop, have not been elucidated. Whole-genome, array-based, comparative genomic hybridization (CGH) revealed a level of structural diversity between the inbred lines B73 and Mo17 that is unprecedented among higher eukaryotes. A detailed analysis of altered segments of DNA conservatively estimates that there are several hundred CNV sequences among the two genotypes, as well as several thousand PAV sequences that are present in B73 but not Mo17. Haplotype-specific PAVs contain hundreds of single-copy, expressed genes that may contribute to heterosis and to the extraordinary phenotypic diversity of this important crop.

  19. Comparative assessment of genetic diversity in cytoplasmic and nuclear genome of upland cotton.

    Science.gov (United States)

    Egamberdiev, Sharof S; Saha, Sukumar; Salakhutdinov, Ilkhom; Jenkins, Johnie N; Deng, Dewayne; Y Abdurakhmonov, Ibrokhim

    2016-06-01

    The importance of the cytoplasmic genome for many economically important traits is well documented in several crop species, including cotton. There is no report on application of cotton chloroplast specific SSR markers as a diagnostic tool to study genetic diversity among improved Upland cotton lines. The complete plastome sequence information in GenBank provided us an opportunity to report on 17 chloroplast specific SSR markers using a cost-effective data mining strategy. Here we report the comparative analysis of genetic diversity among a set of 42 improved Upland cotton lines using SSR markers specific to chloroplast and nuclear genome, respectively. Our results revealed that low to moderate level of genetic diversity existed in both nuclear and cytoplasm genome among this set of cotton lines. However, the specific estimation suggested that genetic diversity is lower in cytoplasmic genome compared to the nuclear genome among this set of Upland cotton lines. In summary, this research is important from several perspectives. We detected a set of cytoplasm genome specific SSR primer pairs by using a cost-effective data mining strategy. We reported for the first time the genetic diversity in the cytoplasmic genome within a set of improved Upland cotton accessions. Results revealed that the genetic diversity in cytoplasmic genome is narrow, compared to the nuclear genome within this set of Upland cotton accessions. Our results suggested that most of these polymorphic chloroplast SSRs would be a valuable complementary tool in addition to the nuclear SSR in the study of evolution, gene flow and genetic diversity in Upland cotton.

  20. Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Dothideomycetes

    Energy Technology Data Exchange (ETDEWEB)

    Ohm, Robin A.; Feau, Nicolas; Henrissat, Bernard; Schoch, Conrad L.; Horwitz, Benjamin A.; Barry, Kerrie W.; Condon, Bradford J.; Copeland, Alex C.; Dhillon, Braham; Glaser, Fabian; Hesse, Cedar N.; Kosti, Idit; LaButti, Kurt; Lindquist, Erika A.; Lucas, Susan; Salamov, Asaf A.; Bradshaw, Rosie E.; Ciuffetti, Lynda; Hamelin, Richard C.; Kema, Gert H. J.; Lawrence, Christopher; Scott, James A.; Spatafora, Joseph W.; Turgeon, B. Gillian; de Wit, Pierre J. G. M.; Zhong, Shaobin; Goodwin, Stephen B.; Grigoriev, Igor V.

    2013-03-05

    The class of Dothideomycetes is one of the largest and most diverse groups of fungi. Many are plant pathogens and pose a serious threat to agricultural crops that are grown for biofuel, food or feed. Most Dothideomycetes have only a single host plant, and related species can have very diverse hosts. Eighteen genomes of Dothideomycetes have currently been sequenced by the Joint Genome Institute and other sequencing centers. Here we describe the results of comparative analyses of the fungi in this group.

  1. Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Doethideomycetes Fungi

    Energy Technology Data Exchange (ETDEWEB)

    Ohm, Robin A.; Feau, Nicolas; Henrissat, Bernard; Schoch, Conrad L.; Horwitz, Benjamin A.; Barry, Kerrie W.; Condon, Bradford J.; Copeland, Alex C.; Dhillon, Braham; Glaser, Fabien; Hesse, Cedar N.; Kosti, Idit; LaButti, Kurt; Lindquist, Erika A.; Lucas, Susan; Salamov, Asaf A.; Bradshaw, Rosie E.; Ciuffetti, Lynda; Hamelin, Richard C.; Kema, Gert H. J.; Lawrence, Christopher; Scott, James A.; Spatafora, Joseph W.; Turgeon, B. Gillian; de Wit, Pierre J. G. M.; Zhong, Shaobin; Goodwin, Stephen B.; Grigoriev, Igor V.

    2012-03-13

    The class of Dothideomycetes is one of the largest and most diverse groups of fungi. Many are plant pathogens and pose a serious threat to agricultural crops grown for biofuel, food or feed. Most Dothideomycetes have only a single host and related species can have very diverse host plants. Eighteen genomes of Dothideomycetes have currently been sequenced by the Joint Genome Institute and other sequencing centers. Here we describe the results of comparative analyses of the fungi in this group.

  2. Wild emmer genome architecture and diversity elucidate wheat evolution and domestication.

    Science.gov (United States)

    Avni, Raz; Nave, Moran; Barad, Omer; Baruch, Kobi; Twardziok, Sven O; Gundlach, Heidrun; Hale, Iago; Mascher, Martin; Spannagl, Manuel; Wiebe, Krystalee; Jordan, Katherine W; Golan, Guy; Deek, Jasline; Ben-Zvi, Batsheva; Ben-Zvi, Gil; Himmelbach, Axel; MacLachlan, Ron P; Sharpe, Andrew G; Fritz, Allan; Ben-David, Roi; Budak, Hikmet; Fahima, Tzion; Korol, Abraham; Faris, Justin D; Hernandez, Alvaro; Mikel, Mark A; Levy, Avraham A; Steffenson, Brian; Maccaferri, Marco; Tuberosa, Roberto; Cattivelli, Luigi; Faccioli, Primetta; Ceriotti, Aldo; Kashkush, Khalil; Pourkheirandish, Mohammad; Komatsuda, Takao; Eilam, Tamar; Sela, Hanan; Sharon, Amir; Ohad, Nir; Chamovitz, Daniel A; Mayer, Klaus F X; Stein, Nils; Ronen, Gil; Peleg, Zvi; Pozniak, Curtis J; Akhunov, Eduard D; Distelfeld, Assaf

    2017-07-07

    Wheat (Triticum spp.) is one of the founder crops that likely drove the Neolithic transition to sedentary agrarian societies in the Fertile Crescent more than 10,000 years ago. Identifying genetic modifications underlying wheat's domestication requires knowledge about the genome of its allo-tetraploid progenitor, wild emmer (T. turgidum ssp. dicoccoides). We report a 10.1-gigabase assembly of the 14 chromosomes of wild tetraploid wheat, as well as analyses of gene content, genome architecture, and genetic diversity. With this fully assembled polyploid wheat genome, we identified the causal mutations in Brittle Rachis 1 (TtBtr1) genes controlling shattering, a key domestication trait. A study of genomic diversity among wild and domesticated accessions revealed genomic regions bearing the signature of selection under domestication. This reference assembly will serve as a resource for accelerating the genome-assisted improvement of modern wheat varieties. Copyright © 2017, American Association for the Advancement of Science.

  3. Genomic diversity and evolution of the head crest in the rock pigeon

    DEFF Research Database (Denmark)

    Shapiro, Michael D.; Kronenberg, Zev; Li, Cai

    2013-01-01

    The geographic origins of breeds and the genetic basis of variation within the widely distributed and phenotypically diverse domestic rock pigeon (Columba livia) remain largely unknown. We generated a rock pigeon reference genome and additional genome sequences representing domestic and feral pop...

  4. Relationship between metabolic and genomic diversity in sesame (Sesamum indicum L.).

    Science.gov (United States)

    Laurentin, Hernán; Ratzinger, Astrid; Karlovsky, Petr

    2008-05-29

    Diversity estimates in cultivated plants provide a rationale for conservation strategies and support the selection of starting material for breeding programs. Diversity measures applied to crops usually have been limited to the assessment of genome polymorphism at the DNA level. Occasionally, selected morphological features are recorded and the content of key chemical constituents determined, but unbiased and comprehensive chemical phenotypes have not been included systematically in diversity surveys. Our objective in this study was to assess metabolic diversity in sesame by nontargeted metabolic profiling and elucidate the relationship between metabolic and genome diversity in this crop. Ten sesame accessions were selected that represent most of the genome diversity of sesame grown in India, Western Asia, Sudan and Venezuela based on previous AFLP studies. Ethanolic seed extracts were separated by HPLC, metabolites were ionized by positive and negative electrospray and ions were detected with an ion trap mass spectrometer in full-scan mode for m/z from 50 to 1000. Genome diversity was determined by Amplified Fragment Length Polymorphism (AFLP) using eight primer pair combinations. The relationship between biodiversity at the genome and at the metabolome levels was assessed by correlation analysis and multivariate statistics. Patterns of diversity at the genomic and metabolic levels differed, indicating that selection played a significant role in the evolution of metabolic diversity in sesame. This result implies that when used for the selection of genotypes in breeding and conservation, diversity assessment based on neutral DNA markers should be complemented with metabolic profiles. We hypothesize that this applies to all crops with a long history of domestication that possess commercially relevant traits affected by chemical phenotypes.

  5. Genome Size Diversity in Lilium (Liliaceae Is Correlated with Karyotype and Environmental Traits

    Directory of Open Access Journals (Sweden)

    Yun-peng Du

    2017-07-01

    Full Text Available Genome size (GS diversity is of fundamental biological importance. The occurrence of giant genomes in angiosperms is restricted to just a few lineages in the analyzed genome size of plant species so far. It is still an open question whether GS diversity is shaped by neutral or natural selection. The genus Lilium, with giant genomes, is phylogenetically and horticulturally important and is distributed throughout the northern hemisphere. GS diversity in Lilium and the underlying evolutionary mechanisms are poorly understood. We performed a comprehensive study involving phylogenetically independent analysis on 71 species to explore the diversity and evolution of GS and its correlation with karyological and environmental traits within Lilium (including Nomocharis. The strong phylogenetic signal detected for GS in the genus provides evidence consistent with that the repetitive DNA may be the primary contributors to the GS diversity, while the significant positive relationships detected between GS and the haploid chromosome length (HCL provide insights into patterns of genome evolution. The relationships between GS and karyotypes indicate that ancestral karyotypes of Lilium are likely to have exhibited small genomes, low diversity in centromeric index (CVCI values and relatively high relative variation in chromosome length (CVCL values. Significant relationships identified between GS and annual temperature and between GS and annual precipitation suggest that adaptation to habitat strongly influences GS diversity. We conclude that GS in Lilium is shaped by both neutral (genetic drift and adaptive evolution. These findings will have important consequences for understanding the evolution of giant plant genomes, and exploring the role of repetitive DNA fraction and chromosome changes in a plant group with large genomes and conservation of chromosome number.

  6. Horizontal gene transfer from diverse bacteria to an insect genome enables a tripartite nested mealybug symbiosis.

    Science.gov (United States)

    Husnik, Filip; Nikoh, Naruo; Koga, Ryuichi; Ross, Laura; Duncan, Rebecca P; Fujie, Manabu; Tanaka, Makiko; Satoh, Nori; Bachtrog, Doris; Wilson, Alex C C; von Dohlen, Carol D; Fukatsu, Takema; McCutcheon, John P

    2013-06-20

    The smallest reported bacterial genome belongs to Tremblaya princeps, a symbiont of Planococcus citri mealybugs (PCIT). Tremblaya PCIT not only has a 139 kb genome, but possesses its own bacterial endosymbiont, Moranella endobia. Genome and transcriptome sequencing, including genome sequencing from a Tremblaya lineage lacking intracellular bacteria, reveals that the extreme genomic degeneracy of Tremblaya PCIT likely resulted from acquiring Moranella as an endosymbiont. In addition, at least 22 expressed horizontally transferred genes from multiple diverse bacteria to the mealybug genome likely complement missing symbiont genes. However, none of these horizontally transferred genes are from Tremblaya, showing that genome reduction in this symbiont has not been enabled by gene transfer to the host nucleus. Our results thus indicate that the functioning of this three-way symbiosis is dependent on genes from at least six lineages of organisms and reveal a path to intimate endosymbiosis distinct from that followed by organelles. Copyright © 2013 Elsevier Inc. All rights reserved.

  7. Whole genome sequences of the USMARC sheep diversity panel v 2.4 aligned to the ovine reference genome assembly

    Science.gov (United States)

    A searchable and publicly viewable set of mapped genomes from 96 rams from 9 US sheep breeds was created. The nine pure breeds were selected to represent genetic diversity for traits such as fertility, prolificacy, maternal ability, growth rate, carcass leanness, wool quality, mature weight, and lo...

  8. The characterization of goat genetic diversity : Towards a genomic approach

    NARCIS (Netherlands)

    Ajmone-Marsan, P.; Colli, L.; Han, J. L.; Achilli, A.; Lancioni, H.; Joost, S.; Crepaldi, P.; Pilla, F.; Stella, A.; Taberlet, P.; Boettcher, P.; Negrini, R.; Lenstra, J. A.|info:eu-repo/dai/nl/067852335

    2014-01-01

    The investigation of genetic diversity at molecular level has been proposed as a valuable complement and sometimes proxy to phenotypic diversity of local breeds and is presently considered as one of the FAO priorities for breed characterization. By recommending a set of selected molecular markers

  9. Rice Chloroplast Genome Variation Architecture and Phylogenetic Dissection in Diverse Oryza Species Assessed by Whole-Genome Resequencing.

    Science.gov (United States)

    Tong, Wei; Kim, Tae-Sung; Park, Yong-Jin

    2016-12-01

    Chloroplast genome variations have been detected, despite its overall conserved structure, which has been valuable for plant population genetics and evolutionary studies. Here, we described chloroplast variation architecture of 383 rice accessions from diverse regions and different ecotypes, in order to mine the rice chloroplast genome variation architecture and phylogenetic. A total of 3677 variations across the chloroplast genome were identified with an average density of 27.33 per kb, in which wild rice showing a higher variation density than cultivated groups. Chloroplast genome nucleotide diversity investigation indicated a high degree of diversity in wild rice than in cultivated rice. Genetic distance estimation revealed that African rice showed a low level of breeding and connectivity with the Asian rice, suggesting the big distinction of them. Population structure and principal component analysis revealed the existence of clear clustering of African and Asian rice, as well as the indica and japonica in Asian cultivated rice. Phylogenetic analysis based on maximum likelihood and Bayesian inference methods and the population splits test suggested and supported the independent origins of indica and japonica within Asian cultivated rice. In addition, the African cultivated rice was thought to be domesticated differently from Asian cultivated rice. The chloroplast genome variation architecture in Asian and African rice are different, as well as within Asian or African rice. Wild rice and cultivated rice also have distinct nucleotide diversity or genetic distance. In chloroplast level, the independent origins of indica and japonica within Asian cultivated rice were suggested and the African cultivated rice was thought to be domesticated differently from Asian cultivated rice. These results will provide more candidate evidence for the further rice chloroplast genomic and evolution studies.

  10. Genome sequence and genetic diversity of European ash trees

    DEFF Research Database (Denmark)

    Sollars, Elizabeth S A; Harper, Andrea L; Kelly, Laura J

    2017-01-01

    Ash trees (genus Fraxinus, family Oleaceae) are widespread throughout the Northern Hemisphere, but are being devastated in Europe by the fungus Hymenoscyphus fraxineus, causing ash dieback, and in North America by the herbivorous beetle Agrilus planipennis. Here we sequence the genome of a low......-heterozygosity Fraxinus excelsior tree from Gloucestershire, UK, annotating 38,852 protein-coding genes of which 25% appear ash specific when compared with the genomes of ten other plant species. Analyses of paralogous genes suggest a whole-genome duplication shared with olive (Olea europaea, Oleaceae). We also re......-sequence 37 F. excelsior trees from Europe, finding evidence for apparent long-term decline in effective population size. Using our reference sequence, we re-analyse association transcriptomic data, yielding improved markers for reduced susceptibility to ash dieback. Surveys of these markers in British...

  11. Estimating variation within the genes and inferring the phylogeny of 186 sequenced diverse Escherichia coli genomes

    DEFF Research Database (Denmark)

    Kaas, Rolf Sommer; Rundsten, Carsten Friis; Ussery, David

    2012-01-01

    for creating better phylogenies, for determination of molecular clocks and for improved typing techniques. Results We find 3,051 gene clusters/families present in at least 95% of the genomes and 1,702 gene clusters present in 100% of the genomes. The former 'soft core' of about 3,000 gene families is perhaps...... more biologically relevant, especially considering that many of these genome sequences are draft quality. The E. coli pan-genome for this set of isolates contains 16,373 gene clusters. A core-gene tree, based on alignment and a pan-genome tree based on gene presence/absence, maps the relatedness...... of the 186 sequenced E. coli genomes. The core-gene tree displays high confidence and divides the E. coli strains into the observed MLST type clades and also separates defined phylotypes. Conclusion The results of comparing a large and diverse E. coli dataset support the theory that reliable and good...

  12. Fundamental differences in diversity and genomic population structure between Atlantic and Pacific Prochlorococcus.

    Science.gov (United States)

    Kashtan, Nadav; Roggensack, Sara E; Berta-Thompson, Jessie W; Grinberg, Maor; Stepanauskas, Ramunas; Chisholm, Sallie W

    2017-09-01

    The Atlantic and Pacific Oceans represent different biogeochemical regimes in which the abundant marine cyanobacterium Prochlorococcus thrives. We have shown that Prochlorococcus populations in the Atlantic are composed of hundreds of genomically, and likely ecologically, distinct coexisting subpopulations with distinct genomic backbones. Here we ask if differences in the ecology and selection pressures between the Atlantic and Pacific are reflected in the diversity and genomic composition of their indigenous Prochlorococcus populations. We applied large-scale single-cell genomics and compared the cell-by-cell genomic composition of wild populations of co-occurring cells from samples from Station ALOHA off Hawaii, and from Bermuda Atlantic Time Series Station off Bermuda. We reveal fundamental differences in diversity and genomic structure of populations between the sites. The Pacific populations are more diverse than those in the Atlantic, composed of significantly more coexisting subpopulations and lacking dominant subpopulations. Prochlorococcus from the two sites seem to be composed of mostly non-overlapping distinct sets of subpopulations with different genomic backbones-likely reflecting different sets of ocean-specific micro-niches. Furthermore, phylogenetically closely related strains carry ocean-associated nutrient acquisition genes likely reflecting differences in major selection pressures between the oceans. This differential selection, along with geographic separation, clearly has a significant role in shaping these populations.

  13. Endozoicomonas genomes reveal functional adaptation and plasticity in bacterial strains symbiotically associated with diverse marine hosts

    KAUST Repository

    Neave, Matthew J.

    2017-01-17

    Endozoicomonas bacteria are globally distributed and often abundantly associated with diverse marine hosts including reef-building corals, yet their function remains unknown. In this study we generated novel Endozoicomonas genomes from single cells and metagenomes obtained directly from the corals Stylophora pistillata, Pocillopora verrucosa, and Acropora humilis. We then compared these culture-independent genomes to existing genomes of bacterial isolates acquired from a sponge, sea slug, and coral to examine the functional landscape of this enigmatic genus. Sequencing and analysis of single cells and metagenomes resulted in four novel genomes with 60–76% and 81–90% genome completeness, respectively. These data also confirmed that Endozoicomonas genomes are large and are not streamlined for an obligate endosymbiotic lifestyle, implying that they have free-living stages. All genomes show an enrichment of genes associated with carbon sugar transport and utilization and protein secretion, potentially indicating that Endozoicomonas contribute to the cycling of carbohydrates and the provision of proteins to their respective hosts. Importantly, besides these commonalities, the genomes showed evidence for differential functional specificity and diversification, including genes for the production of amino acids. Given this metabolic diversity of Endozoicomonas we propose that different genotypes play disparate roles and have diversified in concert with their hosts.

  14. Chimpanzee genomic diversity reveals ancient admixture with bonobos

    DEFF Research Database (Denmark)

    de Manuel, Marc; Kuhlwilm, Martin; Frandsen, Peter

    2016-01-01

    Our closest living relatives, chimpanzees and bonobos, have a complex demographic history. We analyzed the high-coverage whole genomes of 75 wild-born chimpanzees and bonobos from 10 countries in Africa. We found that chimpanzee population substructure makes genetic information a good predictor...... of geographic origin at country and regional scales. Multiple lines of evidence suggest that gene flow occurred from bonobos into the ancestors of central and eastern chimpanzees between 200,000 and 550,000 years ago, probably with subsequent spread into Nigeria-Cameroon chimpanzees. Together with another......, possibly more recent contact (after 200,000 years ago), bonobos contributed less than 1% to the central chimpanzee genomes. Admixture thus appears to have been widespread during hominid evolution....

  15. Genome diversity of Pseudomonas aeruginosa PAO1 laboratory strains.

    Science.gov (United States)

    Klockgether, Jens; Munder, Antje; Neugebauer, Jens; Davenport, Colin F; Stanke, Frauke; Larbig, Karen D; Heeb, Stephan; Schöck, Ulrike; Pohl, Thomas M; Wiehlmann, Lutz; Tümmler, Burkhard

    2010-02-01

    Pseudomonas aeruginosa PAO1 is the most commonly used strain for research on this ubiquitous and metabolically versatile opportunistic pathogen. Strain PAO1, a derivative of the original Australian PAO isolate, has been distributed worldwide to laboratories and strain collections. Over decades discordant phenotypes of PAO1 sublines have emerged. Taking the existing PAO1-UW genome sequence (named after the University of Washington, which led the sequencing project) as a blueprint, the genome sequences of reference strains MPAO1 and PAO1-DSM (stored at the German Collection for Microorganisms and Cell Cultures [DSMZ]) were resolved by physical mapping and deep short read sequencing-by-synthesis. MPAO1 has been the source of near-saturation libraries of transposon insertion mutants, and PAO1-DSM is identical in its SpeI-DpnI restriction map with the original isolate. The major genomic differences of MPAO1 and PAO1-DSM in comparison to PAO1-UW are the lack of a large inversion, a duplication of a mobile 12-kb prophage region carrying a distinct integrase and protein phosphatases or kinases, deletions of 3 to 1,006 bp in size, and at least 39 single-nucleotide substitutions, 17 of which affect protein sequences. The PAO1 sublines differed in their ability to cope with nutrient limitation and their virulence in an acute murine airway infection model. Subline PAO1-DSM outnumbered the two other sublines in late stationary growth phase. In conclusion, P. aeruginosa PAO1 shows an ongoing microevolution of genotype and phenotype that jeopardizes the reproducibility of research. High-throughput genome resequencing will resolve more cases and could become a proper quality control for strain collections.

  16. Natural selection shaped the rise and fall of passenger pigeon genomic diversity.

    Science.gov (United States)

    Murray, Gemma G R; Soares, André E R; Novak, Ben J; Schaefer, Nathan K; Cahill, James A; Baker, Allan J; Demboski, John R; Doll, Andrew; Da Fonseca, Rute R; Fulton, Tara L; Gilbert, M Thomas P; Heintzman, Peter D; Letts, Brandon; McIntosh, George; O'Connell, Brendan L; Peck, Mark; Pipes, Marie-Lorraine; Rice, Edward S; Santos, Kathryn M; Sohrweide, A Gregory; Vohr, Samuel H; Corbett-Detig, Russell B; Green, Richard E; Shapiro, Beth

    2017-11-17

    The extinct passenger pigeon was once the most abundant bird in North America, and possibly the world. Although theory predicts that large populations will be more genetically diverse, passenger pigeon genetic diversity was surprisingly low. To investigate this disconnect, we analyzed 41 mitochondrial and 4 nuclear genomes from passenger pigeons and 2 genomes from band-tailed pigeons, which are passenger pigeons' closest living relatives. Passenger pigeons' large population size appears to have allowed for faster adaptive evolution and removal of harmful mutations, driving a huge loss in their neutral genetic diversity. These results demonstrate the effect that selection can have on a vertebrate genome and contradict results that suggested that population instability contributed to this species's surprisingly rapid extinction. Copyright © 2017, American Association for the Advancement of Science.

  17. Genome sequence and genetic diversity of European ash trees.

    Science.gov (United States)

    Sollars, Elizabeth S A; Harper, Andrea L; Kelly, Laura J; Sambles, Christine M; Ramirez-Gonzalez, Ricardo H; Swarbreck, David; Kaithakottil, Gemy; Cooper, Endymion D; Uauy, Cristobal; Havlickova, Lenka; Worswick, Gemma; Studholme, David J; Zohren, Jasmin; Salmon, Deborah L; Clavijo, Bernardo J; Li, Yi; He, Zhesi; Fellgett, Alison; McKinney, Lea Vig; Nielsen, Lene Rostgaard; Douglas, Gerry C; Kjær, Erik Dahl; Downie, J Allan; Boshier, David; Lee, Steve; Clark, Jo; Grant, Murray; Bancroft, Ian; Caccamo, Mario; Buggs, Richard J A

    2017-01-12

    Ash trees (genus Fraxinus, family Oleaceae) are widespread throughout the Northern Hemisphere, but are being devastated in Europe by the fungus Hymenoscyphus fraxineus, causing ash dieback, and in North America by the herbivorous beetle Agrilus planipennis. Here we sequence the genome of a low-heterozygosity Fraxinus excelsior tree from Gloucestershire, UK, annotating 38,852 protein-coding genes of which 25% appear ash specific when compared with the genomes of ten other plant species. Analyses of paralogous genes suggest a whole-genome duplication shared with olive (Olea europaea, Oleaceae). We also re-sequence 37 F. excelsior trees from Europe, finding evidence for apparent long-term decline in effective population size. Using our reference sequence, we re-analyse association transcriptomic data, yielding improved markers for reduced susceptibility to ash dieback. Surveys of these markers in British populations suggest that reduced susceptibility to ash dieback may be more widespread in Great Britain than in Denmark. We also present evidence that susceptibility of trees to H. fraxineus is associated with their iridoid glycoside levels. This rapid, integrated, multidisciplinary research response to an emerging health threat in a non-model organism opens the way for mitigation of the epidemic.

  18. The family Rhabdoviridae: mono- and bipartite negative-sense RNA viruses with diverse genome organization and common evolutionary origins

    Science.gov (United States)

    Dietzgen, Ralf G.; Kondo, Hideki; Goodin, Michael M.; Kurath, Gael; Vasilakis, Nikos

    2017-01-01

    The family Rhabdoviridae consists of mostly enveloped, bullet-shaped or bacilliform viruses with a negative-sense, single-stranded RNA genome that infect vertebrates, invertebrates or plants. This ecological diversity is reflected by the diversity and complexity of their genomes. Five canonical structural protein genes are conserved in all rhabdoviruses, but may be overprinted, overlapped or interspersed with several novel and diverse accessory genes. This review gives an overview of the characteristics and diversity of rhabdoviruses, their taxonomic classification, replication mechanism, properties of classical rhabdoviruses such as rabies virus and rhabdoviruses with complex genomes, rhabdoviruses infecting aquatic species, and plant rhabdoviruses with both mono- and bipartite genomes.

  19. Population genomics diversity of Plasmodium falciparum in malaria ...

    African Journals Online (AJOL)

    Abstract. Background: Plasmodium falciparum, the most dangerous malaria parasite species to humans remains an important public health concern in Okelele, a rural community in Ilorin, Kwara State, Nigeria. There is however little information about the genetic diversity of Plasmodium falciparum in Nigeria. Objective: To ...

  20. Population genomics diversity of Plasmodium falciparum in malaria ...

    African Journals Online (AJOL)

    Background: Plasmodium falciparum, the most dangerous malaria parasite species to humans remains an important public health concern in Okelele, a rural community in Ilorin, Kwara State, Nigeria. There is however little information about the genetic diversity of Plasmodium falciparum in Nigeria. Objective: To determine ...

  1. SSR Analysis on Diversity of AA Genome Oryza Species in the Southeast and South Asia

    Directory of Open Access Journals (Sweden)

    Jian-zhen LU

    2008-12-01

    Full Text Available To investigate genetic diversities among the AA genome Oryza species in the Southeast and South Asia, a total of 428 accessions of the AA genome Oryza species were genotyped using 36 simple sequence repeats (SSR markers distributed throughout the rice genome. All of the 36 SSR markers generated polymorphic bands, revealing 100% polymorphism. The number of alleles per locus ranged from 3 to 17 with the mean of 8.6. The Nei's genetic diversity index (He ranged from 0.337 at RM455 to 0.865 at RM169 with an average value of 0.650. The genetic diversity of the AA genome Oryza species in the Southeast Asia was obviously higher than that in the South Asia. Among the detected Oryza species in the South and Southeast Asia, O. rufipogon showed the highest genetic diversity. Meanwhile, a higher genetic differentiation (Fst was found among the detected Oryza species in the Southeast Asia than in the South Asia. The Fst value between O. nivara and O. sativa was the highest. The results from the number of specific alleles, specific loci, and allele frequency confirmed the greater genetic variation among the detected species. In addition, the specific allele in RM161 displayed higher frequency (0.193, suggesting its important function in identifying Oryza species of AA genome.

  2. Tales of diversity: Genomic and morphological characteristics of forty-six Arthrobacter phages.

    Directory of Open Access Journals (Sweden)

    Karen K Klyczek

    Full Text Available The vast bacteriophage population harbors an immense reservoir of genetic information. Almost 2000 phage genomes have been sequenced from phages infecting hosts in the phylum Actinobacteria, and analysis of these genomes reveals substantial diversity, pervasive mosaicism, and novel mechanisms for phage replication and lysogeny. Here, we describe the isolation and genomic characterization of 46 phages from environmental samples at various geographic locations in the U.S. infecting a single Arthrobacter sp. strain. These phages include representatives of all three virion morphologies, and Jasmine is the first sequenced podovirus of an actinobacterial host. The phages also span considerable sequence diversity, and can be grouped into 10 clusters according to their nucleotide diversity, and two singletons each with no close relatives. However, the clusters/singletons appear to be genomically well separated from each other, and relatively few genes are shared between clusters. Genome size varies from among the smallest of siphoviral phages (15,319 bp to over 70 kbp, and G+C contents range from 45-68%, compared to 63.4% for the host genome. Although temperate phages are common among other actinobacterial hosts, these Arthrobacter phages are primarily lytic, and only the singleton Galaxy is likely temperate.

  3. Genome-Based Studies of Marine Microorganisms to Maximize the Diversity of Natural Products Discovery for Medical Treatments

    OpenAIRE

    Xin-Qing Zhao

    2011-01-01

    Marine microorganisms are rich source for natural products which play important roles in pharmaceutical industry. Over the past decade, genome-based studies of marine microorganisms have unveiled the tremendous diversity of the producers of natural products and also contributed to the efficiency of harness the strain diversity and chemical diversity, as well as the genetic diversity of marine microorganisms for the rapid discovery and generation of new natural products. In the meantime, genom...

  4. High Whole-Genome Sequence Diversity of Human Papillomavirus Type 18 Isolates

    Directory of Open Access Journals (Sweden)

    Pascal van der Weele

    2018-02-01

    Full Text Available Background: The most commonly found human papillomavirus (HPV types in cervical cancer are HPV16 and HPV18. Genome variants of these types have been associated with differential carcinogenic potential. To date, only a handful of studies have described HPV18 whole genome sequencing results. Here we describe HPV18 variant diversity and conservation of persistent infections in a longitudinal retrospective cohort study. Methods: Cervical self-samples were obtained annually over four years and genotyped on the SPF10-DEIA-LiPA25 platform. Clearing and persistent HPV18 positive infections were selected, amplified in two overlapping fragments, and sequenced using 32 sequence primers. Results: Complete viral genomes were obtained from 25 participants with persistent and 26 participants with clearing HPV18 infections, resulting in 52 unique HPV18 genomes. Sublineage A3 was predominant in this population. The consensus viral genome was completely conserved over time in persistent infections, with one exception, where different HPV18 variants were identified in follow-up samples. Conclusions: This study identified a diverse set of HPV18 variants. In persistent infections, the consensus viral genome is conserved. The identification of only one HPV18 infection with different major variants in follow-up implies that this is a potentially rare event. This dataset adds 52 HPV18 genome variants to Genbank, more than doubling the currently available HPV18 information resource, and all but one variant are unique additions.

  5. Genomic diversity in Onchocerca volvulus and its Wolbachia endosymbiont.

    Science.gov (United States)

    Choi, Young-Jun; Tyagi, Rahul; McNulty, Samantha N; Rosa, Bruce A; Ozersky, Philip; Martin, John; Hallsworth-Pepin, Kymberlie; Unnasch, Thomas R; Norice, Carmelle T; Nutman, Thomas B; Weil, Gary J; Fischer, Peter U; Mitreva, Makedonka

    2016-11-21

    Ongoing elimination efforts have altered the global distribution of Onchocerca volvulus, the agent of river blindness, and further population restructuring is expected as efforts continue. Therefore, a better understanding of population genetic processes and their effect on biogeography is needed to support elimination goals. We describe O. volvulus genome variation in 27 isolates from the early 1990s (before widespread mass treatment) from four distinct locales: Ecuador, Uganda, the West African forest and the West African savanna. We observed genetic substructuring between Ecuador and West Africa and between the West African forest and savanna bioclimes, with evidence of unidirectional gene flow from savanna to forest strains. We identified forest:savanna-discriminatory genomic regions and report a set of ancestry informative loci that can be used to differentiate between forest, savanna and admixed isolates, which has not previously been possible. We observed mito-nuclear discordance possibly stemming from incomplete lineage sorting. The catalogue of the nuclear, mitochondrial and endosymbiont DNA variants generated in this study will support future basic and translational onchocerciasis research, with particular relevance for ongoing control programmes, and boost efforts to characterize drug, vaccine and diagnostic targets.

  6. Lactobacillus paracasei comparative genomics: towards species pan-genome definition and exploitation of diversity.

    Directory of Open Access Journals (Sweden)

    Tamara Smokvina

    Full Text Available Lactobacillus paracasei is a member of the normal human and animal gut microbiota and is used extensively in the food industry in starter cultures for dairy products or as probiotics. With the development of low-cost, high-throughput sequencing techniques it has become feasible to sequence many different strains of one species and to determine its "pan-genome". We have sequenced the genomes of 34 different L. paracasei strains, and performed a comparative genomics analysis. We analysed genome synteny and content, focussing on the pan-genome, core genome and variable genome. Each genome was shown to contain around 2800-3100 protein-coding genes, and comparative analysis identified over 4200 ortholog groups that comprise the pan-genome of this species, of which about 1800 ortholog groups make up the conserved core. Several factors previously associated with host-microbe interactions such as pili, cell-envelope proteinase, hydrolases p40 and p75 or the capacity to produce short branched-chain fatty acids (bkd operon are part of the L. paracasei core genome present in all analysed strains. The variome consists mainly of hypothetical proteins, phages, plasmids, transposon/conjugative elements, and known functions such as sugar metabolism, cell-surface proteins, transporters, CRISPR-associated proteins, and EPS biosynthesis proteins. An enormous variety and variability of sugar utilization gene cassettes were identified, with each strain harbouring between 25-53 cassettes, reflecting the high adaptability of L. paracasei to different niches. A phylogenomic tree was constructed based on total genome contents, and together with an analysis of horizontal gene transfer events we conclude that evolution of these L. paracasei strains is complex and not always related to niche adaptation. The results of this genome content comparison was used, together with high-throughput growth experiments on various carbohydrates, to perform gene-trait matching analysis

  7. Genetic diversity in the modern horse illustrated from genome-wide SNP data.

    Directory of Open Access Journals (Sweden)

    Jessica L Petersen

    Full Text Available Horses were domesticated from the Eurasian steppes 5,000-6,000 years ago. Since then, the use of horses for transportation, warfare, and agriculture, as well as selection for desired traits and fitness, has resulted in diverse populations distributed across the world, many of which have become or are in the process of becoming formally organized into closed, breeding populations (breeds. This report describes the use of a genome-wide set of autosomal SNPs and 814 horses from 36 breeds to provide the first detailed description of equine breed diversity. F(ST calculations, parsimony, and distance analysis demonstrated relationships among the breeds that largely reflect geographic origins and known breed histories. Low levels of population divergence were observed between breeds that are relatively early on in the process of breed development, and between those with high levels of within-breed diversity, whether due to large population size, ongoing outcrossing, or large within-breed phenotypic diversity. Populations with low within-breed diversity included those which have experienced population bottlenecks, have been under intense selective pressure, or are closed populations with long breed histories. These results provide new insights into the relationships among and the diversity within breeds of horses. In addition these results will facilitate future genome-wide association studies and investigations into genomic targets of selection.

  8. Genetic Diversity in the Modern Horse Illustrated from Genome-Wide SNP Data

    Science.gov (United States)

    Petersen, Jessica L.; Mickelson, James R.; Cothran, E. Gus; Andersson, Lisa S.; Axelsson, Jeanette; Bailey, Ernie; Bannasch, Danika; Binns, Matthew M.; Borges, Alexandre S.; Brama, Pieter; da Câmara Machado, Artur; Distl, Ottmar; Felicetti, Michela; Fox-Clipsham, Laura; Graves, Kathryn T.; Guérin, Gérard; Haase, Bianca; Hasegawa, Telhisa; Hemmann, Karin; Hill, Emmeline W.; Leeb, Tosso; Lindgren, Gabriella; Lohi, Hannes; Lopes, Maria Susana; McGivney, Beatrice A.; Mikko, Sofia; Orr, Nicholas; Penedo, M. Cecilia T; Piercy, Richard J.; Raekallio, Marja; Rieder, Stefan; Røed, Knut H.; Silvestrelli, Maurizio; Swinburne, June; Tozaki, Teruaki; Vaudin, Mark; M. Wade, Claire; McCue, Molly E.

    2013-01-01

    Horses were domesticated from the Eurasian steppes 5,000–6,000 years ago. Since then, the use of horses for transportation, warfare, and agriculture, as well as selection for desired traits and fitness, has resulted in diverse populations distributed across the world, many of which have become or are in the process of becoming formally organized into closed, breeding populations (breeds). This report describes the use of a genome-wide set of autosomal SNPs and 814 horses from 36 breeds to provide the first detailed description of equine breed diversity. FST calculations, parsimony, and distance analysis demonstrated relationships among the breeds that largely reflect geographic origins and known breed histories. Low levels of population divergence were observed between breeds that are relatively early on in the process of breed development, and between those with high levels of within-breed diversity, whether due to large population size, ongoing outcrossing, or large within-breed phenotypic diversity. Populations with low within-breed diversity included those which have experienced population bottlenecks, have been under intense selective pressure, or are closed populations with long breed histories. These results provide new insights into the relationships among and the diversity within breeds of horses. In addition these results will facilitate future genome-wide association studies and investigations into genomic targets of selection. PMID:23383025

  9. Tomato Fruits Show Wide Phenomic Diversity but Fruit Developmental Genes Show Low Genomic Diversity.

    Directory of Open Access Journals (Sweden)

    Vijee Mohan

    Full Text Available Domestication of tomato has resulted in large diversity in fruit phenotypes. An intensive phenotyping of 127 tomato accessions from 20 countries revealed extensive morphological diversity in fruit traits. The diversity in fruit traits clustered the accessions into nine classes and identified certain promising lines having desirable traits pertaining to total soluble salts (TSS, carotenoids, ripening index, weight and shape. Factor analysis of the morphometric data from Tomato Analyzer showed that the fruit shape is a complex trait shared by several factors. The 100% variance between round and flat fruit shapes was explained by one discriminant function having a canonical correlation of 0.874 by stepwise discriminant analysis. A set of 10 genes (ACS2, COP1, CYC-B, RIN, MSH2, NAC-NOR, PHOT1, PHYA, PHYB and PSY1 involved in various plant developmental processes were screened for SNP polymorphism by EcoTILLING. The genetic diversity in these genes revealed a total of 36 non-synonymous and 18 synonymous changes leading to the identification of 28 haplotypes. The average frequency of polymorphism across the genes was 0.038/Kb. Significant negative Tajima'D statistic in two of the genes, ACS2 and PHOT1 indicated the presence of rare alleles in low frequency. Our study indicates that while there is low polymorphic diversity in the genes regulating plant development, the population shows wider phenotype diversity. Nonetheless, morphological and genetic diversity of the present collection can be further exploited as potential resources in future.

  10. Prophage Integrase Typing Is a Useful Indicator of Genomic Diversity in Salmonella enterica

    Directory of Open Access Journals (Sweden)

    Anna Colavecchio

    2017-07-01

    Full Text Available Salmonella enterica is a bacterial species that is a major cause of illness in humans and food-producing animals. S. enterica exhibits considerable inter-serovar diversity, as evidenced by the large number of host adapted serovars that have been identified. The development of methods to assess genome diversity in S. enterica will help to further define the limits of diversity in this foodborne pathogen. Thus, we evaluated a PCR assay, which targets prophage integrase genes, as a rapid method to investigate S. enterica genome diversity. To evaluate the PCR prophage integrase assay, 49 isolates of S. enterica were selected, including 19 clinical isolates from clonal serovars (Enteritidis and Heidelberg that commonly cause human illness, and 30 isolates from food-associated Salmonella serovars that rarely cause human illness. The number of integrase genes identified by the PCR assay was compared to the number of integrase genes within intact prophages identified by whole genome sequencing and phage finding program PHASTER. The PCR assay identified a total of 147 prophage integrase genes within the 49 S. enterica genomes (79 integrase genes in the food-associated Salmonella isolates, 50 integrase genes in S. Enteritidis, and 18 integrase genes in S. Heidelberg. In comparison, whole genome sequencing and PHASTER identified a total of 75 prophage integrase genes within 102 intact prophages in the 49 S. enterica genomes (44 integrase genes in the food-associated Salmonella isolates, 21 integrase genes in S. Enteritidis, and 9 integrase genes in S. Heidelberg. Collectively, both the PCR assay and PHASTER identified the presence of a large diversity of prophage integrase genes in the food-associated isolates compared to the clinical isolates, thus indicating a high degree of diversity in the food-associated isolates, and confirming the clonal nature of S. Enteritidis and S. Heidelberg. Moreover, PHASTER revealed a diversity of 29 different types of

  11. Artificial selection with traditional or genomic relationships: consequences in coancestry and genetic diversity.

    Science.gov (United States)

    Rodríguez-Ramilo, Silvia Teresa; García-Cortés, Luis Alberto; de Cara, María Ángeles Rodríguez

    2015-01-01

    Estimated breeding values (EBVs) are traditionally obtained from pedigree information. However, EBVs from high-density genotypes can have higher accuracy than EBVs from pedigree information. At the same time, it has been shown that EBVs from genomic data lead to lower increases in inbreeding compared with traditional selection based on genealogies. Here we evaluate the performance with BLUP selection based on genealogical coancestry with three different genome-based coancestry estimates: (1) an estimate based on shared segments of homozygosity, (2) an approach based on SNP-by-SNP count corrected by allelic frequencies, and (3) the identity by state methodology. We evaluate the effect of different population sizes, different number of genomic markers, and several heritability values for a quantitative trait. The performance of the different measures of coancestry in BLUP is evaluated in the true breeding values after truncation selection and also in terms of coancestry and diversity maintained. Accordingly, cross-performances were also carried out, that is, how prediction based on genealogical records impacts the three other measures of coancestry and inbreeding, and viceversa. Our results show that the genetic gains are very similar for all four coancestries, but the genomic-based methods are superior to using genealogical coancestries in terms of maintaining diversity measured as observed heterozygosity. Furthermore, the measure of coancestry based on shared segments of the genome seems to provide slightly better results on some scenarios, and the increase in inbreeding and loss in diversity is only slightly larger than the other genomic selection methods in those scenarios. Our results shed light on genomic selection vs. traditional genealogical-based BLUP and make the case to manage the population variability using genomic information to preserve the future success of selection programmes.

  12. Expanding the diversity of mycobacteriophages: insights into genome architecture and evolution.

    Science.gov (United States)

    Pope, Welkin H; Jacobs-Sera, Deborah; Russell, Daniel A; Peebles, Craig L; Al-Atrache, Zein; Alcoser, Turi A; Alexander, Lisa M; Alfano, Matthew B; Alford, Samantha T; Amy, Nichols E; Anderson, Marie D; Anderson, Alexander G; Ang, Andrew A S; Ares, Manuel; Barber, Amanda J; Barker, Lucia P; Barrett, Jonathan M; Barshop, William D; Bauerle, Cynthia M; Bayles, Ian M; Belfield, Katherine L; Best, Aaron A; Borjon, Agustin; Bowman, Charles A; Boyer, Christine A; Bradley, Kevin W; Bradley, Victoria A; Broadway, Lauren N; Budwal, Keshav; Busby, Kayla N; Campbell, Ian W; Campbell, Anne M; Carey, Alyssa; Caruso, Steven M; Chew, Rebekah D; Cockburn, Chelsea L; Cohen, Lianne B; Corajod, Jeffrey M; Cresawn, Steven G; Davis, Kimberly R; Deng, Lisa; Denver, Dee R; Dixon, Breyon R; Ekram, Sahrish; Elgin, Sarah C R; Engelsen, Angela E; English, Belle E V; Erb, Marcella L; Estrada, Crystal; Filliger, Laura Z; Findley, Ann M; Forbes, Lauren; Forsyth, Mark H; Fox, Tyler M; Fritz, Melissa J; Garcia, Roberto; George, Zindzi D; Georges, Anne E; Gissendanner, Christopher R; Goff, Shannon; Goldstein, Rebecca; Gordon, Kobie C; Green, Russell D; Guerra, Stephanie L; Guiney-Olsen, Krysta R; Guiza, Bridget G; Haghighat, Leila; Hagopian, Garrett V; Harmon, Catherine J; Harmson, Jeremy S; Hartzog, Grant A; Harvey, Samuel E; He, Siping; He, Kevin J; Healy, Kaitlin E; Higinbotham, Ellen R; Hildebrandt, Erin N; Ho, Jason H; Hogan, Gina M; Hohenstein, Victoria G; Holz, Nathan A; Huang, Vincent J; Hufford, Ericka L; Hynes, Peter M; Jackson, Arrykka S; Jansen, Erica C; Jarvik, Jonathan; Jasinto, Paul G; Jordan, Tuajuanda C; Kasza, Tomas; Katelyn, Murray A; Kelsey, Jessica S; Kerrigan, Larisa A; Khaw, Daryl; Kim, Junghee; Knutter, Justin Z; Ko, Ching-Chung; Larkin, Gail V; Laroche, Jennifer R; Latif, Asma; Leuba, Kohana D; Leuba, Sequoia I; Lewis, Lynn O; Loesser-Casey, Kathryn E; Long, Courtney A; Lopez, A Javier; Lowery, Nicholas; Lu, Tina Q; Mac, Victor; Masters, Isaac R; McCloud, Jazmyn J; McDonough, Molly J; Medenbach, Andrew J; Menon, Anjali; Miller, Rachel; Morgan, Brandon K; Ng, Patrick C; Nguyen, Elvis; Nguyen, Katrina T; Nguyen, Emilie T; Nicholson, Kaylee M; Parnell, Lindsay A; Peirce, Caitlin E; Perz, Allison M; Peterson, Luke J; Pferdehirt, Rachel E; Philip, Seegren V; Pogliano, Kit; Pogliano, Joe; Polley, Tamsen; Puopolo, Erica J; Rabinowitz, Hannah S; Resiss, Michael J; Rhyan, Corwin N; Robinson, Yetta M; Rodriguez, Lauren L; Rose, Andrew C; Rubin, Jeffrey D; Ruby, Jessica A; Saha, Margaret S; Sandoz, James W; Savitskaya, Judith; Schipper, Dale J; Schnitzler, Christine E; Schott, Amanda R; Segal, J Bradley; Shaffer, Christopher D; Sheldon, Kathryn E; Shepard, Erica M; Shepardson, Jonathan W; Shroff, Madav K; Simmons, Jessica M; Simms, Erika F; Simpson, Brandy M; Sinclair, Kathryn M; Sjoholm, Robert L; Slette, Ingrid J; Spaulding, Blaire C; Straub, Clark L; Stukey, Joseph; Sughrue, Trevor; Tang, Tin-Yun; Tatyana, Lyons M; Taylor, Stephen B; Taylor, Barbara J; Temple, Louise M; Thompson, Jasper V; Tokarz, Michael P; Trapani, Stephanie E; Troum, Alexander P; Tsay, Jonathan; Tubbs, Anthony T; Walton, Jillian M; Wang, Danielle H; Wang, Hannah; Warner, John R; Weisser, Emilie G; Wendler, Samantha C; Weston-Hafer, Kathleen A; Whelan, Hilary M; Williamson, Kurt E; Willis, Angelica N; Wirtshafter, Hannah S; Wong, Theresa W; Wu, Phillip; Yang, Yun jeong; Yee, Brandon C; Zaidins, David A; Zhang, Bo; Zúniga, Melina Y; Hendrix, Roger W; Hatfull, Graham F

    2011-01-27

    Mycobacteriophages are viruses that infect mycobacterial hosts such as Mycobacterium smegmatis and Mycobacterium tuberculosis. All mycobacteriophages characterized to date are dsDNA tailed phages, and have either siphoviral or myoviral morphotypes. However, their genetic diversity is considerable, and although sixty-two genomes have been sequenced and comparatively analyzed, these likely represent only a small portion of the diversity of the mycobacteriophage population at large. Here we report the isolation, sequencing and comparative genomic analysis of 18 new mycobacteriophages isolated from geographically distinct locations within the United States. Although no clear correlation between location and genome type can be discerned, these genomes expand our knowledge of mycobacteriophage diversity and enhance our understanding of the roles of mobile elements in viral evolution. Expansion of the number of mycobacteriophages grouped within Cluster A provides insights into the basis of immune specificity in these temperate phages, and we also describe a novel example of apparent immunity theft. The isolation and genomic analysis of bacteriophages by freshman college students provides an example of an authentic research experience for novice scientists.

  13. Expanding the Diversity of Mycobacteriophages: Insights into Genome Architecture and Evolution

    Science.gov (United States)

    Pope, Welkin H.; Jacobs-Sera, Deborah; Russell, Daniel A.; Peebles, Craig L.; Al-Atrache, Zein; Alcoser, Turi A.; Alexander, Lisa M.; Alfano, Matthew B.; Alford, Samantha T.; Amy, Nichols E.; Anderson, Marie D.; Anderson, Alexander G.; Ang, Andrew A. S.; Ares, Manuel; Barber, Amanda J.; Barker, Lucia P.; Barrett, Jonathan M.; Barshop, William D.; Bauerle, Cynthia M.; Bayles, Ian M.; Belfield, Katherine L.; Best, Aaron A.; Borjon, Agustin; Bowman, Charles A.; Boyer, Christine A.; Bradley, Kevin W.; Bradley, Victoria A.; Broadway, Lauren N.; Budwal, Keshav; Busby, Kayla N.; Campbell, Ian W.; Campbell, Anne M.; Carey, Alyssa; Caruso, Steven M.; Chew, Rebekah D.; Cockburn, Chelsea L.; Cohen, Lianne B.; Corajod, Jeffrey M.; Cresawn, Steven G.; Davis, Kimberly R.; Deng, Lisa; Denver, Dee R.; Dixon, Breyon R.; Ekram, Sahrish; Elgin, Sarah C. R.; Engelsen, Angela E.; English, Belle E. V.; Erb, Marcella L.; Estrada, Crystal; Filliger, Laura Z.; Findley, Ann M.; Forbes, Lauren; Forsyth, Mark H.; Fox, Tyler M.; Fritz, Melissa J.; Garcia, Roberto; George, Zindzi D.; Georges, Anne E.; Gissendanner, Christopher R.; Goff, Shannon; Goldstein, Rebecca; Gordon, Kobie C.; Green, Russell D.; Guerra, Stephanie L.; Guiney-Olsen, Krysta R.; Guiza, Bridget G.; Haghighat, Leila; Hagopian, Garrett V.; Harmon, Catherine J.; Harmson, Jeremy S.; Hartzog, Grant A.; Harvey, Samuel E.; He, Siping; He, Kevin J.; Healy, Kaitlin E.; Higinbotham, Ellen R.; Hildebrandt, Erin N.; Ho, Jason H.; Hogan, Gina M.; Hohenstein, Victoria G.; Holz, Nathan A.; Huang, Vincent J.; Hufford, Ericka L.; Hynes, Peter M.; Jackson, Arrykka S.; Jansen, Erica C.; Jarvik, Jonathan; Jasinto, Paul G.; Jordan, Tuajuanda C.; Kasza, Tomas; Katelyn, Murray A.; Kelsey, Jessica S.; Kerrigan, Larisa A.; Khaw, Daryl; Kim, Junghee; Knutter, Justin Z.; Ko, Ching-Chung; Larkin, Gail V.; Laroche, Jennifer R.; Latif, Asma; Leuba, Kohana D.; Leuba, Sequoia I.; Lewis, Lynn O.; Loesser-Casey, Kathryn E.; Long, Courtney A.; Lopez, A. Javier; Lowery, Nicholas; Lu, Tina Q.; Mac, Victor; Masters, Isaac R.; McCloud, Jazmyn J.; McDonough, Molly J.; Medenbach, Andrew J.; Menon, Anjali; Miller, Rachel; Morgan, Brandon K.; Ng, Patrick C.; Nguyen, Elvis; Nguyen, Katrina T.; Nguyen, Emilie T.; Nicholson, Kaylee M.; Parnell, Lindsay A.; Peirce, Caitlin E.; Perz, Allison M.; Peterson, Luke J.; Pferdehirt, Rachel E.; Philip, Seegren V.; Pogliano, Kit; Pogliano, Joe; Polley, Tamsen; Puopolo, Erica J.; Rabinowitz, Hannah S.; Resiss, Michael J.; Rhyan, Corwin N.; Robinson, Yetta M.; Rodriguez, Lauren L.; Rose, Andrew C.; Rubin, Jeffrey D.; Ruby, Jessica A.; Saha, Margaret S.; Sandoz, James W.; Savitskaya, Judith; Schipper, Dale J.; Schnitzler, Christine E.; Schott, Amanda R.; Segal, J. Bradley; Shaffer, Christopher D.; Sheldon, Kathryn E.; Shepard, Erica M.; Shepardson, Jonathan W.; Shroff, Madav K.; Simmons, Jessica M.; Simms, Erika F.; Simpson, Brandy M.; Sinclair, Kathryn M.; Sjoholm, Robert L.; Slette, Ingrid J.; Spaulding, Blaire C.; Straub, Clark L.; Stukey, Joseph; Sughrue, Trevor; Tang, Tin-Yun; Tatyana, Lyons M.; Taylor, Stephen B.; Taylor, Barbara J.; Temple, Louise M.; Thompson, Jasper V.; Tokarz, Michael P.; Trapani, Stephanie E.; Troum, Alexander P.; Tsay, Jonathan; Tubbs, Anthony T.; Walton, Jillian M.; Wang, Danielle H.; Wang, Hannah; Warner, John R.; Weisser, Emilie G.; Wendler, Samantha C.; Weston-Hafer, Kathleen A.; Whelan, Hilary M.; Williamson, Kurt E.; Willis, Angelica N.; Wirtshafter, Hannah S.; Wong, Theresa W.; Wu, Phillip; Yang, Yun jeong; Yee, Brandon C.; Zaidins, David A.; Zhang, Bo; Zúniga, Melina Y.; Hendrix, Roger W.; Hatfull, Graham F.

    2011-01-01

    Mycobacteriophages are viruses that infect mycobacterial hosts such as Mycobacterium smegmatis and Mycobacterium tuberculosis. All mycobacteriophages characterized to date are dsDNA tailed phages, and have either siphoviral or myoviral morphotypes. However, their genetic diversity is considerable, and although sixty-two genomes have been sequenced and comparatively analyzed, these likely represent only a small portion of the diversity of the mycobacteriophage population at large. Here we report the isolation, sequencing and comparative genomic analysis of 18 new mycobacteriophages isolated from geographically distinct locations within the United States. Although no clear correlation between location and genome type can be discerned, these genomes expand our knowledge of mycobacteriophage diversity and enhance our understanding of the roles of mobile elements in viral evolution. Expansion of the number of mycobacteriophages grouped within Cluster A provides insights into the basis of immune specificity in these temperate phages, and we also describe a novel example of apparent immunity theft. The isolation and genomic analysis of bacteriophages by freshman college students provides an example of an authentic research experience for novice scientists. PMID:21298013

  14. Expanding the diversity of mycobacteriophages: insights into genome architecture and evolution.

    Directory of Open Access Journals (Sweden)

    Welkin H Pope

    2011-01-01

    Full Text Available Mycobacteriophages are viruses that infect mycobacterial hosts such as Mycobacterium smegmatis and Mycobacterium tuberculosis. All mycobacteriophages characterized to date are dsDNA tailed phages, and have either siphoviral or myoviral morphotypes. However, their genetic diversity is considerable, and although sixty-two genomes have been sequenced and comparatively analyzed, these likely represent only a small portion of the diversity of the mycobacteriophage population at large. Here we report the isolation, sequencing and comparative genomic analysis of 18 new mycobacteriophages isolated from geographically distinct locations within the United States. Although no clear correlation between location and genome type can be discerned, these genomes expand our knowledge of mycobacteriophage diversity and enhance our understanding of the roles of mobile elements in viral evolution. Expansion of the number of mycobacteriophages grouped within Cluster A provides insights into the basis of immune specificity in these temperate phages, and we also describe a novel example of apparent immunity theft. The isolation and genomic analysis of bacteriophages by freshman college students provides an example of an authentic research experience for novice scientists.

  15. Mitochondrial genome diversity and population structure of the giant squid Architeuthis

    DEFF Research Database (Denmark)

    Winkelmann, Inger Eleanor Hall; Campos, Paula; Strugnell, Jan

    2013-01-01

    techniques, considerable controversy exists with regard to topics as varied as their taxonomy, biology and even behaviour. In this study, we have characterized the mitochondrial genome (mitogenome) diversity of 43 Architeuthis samples collected from across the range of the species, in order to use genetic...

  16. The genomic diversity and stability of field strains of Suid herpesvirus 1 (Aujeszky's disease virus)

    DEFF Research Database (Denmark)

    Christensen, Laurids Siig; Sørensen, K. J.

    1991-01-01

    The genomic diversity among isolates of suid herpesvirus 1 (SHV-1) collected in the same herd and among clones from the same isolate was studied by restriction fragment pattern (RFP) analysis using BamHI. Tentatively defining a field strain as a transmissible entity, it was concluded that strains...

  17. Nucleotide diversity of the Chlamydomonas reinhardtii plastid genome: addressing the mutational-hazard hypothesis.

    Science.gov (United States)

    Smith, David Roy; Lee, Robert W

    2009-05-27

    The mutational-hazard hypothesis argues that the noncoding-DNA content of a genome is a consequence of the mutation rate (mu) and the effective number of genes per locus in the population (N(g)). The hypothesis predicts that genomes with a high N(g)mu will be more compact than those with a small N(g)mu. Approximations of N(g)mu can be gained by measuring the nucleotide diversity at silent sites (pi(silent)). We addressed the mutation-hazard hypothesis apropos plastid-genome evolution by measuring pi(silent) of the Chlamydomonas reinhardtii plastid DNA (ptDNA), the most noncoding-DNA-dense plastid genome observed to date. The data presented here in conjunction with previously published values of pi(silent) for the C. reinhardtii mitochondrial and nuclear genomes, which are respectively compact and bloated, allow for a complete analysis of nucleotide diversity and genome compactness in all three genetic compartments of this model organism. In C. reinhardtii, the mean estimate of pi(silent) for the ptDNA (14.5 x 10(-3)) is less than that of the nuclear DNA (32 x 10(-3)) and greater than that of the mitochondrial DNA (8.5 x 10(-3)). On average, C. reinhardtii has approximately 4 times more silent-site ptDNA diversity than the mean value reported for land plants, which have more compact plastid genomes. The silent-site nucleotide diversity of the different ptDNA loci that were studied varied significantly: from 0 to 71 x 10(-3) for synonymous sites and from 0 to 42 x 10(-3) for intergenic regions. Our findings on silent-site ptDNA diversity are inconsistent with what would be expected under the mutational-hazard hypothesis and go against the documented trend in other systems of pi(silent) positively correlating with genome compactness. Overall, we highlight the lack of reliable nucleotide-diversity measurements for ptDNA and hope that the values presented here will act as sound data for future research concerning the mutational-hazard hypothesis and plastid evolution in

  18. Ribosomal RNA diversity predicts genome diversity in gut bacteria and their relatives.

    Science.gov (United States)

    Zaneveld, Jesse R; Lozupone, Catherine; Gordon, Jeffrey I; Knight, Rob

    2010-07-01

    The mammalian gut is an attractive model for exploring the general question of how habitat impacts the evolution of gene content. Therefore, we have characterized the relationship between 16 S rRNA gene sequence similarity and overall levels of gene conservation in four groups of species: gut specialists and cosmopolitans, each of which can be divided into pathogens and non-pathogens. At short phylogenetic distances, specialist or cosmopolitan bacteria found in the gut share fewer genes than is typical for genomes that come from non-gut environments, but at longer phylogenetic distances gut bacteria are more similar to each other than are genomes at equivalent evolutionary distances from non-gut environments, suggesting a pattern of short-term specialization but long-term convergence. Moreover, this pattern is observed in both pathogens and non-pathogens, and can even be seen in the plasmids carried by gut bacteria. This observation is consistent with the finding that, despite considerable interpersonal variation in species content, there is surprising functional convergence in the microbiome of different humans. Finally, we observe that even within bacterial species or genera 16S rRNA divergence provides useful information about average conservation of gene content. The results described here should be useful for guiding strain selection to maximize novel gene discovery in large-scale genome sequencing projects, while the approach could be applied in studies seeking to understand the effects of habitat adaptation on genome evolution across other body habitats or environment types.

  19. Diversity through duplication: whole-genome sequencing reveals novel gene retrocopies in the human population.

    Science.gov (United States)

    Richardson, Sandra R; Salvador-Palomeque, Carmen; Faulkner, Geoffrey J

    2014-05-01

    Gene retrocopies are generated by reverse transcription and genomic integration of mRNA. As such, retrocopies present an important exception to the central dogma of molecular biology, and have substantially impacted the functional landscape of the metazoan genome. While an estimated 8,000-17,000 retrocopies exist in the human genome reference sequence, the extent of variation between individuals in terms of retrocopy content has remained largely unexplored. Three recent studies by Abyzov et al., Ewing et al. and Schrider et al. have exploited 1,000 Genomes Project Consortium data, as well as other sources of whole-genome sequencing data, to uncover novel gene retrocopies. Here, we compare the methods and results of these three studies, highlight the impact of retrocopies in human diversity and genome evolution, and speculate on the potential for somatic gene retrocopies to impact cancer etiology and genetic diversity among individual neurons in the mammalian brain. © 2014 The Authors. Bioessays published by WILEY Periodicals, Inc.

  20. Genetic Diversity and Reassortment of Hantaan Virus Tripartite RNA Genomes in Nature, the Republic of Korea.

    Directory of Open Access Journals (Sweden)

    Jeong-Ah Kim

    2016-06-01

    Full Text Available Hantaan virus (HTNV, a negative sense tripartite RNA virus of the Family Bunyaviridae, is the most prevalent hantavirus in the Republic of Korea (ROK. It is the causative agent of Hemorrhagic Fever with Renal Syndrome (HFRS in humans and maintained in the striped field mouse, Apodemus agrarius, the primary zoonotic host. Clinical HFRS cases have been reported commonly in HFRS-endemic areas of Gyeonggi province. Recently, the death of a member of the ROK military from Gangwon province due to HFRS prompted an investigation of the epidemiology and distribution of hantaviruses in Gangwon and Gyeonggi provinces that border the demilitarized zone separating North and South Korea.To elucidate the geographic distribution and molecular diversity of HTNV, whole genome sequences of HTNV Large (L, Medium (M, and Small (S segments were acquired from lung tissues of A. agrarius captured from 2003-2014. Consistent with the clinical incidence of HFRS established by the Korea Centers for Disease Control & Prevention (KCDC, the prevalence of HTNV in naturally infected mice in Gangwon province was lower than for Gyeonggi province. Whole genomic sequences of 34 HTNV strains were identified and a phylogenetic analysis showed geographic diversity of the virus in the limited areas. Reassortment analysis first suggested an occurrence of genetic exchange of HTNV genomes in nature, ROK.This study is the first report to demonstrate the molecular prevalence of HTNV in Gangwon province. Whole genome sequencing of HTNV showed well-supported geographic lineages and the molecular diversity in the northern region of ROK due to a natural reassortment of HTNV genomes. These observations contribute to a better understanding of the genetic diversity and molecular evolution of hantaviruses. Also, the full-length of HTNV tripartite genomes will provide a database for phylogeographic analysis of spatial and temporal outbreaks of hantavirus infection.

  1. Co-invading symbiotic mutualists of Medicago polymorpha retain high ancestral diversity and contain diverse accessory genomes.

    Science.gov (United States)

    Porter, Stephanie S; Faber-Hammond, Joshua J; Friesen, Maren L

    2018-01-01

    Exotic, invasive plants and animals can wreak havoc on ecosystems by displacing natives and altering environmental conditions. However, much less is known about the identities or evolutionary dynamics of the symbiotic microbes that accompany invasive species. Most leguminous plants rely upon symbiotic rhizobium bacteria to fix nitrogen and are incapable of colonizing areas devoid of compatible rhizobia. We compare the genomes of symbiotic rhizobia in a portion of the legume's invaded range with those of the rhizobium symbionts from across the legume's native range. We show that in an area of California the legume Medicago polymorpha has invaded, its Ensifer medicae symbionts: (i) exhibit genome-wide patterns of relatedness that together with historical evidence support host-symbiont co-invasion from Europe into California, (ii) exhibit population genomic patterns consistent with the introduction of the majority of deep diversity from the native range, rather than a genetic bottleneck during colonization of California and (iii) harbor a large set of accessory genes uniquely enriched in binding functions, which could play a role in habitat invasion. Examining microbial symbiont genome dynamics during biological invasions is critical for assessing host-symbiont co-invasions whereby microbial symbiont range expansion underlies plant and animal invasions. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  2. Impact of marker ascertainment bias on genomic selection accuracy and estimates of genetic diversity.

    Directory of Open Access Journals (Sweden)

    Nicolas Heslot

    Full Text Available Genome-wide molecular markers are often being used to evaluate genetic diversity in germplasm collections and for making genomic selections in breeding programs. To accurately predict phenotypes and assay genetic diversity, molecular markers should assay a representative sample of the polymorphisms in the population under study. Ascertainment bias arises when marker data is not obtained from a random sample of the polymorphisms in the population of interest. Genotyping-by-sequencing (GBS is rapidly emerging as a low-cost genotyping platform, even for the large, complex, and polyploid wheat (Triticum aestivum L. genome. With GBS, marker discovery and genotyping occur simultaneously, resulting in minimal ascertainment bias. The previous platform of choice for whole-genome genotyping in many species such as wheat was DArT (Diversity Array Technology and has formed the basis of most of our knowledge about cereals genetic diversity. This study compared GBS and DArT marker platforms for measuring genetic diversity and genomic selection (GS accuracy in elite U.S. soft winter wheat. From a set of 365 breeding lines, 38,412 single nucleotide polymorphism GBS markers were discovered and genotyped. The GBS SNPs gave a higher GS accuracy than 1,544 DArT markers on the same lines, despite 43.9% missing data. Using a bootstrap approach, we observed significantly more clustering of markers and ascertainment bias with DArT relative to GBS. The minor allele frequency distribution of GBS markers had a deficit of rare variants compared to DArT markers. Despite the ascertainment bias of the DArT markers, GS accuracy for three traits out of four was not significantly different when an equal number of markers were used for each platform. This suggests that the gain in accuracy observed using GBS compared to DArT markers was mainly due to a large increase in the number of markers available for the analysis.

  3. Characterization of Human Cytomegalovirus Genome Diversity in Immunocompromised Hosts by Whole-Genome Sequencing Directly From Clinical Specimens.

    Science.gov (United States)

    Hage, Elias; Wilkie, Gavin S; Linnenweber-Held, Silvia; Dhingra, Akshay; Suárez, Nicolás M; Schmidt, Julius J; Kay-Fedorov, Penelope C; Mischak-Weissinger, Eva; Heim, Albert; Schwarz, Anke; Schulz, Thomas F; Davison, Andrew J; Ganzenmueller, Tina

    2017-06-01

    Advances in next-generation sequencing (NGS) technologies allow comprehensive studies of genetic diversity over the entire genome of human cytomegalovirus (HCMV), a significant pathogen for immunocompromised individuals. Next-generation sequencing was performed on target enriched sequence libraries prepared directly from a variety of clinical specimens (blood, urine, breast milk, respiratory samples, biopsies, and vitreous humor) obtained longitudinally or from different anatomical compartments from 20 HCMV-infected patients (renal transplant recipients, stem cell transplant recipients, and congenitally infected children). De novo-assembled HCMV genome sequences were obtained for 57 of 68 sequenced samples. Analysis of longitudinal or compartmental HCMV diversity revealed various patterns: no major differences were detected among longitudinal, intraindividual blood samples from 9 of 15 patients and in most of the patients with compartmental samples, whereas a switch of the major HCMV population was observed in 6 individuals with sequential blood samples and upon compartmental analysis of 1 patient with HCMV retinitis. Variant analysis revealed additional aspects of minor virus population dynamics and antiviral-resistance mutations. In immunosuppressed patients, HCMV can remain relatively stable or undergo drastic genomic changes that are suggestive of the emergence of minor resident strains or de novo infection.

  4. Diversity of 23S rRNA genes within individual prokaryotic genomes.

    Directory of Open Access Journals (Sweden)

    Anna Pei

    Full Text Available BACKGROUND: The concept of ribosomal constraints on rRNA genes is deduced primarily based on the comparison of consensus rRNA sequences between closely related species, but recent advances in whole-genome sequencing allow evaluation of this concept within organisms with multiple rRNA operons. METHODOLOGY/PRINCIPAL FINDINGS: Using the 23S rRNA gene as an example, we analyzed the diversity among individual rRNA genes within a genome. Of 184 prokaryotic species containing multiple 23S rRNA genes, diversity was observed in 113 (61.4% genomes (mean 0.40%, range 0.01%-4.04%. Significant (1.17%-4.04% intragenomic variation was found in 8 species. In 5 of the 8 species, the diversity in the primary structure had only minimal effect on the secondary structure (stem versus loop transition. In the remaining 3 species, the diversity significantly altered local secondary structure, but the alteration appears minimized through complex rearrangement. Intervening sequences (IVS, ranging between 9 and 1471 nt in size, were found in 7 species. IVS in Deinococcus radiodurans and Nostoc sp. encode transposases. T. tengcongensis was the only species in which intragenomic diversity >3% was observed among 4 paralogous 23S rRNA genes. CONCLUSIONS/SIGNIFICANCE: These findings indicate tight ribosomal constraints on individual 23S rRNA genes within a genome. Although classification using primary 23S rRNA sequences could be erroneous, significant diversity among paralogous 23S rRNA genes was observed only once in the 184 species analyzed, indicating little overall impact on the mainstream of 23S rRNA gene-based prokaryotic taxonomy.

  5. Artificial selection with traditional or genomic relationships: consequences in coancestry and genetic diversity

    Directory of Open Access Journals (Sweden)

    Silvia T Rodriguez-Ramilo

    2015-04-01

    Full Text Available Estimated breeding values (EBVs are traditionally obtained from pedigree information. However, EBVs from high-density genotypes can have higher accuracy than EBVs from pedigree information. At the same time, it has been shown that EBVs from genomic data lead to lower increases in inbreeding compared with traditional selection based on genealogies. Here we evaluate the performance with BLUP selection based on genealogical coancestry with three different genome-based coancestry estimates: (1 an estimate based on shared segments of homozygosity, (2 an approach based on SNP-by-SNP count corrected by allelic frequencies, and (3 the identity by state methodology. We evaluate the effect of different population sizes, different numberof genomic markers, and several heritability values for a quantitative trait. The performance of the different measures of coancestry in BLUP is evaluated in the true breeding values after truncation selection and also in terms of coancestry and diversity maintained.Accordingly, cross-performances were also carried out, that is, how prediction based on genealogical records impacts the three other measures of coancestry and inbreeding, and viceversa. Our results show that the genetic gains are very similar for allfour coancestries, but the genomic-based methods are superior to using genealogical coancestries in terms of maintainingdiversity measured as observed heterozygosity. Furthermore, the measure of coancestry based on shared segments of the genome seems to provide slightly better results on some scenarios, and the increase in inbreeding and loss in diversity is only slightly larger than the other genomic selection methods in those scenarios. Our results shed light on genomic selection versus traditional genealogical-based BLUP and make the case to manage the population variability using genomic information to preserve the future success of selection programmes.

  6. Comparative genomics of Mycoplasma: analysis of conserved essential genes and diversity of the pan-genome.

    Directory of Open Access Journals (Sweden)

    Wei Liu

    Full Text Available Mycoplasma, the smallest self-replicating organism with a minimal metabolism and little genomic redundancy, is expected to be a close approximation to the minimal set of genes needed to sustain bacterial life. This study employs comparative evolutionary analysis of twenty Mycoplasma genomes to gain an improved understanding of essential genes. By analyzing the core genome of mycoplasmas, we finally revealed the conserved essential genes set for mycoplasma survival. Further analysis showed that the core genome set has many characteristics in common with experimentally identified essential genes. Several key genes, which are related to DNA replication and repair and can be disrupted in transposon mutagenesis studies, may be critical for bacteria survival especially over long period natural selection. Phylogenomic reconstructions based on 3,355 homologous groups allowed robust estimation of phylogenetic relatedness among mycoplasma strains. To obtain deeper insight into the relative roles of molecular evolution in pathogen adaptation to their hosts, we also analyzed the positive selection pressures on particular sites and lineages. There appears to be an approximate correlation between the divergence of species and the level of positive selection detected in corresponding lineages.

  7. Genomic diversity and introgression in O. sativa reveal the impact of domestication and breeding on the rice genome.

    Directory of Open Access Journals (Sweden)

    Keyan Zhao

    Full Text Available BACKGROUND: The domestication of Asian rice (Oryza sativa was a complex process punctuated by episodes of introgressive hybridization among and between subpopulations. Deep genetic divergence between the two main varietal groups (Indica and Japonica suggests domestication from at least two distinct wild populations. However, genetic uniformity surrounding key domestication genes across divergent subpopulations suggests cultural exchange of genetic material among ancient farmers. METHODOLOGY/PRINCIPAL FINDINGS: In this study, we utilize a novel 1,536 SNP panel genotyped across 395 diverse accessions of O. sativa to study genome-wide patterns of polymorphism, to characterize population structure, and to infer the introgression history of domesticated Asian rice. Our population structure analyses support the existence of five major subpopulations (indica, aus, tropical japonica, temperate japonica and GroupV consistent with previous analyses. Our introgression analysis shows that most accessions exhibit some degree of admixture, with many individuals within a population sharing the same introgressed segment due to artificial selection. Admixture mapping and association analysis of amylose content and grain length illustrate the potential for dissecting the genetic basis of complex traits in domesticated plant populations. CONCLUSIONS/SIGNIFICANCE: Genes in these regions control a myriad of traits including plant stature, blast resistance, and amylose content. These analyses highlight the power of population genomics in agricultural systems to identify functionally important regions of the genome and to decipher the role of human-directed breeding in refashioning the genomes of a domesticated species.

  8. A Genomic Encyclopedia of the Root Nodule Bacteria: assessing genetic diversity through a systematic biogeographic survey.

    Science.gov (United States)

    Reeve, Wayne; Ardley, Julie; Tian, Rui; Eshragi, Leila; Yoon, Je Won; Ngamwisetkun, Pinyaruk; Seshadri, Rekha; Ivanova, Natalia N; Kyrpides, Nikos C

    2015-01-01

    Root nodule bacteria are free-living soil bacteria, belonging to diverse genera within the Alphaproteobacteria and Betaproteobacteria, that have the capacity to form nitrogen-fixing symbioses with legumes. The symbiosis is specific and is governed by signaling molecules produced from both host and bacteria. Sequencing of several model RNB genomes has provided valuable insights into the genetic basis of symbiosis. However, the small number of sequenced RNB genomes available does not currently reflect the phylogenetic diversity of RNB, or the variety of mechanisms that lead to symbiosis in different legume hosts. This prevents a broad understanding of symbiotic interactions and the factors that govern the biogeography of host-microbe symbioses. Here, we outline a proposal to expand the number of sequenced RNB strains, which aims to capture this phylogenetic and biogeographic diversity. Through the Vavilov centers of diversity (Proposal ID: 231) and GEBA-RNB (Proposal ID: 882) projects we will sequence 107 RNB strains, isolated from diverse legume hosts in various geographic locations around the world. The nominated strains belong to nine of the 16 currently validly described RNB genera. They include 13 type strains, as well as elite inoculant strains of high commercial importance. These projects will strongly support systematic sequence-based studies of RNB and contribute to our understanding of the effects of biogeography on the evolution of different species of RNB, as well as the mechanisms that determine the specificity and effectiveness of nodulation and symbiotic nitrogen fixation by RNB with diverse legume hosts.

  9. Genetic diversity and genomic strategies for improving drought and waterlogging tolerance in soybeans.

    Science.gov (United States)

    Valliyodan, Babu; Ye, Heng; Song, Li; Murphy, MacKensie; Shannon, J Grover; Nguyen, Henry T

    2017-04-01

    Drought and its interaction with high temperature are the major abiotic stress factors affecting soybean yield and production stability. Ongoing climate changes are anticipated to intensify drought events, which will further impact crop production and food security. However, excessive water also limits soybean production. The success of soybean breeding programmes for crop improvement is dependent on the extent of genetic variation present in the germplasm base. Screening for natural genetic variation in drought- and flooding tolerance-related traits, including root system architecture, water and nitrogen-fixation efficiency, and yield performance indices, has helped to identify the best resources for genetic studies in soybean. Genomic resources, including whole-genome sequences of diverse germplasms, millions of single-nucleotide polymorphisms, and high-throughput marker genotyping platforms, have expedited gene and marker discovery for translational genomics in soybean. This review highlights the current knowledge of the genetic diversity and quantitative trait loci associated with root system architecture, canopy wilting, nitrogen-fixation ability, and flooding tolerance that contributes to the understanding of drought- and flooding-tolerance mechanisms in soybean. Next-generation mapping approaches and high-throughput phenotyping will facilitate a better understanding of phenotype-genotype associations and help to formulate genomic-assisted breeding strategies, including genomic selection, in soybean for tolerance to drought and flooding stress. © The Author 2016. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  10. Scanning the landscape of genome architecture of non-O1 and non-O139 Vibrio cholerae by whole genome mapping reveals extensive population genetic diversity.

    Science.gov (United States)

    Chapman, Carol; Henry, Matthew; Bishop-Lilly, Kimberly A; Awosika, Joy; Briska, Adam; Ptashkin, Ryan N; Wagner, Trevor; Rajanna, Chythanya; Tsang, Hsinyi; Johnson, Shannon L; Mokashi, Vishwesh P; Chain, Patrick S G; Sozhamannan, Shanmuga

    2015-01-01

    Historically, cholera outbreaks have been linked to V. cholerae O1 serogroup strains or its derivatives of the O37 and O139 serogroups. A genomic study on the 2010 Haiti cholera outbreak strains highlighted the putative role of non O1/non-O139 V. cholerae in causing cholera and the lack of genomic sequences of such strains from around the world. Here we address these gaps by scanning a global collection of V. cholerae strains as a first step towards understanding the population genetic diversity and epidemic potential of non O1/non-O139 strains. Whole Genome Mapping (Optical Mapping) based bar coding produces a high resolution, ordered restriction map, depicting a complete view of the unique chromosomal architecture of an organism. To assess the genomic diversity of non-O1/non-O139 V. cholerae, we applied a Whole Genome Mapping strategy on a well-defined and geographically and temporally diverse strain collection, the Sakazaki serogroup type strains. Whole Genome Map data on 91 of the 206 serogroup type strains support the hypothesis that V. cholerae has an unprecedented genetic and genomic structural diversity. Interestingly, we discovered chromosomal fusions in two unusual strains that possess a single chromosome instead of the two chromosomes usually found in V. cholerae. We also found pervasive chromosomal rearrangements such as duplications and indels in many strains. The majority of Vibrio genome sequences currently in public databases are unfinished draft sequences. The Whole Genome Mapping approach presented here enables rapid screening of large strain collections to capture genomic complexities that would not have been otherwise revealed by unfinished draft genome sequencing and thus aids in assembling and finishing draft sequences of complex genomes. Furthermore, Whole Genome Mapping allows for prediction of novel V. cholerae non-O1/non-O139 strains that may have the potential to cause future cholera outbreaks.

  11. Genome sequence diversity and clues to the evolution of variola (smallpox) virus.

    Science.gov (United States)

    Esposito, Joseph J; Sammons, Scott A; Frace, A Michael; Osborne, John D; Olsen-Rasmussen, Melissa; Zhang, Ming; Govil, Dhwani; Damon, Inger K; Kline, Richard; Laker, Miriam; Li, Yu; Smith, Geoffrey L; Meyer, Hermann; Leduc, James W; Wohlhueter, Robert M

    2006-08-11

    Comparative genomics of 45 epidemiologically varied variola virus isolates from the past 30 years of the smallpox era indicate low sequence diversity, suggesting that there is probably little difference in the isolates' functional gene content. Phylogenetic clustering inferred three clades coincident with their geographical origin and case-fatality rate; the latter implicated putative proteins that mediate viral virulence differences. Analysis of the viral linear DNA genome suggests that its evolution involved direct descent and DNA end-region recombination events. Knowing the sequences will help understand the viral proteome and improve diagnostic test precision, therapeutics, and systems for their assessment.

  12. Genome-based studies of marine microorganisms to maximize the diversity of natural products discovery for medical treatments.

    Science.gov (United States)

    Zhao, Xin-Qing

    2011-01-01

    Marine microorganisms are rich source for natural products which play important roles in pharmaceutical industry. Over the past decade, genome-based studies of marine microorganisms have unveiled the tremendous diversity of the producers of natural products and also contributed to the efficiency of harness the strain diversity and chemical diversity, as well as the genetic diversity of marine microorganisms for the rapid discovery and generation of new natural products. In the meantime, genomic information retrieved from marine symbiotic microorganisms can also be employed for the discovery of new medical molecules from yet-unculturable microorganisms. In this paper, the recent progress in the genomic research of marine microorganisms is reviewed; new tools of genome mining as well as the advance in the activation of orphan pathways and metagenomic studies are summarized. Genome-based research of marine microorganisms will maximize the biodiscovery process and solve the problems of supply and sustainability of drug molecules for medical treatments.

  13. Genome-Based Studies of Marine Microorganisms to Maximize the Diversity of Natural Products Discovery for Medical Treatments

    Directory of Open Access Journals (Sweden)

    Xin-Qing Zhao

    2011-01-01

    Full Text Available Marine microorganisms are rich source for natural products which play important roles in pharmaceutical industry. Over the past decade, genome-based studies of marine microorganisms have unveiled the tremendous diversity of the producers of natural products and also contributed to the efficiency of harness the strain diversity and chemical diversity, as well as the genetic diversity of marine microorganisms for the rapid discovery and generation of new natural products. In the meantime, genomic information retrieved from marine symbiotic microorganisms can also be employed for the discovery of new medical molecules from yet-unculturable microorganisms. In this paper, the recent progress in the genomic research of marine microorganisms is reviewed; new tools of genome mining as well as the advance in the activation of orphan pathways and metagenomic studies are summarized. Genome-based research of marine microorganisms will maximize the biodiscovery process and solve the problems of supply and sustainability of drug molecules for medical treatments.

  14. Extensive Genomic Diversity among Bovine-Adapted Staphylococcus aureus: Evidence for a Genomic Rearrangement within CC97.

    Directory of Open Access Journals (Sweden)

    Kathleen E Budd

    Full Text Available Staphylococcus aureus is an important pathogen associated with both human and veterinary disease and is a common cause of bovine mastitis. Genomic heterogeneity exists between S. aureus strains and has been implicated in the adaptation of specific strains to colonise particular mammalian hosts. Knowledge of the factors required for host specificity and virulence is important for understanding the pathogenesis and management of S. aureus mastitis. In this study, a panel of mastitis-associated S. aureus isolates (n = 126 was tested for resistance to antibiotics commonly used to treat mastitis. Over half of the isolates (52% demonstrated resistance to penicillin and ampicillin but all were susceptible to the other antibiotics tested. S. aureus isolates were further examined for their clonal diversity by Multi-Locus Sequence Typing (MLST. In total, 18 different sequence types (STs were identified and eBURST analysis demonstrated that the majority of isolates grouped into clonal complexes CC97, CC151 or sequence type (ST 136. Analysis of the role of recombination events in determining S. aureus population structure determined that ST diversification through nucleotide substitutions were more likely to be due to recombination compared to point mutation, with regions of the genome possibly acting as recombination hotspots. DNA microarray analysis revealed a large number of differences amongst S. aureus STs in their variable genome content, including genes associated with capsule and biofilm formation and adhesion factors. Finally, evidence for a genomic arrangement was observed within isolates from CC97 with the ST71-like subgroup showing evidence of an IS431 insertion element having replaced approximately 30 kb of DNA including the ica operon and histidine biosynthesis genes, resulting in histidine auxotrophy. This genomic rearrangement may be responsible for the diversification of ST71 into an emerging bovine adapted subgroup.

  15. The expanded diversity of methylophilaceae from Lake Washington through cultivation and genomic sequencing of novel ecotypes.

    Directory of Open Access Journals (Sweden)

    David A C Beck

    Full Text Available We describe five novel Methylophilaceae ecotypes from a single ecological niche in Lake Washington, USA, and compare them to three previously described ecotypes, in terms of their phenotype and genome sequence divergence. Two of the ecotypes appear to represent novel genera within the Methylophilaceae. Genome-based metabolic reconstruction highlights metabolic versatility of Methylophilaceae with respect to methylotrophy and nitrogen metabolism, different ecotypes possessing different combinations of primary substrate oxidation systems (MxaFI-type methanol dehydrogenase versus XoxF-type methanol dehydrogenase; methylamine dehydrogenase versus N-methylglutamate pathway and different potentials for denitrification (assimilatory versus respiratory nitrate reduction. By comparing pairs of closely related genomes, we uncover that site-specific recombination is the main means of genomic evolution and strain divergence, including lateral transfers of genes from both closely- and distantly related taxa. The new ecotypes and the new genomes contribute significantly to our understanding of the extent of genomic and metabolic diversity among organisms of the same family inhabiting the same ecological niche. These organisms also provide novel experimental models for studying the complexity and the function of the microbial communities active in methylotrophy.

  16. Plastid genomics in horticultural species: Importance and applications for plant diversity, evolution and biotechnology

    Directory of Open Access Journals (Sweden)

    Marcelo eRogalski

    2015-07-01

    Full Text Available During the evolution of the eukaryotic cell, plastids and mitochondria arose from an endosymbiotic process, which determined the presence of three genetic compartments into the incipient plant cell. After that, these three genetic materials from host and symbiont suffered several rearrangements, bringing on a complex interaction between nuclear and organellar gene products. Nowadays, plastids harbor a small genome with ~130 genes in a 100-220 kb sequence in higher plants. Plastid genes are mostly highly conserved between plant species, being useful for phylogenetic analysis in higher taxa. However, intergenic spacers have a relatively higher mutation rate and are important markers to study genetic diversity and divergence within natural plant populations. The predominant uniparental inheritance of plastids is like a highly desirable feature for phylogeny studies. Moreover, the gene content and genome rearrangements are efficient tools to capture and understand evolutionary events between different plant species. Currently, genetic engineering of the plastid genome (plastome offers a number of attractive advantages as high-level of foreign protein expression, marker-gene excision, gene expression in operon and transgene containment because of maternal inheritance of plastid genome in most crops. Therefore, plastid genome can be used for adding new characteristics related to synthesis of metabolic compounds, biopharmaceutical and tolerance to biotic and abiotic stresses. Here, we describe the importance and applications of plastid genome as tools for genetic and evolutionary studies, and plastid transformation focusing on increasing the performance of horticultural species in the field.

  17. Genome Microscale Heterogeneity among Wild Potatoes Revealed by Diversity Arrays Technology Marker Sequences

    Directory of Open Access Journals (Sweden)

    Alessandra Traini

    2013-01-01

    Full Text Available Tuber-bearing potato species possess several genes that can be exploited to improve the genetic background of the cultivated potato Solanum tuberosum. Among them, S. bulbocastanum and S. commersonii are well known for their strong resistance to environmental stresses. However, scant information is available for these species in terms of genome organization, gene function, and regulatory networks. Consequently, genomic tools to assist breeding are meager, and efficient exploitation of these species has been limited so far. In this paper, we employed the reference genome sequences from cultivated potato and tomato and a collection of sequences of 1,423 potato Diversity Arrays Technology (DArT markers that show polymorphic representation across the genomes of S. bulbocastanum and/or S. commersonii genotypes. Our results highlighted microscale genome sequence heterogeneity that may play a significant role in functional and structural divergence between related species. Our analytical approach provides knowledge of genome structural and sequence variability that could not be detected by transcriptome and proteome approaches.

  18. Twenty-one genome sequences from Pseudomonas species and 19 genome sequences from diverse bacteria isolated from the rhizosphere and endosphere of Populus deltoides.

    Science.gov (United States)

    Brown, Steven D; Utturkar, Sagar M; Klingeman, Dawn M; Johnson, Courtney M; Martin, Stanton L; Land, Miriam L; Lu, Tse-Yuan S; Schadt, Christopher W; Doktycz, Mitchel J; Pelletier, Dale A

    2012-11-01

    To aid in the investigation of the Populus deltoides microbiome, we generated draft genome sequences for 21 Pseudomonas strains and 19 other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium, and Variovorax were generated.

  19. Twenty-One Genome Sequences from Pseudomonas Species and 19 Genome Sequences from Diverse Bacteria Isolated from the Rhizosphere and Endosphere of Populus deltoides

    OpenAIRE

    Brown, Steven D.; Utturkar, Sagar M.; Klingeman, Dawn M.; Johnson, Courtney M.; Martin, Stanton L.; Land, Miriam L.; Lu, Tse-Yuan S.; Schadt, Christopher W.; Doktycz, Mitchel J.; Pelletier, Dale A.

    2012-01-01

    To aid in the investigation of the Populus deltoides microbiome, we generated draft genome sequences for 21 Pseudomonas strains and 19 other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium, and Variovorax were generated.

  20. Twenty-One Genome Sequences from Pseudomonas Species and 19 Genome Sequences from Diverse Bacteria Isolated from the Rhizosphere and Endosphere of Populus deltoides

    Energy Technology Data Exchange (ETDEWEB)

    Brown, Steven D [ORNL; Utturkar, Sagar M [ORNL; Klingeman, Dawn Marie [ORNL; Johnson, Courtney M [ORNL; Martin, Stanton [ORNL; Land, Miriam L [ORNL; Lu, Tse-Yuan [ORNL; Schadt, Christopher Warren [ORNL; Doktycz, Mitchel John [ORNL; Pelletier, Dale A [ORNL

    2012-01-01

    To aid in the investigation of the Populus deltoides microbiome we generated draft genome sequences for twenty one Pseudomonas and twenty one other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Burkholderia, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium and Variovorax were generated.

  1. Within-host bacterial diversity hinders accurate reconstruction of transmission networks from genomic distance data.

    Science.gov (United States)

    Worby, Colin J; Lipsitch, Marc; Hanage, William P

    2014-03-01

    The prospect of using whole genome sequence data to investigate bacterial disease outbreaks has been keenly anticipated in many quarters, and the large-scale collection and sequencing of isolates from cases is becoming increasingly feasible. While sequence data can provide many important insights into disease spread and pathogen adaptation, it remains unclear how successfully they may be used to estimate individual routes of transmission. Several studies have attempted to reconstruct transmission routes using genomic data; however, these have typically relied upon restrictive assumptions, such as a shared topology of the phylogenetic tree and a lack of within-host diversity. In this study, we investigated the potential for bacterial genomic data to inform transmission network reconstruction. We used simulation models to investigate the origins, persistence and onward transmission of genetic diversity, and examined the impact of such diversity on our estimation of the epidemiological relationship between carriers. We used a flexible distance-based metric to provide a weighted transmission network, and used receiver-operating characteristic (ROC) curves and network entropy to assess the accuracy and uncertainty of the inferred structure. Our results suggest that sequencing a single isolate from each case is inadequate in the presence of within-host diversity, and is likely to result in misleading interpretations of transmission dynamics--under many plausible conditions, this may be little better than selecting transmission links at random. Sampling more frequently improves accuracy, but much uncertainty remains, even if all genotypes are observed. While it is possible to discriminate between clusters of carriers, individual transmission routes cannot be resolved by sequence data alone. Our study demonstrates that bacterial genomic distance data alone provide only limited information on person-to-person transmission dynamics.

  2. Genome-wide linkage disequilibrium and genetic diversity in five populations of Australian domestic sheep.

    Science.gov (United States)

    Al-Mamun, Hawlader Abdullah; Clark, Samuel A; Kwan, Paul; Gondro, Cedric

    2015-11-24

    Knowledge of the genetic structure and overall diversity of livestock species is important to maximise the potential of genome-wide association studies and genomic prediction. Commonly used measures such as linkage disequilibrium (LD), effective population size (N e ), heterozygosity, fixation index (F ST) and runs of homozygosity (ROH) are widely used and help to improve our knowledge about genetic diversity in animal populations. The development of high-density single nucleotide polymorphism (SNP) arrays and the subsequent genotyping of large numbers of animals have greatly increased the accuracy of these population-based estimates. In this study, we used the Illumina OvineSNP50 BeadChip array to estimate and compare LD (measured by r (2) and D'), N e , heterozygosity, F ST and ROH in five Australian sheep populations: three pure breeds, i.e., Merino (MER), Border Leicester (BL), Poll Dorset (PD) and two crossbred populations i.e. F1 crosses of Merino and Border Leicester (MxB) and MxB crossed to Poll Dorset (MxBxP). Compared to other livestock species, the sheep populations that were analysed in this study had low levels of LD and high levels of genetic diversity. The rate of LD decay was greater in Merino than in the other pure breeds. Over short distances (sheep breeds, especially in Merino sheep. The observed range of diversity will influence the design of genome-wide association studies and the results that can be obtained from them. This knowledge will also be useful to design reference populations for genomic prediction of breeding values in sheep.

  3. Acidobacteria form a coherent but highly diverse group within the bacterial domain: evidence from environmental genomics

    DEFF Research Database (Denmark)

    Quaiser, Achim; Ochsenreiter, Torsten; Lanz, Christa

    2003-01-01

    Acidobacteria have been established as a novel phylum of Bacteria that is consistently detected in many different habitats around the globe by 16S rDNA-based molecular surveys. The phylogenetic diversity, ubiquity and abundance of this group, particularly in soil habitats, suggest an important...... insert libraries directly from DNA of a calcerous grassland soil. Genomic fragments of Acidobacteria were identified with specific 16S rDNA probes and sequence analyses of six independently identified clones were performed, representing in total more than 210,000 bp. The 16S rRNA genes of the genomic...... fragments differed between 2.3% and 19.9% and were placed into two different subgroups of Acidobacteria (groups III and V). Although partial co-linearity was found between genomic fragments, the gene content around the rRNA operons was generally not conserved. Phylogenetic reconstructions with orthologues...

  4. Comparative Analysis of 35 Basidiomycete Genomes Reveals Diversity and Uniqueness of the Phylum

    Energy Technology Data Exchange (ETDEWEB)

    Riley, Robert; Salamov, Asaf; Otillar, Robert; Fagnan, Kirsten; Boussau, Bastien; Brown, Daren; Henrissat, Bernard; Levasseur, Anthony; Held, Benjamin; Nagy, Laszlo; Floudas, Dimitris; Morin, Emmanuelle; Manning, Gerard; Baker, Scott; Martin, Francis; Blanchette, Robert; Hibbett, David; Grigoriev, Igor V.

    2013-03-11

    Fungi of the phylum Basidiomycota (basidiomycetes), make up some 37percent of the described fungi, and are important in forestry, agriculture, medicine, and bioenergy. This diverse phylum includes symbionts, pathogens, and saprobes including wood decaying fungi. To better understand the diversity of this phylum we compared the genomes of 35 basidiomycete fungi including 6 newly sequenced genomes. The genomes of basidiomycetes span extremes of genome size, gene number, and repeat content. A phylogenetic tree of Basidiomycota was generated using the Phyldog software, which uses all available protein sequence data to simultaneously infer gene and species trees. Analysis of core genes reveals that some 48percent of basidiomycete proteins are unique to the phylum with nearly half of those (22percent) comprising proteins found in only one organism. Phylogenetic patterns of plant biomass-degrading genes suggest a continuum rather than a sharp dichotomy between the white rot and brown rot modes of wood decay among the members of Agaricomycotina subphylum. There is a correlation of the profile of certain gene families to nutritional mode in Agaricomycotina. Based on phylogenetically-informed PCA analysis of such profiles, we predict that that Botryobasidium botryosum and Jaapia argillacea have properties similar to white rot species, although neither has liginolytic class II fungal peroxidases. Furthermore, we find that both fungi exhibit wood decay with white rot-like characteristics in growth assays. Analysis of the rate of discovery of proteins with no or few homologs suggests the high value of continued sequencing of basidiomycete fungi.

  5. Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Dothideomycetes Fungi

    Energy Technology Data Exchange (ETDEWEB)

    Ohm, Robin A.; Feau, Nicolas; Henrissat, Bernard; Schoch, Conrad L.; Horwitz, Benjamin A.; Barry, Kerrie W.; Condon, Bradford J.; Copeland, Alex C.; Dhillon, Braham; Glaser, Fabian; Hesse, Cedar N.; Kosti, Idit; LaButti, Kurt; Lindquist, Erika A.; Lucas, Susan; Salamov, Asaf A.; Bradshaw, Rosie E.; Ciuffetti, Lynda; Hamelin, Richard C.; Kema, Gert H. J.; Lawrence, Christopher; Scott, James A.; Spatafora, Joseph W.; Turgeon, B. Gillian; Wit, Pierre J. G. M. de; Zhong, Shaobin; Goodwin, Stephen B.; Grigoriev, Igor V.

    2012-02-29

    The class Dothideomycetes is one of the largest groups of fungi with a high level of ecological diversity including many plant pathogens infecting a broad range of hosts. Here, we compare genome features of 18 members of this class, including 6 necrotrophs, 9 (hemi)biotrophs and 3 saprotrophs, to analyze genome structure, evolution, and the diverse strategies of pathogenesis. The Dothideomycetes most likely evolved from a common ancestor more than 280 million years ago. The 18 genome sequences differ dramatically in size due to variation in repetitive content, but show much less variation in number of (core) genes. Gene order appears to have been rearranged mostly within chromosomal boundaries by multiple inversions, in extant genomes frequently demarcated by adjacent simple repeats. Several Dothideomycetes contain one or more gene-poor, transposable element (TE)-rich putatively dispensable chromosomes of unknown function. The 18 Dothideomycetes offer an extensive catalogue of genes involved in cellulose degradation, proteolysis, secondary metabolism, and cysteine-rich small secreted proteins. Ancestors of the two major orders of plant pathogens in the Dothideomycetes, the Capnodiales and Pleosporales, may have had different modes of pathogenesis, with the former having fewer of these genes than the latter. Many of these genes are enriched in proximity to transposable elements, suggesting faster evolution because of the effects of repeat induced point (RIP) mutations. A syntenic block of genes, including oxidoreductases, is conserved in most Dothideomycetes and upregulated during infection in L. maculans, suggesting a possible function in response to oxidative stress.

  6. Diverse lifestyles and strategies of plant pathogenesis encoded in the genomes of eighteen Dothideomycetes fungi.

    Directory of Open Access Journals (Sweden)

    Robin A Ohm

    Full Text Available The class Dothideomycetes is one of the largest groups of fungi with a high level of ecological diversity including many plant pathogens infecting a broad range of hosts. Here, we compare genome features of 18 members of this class, including 6 necrotrophs, 9 (hemibiotrophs and 3 saprotrophs, to analyze genome structure, evolution, and the diverse strategies of pathogenesis. The Dothideomycetes most likely evolved from a common ancestor more than 280 million years ago. The 18 genome sequences differ dramatically in size due to variation in repetitive content, but show much less variation in number of (core genes. Gene order appears to have been rearranged mostly within chromosomal boundaries by multiple inversions, in extant genomes frequently demarcated by adjacent simple repeats. Several Dothideomycetes contain one or more gene-poor, transposable element (TE-rich putatively dispensable chromosomes of unknown function. The 18 Dothideomycetes offer an extensive catalogue of genes involved in cellulose degradation, proteolysis, secondary metabolism, and cysteine-rich small secreted proteins. Ancestors of the two major orders of plant pathogens in the Dothideomycetes, the Capnodiales and Pleosporales, may have had different modes of pathogenesis, with the former having fewer of these genes than the latter. Many of these genes are enriched in proximity to transposable elements, suggesting faster evolution because of the effects of repeat induced point (RIP mutations. A syntenic block of genes, including oxidoreductases, is conserved in most Dothideomycetes and upregulated during infection in L. maculans, suggesting a possible function in response to oxidative stress.

  7. Unity in diversity: an overview of the genomic anthropology of India.

    Science.gov (United States)

    Mastana, Sarabjit S

    2014-01-01

    India is considered a treasure for geneticists and evolutionary biologists due to its vast human diversity, consisting of more than 4500 anthropologically well-defined populations (castes, tribes and religious groups). Each population differs in terms of endogamy, language, culture, physical features, geographic and climatic position and genetic architecture. These factors contributed to India-specific genetic variations which may be responsible for various common diseases in India and its migratory populations. As a result, interpretations of the origins and affinities of Indian populations as well as health and disease conditions require complex and sophisticated genetic analysis. Evidence of ancient human dispersals and settlements is preserved in the genome of Indian inhabitants and this has been extensively analysed in conventional and genomic analyses. Using genomic analyses of STRs and Alu on a set of populations, this study estimates the level and extent of genetic variation and its implications. The results show that Indian populations have a higher level of unique genetic diversity which is structured by many social processes and geographical attributes of the country. This overview highlights the need to study the anthropological structure and evolutionary history of Indian populations while designing genomic and epigenomic investigations.

  8. Neutral Theory Predicts the Relative Abundance and Diversity of Genetic Elements in a Broad Array of Eukaryotic Genomes

    Science.gov (United States)

    Serra, François; Becher, Verónica; Dopazo, Hernán

    2013-01-01

    It is universally true in ecological communities, terrestrial or aquatic, temperate or tropical, that some species are very abundant, others are moderately common, and the majority are rare. Likewise, eukaryotic genomes also contain classes or “species” of genetic elements that vary greatly in abundance: DNA transposons, retrotransposons, satellite sequences, simple repeats and their less abundant functional sequences such as RNA or genes. Are the patterns of relative species abundance and diversity similar among ecological communities and genomes? Previous dynamical models of genomic diversity have focused on the selective forces shaping the abundance and diversity of transposable elements (TEs). However, ideally, models of genome dynamics should consider not only TEs, but also the diversity of all genetic classes or “species” populating eukaryotic genomes. Here, in an analysis of the diversity and abundance of genetic elements in >500 eukaryotic chromosomes, we show that the patterns are consistent with a neutral hypothesis of genome assembly in virtually all chromosomes tested. The distributions of relative abundance of genetic elements are quite precisely predicted by the dynamics of an ecological model for which the principle of functional equivalence is the main assumption. We hypothesize that at large temporal scales an overarching neutral or nearly neutral process governs the evolution of abundance and diversity of genetic elements in eukaryotic genomes. PMID:23798991

  9. Neutral theory predicts the relative abundance and diversity of genetic elements in a broad array of eukaryotic genomes.

    Directory of Open Access Journals (Sweden)

    François Serra

    Full Text Available It is universally true in ecological communities, terrestrial or aquatic, temperate or tropical, that some species are very abundant, others are moderately common, and the majority are rare. Likewise, eukaryotic genomes also contain classes or "species" of genetic elements that vary greatly in abundance: DNA transposons, retrotransposons, satellite sequences, simple repeats and their less abundant functional sequences such as RNA or genes. Are the patterns of relative species abundance and diversity similar among ecological communities and genomes? Previous dynamical models of genomic diversity have focused on the selective forces shaping the abundance and diversity of transposable elements (TEs. However, ideally, models of genome dynamics should consider not only TEs, but also the diversity of all genetic classes or "species" populating eukaryotic genomes. Here, in an analysis of the diversity and abundance of genetic elements in >500 eukaryotic chromosomes, we show that the patterns are consistent with a neutral hypothesis of genome assembly in virtually all chromosomes tested. The distributions of relative abundance of genetic elements are quite precisely predicted by the dynamics of an ecological model for which the principle of functional equivalence is the main assumption. We hypothesize that at large temporal scales an overarching neutral or nearly neutral process governs the evolution of abundance and diversity of genetic elements in eukaryotic genomes.

  10. Neutral theory predicts the relative abundance and diversity of genetic elements in a broad array of eukaryotic genomes.

    Science.gov (United States)

    Serra, François; Becher, Verónica; Dopazo, Hernán

    2013-01-01

    It is universally true in ecological communities, terrestrial or aquatic, temperate or tropical, that some species are very abundant, others are moderately common, and the majority are rare. Likewise, eukaryotic genomes also contain classes or "species" of genetic elements that vary greatly in abundance: DNA transposons, retrotransposons, satellite sequences, simple repeats and their less abundant functional sequences such as RNA or genes. Are the patterns of relative species abundance and diversity similar among ecological communities and genomes? Previous dynamical models of genomic diversity have focused on the selective forces shaping the abundance and diversity of transposable elements (TEs). However, ideally, models of genome dynamics should consider not only TEs, but also the diversity of all genetic classes or "species" populating eukaryotic genomes. Here, in an analysis of the diversity and abundance of genetic elements in >500 eukaryotic chromosomes, we show that the patterns are consistent with a neutral hypothesis of genome assembly in virtually all chromosomes tested. The distributions of relative abundance of genetic elements are quite precisely predicted by the dynamics of an ecological model for which the principle of functional equivalence is the main assumption. We hypothesize that at large temporal scales an overarching neutral or nearly neutral process governs the evolution of abundance and diversity of genetic elements in eukaryotic genomes.

  11. Genetic Diversity in Lens Species Revealed by EST and Genomic Simple Sequence Repeat Analysis

    Science.gov (United States)

    Dikshit, Harsh Kumar; Singh, Akanksha; Singh, Dharmendra; Aski, Muraleedhar Sidaram; Prakash, Prapti; Jain, Neelu; Meena, Suresh; Kumar, Shiv; Sarker, Ashutosh

    2015-01-01

    Low productivity of pilosae type lentils grown in South Asia is attributed to narrow genetic base of the released cultivars which results in susceptibility to biotic and abiotic stresses. For enhancement of productivity and production, broadening of genetic base is essentially required. The genetic base of released cultivars can be broadened by using diverse types including bold seeded and early maturing lentils from Mediterranean region and related wild species. Genetic diversity in eighty six accessions of three species of genus Lens was assessed based on twelve genomic and thirty one EST-SSR markers. The evaluated set of genotypes included diverse lentil varieties and advanced breeding lines from Indian programme, two early maturing ICARDA lines and five related wild subspecies/species endemic to the Mediterranean region. Genomic SSRs exhibited higher polymorphism in comparison to EST SSRs. GLLC 598 produced 5 alleles with highest gene diversity value of 0.80. Among the studied subspecies/species 43 SSRs detected maximum number of alleles in L. orientalis. Based on Nei’s genetic distance cultivated lentil L. culinaris subsp. culinaris was found to be close to its wild progenitor L. culinaris subsp. orientalis. The Prichard’s structure of 86 genotypes distinguished different subspecies/species. Higher variability was recorded among individuals within population than among populations. PMID:26381889

  12. Genetic Diversity in Lens Species Revealed by EST and Genomic Simple Sequence Repeat Analysis.

    Directory of Open Access Journals (Sweden)

    Harsh Kumar Dikshit

    Full Text Available Low productivity of pilosae type lentils grown in South Asia is attributed to narrow genetic base of the released cultivars which results in susceptibility to biotic and abiotic stresses. For enhancement of productivity and production, broadening of genetic base is essentially required. The genetic base of released cultivars can be broadened by using diverse types including bold seeded and early maturing lentils from Mediterranean region and related wild species. Genetic diversity in eighty six accessions of three species of genus Lens was assessed based on twelve genomic and thirty one EST-SSR markers. The evaluated set of genotypes included diverse lentil varieties and advanced breeding lines from Indian programme, two early maturing ICARDA lines and five related wild subspecies/species endemic to the Mediterranean region. Genomic SSRs exhibited higher polymorphism in comparison to EST SSRs. GLLC 598 produced 5 alleles with highest gene diversity value of 0.80. Among the studied subspecies/species 43 SSRs detected maximum number of alleles in L. orientalis. Based on Nei's genetic distance cultivated lentil L. culinaris subsp. culinaris was found to be close to its wild progenitor L. culinaris subsp. orientalis. The Prichard's structure of 86 genotypes distinguished different subspecies/species. Higher variability was recorded among individuals within population than among populations.

  13. Entangled fates of holobiont genomes during invasion: nested bacterial and host diversities in Caulerpa taxifolia

    KAUST Repository

    Arnaud-Haond, S.

    2017-01-30

    Successful prevention and mitigation of biological invasions requires retracing the initial steps of introduction, as well as understanding key elements enhancing the adaptability of invasive species. We studied the genetic diversity of the green alga Caulerpa taxifolia and its associated bacterial communities in several areas around the world. The striking congruence of α and ß diversity of the algal genome and endophytic communities reveals a tight association, supporting the holobiont concept as best describing the unit of spreading and invasion. Both genomic compartments support the hypotheses of a unique accidental introduction in the Mediterranean and of multiple invasion events in Southern Australia. In addition to helping with tracing the origin of invasion, bacterial communities exhibit metabolic functions that can potentially enhance adaptability and competitiveness of the consortium they form with their host. We thus hypothesize that low genetic diversities of both host and symbiont communities may contribute to the recent regression in the Mediterranean, in contrast with the persistence of highly diverse assemblages in southern Australia. This study supports the importance of scaling up from the host to the holobiont for a comprehensive understanding of invasions. This article is protected by copyright. All rights reserved.

  14. Unraveling Mycobacterium tuberculosis genomic diversity and evolution in Lisbon, Portugal, a highly drug resistant setting

    KAUST Repository

    Perdigão, João

    2014-11-18

    Background Multidrug- (MDR) and extensively drug resistant (XDR) tuberculosis (TB) presents a challenge to disease control and elimination goals. In Lisbon, Portugal, specific and successful XDR-TB strains have been found in circulation for almost two decades. Results In the present study we have genotyped and sequenced the genomes of 56 Mycobacterium tuberculosis isolates recovered mostly from Lisbon. The genotyping data revealed three major clusters associated with MDR-TB, two of which are associated with XDR-TB. Whilst the genomic data contributed to elucidate the phylogenetic positioning of circulating MDR-TB strains, showing a high predominance of a single SNP cluster group 5. Furthermore, a genome-wide phylogeny analysis from these strains, together with 19 publicly available genomes of Mycobacterium tuberculosis clinical isolates, revealed two major clades responsible for M/XDR-TB in the region: Lisboa3 and Q1 (LAM). The data presented by this study yielded insights on microevolution and identification of novel compensatory mutations associated with rifampicin resistance in rpoB and rpoC. The screening for other structural variations revealed putative clade-defining variants. One deletion in PPE41, found among Lisboa3 isolates, is proposed to contribute to immune evasion and as a selective advantage. Insertion sequence (IS) mapping has also demonstrated the role of IS6110 as a major driver in mycobacterial evolution by affecting gene integrity and regulation. Conclusions Globally, this study contributes with novel genome-wide phylogenetic data and has led to the identification of new genomic variants that support the notion of a growing genomic diversity facing both setting and host adaptation.

  15. Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains

    Directory of Open Access Journals (Sweden)

    Jenny L. Draper

    2017-02-01

    Full Text Available Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged Helicobacter pylori strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these H. pylori isolates, including movement of the transposable element IS607, large and small inversions, multiple single nucleotide polymorphisms, and variation in cagA copy number. The cagA gene was found as 1 to 4 tandem copies located off the cag island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in cagY, which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB α-1,3 lipopolysaccharide (LPS fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains.

  16. Genome-wide analysis of repeat diversity across the family Musaceae.

    Directory of Open Access Journals (Sweden)

    Petr Novák

    Full Text Available BACKGROUND: The banana family (Musaceae includes genetically a diverse group of species and their diploid and polyploid hybrids that are widely cultivated in the tropics. In spite of their socio-economic importance, the knowledge of Musaceae genomes is basically limited to draft genome assemblies of two species, Musa acuminata and M. balbisiana. Here we aimed to complement this information by analyzing repetitive genome fractions of six species selected to represent various phylogenetic groups within the family. RESULTS: Low-pass sequencing of M. acuminata, M. ornata, M. textilis, M. beccarii, M. balbisiana, and Ensete gilletii genomes was performed using a 454/Roche platform. Sequence reads were subjected to analysis of their overall intra- and inter-specific similarities and, all major repeat families were quantified using graph-based clustering. Maximus/SIRE and Angela lineages of Ty1/copia long terminal repeat (LTR retrotransposons and the chromovirus lineage of Ty3/gypsy elements were found to make up most of highly repetitive DNA in all species (14-34.5% of the genome. However, there were quantitative differences and sequence variations detected for classified repeat families as well as for the bulk of total repetitive DNA. These differences were most pronounced between species from different taxonomic sections of the Musaceae family, whereas pairs of closely related species (M. acuminata/M. ornata and M. beccarii/M. textilis shared similar populations of repetitive elements. CONCLUSIONS: This study provided the first insight into the composition and sequence variation of repetitive parts of Musaceae genomes. It allowed identification of repetitive sequences specific for a single species or a group of species that can be utilized as molecular markers in breeding programs and generated computational resources that will be instrumental in repeat masking and annotation in future genome assembly projects.

  17. Characterization of polyploid wheat genomic diversity using a high‐density 90 000 single nucleotide polymorphism array

    National Research Council Canada - National Science Library

    Wang, Shichen; Wong, Debbie; Forrest, Kerrie; Allen, Alexandra; Chao, Shiaoman; Huang, Bevan E; Maccaferri, Marco; Salvi, Silvio; Milner, Sara G; Cattivelli, Luigi; Mastrangelo, Anna M; Whan, Alex; Stephen, Stuart; Barker, Gary; Wieseke, Ralf; Plieske, Joerg; Lillemo, Morten; Mather, Diane; Appels, Rudi; Dolferus, Rudy; Brown‐Guedira, Gina; Korol, Abraham; Akhunova, Alina R; Feuillet, Catherine; Salse, Jerome; Morgante, Michele; Pozniak, Curtis; Luo, Ming‐Cheng; Dvorak, Jan; Morell, Matthew; Dubcovsky, Jorge; Ganal, Martin; Tuberosa, Roberto; Lawley, Cindy; Mikoulitch, Ivan; Cavanagh, Colin; Edwards, Keith J; Hayden, Matthew; Akhunov, Eduard

    2014-01-01

    High‐density single nucleotide polymorphism ( SNP ) genotyping arrays are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships between individuals in populations and studying marker...

  18. The genomic signature of sexual selection in the genetic diversity of the sex chromosomes and autosomes.

    Science.gov (United States)

    Corl, Ammon; Ellegren, Hans

    2012-07-01

    Genomic levels of variation can help reveal the selective and demographic forces that have affected a species during its history. The relative amount of genetic diversity observed on the sex chromosomes as compared to the autosomes is predicted to differ among monogamous and polygynous species. Many species show departures from the expectation for monogamy, but it can be difficult to conclude that this pattern results from variation in mating system because forces other than sexual selection can act upon sex chromosome genetic diversity. As a critical test of the role of mating system, we compared levels of genetic diversity on the Z chromosome and autosomes of phylogenetically independent pairs of shorebirds that differed in their mating systems. We found general support for sexual selection shaping sex chromosome diversity because most polygynous species showed relatively reduced genetic variation on their Z chromosomes as compared to monogamous species. Differences in levels of genetic diversity between the sex chromosomes and autosomes may therefore contribute to understanding the long-term history of sexual selection experienced by a species. © 2012 The Author(s).

  19. Genomic diversity of necrotic enteritis-associated strains of Clostridium perfringens: a review.

    Science.gov (United States)

    Lacey, Jake A; Johanesen, Priscilla A; Lyras, Dena; Moore, Robert J

    2016-06-01

    The investigation of genomic variation between Clostridium perfringens isolates from poultry has been an important tool to enhance our understanding of the genetic basis of strain pathogenicity and the epidemiology of virulent and avirulent strains within the context of necrotic enteritis (NE). The earliest studies used whole genome profiling techniques such as pulsed-field gel electrophoresis to differentiate isolates and determine their relative levels of relatedness. DNA sequencing has been used to investigate genetic variation in (a) individual genes, such as those encoding the alpha and NetB toxins; (b) panels of housekeeping genes for multi-locus sequence typing and (c) most recently whole genome sequencing to build a more complete picture of genomic differences between isolates. Conclusions drawn from these studies include: differential carriage of large conjugative plasmids accounts for a large proportion of inter-strain differences; plasmid-encoded genes are more highly conserved than chromosomal genes, perhaps indicating a relatively recent origin for the plasmids; isolates from NE-affected birds fall into three distinct sequence-based clades while non-pathogenic isolates from healthy birds tend to be more genomically diverse. Overall, the NE causing strains are closely related to C. perfringens isolates from other birds and other diseases whereas the non-pathogenic poultry strains are generally more remotely related to either the pathogenic strains or the strains from other birds. Genomic analysis has indicated that genes in addition to netB are associated with NE pathogenic isolates. Collectively, this work has resulted in a deeper understanding of the pathogenesis of this important poultry disease.

  20. Structural and sequence diversity of the transposon Galileo in the Drosophila willistoni genome.

    Science.gov (United States)

    Gonçalves, Juliana W; Valiati, Victor Hugo; Delprat, Alejandra; Valente, Vera L S; Ruiz, Alfredo

    2014-09-13

    Galileo is one of three members of the P superfamily of DNA transposons. It was originally discovered in Drosophila buzzatii, in which three segregating chromosomal inversions were shown to have been generated by ectopic recombination between Galileo copies. Subsequently, Galileo was identified in six of 12 sequenced Drosophila genomes, indicating its widespread distribution within this genus. Galileo is strikingly abundant in Drosophila willistoni, a neotropical species that is highly polymorphic for chromosomal inversions, suggesting a role for this transposon in the evolution of its genome. We carried out a detailed characterization of all Galileo copies present in the D. willistoni genome. A total of 191 copies, including 133 with two terminal inverted repeats (TIRs), were classified according to structure in six groups. The TIRs exhibited remarkable variation in their length and structure compared to the most complete copy. Three copies showed extended TIRs due to internal tandem repeats, the insertion of other transposable elements (TEs), or the incorporation of non-TIR sequences into the TIRs. Phylogenetic analyses of the transposase (TPase)-encoding and TIR segments yielded two divergent clades, which we termed Galileo subfamilies V and W. Target-site duplications (TSDs) in D. willistoni Galileo copies were 7- or 8-bp in length, with the consensus sequence GTATTAC. Analysis of the region around the TSDs revealed a target site motif (TSM) with a 15-bp palindrome that may give rise to a stem-loop secondary structure. There is a remarkable abundance and diversity of Galileo copies in the D. willistoni genome, although no functional copies were found. The TIRs in particular have a dynamic structure and extend in different ways, but their ends (required for transposition) are more conserved than the rest of the element. The D. willistoni genome harbors two Galileo subfamilies (V and W) that diverged ~9 million years ago and may have descended from an ancestral

  1. Improved annotation of the insect vector of citrus greening disease: biocuration by a diverse genomics community

    Science.gov (United States)

    Hosmani, Prashant S.; Villalobos-Ayala, Krystal; Miller, Sherry; Shippy, Teresa; Flores, Mirella; Rosendale, Andrew; Cordola, Chris; Bell, Tracey; Mann, Hannah; DeAvila, Gabe; DeAvila, Daniel; Moore, Zachary; Buller, Kyle; Ciolkevich, Kathryn; Nandyal, Samantha; Mahoney, Robert; Van Voorhis, Joshua; Dunlevy, Megan; Farrow, David; Hunter, David; Morgan, Taylar; Shore, Kayla; Guzman, Victoria; Izsak, Allison; Dixon, Danielle E.; Cridge, Andrew; Cano, Liliana; Cao, Xiaolong; Jiang, Haobo; Leng, Nan; Johnson, Shannon; Cantarel, Brandi L.; Richards, Stephen; English, Adam; Shatters, Robert G.; Childers, Chris; Chen, Mei-Ju; Hunter, Wayne; Cilia, Michelle; Mueller, Lukas A.; Munoz-Torres, Monica; Nelson, David; Poelchau, Monica F.; Benoit, Joshua B.; Wiersma-Koch, Helen; D’Elia, Tom; Brown, Susan J.

    2017-01-01

    Abstract The Asian citrus psyllid (Diaphorina citri Kuwayama) is the insect vector of the bacterium Candidatus Liberibacter asiaticus (CLas), the pathogen associated with citrus Huanglongbing (HLB, citrus greening). HLB threatens citrus production worldwide. Suppression or reduction of the insect vector using chemical insecticides has been the primary method to inhibit the spread of citrus greening disease. Accurate structural and functional annotation of the Asian citrus psyllid genome, as well as a clear understanding of the interactions between the insect and CLas, are required for development of new molecular-based HLB control methods. A draft assembly of the D. citri genome has been generated and annotated with automated pipelines. However, knowledge transfer from well-curated reference genomes such as that of Drosophila melanogaster to newly sequenced ones is challenging due to the complexity and diversity of insect genomes. To identify and improve gene models as potential targets for pest control, we manually curated several gene families with a focus on genes that have key functional roles in D. citri biology and CLas interactions. This community effort produced 530 manually curated gene models across developmental, physiological, RNAi regulatory and immunity-related pathways. As previously shown in the pea aphid, RNAi machinery genes putatively involved in the microRNA pathway have been specifically duplicated. A comprehensive transcriptome enabled us to identify a number of gene families that are either missing or misassembled in the draft genome. In order to develop biocuration as a training experience, we included undergraduate and graduate students from multiple institutions, as well as experienced annotators from the insect genomics research community. The resulting gene set (OGS v1.0) combines both automatically predicted and manually curated gene models. Database URL: https://citrusgreening.org/

  2. Diversity of chloroplast genome among local clones of cocoa (Theobroma cacao, L.) from Central Sulawesi

    Science.gov (United States)

    Suwastika, I. Nengah; Pakawaru, Nurul Aisyah; Rifka, Rahmansyah, Muslimin, Ishizaki, Yoko; Cruz, André Freire; Basri, Zainuddin; Shiina, Takashi

    2017-02-01

    Chloroplast genomes typically range in size from 120 to 170 kilo base pairs (kb), which relatively conserved among plant species. Recent evaluation on several species, certain unique regions showed high variability which can be utilized in the phylogenetic analysis. Many fragments of coding regions, introns, and intergenic spacers, such as atpB-rbcL, ndhF, rbcL, rpl16, trnH-psbA, trnL-F, trnS-G, etc., have been used for phylogenetic reconstructions at various taxonomic levels. Based on that status, we would like to analysis the diversity of chloroplast genome within species of local cacao (Theobroma cacao L.) from Central Sulawesi. Our recent data showed, there were more than 20 clones from local farming in Central Sulawesi, and it can be detected based on phenotypic and nuclear-genome-based characterization (RAPD- Random Amplified Polymorphic DNA and SSR- Simple Sequences Repeat) markers. In developing DNA marker for this local cacao, here we also included analysis based on the variation of chloroplast genome. At least several regions such as rpl32-TurnL, it can be considered as chloroplast markers on our local clone of cocoa. Furthermore, we could develop phylogenetic analysis in between clones of cocoa.

  3. A common genomic framework for a diverse assembly of plasmids in the symbiotic nitrogen fixing bacteria.

    Directory of Open Access Journals (Sweden)

    Lisa C Crossman

    2008-07-01

    Full Text Available This work centres on the genomic comparisons of two closely-related nitrogen-fixing symbiotic bacteria, Rhizobium leguminosarum biovar viciae 3841 and Rhizobium etli CFN42. These strains maintain a stable genomic core that is also common to other rhizobia species plus a very variable and significant accessory component. The chromosomes are highly syntenic, whereas plasmids are related by fewer syntenic blocks and have mosaic structures. The pairs of plasmids p42f-pRL12, p42e-pRL11 and p42b-pRL9 as well large parts of p42c with pRL10 are shown to be similar, whereas the symbiotic plasmids (p42d and pRL10 are structurally unrelated and seem to follow distinct evolutionary paths. Even though purifying selection is acting on the whole genome, the accessory component is evolving more rapidly. This component is constituted largely for proteins for transport of diverse metabolites and elements of external origin. The present analysis allows us to conclude that a heterogeneous and quickly diversifying group of plasmids co-exists in a common genomic framework.

  4. Genomic and metabolic diversity of Marine Group I Thaumarchaeota in the mesopelagic of two subtropical gyres.

    Directory of Open Access Journals (Sweden)

    Brandon K Swan

    Full Text Available Marine Group I (MGI Thaumarchaeota are one of the most abundant and cosmopolitan chemoautotrophs within the global dark ocean. To date, no representatives of this archaeal group retrieved from the dark ocean have been successfully cultured. We used single cell genomics to investigate the genomic and metabolic diversity of thaumarchaea within the mesopelagic of the subtropical North Pacific and South Atlantic Ocean. Phylogenetic and metagenomic recruitment analysis revealed that MGI single amplified genomes (SAGs are genetically and biogeographically distinct from existing thaumarchaea cultures obtained from surface waters. Confirming prior studies, we found genes encoding proteins for aerobic ammonia oxidation and the hydrolysis of urea, which may be used for energy production, as well as genes involved in 3-hydroxypropionate/4-hydroxybutyrate and oxidative tricarboxylic acid pathways. A large proportion of protein sequences identified in MGI SAGs were absent in the marine cultures Cenarchaeum symbiosum and Nitrosopumilus maritimus, thus expanding the predicted protein space for this archaeal group. Identifiable genes located on genomic islands with low metagenome recruitment capacity were enriched in cellular defense functions, likely in response to viral infections or grazing. We show that MGI Thaumarchaeota in the dark ocean may have more flexibility in potential energy sources and adaptations to biotic interactions than the existing, surface-ocean cultures.

  5. Diversity and Evolution of Mycobacterium tuberculosis: Moving to Whole-Genome-Based Approaches

    Science.gov (United States)

    Niemann, Stefan; Supply, Philip

    2014-01-01

    Genotyping of clinical Mycobacterium tuberculosis complex (MTBC) strains has become a standard tool for epidemiological tracing and for the investigation of the local and global strain population structure. Of special importance is the analysis of the expansion of multidrug (MDR) and extensively drug-resistant (XDR) strains. Classical genotyping and, more recently, whole-genome sequencing have revealed that the strains of the MTBC are more diverse than previously anticipated. Globally, several phylogenetic lineages can be distinguished whose geographical distribution is markedly variable. Strains of particular (sub)lineages, such as Beijing, seem to be more virulent and associated with enhanced resistance levels and fitness, likely fueling their spread in certain world regions. The upcoming generalization of whole-genome sequencing approaches will expectedly provide more comprehensive insights into the molecular and epidemiological mechanisms involved and lead to better diagnostic and therapeutic tools. PMID:25190252

  6. An evolutionary perspective of how infection drives human genome diversity: the case of malaria.

    Science.gov (United States)

    Mangano, Valentina D; Modiano, David

    2014-10-01

    Infection with malaria parasites has imposed a strong selective pressure on the human genome, promoting the convergent evolution of a diverse range of genetic adaptations, many of which are harboured by the red blood cell, which hosts the pathogenic stage of the Plasmodium life cycle. Recent genome-wide and multi-centre association studies of severe malaria have consistently identified ATP2B4, encoding the major Ca(2+) pump of erythrocytes, as a novel resistance locus. Evidence is also accumulating that interaction occurs among resistance loci, the most recent example being negative epistasis among alpha-thalassemia and haptoglobin type 2. Finally, studies on the effect of haemoglobin S and C on parasite transmission to mosquitoes have suggested that protective variants could increase in frequency enhancing parasite fitness. Copyright © 2014 Elsevier Ltd. All rights reserved.

  7. Intraspecies Genomic Diversity and Long-Term Persistence of Bifidobacterium longum

    Science.gov (United States)

    Chaplin, Andrei V.; Efimov, Boris A.; Smeianov, Vladimir V.; Kafarskaia, Lyudmila I.; Pikina, Alla P.; Shkoporov, Andrei N.

    2015-01-01

    Members of genus Bifidobacterium are Gram-positive bacteria, representing a large part of the human infant microbiota and moderately common in adults. However, our knowledge about their diversity, intraspecific phylogeny and long-term persistence in humans is still limited. Bifidobacterium longum is generally considered to be the most common and prevalent species in the intestinal microbiota. In this work we studied whole genome sequences of 28 strains of B. longum, including 8 sequences described in this paper. Part of these strains were isolated from healthy children during a long observation period (up to 10 years between isolation from the same patient). The three known subspecies (longum, infantis and suis) could be clearly divided using sequence-based phylogenetic methods, gene content and the average nucleotide identity. The profiles of glycoside hydrolase genes reflected the different ecological specializations of these three subspecies. The high impact of horizontal gene transfer on genomic diversity was observed, which is possibly due to a large number of prophages and rapidly spreading plasmids. The pan-genome characteristics of the subspecies longum corresponded to the open pan-genome model. While the major part of the strain-specific genetic loci represented transposons and phage-derived regions, a large number of cell envelope synthesis genes were also observed within this category, representing high variability of cell surface molecules. We observed the cases of isolation of high genetically similar strains of B. longum from the same patients after long periods of time, however, we didn’t succeed in the isolation of genetically identical bacteria: a fact, reflecting the high plasticity of microbiota in children. PMID:26275230

  8. Combining genomic sequencing methods to explore viral diversity and reveal potential virus-host interactions

    Directory of Open Access Journals (Sweden)

    Cheryl-Emiliane Tien Chow

    2015-04-01

    Full Text Available Viral diversity and virus-host interactions in oxygen-starved regions of the ocean, also known as oxygen minimum zones (OMZs, remain relatively unexplored. Microbial community metabolism in OMZs alters nutrient and energy flow through marine food webs, resulting in biological nitrogen loss and greenhouse gas production. Thus, viruses infecting OMZ microbes have the potential to modulate community metabolism with resulting feedback on ecosystem function. Here, we describe viral communities inhabiting oxic surface (10m and oxygen-starved basin (200m waters of Saanich Inlet, a seasonally anoxic fjord on the coast of Vancouver Island, British Columbia using viral metagenomics and complete viral fosmid sequencing on samples collected between April 2007 and April 2010. Of 6459 open reading frames (ORFs predicted across all 34 viral fosmids, 77.6% (n=5010 had no homology to reference viral genomes. These fosmids recruited a higher proportion of viral metagenomic sequences from Saanich Inlet than from nearby northeastern subarctic Pacific Ocean (Line P waters, indicating differences in the viral communities between coastal and open ocean locations. While functional annotations of fosmid ORFs were limited, recruitment to NCBI’s non-redundant ‘nr’ database and publicly available single-cell genomes identified putative viruses infecting marine thaumarchaeal and SUP05 proteobacteria to provide potential host linkages with relevance to coupled biogeochemical cycling processes in OMZ waters. Taken together, these results highlight the power of coupled analyses of multiple sequence data types, such as viral metagenomic and fosmid sequence data with prokaryotic single cell genomes, to chart viral diversity, elucidate genomic and ecological contexts for previously unclassifiable viral sequences, and identify novel host interactions in natural and engineered ecosystems.

  9. Maize (Zea mays L. genome diversity as revealed by RNA-sequencing.

    Directory of Open Access Journals (Sweden)

    Candice N Hansey

    Full Text Available Maize is rich in genetic and phenotypic diversity. Understanding the sequence, structural, and expression variation that contributes to phenotypic diversity would facilitate more efficient varietal improvement. RNA based sequencing (RNA-seq is a powerful approach for transcriptional analysis, assessing sequence variation, and identifying novel transcript sequences, particularly in large, complex, repetitive genomes such as maize. In this study, we sequenced RNA from whole seedlings of 21 maize inbred lines representing diverse North American and exotic germplasm. Single nucleotide polymorphism (SNP detection identified 351,710 polymorphic loci distributed throughout the genome covering 22,830 annotated genes. Tight clustering of two distinct heterotic groups and exotic lines was evident using these SNPs as genetic markers. Transcript abundance analysis revealed minimal variation in the total number of genes expressed across these 21 lines (57.1% to 66.0%. However, the transcribed gene set among the 21 lines varied, with 48.7% expressed in all of the lines, 27.9% expressed in one to 20 lines, and 23.4% expressed in none of the lines. De novo assembly of RNA-seq reads that did not map to the reference B73 genome sequence revealed 1,321 high confidence novel transcripts, of which, 564 loci were present in all 21 lines, including B73, and 757 loci were restricted to a subset of the lines. RT-PCR validation demonstrated 87.5% concordance with the computational prediction of these expressed novel transcripts. Intriguingly, 145 of the novel de novo assembled loci were present in lines from only one of the two heterotic groups consistent with the hypothesis that, in addition to sequence polymorphisms and transcript abundance, transcript presence/absence variation is present and, thereby, may be a mechanism contributing to the genetic basis of heterosis.

  10. Characterizing neutral genomic diversity and selection signatures in indigenous populations of Moroccan goats (Capra hircus) using WGS data.

    Science.gov (United States)

    Benjelloun, Badr; Alberto, Florian J; Streeter, Ian; Boyer, Frédéric; Coissac, Eric; Stucki, Sylvie; BenBati, Mohammed; Ibnelbachyr, Mustapha; Chentouf, Mouad; Bechchari, Abdelmajid; Leempoel, Kevin; Alberti, Adriana; Engelen, Stefan; Chikhi, Abdelkader; Clarke, Laura; Flicek, Paul; Joost, Stéphane; Taberlet, Pierre; Pompanon, François

    2015-01-01

    Since the time of their domestication, goats (Capra hircus) have evolved in a large variety of locally adapted populations in response to different human and environmental pressures. In the present era, many indigenous populations are threatened with extinction due to their substitution by cosmopolitan breeds, while they might represent highly valuable genomic resources. It is thus crucial to characterize the neutral and adaptive genetic diversity of indigenous populations. A fine characterization of whole genome variation in farm animals is now possible by using new sequencing technologies. We sequenced the complete genome at 12× coverage of 44 goats geographically representative of the three phenotypically distinct indigenous populations in Morocco. The study of mitochondrial genomes showed a high diversity exclusively restricted to the haplogroup A. The 44 nuclear genomes showed a very high diversity (24 million variants) associated with low linkage disequilibrium. The overall genetic diversity was weakly structured according to geography and phenotypes. When looking for signals of positive selection in each population we identified many candidate genes, several of which gave insights into the metabolic pathways or biological processes involved in the adaptation to local conditions (e.g., panting in warm/desert conditions). This study highlights the interest of WGS data to characterize livestock genomic diversity. It illustrates the valuable genetic richness present in indigenous populations that have to be sustainably managed and may represent valuable genetic resources for the long-term preservation of the species.

  11. Characterizing neutral genomic diversity and selection signatures in indigenous populations of Moroccan goats (Capra hircus using WGS data

    Directory of Open Access Journals (Sweden)

    Badr eBenjelloun

    2015-04-01

    Full Text Available Since the time of their domestication, goats (Capra hircus have evolved in a large variety of locally adapted populations in response to different human and environmental pressures. In the present era, many indigenous populations are threatened with extinction due to their substitution by cosmopolitan breeds, while they might represent highly valuable genomic resources. It is thus crucial to characterize the neutral and adaptive genetic diversity of indigenous populations. A fine characterization of whole genome variation in farm animals is now possible by using new sequencing technologies. We sequenced the complete genome at 12X coverage of 44 goats geographically representative of the three phenotypically distinct indigenous populations in Morocco. The study of mitochondrial genomes showed a high diversity exclusively restricted to the haplogroup A. The 44 nuclear genomes showed a very high diversity (24 million variants associated with low linkage disequilibrium. The overall genetic diversity was weakly structured according to geography and phenotypes. When looking for signals of positive selection in each population we identified many candidate genes, several of which gave insights into the metabolic pathways or biological processes involved in the adaptation to local conditions (e.g. panting in warm/desert conditions. This study highlights the interest of WGS data to characterize livestock genomic diversity. It illustrates the valuable genetic richness present in indigenous populations that have to be sustainably managed and may represent valuable genetic resources for the long-term preservation of the species.

  12. Comparative genomics reveals high biological diversity and specific adaptations in the industrially and medically important fungal genus Aspergillus.

    Science.gov (United States)

    de Vries, Ronald P; Riley, Robert; Wiebenga, Ad; Aguilar-Osorio, Guillermo; Amillis, Sotiris; Uchima, Cristiane Akemi; Anderluh, Gregor; Asadollahi, Mojtaba; Askin, Marion; Barry, Kerrie; Battaglia, Evy; Bayram, Özgür; Benocci, Tiziano; Braus-Stromeyer, Susanna A; Caldana, Camila; Cánovas, David; Cerqueira, Gustavo C; Chen, Fusheng; Chen, Wanping; Choi, Cindy; Clum, Alicia; Dos Santos, Renato Augusto Corrêa; Damásio, André Ricardo de Lima; Diallinas, George; Emri, Tamás; Fekete, Erzsébet; Flipphi, Michel; Freyberg, Susanne; Gallo, Antonia; Gournas, Christos; Habgood, Rob; Hainaut, Matthieu; Harispe, María Laura; Henrissat, Bernard; Hildén, Kristiina S; Hope, Ryan; Hossain, Abeer; Karabika, Eugenia; Karaffa, Levente; Karányi, Zsolt; Kraševec, Nada; Kuo, Alan; Kusch, Harald; LaButti, Kurt; Lagendijk, Ellen L; Lapidus, Alla; Levasseur, Anthony; Lindquist, Erika; Lipzen, Anna; Logrieco, Antonio F; MacCabe, Andrew; Mäkelä, Miia R; Malavazi, Iran; Melin, Petter; Meyer, Vera; Mielnichuk, Natalia; Miskei, Márton; Molnár, Ákos P; Mulé, Giuseppina; Ngan, Chew Yee; Orejas, Margarita; Orosz, Erzsébet; Ouedraogo, Jean Paul; Overkamp, Karin M; Park, Hee-Soo; Perrone, Giancarlo; Piumi, Francois; Punt, Peter J; Ram, Arthur F J; Ramón, Ana; Rauscher, Stefan; Record, Eric; Riaño-Pachón, Diego Mauricio; Robert, Vincent; Röhrig, Julian; Ruller, Roberto; Salamov, Asaf; Salih, Nadhira S; Samson, Rob A; Sándor, Erzsébet; Sanguinetti, Manuel; Schütze, Tabea; Sepčić, Kristina; Shelest, Ekaterina; Sherlock, Gavin; Sophianopoulou, Vicky; Squina, Fabio M; Sun, Hui; Susca, Antonia; Todd, Richard B; Tsang, Adrian; Unkles, Shiela E; van de Wiele, Nathalie; van Rossen-Uffink, Diana; Oliveira, Juliana Velasco de Castro; Vesth, Tammi C; Visser, Jaap; Yu, Jae-Hyuk; Zhou, Miaomiao; Andersen, Mikael R; Archer, David B; Baker, Scott E; Benoit, Isabelle; Brakhage, Axel A; Braus, Gerhard H; Fischer, Reinhard; Frisvad, Jens C; Goldman, Gustavo H; Houbraken, Jos; Oakley, Berl; Pócsi, István; Scazzocchio, Claudio; Seiboth, Bernhard; vanKuyk, Patricia A; Wortman, Jennifer; Dyer, Paul S; Grigoriev, Igor V

    2017-02-14

    The fungal genus Aspergillus is of critical importance to humankind. Species include those with industrial applications, important pathogens of humans, animals and crops, a source of potent carcinogenic contaminants of food, and an important genetic model. The genome sequences of eight aspergilli have already been explored to investigate aspects of fungal biology, raising questions about evolution and specialization within this genus. We have generated genome sequences for ten novel, highly diverse Aspergillus species and compared these in detail to sister and more distant genera. Comparative studies of key aspects of fungal biology, including primary and secondary metabolism, stress response, biomass degradation, and signal transduction, revealed both conservation and diversity among the species. Observed genomic differences were validated with experimental studies. This revealed several highlights, such as the potential for sex in asexual species, organic acid production genes being a key feature of black aspergilli, alternative approaches for degrading plant biomass, and indications for the genetic basis of stress response. A genome-wide phylogenetic analysis demonstrated in detail the relationship of the newly genome sequenced species with other aspergilli. Many aspects of biological differences between fungal species cannot be explained by current knowledge obtained from genome sequences. The comparative genomics and experimental study, presented here, allows for the first time a genus-wide view of the biological diversity of the aspergilli and in many, but not all, cases linked genome differences to phenotype. Insights gained could be exploited for biotechnological and medical applications of fungi.

  13. Whole-Genome Sequencing Reveals Diverse Models of Structural Variations in Esophageal Squamous Cell Carcinoma.

    Science.gov (United States)

    Cheng, Caixia; Zhou, Yong; Li, Hongyi; Xiong, Teng; Li, Shuaicheng; Bi, Yanghui; Kong, Pengzhou; Wang, Fang; Cui, Heyang; Li, Yaoping; Fang, Xiaodong; Yan, Ting; Li, Yike; Wang, Juan; Yang, Bin; Zhang, Ling; Jia, Zhiwu; Song, Bin; Hu, Xiaoling; Yang, Jie; Qiu, Haile; Zhang, Gehong; Liu, Jing; Xu, Enwei; Shi, Ruyi; Zhang, Yanyan; Liu, Haiyan; He, Chanting; Zhao, Zhenxiang; Qian, Yu; Rong, Ruizhou; Han, Zhiwei; Zhang, Yanlin; Luo, Wen; Wang, Jiaqian; Peng, Shaoliang; Yang, Xukui; Li, Xiangchun; Li, Lin; Fang, Hu; Liu, Xingmin; Ma, Li; Chen, Yunqing; Guo, Shiping; Chen, Xing; Xi, Yanfeng; Li, Guodong; Liang, Jianfang; Yang, Xiaofeng; Guo, Jiansheng; Jia, JunMei; Li, Qingshan; Cheng, Xiaolong; Zhan, Qimin; Cui, Yongping

    2016-02-04

    Comprehensive identification of somatic structural variations (SVs) and understanding their mutational mechanisms in cancer might contribute to understanding biological differences and help to identify new therapeutic targets. Unfortunately, characterization of complex SVs across the whole genome and the mutational mechanisms underlying esophageal squamous cell carcinoma (ESCC) is largely unclear. To define a comprehensive catalog of somatic SVs, affected target genes, and their underlying mechanisms in ESCC, we re-analyzed whole-genome sequencing (WGS) data from 31 ESCCs using Meerkat algorithm to predict somatic SVs and Patchwork to determine copy-number changes. We found deletions and translocations with NHEJ and alt-EJ signature as the dominant SV types, and 16% of deletions were complex deletions. SVs frequently led to disruption of cancer-associated genes (e.g., CDKN2A and NOTCH1) with different mutational mechanisms. Moreover, chromothripsis, kataegis, and breakage-fusion-bridge (BFB) were identified as contributing to locally mis-arranged chromosomes that occurred in 55% of ESCCs. These genomic catastrophes led to amplification of oncogene through chromothripsis-derived double-minute chromosome formation (e.g., FGFR1 and LETM2) or BFB-affected chromosomes (e.g., CCND1, EGFR, ERBB2, MMPs, and MYC), with approximately 30% of ESCCs harboring BFB-derived CCND1 amplification. Furthermore, analyses of copy-number alterations reveal high frequency of whole-genome duplication (WGD) and recurrent focal amplification of CDCA7 that might act as a potential oncogene in ESCC. Our findings reveal molecular defects such as chromothripsis and BFB in malignant transformation of ESCCs and demonstrate diverse models of SVs-derived target genes in ESCCs. These genome-wide SV profiles and their underlying mechanisms provide preventive, diagnostic, and therapeutic implications for ESCCs. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  14. Assessing Genetic Diversity among Brettanomyces Yeasts by DNA Fingerprinting and Whole-Genome Sequencing

    Science.gov (United States)

    Crauwels, Sam; Zhu, Bo; Steensels, Jan; Busschaert, Pieter; De Samblanx, Gorik; Marchal, Kathleen; Willems, Kris A.

    2014-01-01

    Brettanomyces yeasts, with the species Brettanomyces (Dekkera) bruxellensis being the most important one, are generally reported to be spoilage yeasts in the beer and wine industry due to the production of phenolic off flavors. However, B. bruxellensis is also known to be a beneficial contributor in certain fermentation processes, such as the production of certain specialty beers. Nevertheless, despite its economic importance, Brettanomyces yeasts remain poorly understood at the genetic and genomic levels. In this study, the genetic relationship between more than 50 Brettanomyces strains from all presently known species and from several sources was studied using a combination of DNA fingerprinting techniques. This revealed an intriguing correlation between the B. bruxellensis fingerprints and the respective isolation source. To further explore this relationship, we sequenced a (beneficial) beer isolate of B. bruxellensis (VIB X9085; ST05.12/22) and compared its genome sequence with the genome sequences of two wine spoilage strains (AWRI 1499 and CBS 2499). ST05.12/22 was found to be substantially different from both wine strains, especially at the level of single nucleotide polymorphisms (SNPs). In addition, there were major differences in the genome structures between the strains investigated, including the presence of large duplications and deletions. Gene content analysis revealed the presence of 20 genes which were present in both wine strains but absent in the beer strain, including many genes involved in carbon and nitrogen metabolism, and vice versa, no genes that were missing in both AWRI 1499 and CBS 2499 were found in ST05.12/22. Together, this study provides tools to discriminate Brettanomyces strains and provides a first glimpse at the genetic diversity and genome plasticity of B. bruxellensis. PMID:24814796

  15. Genomic diversity among Corynebacterium jeikeium strains and comparison with biochemical characteristics and antimicrobial susceptibilities.

    Science.gov (United States)

    Riegel, P; de Briel, D; Prévost, G; Jehl, F; Monteil, H

    1994-08-01

    Levels of DNA relatedness were determined by performing DNA-DNA hybridization experiments (S1 nuclease procedure) with 13 human isolates exhibiting various antimicrobial susceptibility patterns which had been identified as Corynebacterium jeikeium by classical tests and the API Coryne system and with reference strains of C. jeikeium and related taxa. Twelve of 13 isolates which formed three genomic groups showed between 22 and 75% relatedness with the type strain of C. jeikeium. One of these genomic groups included all the strains resistant to penicillin and gentamicin and is genomically related to the C. jeikeium type strain at the species level. In addition, the reference strain of "Corynebacterium genitalium" biotype II was found to belong to this genospecies and therefore can be considered as a synonym of C. jeikeium. In contrast, one isolate and the reference strains of "Corynebacterium pseudogenitalium" biotypes C-3 and C-4 which were assigned to C. jeikeium by the API Coryne system were less than 10% related to the C. jeikeium type strain. These nongenomically related strains can be differentiated from the jeikeium-related strains on the basis of positive acidification from fructose and growth under anaerobic conditions. Furthermore, these strains exhibited full susceptibility to penicillin whereas the strains related to the C. jeikeium type strain are resistant to or only moderately susceptible to penicillin. No genomic relationship was found between C. jeikeium-related strains and other lipophilic coryneforms, identified as Corynebacterium accolens or Corynebacterium group G or F. Our study demonstrates the necessity to perform the fructose fermentation test or respiratory-type test for the correct identification of lipophilic coryneforms as C. jeikeium. Although these strains show genomic diversity at the species level, in a practical aspect, biochemical properties as well as antimicrobial susceptibility may allow the classification of such isolates in

  16. Global genomic diversity of Oryza sativa varieties revealed by comparative physical mapping.

    Science.gov (United States)

    Wang, Xiaoming; Kudrna, David A; Pan, Yonglong; Wang, Hao; Liu, Lin; Lin, Haiyan; Zhang, Jianwei; Song, Xiang; Goicoechea, Jose Luis; Wing, Rod A; Zhang, Qifa; Luo, Meizhong

    2014-04-01

    Bacterial artificial chromosome (BAC) physical maps embedding a large number of BAC end sequences (BESs) were generated for Oryza sativa ssp. indica varieties Minghui 63 (MH63) and Zhenshan 97 (ZS97) and were compared with the genome sequences of O. sativa spp. japonica cv. Nipponbare and O. sativa ssp. indica cv. 93-11. The comparisons exhibited substantial diversities in terms of large structural variations and small substitutions and indels. Genome-wide BAC-sized and contig-sized structural variations were detected, and the shared variations were analyzed. In the expansion regions of the Nipponbare reference sequence, in comparison to the MH63 and ZS97 physical maps, as well as to the previously constructed 93-11 physical map, the amounts and types of the repeat contents, and the outputs of gene ontology analysis, were significantly different from those of the whole genome. Using the physical maps of four wild Oryza species from OMAP (http://www.omap.org) as a control, we detected many conserved and divergent regions related to the evolution process of O. sativa. Between the BESs of MH63 and ZS97 and the two reference sequences, a total of 1532 polymorphic simple sequence repeats (SSRs), 71,383 SNPs, 1767 multiple nucleotide polymorphisms, 6340 insertions, and 9137 deletions were identified. This study provides independent whole-genome resources for intra- and intersubspecies comparisons and functional genomics studies in O. sativa. Both the comparative physical maps and the GBrowse, which integrated the QTL and molecular markers from GRAMENE (http://www.gramene.org) with our physical maps and analysis results, are open to the public through our Web site (http://gresource.hzau.edu.cn/resource/resource.html).

  17. The impact of spatial structure on viral genomic diversity generated during adaptation to thermal stress.

    Directory of Open Access Journals (Sweden)

    Dilara Ally

    Full Text Available Most clinical and natural microbial communities live and evolve in spatially structured environments. When changes in environmental conditions trigger evolutionary responses, spatial structure can impact the types of adaptive response and the extent to which they spread. In particular, localized competition in a spatial landscape can lead to the emergence of a larger number of different adaptive trajectories than would be found in well-mixed populations. Our goal was to determine how two levels of spatial structure affect genomic diversity in a population and how this diversity is manifested spatially.We serially transferred bacteriophage populations growing at high temperatures (40°C on agar plates for 550 generations at two levels of spatial structure. The level of spatial structure was determined by whether the physical locations of the phage subsamples were preserved or disrupted at each passage to fresh bacterial host populations. When spatial structure of the phage populations was preserved, there was significantly greater diversity on a global scale with restricted and patchy distribution. When spatial structure was disrupted with passaging to fresh hosts, beneficial mutants were spread across the entire plate. This resulted in reduced diversity, possibly due to clonal interference as the most fit mutants entered into competition on a global scale. Almost all substitutions present at the end of the adaptation in the populations with disrupted spatial structure were also present in the populations with structure preserved.Our results are consistent with the patchy nature of the spread of adaptive mutants in a spatial landscape. Spatial structure enhances diversity and slows fixation of beneficial mutants. This added diversity could be beneficial in fluctuating environments. We also connect observed substitutions and their effects on fitness to aspects of phage biology, and we provide evidence that some substitutions exclude each other.

  18. Genome-wide Diversity and Association Mapping for Capsaicinoids and Fruit Weight in Capsicum annuum L.

    Science.gov (United States)

    Nimmakayala, Padma; Abburi, Venkata L; Saminathan, Thangasamy; Alaparthi, Suresh B; Almeida, Aldo; Davenport, Brittany; Nadimi, Marjan; Davidson, Joshua; Tonapi, Krittika; Yadav, Lav; Malkaram, Sridhar; Vajja, Gopinath; Hankins, Gerald; Harris, Robert; Park, Minkyu; Choi, Doil; Stommel, John; Reddy, Umesh K

    2016-11-30

    Accumulated capsaicinoid content and increased fruit size are traits resulting from Capsicum annuum domestication. In this study, we used a diverse collection of C. annuum to generate 66,960 SNPs using genotyping by sequencing. The study identified 1189 haplotypes containing 3413 SNPs. Length of individual linkage disequilibrium (LD) blocks varied along chromosomes, with regions of high and low LD interspersed with an average LD of 139 kb. Principal component analysis (PCA), Bayesian model based population structure analysis and an Euclidean tree built based on identity by state (IBS) indices revealed that the clustering pattern of diverse accessions are in agreement with capsaicin content (CA) and fruit weight (FW) classifications indicating the importance of these traits in shaping modern pepper genome. PCA and IBS were used in a mixed linear model of capsaicin and dihydrocapsaicin content and fruit weight to reduce spurious associations because of confounding effects of subpopulations in genome-wide association study (GWAS). Our GWAS results showed SNPs in Ankyrin-like protein, IKI3 family protein, ABC transporter G family and pentatricopeptide repeat protein are the major markers for capsaicinoids and of 16 SNPs strongly associated with FW in both years of the study, 7 are located in known fruit weight controlling genes.

  19. Patterns of genomic and phenomic diversity in wine and table grapes.

    Science.gov (United States)

    Migicovsky, Zoë; Sawler, Jason; Gardner, Kyle M; Aradhya, Mallikarjuna K; Prins, Bernard H; Schwaninger, Heidi R; Bustamante, Carlos D; Buckler, Edward S; Zhong, Gan-Yuan; Brown, Patrick J; Myles, Sean

    2017-01-01

    Grapes are one of the most economically and culturally important crops worldwide, and they have been bred for both winemaking and fresh consumption. Here we evaluate patterns of diversity across 33 phenotypes collected over a 17-year period from 580 table and wine grape accessions that belong to one of the world's largest grape gene banks, the grape germplasm collection of the United States Department of Agriculture. We find that phenological events throughout the growing season are correlated, and quantify the marked difference in size between table and wine grapes. By pairing publicly available historical phenotype data with genome-wide polymorphism data, we identify large effect loci controlling traits that have been targeted during domestication and breeding, including hermaphroditism, lighter skin pigmentation and muscat aroma. Breeding for larger berries in table grapes was traditionally concentrated in geographic regions where Islam predominates and alcohol was prohibited, whereas wine grapes retained the ancestral smaller size that is more desirable for winemaking in predominantly Christian regions. We uncover a novel locus with a suggestive association with berry size that harbors a signature of positive selection for larger berries. Our results suggest that religious rules concerning alcohol consumption have had a marked impact on patterns of phenomic and genomic diversity in grapes.

  20. Complexity of the Mycoplasma fermentans M64 genome and metabolic essentiality and diversity among mycoplasmas.

    Directory of Open Access Journals (Sweden)

    Hung-Wei Shu

    Full Text Available Recently, the genomes of two Mycoplasma fermentans strains, namely M64 and JER, have been completely sequenced. Gross comparison indicated that the genome of M64 is significantly bigger than the other strain and the difference is mainly contributed by the repetitive sequences including seven families of simple and complex transposable elements ranging from 973 to 23,778 bps. Analysis of these repeats resulted in the identification of a new distinct family of Integrative Conjugal Elements of M. fermentans, designated as ICEF-III. Using the concept of "reaction connectivity", the metabolic capabilities in M. fermentans manifested by the complete and partial connected biomodules were revealed. A comparison of the reported M. pulmonis, M. arthritidis, M. genitalium, B. subtilis, and E. coli essential genes and the genes predicted from the M64 genome indicated that more than 73% of the Mycoplasmas essential genes are preserved in M. fermentans. Further examination of the highly and partly connected reactions by a novel combinatorial phylogenetic tree, metabolic network, and essential gene analysis indicated that some of the pathways (e.g. purine and pyrimidine metabolisms with partial connected reactions may be important for the conversions of intermediate metabolites. Taken together, in light of systems and network analyses, the diversity among the Mycoplasma species was manifested on the variations of their limited metabolic abilities during evolution.

  1. Complexity of the Mycoplasma fermentans M64 genome and metabolic essentiality and diversity among mycoplasmas.

    Science.gov (United States)

    Shu, Hung-Wei; Liu, Tze-Tze; Chan, Huang-I; Liu, Yen-Ming; Wu, Keh-Ming; Shu, Hung-Yu; Tsai, Shih-Feng; Hsiao, Kwang-Jen; Hu, Wensi S; Ng, Wailap Victor

    2012-01-01

    Recently, the genomes of two Mycoplasma fermentans strains, namely M64 and JER, have been completely sequenced. Gross comparison indicated that the genome of M64 is significantly bigger than the other strain and the difference is mainly contributed by the repetitive sequences including seven families of simple and complex transposable elements ranging from 973 to 23,778 bps. Analysis of these repeats resulted in the identification of a new distinct family of Integrative Conjugal Elements of M. fermentans, designated as ICEF-III. Using the concept of "reaction connectivity", the metabolic capabilities in M. fermentans manifested by the complete and partial connected biomodules were revealed. A comparison of the reported M. pulmonis, M. arthritidis, M. genitalium, B. subtilis, and E. coli essential genes and the genes predicted from the M64 genome indicated that more than 73% of the Mycoplasmas essential genes are preserved in M. fermentans. Further examination of the highly and partly connected reactions by a novel combinatorial phylogenetic tree, metabolic network, and essential gene analysis indicated that some of the pathways (e.g. purine and pyrimidine metabolisms) with partial connected reactions may be important for the conversions of intermediate metabolites. Taken together, in light of systems and network analyses, the diversity among the Mycoplasma species was manifested on the variations of their limited metabolic abilities during evolution.

  2. Unlocking the diversity of genebanks: whole-genome marker analysis of Swiss bread wheat and spelt

    KAUST Repository

    Müller, Thomas

    2017-11-04

    Genebanks play a pivotal role in preserving the genetic diversity present among old landraces and wild progenitors of modern crops and they represent sources of agriculturally important genes that were lost during domestication and in modern breeding. However, undesirable genes that negatively affect crop performance are often co-introduced when landraces and wild crop progenitors are crossed with elite cultivars, which often limit the use of genebank material in modern breeding programs. A detailed genetic characterization is an important prerequisite to solve this problem and to make genebank material more accessible to breeding. Here, we genotyped 502 bread wheat and 293 spelt accessions held in the Swiss National Genebank using a 15K wheat SNP array. The material included both spring and winter wheats and consisted of old landraces and modern cultivars. Genome- and sub-genome-wide analyses revealed that spelt and bread wheat form two distinct gene pools. In addition, we identified bread wheat landraces that were genetically distinct from modern cultivars. Such accessions were possibly missed in the early Swiss wheat breeding program and are promising targets for the identification of novel genes. The genetic information obtained in this study is appropriate to perform genome-wide association studies, which will facilitate the identification and transfer of agriculturally important genes from the genebank into modern cultivars through marker-assisted selection.

  3. Unlocking the diversity of genebanks: whole-genome marker analysis of Swiss bread wheat and spelt.

    Science.gov (United States)

    Müller, Thomas; Schierscher-Viret, Beate; Fossati, Dario; Brabant, Cécile; Schori, Arnold; Keller, Beat; Krattinger, Simon G

    2017-11-04

    High-throughput genotyping of Swiss bread wheat and spelt accessions revealed differences in their gene pools and identified bread wheat landraces that were not used in breeding. Genebanks play a pivotal role in preserving the genetic diversity present among old landraces and wild progenitors of modern crops and they represent sources of agriculturally important genes that were lost during domestication and in modern breeding. However, undesirable genes that negatively affect crop performance are often co-introduced when landraces and wild crop progenitors are crossed with elite cultivars, which often limit the use of genebank material in modern breeding programs. A detailed genetic characterization is an important prerequisite to solve this problem and to make genebank material more accessible to breeding. Here, we genotyped 502 bread wheat and 293 spelt accessions held in the Swiss National Genebank using a 15K wheat SNP array. The material included both spring and winter wheats and consisted of old landraces and modern cultivars. Genome- and sub-genome-wide analyses revealed that spelt and bread wheat form two distinct gene pools. In addition, we identified bread wheat landraces that were genetically distinct from modern cultivars. Such accessions were possibly missed in the early Swiss wheat breeding program and are promising targets for the identification of novel genes. The genetic information obtained in this study is appropriate to perform genome-wide association studies, which will facilitate the identification and transfer of agriculturally important genes from the genebank into modern cultivars through marker-assisted selection.

  4. Gene arrangement convergence, diverse intron content, and genetic code modifications in mitochondrial genomes of sphaeropleales (chlorophyta).

    Science.gov (United States)

    Fučíková, Karolina; Lewis, Paul O; González-Halphen, Diego; Lewis, Louise A

    2014-08-08

    The majority of our knowledge about mitochondrial genomes of Viridiplantae comes from land plants, but much less is known about their green algal relatives. In the green algal order Sphaeropleales (Chlorophyta), only one representative mitochondrial genome is currently available-that of Acutodesmus obliquus. Our study adds nine completely sequenced and three partially sequenced mitochondrial genomes spanning the phylogenetic diversity of Sphaeropleales. We show not only a size range of 25-53 kb and variation in intron content (0-11) and gene order but also conservation of 13 core respiratory genes and fragmented ribosomal RNA genes. We also report an unusual case of gene arrangement convergence in Neochloris aquatica, where the two rns fragments were secondarily placed in close proximity. Finally, we report the unprecedented usage of UCG as stop codon in Pseudomuriella schumacherensis. In addition, phylogenetic analyses of the mitochondrial protein-coding genes yield a fully resolved, well-supported phylogeny, showing promise for addressing systematic challenges in green algae. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  5. Genomic diversity among Beijing and non-Beijing Mycobacterium tuberculosis isolates from Myanmar.

    Directory of Open Access Journals (Sweden)

    Ruth Stavrum

    2008-04-01

    Full Text Available The Beijing family of Mycobacterium tuberculosis is dominant in countries in East Asia. Genomic polymorphisms are a source of diversity within the M. tuberculosis genome and may account for the variation of virulence among M. tuberculosis isolates. Till date there are no studies that have examined the genomic composition of M. tuberculosis isolates from the high TB-burden country, Myanmar.Twenty-two M. tuberculosis isolates from Myanmar were screened on whole-genome arrays containing genes from M. tuberculosis H37Rv, M. tuberculosis CDC1551 and M. bovis AF22197. Screening identified 198 deletions or extra regions in the clinical isolates compared to H37Rv. Twenty-two regions differentiated between Beijing and non-Beijing isolates and were verified by PCR on an additional 40 isolates. Six regions (Rv0071-0074 [RD105], Rv1572-1576c [RD149], Rv1585c-1587c [RD149], MT1798-Rv1755c [RD152], Rv1761c [RD152] and Rv0279c were deleted in Beijing isolates, of which 4 (Rv1572-1576c, Rv1585c-1587c, MT1798-Rv1755c and Rv1761c were variably deleted among ST42 isolates, indicating a closer relationship between the Beijing and ST42 lineages. The TbD1 region, Mb1582-Mb1583 was deleted in Beijing and ST42 isolates. One M. bovis gene of unknown function, Mb3184c was present in all isolates, except 11 of 13 ST42 isolates. The CDC1551 gene, MT1360 coding for a putative adenylate cyclase, was present in all Beijing and ST42 isolates (except 1. The pks15/1 gene, coding for a putative virulence factor, was intact in all Beijing and non-Beijing isolates, except in ST42 and ST53 isolates.This study describes previously unreported deletions/extra regions in Beijing and non-Beijing M. tuberculosis isolates. The modern and highly frequent ST42 lineage showed a closer relationship to the hypervirulent Beijing lineage than to the ancient non-Beijing lineages. The pks15/1 gene was disrupted only in modern non-Beijing isolates. This is the first report of an in-depth analysis on

  6. Analysis of genotype diversity and evolution of Dengue virus serotype 2 using complete genomes

    Directory of Open Access Journals (Sweden)

    Vaishali P. Waman

    2016-08-01

    Full Text Available Background Dengue is one of the most common arboviral diseases prevalent worldwide and is caused by Dengue viruses (genus Flavivirus, family Flaviviridae. There are four serotypes of Dengue Virus (DENV-1 to DENV-4, each of which is further subdivided into distinct genotypes. DENV-2 is frequently associated with severe dengue infections and epidemics. DENV-2 consists of six genotypes such as Asian/American, Asian I, Asian II, Cosmopolitan, American and sylvatic. Comparative genomic study was carried out to infer population structure of DENV-2 and to analyze the role of evolutionary and spatiotemporal factors in emergence of diversifying lineages. Methods Complete genome sequences of 990 strains of DENV-2 were analyzed using Bayesian-based population genetics and phylogenetic approaches to infer genetically distinct lineages. The role of spatiotemporal factors, genetic recombination and selection pressure in the evolution of DENV-2 is examined using the sequence-based bioinformatics approaches. Results DENV-2 genetic structure is complex and consists of fifteen subpopulations/lineages. The Asian/American genotype is observed to be diversified into seven lineages. The Asian I, Cosmopolitan and sylvatic genotypes were found to be subdivided into two lineages, each. The populations of American and Asian II genotypes were observed to be homogeneous. Significant evidence of episodic positive selection was observed in all the genes, except NS4A. Positive selection operational on a few codons in envelope gene confers antigenic and lineage diversity in the American strains of Asian/American genotype. Selection on codons of non-structural genes was observed to impact diversification of lineages in Asian I, cosmopolitan and sylvatic genotypes. Evidence of intra/inter-genotype recombination was obtained and the uncertainty in classification of recombinant strains was resolved using the population genetics approach. Discussion Complete genome-based analysis

  7. Genome diversity of marine phages recovered from Mediterranean metagenomes: Size matters.

    Directory of Open Access Journals (Sweden)

    Mario López-Pérez

    2017-09-01

    Full Text Available Marine viruses play a critical role not only in the global geochemical cycles but also in the biology and evolution of their hosts. Despite their importance, viral diversity remains underexplored mostly due to sampling and cultivation challenges. Direct sequencing approaches such as viromics has provided new insights into the marine viral world. As a complementary approach, we analysed 24 microbial metagenomes (>0.2 μm size range obtained from six sites in the Mediterranean Sea that vary by depth, season and filter used to retrieve the fraction. Filter-size comparison showed a significant number of viral sequences that were retained on the larger-pore filters and were different from those found in the viral fraction from the same sample, indicating that some important viral information is missing using only assembly from viromes. Besides, we were able to describe 1,323 viral genomic fragments that were more than 10Kb in length, of which 36 represented complete viral genomes including some of them retrieved from a cross-assembly from different metagenomes. Host prediction based on sequence methods revealed new phage groups belonging to marine prokaryotes like SAR11, Cyanobacteria or SAR116. We also identified the first complete virophage from deep seawater and a new endemic clade of the recently discovered Marine group II Euryarchaeota virus. Furthermore, analysis of viral distribution using metagenomes and viromes indicated that most of the new phages were found exclusively in the Mediterranean Sea and some of them, mostly the ones recovered from deep metagenomes, do not recruit in any database probably indicating higher variability and endemicity in Mediterranean bathypelagic waters. Together these data provide the first detailed picture of genomic diversity, spatial and depth variations of viral communities within the Mediterranean Sea using metagenome assembly.

  8. Identification of genome-wide copy number variations among diverse pig breeds by array CGH

    Directory of Open Access Journals (Sweden)

    Li Yan

    2012-12-01

    Full Text Available Abstract Background Recent studies have shown that copy number variation (CNV in mammalian genomes contributes to phenotypic diversity, including health and disease status. In domestic pigs, CNV has been catalogued by several reports, but the extent of CNV and the phenotypic effects are far from clear. The goal of this study was to identify CNV regions (CNVRs in pigs based on array comparative genome hybridization (aCGH. Results Here a custom-made tiling oligo-nucleotide array was used with a median probe spacing of 2506 bp for screening 12 pigs including 3 Chinese native pigs (one Chinese Erhualian, one Tongcheng and one Yangxin pig, 5 European pigs (one Large White, one Pietrain, one White Duroc and two Landrace pigs, 2 synthetic pigs (Chinese new line DIV pigs and 2 crossbred pigs (Landrace × DIV pigs with a Duroc pig as the reference. Two hundred and fifty-nine CNVRs across chromosomes 1–18 and X were identified, with an average size of 65.07 kb and a median size of 98.74 kb, covering 16.85 Mb or 0.74% of the whole genome. Concerning copy number status, 93 (35.91% CNVRs were called as gains, 140 (54.05% were called as losses and the remaining 26 (10.04% were called as both gains and losses. Of all detected CNVRs, 171 (66.02% and 34 (13.13% CNVRs directly overlapped with Sus scrofa duplicated sequences and pig QTLs, respectively. The CNVRs encompassed 372 full length Ensembl transcripts. Two CNVRs identified by aCGH were validated using real-time quantitative PCR (qPCR. Conclusions Using 720 K array CGH (aCGH we described a map of porcine CNVs which facilitated the identification of structural variations for important phenotypes and the assessment of the genetic diversity of pigs.

  9. Exploring the genomic diversity of black yeasts and relatives (Chaetothyriales, Ascomycota

    Directory of Open Access Journals (Sweden)

    M.M. Teixeira

    2017-03-01

    Full Text Available The order Chaetothyriales (Pezizomycotina, Ascomycetes harbours obligatorily melanised fungi and includes numerous etiologic agents of chromoblastomycosis, phaeohyphomycosis and other diseases of vertebrate hosts. Diseases range from mild cutaneous to fatal cerebral or disseminated infections and affect humans and cold-blooded animals globally. In addition, Chaetothyriales comprise species with aquatic, rock-inhabiting, ant-associated, and mycoparasitic life-styles, as well as species that tolerate toxic compounds, suggesting a high degree of versatile extremotolerance. To understand their biology and divergent niche occupation, we sequenced and annotated a set of 23 genomes of main the human opportunists within the Chaetothyriales as well as related environmental species. Our analyses included fungi with diverse life-styles, namely opportunistic pathogens and closely related saprobes, to identify genomic adaptations related to pathogenesis. Furthermore, ecological preferences of Chaetothyriales were analysed, in conjuncture with the order-level phylogeny based on conserved ribosomal genes. General characteristics, phylogenomic relationships, transposable elements, sex-related genes, protein family evolution, genes related to protein degradation (MEROPS, carbohydrate-active enzymes (CAZymes, melanin synthesis and secondary metabolism were investigated and compared between species. Genome assemblies varied from 25.81 Mb (Capronia coronata to 43.03 Mb (Cladophialophora immunda. The bantiana-clade contained the highest number of predicted genes (12 817 on average as well as larger genomes. We found a low content of mobile elements, with DNA transposons from Tc1/Mariner superfamily being the most abundant across analysed species. Additionally, we identified a reduction of carbohydrate degrading enzymes, specifically many of the Glycosyl Hydrolase (GH class, while most of the Pectin Lyase (PL genes were lost in etiological agents of

  10. Exploring the genomic diversity of black yeasts and relatives (Chaetothyriales, Ascomycota).

    Science.gov (United States)

    Teixeira, M M; Moreno, L F; Stielow, B J; Muszewska, A; Hainaut, M; Gonzaga, L; Abouelleil, A; Patané, J S L; Priest, M; Souza, R; Young, S; Ferreira, K S; Zeng, Q; da Cunha, M M L; Gladki, A; Barker, B; Vicente, V A; de Souza, E M; Almeida, S; Henrissat, B; Vasconcelos, A T R; Deng, S; Voglmayr, H; Moussa, T A A; Gorbushina, A; Felipe, M S S; Cuomo, C A; de Hoog, G Sybren

    2017-03-01

    The order Chaetothyriales (Pezizomycotina, Ascomycetes) harbours obligatorily melanised fungi and includes numerous etiologic agents of chromoblastomycosis, phaeohyphomycosis and other diseases of vertebrate hosts. Diseases range from mild cutaneous to fatal cerebral or disseminated infections and affect humans and cold-blooded animals globally. In addition, Chaetothyriales comprise species with aquatic, rock-inhabiting, ant-associated, and mycoparasitic life-styles, as well as species that tolerate toxic compounds, suggesting a high degree of versatile extremotolerance. To understand their biology and divergent niche occupation, we sequenced and annotated a set of 23 genomes of main the human opportunists within the Chaetothyriales as well as related environmental species. Our analyses included fungi with diverse life-styles, namely opportunistic pathogens and closely related saprobes, to identify genomic adaptations related to pathogenesis. Furthermore, ecological preferences of Chaetothyriales were analysed, in conjuncture with the order-level phylogeny based on conserved ribosomal genes. General characteristics, phylogenomic relationships, transposable elements, sex-related genes, protein family evolution, genes related to protein degradation (MEROPS), carbohydrate-active enzymes (CAZymes), melanin synthesis and secondary metabolism were investigated and compared between species. Genome assemblies varied from 25.81 Mb (Capronia coronata) to 43.03 Mb (Cladophialophora immunda). The bantiana-clade contained the highest number of predicted genes (12 817 on average) as well as larger genomes. We found a low content of mobile elements, with DNA transposons from Tc1/Mariner superfamily being the most abundant across analysed species. Additionally, we identified a reduction of carbohydrate degrading enzymes, specifically many of the Glycosyl Hydrolase (GH) class, while most of the Pectin Lyase (PL) genes were lost in etiological agents of chromoblastomycosis and

  11. Distribution and Genetic Diversity of Bacteriocin Gene Clusters in Rumen Microbial Genomes.

    Science.gov (United States)

    Azevedo, Analice C; Bento, Cláudia B P; Ruiz, Jeronimo C; Queiroz, Marisa V; Mantovani, Hilário C

    2015-10-01

    Some species of ruminal bacteria are known to produce antimicrobial peptides, but the screening procedures have mostly been based on in vitro assays using standardized methods. Recent sequencing efforts have made available the genome sequences of hundreds of ruminal microorganisms. In this work, we performed genome mining of the complete and partial genome sequences of 224 ruminal bacteria and 5 ruminal archaea to determine the distribution and diversity of bacteriocin gene clusters. A total of 46 bacteriocin gene clusters were identified in 33 strains of ruminal bacteria. Twenty gene clusters were related to lanthipeptide biosynthesis, while 11 gene clusters were associated with sactipeptide production, 7 gene clusters were associated with class II bacteriocin production, and 8 gene clusters were associated with class III bacteriocin production. The frequency of strains whose genomes encode putative antimicrobial peptide precursors was 14.4%. Clusters related to the production of sactipeptides were identified for the first time among ruminal bacteria. BLAST analysis indicated that the majority of the gene clusters (88%) encoding putative lanthipeptides contained all the essential genes required for lanthipeptide biosynthesis. Most strains of Streptococcus (66.6%) harbored complete lanthipeptide gene clusters, in addition to an open reading frame encoding a putative class II bacteriocin. Albusin B-like proteins were found in 100% of the Ruminococcus albus strains screened in this study. The in silico analysis provided evidence of novel biosynthetic gene clusters in bacterial species not previously related to bacteriocin production, suggesting that the rumen microbiota represents an underexplored source of antimicrobial peptides. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  12. Using Whole Genome Analysis to Examine Recombination across Diverse Sequence Types of Staphylococcus aureus.

    Directory of Open Access Journals (Sweden)

    Elizabeth M Driebe

    Full Text Available Staphylococcus aureus is an important clinical pathogen worldwide and understanding this organism's phylogeny and, in particular, the role of recombination, is important both to understand the overall spread of virulent lineages and to characterize outbreaks. To further elucidate the phylogeny of S. aureus, 35 diverse strains were sequenced using whole genome sequencing. In addition, 29 publicly available whole genome sequences were included to create a single nucleotide polymorphism (SNP-based phylogenetic tree encompassing 11 distinct lineages. All strains of a particular sequence type fell into the same clade with clear groupings of the major clonal complexes of CC8, CC5, CC30, CC45 and CC1. Using a novel analysis method, we plotted the homoplasy density and SNP density across the whole genome and found evidence of recombination throughout the entire chromosome, but when we examined individual clonal lineages we found very little recombination. However, when we analyzed three branches of multiple lineages, we saw intermediate and differing levels of recombination between them. These data demonstrate that in S. aureus, recombination occurs across major lineages that subsequently expand in a clonal manner. Estimated mutation rates for the CC8 and CC5 lineages were different from each other. While the CC8 lineage rate was similar to previous studies, the CC5 lineage was 100-fold greater. Fifty known virulence genes were screened in all genomes in silico to determine their distribution across major clades. Thirty-three genes were present variably across clades, most of which were not constrained by ancestry, indicating horizontal gene transfer or gene loss.

  13. Bacterial origin of a diverse family of UDP-glycosyltransferase genes in the Tetranychus urticae genome.

    Science.gov (United States)

    Ahn, Seung-Joon; Dermauw, Wannes; Wybouw, Nicky; Heckel, David G; Van Leeuwen, Thomas

    2014-07-01

    UDP-glycosyltransferases (UGTs) catalyze the conjugation of a variety of small lipophilic molecules with uridine diphosphate (UDP) sugars, altering them into more water-soluble metabolites. Thereby, UGTs play an important role in the detoxification of xenobiotics and in the regulation of endobiotics. Recently, the genome sequence was reported for the two-spotted spider mite, Tetranychus urticae, a polyphagous herbivore damaging a number of agricultural crops. Although various gene families implicated in xenobiotic metabolism have been documented in T. urticae, UGTs so far have not. We identified 80 UGT genes in the T. urticae genome, the largest number of UGT genes in a metazoan species reported so far. Phylogenetic analysis revealed that lineage-specific gene expansions increased the diversity of the T. urticae UGT repertoire. Genomic distribution, intron-exon structure and structural motifs in the T. urticae UGTs were also described. In addition, expression profiling after host-plant shifts and in acaricide resistant lines supported an important role for UGT genes in xenobiotic metabolism. Expanded searches of UGTs in other arachnid species (Subphylum Chelicerata), including a spider, a scorpion, two ticks and two predatory mites, unexpectedly revealed the complete absence of UGT genes. However, a centipede (Subphylum Myriapoda) and a water flea and a crayfish (Subphylum Crustacea) contain UGT genes in their genomes similar to insect UGTs, suggesting that the UGT gene family might have been lost early in the Chelicerata lineage and subsequently re-gained in the tetranychid mites. Sequence similarity of T. urticae UGTs and bacterial UGTs and their phylogenetic reconstruction suggest that spider mites acquired UGT genes from bacteria by horizontal gene transfer. Our findings show a unique evolutionary history of the T. urticae UGT gene family among other arthropods and provide important clues to its functions in relation to detoxification and thereby host

  14. Diversity of eukaryotic DNA replication origins revealed by genome-wide analysis of chromatin structure.

    Directory of Open Access Journals (Sweden)

    Nicolas M Berbenetz

    2010-09-01

    Full Text Available Eukaryotic DNA replication origins differ both in their efficiency and in the characteristic time during S phase when they become active. The biological basis for these differences remains unknown, but they could be a consequence of chromatin structure. The availability of genome-wide maps of nucleosome positions has led to an explosion of information about how nucleosomes are assembled at transcription start sites, but no similar maps exist for DNA replication origins. Here we combine high-resolution genome-wide nucleosome maps with comprehensive annotations of DNA replication origins to identify patterns of nucleosome occupancy at eukaryotic replication origins. On average, replication origins contain a nucleosome depleted region centered next to the ACS element, flanked on both sides by arrays of well-positioned nucleosomes. Our analysis identified DNA sequence properties that correlate with nucleosome occupancy at replication origins genome-wide and that are correlated with the nucleosome-depleted region. Clustering analysis of all annotated replication origins revealed a surprising diversity of nucleosome occupancy patterns. We provide evidence that the origin recognition complex, which binds to the origin, acts as a barrier element to position and phase nucleosomes on both sides of the origin. Finally, analysis of chromatin reconstituted in vitro reveals that origins are inherently nucleosome depleted. Together our data provide a comprehensive, genome-wide view of chromatin structure at replication origins and suggest a model of nucleosome positioning at replication origins in which the underlying sequence occludes nucleosomes to permit binding of the origin recognition complex, which then (likely in concert with nucleosome modifiers and remodelers positions nucleosomes adjacent to the origin to promote replication origin function.

  15. Novel Moraxella catarrhalis prophages display hyperconserved non-structural genes despite their genomic diversity.

    Science.gov (United States)

    Ariff, Amir; Wise, Michael J; Kahler, Charlene M; Tay, Chin Yen; Peters, Fanny; Perkins, Timothy T; Chang, Barbara J

    2015-10-24

    Moraxella catarrhalis is an important pathogen that often causes otitis media in children, a disease that is not currently vaccine preventable. Asymptomatic colonisation of the human upper respiratory tract is common and lack of clearance by the immune system is likely due to the emergence of seroresistant genetic lineages. No active bacteriophages or prophages have been described in this species. This study was undertaken to identify and categorise prophages in M. catarrhalis, their genetic diversity and the relationship of such diversity with the host-species phylogeny. This study presents a comparative analysis of 32 putative prophages identified in 95 phylogenetically variable, newly sequenced M. catarrhalis genomes. The prophages were genotypically classified into four diverse clades. The genetic synteny of each clade is similar to the group 1 phage family Siphoviridae, however, they form genotypic clusters that are distinct from other members of this family. No core genetic sequences exist across the 32 prophages despite clades 2, 3, and 4 sharing the most sequence identity. The analysis of non-structural prophage genes (coding the integrase, and terminase), and portal gene showed that the respective genes were identical for clades 2, 3, and 4, but unique for clade 1. Empirical analysis calculated that these genes are unexpectedly hyperconserved, under purifying selection, suggesting a tightly regulated functional role. As such, it is improbable that the prophages are decaying remnants but stable components of a fluctuating, flexible and unpredictable system ultimately maintained by functional constraints on non-structural and packaging genes. Additionally, the plate encoding genes were well conserved across all four prophage clades, and the tail fibre genes, commonly responsible for receptor recognition, were clustered into three major groups distributed across the prophage clades. A pan-genome of 283,622 bp was identified, and the prophages were mapped

  16. The Nephila clavipes genome highlights the diversity of spider silk genes and their complex expression.

    Science.gov (United States)

    Babb, Paul L; Lahens, Nicholas F; Correa-Garhwal, Sandra M; Nicholson, David N; Kim, Eun Ji; Hogenesch, John B; Kuntner, Matjaž; Higgins, Linden; Hayashi, Cheryl Y; Agnarsson, Ingi; Voight, Benjamin F

    2017-06-01

    Spider silks are the toughest known biological materials, yet are lightweight and virtually invisible to the human immune system, and they thus have revolutionary potential for medicine and industry. Spider silks are largely composed of spidroins, a unique family of structural proteins. To investigate spidroin genes systematically, we constructed the first genome of an orb-weaving spider: the golden orb-weaver (Nephila clavipes), which builds large webs using an extensive repertoire of silks with diverse physical properties. We cataloged 28 Nephila spidroins, representing all known orb-weaver spidroin types, and identified 394 repeated coding motif variants and higher-order repetitive cassette structures unique to specific spidroins. Characterization of spidroin expression in distinct silk gland types indicates that glands can express multiple spidroin types. We find evidence of an alternatively spliced spidroin, a spidroin expressed only in venom glands, evolutionary mechanisms for spidroin diversification, and non-spidroin genes with expression patterns that suggest roles in silk production.

  17. Genomic diversity of mumps virus and global distribution of the 12 genotypes.

    Science.gov (United States)

    Jin, Li; Örvell, Claes; Myers, Richard; Rota, Paul A; Nakayama, Tetsuo; Forcic, Dubravko; Hiebert, Joanne; Brown, Kevin E

    2015-03-01

    The WHO recently proposed an updated nomenclature for mumps virus (MuV). WHO currently recognizes 12 genotypes of MuV, assigned letters from A to N (excluding E and M), which are based on the nucleotide sequences of small hydrophobic (SH) and haemagglutinin-neuraminidase (HN) genes. A total of 66 MuV genomes are available in GenBank, representing eight of the 12 genotypes. To complete this dataset, whole genomes of seven isolates representing six genotypes (D, H, I, J, K and L) and one unclassified strain were sequenced. SH and HN genes of other representative strains were also sequenced. The degree of genetic divergence, predicted amino acid substitutions in the HN and fusion (F) proteins and geographic distributions of MuV strains were analysed based on the updated dataset. Nucleotide heterogeneity between genotypes reached 20% within the SH gene, with a maximum of 9% within the HN gene. The geographic and chronologic distributions of the 12 genotypes were summarised. This review contributes to our understanding of strain diversity for wild type MuV, and the results support the current WHO nomenclature. © 2014 Crown copyright. Reviews in Medical Virology © 2014 John Wiley & Sons, Ltd.

  18. Functional Genomics of Novel Secondary Metabolites from Diverse Cyanobacteria Using Untargeted Metabolomics

    Science.gov (United States)

    Baran, Richard; Ivanova, Natalia N.; Jose, Nick; Garcia-Pichel, Ferran; Kyrpides, Nikos C.; Gugger, Muriel; Northen, Trent R.

    2013-01-01

    Mass spectrometry-based metabolomics has become a powerful tool for the detection of metabolites in complex biological systems and for the identification of novel metabolites. We previously identified a number of unexpected metabolites in the cyanobacterium Synechococcus sp. PCC 7002, such as histidine betaine, its derivatives and several unusual oligosaccharides. To test for the presence of these compounds and to assess the diversity of small polar metabolites in other cyanobacteria, we profiled cell extracts of nine strains representing much of the morphological and evolutionary diversification of this phylum. Spectral features in raw metabolite profiles obtained by normal phase liquid chromatography coupled to mass spectrometry (MS) were manually curated so that chemical formulae of metabolites could be assigned. For putative identification, retention times and MS/MS spectra were cross-referenced with those of standards or available sprectral library records. Overall, we detected 264 distinct metabolites. These included indeed different betaines, oligosaccharides as well as additional unidentified metabolites with chemical formulae not present in databases of metabolism. Some of these metabolites were detected only in a single strain, but some were present in more than one. Genomic interrogation of the strains revealed that generally, presence of a given metabolite corresponded well with the presence of its biosynthetic genes, if known. Our results show the potential of combining metabolite profiling and genomics for the identification of novel biosynthetic genes. PMID:24084783

  19. Reconstructing Demography and Social Behavior During the Neolithic Expansion from Genomic Diversity Across Island Southeast Asia.

    Science.gov (United States)

    Vallée, François; Luciani, Aurélien; Cox, Murray P

    2016-12-01

    Archaeology, linguistics, and increasingly genetics are clarifying how populations moved from mainland Asia, through Island Southeast Asia, and out into the Pacific during the farming revolution. Yet key features of this process remain poorly understood, particularly how social behaviors intersected with demographic drivers to create the patterns of genomic diversity observed across Island Southeast Asia today. Such questions are ripe for computer modeling. Here, we construct an agent-based model to simulate human mobility across Island Southeast Asia from the Neolithic period to the present, with a special focus on interactions between individuals with Asian, Papuan, and mixed Asian-Papuan ancestry. Incorporating key features of the region, including its complex geography (islands and sea), demographic drivers (fecundity and migration), and social behaviors (marriage preferences), the model simultaneously tracks a full suite of genomic markers (autosomes, X chromosome, mitochondrial DNA, and Y chromosome). Using Bayesian inference, model parameters were determined that produce simulations that closely resemble the admixture profiles of 2299 individuals from 84 populations across Island Southeast Asia. The results highlight that greater propensity to migrate and elevated birth rates are related drivers behind the expansion of individuals with Asian ancestry relative to individuals with Papuan ancestry, that offspring preferentially resulted from marriages between Asian women and Papuan men, and that in contrast to current thinking, individuals with Asian ancestry were likely distributed across large parts of western Island Southeast Asia before the Neolithic expansion. Copyright © 2016 Vallée et al.

  20. Genetic diversity and genomic signatures of selection among cattle breeds from Siberia, eastern and northern Europe.

    Science.gov (United States)

    Iso-Touru, T; Tapio, M; Vilkki, J; Kiseleva, T; Ammosov, I; Ivanova, Z; Popov, R; Ozerov, M; Kantanen, J

    2016-12-01

    Domestication in the near eastern region had a major impact on the gene pool of humpless taurine cattle (Bos taurus). As a result of subsequent natural and artificial selection, hundreds of different breeds have evolved, displaying a broad range of phenotypic traits. Here, 10 Eurasian B. taurus breeds from different biogeographic and production conditions, which exhibit different demographic histories and have been under artificial selection at various intensities, were investigated using the Illumina BovineSNP50 panel to understand their genetic diversity and population structure. In addition, we scanned genomes from eight breeds for signatures of diversifying selection. Our population structure analysis indicated six distinct breed groups, the most divergent being the Yakutian cattle from Siberia. Selection signals were shared (experimental P-value selection signals in the Yakutian cattle were found on chromosomes 7 and 21, where a miRNA gene and genes related to immune system processes are respectively located. In general, genomic regions indicating selection overlapped with known QTL associated with milk production (e.g. on chromosome 19), reproduction (e.g. on chromosome 24) and meat quality (e.g. on chromosome 7). The selection map created in this study shows that native cattle breeds and their genetic resources represent unique material for future breeding. © 2016 Stichting International Foundation for Animal Genetics.

  1. Population Stratification and Underrepresentation of Indian Subcontinent Genetic Diversity in the 1000 Genomes Project Dataset.

    Science.gov (United States)

    Sengupta, Dhriti; Choudhury, Ananyo; Basu, Analabha; Ramsay, Michèle

    2016-12-31

    Genomic variation in Indian populations is of great interest due to the diversity of ancestral components, social stratification, endogamy and complex admixture patterns. With an expanding population of 1.2 billion, India is also a treasure trove to catalogue innocuous as well as clinically relevant rare mutations. Recent studies have revealed four dominant ancestries in populations from mainland India: Ancestral North-Indian (ANI), Ancestral South-Indian (ASI), Ancestral Tibeto-Burman (ATB) and Ancestral Austro-Asiatic (AAA). The 1000 Genomes Project (KGP) Phase-3 data include about 500 genomes from five linguistically defined Indian-Subcontinent (IS) populations (Punjabi, Gujrati, Bengali, Telugu and Tamil) some of whom are recent migrants to USA or UK. Comparative analyses show that despite the distinct geographic origins of the KGP-IS populations, the ANI component is predominantly represented in this dataset. Previous studies demonstrated population substructure in the HapMap Gujrati population, and we found evidence for additional substructure in the Punjabi and Telugu populations. These substructured populations have characteristic/significant differences in heterozygosity and inbreeding coefficients. Moreover, we demonstrate that the substructure is better explained by factors like differences in proportion of ancestral components, and endogamy driven social structure rather than invoking a novel ancestral component to explain it. Therefore, using language and/or geography as a proxy for an ethnic unit is inadequate for many of the IS populations. This highlights the necessity for more nuanced sampling strategies or corrective statistical approaches, particularly for biomedical and population genetics research in India. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  2. Probing the diversity of chloromethane-degrading bacteria by comparative genomics and isotopic fractionation

    Directory of Open Access Journals (Sweden)

    Thierry eNADALIG

    2014-10-01

    Full Text Available Chloromethane (CH3Cl is produced on earth by a variety of abiotic and biological processes. It is the most important halogenated trace gas in the atmosphere, where it contributes to ozone destruction. Current estimates of the global CH3Cl budget are uncertain and suggest that microorganisms might play a more important role in degrading atmospheric CH3Cl than previously thought. Its degradation by bacteria has been demonstrated in marine, terrestrial and phyllospheric environments. Improving our knowledge of these degradation processes and its magnitude is thus highly relevant for a better understanding of the global budget of CH3Cl.The cmu pathway, for chloromethane utilisation, is the only microbial pathway for CH3Cl degradation elucidated so far, and was characterised in detail in aerobic methylotrophic Alphaproteobacteria. Here, we reveal the potential of using a two-pronged approach involving a combination of comparative genomics and isotopic fractionation during CH3Cl degradation to newly address the question of the diversity of chloromethane-degrading bacteria in the environment.Analysis of available bacterial genome sequences reveals that several bacteria not yet known to degrade CH3Cl contain part or all of the complement of cmu genes required for CH3Cl degradation. These organisms, unlike bacteria shown to grow with CH3Cl using the cmu pathway, are obligate anaerobes. On the other hand, analysis of the complete genome of the chloromethane-degrading bacterium Leisingera methylohalidivorans showed that this bacterium does not contain cmu genes. Isotope fractionation experiments with L. methylohalidivorans suggest that the unknown pathway used by this bacterium for growth with CH3Cl can be differentiated from the cmu pathway. This result opens the prospect that contributions from bacteria with the cmu and Leisingera-type pathways to the atmospheric CH3Cl budget may be teased apart in the future.

  3. A Genome-Scale Model of Shewanella piezotolerans Simulates Mechanisms of Metabolic Diversity and Energy Conservation.

    Science.gov (United States)

    Dufault-Thompson, Keith; Jian, Huahua; Cheng, Ruixue; Li, Jiefu; Wang, Fengping; Zhang, Ying

    2017-01-01

    Shewanella piezotolerans strain WP3 belongs to the group 1 branch of the Shewanella genus and is a piezotolerant and psychrotolerant species isolated from the deep sea. In this study, a genome-scale model was constructed for WP3 using a combination of genome annotation, ortholog mapping, and physiological verification. The metabolic reconstruction contained 806 genes, 653 metabolites, and 922 reactions, including central metabolic functions that represented nonhomologous replacements between the group 1 and group 2 Shewanella species. Metabolic simulations with the WP3 model demonstrated consistency with existing knowledge about the physiology of the organism. A comparison of model simulations with experimental measurements verified the predicted growth profiles under increasing concentrations of carbon sources. The WP3 model was applied to study mechanisms of anaerobic respiration through investigating energy conservation, redox balancing, and the generation of proton motive force. Despite being an obligate respiratory organism, WP3 was predicted to use substrate-level phosphorylation as the primary source of energy conservation under anaerobic conditions, a trait previously identified in other Shewanella species. Further investigation of the ATP synthase activity revealed a positive correlation between the availability of reducing equivalents in the cell and the directionality of the ATP synthase reaction flux. Comparison of the WP3 model with an existing model of a group 2 species, Shewanella oneidensis MR-1, revealed that the WP3 model demonstrated greater flexibility in ATP production under the anaerobic conditions. Such flexibility could be advantageous to WP3 for its adaptation to fluctuating availability of organic carbon sources in the deep sea. IMPORTANCE The well-studied nature of the metabolic diversity of Shewanella bacteria makes species from this genus a promising platform for investigating the evolution of carbon metabolism and energy conservation

  4. Comparative genomics of the classical Bordetella subspecies: the evolution and exchange of virulence-associated diversity amongst closely related pathogens.

    Science.gov (United States)

    Park, Jihye; Zhang, Ying; Buboltz, Anne M; Zhang, Xuqing; Schuster, Stephan C; Ahuja, Umesh; Liu, Minghsun; Miller, Jeff F; Sebaihia, Mohammed; Bentley, Stephen D; Parkhill, Julian; Harvill, Eric T

    2012-10-10

    The classical Bordetella subspecies are phylogenetically closely related, yet differ in some of the most interesting and important characteristics of pathogens, such as host range, virulence and persistence. The compelling picture from previous comparisons of the three sequenced genomes was of genome degradation, with substantial loss of genome content (up to 24%) associated with adaptation to humans. For a more comprehensive picture of lineage evolution, we employed comparative genomic and phylogenomic analyses using seven additional diverse, newly sequenced Bordetella isolates. Genome-wide single nucleotide polymorphism (SNP) analysis supports a reevaluation of the phylogenetic relationships between the classical Bordetella subspecies, and suggests a closer link between ovine and human B. parapertussis lineages than has been previously proposed. Comparative analyses of genome content revealed that only 50% of the pan-genome is conserved in all strains, reflecting substantial diversity of genome content in these closely related pathogens that may relate to their different host ranges, virulence and persistence characteristics. Strikingly, these analyses suggest possible horizontal gene transfer (HGT) events in multiple loci encoding virulence factors, including O-antigen and pertussis toxin (Ptx). Segments of the pertussis toxin locus (ptx) and its secretion system locus (ptl) appear to have been acquired by the classical Bordetella subspecies and are divergent in different lineages, suggesting functional divergence in the classical Bordetellae. Together, these observations, especially in key virulence factors, reveal that multiple mechanisms, such as point mutations, gain or loss of genes, as well as HGTs, contribute to the substantial phenotypic diversity of these versatile subspecies in various hosts.

  5. Comparative genomics of the classical Bordetella subspecies: the evolution and exchange of virulence-associated diversity amongst closely related pathogens

    Directory of Open Access Journals (Sweden)

    Park Jihye

    2012-10-01

    Full Text Available Abstract Background The classical Bordetella subspecies are phylogenetically closely related, yet differ in some of the most interesting and important characteristics of pathogens, such as host range, virulence and persistence. The compelling picture from previous comparisons of the three sequenced genomes was of genome degradation, with substantial loss of genome content (up to 24% associated with adaptation to humans. Results For a more comprehensive picture of lineage evolution, we employed comparative genomic and phylogenomic analyses using seven additional diverse, newly sequenced Bordetella isolates. Genome-wide single nucleotide polymorphism (SNP analysis supports a reevaluation of the phylogenetic relationships between the classical Bordetella subspecies, and suggests a closer link between ovine and human B. parapertussis lineages than has been previously proposed. Comparative analyses of genome content revealed that only 50% of the pan-genome is conserved in all strains, reflecting substantial diversity of genome content in these closely related pathogens that may relate to their different host ranges, virulence and persistence characteristics. Strikingly, these analyses suggest possible horizontal gene transfer (HGT events in multiple loci encoding virulence factors, including O-antigen and pertussis toxin (Ptx. Segments of the pertussis toxin locus (ptx and its secretion system locus (ptl appear to have been acquired by the classical Bordetella subspecies and are divergent in different lineages, suggesting functional divergence in the classical Bordetellae. Conclusions Together, these observations, especially in key virulence factors, reveal that multiple mechanisms, such as point mutations, gain or loss of genes, as well as HGTs, contribute to the substantial phenotypic diversity of these versatile subspecies in various hosts.

  6. Bayesian Analysis of Evolutionary Divergence with Genomic Data under Diverse Demographic Models.

    Science.gov (United States)

    Chung, Yujin; Hey, Jody

    2017-06-01

    We present a new Bayesian method for estimating demographic and phylogenetic history using population genomic data. Several key innovations are introduced that allow the study of diverse models within an Isolation-with-Migration framework. The new method implements a 2-step analysis, with an initial Markov chain Monte Carlo (MCMC) phase that samples simple coalescent trees, followed by the calculation of the joint posterior density for the parameters of a demographic model. In step 1, the MCMC sampling phase, the method uses a reduced state space, consisting of coalescent trees without migration paths, and a simple importance sampling distribution without the demography of interest. Once obtained, a single sample of trees can be used in step 2 to calculate the joint posterior density for model parameters under multiple diverse demographic models, without having to repeat MCMC runs. Because migration paths are not included in the state space of the MCMC phase, but rather are handled by analytic integration in step 2 of the analysis, the method is scalable to a large number of loci with excellent MCMC mixing properties. With an implementation of the new method in the computer program MIST, we demonstrate the method's accuracy, scalability, and other advantages using simulated data and DNA sequences of two common chimpanzee subspecies: Pan troglodytes (P. t.) troglodytes and P. t. verus. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  7. Antigenic and Genomic Diversity of Human Rotavirus VP4 in Two Consecutive Epidemic Seasons in Mexico

    Science.gov (United States)

    Padilla-Noriega, Luis; Méndez-Toss, Martha; Menchaca, Griselda; Contreras, Juan F.; Romero-Guido, Pedro; Puerto, Fernando I.; Guiscafré, Héctor; Mota, Felipe; Herrera, Ismael; Cedillo, Roberto; Muñoz, Onofre; Calva, Juan; Guerrero, María de Lourdes; Coulson, Barbara S.; Greenberg, Harry B.; López, Susana; Arias, Carlos F.

    1998-01-01

    In the present investigation we characterized the antigenic diversity of the VP4 and VP7 proteins in 309 and 261 human rotavirus strains isolated during two consecutive epidemic seasons, respectively, in three different regions of Mexico. G3 was found to be the prevalent VP7 serotype during the first year, being superseded by serotype G1 strains during the second season. To antigenically characterize the VP4 protein of the strains isolated, we used five neutralizing monoclonal antibodies (MAbs) which showed specificity for VP4 serotypes P1A, P1B, and P2 in earlier studies. Eight different patterns of reactivity with these MAbs were found, and the prevalence of three of these patterns varied from one season to the next. The P genotype of a subset of 52 samples was determined by PCR. Among the strains characterized as genotype P[4] and P[8] there were three and five different VP4 MAb reactivity patterns, respectively, indicating that the diversity of neutralization epitopes in VP4 is greater than that previously appreciated by the genomic typing methods. PMID:9620401

  8. Multi-genome analysis identifies functional and phylogenetic diversity of basidiomycete adenylate-forming reductases.

    Science.gov (United States)

    Brandenburger, Eileen; Braga, Daniel; Kombrink, Anja; Lackner, Gerald; Gressler, Julia; Künzler, Markus; Hoffmeister, Dirk

    2016-07-22

    Among the invaluable benefits of basidiomycete genomics is the dramatically enhanced insight into the potential capacity to biosynthesize natural products. This study focuses on adenylate-forming reductases, which is a group of natural product biosynthesis enzymes that resembles non-ribosomal peptide synthetases, yet serves to modify one substrate, rather than to condense two or more building blocks. Phylogenetically, these reductases fall in four classes. The phylogeny of Heterobasidion annosum (Russulales) and Serpula lacrymans (Boletales) adenylate-forming reductases was investigated. We identified a previously unrecognized phylogenetic branch within class III adenylate-forming reductases. Three representatives were heterologously produced and their substrate preferences determined in vitro: NPS9 and NPS11 of S. lacrymans preferred l-threonine and benzoic acid, respectively, while NPS10 of H. annosum accepted phenylpyruvic acid best. We also investigated two class IV adenylate-forming reductases of Coprinopsis cinerea, which each were active with l-alanine, l-valine, and l-serine as substrates. Our results show that adenylate-forming reductases are functionally more diverse than previously recognized. As none of the natural products known from the species investigated in this study includes the identified substrates of their respective reductases, our findings may help further explore the diversity of these basidiomycete secondary metabolomes. Copyright © 2016 Elsevier Inc. All rights reserved.

  9. Genome wide characterization of simple sequence repeats in watermelon genome and their application in comparative mapping and genetic diversity analysis

    Science.gov (United States)

    Simple sequence repeats (SSR) or microsatellite markers are one of the most informative and versatile DNA-based markers. The use of next-generation sequencing technologies allow whole genome sequencing and make it possible to develop large numbers of SSRs through bioinformatic analysis of genome da...

  10. Staphylococcus epidermidis pan-genome sequence analysis reveals diversity of skin commensal and hospital infection-associated isolates.

    Science.gov (United States)

    Conlan, Sean; Mijares, Lilia A; Becker, Jesse; Blakesley, Robert W; Bouffard, Gerard G; Brooks, Shelise; Coleman, Holly; Gupta, Jyoti; Gurson, Natalie; Park, Morgan; Schmidt, Brian; Thomas, Pamela J; Otto, Michael; Kong, Heidi H; Murray, Patrick R; Segre, Julia A

    2012-07-25

    While Staphylococcus epidermidis is commonly isolated from healthy human skin, it is also the most frequent cause of nosocomial infections on indwelling medical devices. Despite its importance, few genome sequences existed and the most frequent hospital-associated lineage, ST2, had not been fully sequenced. We cultivated 71 commensal S. epidermidis isolates from 15 skin sites and compared them with 28 nosocomial isolates from venous catheters and blood cultures. We produced 21 commensal and 9 nosocomial draft genomes, and annotated and compared their gene content, phylogenetic relatedness and biochemical functions. The commensal strains had an open pan-genome with 80% core genes and 20% variable genes. The variable genome was characterized by an overabundance of transposable elements, transcription factors and transporters. Biochemical diversity, as assayed by antibiotic resistance and in vitro biofilm formation, demonstrated the varied phenotypic consequences of this genomic diversity. The nosocomial isolates exhibited both large-scale rearrangements and single-nucleotide variation. We showed that S. epidermidis genomes separate into two phylogenetic groups, one consisting only of commensals. The formate dehydrogenase gene, present only in commensals, is a discriminatory marker between the two groups. Commensal skin S. epidermidis have an open pan-genome and show considerable diversity between isolates, even when derived from a single individual or body site. For ST2, the most common nosocomial lineage, we detect variation between three independent isolates sequenced. Finally, phylogenetic analyses revealed a previously unrecognized group of S. epidermidis strains characterized by reduced virulence and formate dehydrogenase, which we propose as a clinical molecular marker.

  11. Genome-wide-analyses of Listeria monocytogenes from food-processing plants reveals clonal diversity and dates the emergence of persisting sequence types

    DEFF Research Database (Denmark)

    Knudsen, Gitte Maegaard; Nielsen, Jesper Boye; Marvig, Rasmus Lykke

    2017-01-01

    of 80 isolates of Listeria monocytogenes sampled from Danish food processing plants over a time-period of 20 years, and analyzed the sequences together with 10 public available reference genomes to advance our understanding of inter- and intra-plant genomic diversity of L. monocytogenes. Except......Whole genome sequencing is increasing used in epidemiology, e.g. for tracing outbreaks of food-borne diseases. This requires in-depth understanding of pathogen emergence, persistence, and genomic diversity along the food production chain including in food processing plants. We sequenced the genomes...

  12. Genomic Diversity of Burkholderia pseudomallei Clinical Isolates: Subtractive Hybridization Reveals a Burkholderia mallei-Specific Prophage in B. pseudomallei 1026b

    Science.gov (United States)

    2004-06-01

    J. P. Kitajima. 2003. Comparative analysis of the complete genome sequence of Pierce’s disease and citrus varigated chlorosis strains of Xylella...JOURNAL OF BACTERIOLOGY, June 2004, p. 3938–3950 Vol. 186, No. 12 0021-9193/04/$08.000 DOI: 10.1128/JB.186.12.3938–3950.2004 Genomic Diversity of... genomic sequence of B. pseudomallei K96243 was recently determined, but little is known about the overall genetic diversity of this species

  13. Genomic comparison of multi-drug resistant invasive and colonizing Acinetobacter baumannii isolated from diverse human body sites reveals genomic plasticity

    Directory of Open Access Journals (Sweden)

    Hsiao William W

    2011-06-01

    Full Text Available Abstract Background Acinetobacter baumannii has recently emerged as a significant global pathogen, with a surprisingly rapid acquisition of antibiotic resistance and spread within hospitals and health care institutions. This study examines the genomic content of three A. baumannii strains isolated from distinct body sites. Isolates from blood, peri-anal, and wound sources were examined in an attempt to identify genetic features that could be correlated to each isolation source. Results Pulsed-field gel electrophoresis, multi-locus sequence typing and antibiotic resistance profiles demonstrated genotypic and phenotypic variation. Each isolate was sequenced to high-quality draft status, which allowed for comparative genomic analyses with existing A. baumannii genomes. A high resolution, whole genome alignment method detailed the phylogenetic relationships of sequenced A. baumannii and found no correlation between phylogeny and body site of isolation. This method identified genomic regions unique to both those isolates found on the surface of the skin or in wounds, termed colonization isolates, and those identified from body fluids, termed invasive isolates; these regions may play a role in the pathogenesis and spread of this important pathogen. A PCR-based screen of 74 A. baumanii isolates demonstrated that these unique genes are not exclusive to either phenotype or isolation source; however, a conserved genomic region exclusive to all sequenced A. baumannii was identified and verified. Conclusions The results of the comparative genome analysis and PCR assay show that A. baumannii is a diverse and genomically variable pathogen that appears to have the potential to cause a range of human disease regardless of the isolation source.

  14. Diverse pathways of phosphatidylcholine biosynthesis in algae as estimated by labeling studies and genomic sequence analysis.

    Science.gov (United States)

    Sato, Naoki; Mori, Natsumi; Hirashima, Takashi; Moriyama, Takashi

    2016-08-01

    Phosphatidylcholine (PC) is an almost ubiquitous phospholipid in eukaryotic algae and plants but is not found in a few species, for example Chlamydomonas reinhardtii. We recently found that some species of the genus Chlamydomonas possess PC. In the universal pathway, PC is synthesized de novo by methylation of phosphatidylethanolamine (PE) or transfer of phosphocholine from cytidine diphosphate (CDP)-choline to diacylglycerol. Phosphocholine, the direct precursor to CDP-choline, is synthesized either by methylation of phosphoethanolamine or phosphorylation of choline. Here we analyzed the mechanism of PC biosynthesis in two species of Chlamydomonas (asymmetrica and sphaeroides) as well as in a red alga, Cyanidioschyzon merolae. Comparative genomic analysis of enzymes involved in PC biosynthesis indicated that C. merolae possesses only the PE methylation pathway. Radioactive tracer experiments using [(32) P]phosphate showed delayed labeling of PC with respect to PE, which was consistent with the PE methylation pathway. In Chlamydomonas asymmetrica, labeling of PC was detected from the early time of incubation with [(32) P]phosphate, suggesting the operation of phosphoethanolamine methylation pathway. Genomic analysis indeed detected the genes for the phosphoethanolamine methylation pathway. In contrast, the labeling of PC in C. sphaeroides was slow, suggesting that the PE methylation pathway was at work. These results as well as biochemical and computational results uncover an unexpected diversity of the mechanisms for PC biosynthesis in algae. Based on these results, we will discuss plausible mechanisms for the scattered distribution of the ability to biosynthesize PC in the genus Chlamydomonas. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  15. Comparative genomics of plant-associated Pseudomonas spp.: insights into diversity and inheritance of traits involved in multitrophic interactions.

    Directory of Open Access Journals (Sweden)

    Joyce E Loper

    2012-07-01

    Full Text Available We provide here a comparative genome analysis of ten strains within the Pseudomonas fluorescens group including seven new genomic sequences. These strains exhibit a diverse spectrum of traits involved in biological control and other multitrophic interactions with plants, microbes, and insects. Multilocus sequence analysis placed the strains in three sub-clades, which was reinforced by high levels of synteny, size of core genomes, and relatedness of orthologous genes between strains within a sub-clade. The heterogeneity of the P. fluorescens group was reflected in the large size of its pan-genome, which makes up approximately 54% of the pan-genome of the genus as a whole, and a core genome representing only 45-52% of the genome of any individual strain. We discovered genes for traits that were not known previously in the strains, including genes for the biosynthesis of the siderophores achromobactin and pseudomonine and the antibiotic 2-hexyl-5-propyl-alkylresorcinol; novel bacteriocins; type II, III, and VI secretion systems; and insect toxins. Certain gene clusters, such as those for two type III secretion systems, are present only in specific sub-clades, suggesting vertical inheritance. Almost all of the genes associated with multitrophic interactions map to genomic regions present in only a subset of the strains or unique to a specific strain. To explore the evolutionary origin of these genes, we mapped their distributions relative to the locations of mobile genetic elements and repetitive extragenic palindromic (REP elements in each genome. The mobile genetic elements and many strain-specific genes fall into regions devoid of REP elements (i.e., REP deserts and regions displaying atypical tri-nucleotide composition, possibly indicating relatively recent acquisition of these loci. Collectively, the results of this study highlight the enormous heterogeneity of the P. fluorescens group and the importance of the variable genome in tailoring

  16. Probing Genomic Aspects of the Multi-Host Pathogen Clostridium perfringens Reveals Significant Pangenome Diversity, and a Diverse Array of Virulence Factors

    Science.gov (United States)

    Kiu, Raymond; Caim, Shabhonam; Alexander, Sarah; Pachori, Purnima; Hall, Lindsay J.

    2017-01-01

    Clostridium perfringens is an important cause of animal and human infections, however information about the genetic makeup of this pathogenic bacterium is currently limited. In this study, we sought to understand and characterise the genomic variation, pangenomic diversity, and key virulence traits of 56 C. perfringens strains which included 51 public, and 5 newly sequenced and annotated genomes using Whole Genome Sequencing. Our investigation revealed that C. perfringens has an “open” pangenome comprising 11667 genes and 12.6% of core genes, identified as the most divergent single-species Gram-positive bacterial pangenome currently reported. Our computational analyses also defined C. perfringens phylogeny (16S rRNA gene) in relation to some 25 Clostridium species, with C. baratii and C. sardiniense determined to be the closest relatives. Profiling virulence-associated factors confirmed presence of well-characterised C. perfringens-associated exotoxins genes including α-toxin (plc), enterotoxin (cpe), and Perfringolysin O (pfo or pfoA), although interestingly there did not appear to be a close correlation with encoded toxin type and disease phenotype. Furthermore, genomic analysis indicated significant horizontal gene transfer events as defined by presence of prophage genomes, and notably absence of CRISPR defence systems in >70% (40/56) of the strains. In relation to antimicrobial resistance mechanisms, tetracycline resistance genes (tet) and anti-defensins genes (mprF) were consistently detected in silico (tet: 75%; mprF: 100%). However, pre-antibiotic era strain genomes did not encode for tet, thus implying antimicrobial selective pressures in C. perfringens evolutionary history over the past 80 years. This study provides new genomic understanding of this genetically divergent multi-host bacterium, and further expands our knowledge on this medically and veterinary important pathogen. PMID:29312194

  17. Probing Genomic Aspects of the Multi-Host Pathogen Clostridium perfringens Reveals Significant Pangenome Diversity, and a Diverse Array of Virulence Factors

    Directory of Open Access Journals (Sweden)

    Raymond Kiu

    2017-12-01

    Full Text Available Clostridium perfringens is an important cause of animal and human infections, however information about the genetic makeup of this pathogenic bacterium is currently limited. In this study, we sought to understand and characterise the genomic variation, pangenomic diversity, and key virulence traits of 56 C. perfringens strains which included 51 public, and 5 newly sequenced and annotated genomes using Whole Genome Sequencing. Our investigation revealed that C. perfringens has an “open” pangenome comprising 11667 genes and 12.6% of core genes, identified as the most divergent single-species Gram-positive bacterial pangenome currently reported. Our computational analyses also defined C. perfringens phylogeny (16S rRNA gene in relation to some 25 Clostridium species, with C. baratii and C. sardiniense determined to be the closest relatives. Profiling virulence-associated factors confirmed presence of well-characterised C. perfringens-associated exotoxins genes including α-toxin (plc, enterotoxin (cpe, and Perfringolysin O (pfo or pfoA, although interestingly there did not appear to be a close correlation with encoded toxin type and disease phenotype. Furthermore, genomic analysis indicated significant horizontal gene transfer events as defined by presence of prophage genomes, and notably absence of CRISPR defence systems in >70% (40/56 of the strains. In relation to antimicrobial resistance mechanisms, tetracycline resistance genes (tet and anti-defensins genes (mprF were consistently detected in silico (tet: 75%; mprF: 100%. However, pre-antibiotic era strain genomes did not encode for tet, thus implying antimicrobial selective pressures in C. perfringens evolutionary history over the past 80 years. This study provides new genomic understanding of this genetically divergent multi-host bacterium, and further expands our knowledge on this medically and veterinary important pathogen.

  18. Culture Phenotypes of Genomically and Geographically Diverse Mycobacterium avium subsp. paratuberculosis Isolates from Different Hosts▿

    Science.gov (United States)

    Whittington, Richard J.; Marsh, Ian B.; Saunders, Vanessa; Grant, Irene R.; Juste, Ramon; Sevilla, Iker A.; Manning, Elizabeth J. B.; Whitlock, Robert H.

    2011-01-01

    Mycobacterium avium subsp. paratuberculosis causes paratuberculosis (Johne's disease) in ruminants in most countries. Historical data suggest substantial differences in culturability of M. avium subsp. paratuberculosis isolates from small ruminants and cattle; however, a systematic comparison of culture media and isolates from different countries and hosts has not been undertaken. Here, 35 field isolates from the United States, Spain, Northern Ireland, and Australia were propagated in Bactec 12B medium and Middlebrook 7H10 agar, genomically characterized, and subcultured to Lowenstein-Jensen (LJ), Herrold's egg yolk (HEY), modified Middlebrook 7H10, Middlebrook 7H11, and Watson-Reid (WR) agars, all with and without mycobactin J and some with sodium pyruvate. Fourteen genotypes of M. avium subsp. paratuberculosis were represented as determined by BstEII IS900 and IS1311 restriction fragment length polymorphism analysis. There was no correlation between genotype and overall culturability, although most S strains tended to grow poorly on HEY agar. Pyruvate was inhibitory to some isolates. All strains grew on modified Middlebrook 7H10 agar but more slowly and less prolifically on LJ agar. Mycobactin J was required for growth on all media except 7H11 agar, but growth was improved by the addition of mycobactin J to 7H11 agar. WR agar supported the growth of few isolates. The differences in growth of M. avium subsp. paratuberculosis that have historically been reported in diverse settings have been strongly influenced by the type of culture medium used. When an optimal culture medium, such as modified Middlebrook 7H10 agar, is used, very little difference between the growth phenotypes of diverse strains of M. avium subsp. paratuberculosis was observed. This optimal medium is recommended to remove bias in the isolation and cultivation of M. avium subsp. paratuberculosis. PMID:21430104

  19. Leveraging genomic resources of model species for the assessment of diversity and phylogeny in wild and domesticated lentil.

    Science.gov (United States)

    Alo, Fida; Furman, Bonnie J; Akhunov, Eduard; Dvorak, Jan; Gepts, Paul

    2011-01-01

    Advances in comparative genomics have provided significant opportunities for analysis of genetic diversity in species with limited genomic resources, such as the genus Lens. Medicago truncatula expressed sequence tags (ESTs) were aligned with the Arabidopsis thaliana genome sequence to identify conserved exon sequences and splice sites in the ESTs. Conserved primers (CPs) based on M. truncatula EST sequences flanking one or more introns were then designed. A total of 22% of the CPs produced polymerase chain reaction amplicons in lentil and were used to sequence amplicons in 175 wild and 133 domesticated lentil accessions. Analysis of the sequences confirmed that L. nigricans and L. ervoides are well-defined species at the DNA sequence level. Lens culinaris subsp. odemensis, L. culinaris subsp. tomentosus, and L. lamottei may constitute a single taxon pending verification with crossability experiments. Lens culinaris subsp. orientalis is the progenitor of domesticated lentil, L. culinaris subsp. culinaris (as proposed before), but a more specific area of origin can be suggested in southern Turkey. We were also able to detect the divergence, following domestication, of the domesticated gene pool into overlapping large-seeded (megasperma) and small-seeded (microsperma) groups. Lentil domestication led to a loss of genetic diversity of approximately 40%. The approach followed in this research has allowed us to rapidly exploit sequence information from model plant species for the study of genetic diversity of a crop such as lentil with limited genomic resources.

  20. Challenges and strategies for implementing genomic services in diverse settings: experiences from the Implementing GeNomics In pracTicE (IGNITE) network.

    Science.gov (United States)

    Sperber, Nina R; Carpenter, Janet S; Cavallari, Larisa H; J Damschroder, Laura; Cooper-DeHoff, Rhonda M; Denny, Joshua C; Ginsburg, Geoffrey S; Guan, Yue; Horowitz, Carol R; Levy, Kenneth D; Levy, Mia A; Madden, Ebony B; Matheny, Michael E; Pollin, Toni I; Pratt, Victoria M; Rosenman, Marc; Voils, Corrine I; W Weitzel, Kristen; Wilke, Russell A; Ryanne Wu, R; Orlando, Lori A

    2017-05-22

    To realize potential public health benefits from genetic and genomic innovations, understanding how best to implement the innovations into clinical care is important. The objective of this study was to synthesize data on challenges identified by six diverse projects that are part of a National Human Genome Research Institute (NHGRI)-funded network focused on implementing genomics into practice and strategies to overcome these challenges. We used a multiple-case study approach with each project considered as a case and qualitative methods to elicit and describe themes related to implementation challenges and strategies. We describe challenges and strategies in an implementation framework and typology to enable consistent definitions and cross-case comparisons. Strategies were linked to challenges based on expert review and shared themes. Three challenges were identified by all six projects, and strategies to address these challenges varied across the projects. One common challenge was to increase the relative priority of integrating genomics within the health system electronic health record (EHR). Four projects used data warehousing techniques to accomplish the integration. The second common challenge was to strengthen clinicians' knowledge and beliefs about genomic medicine. To overcome this challenge, all projects developed educational materials and conducted meetings and outreach focused on genomic education for clinicians. The third challenge was engaging patients in the genomic medicine projects. Strategies to overcome this challenge included use of mass media to spread the word, actively involving patients in implementation (e.g., a patient advisory board), and preparing patients to be active participants in their healthcare decisions. This is the first collaborative evaluation focusing on the description of genomic medicine innovations implemented in multiple real-world clinical settings. Findings suggest that strategies to facilitate integration of genomic

  1. 'Candidatus Phytoplasma phoenicium' associated with almond witches'-broom disease: from draft genome to genetic diversity among strain populations.

    Science.gov (United States)

    Quaglino, Fabio; Kube, Michael; Jawhari, Maan; Abou-Jawdah, Yusuf; Siewert, Christin; Choueiri, Elia; Sobh, Hana; Casati, Paola; Tedeschi, Rosemarie; Lova, Marina Molino; Alma, Alberto; Bianco, Piero Attilio

    2015-07-30

    Almond witches'-broom (AlmWB), a devastating disease of almond, peach and nectarine in Lebanon, is associated with 'Candidatus Phytoplasma phoenicium'. In the present study, we generated a draft genome sequence of 'Ca. P. phoenicium' strain SA213, representative of phytoplasma strain populations from different host plants, and determined the genetic diversity among phytoplasma strain populations by phylogenetic analyses of 16S rRNA, groEL, tufB and inmp gene sequences. Sequence-based typing and phylogenetic analysis of the gene inmp, coding an integral membrane protein, distinguished AlmWB-associated phytoplasma strains originating from diverse host plants, whereas their 16S rRNA, tufB and groEL genes shared 100 % sequence identity. Moreover, dN/dS analysis indicated positive selection acting on inmp gene. Additionally, the analysis of 'Ca. P. phoenicium' draft genome revealed the presence of integral membrane proteins and effector-like proteins and potential candidates for interaction with hosts. One of the integral membrane proteins was predicted as BI-1, an inhibitor of apoptosis-promoting Bax factor. Bioinformatics analyses revealed the presence of putative BI-1 in draft and complete genomes of other 'Ca. Phytoplasma' species. The genetic diversity within 'Ca. P. phoenicium' strain populations in Lebanon suggested that AlmWB disease could be associated with phytoplasma strains derived from the adaptation of an original strain to diverse hosts. Moreover, the identification of a putative inhibitor of apoptosis-promoting Bax factor (BI-1) in 'Ca. P. phoenicium' draft genome and within genomes of other 'Ca. Phytoplasma' species suggested its potential role as a phytoplasma fitness-increasing factor by modification of the host-defense response.

  2. Population genomic analysis reveals differential evolutionary histories and patterns of diversity across subgenomes and subpopulations of Brassica napus L.

    Directory of Open Access Journals (Sweden)

    Elodie eGazave

    2016-04-01

    Full Text Available The allotetraploid species Brassica napus L. is a global crop of major economic importance, providing canola oil (seed and vegetables for human consumption and fodder and meal for livestock feed. Characterizing the genetic diversity present in the extant germplasm pool of B. napus is fundamental to better conserve, manage and utilize the genetic resources of this species. We used sequence-based genotyping to identify and genotype 30,881 SNPs in a diversity panel of 782 B. napus accessions, representing samples of winter and spring growth habits originating from 33 countries across Europe, Asia and America. We detected strong population structure broadly concordant with growth habit and geography, and identified three major genetic groups: spring (SP, winter Europe (WE, and winter Asia (WA. Subpopulation-specific polymorphism patterns suggest enriched genetic diversity within the WA group and a smaller effective breeding population for the SP group compared to WE. Interestingly, the two subgenomes of B. napus appear to have different geographic origins, with phylogenetic analysis placing WE and WA as basal clades for the other subpopulations in the C and A subgenomes, respectively. Finally, we identified 16 genomic regions where the patterns of diversity differed markedly from the genome-wide average, several of which are suggestive of genomic inversions. The results obtained in this study constitute a valuable resource for worldwide breeding efforts and the genetic dissection and prediction of complex B. napus traits.

  3. Genetic and genomic diversity studies of Acacia symbionts in Senegal reveal new species of Mesorhizobium with a putative geographical pattern.

    Directory of Open Access Journals (Sweden)

    Fatou Diouf

    Full Text Available Acacia senegal (L Willd. and Acacia seyal Del. are highly nitrogen-fixing and moderately salt tolerant species. In this study we focused on the genetic and genomic diversity of Acacia mesorhizobia symbionts from diverse origins in Senegal and investigated possible correlations between the genetic diversity of the strains, their soil of origin, and their tolerance to salinity. We first performed a multi-locus sequence analysis on five markers gene fragments on a collection of 47 mesorhizobia strains of A. senegal and A. seyal from 8 localities. Most of the strains (60% clustered with the M. plurifarium type strain ORS 1032T, while the others form four new clades (MSP1 to MSP4. We sequenced and assembled seven draft genomes: four in the M. plurifarium clade (ORS3356, ORS3365, STM8773 and ORS1032T, one in MSP1 (STM8789, MSP2 (ORS3359 and MSP3 (ORS3324. The average nucleotide identities between these genomes together with the MLSA analysis reveal three new species of Mesorhizobium. A great variability of salt tolerance was found among the strains with a lack of correlation between the genetic diversity of mesorhizobia, their salt tolerance and the soils samples characteristics. A putative geographical pattern of A. senegal symbionts between the dryland north part and the center of Senegal was found, reflecting adaptations to specific local conditions such as the water regime. However, the presence of salt does not seem to be an important structuring factor of Mesorhizobium species.

  4. Human Genome Diversity Project. Summary of planning workshop 3(B): Ethical and human-rights implications

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1993-12-31

    The third planning workshop of the Human Genome Diversity Project was held on the campus of the US National Institutes of Health in Bethesda, Maryland, from February 16 through February 18, 1993. The second day of the workshop was devoted to an exploration of the ethical and human-rights implications of the Project. This open meeting centered on three roundtables, involving 12 invited participants, and the resulting discussions among all those present. Attendees and their affiliations are listed in the attached Appendix A. The discussion was guided by a schedule and list of possible issues, distributed to all present and attached as Appendix B. This is a relatively complete, and thus lengthy, summary of the comments at the meeting. The beginning of the summary sets out as conclusions some issues on which there appeared to be widespread agreement, but those conclusions are not intended to serve as a set of detailed recommendations. The meeting organizer is distributing his recommendations in a separate memorandum; recommendations from others who attended the meeting are welcome and will be distributed by the meeting organizer to the participants and to the Project committee.

  5. Integrating Diverse Types of Genomic Data to Identify Genes that Underlie Adverse Pregnancy Phenotypes.

    Directory of Open Access Journals (Sweden)

    Jibril Hirbo

    Full Text Available Progress in understanding complex genetic diseases has been bolstered by synthetic approaches that overlay diverse data types and analyses to identify functionally important genes. Pre-term birth (PTB, a major complication of pregnancy, is a leading cause of infant mortality worldwide. A major obstacle in addressing PTB is that the mechanisms controlling parturition and birth timing remain poorly understood. Integrative approaches that overlay datasets derived from comparative genomics with function-derived ones have potential to advance our understanding of the genetics of birth timing, and thus provide insights into the genes that may contribute to PTB. We intersected data from fast evolving coding and non-coding gene regions in the human and primate lineage with data from genes expressed in the placenta, from genes that show enriched expression only in the placenta, as well as from genes that are differentially expressed in four distinct PTB clinical subtypes. A large fraction of genes that are expressed in placenta, and differentially expressed in PTB clinical subtypes (23-34% are fast evolving, and are associated with functions that include adhesion neurodevelopmental and immune processes. Functional categories of genes that express fast evolution in coding regions differ from those linked to fast evolution in non-coding regions. Finally, there is a surprising lack of overlap between fast evolving genes that are differentially expressed in four PTB clinical subtypes. Integrative approaches, especially those that incorporate evolutionary perspectives, can be successful in identifying potential genetic contributions to complex genetic diseases, such as PTB.

  6. Diverse data supports the transition of filamentous fungal model organisms into the post-genomics era

    Energy Technology Data Exchange (ETDEWEB)

    McCluskey, Kevin; Baker, Scott E.

    2017-02-17

    Filamentous fungi have been important as model organisms since the beginning of modern biological inquiry and have benefitted from open data since the earliest genetic maps were shared. From early origins in simple Mendelian genetics of mating types, parasexual genetics of colony colour, and the foundational demonstration of the segregation of a nutritional requirement, the contribution of research systems utilising filamentous fungi has spanned the biochemical genetics era, through the molecular genetics era, and now are at the very foundation of diverse omics approaches to research and development. Fungal model organisms have come from most major taxonomic groups although Ascomycete filamentous fungi have seen the most major sustained effort. In addition to the published material about filamentous fungi, shared molecular tools have found application in every area of fungal biology. Similarly, shared data has contributed to the success of model systems. The scale of data supporting research with filamentous fungi has grown by 10 to 12 orders of magnitude. From genetic to molecular maps, expression databases, and finally genome resources, the open and collaborative nature of the research communities has assured that the rising tide of data has lifted all of the research systems together.

  7. Hunter-gatherer genomic diversity suggests a southern African origin for modern humans.

    Science.gov (United States)

    Henn, Brenna M; Gignoux, Christopher R; Jobin, Matthew; Granka, Julie M; Macpherson, J M; Kidd, Jeffrey M; Rodríguez-Botigué, Laura; Ramachandran, Sohini; Hon, Lawrence; Brisbin, Abra; Lin, Alice A; Underhill, Peter A; Comas, David; Kidd, Kenneth K; Norman, Paul J; Parham, Peter; Bustamante, Carlos D; Mountain, Joanna L; Feldman, Marcus W

    2011-03-29

    Africa is inferred to be the continent of origin for all modern human populations, but the details of human prehistory and evolution in Africa remain largely obscure owing to the complex histories of hundreds of distinct populations. We present data for more than 580,000 SNPs for several hunter-gatherer populations: the Hadza and Sandawe of Tanzania, and the ≠Khomani Bushmen of South Africa, including speakers of the nearly extinct N|u language. We find that African hunter-gatherer populations today remain highly differentiated, encompassing major components of variation that are not found in other African populations. Hunter-gatherer populations also tend to have the lowest levels of genome-wide linkage disequilibrium among 27 African populations. We analyzed geographic patterns of linkage disequilibrium and population differentiation, as measured by F(ST), in Africa. The observed patterns are consistent with an origin of modern humans in southern Africa rather than eastern Africa, as is generally assumed. Additionally, genetic variation in African hunter-gatherer populations has been significantly affected by interaction with farmers and herders over the past 5,000 y, through both severe population bottlenecks and sex-biased migration. However, African hunter-gatherer populations continue to maintain the highest levels of genetic diversity in the world.

  8. Genome size diversity in angiosperms and its influence on gene space.

    Science.gov (United States)

    Dodsworth, Steven; Leitch, Andrew R; Leitch, Ilia J

    2015-12-01

    Genome size varies c. 2400-fold in angiosperms (flowering plants), although the range of genome size is skewed towards small genomes, with a mean genome size of 1C=5.7Gb. One of the most crucial factors governing genome size in angiosperms is the relative amount and activity of repetitive elements. Recently, there have been new insights into how these repeats, previously discarded as 'junk' DNA, can have a significant impact on gene space (i.e. the part of the genome comprising all the genes and gene-related DNA). Here we review these new findings and explore in what ways genome size itself plays a role in influencing how repeats impact genome dynamics and gene space, including gene expression. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.

  9. Genome-wide diversity and differentiation in New World populations of the human malaria parasite Plasmodium vivax.

    Directory of Open Access Journals (Sweden)

    Thais C de Oliveira

    2017-07-01

    Full Text Available The Americas were the last continent colonized by humans carrying malaria parasites. Plasmodium falciparum from the New World shows very little genetic diversity and greater linkage disequilibrium, compared with its African counterparts, and is clearly subdivided into local, highly divergent populations. However, limited available data have revealed extensive genetic diversity in American populations of another major human malaria parasite, P. vivax.We used an improved sample preparation strategy and next-generation sequencing to characterize 9 high-quality P. vivax genome sequences from northwestern Brazil. These new data were compared with publicly available sequences from recently sampled clinical P. vivax isolates from Brazil (BRA, total n = 11 sequences, Peru (PER, n = 23, Colombia (COL, n = 31, and Mexico (MEX, n = 19.We found that New World populations of P. vivax are as diverse (nucleotide diversity π between 5.2 × 10-4 and 6.2 × 10-4 as P. vivax populations from Southeast Asia, where malaria transmission is substantially more intense. They display several non-synonymous nucleotide substitutions (some of them previously undescribed in genes known or suspected to be involved in antimalarial drug resistance, such as dhfr, dhps, mdr1, mrp1, and mrp-2, but not in the chloroquine resistance transporter ortholog (crt-o gene. Moreover, P. vivax in the Americas is much less geographically substructured than local P. falciparum populations, with relatively little between-population genome-wide differentiation (pairwise FST values ranging between 0.025 and 0.092. Finally, P. vivax populations show a rapid decline in linkage disequilibrium with increasing distance between pairs of polymorphic sites, consistent with very frequent outcrossing. We hypothesize that the high diversity of present-day P. vivax lineages in the Americas originated from successive migratory waves and subsequent admixture between parasite lineages from geographically

  10. Using genome-wide measures of coancestry to maintain diversity and fitness in endangered and domestic pig populations

    Science.gov (United States)

    Bosse, Mirte; Megens, Hendrik-Jan; Madsen, Ole; Crooijmans, Richard P.M.A.; Ryder, Oliver A.; Austerlitz, Frédéric; Groenen, Martien A.M.; de Cara, M. Angeles R.

    2015-01-01

    Conservation and breeding programs aim at maintaining the most diversity, thereby avoiding deleterious effects of inbreeding while maintaining enough variation from which traits of interest can be selected. Theoretically, the most diversity is maintained using optimal contributions based on many markers to calculate coancestries, but this can decrease fitness by maintaining linked deleterious variants. The heterogeneous patterns of coancestry displayed in pigs make them an excellent model to test these predictions. We propose methods to measure coancestry and fitness from resequencing data and use them in population management. We analyzed the resequencing data of Sus cebifrons, a highly endangered porcine species from the Philippines, and genotype data from the Pietrain domestic breed. By analyzing the demographic history of Sus cebifrons, we inferred two past bottlenecks that resulted in some inbreeding load. In Pietrain, we analyzed signatures of selection possibly associated with commercial traits. We also simulated the management of each population to assess the performance of different optimal contribution methods to maintain diversity, fitness, and selection signatures. Maximum genetic diversity was maintained using marker-by-marker coancestry, and least using genealogical coancestry. Using a measure of coancestry based on shared segments of the genome achieved the best results in terms of diversity and fitness. However, this segment-based management eliminated signatures of selection. We demonstrate that maintaining both diversity and fitness depends on the genomic distribution of deleterious variants, which is shaped by demographic and selection histories. Our findings show the importance of genomic and next-generation sequencing information in the optimal design of breeding or conservation programs. PMID:26063737

  11. Complete genome analysis of 33 ecologically and biologically diverse Rift Valley fever virus strains reveals widespread virus movement and low genetic diversity due to recent common ancestry.

    Science.gov (United States)

    Bird, Brian H; Khristova, Marina L; Rollin, Pierre E; Ksiazek, Thomas G; Nichol, Stuart T

    2007-03-01

    Rift Valley fever (RVF) virus is a mosquito-borne RNA virus responsible for large explosive outbreaks of acute febrile disease in humans and livestock in Africa with significant mortality and economic impact. The successful high-throughput generation of the complete genome sequence was achieved for 33 diverse RVF virus strains collected from throughout Africa and Saudi Arabia from 1944 to 2000, including strains differing in pathogenicity in disease models. While several distinct virus genetic lineages were determined, which approximately correlate with geographic origin, multiple exceptions indicative of long-distance virus movement have been found. Virus strains isolated within an epidemic (e.g., Mauritania, 1987, or Egypt, 1977 to 1978) exhibit little diversity, while those in enzootic settings (e.g., 1970s Zimbabwe) can be highly diverse. In addition, the large Saudi Arabian RVF outbreak in 2000 appears to have involved virus introduction from East Africa, based on the close ancestral relationship of a 1998 East African virus. Virus genetic diversity was low (approximately 5%) and primarily involved accumulation of mutations at an average of 2.9 x 10(-4) substitutions/site/year, although some evidence of RNA segment reassortment was found. Bayesian analysis of current RVF virus genetic diversity places the most recent common ancestor of these viruses in the late 1800s, the colonial period in Africa, a time of dramatic changes in agricultural practices and introduction of nonindigenous livestock breeds. In addition to insights into the evolution and ecology of RVF virus, these genomic data also provide a foundation for the design of molecular detection assays and prototype vaccines useful in combating this important disease.

  12. Draft genome of spinach and transcriptome diversity of 120 Spinacia accessions

    OpenAIRE

    Xu, Chenxi; Jiao, Chen; Sun, Honghe; Cai, Xiaofeng; Wang, Xiaoli; Ge, Chenhui; Zheng, Yi; Liu, Wenli; Sun, Xuepeng; Xu, Yimin; Deng, Jie; Zhang, Zhonghua; Huang, Sanwen; Dai, Shaojun; Mou, Beiquan

    2017-01-01

    Spinach is an important leafy vegetable enriched with multiple necessary nutrients. Here we report the draft genome sequence of spinach (Spinacia oleracea, 2n=12), which contains 25,495 protein-coding genes. The spinach genome is highly repetitive with 74.4% of its content in the form of transposable elements. No recent whole genome duplication events are observed in spinach. Genome syntenic analysis between spinach and sugar beet suggests substantial inter- and intra-chromosome rearrangement...

  13. The Pinus taeda genome is characterized by diverse and highly diverged repetitive sequences

    Directory of Open Access Journals (Sweden)

    Yandell Mark

    2010-07-01

    Full Text Available Abstract Background In today's age of genomic discovery, no attempt has been made to comprehensively sequence a gymnosperm genome. The largest genus in the coniferous family Pinaceae is Pinus, whose 110-120 species have extremely large genomes (c. 20-40 Gb, 2N = 24. The size and complexity of these genomes have prompted much speculation as to the feasibility of completing a conifer genome sequence. Conifer genomes are reputed to be highly repetitive, but there is little information available on the nature and identity of repetitive units in gymnosperms. The pines have extensive genetic resources, with approximately 329000 ESTs from eleven species and genetic maps in eight species, including a dense genetic map of the twelve linkage groups in Pinus taeda. Results We present here the Sanger sequence and annotation of ten P. taeda BAC clones and Genome Analyzer II whole genome shotgun (WGS sequences representing 7.5% of the genome. Computational annotation of ten BACs predicts three putative protein-coding genes and at least fifteen likely pseudogenes in nearly one megabase of sequence. We found three conifer-specific LTR retroelements in the BACs, and tentatively identified at least 15 others based on evidence from the distantly related angiosperms. Alignment of WGS sequences to the BACs indicates that 80% of BAC sequences have similar copies (≥ 75% nucleotide identity elsewhere in the genome, but only 23% have identical copies (99% identity. The three most common repetitive elements in the genome were identified and, when combined, represent less than 5% of the genome. Conclusions This study indicates that the majority of repeats in the P. taeda genome are 'novel' and will therefore require additional BAC or genomic sequencing for accurate characterization. The pine genome contains a very large number of diverged and probably defunct repetitive elements. This study also provides new evidence that sequencing a pine genome using a WGS approach is

  14. Genetic diversity and structure of elite cotton germplasm (Gossypium hirsutum L.) using genome-wide SNP data.

    Science.gov (United States)

    Ai, XianTao; Liang, YaJun; Wang, JunDuo; Zheng, JuYun; Gong, ZhaoLong; Guo, JiangPing; Li, XueYuan; Qu, YanYing

    2017-10-01

    Cotton (Gossypium spp.) is the most important natural textile fiber crop, and Gossypium hirsutum L. is responsible for 90% of the annual cotton crop in the world. Information on cotton genetic diversity and population structure is essential for new breeding lines. In this study, we analyzed population structure and genetic diversity of 288 elite Gossypium hirsutum cultivar accessions collected from around the world, and especially from China, using genome-wide single nucleotide polymorphisms (SNP) markers. The average polymorphsim information content (PIC) was 0.25, indicating a relatively low degree of genetic diversity. Population structure analysis revealed extensive admixture and identified three subgroups. Phylogenetic analysis supported the subgroups identified by STRUCTURE. The results from both population structure and phylogenetic analysis were, for the most part, in agreement with pedigree information. Analysis of molecular variance revealed a larger amount of variation was due to diversity within the groups. Establishment of genetic diversity and population structure from this study could be useful for genetic and genomic analysis and systematic utilization of the standing genetic variation in upland cotton.

  15. Extreme genome diversity in the hyper-prevalent parasitic eukaryote Blastocystis.

    Science.gov (United States)

    Gentekaki, Eleni; Curtis, Bruce A; Stairs, Courtney W; Klimeš, Vladimír; Eliáš, Marek; Salas-Leiva, Dayana E; Herman, Emily K; Eme, Laura; Arias, Maria C; Henrissat, Bernard; Hilliou, Frédérique; Klute, Mary J; Suga, Hiroshi; Malik, Shehre-Banoo; Pightling, Arthur W; Kolisko, Martin; Rachubinski, Richard A; Schlacht, Alexander; Soanes, Darren M; Tsaousis, Anastasios D; Archibald, John M; Ball, Steven G; Dacks, Joel B; Clark, C Graham; van der Giezen, Mark; Roger, Andrew J

    2017-09-01

    Blastocystis is the most prevalent eukaryotic microbe colonizing the human gut, infecting approximately 1 billion individuals worldwide. Although Blastocystis has been linked to intestinal disorders, its pathogenicity remains controversial because most carriers are asymptomatic. Here, the genome sequence of Blastocystis subtype (ST) 1 is presented and compared to previously published sequences for ST4 and ST7. Despite a conserved core of genes, there is unexpected diversity between these STs in terms of their genome sizes, guanine-cytosine (GC) content, intron numbers, and gene content. ST1 has 6,544 protein-coding genes, which is several hundred more than reported for ST4 and ST7. The percentage of proteins unique to each ST ranges from 6.2% to 20.5%, greatly exceeding the differences observed within parasite genera. Orthologous proteins also display extreme divergence in amino acid sequence identity between STs (i.e., 59%-61% median identity), on par with observations of the most distantly related species pairs of parasite genera. The STs also display substantial variation in gene family distributions and sizes, especially for protein kinase and protease gene families, which could reflect differences in virulence. It remains to be seen to what extent these inter-ST differences persist at the intra-ST level. A full 26% of genes in ST1 have stop codons that are created on the mRNA level by a novel polyadenylation mechanism found only in Blastocystis. Reconstructions of pathways and organellar systems revealed that ST1 has a relatively complete membrane-trafficking system and a near-complete meiotic toolkit, possibly indicating a sexual cycle. Unlike some intestinal protistan parasites, Blastocystis ST1 has near-complete de novo pyrimidine, purine, and thiamine biosynthesis pathways and is unique amongst studied stramenopiles in being able to metabolize α-glucans rather than β-glucans. It lacks all genes encoding heme-containing cytochrome P450 proteins

  16. Extreme genome diversity in the hyper-prevalent parasitic eukaryote Blastocystis

    Science.gov (United States)

    Gentekaki, Eleni; Stairs, Courtney W.; Klimeš, Vladimír; Eliáš, Marek; Salas-Leiva, Dayana E.; Herman, Emily K.; Eme, Laura; Arias, Maria C.; Henrissat, Bernard; Hilliou, Frédérique; Klute, Mary J.; Suga, Hiroshi; Malik, Shehre-Banoo; Pightling, Arthur W.; Kolisko, Martin; Rachubinski, Richard A.; Schlacht, Alexander; Soanes, Darren M.; Tsaousis, Anastasios D.; Archibald, John M.; Ball, Steven G.; Dacks, Joel B.; Clark, C. Graham; van der Giezen, Mark; Roger, Andrew J.

    2017-01-01

    Blastocystis is the most prevalent eukaryotic microbe colonizing the human gut, infecting approximately 1 billion individuals worldwide. Although Blastocystis has been linked to intestinal disorders, its pathogenicity remains controversial because most carriers are asymptomatic. Here, the genome sequence of Blastocystis subtype (ST) 1 is presented and compared to previously published sequences for ST4 and ST7. Despite a conserved core of genes, there is unexpected diversity between these STs in terms of their genome sizes, guanine-cytosine (GC) content, intron numbers, and gene content. ST1 has 6,544 protein-coding genes, which is several hundred more than reported for ST4 and ST7. The percentage of proteins unique to each ST ranges from 6.2% to 20.5%, greatly exceeding the differences observed within parasite genera. Orthologous proteins also display extreme divergence in amino acid sequence identity between STs (i.e., 59%–61% median identity), on par with observations of the most distantly related species pairs of parasite genera. The STs also display substantial variation in gene family distributions and sizes, especially for protein kinase and protease gene families, which could reflect differences in virulence. It remains to be seen to what extent these inter-ST differences persist at the intra-ST level. A full 26% of genes in ST1 have stop codons that are created on the mRNA level by a novel polyadenylation mechanism found only in Blastocystis. Reconstructions of pathways and organellar systems revealed that ST1 has a relatively complete membrane-trafficking system and a near-complete meiotic toolkit, possibly indicating a sexual cycle. Unlike some intestinal protistan parasites, Blastocystis ST1 has near-complete de novo pyrimidine, purine, and thiamine biosynthesis pathways and is unique amongst studied stramenopiles in being able to metabolize α-glucans rather than β-glucans. It lacks all genes encoding heme-containing cytochrome P450 proteins

  17. Comparative genomic analysis reveals a diverse repertoire of genes involved in prokaryote-eukaryote interactions within the Pseudovibrio genus.

    Directory of Open Access Journals (Sweden)

    Stefano eRomano

    2016-03-01

    Full Text Available Strains of the Pseudovibrio genus have been detected worldwide, mainly as part of bacterial communities associated with marine invertebrates, particularly sponges. This recurrent association has been considered as an indication of a symbiotic relationship between these microbes and their host. Until recently, the availability of only two genomes, belonging to closely related strains, has limited the knowledge on the genomic and physiological features of the genus to a single phylogenetic lineage.Here we present 10 newly sequenced genomes of Pseudovibrio strains isolated from marine sponges from the west coast of Ireland, and including the other two publicly available genomes we performed an extensive comparative genomic analysis. Homogeneity was apparent in terms of both the orthologous genes and the metabolic features shared amongst the 12 strains. At the genomic level, a key physiological difference observed amongst the isolates was the presence only in strain P. axinellae AD2 of genes encoding proteins involved in assimilatory nitrate reduction, which was then proved experimentally. We then focused on studying those systems known to be involved in the interactions with eukaryotic and prokaryotic cells. This analysis revealed that the genus harbors a large diversity of toxin-like proteins, secretion systems and their potential effectors. Their distribution in the genus was not always consistent with the phylogenetic relationship of the strains. Finally, our analyses identified new genomic islands encoding potential toxin-immunity systems, previously unknown in the genus.Our analyses shed new light on the Pseudovibrio genus, indicating a large diversity of both metabolic features and systems for interacting with the host. The diversity in both distribution and abundance of these systems amongst the strains underlines how metabolically and phylogenetically similar bacteria may use different strategies to interact with the host and find a niche

  18. Genome sequence and genetic diversity of the common carp, Cyprinus carpio.

    Science.gov (United States)

    Xu, Peng; Zhang, Xiaofeng; Wang, Xumin; Li, Jiongtang; Liu, Guiming; Kuang, Youyi; Xu, Jian; Zheng, Xianhu; Ren, Lufeng; Wang, Guoliang; Zhang, Yan; Huo, Linhe; Zhao, Zixia; Cao, Dingchen; Lu, Cuiyun; Li, Chao; Zhou, Yi; Liu, Zhanjiang; Fan, Zhonghua; Shan, Guangle; Li, Xingang; Wu, Shuangxiu; Song, Lipu; Hou, Guangyuan; Jiang, Yanliang; Jeney, Zsigmond; Yu, Dan; Wang, Li; Shao, Changjun; Song, Lai; Sun, Jing; Ji, Peifeng; Wang, Jian; Li, Qiang; Xu, Liming; Sun, Fanyue; Feng, Jianxin; Wang, Chenghui; Wang, Shaolin; Wang, Baosen; Li, Yan; Zhu, Yaping; Xue, Wei; Zhao, Lan; Wang, Jintu; Gu, Ying; Lv, Weihua; Wu, Kejing; Xiao, Jingfa; Wu, Jiayan; Zhang, Zhang; Yu, Jun; Sun, Xiaowen

    2014-11-01

    The common carp, Cyprinus carpio, is one of the most important cyprinid species and globally accounts for 10% of freshwater aquaculture production. Here we present a draft genome of domesticated C. carpio (strain Songpu), whose current assembly contains 52,610 protein-coding genes and approximately 92.3% coverage of its paleotetraploidized genome (2n = 100). The latest round of whole-genome duplication has been estimated to have occurred approximately 8.2 million years ago. Genome resequencing of 33 representative individuals from worldwide populations demonstrates a single origin for C. carpio in 2 subspecies (C. carpio Haematopterus and C. carpio carpio). Integrative genomic and transcriptomic analyses were used to identify loci potentially associated with traits including scaling patterns and skin color. In combination with the high-resolution genetic map, the draft genome paves the way for better molecular studies and improved genome-assisted breeding of C. carpio and other closely related species.

  19. Investigations into genome diversity of Haemophilus influenzae using whole genome sequencing of clinical isolates and laboratory transformants.

    Science.gov (United States)

    Power, Peter M; Bentley, Stephen D; Parkhill, Julian; Moxon, E Richard; Hood, Derek W

    2012-11-23

    Haemophilus influenzae is an important human commensal pathogen associated with significant levels of disease. High-throughput DNA sequencing was used to investigate differences in genome content within this species. Genomic DNA sequence was obtained from 85 strains of H. influenzae and from other related species, selected based on geographical site of isolation, disease association and documented genotypic and phenotypic differences. When compared by Mauve alignment these indicated groupings of H. influenzae that were consistent with previously published analyses; capsule expressing strains fell into two distinct groups and those of serotype b (Hib) were found in two closely positioned lineages. For 18 Hib strains representing both lineages we found many discrete regions (up to 40% of the total genome) displaying sequence variation when compared to a common reference strain. Evidence that this naturally occurring pattern of inter-strain variation in H. influenzae can be mediated by transformation was obtained through sequencing DNA obtained from a pool of 200 independent transformants of a recipient (strain Rd) using donor DNA from a heterologous Hib strain (Eagan). Much of the inter-strain variation in genome sequence in H. influenzae is likely the result of inter-strain exchanges of DNA, most plausibly through transformation.

  20. Investigating the global genomic diversity of Escherichia coli using a multi-genome DNA microarray platform with novel gene prediction strategies

    Directory of Open Access Journals (Sweden)

    LeClerc Joseph E

    2011-07-01

    Full Text Available Abstract Background The gene content of a diverse group of 183 unique Escherichia coli and Shigella isolates was determined using the Affymetrix GeneChip® E. coli Genome 2.0 Array, originally designed for transcriptome analysis, as a genotyping tool. The probe set design utilized by this array provided the opportunity to determine the gene content of each strain very accurately and reliably. This array constitutes 10,112 independent genes representing four individual E. coli genomes, therefore providing the ability to survey genes of several different pathogen types. The entire ECOR collection, 80 EHEC-like isolates, and a diverse set of isolates from our FDA strain repository were included in our analysis. Results From this study we were able to define sets of genes that correspond to, and therefore define, the EHEC pathogen type. Furthermore, our sampling of 63 unique strains of O157:H7 showed the ability of this array to discriminate between closely related strains. We found that individual strains of O157:H7 differed, on average, by 197 probe sets. Finally, we describe an analysis method that utilizes the power of the probe sets to determine accurately the presence/absence of each gene represented on this array. Conclusions These elements provide insights into understanding the microbial diversity that exists within extant E. coli populations. Moreover, these data demonstrate that this novel microarray-based analysis is a powerful tool in the field of molecular epidemiology and the newly emerging field of microbial forensics.

  1. Comparative genomic analysis of 45 type strains of the genus Bifidobacterium: a snapshot of its genetic diversity and evolution.

    Science.gov (United States)

    Sun, Zhihong; Zhang, Wenyi; Guo, Chenyi; Yang, Xianwei; Liu, Wenjun; Wu, Yarong; Song, Yuqin; Kwok, Lai Yu; Cui, Yujun; Menghe, Bilige; Yang, Ruifu; Hu, Liangping; Zhang, Heping

    2015-01-01

    Bifidobacteria are well known for their human health-promoting effects and are therefore widely applied in the food industry. Members of the Bifidobacterium genus were first identified from the human gastrointestinal tract and were then found to be widely distributed across various ecological niches. Although the genetic diversity of Bifidobacterium has been determined based on several marker genes or a few genomes, the global diversity and evolution scenario for the entire genus remain unresolved. The present study comparatively analyzed the genomes of 45 type strains. We built a robust genealogy for Bifidobacterium based on 402 core genes and defined its root according to the phylogeny of the tree of bacteria. Our results support that all human isolates are of younger lineages, and although species isolated from bees dominate the more ancient lineages, the bee was not necessarily the original host for bifidobacteria. Moreover, the species isolated from different hosts are enriched with specific gene sets, suggesting host-specific adaptation. Notably, bee-specific genes are strongly associated with respiratory metabolism and are potential in helping those bacteria adapt to the oxygen-rich gut environment in bees. This study provides a snapshot of the genetic diversity and evolution of Bifidobacterium, paving the way for future studies on the taxonomy and functional genomics of the genus.

  2. Intraspecies Genomic Diversity and Natural Population Structure of the Meat-Borne Lactic Acid Bacterium Lactobacillus sakei▿ †

    Science.gov (United States)

    Chaillou, Stéphane; Daty, Marie; Baraige, Fabienne; Dudez, Anne-Marie; Anglade, Patricia; Jones, Rhys; Alpert, Carl-Alfred; Champomier-Vergès, Marie-Christine; Zagorec, Monique

    2009-01-01

    Lactobacillus sakei is a food-borne bacterium naturally found in meat and fish products. A study was performed to examine the intraspecies diversity among 73 isolates sourced from laboratory collections in several different countries. Pulsed-field gel electrophoresis analysis demonstrated a 25% variation in genome size between isolates, ranging from 1,815 kb to 2,310 kb. The relatedness between isolates was then determined using a PCR-based method that detects the possession of 60 chromosomal genes belonging to the flexible gene pool. Ten different strain clusters were identified that had noticeable differences in their average genome size reflecting the natural population structure. The results show that many different genotypes may be isolated from similar types of meat products, suggesting a complex ecological habitat in which intraspecies diversity may be required for successful adaptation. Finally, proteomic analysis revealed a slight difference between the migration patterns of highly abundant GapA isoforms of the two prevailing L. sakei subspecies (sakei and carnosus). This analysis was used to affiliate the genotypic clusters with the corresponding subspecies. These findings reveal for the first time the extent of intraspecies genomic diversity in L. sakei. Consequently, identification of molecular subtypes may in the future prove valuable for a better understanding of microbial ecosystems in food products. PMID:19114527

  3. Comparative genomic analysis of 45 type strains of the genus Bifidobacterium: a snapshot of its genetic diversity and evolution.

    Directory of Open Access Journals (Sweden)

    Zhihong Sun

    Full Text Available Bifidobacteria are well known for their human health-promoting effects and are therefore widely applied in the food industry. Members of the Bifidobacterium genus were first identified from the human gastrointestinal tract and were then found to be widely distributed across various ecological niches. Although the genetic diversity of Bifidobacterium has been determined based on several marker genes or a few genomes, the global diversity and evolution scenario for the entire genus remain unresolved. The present study comparatively analyzed the genomes of 45 type strains. We built a robust genealogy for Bifidobacterium based on 402 core genes and defined its root according to the phylogeny of the tree of bacteria. Our results support that all human isolates are of younger lineages, and although species isolated from bees dominate the more ancient lineages, the bee was not necessarily the original host for bifidobacteria. Moreover, the species isolated from different hosts are enriched with specific gene sets, suggesting host-specific adaptation. Notably, bee-specific genes are strongly associated with respiratory metabolism and are potential in helping those bacteria adapt to the oxygen-rich gut environment in bees. This study provides a snapshot of the genetic diversity and evolution of Bifidobacterium, paving the way for future studies on the taxonomy and functional genomics of the genus.

  4. Family-wide Structural Characterization and Genomic Comparisons Decode the Diversity-oriented Biosynthesis of Thalassospiramides by Marine Proteobacteria.

    Science.gov (United States)

    Zhang, Weipeng; Lu, Liang; Lai, Qiliang; Zhu, Beika; Li, Zhongrui; Xu, Ying; Shao, Zongze; Herrup, Karl; Moore, Bradley S; Ross, Avena C; Qian, Pei-Yuan

    2016-12-30

    The thalassospiramide lipopeptides have great potential for therapeutic applications; however, their structural and functional diversity and biosynthesis are poorly understood. Here, by cultivating 130 Rhodospirillaceae strains sampled from oceans worldwide, we discovered 21 new thalassospiramide analogues and demonstrated their neuroprotective effects. To investigate the diversity of biosynthetic gene cluster (BGC) architectures, we sequenced the draft genomes of 28 Rhodospirillaceae strains. Our family-wide genomic analysis revealed three types of dysfunctional BGCs and four functional BGCs whose architectures correspond to four production patterns. This correlation allowed us to reassess the "diversity-oriented biosynthesis" proposed for the microbial production of thalassospiramides, which involves iteration of several key modules. Preliminary evolutionary investigation suggested that the functional BGCs could have arisen through module/domain loss, whereas the dysfunctional BGCs arose through horizontal gene transfer. Further comparative genomics indicated that thalassospiramide production is likely to be attendant on particular genes/pathways for amino acid metabolism, signaling transduction, and compound efflux. Our findings provide a systematic understanding of thalassospiramide production and new insights into the underlying mechanism. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  5. Comparative Genomic Analysis of 45 Type Strains of the Genus Bifidobacterium: A Snapshot of Its Genetic Diversity and Evolution

    Science.gov (United States)

    Sun, Zhihong; Zhang, Wenyi; Guo, Chenyi; Yang, Xianwei; Liu, Wenjun; Wu, Yarong; Song, Yuqin; Kwok, Lai Yu; Cui, Yujun; Menghe, Bilige; Yang, Ruifu; Hu, Liangping; Zhang, Heping

    2015-01-01

    Bifidobacteria are well known for their human health-promoting effects and are therefore widely applied in the food industry. Members of the Bifidobacterium genus were first identified from the human gastrointestinal tract and were then found to be widely distributed across various ecological niches. Although the genetic diversity of Bifidobacterium has been determined based on several marker genes or a few genomes, the global diversity and evolution scenario for the entire genus remain unresolved. The present study comparatively analyzed the genomes of 45 type strains. We built a robust genealogy for Bifidobacterium based on 402 core genes and defined its root according to the phylogeny of the tree of bacteria. Our results support that all human isolates are of younger lineages, and although species isolated from bees dominate the more ancient lineages, the bee was not necessarily the original host for bifidobacteria. Moreover, the species isolated from different hosts are enriched with specific gene sets, suggesting host-specific adaptation. Notably, bee-specific genes are strongly associated with respiratory metabolism and are potential in helping those bacteria adapt to the oxygen-rich gut environment in bees. This study provides a snapshot of the genetic diversity and evolution of Bifidobacterium, paving the way for future studies on the taxonomy and functional genomics of the genus. PMID:25658111

  6. Genomic patterns of nucleotide diversity in divergent populations of U.S. weedy rice

    Science.gov (United States)

    2010-01-01

    Background Weedy rice (red rice), a conspecific weed of cultivated rice (Oryza sativa L.), is a significant problem throughout the world and an emerging threat in regions where it was previously absent. Despite belonging to the same species complex as domesticated rice and its wild relatives, the evolutionary origins of weedy rice remain unclear. We use genome-wide patterns of single nucleotide polymorphism (SNP) variation in a broad geographic sample of weedy, domesticated, and wild Oryza samples to infer the origin and demographic processes influencing U.S. weedy rice evolution. Results We find greater population structure than has been previously reported for U.S. weedy rice, and that the multiple, genetically divergent populations have separate origins. The two main U.S. weedy rice populations share genetic backgrounds with cultivated O. sativa varietal groups not grown commercially in the U.S., suggesting weed origins from domesticated ancestors. Hybridization between weedy groups and between weedy rice and local crops has also led to the evolution of distinct U.S. weedy rice populations. Demographic simulations indicate differences among the main weedy groups in the impact of bottlenecks on their establishment in the U.S., and in the timing of divergence from their cultivated relatives. Conclusions Unlike prior research, we did not find unambiguous evidence for U.S. weedy rice originating via hybridization between cultivated and wild Oryza species. Our results demonstrate the potential for weedy life-histories to evolve directly from within domesticated lineages. The diverse origins of U.S. weedy rice populations demonstrate the multiplicity of evolutionary forces that can influence the emergence of weeds from a single species complex. PMID:20550656

  7. Genomic diversity in Mycobacterium leprae isolates from leprosy cases in South India.

    Science.gov (United States)

    Das, Madhusmita; Chaitanya, V Sundeep; Kanmani, K; Rajan, Lakshmi; Ebenezer, Mannam

    2016-11-01

    The Objective of this study was to identify the strain diversity of Mycobacterium leprae in terms of SNP types and subtypes stratified as per genomic single nucleotide polymorphisms, in clinical isolates of leprosy patients from a tertiary care leprosy center in South India. Further, the associations of SNP types with clinical outcomes in leprosy were also investigated. DNA was extracted from excisional skin biopsies of a total of 172 newly diagnosed untreated leprosy patients from a clinic in Tamil Nadu, in south India, that also serves patients from neighboring states. All the leprosy patients were those who voluntarily reported at the clinic during the study period of one year i.e., 2015. Clinical and histopathological details were collected at diagnosis and leprosy was confirmed through bacteriological smear examination and PCR for M. leprae specific RLEP region. SNP types and subtypes were determined by PCR amplification and Sanger sequencing of PCR products. M. leprae specific RLEP gene amplification was achieved in 160 out of 172 patients. Among 160 specimens 118(73.75%) were type 1 and 42 (26.25%) were type 2 and on subtyping it was noted that 88/160 (55.00%) were 1D, 25/160 (15.62%) 1C, 5/160 (3.12%) 1A, 33/160 (20.62%) 2G and 9/160 (5.62%) were 2H. Our results indicated that subtype 1D is predominant in the south Indian population. We also noted 2G, 1C and 1A in the patient sample tested. Additionally we identified subtype 2H for the first time in India. Copyright © 2016. Published by Elsevier B.V.

  8. Cytotype diversity and genome size variation in eastern Asian polyploid Cardamine (Brassicaceae) species.

    Science.gov (United States)

    Marhold, Karol; Kudoh, Hiroshi; Pak, Jae-Hong; Watanabe, Kuniaki; Spaniel, Stanislav; Lihová, Judita

    2010-02-01

    Intraspecific ploidy-level variation is an important aspect of a species' genetic make-up, which may lend insight into its evolutionary history and future potential. The present study explores this phenomenon in a group of eastern Asian Cardamine species. Plant material was sampled from 59 localities in Japan and Korea, which were used in karyological (chromosome counting) and flow cytometric analyses. The absolute nuclear DNA content (in pg) was measured using propidium iodide and the relative nuclear DNA content (in arbitrary units) was measured using 4,6-diamidino-2-phenylindole fluorochrome. Substantial cytotype diversity was found, with strikingly different distribution patterns between the species. Two cytotypes were found in C. torrentis sensu lato (4x and 8x, in C. valida and C. torrentis sensu stricto, respectively), which displays a north-south geographical pattern in Japan. Hypotheses regarding their origin and colonization history in the Japanese archipelago are discussed. In Korean C. amaraeiformis, only tetraploids were found, and these populations may in fact belong to C. valida. C. yezoensis was found to harbour as many as six cytotypes in Japan, ranging from hexa- to dodecaploids. Ploidy levels do not show any obvious geographical pattern; populations with mixed ploidy levels, containing two to four cytotypes, are frequently observed throughout the range. C. schinziana, an endemic of Hokkaido, has hexa- and octoploid populations. Previous chromosome records are also revised, showing that they are largely based on misidentified material or misinterpreted names. Sampling of multiple populations and utilization of the efficient flow cytometric approach allowed the detection of large-scale variation in ploidy levels and genome size variation attributable to aneuploidy. These data will be essential in further phylogenetic and evolutionary studies.

  9. Microbial iron management mechanisms in extremely acidic environments: comparative genomics evidence for diversity and versatility

    Directory of Open Access Journals (Sweden)

    Nieto Pamela A

    2008-11-01

    uptake systems could reflect their obligatory occupation of extremely low pH environments where high concentrations of soluble iron may always be available and were oxidized sulfur species might not compromise iron speciation dynamics. Presence of bacterioferritin in the Acidithiobacilli, polyphosphate accumulation functions and variants of FieF-like diffusion facilitators in both Acidithiobacilli and Leptospirilla, indicate that they may remove or store iron under conditions of variable availability. In addition, the Fe(II-oxidizing capacity of both A. ferrooxidans and Leptospirilla could itself be a way to evade iron stress imposed by readily available Fe(II ions at low pH. Fur regulatory sites have been predicted for a number of gene clusters including iron related and non-iron related functions in both the Acidithiobacilli and Leptospirilla, laying the foundation for the future discovery of iron regulated and iron-phosphate coordinated regulatory control circuits. Conclusion In silico analyses of the genomes of acidophilic bacteria are beginning to tease apart the mechanisms that mediate iron uptake and homeostasis in low pH environments. Initial models pinpoint significant differences in abundance and diversity of iron management mechanisms between Leptospirilla and Acidithiobacilli, and begin to reveal how these two groups respond to iron cycling and iron fluctuations in naturally acidic environments and in industrial operations. Niche partitions and ecological successions between acidophilic microorganisms may be partially explained by these observed differences. Models derived from these analyses pave the way for improved hypothesis testing and well directed experimental investigation. In addition, aspects of these models should challenge investigators to evaluate alternative iron management strategies in non-acidophilic model organisms.

  10. Hidden diversity revealed by genome-resolved metagenomics of iron-oxidizing microbial mats from Lō'ihi Seamount, Hawai'i.

    Science.gov (United States)

    Fullerton, Heather; Hager, Kevin W; McAllister, Sean M; Moyer, Craig L

    2017-08-01

    The Zetaproteobacteria are ubiquitous in marine environments, yet this class of Proteobacteria is only represented by a few closely-related cultured isolates. In high-iron environments, such as diffuse hydrothermal vents, the Zetaproteobacteria are important members of the community driving its structure. Biogeography of Zetaproteobacteria has shown two ubiquitous operational taxonomic units (OTUs), yet much is unknown about their genomic diversity. Genome-resolved metagenomics allows for the specific binning of microbial genomes based on genomic signatures present in composite metagenome assemblies. This resulted in the recovery of 93 genome bins, of which 34 were classified as Zetaproteobacteria. Form II ribulose 1,5-bisphosphate carboxylase genes were recovered from nearly all the Zetaproteobacteria genome bins. In addition, the Zetaproteobacteria genome bins contain genes for uptake and utilization of bioavailable nitrogen, detoxification of arsenic, and a terminal electron acceptor adapted for low oxygen concentration. Our results also support the hypothesis of a Cyc2-like protein as the site for iron oxidation, now detected across a majority of the Zetaproteobacteria genome bins. Whole genome comparisons showed a high genomic diversity across the Zetaproteobacteria OTUs and genome bins that were previously unidentified by SSU rRNA gene analysis. A single lineage of cosmopolitan Zetaproteobacteria (zOTU 2) was found to be monophyletic, based on cluster analysis of average nucleotide identity and average amino acid identity comparisons. From these data, we can begin to pinpoint genomic adaptations of the more ecologically ubiquitous Zetaproteobacteria, and further understand their environmental constraints and metabolic potential.

  11. Development of novel InDel markers and genetic diversity in Chenopodium quinoa through whole-genome re-sequencing.

    Science.gov (United States)

    Zhang, Tifu; Gu, Minfeng; Liu, Yuhe; Lv, Yuanda; Zhou, Ling; Lu, Haiyan; Liang, Shuaiqiang; Bao, Huabin; Zhao, Han

    2017-09-05

    Quinoa (Chenopodium quinoa Willd.) is a balanced nutritional crop, but its breeding improvement has been limited by the lack of information on its genetics and genomics. Therefore, it is necessary to obtain knowledge on genomic variation, population structure, and genetic diversity and to develop novel Insertion/Deletion (InDel) markers for quinoa by whole-genome re-sequencing. We re-sequenced 11 quinoa accessions and obtained a coverage depth between approximately 7× to 23× the quinoa genome. Based on the 1453-megabase (Mb) assembly from the reference accession Riobamba, 8,441,022 filtered bi-allelic single nucleotide polymorphisms (SNPs) and 842,783 filtered InDels were identified, with an estimated SNP and InDel density of 5.81 and 0.58 per kilobase (kb). From the genomic InDel variations, 85 dimorphic InDel markers were newly developed and validated. Together with the 62 simple sequence repeat (SSR) markers reported, a total of 147 markers were used for genotyping the 129 quinoa accessions. Molecular grouping analysis showed classification into two major groups, the Andean highland (composed of the northern and southern highland subgroups) and Chilean coastal, based on combined STRUCTURE, phylogenetic tree and PCA (Principle Component Analysis) analyses. Further analysis of the genetic diversity exhibited a decreasing tendency from the Chilean coast group to the Andean highland group, and the gene flow between subgroups was more frequent than that between the two subgroups and the Chilean coastal group. The majority of the variations (approximately 70%) were found through an analysis of molecular variation (AMOVA) due to the diversity between the groups. This was congruent with the observation of a highly significant FST value (0.705) between the groups, demonstrating significant genetic differentiation between the Andean highland type of quinoa and the Chilean coastal type. Moreover, a core set of 16 quinoa germplasms that capture all 362 alleles was

  12. Nucleotide diversity in the mitochondrial and nuclear compartments of Chlamydomonas reinhardtii: investigating the origins of genome architecture

    Directory of Open Access Journals (Sweden)

    Lee Robert W

    2008-05-01

    Full Text Available Abstract Background The magnitude of intronic and intergenic DNA can vary substantially both within and among evolutionary lineages; however, the forces responsible for this disparity in genome compactness are conjectural. One explanation, termed the mutational-burden hypothesis, posits that genome compactness is primarily driven by two nonadaptive processes: mutation and random genetic drift – the effects of which can be discerned by measuring the nucleotide diversity at silent sites (πsilent, defined as noncoding sites and the synonymous sites of protein-coding regions. The mutational-burden hypothesis holds that πsilent is negatively correlated to genome compactness. We used the model organism Chlamydomonas reinhardtii, which has a streamlined, coding-dense mitochondrial genome and an noncompact, intron-rich nuclear genome, to investigate the mutational-burden hypothesis. For measuring πsilent we sequenced the complete mitochondrial genome and portions of 7 nuclear genes from 7 geographical isolates of C. reinhardtii. Results We found significantly more nucleotide diversity in the nuclear compartment of C. reinhardtii than in the mitochondrial compartment: net values of πsilent for the nuclear and mitochondrial genomes were 32 × 10-3 and 8.5 × 10-3, respectively; and when insertions and deletions (indels are factored in, these values become 49 × 10-3 for the nuclear DNA and 11 × 10-3 for the mitochondrial DNA (mtDNA. Furthermore, our investigations of C. reinhardtii revealed 4 previously undiscovered mitochondrial introns, one of which contains a fragment of the large-subunit (LSU rRNA gene and another of which is found in a region of the LSU-rRNA gene not previously reported (for any taxon to contain introns. Conclusion At first glance our results are in opposition to the mutational-burden hypothesis: πsilent was approximately 4 times greater in the nuclear compartment of C. reinhardtii relative to the mitochondrial compartment

  13. The Chloroplast Genome Sequence of Scutellaria baicalensis Provides Insight into Intraspecific and Interspecific Chloroplast Genome Diversity in Scutellaria

    Directory of Open Access Journals (Sweden)

    Dan Jiang

    2017-09-01

    Full Text Available Scutellaria baicalensis Georgi (Lamiaceae is the source of the well-known traditional Chinese medicine “HuangQin” (Radix Scutellariae. Natural sources of S. baicalensis are rapidly declining due to high market demand and overexploitation. Moreover, the commercial products of Radix Scutellariae have often been found to contain adulterants in recent years, which may give rise to issues regarding drug efficacy and safety. In this study, we developed valuable chloroplast molecular resources by comparing intraspecific and interspecific chloroplast genome. The S. baicalensis chloroplast genome is a circular molecule consisting of two single-copy regions separated by a pair of inverted repeats. Comparative analyses of three Scutellaria chloroplast genomes revealed six variable regions (trnH-psbA, trnK-rps16, petN-psbM, trnT-trnL, petA-psbJ, and ycf1 that could be used as DNA barcodes. There were 25 single nucleotide polymorphisms(SNPs and 29 indels between the two S. baicalensis genotypes. All of the indels occurred within non-coding regions. Phylogenetic analysis suggested that Scutellarioideae is a sister taxon to Lamioideae. These resources could be used to explore the variation present in Scutellaria populations and for further evolutionary, phylogenetic, barcoding and genetic engineering studies, in addition to effective exploration and conservation of S. baicalensis.

  14. Defining the diverse spectrum of inversions, complex structural variation, and chromothripsis in the morbid human genome.

    Science.gov (United States)

    Collins, Ryan L; Brand, Harrison; Redin, Claire E; Hanscom, Carrie; Antolik, Caroline; Stone, Matthew R; Glessner, Joseph T; Mason, Tamara; Pregno, Giulia; Dorrani, Naghmeh; Mandrile, Giorgia; Giachino, Daniela; Perrin, Danielle; Walsh, Cole; Cipicchio, Michelle; Costello, Maura; Stortchevoi, Alexei; An, Joon-Yong; Currall, Benjamin B; Seabra, Catarina M; Ragavendran, Ashok; Margolin, Lauren; Martinez-Agosto, Julian A; Lucente, Diane; Levy, Brynn; Sanders, Stephan J; Wapner, Ronald J; Quintero-Rivera, Fabiola; Kloosterman, Wigard; Talkowski, Michael E

    2017-03-06

    Structural variation (SV) influences genome organization and contributes to human disease. However, the complete mutational spectrum of SV has not been routinely captured in disease association studies. We sequenced 689 participants with autism spectrum disorder (ASD) and other developmental abnormalities to construct a genome-wide map of large SV. Using long-insert jumping libraries at 105X mean physical coverage and linked-read whole-genome sequencing from 10X Genomics, we document seven major SV classes at ~5 kb SV resolution. Our results encompass 11,735 distinct large SV sites, 38.1% of which are novel and 16.8% of which are balanced or complex. We characterize 16 recurrent subclasses of complex SV (cxSV), revealing that: (1) cxSV are larger and rarer than canonical SV; (2) each genome harbors 14 large cxSV on average; (3) 84.4% of large cxSVs involve inversion; and (4) most large cxSV (93.8%) have not been delineated in previous studies. Rare SVs are more likely to disrupt coding and regulatory non-coding loci, particularly when truncating constrained and disease-associated genes. We also identify multiple cases of catastrophic chromosomal rearrangements known as chromoanagenesis, including somatic chromoanasynthesis, and extreme balanced germline chromothripsis events involving up to 65 breakpoints and 60.6 Mb across four chromosomes, further defining rare categories of extreme cxSV. These data provide a foundational map of large SV in the morbid human genome and demonstrate a previously underappreciated abundance and diversity of cxSV that should be considered in genomic studies of human disease.

  15. Recombination Enhances HIV-1 Envelope Diversity by Facilitating the Survival of Latent Genomic Fragments in the Plasma Virus Population.

    Directory of Open Access Journals (Sweden)

    Taina T Immonen

    2015-12-01

    Full Text Available HIV-1 is subject to immune pressure exerted by the host, giving variants that escape the immune response an advantage. Virus released from activated latent cells competes against variants that have continually evolved and adapted to host immune pressure. Nevertheless, there is increasing evidence that virus displaying a signal of latency survives in patient plasma despite having reduced fitness due to long-term immune memory. We investigated the survival of virus with latent envelope genomic fragments by simulating within-host HIV-1 sequence evolution and the cycling of viral lineages in and out of the latent reservoir. Our model incorporates a detailed mutation process including nucleotide substitution, recombination, latent reservoir dynamics, diversifying selection pressure driven by the immune response, and purifying selection pressure asserted by deleterious mutations. We evaluated the ability of our model to capture sequence evolution in vivo by comparing our simulated sequences to HIV-1 envelope sequence data from 16 HIV-infected untreated patients. Empirical sequence divergence and diversity measures were qualitatively and quantitatively similar to those of our simulated HIV-1 populations, suggesting that our model invokes realistic trends of HIV-1 genetic evolution. Moreover, reconstructed phylogenies of simulated and patient HIV-1 populations showed similar topological structures. Our simulation results suggest that recombination is a key mechanism facilitating the persistence of virus with latent envelope genomic fragments in the productively infected cell population. Recombination increased the survival probability of latent virus forms approximately 13-fold. Prevalence of virus with latent fragments in productively infected cells was observed in only 2% of simulations when we ignored recombination, while the proportion increased to 27% of simulations when we allowed recombination. We also found that the selection pressures exerted

  16. The dnd operon for DNA phosphorothioation modification system in Escherichia coli is located in diverse genomic islands.

    Science.gov (United States)

    Ho, Wing Sze; Ou, Hong-Yu; Yeo, Chew Chieng; Thong, Kwai Lin

    2015-03-17

    Strains of Escherichia coli that are non-typeable by pulsed-field gel electrophoresis (PFGE) due to in-gel degradation can influence their molecular epidemiological data. The DNA degradation phenotype (Dnd(+)) is mediated by the dnd operon that encode enzymes catalyzing the phosphorothioation of DNA, rendering the modified DNA susceptible to oxidative cleavage during a PFGE run. In this study, a PCR assay was developed to detect the presence of the dnd operon in Dnd(+) E. coli strains and to improve their typeability. Investigations into the genetic environments of the dnd operon in various E. coli strains led to the discovery that the dnd operon is harboured in various diverse genomic islands. The dndBCDE genes (dnd operon) were detected in all Dnd(+) E. coli strains by PCR. The addition of thiourea improved the typeability of Dnd(+) E. coli strains to 100% using PFGE and the Dnd(+) phenotype can be observed in both clonal and genetically diverse E. coli strains. Genomic analysis of 101 dnd operons from genome sequences of Enterobacteriaceae revealed that the dnd operons of the same bacterial species were generally clustered together in the phylogenetic tree. Further analysis of dnd operons of 52 E. coli genomes together with their respective immediate genetic environments revealed a total of 7 types of genetic organizations, all of which were found to be associated with genomic islands designated dnd-encoding GIs. The dnd-encoding GIs displayed mosaic structure and the genomic context of the 7 islands (with 1 representative genome from each type of genetic organization) were also highly variable, suggesting multiple recombination events. This is also the first report where two dnd operons were found within a strain although the biological implication is unknown. Surprisingly, dnd operons were frequently found in pathogenic E. coli although their link with virulence has not been explored. Genomic islands likely play an important role in facilitating the horizontal

  17. Determination of Elizabethkingia Diversity by MALDI-TOF Mass Spectrometry and Whole-Genome Sequencing

    DEFF Research Database (Denmark)

    Eriksen, Helle Brander; Gumpert, Heidi; Faurholt, Cecilie Haase

    2017-01-01

    In a hospital-acquired infection with multidrug-resistant Elizabethkingia, matrix-assisted laser desorption/ionization time-of-flight mass spectrometry and 16S rRNA gene analysis identified the pathogen as Elizabethkingia miricola. Whole-genome sequencing, genus-level core genome analysis...

  18. The genomics of speciation in Drosophila: diversity, divergence, and introgression estimated using low-coverage genome sequencing.

    Directory of Open Access Journals (Sweden)

    Rob J Kulathinal

    2009-07-01

    Full Text Available In nature, closely related species may hybridize while still retaining their distinctive identities. Chromosomal regions that experience reduced recombination in hybrids, such as within inversions, have been hypothesized to contribute to the maintenance of species integrity. Here, we examine genomic sequences from closely related fruit fly taxa of the Drosophila pseudoobscura subgroup to reconstruct their evolutionary histories and past patterns of genic exchange. Partial genomic assemblies were generated from two subspecies of Drosophila pseudoobscura (D. ps. and an outgroup species, D. miranda. These new assemblies were compared to available assemblies of D. ps. pseudoobscura and D. persimilis, two species with overlapping ranges in western North America. Within inverted regions, nucleotide divergence among each pair of the three species is comparable, whereas divergence between D. ps. pseudoobscura and D. persimilis in non-inverted regions is much lower and closer to levels of intraspecific variation. Using molecular markers flanking each of the major chromosomal inversions, we identify strong crossover suppression in F(1 hybrids extending over 2 megabase pairs (Mbp beyond the inversion breakpoints. These regions of crossover suppression also exhibit the high nucleotide divergence associated with inverted regions. Finally, by comparison to a geographically isolated subspecies, D. ps. bogotana, our results suggest that autosomal gene exchange between the North American species, D. ps. pseudoobscura and D. persimilis, occurred since the split of the subspecies, likely within the last 200,000 years. We conclude that chromosomal rearrangements have been vital to the ongoing persistence of these species despite recent hybridization. Our study serves as a proof-of-principle on how whole genome sequencing can be applied to formulate and test hypotheses about species formation in lesser-known non-model systems.

  19. Genome mining of the genetic diversity in the Aspergillus genus - from a collection of more than 30 Aspergillus species

    DEFF Research Database (Denmark)

    Rasmussen, Jane Lind Nybo; Vesth, Tammi Camilla; Theobald, Sebastian

    In the era of high-throughput sequencing, comparative genomics can be applied for evaluating species diversity. In this project we aim to compare the genomes of 300 species of filamentous fungi from the Aspergillus genus, a complex task. To be able to define species, clade, and core features, thi...

  20. Citrus Greening-HLB Genome Resources Group: A bioinformatics resource for diverse projects related to the biology and diagnosis of citrus greening/HLB

    OpenAIRE

    Saha, Surya; Lindeberg, Magdalen

    2013-01-01

    The CG-HLB Genome Resources Group serves as a bioinformatics resource for diverse projects related to the biology and diagnosis of citrus greening/HLB. Current projects include the identification of bacterial endosymbionts in the Diaphorina citri (Asian citrus psyllid) metagenome. Mapping sequence reads to reference genomes shows evidence for Wolbachia and an enterobacterial endosymbiont related to Salmonella in the metagenome. A draft genome sequence has been produced for the Wolbachia endos...

  1. Clone libraries and single cell genome amplification reveal extended diversity of uncultivated magnetotactic bacteria from marine and freshwater environments.

    Science.gov (United States)

    Kolinko, Sebastian; Wanner, Gerhard; Katzmann, Emanuel; Kiemer, Felizitas; Fuchs, Bernhard M; Schüler, Dirk

    2013-05-01

    Magnetotactic bacteria (MTB), which orient along the earth's magnetic field using magnetosomes, are ubiquitous and abundant in marine and freshwater environments. Previous phylogenetic analysis of diverse MTB has been limited to few cultured species and the most abundant and conspicuous members of natural populations, which were assigned to various lineages of the Proteobacteria, the Nitrospirae phylum as well as the candidate division OP3. However, their known phylogenetic diversity still not matches the large morphological and ultrastructural variability of uncultured MTB found in environmental communities. Here, we used analysis of 16S rRNA gene clone libraries in combination with microsorting and whole-genome amplification to systematically address the entire diversity of uncultured MTB from two different habitats. This approach revealed extensive and novel diversity of MTB within the freshwater and marine sediment samples. In total, single-cell analysis identified eight different phylotypes, which were only partly represented in the clone libraries, and which could be unambiguously assigned to their respective morphotypes. Identified MTB belonged to the Alphaproteobacteria (seven species) and the Nitrospirae phylum (two species). End-sequencing of a small insert library created from WGA-derived DNA of a novel conspicuous magnetotactic vibrio identified genes with highest similarity to two cultivated MTB as well as to other phylogenetic groups. In conclusion, the combination of metagenomic cloning and single cell sorting represents a powerful approach to recover maximum bacterial diversity including low-abundant magnetotactic phylotypes from environmental samples and also provides access to genomic analysis of uncultivated MTB. © 2012 Society for Applied Microbiology and Blackwell Publishing Ltd.

  2. Genomic and transcriptomic analyses reveal differential regulation of diverse terpenoid and polyketides secondary metabolites in Hericium erinaceus.

    Science.gov (United States)

    Chen, Juan; Zeng, Xu; Yang, Yan Long; Xing, Yong Mei; Zhang, Qi; Li, Jia Mei; Ma, Ke; Liu, Hong Wei; Guo, Shun Xing

    2017-08-31

    The lion's mane mushroom Hericium erinaceus is a famous traditional medicinal fungus credited with anti-dementia activity and a producer of cyathane diterpenoid natural products (erinacines) useful against nervous system diseases. To date, few studies have explored the biosynthesis of these compounds, although their chemical synthesis is known. Here, we report the first genome and tanscriptome sequence of the medicinal fungus H. erinaceus. The size of the genome is 39.35 Mb, containing 9895 gene models. The genome of H. erinaceus reveals diverse enzymes and a large family of cytochrome P450 (CYP) proteins involved in the biosynthesis of terpenoid backbones, diterpenoids, sesquiterpenes and polyketides. Three gene clusters related to terpene biosynthesis and one gene cluster for polyketides biosynthesis (PKS) were predicted. Genes involved in terpenoid biosynthesis were generally upregulated in mycelia, while the PKS gene was upregulated in the fruiting body. Comparative genome analysis of 42 fungal species of Basidiomycota revealed that most edible and medicinal mushroom show many more gene clusters involved in terpenoid and polyketide biosynthesis compared to the pathogenic fungi. None of the gene clusters for terpenoid or polyketide biosynthesis were predicted in the poisonous mushroom Amanita muscaria. Our findings may facilitate future discovery and biosynthesis of bioactive secondary metabolites from H. erinaceus and provide fundamental information for exploring the secondary metabolites in other Basidiomycetes.

  3. Which Individuals To Choose To Update the Reference Population? Minimizing the Loss of Genetic Diversity in Animal Genomic Selection Programs

    Directory of Open Access Journals (Sweden)

    Sonia E. Eynard

    2018-01-01

    Full Text Available Genomic selection (GS is commonly used in livestock and increasingly in plant breeding. Relying on phenotypes and genotypes of a reference population, GS allows performance prediction for young individuals having only genotypes. This is expected to achieve fast high genetic gain but with a potential loss of genetic diversity. Existing methods to conserve genetic diversity depend mostly on the choice of the breeding individuals. In this study, we propose a modification of the reference population composition to mitigate diversity loss. Since the high cost of phenotyping is the limiting factor for GS, our findings are of major economic interest. This study aims to answer the following questions: how would decisions on the reference population affect the breeding population, and how to best select individuals to update the reference population and balance maximizing genetic gain and minimizing loss of genetic diversity? We investigated three updating strategies for the reference population: random, truncation, and optimal contribution (OC strategies. OC maximizes genetic merit for a fixed loss of genetic diversity. A French Montbéliarde dairy cattle population with 50K SNP chip genotypes and simulations over 10 generations were used to compare these different strategies using milk production as the trait of interest. Candidates were selected to update the reference population. Prediction bias and both genetic merit and diversity were measured. Changes in the reference population composition slightly affected the breeding population. Optimal contribution strategy appeared to be an acceptable compromise to maintain both genetic gain and diversity in the reference and the breeding populations.

  4. Genomic patterns in Acropora cervicornis show extensive population structure and variable genetic diversity.

    Science.gov (United States)

    Drury, Crawford; Schopmeyer, Stephanie; Goergen, Elizabeth; Bartels, Erich; Nedimyer, Ken; Johnson, Meaghan; Maxwell, Kerry; Galvan, Victor; Manfrino, Carrie; Lirman, Diego

    2017-08-01

    Threatened Caribbean coral communities can benefit from high-resolution genetic data used to inform management and conservation action. We use Genotyping by Sequencing (GBS) to investigate genetic patterns in the threatened coral, Acropora cervicornis, across the Florida Reef Tract (FRT) and the western Caribbean. Results show extensive population structure at regional scales and resolve previously unknown structure within the FRT. Different regions also exhibit up to threefold differences in genetic diversity (He), suggesting targeted management based on the goals and resources of each population is needed. Patterns of genetic diversity have a strong spatial component, and our results show Broward and the Lower Keys are among the most diverse populations in Florida. The genetic diversity of Caribbean staghorn coral is concentrated within populations and within individual reefs (AMOVA), highlighting the complex mosaic of population structure. This variance structure is similar over regional and local scales, which suggests that in situ nurseries are adequately capturing natural patterns of diversity, representing a resource that can replicate the average diversity of wild assemblages, serving to increase intraspecific diversity and potentially leading to improved biodiversity and ecosystem function. Results presented here can be translated into specific goals for the recovery of A. cervicornis, including active focus on low diversity areas, protection of high diversity and connectivity, and practical thresholds for responsible restoration.

  5. Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication.

    Science.gov (United States)

    Wu, G Albert; Prochnik, Simon; Jenkins, Jerry; Salse, Jerome; Hellsten, Uffe; Murat, Florent; Perrier, Xavier; Ruiz, Manuel; Scalabrin, Simone; Terol, Javier; Takita, Marco Aurélio; Labadie, Karine; Poulain, Julie; Couloux, Arnaud; Jabbari, Kamel; Cattonaro, Federica; Del Fabbro, Cristian; Pinosio, Sara; Zuccolo, Andrea; Chapman, Jarrod; Grimwood, Jane; Tadeo, Francisco R; Estornell, Leandro H; Muñoz-Sanz, Juan V; Ibanez, Victoria; Herrero-Ortega, Amparo; Aleza, Pablo; Pérez-Pérez, Julián; Ramón, Daniel; Brunel, Dominique; Luro, François; Chen, Chunxian; Farmerie, William G; Desany, Brian; Kodira, Chinnappa; Mohiuddin, Mohammed; Harkins, Tim; Fredrikson, Karin; Burns, Paul; Lomsadze, Alexandre; Borodovsky, Mark; Reforgiato, Giuseppe; Freitas-Astúa, Juliana; Quetier, Francis; Navarro, Luis; Roose, Mikeal; Wincker, Patrick; Schmutz, Jeremy; Morgante, Michele; Machado, Marcos Antonio; Talon, Manuel; Jaillon, Olivier; Ollitrault, Patrick; Gmitter, Frederick; Rokhsar, Daniel

    2014-07-01

    Cultivated citrus are selections from, or hybrids of, wild progenitor species whose identities and contributions to citrus domestication remain controversial. Here we sequence and compare citrus genomes--a high-quality reference haploid clementine genome and mandarin, pummelo, sweet-orange and sour-orange genomes--and show that cultivated types derive from two progenitor species. Although cultivated pummelos represent selections from one progenitor species, Citrus maxima, cultivated mandarins are introgressions of C. maxima into the ancestral mandarin species Citrus reticulata. The most widely cultivated citrus, sweet orange, is the offspring of previously admixed individuals, but sour orange is an F1 hybrid of pure C. maxima and C. reticulata parents, thus implying that wild mandarins were part of the early breeding germplasm. A Chinese wild 'mandarin' diverges substantially from C. reticulata, thus suggesting the possibility of other unrecognized wild citrus species. Understanding citrus phylogeny through genome analysis clarifies taxonomic relationships and facilitates sequence-directed genetic improvement.

  6. DivStat: a user-friendly tool for single nucleotide polymorphism analysis of genomic diversity.

    Directory of Open Access Journals (Sweden)

    Inês Soares

    Full Text Available Recent developments have led to an enormous increase of publicly available large genomic data, including complete genomes. The 1000 Genomes Project was a major contributor, releasing the results of sequencing a large number of individual genomes, and allowing for a myriad of large scale studies on human genetic variation. However, the tools currently available are insufficient when the goal concerns some analyses of data sets encompassing more than hundreds of base pairs and when considering haplotype sequences of single nucleotide polymorphisms (SNPs. Here, we present a new and potent tool to deal with large data sets allowing the computation of a variety of summary statistics of population genetic data, increasing the speed of data analysis.

  7. Chaperone-usher fimbriae in a diverse selection of Gallibacterium genomes

    National Research Council Canada - National Science Library

    Kudirkienė, Eglė; Bager, Ragnhild J; Johnson, Timothy J; Bojesen, Anders M

    2014-01-01

    ...) family was recently shown to be an important virulence factor and vaccine candidate. To reveal the distribution and variability of CU fimbriae 22 genomes of the avian host-restricted bacteria Gallibacterium spp. were investigated...

  8. Chromosomal copy number variation, selection and uneven rates of recombination reveal cryptic genome diversity linked to pathogenicity.

    Directory of Open Access Journals (Sweden)

    Rhys A Farrer

    Full Text Available Pathogenic fungi constitute a growing threat to both plant and animal species on a global scale. Despite a clonal mode of reproduction dominating the population genetic structure of many fungi, putatively asexual species are known to adapt rapidly when confronted by efforts to control their growth and transmission. However, the mechanisms by which adaptive diversity is generated across a clonal background are often poorly understood. We sequenced a global panel of the emergent amphibian pathogen, Batrachochytrium dendrobatidis (Bd, to high depth and characterized rapidly changing features of its genome that we believe hold the key to the worldwide success of this organism. Our analyses show three processes that contribute to the generation of de novo diversity. Firstly, we show that the majority of wild isolates manifest chromosomal copy number variation that changes over short timescales. Secondly, we show that cryptic recombination occurs within all lineages of Bd, leading to large regions of the genome being in linkage equilibrium, and is preferentially associated with classes of genes of known importance for virulence in other pathosystems. Finally, we show that these classes of genes are under directional selection, and that this has predominantly targeted the Global Panzootic Lineage (BdGPL. Our analyses show that Bd manifests an unusually dynamic genome that may have been shaped by its association with the amphibian host. The rates of variation that we document likely explain the high levels of phenotypic variability that have been reported for Bd, and suggests that the dynamic genome of this pathogen has contributed to its success across multiple biomes and host-species.

  9. Draft genome of spinach and transcriptome diversity of 120 Spinacia accessions

    Science.gov (United States)

    Xu, Chenxi; Jiao, Chen; Sun, Honghe; Cai, Xiaofeng; Wang, Xiaoli; Ge, Chenhui; Zheng, Yi; Liu, Wenli; Sun, Xuepeng; Xu, Yimin; Deng, Jie; Zhang, Zhonghua; Huang, Sanwen; Dai, Shaojun; Mou, Beiquan; Wang, Quanxi; Fei, Zhangjun; Wang, Quanhua

    2017-01-01

    Spinach is an important leafy vegetable enriched with multiple necessary nutrients. Here we report the draft genome sequence of spinach (Spinacia oleracea, 2n=12), which contains 25,495 protein-coding genes. The spinach genome is highly repetitive with 74.4% of its content in the form of transposable elements. No recent whole genome duplication events are observed in spinach. Genome syntenic analysis between spinach and sugar beet suggests substantial inter- and intra-chromosome rearrangements during the Caryophyllales genome evolution. Transcriptome sequencing of 120 cultivated and wild spinach accessions reveals more than 420 K variants. Our data suggests that S. turkestanica is likely the direct progenitor of cultivated spinach and spinach domestication has a weak bottleneck. We identify 93 domestication sweeps in the spinach genome, some of which are associated with important agronomic traits including bolting, flowering and leaf numbers. This study offers insights into spinach evolution and domestication and provides resources for spinach research and improvement. PMID:28537264

  10. Small- and large-scale heterogeneity in genetic variation across the collard flycatcher genome: implications for estimating genetic diversity in nonmodel organisms.

    Science.gov (United States)

    Ingvarsson, Pär K; Wang, Jing

    2017-07-01

    Population genetic studies in nonmodel organisms are often hampered by a lack of reference genomes that are essential for whole-genome resequencing. In the light of this, genotyping methods have been developed to effectively eliminate the need for a reference genome, such as genotyping by sequencing or restriction site-associated DNA sequencing (RAD-seq). However, what remains relatively poorly studied is how accurately these methods capture both average and variation in genetic diversity across an organism's genome. In this issue of Molecular Ecology Resources, Dutoit et al. (2016) use whole-genome resequencing data from the collard flycatcher to assess what factors drive heterogeneity in nucleotide diversity across the genome. Using these data, they then simulate how well different sequencing designs, including RAD sequencing, could capture most of the variation in genetic diversity. They conclude that for evolutionary and conservation-related studies focused on the estimating genomic diversity, researchers should emphasize the number of loci analysed over the number of individuals sequenced. © 2017 John Wiley & Sons Ltd.

  11. Digital genotyping of sorghum – a diverse plant species with a large repeat-rich genome

    Science.gov (United States)

    2013-01-01

    Background Rapid acquisition of accurate genotyping information is essential for all genetic marker-based studies. For species with relatively small genomes, complete genome resequencing is a feasible approach for genotyping; however, for species with large and highly repetitive genomes, the acquisition of whole genome sequences for the purpose of genotyping is still relatively inefficient and too expensive to be carried out on a high-throughput basis. Sorghum bicolor is a C4 grass with a sequenced genome size of ~730 Mb, of which ~80% is highly repetitive. We have developed a restriction enzyme targeted genome resequencing method for genetic analysis, termed Digital Genotyping (DG), to be applied to sorghum and other grass species with large repeat-rich genomes. Results DG templates are generated using one of three methylation sensitive restriction enzymes that recognize a nested set of 4, 6 or 8 bp GC-rich sequences, enabling varying depth of analysis and integration of results among assays. Variation in sequencing efficiency among DG markers was correlated with template GC-content and length. The expected DG allele sequence was obtained 97.3% of the time with a ratio of expected to alternative allele sequence acquisition of >20:1. A genetic map aligned to the sorghum genome sequence with an average resolution of 1.47 cM was constructed using 1,772 DG markers from 137 recombinant inbred lines. The DG map enhanced the detection of QTL for variation in plant height and precisely aligned QTL such as Dw3 to underlying genes/alleles. Higher-resolution NgoMIV-based DG haplotypes were used to trace the origin of DNA on SBI-06, spanning Ma1 and Dw2 from progenitors to BTx623 and IS3620C. DG marker analysis identified the correct location of two miss-assembled regions and located seven super contigs in the sorghum reference genome sequence. Conclusion DG technology provides a cost-effective approach to rapidly generate accurate genotyping data in sorghum. Currently

  12. Comparison of 26 sphingomonad genomes reveals diverse environmental adaptations and biodegradative capabilities.

    Science.gov (United States)

    Aylward, Frank O; McDonald, Bradon R; Adams, Sandra M; Valenzuela, Alejandra; Schmidt, Rebeccah A; Goodwin, Lynne A; Woyke, Tanja; Currie, Cameron R; Suen, Garret; Poulsen, Michael

    2013-06-01

    Sphingomonads comprise a physiologically versatile group within the Alphaproteobacteria that includes strains of interest for biotechnology, human health, and environmental nutrient cycling. In this study, we compared 26 sphingomonad genome sequences to gain insight into their ecology, metabolic versatility, and environmental adaptations. Our multilocus phylogenetic and average amino acid identity (AAI) analyses confirm that Sphingomonas, Sphingobium, Sphingopyxis, and Novosphingobium are well-resolved monophyletic groups with the exception of Sphingomonas sp. strain SKA58, which we propose belongs to the genus Sphingobium. Our pan-genomic analysis of sphingomonads reveals numerous species-specific open reading frames (ORFs) but few signatures of genus-specific cores. The organization and coding potential of the sphingomonad genomes appear to be highly variable, and plasmid-mediated gene transfer and chromosome-plasmid recombination, together with prophage- and transposon-mediated rearrangements, appear to play prominent roles in the genome evolution of this group. We find that many of the sphingomonad genomes encode numerous oxygenases and glycoside hydrolases, which are likely responsible for their ability to degrade various recalcitrant aromatic compounds and polysaccharides, respectively. Many of these enzymes are encoded on megaplasmids, suggesting that they may be readily transferred between species. We also identified enzymes putatively used for the catabolism of sulfonate and nitroaromatic compounds in many of the genomes, suggesting that plant-based compounds or chemical contaminants may be sources of nitrogen and sulfur. Many of these sphingomonads appear to be adapted to oligotrophic environments, but several contain genomic features indicative of host associations. Our work provides a basis for understanding the ecological strategies employed by sphingomonads and their role in environmental nutrient cycling.

  13. The USDA barley core collection: genetic diversity, population structure, and potential for genome-wide association studies.

    Directory of Open Access Journals (Sweden)

    María Muñoz-Amatriaín

    Full Text Available New sources of genetic diversity must be incorporated into plant breeding programs if they are to continue increasing grain yield and quality, and tolerance to abiotic and biotic stresses. Germplasm collections provide a source of genetic and phenotypic diversity, but characterization of these resources is required to increase their utility for breeding programs. We used a barley SNP iSelect platform with 7,842 SNPs to genotype 2,417 barley accessions sampled from the USDA National Small Grains Collection of 33,176 accessions. Most of the accessions in this core collection are categorized as landraces or cultivars/breeding lines and were obtained from more than 100 countries. Both STRUCTURE and principal component analysis identified five major subpopulations within the core collection, mainly differentiated by geographical origin and spike row number (an inflorescence architecture trait. Different patterns of linkage disequilibrium (LD were found across the barley genome and many regions of high LD contained traits involved in domestication and breeding selection. The genotype data were used to define 'mini-core' sets of accessions capturing the majority of the allelic diversity present in the core collection. These 'mini-core' sets can be used for evaluating traits that are difficult or expensive to score. Genome-wide association studies (GWAS of 'hull cover', 'spike row number', and 'heading date' demonstrate the utility of the core collection for locating genetic factors determining important phenotypes. The GWAS results were referenced to a new barley consensus map containing 5,665 SNPs. Our results demonstrate that GWAS and high-density SNP genotyping are effective tools for plant breeders interested in accessing genetic diversity in large germplasm collections.

  14. Genomic analysis of diversity, population structure, virulence, and antimicrobial resistance in Klebsiella pneumoniae, an urgent threat to public health.

    Science.gov (United States)

    Holt, Kathryn E; Wertheim, Heiman; Zadoks, Ruth N; Baker, Stephen; Whitehouse, Chris A; Dance, David; Jenney, Adam; Connor, Thomas R; Hsu, Li Yang; Severin, Juliëtte; Brisse, Sylvain; Cao, Hanwei; Wilksch, Jonathan; Gorrie, Claire; Schultz, Mark B; Edwards, David J; Nguyen, Kinh Van; Nguyen, Trung Vu; Dao, Trinh Tuyet; Mensink, Martijn; Minh, Vien Le; Nhu, Nguyen Thi Khanh; Schultsz, Constance; Kuntaman, Kuntaman; Newton, Paul N; Moore, Catrin E; Strugnell, Richard A; Thomson, Nicholas R

    2015-07-07

    Klebsiella pneumoniae is now recognized as an urgent threat to human health because of the emergence of multidrug-resistant strains associated with hospital outbreaks and hypervirulent strains associated with severe community-acquired infections. K. pneumoniae is ubiquitous in the environment and can colonize and infect both plants and animals. However, little is known about the population structure of K. pneumoniae, so it is difficult to recognize or understand the emergence of clinically important clones within this highly genetically diverse species. Here we present a detailed genomic framework for K. pneumoniae based on whole-genome sequencing of more than 300 human and animal isolates spanning four continents. Our data provide genome-wide support for the splitting of K. pneumoniae into three distinct species, KpI (K. pneumoniae), KpII (K. quasipneumoniae), and KpIII (K. variicola). Further, for K. pneumoniae (KpI), the entity most frequently associated with human infection, we show the existence of >150 deeply branching lineages including numerous multidrug-resistant or hypervirulent clones. We show K. pneumoniae has a large accessory genome approaching 30,000 protein-coding genes, including a number of virulence functions that are significantly associated with invasive community-acquired disease in humans. In our dataset, antimicrobial resistance genes were common among human carriage isolates and hospital-acquired infections, which generally lacked the genes associated with invasive disease. The convergence of virulence and resistance genes potentially could lead to the emergence of untreatable invasive K. pneumoniae infections; our data provide the whole-genome framework against which to track the emergence of such threats.

  15. Genomic analysis of diversity, population structure, virulence, and antimicrobial resistance in Klebsiella pneumoniae, an urgent threat to public health

    Science.gov (United States)

    Holt, Kathryn E.; Wertheim, Heiman; Zadoks, Ruth N.; Baker, Stephen; Whitehouse, Chris A.; Dance, David; Jenney, Adam; Connor, Thomas R.; Hsu, Li Yang; Severin, Juliëtte; Brisse, Sylvain; Cao, Hanwei; Wilksch, Jonathan; Gorrie, Claire; Schultz, Mark B.; Edwards, David J.; Nguyen, Kinh Van; Nguyen, Trung Vu; Dao, Trinh Tuyet; Mensink, Martijn; Minh, Vien Le; Nhu, Nguyen Thi Khanh; Schultsz, Constance; Kuntaman, Kuntaman; Newton, Paul N.; Moore, Catrin E.; Strugnell, Richard A.; Thomson, Nicholas R.

    2015-01-01

    Klebsiella pneumoniae is now recognized as an urgent threat to human health because of the emergence of multidrug-resistant strains associated with hospital outbreaks and hypervirulent strains associated with severe community-acquired infections. K. pneumoniae is ubiquitous in the environment and can colonize and infect both plants and animals. However, little is known about the population structure of K. pneumoniae, so it is difficult to recognize or understand the emergence of clinically important clones within this highly genetically diverse species. Here we present a detailed genomic framework for K. pneumoniae based on whole-genome sequencing of more than 300 human and animal isolates spanning four continents. Our data provide genome-wide support for the splitting of K. pneumoniae into three distinct species, KpI (K. pneumoniae), KpII (K. quasipneumoniae), and KpIII (K. variicola). Further, for K. pneumoniae (KpI), the entity most frequently associated with human infection, we show the existence of >150 deeply branching lineages including numerous multidrug-resistant or hypervirulent clones. We show K. pneumoniae has a large accessory genome approaching 30,000 protein-coding genes, including a number of virulence functions that are significantly associated with invasive community-acquired disease in humans. In our dataset, antimicrobial resistance genes were common among human carriage isolates and hospital-acquired infections, which generally lacked the genes associated with invasive disease. The convergence of virulence and resistance genes potentially could lead to the emergence of untreatable invasive K. pneumoniae infections; our data provide the whole-genome framework against which to track the emergence of such threats. PMID:26100894

  16. House spider genome uncovers evolutionary shifts in the diversity and expression of black widow venom proteins associated with extreme toxicity.

    Science.gov (United States)

    Gendreau, Kerry L; Haney, Robert A; Schwager, Evelyn E; Wierschin, Torsten; Stanke, Mario; Richards, Stephen; Garb, Jessica E

    2017-02-16

    Black widow spiders are infamous for their neurotoxic venom, which can cause extreme and long-lasting pain. This unusual venom is dominated by latrotoxins and latrodectins, two protein families virtually unknown outside of the black widow genus Latrodectus, that are difficult to study given the paucity of spider genomes. Using tissue-, sex- and stage-specific expression data, we analyzed the recently sequenced genome of the house spider (Parasteatoda tepidariorum), a close relative of black widows, to investigate latrotoxin and latrodectin diversity, expression and evolution. We discovered at least 47 latrotoxin genes in the house spider genome, many of which are tandem-arrayed. Latrotoxins vary extensively in predicted structural domains and expression, implying their significant functional diversification. Phylogenetic analyses show latrotoxins have substantially duplicated after the Latrodectus/Parasteatoda split and that they are also related to proteins found in endosymbiotic bacteria. Latrodectin genes are less numerous than latrotoxins, but analyses show their recruitment for venom function from neuropeptide hormone genes following duplication, inversion and domain truncation. While latrodectins and other peptides are highly expressed in house spider and black widow venom glands, latrotoxins account for a far smaller percentage of house spider venom gland expression. The house spider genome sequence provides novel insights into the evolution of venom toxins once considered unique to black widows. Our results greatly expand the size of the latrotoxin gene family, reinforce its narrow phylogenetic distribution, and provide additional evidence for the lateral transfer of latrotoxins between spiders and bacterial endosymbionts. Moreover, we strengthen the evidence for the evolution of latrodectin venom genes from the ecdysozoan Ion Transport Peptide (ITP)/Crustacean Hyperglycemic Hormone (CHH) neuropeptide superfamily. The lower expression of latrotoxins in

  17. Metabolic diversity and ecological niches of Achromatium populations revealed with single-cell genomic sequencing

    Directory of Open Access Journals (Sweden)

    Muammar eMansor

    2015-08-01

    Full Text Available Large, sulfur-cycling, calcite-precipitating bacteria in the genus Achromatium represent a significant proportion of bacterial communities near sediment-water interfaces throughout the world. Our understanding of their potentially crucial roles in calcium, carbon, sulfur, nitrogen, and iron cycling is limited because they have not been cultured or sequenced using environmental genomics approaches to date. We utilized single-cell genomic sequencing to obtain one incomplete and two nearly complete draft genomes for Achromatium collected at Warm Mineral Springs, FL. Based on 16S rRNA gene sequences, the three cells represent distinct and relatively distant Achromatium populations (91-92% identity. The draft genomes encode key genes involved in sulfur and hydrogen oxidation; oxygen, nitrogen and polysulfide respiration; carbon and nitrogen fixation; organic carbon assimilation and storage; chemotaxis; twitching motility; antibiotic resistance; and membrane transport. Known genes for iron and manganese energy metabolism were not detected. The presence of pyrophosphatase and vacuolar (V-type ATPases, which are generally rare in bacterial genomes, suggests a role for these enzymes in calcium transport, proton pumping, and/or energy generation in the membranes of calcite-containing inclusions.

  18. The humankind genome: from genetic diversity to the origin of human diseases.

    Science.gov (United States)

    Belizário, Jose E

    2013-12-01

    Genome-wide association studies have failed to establish common variant risk for the majority of common human diseases. The underlying reasons for this failure are explained by recent studies of resequencing and comparison of over 1200 human genomes and 10 000 exomes, together with the delineation of DNA methylation patterns (epigenome) and full characterization of coding and noncoding RNAs (transcriptome) being transcribed. These studies have provided the most comprehensive catalogues of functional elements and genetic variants that are now available for global integrative analysis and experimental validation in prospective cohort studies. With these datasets, researchers will have unparalleled opportunities for the alignment, mining, and testing of hypotheses for the roles of specific genetic variants, including copy number variations, single nucleotide polymorphisms, and indels as the cause of specific phenotypes and diseases. Through the use of next-generation sequencing technologies for genotyping and standardized ontological annotation to systematically analyze the effects of genomic variation on humans and model organism phenotypes, we will be able to find candidate genes and new clues for disease's etiology and treatment. This article describes essential concepts in genetics and genomic technologies as well as the emerging computational framework to comprehensively search websites and platforms available for the analysis and interpretation of genomic data.

  19. Diversity and evolution of root-knot nematodes, genus Meloidogyne: new insights from the genomic era.

    Science.gov (United States)

    Castagnone-Sereno, Philippe; Danchin, Etienne G J; Perfus-Barbeoch, Laetitia; Abad, Pierre

    2013-01-01

    Root-knot nematodes (RKNs) (Meloidogyne spp.) are obligate endoparasites of major worldwide economic importance. They exhibit a wide continuum of variation in their reproductive strategies, ranging from amphimixis to obligatory mitotic parthenogenesis. Molecular phylogenetic studies have highlighted divergence between mitotic and meiotic parthenogenetic RKN species and probable interspecific hybridization as critical steps in their speciation and diversification process. The recent completion of the genomes of two RKNs, Meloidogyne hapla and Meloidogyne incognita, that exhibit striking differences in their mode of reproduction (with and without sex, respectively), their geographic distribution, and their host range has opened the way for deciphering the evolutionary significance of (a)sexual reproduction in these parasites. Accumulating evidence suggests that whole-genome duplication (in M. incognita) and horizontal gene transfers (HGTs) represent major forces that have shaped the genome of current RKN species and may account for the extreme adaptive capacities and parasitic success of these nematodes.

  20. Comparison of 26 sphingomonad genomes reveals diverse environmental adaptations and biodegradative capabilities

    DEFF Research Database (Denmark)

    Aylward, Frank O.; McDonald, Bradon R.; Adams, Sandra M.

    2013-01-01

    versatility, and environmental adaptations. Our multilocus phylogenetic and average amino acid identity (AAI) analyses confirm that Sphingomonas, Sphingobium, Sphingopyxis, and Novosphingobium are well-resolved monophyletic groups with the exception of Sphingomonas sp. strain SKA58, which we propose belongs......Sphingomonads comprise a physiologically versatile group within the Alphaproteobacteria that includes strains of interest for biotechnology, human health, and environmental nutrient cycling. In this study, we compared 26 sphingomonad genome sequences to gain insight into their ecology, metabolic...... compounds in many of the genomes, suggesting that plant-based compounds or chemical contaminants may be sources of nitrogen and sulfur. Many of these sphingomonads appear to be adapted to oligotrophic environments, but several contain genomic features indicative of host associations. Our work provides...

  1. Environmental Whole-Genome Amplification to Access Microbial Diversity in Contaminated Sediments

    Energy Technology Data Exchange (ETDEWEB)

    Abulencia, C.B.; Wyborski, D.L.; Garcia, J.; Podar, M.; Chen, W.; Chang, S.H.; Chang, H.W.; Watson, D.; Brodie,E.I.; Hazen, T.C.; Keller, M.

    2005-12-10

    Low-biomass samples from nitrate and heavy metal contaminated soils yield DNA amounts that have limited use for direct, native analysis and screening. Multiple displacement amplification (MDA) using ?29 DNA polymerase was used to amplify whole genomes from environmental, contaminated, subsurface sediments. By first amplifying the genomic DNA (gDNA), biodiversity analysis and gDNA library construction of microbes found in contaminated soils were made possible. The MDA method was validated by analyzing amplified genome coverage from approximately five Escherichia coli cells, resulting in 99.2 percent genome coverage. The method was further validated by confirming overall representative species coverage and also an amplification bias when amplifying from a mix of eight known bacterial strains. We extracted DNA from samples with extremely low cell densities from a U.S. Department of Energy contaminated site. After amplification, small subunit rRNA analysis revealed relatively even distribution of species across several major phyla. Clone libraries were constructed from the amplified gDNA, and a small subset of clones was used for shotgun sequencing. BLAST analysis of the library clone sequences showed that 64.9 percent of the sequences had significant similarities to known proteins, and ''clusters of orthologous groups'' (COG) analysis revealed that more than half of the sequences from each library contained sequence similarity to known proteins. The libraries can be readily screened for native genes or any target of interest. Whole-genome amplification of metagenomic DNA from very minute microbial sources, while introducing an amplification bias, will allow access to genomic information that was not previously accessible.

  2. Comparative analysis of the Oenococcus oeni pan genome reveals genetic diversity in industrially-relevant pathways

    Directory of Open Access Journals (Sweden)

    Borneman Anthony R

    2012-08-01

    Full Text Available Abstract Background Oenococcus oeni, a member of the lactic acid bacteria, is one of a limited number of microorganisms that not only survive, but actively proliferate in wine. It is also unusual as, unlike the majority of bacteria present in wine, it is beneficial to wine quality rather than causing spoilage. These benefits are realised primarily through catalysing malolactic fermentation, but also through imparting other positive sensory properties. However, many of these industrially-important secondary attributes have been shown to be strain-dependent and their genetic basis it yet to be determined. Results In order to investigate the scale and scope of genetic variation in O. oeni, we have performed whole-genome sequencing on eleven strains of this bacterium, bringing the total number of strains for which genome sequences are available to fourteen. While any single strain of O. oeni was shown to contain around 1800 protein-coding genes, in-depth comparative annotation based on genomic synteny and protein orthology identified over 2800 orthologous open reading frames that comprise the pan genome of this species, and less than 1200 genes that make up the conserved genomic core present in all of the strains. The expansion of the pan genome relative to the coding potential of individual strains was shown to be due to the varied presence and location of multiple distinct bacteriophage sequences and also in various metabolic functions with potential impacts on the industrial performance of this species, including cell wall exopolysaccharide biosynthesis, sugar transport and utilisation and amino acid biosynthesis. Conclusions By providing a large cohort of sequenced strains, this study provides a broad insight into the genetic variation present within O. oeni. This data is vital to understanding and harnessing the phenotypic variation present in this economically-important species.

  3. Probing genomic diversity and evolution of Streptococcus suis serotype 2 by NimbleGen tiling arrays

    Directory of Open Access Journals (Sweden)

    Liao Hui

    2011-05-01

    Full Text Available Abstract Background Our previous studies revealed that a new disease form of streptococcal toxic shock syndrome (STSS is associated with specific Streptococcus suis serotype 2 (SS2 strains. To achieve a better understanding of the pathogenicity and evolution of SS2 at the whole-genome level, comparative genomic analysis of 18 SS2 strains, selected on the basis of virulence and geographic origin, was performed using NimbleGen tiling arrays. Results Our results demonstrate that SS2 isolates have highly divergent genomes. The 89K pathogenicity island (PAI, which has been previously recognized as unique to the Chinese epidemic strains causing STSS, was partially included in some other virulent and avirulent strains. The ABC-type transport systems, encoded by 89K, were hypothesized to greatly contribute to the catastrophic features of STSS. Moreover, we identified many polymorphisms in genes encoding candidate or known virulence factors, such as PlcR, lipase, sortases, the pilus-associated proteins, and the response regulator RevS and CtsR. On the basis of analysis of regions of differences (RDs across the entire genome for the 18 selected SS2 strains, a model of microevolution for these strains is proposed, which provides clues into Streptococcus pathogenicity and evolution. Conclusions Our deep comparative genomic analysis of the 89K PAI present in the genome of SS2 strains revealed details into how some virulent strains acquired genes that may contribute to STSS, which may lead to better environmental monitoring of epidemic SS2 strains.

  4. Linking secondary metabolites to gene clusters through genome sequencing of six diverse Aspergillus species

    DEFF Research Database (Denmark)

    Kjærbølling, Inge; Vesth, Tammi C.; Frisvad, Jens C.

    2018-01-01

    The fungal genus of Aspergillus is highly interesting, containing everything from industrial cell factories, model organisms, and human pathogens. In particular, this group has a prolific production of bioactive secondary metabolites (SMs). In this work, four diverse Aspergillus species (A...

  5. Repertoire of SSRs in the Castor Bean Genome and Their Utilization in Genetic Diversity Analysis in Jatropha curcas

    Directory of Open Access Journals (Sweden)

    Arti Sharma

    2011-01-01

    Full Text Available Castor bean and Jatropha contain seed oil of industrial importance, share taxonomical and biochemical similarities, which can be explored for identifying SSRs in the whole genome sequence of castor bean and utilized in Jatropha curcas. Whole genome analysis of castor bean identified 5,80,986 SSRs with a frequency of 1 per 680 bp. Genomic distribution of SSRs revealed that 27% were present in the non-genic region whereas 73% were also present in the putative genic regions with 26% in 5′UTRs, 25% in introns, 16% in 3′UTRs and 6% in the exons. Dinucleotide repeats were more frequent in introns, 5′UTRs and 3′UTRs whereas trinucleotide repeats were predominant in the exons. The transferability of randomly selected 302 SSRs, from castor bean to 49 J. curcas genotypes and 8 Jatropha species other than J. curcas, showed that 211 (~70% amplified on Jatropha out of which 7.58% showed polymorphisms in J. curcas genotypes and 12.32% in Jatropha species. The higher rate of transferability of SSR markers from castor bean to Jatropha coupled with a good level of PIC (polymorphic information content value (0.2 in J. curcas genotypes and 0.6 in Jatropha species suggested that SSRs would be useful in germplasm analysis, linkage mapping, diversity studies and phylogenetic relationships, and so forth, in J. curcas as well as other Jatropha species.

  6. Bayesian Joint Modeling of Multiple Gene Networks and Diverse Genomic Data to Identify Target Genes of a Transcription Factor.

    Science.gov (United States)

    Wei, Peng; Pan, Wei

    2012-01-01

    We consider integrative modeling of multiple gene networks and diverse genomic data, including protein-DNA binding, gene expression and DNA sequence data, to accurately identify the regulatory target genes of a transcription factor (TF). Rather than treating all the genes equally and independently a priori in existing joint modeling approaches, we incorporate the biological prior knowledge that neighboring genes on a gene network tend to be (or not to be) regulated together by a TF. A key contribution of our work is that, to maximize the use of all existing biological knowledge, we allow incorporation of multiple gene networks into joint modeling of genomic data by introducing a mixture model based on the use of multiple Markov random fields (MRFs). Another important contribution of our work is to allow different genomic data to be correlated and to examine the validity and effect of the independence assumption as adopted in existing methods. Due to a fully Bayesian approach, inference about model parameters can be carried out based on MCMC samples. Application to an E. coli data set, together with simulation studies, demonstrates the utility and statistical efficiency gains with the proposed joint model.

  7. Repertoire of SSRs in the Castor Bean Genome and Their Utilization in Genetic Diversity Analysis in Jatropha curcas

    Science.gov (United States)

    Sharma, Arti; Chauhan, Rajinder Singh

    2011-01-01

    Castor bean and Jatropha contain seed oil of industrial importance, share taxonomical and biochemical similarities, which can be explored for identifying SSRs in the whole genome sequence of castor bean and utilized in Jatropha curcas. Whole genome analysis of castor bean identified 5,80,986 SSRs with a frequency of 1 per 680 bp. Genomic distribution of SSRs revealed that 27% were present in the non-genic region whereas 73% were also present in the putative genic regions with 26% in 5′UTRs, 25% in introns, 16% in 3′UTRs and 6% in the exons. Dinucleotide repeats were more frequent in introns, 5′UTRs and 3′UTRs whereas trinucleotide repeats were predominant in the exons. The transferability of randomly selected 302 SSRs, from castor bean to 49 J. curcas genotypes and 8 Jatropha species other than J. curcas, showed that 211 (∼70%) amplified on Jatropha out of which 7.58% showed polymorphisms in J. curcas genotypes and 12.32% in Jatropha species. The higher rate of transferability of SSR markers from castor bean to Jatropha coupled with a good level of PIC (polymorphic information content) value (0.2 in J. curcas genotypes and 0.6 in Jatropha species) suggested that SSRs would be useful in germplasm analysis, linkage mapping, diversity studies and phylogenetic relationships, and so forth, in J. curcas as well as other Jatropha species. PMID:21687555

  8. Genomic Characterization of Dairy Associated Leuconostoc Species and Diversity of Leuconostocs in Undefined Mixed Mesophilic Starter Cultures.

    Science.gov (United States)

    Frantzen, Cyril A; Kot, Witold; Pedersen, Thomas B; Ardö, Ylva M; Broadbent, Jeff R; Neve, Horst; Hansen, Lars H; Dal Bello, Fabio; Østlie, Hilde M; Kleppen, Hans P; Vogensen, Finn K; Holo, Helge

    2017-01-01

    Undefined mesophilic mixed (DL-type) starter cultures are composed of predominantly Lactococcus lactis subspecies and 1-10% Leuconostoc spp. The composition of the Leuconostoc population in the starter culture ultimately affects the characteristics and the quality of the final product. The scientific basis for the taxonomy of dairy relevant leuconostocs can be traced back 50 years, and no documentation on the genomic diversity of leuconostocs in starter cultures exists. We present data on the Leuconostoc population in five DL-type starter cultures commonly used by the dairy industry. The analyses were performed using traditional cultivation methods, and further augmented by next-generation DNA sequencing methods. Bacterial counts for starter cultures cultivated on two different media, MRS and MPCA, revealed large differences in the relative abundance of leuconostocs. Most of the leuconostocs in two of the starter cultures were unable to grow on MRS, emphasizing the limitations of culture-based methods and the importance of careful media selection or use of culture independent methods. Pan-genomic analysis of 59 Leuconostoc genomes enabled differentiation into twelve robust lineages. The genomic analyses show that the dairy-associated leuconostocs are highly adapted to their environment, characterized by the acquisition of genotype traits, such as the ability to metabolize citrate. In particular, Leuconostoc mesenteroides subsp. cremoris display telltale signs of a degenerative evolution, likely resulting from a long period of growth in milk in association with lactococci. Great differences in the metabolic potential between Leuconostoc species and subspecies were revealed. Using targeted amplicon sequencing, the composition of the Leuconostoc population in the five commercial starter cultures was shown to be significantly different. Three of the cultures were dominated by Ln. mesenteroides subspecies cremoris. Leuconostoc pseudomesenteroides dominated in two of the

  9. Gene transfers from diverse bacteria compensate for reductive genome evolution in the chromatophore of Paulinella chromatophora.

    Science.gov (United States)

    Nowack, Eva C M; Price, Dana C; Bhattacharya, Debashish; Singer, Anna; Melkonian, Michael; Grossman, Arthur R

    2016-10-25

    Plastids, the photosynthetic organelles, originated >1 billion y ago via the endosymbiosis of a cyanobacterium. The resulting proliferation of primary producers fundamentally changed global ecology. Endosymbiotic gene transfer (EGT) from the intracellular cyanobacterium to the nucleus is widely recognized as a critical factor in the evolution of photosynthetic eukaryotes. The contribution of horizontal gene transfers (HGTs) from other bacteria to plastid establishment remains more controversial. A novel perspective on this issue is provided by the amoeba Paulinella chromatophora, which contains photosynthetic organelles (chromatophores) that are only 60-200 million years old. Chromatophore genome reduction entailed the loss of many biosynthetic pathways including those for numerous amino acids and cofactors. How the host cell compensates for these losses remains unknown, because the presence of bacteria in all available P. chromatophora cultures excluded elucidation of the full metabolic capacity and occurrence of HGT in this species. Here we generated a high-quality transcriptome and draft genome assembly from the first bacteria-free P. chromatophora culture to deduce rules that govern organelle integration into cellular metabolism. Our analyses revealed that nuclear and chromatophore gene inventories provide highly complementary functions. At least 229 nuclear genes were acquired via HGT from various bacteria, of which only 25% putatively arose through EGT from the chromatophore genome. Many HGT-derived bacterial genes encode proteins that fill gaps in critical chromatophore pathways/processes. Our results demonstrate a dominant role for HGT in compensating for organelle genome reduction and suggest that phagotrophy may be a major driver of HGT.

  10. Draft genome of spinach and transcriptome diversity of 120 Spinacia accessions

    Science.gov (United States)

    Spinach is an important leafy vegetable enriched with multiple and necessary nutrients. It belongs to the order of Caryophyllales, which constitutes the basal clade in core eudicots. Here we report the draft genome sequence of cultivated spinach (Spinacia oleracea, 2n=12), which contains 25,495 prot...

  11. Wild emmer genome architecture and diversity elucidate wheat evolution and domestication

    Science.gov (United States)

    Wheat (Triticum spp.) is one of the founder crops that likely drove the Neolithic transition to sedentary agrarian societies in the Fertile Crescent over 10,000 years ago. Identifying genetic modifications underlying wheat's domestication requires knowledge of the genome of its allo-tetraploid proge...

  12. Novel Insights into the Diversity of Catabolic Metabolism from Ten Haloarchaeal Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Anderson, Iain; Scheuner, Carmen; Goker, Markus; Mavromatis, Kostas; Hooper, Sean D.; Porat, Iris; Klenk, Hans-Peter; Ivanova, Natalia; Kyrpides, Nikos

    2011-05-03

    The extremely halophilic archaea are present worldwide in saline environments and have important biotechnological applications. Ten complete genomes of haloarchaea are now available, providing an opportunity for comparative analysis. We report here the comparative analysis of five newly sequenced haloarchaeal genomes with five previously published ones. Whole genome trees based on protein sequences provide strong support for deep relationships between the ten organisms. Using a soft clustering approach, we identified 887 protein clusters present in all halophiles. Of these core clusters, 112 are not found in any other archaea and therefore constitute the haloarchaeal signature. Four of the halophiles were isolated from water, and four were isolated from soil or sediment. Although there are few habitat-specific clusters, the soil/sediment halophiles tend to have greater capacity for polysaccharide degradation, siderophore synthesis, and cell wall modification. Halorhabdus utahensis and Haloterrigena turkmenica encode over forty glycosyl hydrolases each, and may be capable of breaking down naturally occurring complex carbohydrates. H. utahensis is specialized for growth on carbohydrates and has few amino acid degradation pathways. It uses the non-oxidative pentose phosphate pathway instead of the oxidative pathway, giving it more flexibility in the metabolism of pentoses. These new genomes expand our understanding of haloarchaeal catabolic pathways, providing a basis for further experimental analysis, especially with regard to carbohydrate metabolism. Halophilic glycosyl hydrolases for use in biofuel production are more likely to be found in halophiles isolated from soil or sediment.

  13. Chaperone-usher fimbriae in a diverse selection of Gallibacterium genomes

    DEFF Research Database (Denmark)

    Kudirkiene, Egle; Bager, Ragnhild Jørgensen; Johnson, Timothy J.

    2014-01-01

    encoding a putative major fimbrial subunit, a chaperone, an usher and a fimbrial adhesin. Five fimbrial clusters (Flf-Flf4) and eight conserved domain groups were defined to accommodate the identified fimbriae. Although, the number of different fimbrial clusters in individual Gallibacterium genomes was low...

  14. Genome sequence analysis with MonetDB: a case study on Ebola virus diversity

    NARCIS (Netherlands)

    C.P. Cijvat (Robin); S. Manegold (Stefan); M.L. Kersten (Martin); G.W. Klau (Gunnar); A. Schönhuth (Alexander); T. Marschall (Tobias); Y. Zhang (Ying)

    2015-01-01

    htmlabstractNext-generation sequencing (NGS) technology has led the life sciences into the big data era. Today, sequencing genomes takes little time and cost, but results in terabytes of data to be stored and analysed. Biologists are often exposed to excessively time consuming and error-prone data

  15. Genome sequence analysis with MonetDB - A case study on Ebola virus diversity

    NARCIS (Netherlands)

    Cijvat, R.; Manegold, S.; Kersten, M.; Klau, G.W.; Schönhuth, A.; Marschall, T.; Zhang, Y.

    2015-01-01

    Next-generation sequencing (NGS) technology has led the life sciences into the big data era. Today, sequencing genomes takes little time and cost, but yields terabytes of data to be stored and analyzed. Biologists are often exposed to excessively time consuming and error-prone data management and

  16. Genetic diversity and genome-wide association analysis of cooking time in dry bean (Phaseolus vulgaris L.).

    Science.gov (United States)

    Cichy, Karen A; Wiesinger, Jason A; Mendoza, Fernando A

    2015-08-01

    Fivefold diversity for cooking time found in a panel of 206 Phaseolus vulgaris accessions. Fastest accession cooks nearly 20 min faster than average.   SNPs associated with cooking time on Pv02, 03, and 06. Dry beans (Phaseolus vulgaris L.) are a nutrient dense food and a dietary staple in parts of Africa and Latin America. One of the major factors that limits greater utilization of beans is their long cooking times compared to other foods. Cooking time is an important trait with implications for gender equity, nutritional value of diets, and energy utilization. Very little is known about the genetic diversity and genomic regions involved in determining cooking time. The objective of this research was to assess cooking time on a panel of 206 P. vulgaris accessions, use genome- wide association analysis (GWAS) to identify genomic regions influencing this trait, and to test the ability to predict cooking time by raw seed characteristics. In this study 5.5-fold variation for cooking time was found and five bean accessions were identified which cook in less than 27 min across 2 years, where the average cooking time was 37 min. One accession, ADP0367 cooked nearly 20 min faster than average. Four of these five accessions showed close phylogenetic relationship based on a NJ tree developed with ~5000 SNP markers, suggesting a potentially similar underlying genetic mechanism. GWAS revealed regions on chromosomes Pv02, Pv03, and Pv06 associated with cooking time. Vis/NIR scanning of raw seed explained 68 % of the phenotypic variation for cooking time, suggesting with additional experimentation, it may be possible to use this spectroscopy method to non-destructively identify fast cooking lines as part of a breeding program.

  17. Genomic diversity and phylogenetic relationships of human papillomavirus 16 (HPV16) in Nepal.

    Science.gov (United States)

    Makowsky, Robert; Lhaki, Pema; Wiener, Howard W; Bhatta, Madhav P; Cullen, Michael; Johnson, Derek C; Perry, Rodney T; Lama, Mingma; Boland, Joseph F; Yeager, Meredith; Ghimire, Sarita; Broker, Thomas R; Shrestha, Sadeep

    2016-12-01

    Sequence variants in HPV16 confer differences in oncogenic potential; however, to date there have not been any HPV sequence studies performed in Nepal. The objective of this study was to characterize HPV16 viral genome sequences from Nepal compared to a reference sequence in order to determine their lineages. Additionally, we sought to determine if five High-grade Squamous Intraepithelial Lesion (HSIL) subjects were genetically distinct from the non-HSIL subjects. DNA was isolated from exfoliated cervical cells from 17 individuals in Nepal who were previously identified to be HPV16-positive. A custom HPV16 Ion Ampliseq panel of multiplexed degenerate primers was designed that generated 47 overlapping amplicons and covered 99% of the viral genome for all known HPV16 variant lineages. All sequence data were processed through a custom quality control and analysis pipeline of sequence comparisons and phylogenetic analysis. There were high similarities across the genomes, with two major indels observed in the non-coding region between E5 and L2. Compared to the PAVE reference HPV16 genome, there were up to 9, 4, 38, 27, 8, 7, 52, and 32 nucleotide variants in the E6, E7, E1, E2, E4, E5, L2, and L1 genes in the Nepalese samples, respectively. Based on sequence variation, HPV16 from Nepal falls across the A, C, and D lineages in this study. We found no evidence of genetic distinctness between HSIL and non-HSIL subjects. The evolutionary and pathological characteristics of the representative HPV16 genomes from Nepal seem similar to results from other parts of the world and provide the basis for further studies. Copyright © 2016 Elsevier B.V. All rights reserved.

  18. Insights into the genetic structure and diversity of 38 South Asian Indians from deep whole-genome sequencing.

    Directory of Open Access Journals (Sweden)

    Lai-Ping Wong

    2014-05-01

    Full Text Available South Asia possesses a significant amount of genetic diversity due to considerable intergroup differences in culture and language. There have been numerous reports on the genetic structure of Asian Indians, although these have mostly relied on genotyping microarrays or targeted sequencing of the mitochondria and Y chromosomes. Asian Indians in Singapore are primarily descendants of immigrants from Dravidian-language-speaking states in south India, and 38 individuals from the general population underwent deep whole-genome sequencing with a target coverage of 30X as part of the Singapore Sequencing Indian Project (SSIP. The genetic structure and diversity of these samples were compared against samples from the Singapore Sequencing Malay Project and populations in Phase 1 of the 1,000 Genomes Project (1 KGP. SSIP samples exhibited greater intra-population genetic diversity and possessed higher heterozygous-to-homozygous genotype ratio than other Asian populations. When compared against a panel of well-defined Asian Indians, the genetic makeup of the SSIP samples was closely related to South Indians. However, even though the SSIP samples clustered distinctly from the Europeans in the global population structure analysis with autosomal SNPs, eight samples were assigned to mitochondrial haplogroups that were predominantly present in Europeans and possessed higher European admixture than the remaining samples. An analysis of the relative relatedness between SSIP with two archaic hominins (Denisovan, Neanderthal identified higher ancient admixture in East Asian populations than in SSIP. The data resource for these samples is publicly available and is expected to serve as a valuable complement to the South Asian samples in Phase 3 of 1 KGP.

  19. Secondary uses and the governance of de-identified data: Lessons from the human genome diversity panel

    Directory of Open Access Journals (Sweden)

    Lee Sandra S-J

    2011-09-01

    Full Text Available Abstract Background Recent changes to regulatory guidance in the US and Europe have complicated oversight of secondary research by rendering most uses of de-identified data exempt from human subjects oversight. To identify the implications of such guidelines for harms to participants and communities, this paper explores the secondary uses of one de-identified DNA sample collection with limited oversight: the Human Genome Diversity Project (HGDP-Centre d'Etude du Polymorphisme Humain, Fondation Jean Dausset (CEPH Human Genome Diversity Panel. Methods Using a combination of keyword and cited reference search, we identified English-language scientific articles published between 2002 and 2009 that reported analysis of HGDP Diversity Panel samples and/or data. We then reviewed each article to identify the specific research use to which the samples and/or data was applied. Secondary uses were categorized according to the type and kind of research supported by the collection. Results A wide variety of secondary uses were identified from 148 peer-reviewed articles. While the vast majority of these uses were consistent with the original intent of the collection, a minority of published reports described research whose primary findings could be regarded as controversial, objectionable, or potentially stigmatizing in their interpretation. Conclusions We conclude that potential risks to participants and communities cannot be wholly eliminated by anonymization of individual data and suggest that explicit review of proposed secondary uses, by a Data Access Committee or similar internal oversight body with suitable stakeholder representation, should be a required component of the trustworthy governance of any repository of data or specimens.

  20. Insights into the genetic structure and diversity of 38 South Asian Indians from deep whole-genome sequencing.

    Science.gov (United States)

    Wong, Lai-Ping; Lai, Jason Kuan-Han; Saw, Woei-Yuh; Ong, Rick Twee-Hee; Cheng, Anthony Youzhi; Pillai, Nisha Esakimuthu; Liu, Xuanyao; Xu, Wenting; Chen, Peng; Foo, Jia-Nee; Tan, Linda Wei-Lin; Koo, Seok-Hwee; Soong, Richie; Wenk, Markus Rene; Lim, Wei-Yen; Khor, Chiea-Chuen; Little, Peter; Chia, Kee-Seng; Teo, Yik-Ying

    2014-05-01

    South Asia possesses a significant amount of genetic diversity due to considerable intergroup differences in culture and language. There have been numerous reports on the genetic structure of Asian Indians, although these have mostly relied on genotyping microarrays or targeted sequencing of the mitochondria and Y chromosomes. Asian Indians in Singapore are primarily descendants of immigrants from Dravidian-language-speaking states in south India, and 38 individuals from the general population underwent deep whole-genome sequencing with a target coverage of 30X as part of the Singapore Sequencing Indian Project (SSIP). The genetic structure and diversity of these samples were compared against samples from the Singapore Sequencing Malay Project and populations in Phase 1 of the 1,000 Genomes Project (1 KGP). SSIP samples exhibited greater intra-population genetic diversity and possessed higher heterozygous-to-homozygous genotype ratio than other Asian populations. When compared against a panel of well-defined Asian Indians, the genetic makeup of the SSIP samples was closely related to South Indians. However, even though the SSIP samples clustered distinctly from the Europeans in the global population structure analysis with autosomal SNPs, eight samples were assigned to mitochondrial haplogroups that were predominantly present in Europeans and possessed higher European admixture than the remaining samples. An analysis of the relative relatedness between SSIP with two archaic hominins (Denisovan, Neanderthal) identified higher ancient admixture in East Asian populations than in SSIP. The data resource for these samples is publicly available and is expected to serve as a valuable complement to the South Asian samples in Phase 3 of 1 KGP.

  1. Exploring Genomic Diversity Using Metagenomics of Deep-Sea Subsurface Microbes from the Louisville Seamount and the South Pacific Gyre

    Science.gov (United States)

    Tully, B. J.; Sylvan, J. B.; Heidelberg, J. F.; Huber, J. A.

    2014-12-01

    There are many limitations involved with sampling microbial diversity from deep-sea subsurface environments, ranging from physical sample collection, low microbial biomass, culturing at in situ conditions, and inefficient nucleic acid extractions. As such, we are continually modifying our methods to obtain better results and expanding what we know about microbes in these environments. Here we present analysis of metagenomes sequences from samples collected from 120 m within the Louisville Seamount and from the top 5-10cm of the sediment in the center of the south Pacific gyre (SPG). Both systems are low biomass with ~102 and ~104 cells per cm3 for Louisville Seamount samples analyzed and the SPG sediment, respectively. The Louisville Seamount represents the first in situ subseafloor basalt and the SPG sediments represent the first in situ low biomass sediment microbial metagenomes. Both of these environments, subseafloor basalt and sediments underlying oligotrophic ocean gyres, represent large provinces of the seafloor environment that remain understudied. Despite the low biomass and DNA generated from these samples, we have generated 16 near complete genomes (5 from Louisville and 11 from the SPG) from the two metagenomic datasets. These genomes are estimated to be between 51-100% complete and span a range of phylogenetic groups, including the Proteobacteria, Actinobacteria, Firmicutes, Chloroflexi, and unclassified bacterial groups. With these genomes, we have assessed potential functional capabilities of these organisms and performed a comparative analysis between the environmental genomes and previously sequenced relatives to determine possible adaptations that may elucidate survival mechanisms for these low energy environments. These methods illustrate a baseline analysis that can be applied to future metagenomic deep-sea subsurface datasets and will help to further our understanding of microbiology within these environments.

  2. Genome-wide genetic diversity and differentially selected regions among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep.

    Directory of Open Access Journals (Sweden)

    Lifan Zhang

    Full Text Available Sheep are among the major economically important livestock species worldwide because the animals produce milk, wool, skin, and meat. In the present study, the Illumina OvineSNP50 BeadChip was used to investigate genetic diversity and genome selection among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds from the United States. After quality-control filtering of SNPs (single nucleotide polymorphisms, we used 48,026 SNPs, including 46,850 SNPs on autosomes that were in Hardy-Weinberg equilibrium and 1,176 SNPs on chromosome × for analysis. Phylogenetic analysis based on all 46,850 SNPs clearly separated Suffolk from Rambouillet, Columbia, Polypay, and Targhee, which was not surprising as Rambouillet contributed to the synthesis of the later three breeds. Based on pair-wise estimates of F(ST, significant genetic differentiation appeared between Suffolk and Rambouillet (F(ST = 0.1621, while Rambouillet and Targhee had the closest relationship (F(ST = 0.0681. A scan of the genome revealed 45 and 41 differentially selected regions (DSRs between Suffolk and Rambouillet and among Rambouillet-related breed populations, respectively. Our data indicated that regions 13 and 24 between Suffolk and Rambouillet might be good candidates for evaluating breed differences. Furthermore, ovine genome v3.1 assembly was used as reference to link functionally known homologous genes to economically important traits covered by these differentially selected regions. In brief, our present study provides a comprehensive genome-wide view on within- and between-breed genetic differentiation, biodiversity, and evolution among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds. These results may provide new guidance for the synthesis of new breeds with different breeding objectives.

  3. Population stratification in the context of diverse epidemiologic surveys sans genome-wide data

    Directory of Open Access Journals (Sweden)

    Matthew T. Oetjens

    2016-05-01

    Full Text Available Population stratification or confounding by genetic ancestry is a potential cause of false associations in genetic association studies. Estimation of and adjustment for genetic ancestry has become common practice thanks in part to the availability of ancestry informative markers on genome-wide association study (GWAS arrays. While array data is now widespread, these data are not ubiquitous as several large epidemiologic and clinic-based studies lack genome-wide data. One such large epidemiologic-based study lacking genome-wide data accessible to investigators is the National Health and Nutrition Examination Surveys (NHANES, population-based cross-sectional surveys of Americans linked to demographic, health, and lifestyle data conducted by the Centers for Disease Control and Prevention. DNA samples (n=14,998 were extracted from biospecimens from consented NHANES participants between 1991-1994 (NHANES III, phase 2 and 1999-2002 and represent three major self-identified racial/ethnic groups: non-Hispanic whites (n=6,634, non-Hispanic blacks (n=3,458, and Mexican Americans (n=3,950. We as the Epidemiologic Architecture for Genes Linked to Environment (EAGLE study genotyped candidate gene and GWAS-identified index variants in NHANES as part of the larger Population Architecture using Genomics and Epidemiology (PAGE I study for collaborative genetic association studies. To enable basic quality control such as estimation of genetic ancestry to control for population stratification in NHANES san genome-wide data, we outline here strategies that use limited genetic data to identify the markers optimal for characterizing genetic ancestry. From among 411 and 295 autosomal SNPs available in NHANES III and NHANES 1999-2002, we demonstrate that markers with ancestry information can be identified to estimate global ancestry. Despite limited resolution, global genetic ancestry is highly correlated with self-identified race for the majority of participants

  4. Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication

    Science.gov (United States)

    Wu, G. Albert; Prochnik, Simon; Jenkins, Jerry; Salse, Jerome; Hellsten, Uffe; Murat, Florent; Perrier, Xavier; Ruiz, Manuel; Scalabrin, Simone; Terol, Javier; Takita, Marco Aurélio; Labadie, Karine; Poulain, Julie; Couloux, Arnaud; Jabbari, Kamel; Cattonaro, Federica; Del Fabbro, Cristian; Pinosio, Sara; Zuccolo, Andrea; Chapman, Jarrod; Grimwood, Jane; Tadeo, Francisco R.; Estornell, Leandro H.; Muñoz-Sanz, Juan V.; Ibanez, Victoria; Herrero-Ortega, Amparo; Aleza, Pablo; Pérez-Pérez, Julián; Ramón, Daniel; Brunel, Dominique; Luro, François; Chen, Chunxian; Farmerie, William G.; Desany, Brian; Kodira, Chinnappa; Mohiuddin, Mohammed; Harkins, Tim; Fredrikson, Karin; Burns, Paul; Lomsadze, Alexandre; Borodovsky, Mark; Reforgiato, Giuseppe; Freitas-Astúa, Juliana; Quetier, Francis; Navarro, Luis; Roose, Mikeal; Wincker, Patrick; Schmutz, Jeremy; Morgante, Michele; Machado, Marcos Antonio; Talon, Manuel; Jaillon, Olivier; Ollitrault, Patrick; Gmitter, Frederick; Rokhsar, Daniel

    2014-01-01

    The domestication of citrus, is poorly understood. Cultivated types are selections from, or hybrids of, wild progenitor species, whose identities and contributions remain controversial. By comparative analysis of a collection of citrus genomes, including a high quality haploid reference, we show that cultivated types were derived from two progenitor species. Though cultivated pummelos represent selections from a single progenitor species, C. maxima, cultivated mandarins are introgressions of C. maxima into the ancestral mandarin species, C. reticulata. The most widely cultivated citrus, sweet orange, is the offspring of previously admixed individuals, but sour orange is an F1 hybrid of pure C. maxima and C. reticulata parents, implying that wild mandarins were part of the early breeding germplasm. A wild “mandarin” from China exhibited substantial divergence from C. reticulata, suggesting the possibility of other unrecognized wild citrus species. Understanding citrus phylogeny through genome analysis clarifies taxonomic relationships and enables sequence-directed genetic improvement. PMID:24908277

  5. Genomic and transcriptomic evidence for scavenging of diverse organic compounds by widespread deep-sea archaea

    Science.gov (United States)

    Li, Meng; Baker, Brett J.; Anantharaman, Karthik; Jain, Sunit; Breier, John A.; Dick, Gregory J.

    2015-01-01

    Microbial activity is one of the most important processes to mediate the flux of organic carbon from the ocean surface to the seafloor. However, little is known about the microorganisms that underpin this key step of the global carbon cycle in the deep oceans. Here we present genomic and transcriptomic evidence that five ubiquitous archaeal groups actively use proteins, carbohydrates, fatty acids and lipids as sources of carbon and energy at depths ranging from 800 to 4,950 m in hydrothermal vent plumes and pelagic background seawater across three different ocean basins. Genome-enabled metabolic reconstructions and gene expression patterns show that these marine archaea are motile heterotrophs with extensive mechanisms for scavenging organic matter. Our results shed light on the ecological and physiological properties of ubiquitous marine archaea and highlight their versatile metabolic strategies in deep oceans that might play a critical role in global carbon cycling. PMID:26573375

  6. Reconstruction of diverse verrucomicrobial genomes from metagenome datasets of freshwater reservoirs

    Czech Academy of Sciences Publication Activity Database

    Cabello-Yeves, P.J.; Ghai, Rohit; Mehrshad, Maliheh; Picazo, A.; Camacho, A.; Rodriguez-Valera, F.

    2017-01-01

    Roč. 8, Nov (2017), č. článku 2131. ISSN 1664-302X R&D Projects: GA ČR GA17-04828S Grant - others:AV ČR(CZ) L200961651 Institutional support: RVO:60077344 Keywords : freshwater Verrucomicrobia * metagenomics * rhodopsin * nitrogen fixation * genome streamlining Subject RIV: EE - Microbiology, Virology Impact factor: 4.076, year: 2016

  7. Chitinase family GH18: evolutionary insights from the genomic history of a diverse protein family

    Directory of Open Access Journals (Sweden)

    Aronson Nathan N

    2007-06-01

    Full Text Available Abstract Background Chitinases (EC.3.2.1.14 hydrolyze the β-1,4-linkages in chitin, an abundant N-acetyl-β-D-glucosamine polysaccharide that is a structural component of protective biological matrices such as insect exoskeletons and fungal cell walls. The glycoside hydrolase 18 (GH18 family of chitinases is an ancient gene family widely expressed in archea, prokaryotes and eukaryotes. Mammals are not known to synthesize chitin or metabolize it as a nutrient, yet the human genome encodes eight GH18 family members. Some GH18 proteins lack an essential catalytic glutamic acid and are likely to act as lectins rather than as enzymes. This study used comparative genomic analysis to address the evolutionary history of the GH18 multiprotein family, from early eukaryotes to mammals, in an effort to understand the forces that shaped the human genome content of chitinase related proteins. Results Gene duplication and loss according to a birth-and-death model of evolution is a feature of the evolutionary history of the GH18 family. The current human family likely originated from ancient genes present at the time of the bilaterian expansion (approx. 550 mya. The family expanded in the chitinous protostomes C. elegans and D. melanogaster, declined in early deuterostomes as chitin synthesis disappeared, and expanded again in late deuterostomes with a significant increase in gene number after the avian/mammalian split. Conclusion This comprehensive genomic study of animal GH18 proteins reveals three major phylogenetic groups in the family: chitobiases, chitinases/chitolectins, and stabilin-1 interacting chitolectins. Only the chitinase/chitolectin group is associated with expansion in late deuterostomes. Finding that the human GH18 gene family is closely linked to the human major histocompatibility complex paralogon on chromosome 1, together with the recent association of GH18 chitinase activity with Th2 cell inflammation, suggests that its late expansion

  8. Distinct patterns of mitochondrial genome diversity in bonobos (Pan paniscus and humans

    Directory of Open Access Journals (Sweden)

    Zsurka Gábor

    2010-09-01

    Full Text Available Abstract Background We have analyzed the complete mitochondrial genomes of 22 Pan paniscus (bonobo, pygmy chimpanzee individuals to assess the detailed mitochondrial DNA (mtDNA phylogeny of this close relative of Homo sapiens. Results We identified three major clades among bonobos that separated approximately 540,000 years ago, as suggested by Bayesian analysis. Incidentally, we discovered that the current reference sequence for bonobo likely is a hybrid of the mitochondrial genomes of two distant individuals. When comparing spectra of polymorphic mtDNA sites in bonobos and humans, we observed two major differences: (i Of all 31 bonobo mtDNA homoplasies, i.e. nucleotide changes that occurred independently on separate branches of the phylogenetic tree, 13 were not homoplasic in humans. This indicates that at least a part of the unstable sites of the mitochondrial genome is species-specific and difficult to be explained on the basis of a mutational hotspot concept. (ii A comparison of the ratios of non-synonymous to synonymous changes (dN/dS among polymorphic positions in bonobos and in 4902 Homo sapiens mitochondrial genomes revealed a remarkable difference in the strength of purifying selection in the mitochondrial genes of the F0F1-ATPase complex. While in bonobos this complex showed a similar low value as complexes I and IV, human haplogroups displayed 2.2 to 7.6 times increased dN/dS ratios when compared to bonobos. Conclusions Some variants of mitochondrially encoded subunits of the ATPase complex in humans very likely decrease the efficiency of energy conversion leading to production of extra heat. Thus, we hypothesize that the species-specific release of evolutionary constraints for the mitochondrial genes of the proton-translocating ATPase is a consequence of altered heat homeostasis in modern humans.

  9. Distinct patterns of mitochondrial genome diversity in bonobos (Pan paniscus) and humans.

    Science.gov (United States)

    Zsurka, Gábor; Kudina, Tatiana; Peeva, Viktoriya; Hallmann, Kerstin; Elger, Christian E; Khrapko, Konstantin; Kunz, Wolfram S

    2010-09-02

    We have analyzed the complete mitochondrial genomes of 22 Pan paniscus (bonobo, pygmy chimpanzee) individuals to assess the detailed mitochondrial DNA (mtDNA) phylogeny of this close relative of Homo sapiens. We identified three major clades among bonobos that separated approximately 540,000 years ago, as suggested by Bayesian analysis. Incidentally, we discovered that the current reference sequence for bonobo likely is a hybrid of the mitochondrial genomes of two distant individuals. When comparing spectra of polymorphic mtDNA sites in bonobos and humans, we observed two major differences: (i) Of all 31 bonobo mtDNA homoplasies, i.e. nucleotide changes that occurred independently on separate branches of the phylogenetic tree, 13 were not homoplasic in humans. This indicates that at least a part of the unstable sites of the mitochondrial genome is species-specific and difficult to be explained on the basis of a mutational hotspot concept. (ii) A comparison of the ratios of non-synonymous to synonymous changes (dN/dS) among polymorphic positions in bonobos and in 4902 Homo sapiens mitochondrial genomes revealed a remarkable difference in the strength of purifying selection in the mitochondrial genes of the F0F1-ATPase complex. While in bonobos this complex showed a similar low value as complexes I and IV, human haplogroups displayed 2.2 to 7.6 times increased dN/dS ratios when compared to bonobos. Some variants of mitochondrially encoded subunits of the ATPase complex in humans very likely decrease the efficiency of energy conversion leading to production of extra heat. Thus, we hypothesize that the species-specific release of evolutionary constraints for the mitochondrial genes of the proton-translocating ATPase is a consequence of altered heat homeostasis in modern humans.

  10. Organised genome dynamics in the Escherichia coli species results in highly diverse adaptive paths.

    Directory of Open Access Journals (Sweden)

    Marie Touchon

    2009-01-01

    Full Text Available The Escherichia coli species represents one of the best-studied model organisms, but also encompasses a variety of commensal and pathogenic strains that diversify by high rates of genetic change. We uniformly (re- annotated the genomes of 20 commensal and pathogenic E. coli strains and one strain of E. fergusonii (the closest E. coli related species, including seven that we sequenced to completion. Within the approximately 18,000 families of orthologous genes, we found approximately 2,000 common to all strains. Although recombination rates are much higher than mutation rates, we show, both theoretically and using phylogenetic inference, that this does not obscure the phylogenetic signal, which places the B2 phylogenetic group and one group D strain at the basal position. Based on this phylogeny, we inferred past evolutionary events of gain and loss of genes, identifying functional classes under opposite selection pressures. We found an important adaptive role for metabolism diversification within group B2 and Shigella strains, but identified few or no extraintestinal virulence-specific genes, which could render difficult the development of a vaccine against extraintestinal infections. Genome flux in E. coli is confined to a small number of conserved positions in the chromosome, which most often are not associated with integrases or tRNA genes. Core genes flanking some of these regions show higher rates of recombination, suggesting that a gene, once acquired by a strain, spreads within the species by homologous recombination at the flanking genes. Finally, the genome's long-scale structure of recombination indicates lower recombination rates, but not higher mutation rates, at the terminus of replication. The ensuing effect of background selection and biased gene conversion may thus explain why this region is A+T-rich and shows high sequence divergence but low sequence polymorphism. Overall, despite a very high gene flow, genes co-exist in an

  11. Organised genome dynamics in the Escherichia coli species results in highly diverse adaptive paths.

    Science.gov (United States)

    Touchon, Marie; Hoede, Claire; Tenaillon, Olivier; Barbe, Valérie; Baeriswyl, Simon; Bidet, Philippe; Bingen, Edouard; Bonacorsi, Stéphane; Bouchier, Christiane; Bouvet, Odile; Calteau, Alexandra; Chiapello, Hélène; Clermont, Olivier; Cruveiller, Stéphane; Danchin, Antoine; Diard, Médéric; Dossat, Carole; Karoui, Meriem El; Frapy, Eric; Garry, Louis; Ghigo, Jean Marc; Gilles, Anne Marie; Johnson, James; Le Bouguénec, Chantal; Lescat, Mathilde; Mangenot, Sophie; Martinez-Jéhanne, Vanessa; Matic, Ivan; Nassif, Xavier; Oztas, Sophie; Petit, Marie Agnès; Pichon, Christophe; Rouy, Zoé; Ruf, Claude Saint; Schneider, Dominique; Tourret, Jérôme; Vacherie, Benoit; Vallenet, David; Médigue, Claudine; Rocha, Eduardo P C; Denamur, Erick

    2009-01-01

    The Escherichia coli species represents one of the best-studied model organisms, but also encompasses a variety of commensal and pathogenic strains that diversify by high rates of genetic change. We uniformly (re-) annotated the genomes of 20 commensal and pathogenic E. coli strains and one strain of E. fergusonii (the closest E. coli related species), including seven that we sequenced to completion. Within the approximately 18,000 families of orthologous genes, we found approximately 2,000 common to all strains. Although recombination rates are much higher than mutation rates, we show, both theoretically and using phylogenetic inference, that this does not obscure the phylogenetic signal, which places the B2 phylogenetic group and one group D strain at the basal position. Based on this phylogeny, we inferred past evolutionary events of gain and loss of genes, identifying functional classes under opposite selection pressures. We found an important adaptive role for metabolism diversification within group B2 and Shigella strains, but identified few or no extraintestinal virulence-specific genes, which could render difficult the development of a vaccine against extraintestinal infections. Genome flux in E. coli is confined to a small number of conserved positions in the chromosome, which most often are not associated with integrases or tRNA genes. Core genes flanking some of these regions show higher rates of recombination, suggesting that a gene, once acquired by a strain, spreads within the species by homologous recombination at the flanking genes. Finally, the genome's long-scale structure of recombination indicates lower recombination rates, but not higher mutation rates, at the terminus of replication. The ensuing effect of background selection and biased gene conversion may thus explain why this region is A+T-rich and shows high sequence divergence but low sequence polymorphism. Overall, despite a very high gene flow, genes co-exist in an organised genome.

  12. Genomic Diversity and the Microenvironment as Drivers of Progression in DCIS

    Science.gov (United States)

    2016-10-01

    analysis . Since we are working with small amounts of FFPE DNA, standard methodologies do not readily apply. We have settled on the Genome Center...used in the analysis . B) Violin plots showing the distribution of the percentage of the exome covered by two different depths (20 and 40 reads...samples used in Aim 1. We will employ automated image analysis to compute microenvironmental divergence to determine if specific components of the TME, or

  13. ‘Candidatus Competibacter'-lineage genomes retrieved from metagenomes reveal functional metabolic diversity

    Science.gov (United States)

    McIlroy, Simon J; Albertsen, Mads; Andresen, Eva K; Saunders, Aaron M; Kristiansen, Rikke; Stokholm-Bjerregaard, Mikkel; Nielsen, Kåre L; Nielsen, Per H

    2014-01-01

    The glycogen-accumulating organism (GAO) ‘Candidatus Competibacter' (Competibacter) uses aerobically stored glycogen to enable anaerobic carbon uptake, which is subsequently stored as polyhydroxyalkanoates (PHAs). This biphasic metabolism is key for the Competibacter to survive under the cyclic anaerobic-‘feast': aerobic-‘famine' regime of enhanced biological phosphorus removal (EBPR) wastewater treatment systems. As they do not contribute to phosphorus (P) removal, but compete for resources with the polyphosphate-accumulating organisms (PAO), thought responsible for P removal, their proliferation theoretically reduces the EBPR capacity. In this study, two complete genomes from Competibacter were obtained from laboratory-scale enrichment reactors through metagenomics. Phylogenetic analysis identified the two genomes, ‘Candidatus Competibacter denitrificans' and ‘Candidatus Contendobacter odensis', as being affiliated with Competibacter-lineage subgroups 1 and 5, respectively. Both have genes for glycogen and PHA cycling and for the metabolism of volatile fatty acids. Marked differences were found in their potential for the Embden–Meyerhof–Parnas and Entner–Doudoroff glycolytic pathways, as well as for denitrification, nitrogen fixation, fermentation, trehalose synthesis and utilisation of glucose and lactate. Genetic comparison of P metabolism pathways with sequenced PAOs revealed the absence of the Pit phosphate transporter in the Competibacter-lineage genomes—identifying a key metabolic difference with the PAO physiology. These genomes are the first from any GAO organism and provide new insights into the complex interaction and niche competition between PAOs and GAOs in EBPR systems. PMID:24173461

  14. Spatial and temporal diversity in genomic instability processes defines lung cancer evolution.

    Science.gov (United States)

    de Bruin, Elza C; McGranahan, Nicholas; Mitter, Richard; Salm, Max; Wedge, David C; Yates, Lucy; Jamal-Hanjani, Mariam; Shafi, Seema; Murugaesu, Nirupa; Rowan, Andrew J; Grönroos, Eva; Muhammad, Madiha A; Horswell, Stuart; Gerlinger, Marco; Varela, Ignacio; Jones, David; Marshall, John; Voet, Thierry; Van Loo, Peter; Rassl, Doris M; Rintoul, Robert C; Janes, Sam M; Lee, Siow-Ming; Forster, Martin; Ahmad, Tanya; Lawrence, David; Falzon, Mary; Capitanio, Arrigo; Harkins, Timothy T; Lee, Clarence C; Tom, Warren; Teefe, Enock; Chen, Shann-Ching; Begum, Sharmin; Rabinowitz, Adam; Phillimore, Benjamin; Spencer-Dene, Bradley; Stamp, Gordon; Szallasi, Zoltan; Matthews, Nik; Stewart, Aengus; Campbell, Peter; Swanton, Charles

    2014-10-10

    Spatial and temporal dissection of the genomic changes occurring during the evolution of human non-small cell lung cancer (NSCLC) may help elucidate the basis for its dismal prognosis. We sequenced 25 spatially distinct regions from seven operable NSCLCs and found evidence of branched evolution, with driver mutations arising before and after subclonal diversification. There was pronounced intratumor heterogeneity in copy number alterations, translocations, and mutations associated with APOBEC cytidine deaminase activity. Despite maintained carcinogen exposure, tumors from smokers showed a relative decrease in smoking-related mutations over time, accompanied by an increase in APOBEC-associated mutations. In tumors from former smokers, genome-doubling occurred within a smoking-signature context before subclonal diversification, which suggested that a long period of tumor latency had preceded clinical detection. The regionally separated driver mutations, coupled with the relentless and heterogeneous nature of the genome instability processes, are likely to confound treatment success in NSCLC. Copyright © 2014, American Association for the Advancement of Science.

  15. Penicillium arizonense, a new, genome sequenced fungal species, reveals a high chemical diversity in secreted metabolites

    DEFF Research Database (Denmark)

    Grijseels, Sietske; Nielsen, Jens Christian; Randelovic, Milica

    2016-01-01

    of biosynthetic gene clusters in P. arizonense responsible for the synthesis of all detected compounds except curvulinic acid. The capacity to produce biomass degrading enzymes and the identification of a high chemical diversity in secreted bioactive secondary metabolites, offers a broad range of potential...

  16. Diversity of genomic electropherotypes of naturally occurring equine herpesvirus 1 isolates in Argentina

    Directory of Open Access Journals (Sweden)

    C.M. Galosi

    1998-06-01

    Full Text Available The genomes of 10 equine herpesvirus 1 (EHV-1 strains isolated in Argentina from 1979 to 1991, and a Japanese HH1 reference strain were compared by restriction endonuclease analysis. Two restriction enzymes, BamHI and BglII, were used and analysis of the electropherotypes did not show significant differences among isolates obtained from horses with different clinical signs. This suggests that the EHV-1 isolates studied, which circulated in Argentina for more than 10 years, belong to a single genotype.

  17. Genomic analysis of globally diverse Mycobacterium tuberculosis strains provides insights into emergence and spread of multidrug resistance

    Science.gov (United States)

    Manson, Abigail L.; Cohen, Keira A.; Abeel, Thomas; Desjardins, Christopher A.; Armstrong, Derek T.; Barry, Clifton E.; Brand, Jeannette; Chapman, Sinéad B.; Cho, Sang-Nae; Gabrielian, Andrei; Gomez, James; Jodals, Andreea M.; Joloba, Moses; Jureen, Pontus; Lee, Jong Seok; Malinga, Lesibana; Maiga, Mamoudou; Nordenberg, Dale; Noroc, Ecaterina; Romancenco, Elena; Salazar, Alex; Ssengooba, Willy; Velayati, A. A.; Winglee, Kathryn; Zalutskaya, Aksana; Via, Laura E.; Cassell, Gail H.; Dorman, Susan E.; Ellner, Jerrold; Farnia, Parissa; Galagan, James E.; Rosenthal, Alex; Crudu, Valeriu; Homorodean, Daniela; Hsueh, Po-Ren; Narayanan, Sujatha; Pym, Alexander S.; Skrahina, Alena; Swaminathan, Soumya; Van der Walt, Martie; Alland, David; Bishai, William R.; Cohen, Ted; Hoffner, Sven; Birren, Bruce W.; Earl, Ashlee M.

    2017-01-01

    Multidrug-resistant tuberculosis (MDR-TB), caused by drug resistant strains of Mycobacterium tuberculosis, is an increasingly serious problem worldwide. In this study, we examined a dataset of 5,310 M. tuberculosis whole genome sequences from five continents. Despite great diversity with respect to geographic point of isolation, genetic background and drug resistance, patterns of drug resistance emergence were conserved globally. We have identified harbinger mutations that often precede MDR. In particular, the katG S315T mutation, conferring resistance to isoniazid, overwhelmingly arose before rifampicin resistance across all lineages, geographic regions, and time periods. Molecular diagnostics that include markers for rifampicin resistance alone will be insufficient to identify pre-MDR strains. Incorporating knowledge of pre-MDR polymorphisms, particularly katG S315, into molecular diagnostics will enable targeted treatment of patients with pre-MDR-TB to prevent further development of MDR-TB. PMID:28092681

  18. The draft genome of watermelon (Citrullus lanatus) and resequencing of 20 diverse accessions

    DEFF Research Database (Denmark)

    Guo, Shaogui; Zhang, Jianguo; Sun, Honghe

    2013-01-01

    an evolutionary scenario for the origin of the 11 watermelon chromosomes derived from a 7-chromosome paleohexaploid eudicot ancestor. Resequencing of 20 watermelon accessions representing three different C. lanatus subspecies produced numerous haplotypes and identified the extent of genetic diversity...... into aspects of phloem-based vascular signaling in common between watermelon and cucumber and identified genes crucial to valuable fruit-quality traits, including sugar accumulation and citrulline metabolism....

  19. Genomic diversity and immunomodulatory activity of Lactobacillus plantarum isolated from dairy products.

    Science.gov (United States)

    Zago, M; Scaltriti, E; Bonvini, B; Fornasari, M E; Penna, G; Massimiliano, L; Carminati, D; Rescigno, M; Giraffa, G

    2017-08-24

    In this study, we aimed to investigate some functional characteristics and the immunomodulatory properties of three strains of Lactobacillus plantarum of dairy origin which, in a previous screening, showed to be candidate probiotics. Genome sequencing and comparative genomics, which confirmed the presence of genes involved in folate and riboflavin production and in the immune response of dendritic cells (DCs), prompted us to investigate the ability of the three strains to accumulate the two vitamins and their immunomodulation properties. The ability of the three strains to release antioxidant components in milk was also investigated. Small amounts of folate and riboflavin were produced by the three strains, while they showed a good antioxidant capacity in milk with FRAP method. The immune response experiments well correlated with the presence of candidate genes influencing in DCs cytokine response to L. plantarum. Specifically, the amounts of secreted cytokins by DCs after stimulation with cells of Lp790, Lp813 and Lp998 resulted pro-inflammatory whereas stimulation with culture supernatants (postbiotics) inhibited the release of interleukin (IL)-12p70 and increased the release of the anti-inflammatory IL-10 cytokine. This study adds further evidence on the importance of L. plantarum in human health. Understanding how probiotics (or postbiotics) work in preclinical models can allow a rational choice of the different strains for clinical and/or commercial use.

  20. Diverse Genome-wide Association Studies Associate the IL12/IL23 Pathway with Crohn Disease

    Science.gov (United States)

    Wang, Kai; Zhang, Haitao; Kugathasan, Subra; Annese, Vito; Bradfield, Jonathan P.; Russell, Richard K.; Sleiman, Patrick M.A.; Imielinski, Marcin; Glessner, Joseph; Hou, Cuiping; Wilson, David C.; Walters, Thomas; Kim, Cecilia; Frackelton, Edward C.; Lionetti, Paolo; Barabino, Arrigo; Van Limbergen, Johan; Guthery, Stephen; Denson, Lee; Piccoli, David; Li, Mingyao; Dubinsky, Marla; Silverberg, Mark; Griffiths, Anne; Grant, Struan F.A.; Satsangi, Jack; Baldassano, Robert; Hakonarson, Hakon

    2009-01-01

    Previous genome-wide association (GWA) studies typically focus on single-locus analysis, which may not have the power to detect the majority of genuinely associated loci. Here, we applied pathway analysis using Affymetrix SNP genotype data from the Wellcome Trust Case Control Consortium (WTCCC) and uncovered significant association between Crohn Disease (CD) and the IL12/IL23 pathway, harboring 20 genes (p = 8 × 10−5). Interestingly, the pathway contains multiple genes (IL12B and JAK2) or homologs of genes (STAT3 and CCR6) that were recently identified as genuine susceptibility genes only through meta-analysis of several GWA studies. In addition, the pathway contains other susceptibility genes for CD, including IL18R1, JUN, IL12RB1, and TYK2, which do not reach genome-wide significance by single-marker association tests. The observed pathway-specific association signal was subsequently replicated in three additional GWA studies of European and African American ancestry generated on the Illumina HumanHap550 platform. Our study suggests that examination beyond individual SNP hits, by focusing on genetic networks and pathways, is important to unleashing the true power of GWA studies. PMID:19249008

  1. Whole-Genome Analysis of Diversity and SNP-Major Gene Association in Peach Germplasm.

    Directory of Open Access Journals (Sweden)

    Diego Micheletti

    Full Text Available Peach was domesticated in China more than four millennia ago and from there it spread world-wide. Since the middle of the last century, peach breeding programs have been very dynamic generating hundreds of new commercial varieties, however, in most cases such varieties derive from a limited collection of parental lines (founders. This is one reason for the observed low levels of variability of the commercial gene pool, implying that knowledge of the extent and distribution of genetic variability in peach is critical to allow the choice of adequate parents to confer enhanced productivity, adaptation and quality to improved varieties. With this aim we genotyped 1,580 peach accessions (including a few closely related Prunus species maintained and phenotyped in five germplasm collections (four European and one Chinese with the International Peach SNP Consortium 9K SNP peach array. The study of population structure revealed the subdivision of the panel in three main populations, one mainly made up of Occidental varieties from breeding programs (POP1OCB, one of Occidental landraces (POP2OCT and the third of Oriental accessions (POP3OR. Analysis of linkage disequilibrium (LD identified differential patterns of genome-wide LD blocks in each of the populations. Phenotypic data for seven monogenic traits were integrated in a genome-wide association study (GWAS. The significantly associated SNPs were always in the regions predicted by linkage analysis, forming haplotypes of markers. These diagnostic haplotypes could be used for marker-assisted selection (MAS in modern breeding programs.

  2. Consanguinity around the world: what do the genomic data of the HGDP-CEPH diversity panel tell us?

    Science.gov (United States)

    Leutenegger, Anne-Louise; Sahbatou, Mourad; Gazal, Steven; Cann, Howard; Génin, Emmanuelle

    2011-05-01

    Inbreeding coefficients and consanguineous mating types are usually inferred from population surveys or pedigree studies. Here, we present a method to estimate them from dense genome-wide single-nucleotide polymorphism genotypes and apply it to 940 unrelated individuals from the Human Genome Diversity Panel (HGDP-CEPH). Inbreeding is observed in almost all populations of the panel, and the highest inbreeding levels and frequencies of inbred individuals are found in populations of the Middle East, Central South Asia and the Americas. In these regions, first cousin (1C) marriages are the most frequent, but we also observed marriages between double first cousins (2 × 1C) and between avuncular (AV) pairs. Interestingly, if 2 × 1C marriages are preferred to AV marriages in Central South Asia and the Middle East, the contrary is found in the Americas. There are thus some regional trends but there are also some important differences between populations within a region. Individual results can be found on the CEPH website at ftp://ftp.cephb.fr/hgdp_hbd/.

  3. Genomic Organization, Transcriptomic Analysis, and Functional Characterization of Avian α- and β-Keratins in Diverse Feather Forms

    Science.gov (United States)

    Fan, Wen-Lang; Yan, Jie; Chen, Chih-Kuan; Lai, Yu-Ting; Wu, Siao-Man; Mao, Chi-Tang; Chen, Jun-Jie; Lu, Mei-Yeh Jade; Ho, Meng-Ru; Widelitz, Randall B.; Chen, Chih-Feng; Chuong, Cheng-Ming; Li, Wen-Hsiung

    2014-01-01

    Feathers are hallmark avian integument appendages, although they were also present on theropods. They are composed of flexible corneous materials made of α- and β-keratins, but their genomic organization and their functional roles in feathers have not been well studied. First, we made an exhaustive search of α- and β-keratin genes in the new chicken genome assembly (Galgal4). Then, using transcriptomic analysis, we studied α- and β-keratin gene expression patterns in five types of feather epidermis. The expression patterns of β-keratin genes were different in different feather types, whereas those of α-keratin genes were less variable. In addition, we obtained extensive α- and β-keratin mRNA in situ hybridization data, showing that α-keratins and β-keratins are preferentially expressed in different parts of the feather components. Together, our data suggest that feather morphological and structural diversity can largely be attributed to differential combinations of α- and β-keratin genes in different intrafeather regions and/or feather types from different body parts. The expression profiles provide new insights into the evolutionary origin and diversification of feathers. Finally, functional analysis using mutant chicken keratin forms based on those found in the human α-keratin mutation database led to abnormal phenotypes. This demonstrates that the chicken can be a convenient model for studying the molecular biology of human keratin-based diseases. PMID:25152353

  4. Diversity and distribution of alpha satellite DNA in the genome of an Old World monkey: Cercopithecus solatus.

    Science.gov (United States)

    Cacheux, Lauriane; Ponger, Loïc; Gerbault-Seureau, Michèle; Richard, Florence Anne; Escudé, Christophe

    2016-11-14

    Alpha satellite is the major repeated DNA element of primate centromeres. Evolution of these tandemly repeated sequences has led to the existence of numerous families of monomers exhibiting specific organizational patterns. The limited amount of information available in non-human primates is a restriction to the understanding of the evolutionary dynamics of alpha satellite DNA. We carried out the targeted high-throughput sequencing of alpha satellite monomers and dimers from the Cercopithecus solatus genome, an Old World monkey from the Cercopithecini tribe. Computational approaches were used to infer the existence of sequence families and to study how these families are organized with respect to each other. While previous studies had suggested that alpha satellites in Old World monkeys were poorly diversified, our analysis provides evidence for the existence of at least four distinct families of sequences within the studied species and of higher order organizational patterns. Fluorescence in situ hybridization using oligonucleotide probes that are able to target each family in a specific way showed that the different families had distinct distributions on chromosomes and were not homogeneously distributed between chromosomes. Our new approach provides an unprecedented and comprehensive view of the diversity and organization of alpha satellites in a species outside the hominoid group. We consider these data with respect to previously known alpha satellite families and to potential mechanisms for satellite DNA evolution. Applying this approach to other species will open new perspectives regarding the integration of satellite DNA into comparative genomic and cytogenetic studies.

  5. Whole-genome resequencing and transcriptomic analysis to identify genes involved in leaf-color diversity in ornamental rice plants.

    Directory of Open Access Journals (Sweden)

    Chang-Kug Kim

    Full Text Available Rice field art is a large-scale art form in which people design rice fields using various kinds of ornamental rice plants with different leaf colors. Leaf color-related genes play an important role in the study of chlorophyll biosynthesis, chloroplast structure and function, and anthocyanin biosynthesis. Despite the role of different metabolites in the traditional relationship between leaf and color, comprehensive color-specific metabolite studies of ornamental rice have been limited. We performed whole-genome resequencing and transcriptomic analysis of regulatory patterns and genetic diversity among different rice cultivars to discover new genetic mechanisms that promote enhanced levels of various leaf colors. We resequenced the genomes of 10 rice leaf-color accessions to an average of 40× reads depth and >95% coverage and performed 30 RNA-seq experiments using the 10 rice accessions sampled at three developmental stages. The sequencing results yielded a total of 1,814 × 106 reads and identified an average of 713,114 SNPs per rice accession. Based on our analysis of the DNA variation and gene expression, we selected 47 candidate genes. We used an integrated analysis of the whole-genome resequencing data and the RNA-seq data to divide the candidate genes into two groups: genes related to macronutrient (i.e., magnesium and sulfur transport and genes related to flavonoid pathways, including anthocyanidin biosynthesis. We verified the candidate genes with quantitative real-time PCR using transgenic T-DNA insertion mutants. Our study demonstrates the potential of integrated screening methods combined with genetic-variation and transcriptomic data to isolate genes involved in complex biosynthetic networks and pathways.

  6. Genomic diversity and differentiation of a managed island wild boar population

    DEFF Research Database (Denmark)

    Iacolina, Laura; Scandura, Massimo; J. Goedbloed, Daniel

    2016-01-01

    .169). Such evidences were mostly unaffected by an uneven sample size, although clustering results in reference populations changed when the number of individuals was standardized. Runs of homozygosity (ROHs) pattern and distribution in Sardinian WB are consistent with a past expansion following a bottleneck (small...... ROHs) and recent population substructuring (highly homozygous individuals). The observed effect of a non-random selection of Sardinian individuals on diversity, FST and ROH estimates, stressed the importance of sampling design in the study of structured or introgressed populations. Our results support...

  7. Genetic diversity and genomic resources available for the small millet crops to accelerate a New Green Revolution.

    Science.gov (United States)

    Goron, Travis L; Raizada, Manish N

    2015-01-01

    Small millets are nutrient-rich food sources traditionally grown and consumed by subsistence farmers in Asia and Africa. They include finger millet (Eleusine coracana), foxtail millet (Setaria italica), kodo millet (Paspalum scrobiculatum), proso millet (Panicum miliaceum), barnyard millet (Echinochloa spp.), and little millet (Panicum sumatrense). Local farmers value the small millets for their nutritional and health benefits, tolerance to extreme stress including drought, and ability to grow under low nutrient input conditions, ideal in an era of climate change and steadily depleting natural resources. Little scientific attention has been paid to these crops, hence they have been termed "orphan cereals." Despite this challenge, an advantageous quality of the small millets is that they continue to be grown in remote regions of the world which has preserved their biodiversity, providing breeders with unique alleles for crop improvement. The purpose of this review, first, is to highlight the diverse traits of each small millet species that are valued by farmers and consumers which hold potential for selection, improvement or mechanistic study. For each species, the germplasm, genetic and genomic resources available will then be described as potential tools to exploit this biodiversity. The review will conclude with noting current trends and gaps in the literature and make recommendations on how to better preserve and utilize diversity within these species to accelerate a New Green Revolution for subsistence farmers in Asia and Africa.

  8. Brucella spp. of amphibians comprise genomically diverse motile strains competent for replication in macrophages and survival in mammalian hosts

    Science.gov (United States)

    Al Dahouk, Sascha; Köhler, Stephan; Occhialini, Alessandra; Jiménez de Bagüés, María Pilar; Hammerl, Jens Andre; Eisenberg, Tobias; Vergnaud, Gilles; Cloeckaert, Axel; Zygmunt, Michel S.; Whatmore, Adrian M.; Melzer, Falk; Drees, Kevin P.; Foster, Jeffrey T.; Wattam, Alice R.; Scholz, Holger C.

    2017-01-01

    Twenty-one small Gram-negative motile coccobacilli were isolated from 15 systemically diseased African bullfrogs (Pyxicephalus edulis), and were initially identified as Ochrobactrum anthropi by standard microbiological identification systems. Phylogenetic reconstructions using combined molecular analyses and comparative whole genome analysis of the most diverse of the bullfrog strains verified affiliation with the genus Brucella and placed the isolates in a cluster containing B. inopinata and the other non-classical Brucella species but also revealed significant genetic differences within the group. Four representative but molecularly and phenotypically diverse strains were used for in vitro and in vivo infection experiments. All readily multiplied in macrophage-like murine J774-cells, and their overall intramacrophagic growth rate was comparable to that of B. inopinata BO1 and slightly higher than that of B. microti CCM 4915. In the BALB/c murine model of infection these strains replicated in both spleen and liver, but were less efficient than B. suis 1330. Some strains survived in the mammalian host for up to 12 weeks. The heterogeneity of these novel strains hampers a single species description but their phenotypic and genetic features suggest that they represent an evolutionary link between a soil-associated ancestor and the mammalian host-adapted pathogenic Brucella species. PMID:28300153

  9. An ancestry informative marker set for determining continental origin: validation and extension using human genome diversity panels

    Directory of Open Access Journals (Sweden)

    Gregersen Peter K

    2009-07-01

    Full Text Available Abstract Background Case-control genetic studies of complex human diseases can be confounded by population stratification. This issue can be addressed using panels of ancestry informative markers (AIMs that can provide substantial population substructure information. Previously, we described a panel of 128 SNP AIMs that were designed as a tool for ascertaining the origins of subjects from Europe, Sub-Saharan Africa, Americas, and East Asia. Results In this study, genotypes from Human Genome Diversity Panel populations were used to further evaluate a 93 SNP AIM panel, a subset of the 128 AIMS set, for distinguishing continental origins. Using both model-based and relatively model-independent methods, we here confirm the ability of this AIM set to distinguish diverse population groups that were not previously evaluated. This study included multiple population groups from Oceana, South Asia, East Asia, Sub-Saharan Africa, North and South America, and Europe. In addition, the 93 AIM set provides population substructure information that can, for example, distinguish Arab and Ashkenazi from Northern European population groups and Pygmy from other Sub-Saharan African population groups. Conclusion These data provide additional support for using the 93 AIM set to efficiently identify continental subject groups for genetic studies, to identify study population outliers, and to control for admixture in association studies.

  10. Elaeis oleifera Genomic-SSR Markers: Exploitation in Oil Palm Germplasm Diversity and Cross-Amplification in Arecaceae

    Science.gov (United States)

    Zaki, Noorhariza Mohd; Singh, Rajinder; Rosli, Rozana; Ismail, Ismanizan

    2012-01-01

    Species-specific simple sequence repeat (SSR) markers are favored for genetic studies and marker-assisted selection (MAS) breeding for oil palm genetic improvement. This report characterizes 20 SSR markers from an Elaeis oleifera genomic library (gSSR). Characterization of the repeat type in 2000 sequences revealed a high percentage of di-nucleotides (63.6%), followed by tri-nucleotides (24.2%). Primer pairs were successfully designed for 394 of the E. oleifera gSSRs. Subsequent analysis showed the ability of the 20 selected E. oleifera gSSR markers to reveal genetic diversity in the genus Elaeis. The average Polymorphism Information Content (PIC) value for the SSRs was 0.402, with the tri-repeats showing the highest average PIC (0.626). Low values of observed heterozygosity (Ho) (0.164) and highly positive fixation indices (Fis) in the E. oleifera germplasm collection, compared to the E. guineensis, indicated an excess of homozygosity in E. oleifera. The transferability of the markers to closely related palms, Elaeis guineensis, Cocos nucifera and ornamental palms is also reported. Sequencing the amplicons of three selected E. oleifera gSSRs across both species and palm taxa revealed variations in the repeat-units. The study showed the potential of E. oleifera gSSR markers to reveal genetic diversity in the genus Elaeis. The markers are also a valuable genetic resource for studying E. oleifera and other genus in the Arecaceae family. PMID:22605966

  11. Elaeis oleifera Genomic-SSR Markers: Exploitation in Oil Palm Germplasm Diversity and Cross-Amplification in Arecaceae

    Directory of Open Access Journals (Sweden)

    Ismanizan Ismail

    2012-03-01

    Full Text Available Species-specific simple sequence repeat (SSR markers are favored for genetic studies and marker-assisted selection (MAS breeding for oil palm genetic improvement. This report characterizes 20 SSR markers from an Elaeis oleifera genomic library (gSSR. Characterization of the repeat type in 2000 sequences revealed a high percentage of di-nucleotides (63.6%, followed by tri-nucleotides (24.2%. Primer pairs were successfully designed for 394 of the E. oleifera gSSRs. Subsequent analysis showed the ability of the 20 selected E. oleifera gSSR markers to reveal genetic diversity in the genus Elaeis. The average Polymorphism Information Content (PIC value for the SSRs was 0.402, with the tri-repeats showing the highest average PIC (0.626. Low values of observed heterozygosity (Ho (0.164 and highly positive fixation indices (Fis in the E. oleifera germplasm collection, compared to the E. guineensis, indicated an excess of homozygosity in E. oleifera. The transferability of the markers to closely related palms, Elaeis guineensis, Cocos nucifera and ornamental palms is also reported. Sequencing the amplicons of three selected E. oleifera gSSRs across both species and palm taxa revealed variations in the repeat-units. The study showed the potential of E. oleifera gSSR markers to reveal genetic diversity in the genus Elaeis. The markers are also a valuable genetic resource for studying E. oleifera and other genus in the Arecaceae family.

  12. Genomic diversity of Saccharomyces cerevisiae yeasts associated with alcoholic fermentation of bacanora produced by artisanal methods.

    Science.gov (United States)

    Álvarez-Ainza, M L; Zamora-Quiñonez, K A; Moreno-Ibarra, G M; Acedo-Félix, E

    2015-03-01

    Bacanora is a spirituous beverage elaborated with Agave angustifolia Haw in an artisanal process. Natural fermentation is mostly performed with native yeasts and bacteria. In this study, 228 strains of yeast like Saccharomyces were isolated from the natural alcoholic fermentation on the production of bacanora. Restriction analysis of the amplified region ITS1-5.8S-ITS2 of the ribosomal DNA genes (RFLPr) were used to confirm the genus, and 182 strains were identified as Saccharomyces cerevisiae. These strains displayed high genomic variability in their chromosomes profiles by karyotyping. Electrophoretic profiles of the strains evaluated showed a large number of chromosomes the size of which ranged between 225 and 2200 kpb approximately.

  13. Complete genome sequences of two distinct and diverse Citrus tristeza virus isolates from New Zealand.

    Science.gov (United States)

    Harper, S J; Dawson, T E; Pearson, M N

    2009-01-01

    Two Citrus tristeza virus (CTV) isolates from New Zealand that display distinct phenotypes were isolated, examined and sequenced in full. The first isolate, NZ-M16, is largely asymptomatic and non-transmissible by the aphid vector Toxoptera citricida, while the second, NZ-B18, is highly transmissible and induces very severe symptoms on C. sinensis and C. aurantii. Phylogenetic analysis of the genome sequences showed that both isolates were approximately 90-93% similar to the VT and T318 isolates but possessed only 89% identity to one another. Based on sequence identity, both isolates are VT subtypes, with NZ-M16 being T3-like, while NZ-B18 is a member of a novel subtype with B165 from India.

  14. Genetic diversity of Streptococcus suis isolates as determined by comparative genome hybridization

    Directory of Open Access Journals (Sweden)

    Thi Hoa

    2011-07-01

    Full Text Available Abstract Background Streptococcus suis is a zoonotic pathogen that causes infections in young piglets. S. suis is a heterogeneous species. Thirty-three different capsular serotypes have been described, that differ in virulence between as well as within serotypes. Results In this study, the correlation between gene content, serotype, phenotype and virulence among 55 S. suis strains was studied using Comparative Genome Hybridization (CGH. Clustering of CGH data divided S. suis isolates into two clusters, A and B. Cluster A isolates could be discriminated from cluster B isolates based on the protein expression of extracellular factor (EF. Cluster A contained serotype 1 and 2 isolates that were correlated with virulence. Cluster B mainly contained serotype 7 and 9 isolates. Genetic similarity was observed between serotype 7 and serotype 2 isolates that do not express muramidase released protein (MRP and EF (MRP-EF-, suggesting these isolates originated from a common founder. Profiles of 25 putative virulence-associated genes of S. suis were determined among the 55 isolates. Presence of all 25 genes was shown for cluster A isolates, whereas cluster B isolates lacked one or more putative virulence genes. Divergence of S. suis isolates was further studied based on the presence of 39 regions of difference. Conservation of genes was evaluated by the definition of a core genome that contained 78% of all ORFs in P1/7. Conclusions In conclusion, we show that CGH is a valuable method to study distribution of genes or gene clusters among isolates in detail, yielding information on genetic similarity, and virulence traits of S. suis isolates.

  15. Molecular diversity and phylogeny of Triticum-Aegilops species possessing D genome revealed by SSR and ISSR markers

    Directory of Open Access Journals (Sweden)

    Moradkhani Hoda

    2015-12-01

    Full Text Available The aim of this study is investigation the applicability of SSR and ISSR markers in evaluating the genetic relationships in twenty accessions of Aegilops and Triticum species with D genome in different ploidy levels. Totally, 119 bands and 46 alleles were detected using ten primers for ISSR and SSR markers, respectively. Polymorphism Information Content values for all primers ranged from 0.345 to 0.375 with an average of 0.367 for SSR, and varied from 0.29 to 0.44 with the average 0.37 for ISSR marker. Analysis of molecular variance (AMOVA revealed that 81% (ISSR and 84% (SSR of variability was partitioned among individuals within populations. Comparing the genetic diversity of Aegilops and Triticum accessions, based on genetic parameters, shows that genetic variation of Ae. crassa and Ae. tauschii species are higher than other species, especially in terms of Nei’s gene diversity. Cluster analysis, based on both markers, separated total accessions in three groups. However, classification based on SSR marker data was not conformed to classification according to ISSR marker data. Principal co-ordinate analysis (PCoA for SSR and ISSR data showed that, the first two components clarified 53.48% and 49.91% of the total variation, respectively. This analysis (PCoA, also, indicated consistent patterns of genetic relationships for ISSR data sets, however, the grouping of accessions was not completely accorded to their own geographical origins. Consequently, a high level of genetic diversity was revealed from the accessions sampled from different eco-geographical regions of Iran.

  16. Whole Genome Sequencing of Danish Staphylococcus argenteus Reveals a Genetically Diverse Collection with Clear Separation from Staphylococcus aureus

    OpenAIRE

    Hansen, Thomas A.; Bartels, Mette D.; Hogh, Silje V.; Dons, Lone E.; Pedersen, Michael; Jensen, Thoger G.; Kemp, Michael; Skov, Marianne N.; Gumpert, Heidi; Worning, Peder; Westh, Henrik

    2017-01-01

    Staphylococcus argenteus (S. argenteus) is a newly identified Staphylococcus species that has been misidentified as Staphylococcus aureus (S. aureus) and is clinically relevant. We identified 25 S. argenteus genomes in our collection of whole genome sequenced S. aureus. These genomes were compared to publicly available genomes and a phylogeny revealed seven clusters corresponding to seven clonal complexes. The genome of S. argenteus was found to be different from the genome of S. aureus and a...

  17. Patterns of Genome-Wide Nucleotide Diversity in the Gynodioecious Plant Thymus vulgaris Are Compatible with Recent Sweeps of Cytoplasmic Genes.

    Science.gov (United States)

    Mollion, Maeva; Ehlers, Bodil K; Figuet, Emeric; Santoni, Sylvain; Lenormand, Thomas; Maurice, Sandrine; Galtier, Nicolas; Bataillon, Thomas

    2018-01-01

    Gynodioecy is a sexual dimorphism where females coexist with hermaphrodite individuals. In most cases, this dimorphism involves the interaction of cytoplasmic male sterility (CMS) genes and nuclear restorer genes. Two scenarios can account for how these interactions maintain gynodioecy. Either CMS genes recurrently enter populations at low frequency via mutation or migration and go to fixation unimpeded (successive sweeps), or CMS genes maintain polymorphism over evolutionary time through interactions with a nuclear restorer allele (balanced polymorphism). To distinguish between these scenarios, we used transcriptome sequencing in gynodioecious Thymus vulgaris and surveyed genome-wide diversity in 18 naturally occurring individuals sampled from populations at a local geographic scale. We contrast the amount and patterns of nucleotide diversity in the nuclear and cytoplasmic genome, and find ample diversity at the nuclear level (π = 0.019 at synonymous sites) but reduced genetic diversity and an excess of rare polymorphisms in the cytoplasmic genome relative to the nuclear genome. Our finding is incompatible with the maintenance of gynodioecy via scenarios invoking long-term balancing selection, and instead suggests the recent fixation of CMS lineages in the populations studied. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  18. Standardized subsets of the HGDP-CEPH Human Genome Diversity Cell Line Panel, accounting for atypical and duplicated samples and pairs of close relatives.

    Science.gov (United States)

    Rosenberg, Noah A

    2006-11-01

    The HGDP-CEPH Human Genome Diversity Cell Line Panel is a widely-used resource for studies of human genetic variation. Here, pairs of close relatives that have been included in the panel are identified. Together with information on atypical and duplicated samples, the inferred relative pairs suggest standardized subsets of the panel for use in future population-genetic studies.

  19. Genetic Diversity Analysis with 454 Pyrosequencing and Genomic Reduction Confirmed the Eastern and Western Division in the Cultivated Barley Gene Pool

    Directory of Open Access Journals (Sweden)

    Yong-Bi Fu

    2011-11-01

    Full Text Available Next-generation DNA sequencing (NGS technologies can survey sequence variation on a genome-wide scale, but their utility for crop genetic diversity analysis is poorly known. Many challenges remain in their applications, including sampling complex genomes, identifying single nucleotide polymorphisms (SNPs, and analyzing missing data. This study presented a practical application of the Roche 454 GS FLX Titanium technology in combination with genomic reduction and an advanced bioinformatics tool to analyze the genetic relationships of 16 diverse barley ( L. landraces. A full 454 run generated roughly 1.7 million sequence reads with a total length of 612 Mbp. Application of the computational pipeline called DIAL (de novo identification of alleles identified 2578 contigs and 3980 SNPs. Sanger sequencing of four barley samples confirmed 85 of the 100 selected contigs and 288 of the 620 putative SNPs and identified 735 new SNPs and 39 new indels. Several diversity analyses revealed the eastern and western division in the barley samples. The division is compatible with those inferred with 156 microsatellite alleles of the same 16 samples and consistent with our current knowledge about cultivated barley. These results help to illustrate the utility of NGS technologies for crop diversity studies. The NGS application also provides a new informative set of genomic resources for barley research.

  20. Using diverse U.S. beef cattle genomes to identify missense mutations in EPAS1, a gene associated with high-altitude pulmonary hypertension

    Science.gov (United States)

    The availability of whole genome sequence (WGS) data has made it possible to discover protein variants in silico. However, bovine WGS databases comprised of related influential sires from relatively few breeds tend to under represent the breadth of genetic diversity in U.S. beef cattle. Thus, our ...

  1. Using diverse U.S. beef cattle genomes to identify missense mutations in EPAS1, a gene associated with pulmonary hypertension

    Science.gov (United States)

    The availability of whole genome sequence (WGS) data has made it possible to discover protein variants in silico. However, existing bovine WGS databases do not show data in a form conducive to protein variant analysis, and tend to under represent the breadth of genetic diversity in U.S. beef cattle...

  2. Genome-wide association analysis of seedling traits in diverse Sorghum germplasm under thermal stress.

    Science.gov (United States)

    Chopra, Ratan; Burow, Gloria; Burke, John J; Gladman, Nicholas; Xin, Zhanguo

    2017-01-13

    Climate variability due to fluctuation in temperature is a worldwide concern that imperils crop production. The need to understand how the germplasm variation in major crops can be utilized to aid in discovering and developing breeding lines that can withstand and adapt to temperature fluctuations is more necessary than ever. Here, we analyzed the genetic variation associated with responses to thermal stresses in a sorghum association panel (SAP) representing major races and working groups to identify single nucleotide polymorphisms (SNPs) that are associated with resilience to temperature stress in a major cereal crop. The SAP exhibited extensive variation for seedling traits under cold and heat stress. Genome-wide analyses identified 30 SNPs that were strongly associated with traits measured at seedling stage under cold stress and tagged genes that act as regulators of anthocyanin expression and soluble carbohydrate metabolism. Meanwhile, 12 SNPs were significantly associated with seedling traits under heat stress and these SNPs tagged genes that function in sugar metabolism, and ion transport pathways. Evaluation of co-expression networks for genes near the significantly associated SNPs indicated complex gene interactions for cold and heat stresses in sorghum. We focused and validated the expression of four genes in the network of Sb06g025040, a basic-helix-loop-helix (bHLH) transcription factor that was proposed to be involved in purple color pigmentation of leaf, and observed that genes in this network were upregulated during cold stress in a moderately tolerant line as compared to the more sensitive line. This study facilitated the tagging of genome regions associated with variation in seedling traits of sorghum under cold and heat stress. These findings show the potential of genotype information for development of temperature resilient sorghum cultivars and further characterization of genes and their networks responsible for adaptation to thermal stresses

  3. Primer in Genetics and Genomics, Article 3-Explaining Human Diversity: The Role of DNA.

    Science.gov (United States)

    Read, Catherine Y

    2017-05-01

    Genetic variation lays the foundation for diversity and enables humans to adapt to changing environments. The order of the nucleotides adenine, guanine, cytosine, and thymine on the deoxyribonucleic acid (DNA) molecules of the nuclear chromosomes and mitochondrial DNA (mtDNA) plays an important role in normal cell division, tissue development, and reproduction but is susceptible to alteration from a large number of random, inherited, or environmental events. Variations can range from a change in a single nucleotide to duplication of entire chromosomes. Single nucleotide polymorphisms are the major source of human heterogeneity. Other variations that can alter phenotypes and adversely impact growth, development, and health include copy number variations, aneuploidies, and structural alterations such as deletions, translocations, inversions, duplications, insertions, or mutations in mtDNA. In addition, DNA rearrangements in somatic cells underlie the uncontrolled cell growth found in cancer. This article explores the mechanisms by which variations in DNA arise and the impact those changes can have on human health.

  4. High-resolution genome-wide in vivo footprinting of diverse transcription factors in human cells.

    Science.gov (United States)

    Boyle, Alan P; Song, Lingyun; Lee, Bum-Kyu; London, Darin; Keefe, Damian; Birney, Ewan; Iyer, Vishwanath R; Crawford, Gregory E; Furey, Terrence S

    2011-03-01

    Regulation of gene transcription in diverse cell types is determined largely by varied sets of cis-elements where transcription factors bind. Here we demonstrate that data from a single high-throughput DNase I hypersensitivity assay can delineate hundreds of thousands of base-pair resolution in vivo footprints in human cells that precisely mark individual transcription factor-DNA interactions. These annotations provide a unique resource for the investigation of cis-regulatory elements. We find that footprints for specific transcription factors correlate with ChIP-seq enrichment and can accurately identify functional versus nonfunctional transcription factor motifs. We also find that footprints reveal a unique evolutionary conservation pattern that differentiates functional footprinted bases from surrounding DNA. Finally, detailed analysis of CTCF footprints suggests multiple modes of binding and a novel DNA binding motif upstream of the primary binding site.

  5. Integrated Genomic Analysis of Diverse Induced Pluripotent Stem Cells from the Progenitor Cell Biology Consortium

    Directory of Open Access Journals (Sweden)

    Nathan Salomonis

    2016-07-01

    Full Text Available The rigorous characterization of distinct induced pluripotent stem cells (iPSC derived from multiple reprogramming technologies, somatic sources, and donors is required to understand potential sources of variability and downstream potential. To achieve this goal, the Progenitor Cell Biology Consortium performed comprehensive experimental and genomic analyses of 58 iPSC from ten laboratories generated using a variety of reprogramming genes, vectors, and cells. Associated global molecular characterization studies identified functionally informative correlations in gene expression, DNA methylation, and/or copy-number variation among key developmental and oncogenic regulators as a result of donor, sex, line stability, reprogramming technology, and cell of origin. Furthermore, X-chromosome inactivation in PSC produced highly correlated differences in teratoma-lineage staining and regulator expression upon differentiation. All experimental results, and raw, processed, and metadata from these analyses, including powerful tools, are interactively accessible from a new online portal at https://www.synapse.org to serve as a reusable resource for the stem cell community.

  6. Polysaccharide utilization locus and CAZYme genome repertoires reveal diverse ecological adaptation of Prevotella species.

    Science.gov (United States)

    Accetto, Tomaž; Avguštin, Gorazd

    2015-10-01

    The results of metagenomic studies have clearly established that bacteria of the genus Prevotella represent one of the important groups found in the oral cavity and large intestine of man, and they also dominate the rumen. They belong to the Bacteroidetes, a phylum well-known for its polysaccharide degrading potential that stems from the outer membrane-localized enzyme/binding protein complexes encoded in polysaccharide utilization loci (PULs). Dozens of Prevotella species have been described, primarily from the oral cavity, and many of them occur simultaneously at the same sites, but research on their ecological adaptation has been neglected. Therefore, in this study, the repertoires of PULs and carbohydrate acting enzymes (CAZYmes) found in Prevotella genomes were analyzed and it was concluded that the Prevotella species were widely heterogeneous in this respect and displayed several distinct adaptations with regard to the number, source and nature of the substrates apparently preferred for growth. Copyright © 2015 Elsevier GmbH. All rights reserved.

  7. A geographically-diverse collection of 418 human gut microbiome pathway genome databases

    KAUST Repository

    Hahn, Aria S.

    2017-04-11

    Advances in high-throughput sequencing are reshaping how we perceive microbial communities inhabiting the human body, with implications for therapeutic interventions. Several large-scale datasets derived from hundreds of human microbiome samples sourced from multiple studies are now publicly available. However, idiosyncratic data processing methods between studies introduce systematic differences that confound comparative analyses. To overcome these challenges, we developed GutCyc, a compendium of environmental pathway genome databases (ePGDBs) constructed from 418 assembled human microbiome datasets using MetaPathways, enabling reproducible functional metagenomic annotation. We also generated metabolic network reconstructions for each metagenome using the Pathway Tools software, empowering researchers and clinicians interested in visualizing and interpreting metabolic pathways encoded by the human gut microbiome. For the first time, GutCyc provides consistent annotations and metabolic pathway predictions, making possible comparative community analyses between health and disease states in inflammatory bowel disease, Crohn’s disease, and type 2 diabetes. GutCyc data products are searchable online, or may be downloaded and explored locally using MetaPathways and Pathway Tools.

  8. Aboriginal Australian mitochondrial genome variation - an increased understanding of population antiquity and diversity

    Science.gov (United States)

    Nagle, Nano; van Oven, Mannis; Wilcox, Stephen; van Holst Pellekaan, Sheila; Tyler-Smith, Chris; Xue, Yali; Ballantyne, Kaye N.; Wilcox, Leah; Papac, Luka; Cooke, Karen; van Oorschot, Roland A. H.; McAllister, Peter; Williams, Lesley; Kayser, Manfred; Mitchell, R. John; Adhikarla, Syama; Adler, Christina J.; Balanovska, Elena; Balanovsky, Oleg; Bertranpetit, Jaume; Clarke, Andrew C.; Comas, David; Cooper, Alan; der Sarkissian, Clio S. I.; Dulik, Matthew C.; Gaieski, Jill B.; Ganeshprasad, Arunkumar; Haak, Wolfgang; Haber, Marc; Hobbs, Angela; Javed, Asif; Jin, Li; Kaplan, Matthew E.; Li, Shilin; Martínez-Cruz, Begoña; Matisoo-Smith, Elizabeth A.; Melé, Marta; Merchant, Nirav C.; Owings, Amanda C.; Parida, Laxmi; Pitchappan, Ramasamy; Platt, Daniel E.; Quintana-Murci, Lluis; Renfrew, Colin; Royyuru, Ajay K.; Santhakumari, Arun Varatharajan; Santos, Fabrício R.; Schurr, Theodore G.; Soodyall, Himla; Soria Hernanz, David F.; Swamikrishnan, Pandikumar; Vilar, Miguel G.; Wells, R. Spencer; Zalloua, Pierre A.; Ziegle, Janet S.

    2017-03-01

    Aboriginal Australians represent one of the oldest continuous cultures outside Africa, with evidence indicating that their ancestors arrived in the ancient landmass of Sahul (present-day New Guinea and Australia) ~55 thousand years ago. Genetic studies, though limited, have demonstrated both the uniqueness and antiquity of Aboriginal Australian genomes. We have further resolved known Aboriginal Australian mitochondrial haplogroups and discovered novel indigenous lineages by sequencing the mitogenomes of 127 contemporary Aboriginal Australians. In particular, the more common haplogroups observed in our dataset included M42a, M42c, S, P5 and P12, followed by rarer haplogroups M15, M16, N13, O, P3, P6 and P8. We propose some major phylogenetic rearrangements, such as in haplogroup P where we delinked P4a and P4b and redefined them as P4 (New Guinean) and P11 (Australian), respectively. Haplogroup P2b was identified as a novel clade potentially restricted to Torres Strait Islanders. Nearly all Aboriginal Australian mitochondrial haplogroups detected appear to be ancient, with no evidence of later introgression during the Holocene. Our findings greatly increase knowledge about the geographic distribution and phylogenetic structure of mitochondrial lineages that have survived in contemporary descendants of Australia’s first settlers.

  9. Aboriginal Australian mitochondrial genome variation – an increased understanding of population antiquity and diversity

    Science.gov (United States)

    Nagle, Nano; van Oven, Mannis; Wilcox, Stephen; van Holst Pellekaan, Sheila; Tyler-Smith, Chris; Xue, Yali; Ballantyne, Kaye N.; Wilcox, Leah; Papac, Luka; Cooke, Karen; van Oorschot, Roland A. H.; McAllister, Peter; Williams, Lesley; Kayser, Manfred; Mitchell, R. John; Adhikarla, Syama; Adler, Christina J.; Balanovska, Elena; Balanovsky, Oleg; Bertranpetit, Jaume; Clarke, Andrew C.; Comas, David; Cooper, Alan; Der Sarkissian, Clio S. I.; Dulik, Matthew C.; Gaieski, Jill B.; GaneshPrasad, ArunKumar; Haak, Wolfgang; Haber, Marc; Hobbs, Angela; Javed, Asif; Jin, Li; Kaplan, Matthew E.; Li, Shilin; Martínez-Cruz, Begoña; Matisoo-Smith, Elizabeth A.; Melé, Marta; Merchant, Nirav C.; Owings, Amanda C.; Parida, Laxmi; Pitchappan, Ramasamy; Platt, Daniel E.; Quintana-Murci, Lluis; Renfrew, Colin; Royyuru, Ajay K.; Santhakumari, Arun Varatharajan; Santos, Fabrício R.; Schurr, Theodore G.; Soodyall, Himla; Soria Hernanz, David F.; Swamikrishnan, Pandikumar; Vilar, Miguel G.; Wells, R. Spencer; Zalloua, Pierre A.; Ziegle, Janet S.

    2017-01-01

    Aboriginal Australians represent one of the oldest continuous cultures outside Africa, with evidence indicating that their ancestors arrived in the ancient landmass of Sahul (present-day New Guinea and Australia) ~55 thousand years ago. Genetic studies, though limited, have demonstrated both the uniqueness and antiquity of Aboriginal Australian genomes. We have further resolved known Aboriginal Australian mitochondrial haplogroups and discovered novel indigenous lineages by sequencing the mitogenomes of 127 contemporary Aboriginal Australians. In particular, the more common haplogroups observed in our dataset included M42a, M42c, S, P5 and P12, followed by rarer haplogroups M15, M16, N13, O, P3, P6 and P8. We propose some major phylogenetic rearrangements, such as in haplogroup P where we delinked P4a and P4b and redefined them as P4 (New Guinean) and P11 (Australian), respectively. Haplogroup P2b was identified as a novel clade potentially restricted to Torres Strait Islanders. Nearly all Aboriginal Australian mitochondrial haplogroups detected appear to be ancient, with no evidence of later introgression during the Holocene. Our findings greatly increase knowledge about the geographic distribution and phylogenetic structure of mitochondrial lineages that have survived in contemporary descendants of Australia’s first settlers. PMID:28287095

  10. Comparative evolutionary genomics of Corynebacterium with special reference to codon and amino acid usage diversities.

    Science.gov (United States)

    Pal, Shilpee; Sarkar, Indrani; Roy, Ayan; Mohapatra, Pradeep K Das; Mondal, Keshab C; Sen, Arnab

    2018-02-01

    The present study has been aimed to the comparative analysis of high GC composition containing Corynebacterium genomes and their evolutionary study by exploring codon and amino acid usage patterns. Phylogenetic study by MLSA approach, indel analysis and BLAST matrix differentiated Corynebacterium species in pathogenic and non-pathogenic clusters. Correspondence analysis on synonymous codon usage reveals that, gene length, optimal codon frequencies and tRNA abundance affect the gene expression of Corynebacterium. Most of the optimal codons as well as translationally optimal codons are C ending i.e. RNY (R-purine, N-any nucleotide base, and Y-pyrimidine) and reveal translational selection pressure on codon bias of Corynebacterium. Amino acid usage is affected by hydrophobicity, aromaticity, protein energy cost, etc. Highly expressed genes followed the cost minimization hypothesis and are less diverged at their synonymous positions of codons. Functional analysis of core genes shows significant difference in pathogenic and non-pathogenic Corynebacterium. The study reveals close relationship between non-pathogenic and opportunistic pathogenic Corynebaterium as well as between molecular evolution and survival niches of the organism.

  11. Integrated Genomic Analysis of Diverse Induced Pluripotent Stem Cells from the Progenitor Cell Biology Consortium.

    Science.gov (United States)

    Salomonis, Nathan; Dexheimer, Phillip J; Omberg, Larsson; Schroll, Robin; Bush, Stacy; Huo, Jeffrey; Schriml, Lynn; Ho Sui, Shannan; Keddache, Mehdi; Mayhew, Christopher; Shanmukhappa, Shiva Kumar; Wells, James; Daily, Kenneth; Hubler, Shane; Wang, Yuliang; Zambidis, Elias; Margolin, Adam; Hide, Winston; Hatzopoulos, Antonis K; Malik, Punam; Cancelas, Jose A; Aronow, Bruce J; Lutzko, Carolyn

    2016-07-12

    The rigorous characterization of distinct induced pluripotent stem cells (iPSC) derived from multiple reprogramming technologies, somatic sources, and donors is required to understand potential sources of variability and downstream potential. To achieve this goal, the Progenitor Cell Biology Consortium performed comprehensive experimental and genomic analyses of 58 iPSC from ten laboratories generated using a variety of reprogramming genes, vectors, and cells. Associated global molecular characterization studies identified functionally informative correlations in gene expression, DNA methylation, and/or copy-number variation among key developmental and oncogenic regulators as a result of donor, sex, line stability, reprogramming technology, and cell of origin. Furthermore, X-chromosome inactivation in PSC produced highly correlated differences in teratoma-lineage staining and regulator expression upon differentiation. All experimental results, and raw, processed, and metadata from these analyses, including powerful tools, are interactively accessible from a new online portal at https://www.synapse.org to serve as a reusable resource for the stem cell community. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  12. High-throughput sequencing of Bacillus anthracis in France: investigating genome diversity and population structure using whole-genome SNP discovery.

    Science.gov (United States)

    Girault, Guillaume; Blouin, Yann; Vergnaud, Gilles; Derzelle, Sylviane

    2014-04-16

    Single nucleotide polymorphisms (SNPs) are ideal signatures for subtyping monomorphic pathogens such as Bacillus anthracis. Here we report the use of next-generation sequencing technology to investigate the historical, geographic and genetic diversity of Bacillus anthracis in France. 122 strains isolated over a 60-years period throughout the country were whole-genome sequenced and comparative analyses were carried out with a focus on SNPs discovery to discriminate regional sub-groups of strains. A total of 1581 chromosomal SNPs precisely establish the phylogenetic relationships existing between the French strains. Phylogeography patterns within the three canSNP sub-lineages present in France (i.e. B.Br.CNEVA, A.Br.011/009 and A.Br.001/002) were observed. One of the more remarkable findings was the identification of a variety of genotypes within the A.Br.011/009 sub-group that are persisting in the different regions of France. The 560 SNPs defining the A.Br.011/009- affiliated French strains split the Trans-Eurasian sub-group into six distinct branches without any intermediate nodes. Distinct sub-branches, with some geographic clustering, were resolved. The 345 SNPs defining the major B.Br CNEVA sub-lineage clustered three main phylogeographic clades, the Alps, the Pyrenees, and the Massif Central, with a small Saône-et-Loire sub-cluster nested within the latter group. The French strains affiliated to the minor A.Br.001/002 group were characterized by 226 SNPs. All recent isolates collected from the Doubs department were closely related. Identification of SNPs from whole-genome sequences facilitates high-resolution strain tracking and provides the level of discrimination required for outbreak investigations. Eight diagnostic SNPs, representative of the main French-specific phylogeographic clusters, were therefore selected and developed into high-resolution melting SNP discriminative assays. This work has established one of the most accurate phylogenetic

  13. Genetic Diversity and Genome Wide Association Study of β-Glucan Content in Tetraploid Wheat Grains.

    Directory of Open Access Journals (Sweden)

    Ilaria Marcotuli

    Full Text Available Non-starch polysaccharides (NSPs have many health benefits, including immunomodulatory activity, lowering serum cholesterol, a faecal bulking effect, enhanced absorption of certain minerals, prebiotic effects and the amelioration of type II diabetes. The principal components of the NSP in cereal grains are (1,3;1,4-β-glucans and arabinoxylans. Although (1,3;1,4-β-glucan (hereafter called β-glucan is not the most representative component of wheat cell walls, it is one of the most important types of soluble fibre in terms of its proven beneficial effects on human health. In the present work we explored the genetic variability of β-glucan content in grains from a tetraploid wheat collection that had been genotyped with a 90k-iSelect array, and combined this data to carry out an association analysis. The β-glucan content, expressed as a percentage w/w of grain dry weight, ranged from 0.18% to 0.89% across the collection. Our analysis identified seven genomic regions associated with β-glucan, located on chromosomes 1A, 2A (two, 2B, 5B and 7A (two, confirming the quantitative nature of this trait. Analysis of marker trait associations (MTAs in syntenic regions of several grass species revealed putative candidate genes that might influence β-glucan levels in the endosperm, possibly via their participation in carbon partitioning. These include the glycosyl hydrolases endo-β-(1,4-glucanase (cellulase, β-amylase, (1,4-β-xylan endohydrolase, xylanase inhibitor protein I, isoamylase and the glycosyl transferase starch synthase II.

  14. Basidiomycete DyPs: Genomic diversity, structural-functional aspects, reaction mechanism and environmental significance.

    Science.gov (United States)

    Linde, Dolores; Ruiz-Dueñas, Francisco J; Fernández-Fueyo, Elena; Guallar, Victor; Hammel, Kenneth E; Pogni, Rebecca; Martínez, Angel T

    2015-05-15

    The first enzyme with dye-decolorizing peroxidase (DyP) activity was described in 1999 from an arthroconidial culture of the fungus Bjerkandera adusta. However, the first DyP sequence had been deposited three years before, as a peroxidase gene from a culture of an unidentified fungus of the family Polyporaceae (probably Irpex lacteus). Since the first description, fewer than ten basidiomycete DyPs have been purified and characterized, but a large number of sequences are available from genomes. DyPs share a general fold and heme location with chlorite dismutases and other DyP-type related proteins (such as Escherichia coli EfeB), forming the CDE superfamily. Taking into account the lack of an evolutionary relationship with the catalase-peroxidase superfamily, the observed heme pocket similarities must be considered as a convergent type of evolution to provide similar reactivity to the enzyme cofactor. Studies on the Auricularia auricula-judae DyP showed that high-turnover oxidation of anthraquinone type and other DyP substrates occurs via long-range electron transfer from an exposed tryptophan (Trp377, conserved in most basidiomycete DyPs), whose catalytic radical was identified in the H2O2-activated enzyme. The existence of accessory oxidation sites in DyP is suggested by the residual activity observed after site-directed mutagenesis of the above tryptophan. DyP degradation of substituted anthraquinone dyes (such as Reactive Blue 5) most probably proceeds via typical one-electron peroxidase oxidations and product breakdown without a DyP-catalyzed hydrolase reaction. Although various DyPs are able to break down phenolic lignin model dimers, and basidiomycete DyPs also present marginal activity on nonphenolic dimers, a significant contribution to lignin degradation is unlikely because of the low activity on high redox-potential substrates. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  15. Whole-genome analyses of Enterococcus faecium isolates with diverse daptomycin MICs.

    Science.gov (United States)

    Diaz, Lorena; Tran, Truc T; Munita, Jose M; Miller, William R; Rincon, Sandra; Carvajal, Lina P; Wollam, Aye; Reyes, Jinnethe; Panesso, Diana; Rojas, Natalia L; Shamoo, Yousif; Murray, Barbara E; Weinstock, George M; Arias, Cesar A

    2014-08-01

    Daptomycin (DAP) is a lipopeptide antibiotic frequently used as a "last-resort" antibiotic against vancomycin-resistant Enterococcus faecium (VRE). However, an important limitation for DAP therapy against VRE is the emergence of resistance during therapy. Mutations in regulatory systems involved in cell envelope homeostasis are postulated to be important mediators of DAP resistance in E. faecium. Thus, in order to gain insights into the genetic bases of DAP resistance in E. faecium, we investigated the presence of changes in 43 predicted proteins previously associated with DAP resistance in enterococci and staphylococci using the genomes of 19 E. faecium with different DAP MICs (range, 3 to 48 μg/ml). Bodipy-DAP (BDP-DAP) binding to the cell membrane assays and time-kill curves (DAP alone and with ampicillin) were performed. Genetic changes involving two major pathways were identified: (i) LiaFSR, a regulatory system associated with the cell envelope stress response, and (ii) YycFGHIJ, a system involved in the regulation of cell wall homeostasis. Thr120 → Ala and Trp73 → Cys substitutions in LiaS and LiaR, respectively, were the most common changes identified. DAP bactericidal activity was abolished in the presence of liaFSR or yycFGHIJ mutations regardless of the DAP MIC and was restored in the presence of ampicillin, but only in representatives of the LiaFSR pathway. Reduced binding of BDP-DAP to the cell surface was the predominant finding correlating with resistance in isolates with DAP MICs above the susceptibility breakpoint. Our findings suggest that genotypic information may be crucial to predict response to DAP plus β-lactam combinations and continue to question the DAP breakpoint of 4 μg/ml. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  16. Genome-wide diversity in the levant reveals recent structuring by culture.

    Directory of Open Access Journals (Sweden)

    Marc Haber

    Full Text Available The Levant is a region in the Near East with an impressive record of continuous human existence and major cultural developments since the Paleolithic period. Genetic and archeological studies present solid evidence placing the Middle East and the Arabian Peninsula as the first stepping-stone outside Africa. There is, however, little understanding of demographic changes in the Middle East, particularly the Levant, after the first Out-of-Africa expansion and how the Levantine peoples relate genetically to each other and to their neighbors. In this study we analyze more than 500,000 genome-wide SNPs in 1,341 new samples from the Levant and compare them to samples from 48 populations worldwide. Our results show recent genetic stratifications in the Levant are driven by the religious affiliations of the populations within the region. Cultural changes within the last two millennia appear to have facilitated/maintained admixture between culturally similar populations from the Levant, Arabian Peninsula, and Africa. The same cultural changes seem to have resulted in genetic isolation of other groups by limiting admixture with culturally different neighboring populations. Consequently, Levant populations today fall into two main groups: one sharing more genetic characteristics with modern-day Europeans and Central Asians, and the other with closer genetic affinities to other Middle Easterners and Africans. Finally, we identify a putative Levantine ancestral component that diverged from other Middle Easterners ∼23,700-15,500 years ago during the last glacial period, and diverged from Europeans ∼15,900-9,100 years ago between the last glacial warming and the start of the Neolithic.

  17. Genetic determinants of lipid traits in diverse populations from the population architecture using genomics and epidemiology (PAGE study.

    Directory of Open Access Journals (Sweden)

    Logan Dumitrescu

    2011-06-01

    Full Text Available For the past five years, genome-wide association studies (GWAS have identified hundreds of common variants associated with human diseases and traits, including high-density lipoprotein cholesterol (HDL-C, low-density lipoprotein cholesterol (LDL-C, and triglyceride (TG levels. Approximately 95 loci associated with lipid levels have been identified primarily among populations of European ancestry. The Population Architecture using Genomics and Epidemiology (PAGE study was established in 2008 to characterize GWAS-identified variants in diverse population-based studies. We genotyped 49 GWAS-identified SNPs associated with one or more lipid traits in at least two PAGE studies and across six racial/ethnic groups. We performed a meta-analysis testing for SNP associations with fasting HDL-C, LDL-C, and ln(TG levels in self-identified European American (~20,000, African American (~9,000, American Indian (~6,000, Mexican American/Hispanic (~2,500, Japanese/East Asian (~690, and Pacific Islander/Native Hawaiian (~175 adults, regardless of lipid-lowering medication use. We replicated 55 of 60 (92% SNP associations tested in European Americans at p<0.05. Despite sufficient power, we were unable to replicate ABCA1 rs4149268 and rs1883025, CETP rs1864163, and TTC39B rs471364 previously associated with HDL-C and MAFB rs6102059 previously associated with LDL-C. Based on significance (p<0.05 and consistent direction of effect, a majority of replicated genotype-phentoype associations for HDL-C, LDL-C, and ln(TG in European Americans generalized to African Americans (48%, 61%, and 57%, American Indians (45%, 64%, and 77%, and Mexican Americans/Hispanics (57%, 56%, and 86%. Overall, 16 associations generalized across all three populations. For the associations that did not generalize, differences in effect sizes, allele frequencies, and linkage disequilibrium offer clues to the next generation of association studies for these traits.

  18. Mitochondrial genome diversity at the Bering Strait area highlights prehistoric human migrations from Siberia to northern North America.

    Science.gov (United States)

    Dryomov, Stanislav V; Nazhmidenova, Azhar M; Shalaurova, Sophia A; Morozov, Igor V; Tabarev, Andrei V; Starikovskaya, Elena B; Sukernik, Rem I

    2015-10-01

    The patterns of prehistoric migrations across the Bering Land Bridge are far from being completely understood: there still exists a significant gap in our knowledge of the population history of former Beringia. Here, through comprehensive survey of mitochondrial DNA genomes retained in 'relic' populations, the Maritime Chukchi, Siberian Eskimos, and Commander Aleuts, we explore genetic contribution of prehistoric Siberians/Asians to northwestern Native Americans. Overall, 201 complete mitochondrial sequences (52 new and 149 published) were selected in the reconstruction of trees encompassing mtDNA lineages that are restricted to Coastal Chukotka and Alaska, the Canadian Arctic, Greenland, and the Aleutian chain. Phylogeography of the resulting mtDNA genomes (mitogenomes) considerably extends the range and intrinsic diversity of haplogroups (eg, A2a, A2b, D2a, and D4b1a2a1) that emerged and diversified in postglacial central Beringia, defining independent origins of Neo-Eskimos versus Paleo-Eskimos, Aleuts, and Tlingit (Na-Dene). Specifically, Neo-Eskimos, ancestral to modern Inuit, not only appear to be of the High Arctic origin but also to harbor Altai/Sayan-related ancestry. The occurrence of the haplogroup D2a1b haplotypes in Chukotka (Sireniki) introduces the possibility that the traces of Paleo-Eskimos have not been fully erased by spread of the Neo-Eskimos or their descendants. Our findings are consistent with the recurrent gene flow model of multiple streams of expansions to northern North America from northeastern Eurasia in late Pleistocene-early Holocene.

  19. Mitochondrial Genome Sequence of the Scabies Mite Provides Insight into the Genetic Diversity of Individual Scabies Infections.

    Directory of Open Access Journals (Sweden)

    Ehtesham Mofiz

    2016-02-01

    Full Text Available The scabies mite, Sarcoptes scabiei, is an obligate parasite of the skin that infects humans and other animal species, causing scabies, a contagious disease characterized by extreme itching. Scabies infections are a major health problem, particularly in remote Indigenous communities in Australia, where co-infection of epidermal scabies lesions by Group A Streptococci or Staphylococcus aureus is thought to be responsible for the high rate of rheumatic heart disease and chronic kidney disease. We collected and separately sequenced mite DNA from several pools of thousands of whole mites from a porcine model of scabies (S. scabiei var. suis and two human patients (S. scabiei var. hominis living in different regions of northern Australia. Our sequencing samples the mite and its metagenome, including the mite gut flora and the wound micro-environment. Here, we describe the mitochondrial genome of the scabies mite. We developed a new de novo assembly pipeline based on a bait-and-reassemble strategy, which produced a 14 kilobase mitochondrial genome sequence assembly. We also annotated 35 genes and have compared these to other Acari mites. We identified single nucleotide polymorphisms (SNPs and used these to infer the presence of six haplogroups in our samples, Remarkably, these fall into two closely-related clades with one clade including both human and pig varieties. This supports earlier findings that only limited genetic differences may separate some human and animal varieties, and raises the possibility of cross-host infections. Finally, we used these mitochondrial haplotypes to show that the genetic diversity of individual infections is typically small with 1-3 distinct haplotypes per infestation.

  20. Genomic organization, transcriptomic analysis, and functional characterization of avian α- and β-keratins in diverse feather forms.

    Science.gov (United States)

    Ng, Chen Siang; Wu, Ping; Fan, Wen-Lang; Yan, Jie; Chen, Chih-Kuan; Lai, Yu-Ting; Wu, Siao-Man; Mao, Chi-Tang; Chen, Jun-Jie; Lu, Mei-Yeh Jade; Ho, Meng-Ru; Widelitz, Randall B; Chen, Chih-Feng; Chuong, Cheng-Ming; Li, Wen-Hsiung

    2014-08-24

    Feathers are hallmark avian integument appendages, although they were also present on theropods. They are composed of flexible corneous materials made of α- and β-keratins, but their genomic organization and their functional roles in feathers have not been well studied. First, we made an exhaustive search of α- and β-keratin genes in the new chicken genome assembly (Galgal4). Then, using transcriptomic analysis, we studied α- and β-keratin gene expression patterns in five types of feather epidermis. The expression patterns of β-keratin genes were different in different feather types, whereas those of α-keratin genes were less variable. In addition, we obtained extensive α- and β-keratin mRNA in situ hybridization data, showing that α-keratins and β-keratins are preferentially expressed in different parts of the feather components. Together, our data suggest that feather morphological and structural diversity can largely be attributed to differential combinations of α- and β-keratin genes in different intrafeather regions and/or feather types from different body parts. The expression profiles provide new insights into the evolutionary origin and diversification of feathers. Finally, functional analysis using mutant chicken keratin forms based on those found in the human α-keratin mutation database led to abnormal phenotypes. This demonstrates that the chicken can be a convenient model for studying the molecular biology of human keratin-based diseases. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  1. Targeted genomic enrichment and sequencing of CyHV-3 from carp tissues confirms low nucleotide diversity and mixed genotype infections

    Directory of Open Access Journals (Sweden)

    Saliha Hammoumi

    2016-09-01

    Full Text Available Koi herpesvirus disease (KHVD is an emerging disease that causes mass mortality in koi and common carp, Cyprinus carpio L. Its causative agent is Cyprinid herpesvirus 3 (CyHV-3, also known as koi herpesvirus (KHV. Although data on the pathogenesis of this deadly virus is relatively abundant in the literature, still little is known about its genomic diversity and about the molecular mechanisms that lead to such a high virulence. In this context, we developed a new strategy for sequencing full-length CyHV-3 genomes directly from infected fish tissues. Total genomic DNA extracted from carp gill tissue was specifically enriched with CyHV-3 sequences through hybridization to a set of nearly 2 million overlapping probes designed to cover the entire genome length, using KHV-J sequence (GenBank accession number AP008984 as reference. Applied to 7 CyHV-3 specimens from Poland and Indonesia, this targeted genomic enrichment enabled recovery of the full genomes with >99.9% reference coverage. The enrichment rate was directly correlated to the estimated number of viral copies contained in the DNA extracts used for library preparation, which varied between ∼5000 and ∼2×107. The average sequencing depth was >200 for all samples, thus allowing the search for variants with high confidence. Sequence analyses highlighted a significant proportion of intra-specimen sequence heterogeneity, suggesting the presence of mixed infections in all investigated fish. They also showed that inter-specimen genetic diversity at the genome scale was very low (>99.95% of sequence identity. By enabling full genome comparisons directly from infected fish tissues, this new method will be valuable to trace outbreaks rapidly and at a reasonable cost, and in turn to understand the transmission routes of CyHV-3.

  2. Diversity and distribution of nuclease bacteriocins in bacterial genomes revealed using Hidden Markov Models.

    Science.gov (United States)

    Sharp, Connor; Bray, James; Housden, Nicholas G; Maiden, Martin C J; Kleanthous, Colin

    2017-07-01

    Bacteria exploit an arsenal of antimicrobial peptides and proteins to compete with each other. Three main competition systems have been described: type six secretion systems (T6SS); contact dependent inhibition (CDI); and bacteriocins. Unlike T6SS and CDI systems, bacteriocins do not require contact between bacteria but are diffusible toxins released into the environment. Identified almost a century ago, our understanding of bacteriocin distribution and prevalence in bacterial populations remains poor. In the case of protein bacteriocins, this is because of high levels of sequence diversity and difficulties in distinguishing their killing domains from those of other competition systems. Here, we develop a robust bioinformatics pipeline exploiting Hidden Markov Models for the identification of nuclease bacteriocins (NBs) in bacteria of which, to-date, only a handful are known. NBs are large (>60 kDa) toxins that target nucleic acids (DNA, tRNA or rRNA) in the cytoplasm of susceptible bacteria, usually closely related to the producing organism. We identified >3000 NB genes located on plasmids or on the chromosome from 53 bacterial species distributed across different ecological niches, including human, animals, plants, and the environment. A newly identified NB predicted to be specific for Pseudomonas aeruginosa (pyocin Sn) was produced and shown to kill P. aeruginosa thereby validating our pipeline. Intriguingly, while the genes encoding the machinery needed for NB translocation across the cell envelope are widespread in Gram-negative bacteria, NBs are found exclusively in γ-proteobacteria. Similarity network analysis demonstrated that NBs fall into eight groups each with a distinct arrangement of protein domains involved in import. The only structural feature conserved across all groups was a sequence motif critical for cell-killing that is generally not found in bacteriocins targeting the periplasm, implying a specific role in translocating the nuclease to the

  3. Comparative genomic analysis of Tropheryma whipplei strains reveals that diversity among clinical isolates is mainly related to the WiSP proteins.

    Science.gov (United States)

    La, My-Van; Crapoulet, Nicolas; Barbry, Pascal; Raoult, Didier; Renesto, Patricia

    2007-10-02

    The aim of this study was to analyze the genomic diversity of several Tropheryma whipplei strains by microarray-based comparative genomic hybridization. Fifteen clinical isolates originating from biopsy samples recovered from different countries were compared with the T. whipplei Twist strain. For each isolate, the genes were defined as either present or absent/divergent using the GACK analysis software. Genomic changes were then further characterized by PCR and sequencing. The results revealed a limited genetic variation among the T. whipplei isolates, with at most 2.24% of the probes exhibiting differential hybridization against the Twist strain. The main variation was found in genes encoding the WiSP membrane protein family. This work also demonstrated a 19.2 kb-pair deletion within the T. whipplei DIG15 strain. This deletion occurs in the same region as the previously described large genomic rearrangement between Twist and TW08/27. Thus, this can be considered as a major hot-spot for intra-specific T. whipplei differentiation. Analysis of this deleted region confirmed the role of WND domains in generating T. whipplei diversity. This work provides the first comprehensive genomic comparison of several T. whipplei isolates. It reveals that clinical isolates originating from various geographic and biological sources exhibit a high conservation rate, indicating that T. whipplei rarely interacts with exogenous DNA. Remarkably, frequent inter-strain variations were dicovered that affected members of the WiSP family.

  4. Comparative genomic analysis of Tropheryma whipplei strains reveals that diversity among clinical isolates is mainly related to the WiSP proteins

    Directory of Open Access Journals (Sweden)

    Raoult Didier

    2007-10-01

    Full Text Available Abstract Background The aim of this study was to analyze the genomic diversity of several Tropheryma whipplei strains by microarray-based comparative genomic hybridization. Fifteen clinical isolates originating from biopsy samples recovered from different countries were compared with the T. whipplei Twist strain. For each isolate, the genes were defined as either present or absent/divergent using the GACK analysis software. Genomic changes were then further characterized by PCR and sequencing. Results The results revealed a limited genetic variation among the T. whipplei isolates, with at most 2.24% of the probes exhibiting differential hybridization against the Twist strain. The main variation was found in genes encoding the WiSP membrane protein family. This work also demonstrated a 19.2 kb-pair deletion within the T. whipplei DIG15 strain. This deletion occurs in the same region as the previously described large genomic rearrangement between Twist and TW08/27. Thus, this can be considered as a major hot-spot for intra-specific T. whipplei differentiation. Analysis of this deleted region confirmed the role of WND domains in generating T. whipplei diversity. Conclusion This work provides the first comprehensive genomic comparison of several T. whipplei isolates. It reveals that clinical isolates originating from various geographic and biological sources exhibit a high conservation rate, indicating that T. whipplei rarely interacts with exogenous DNA. Remarkably, frequent inter-strain variations were dicovered that affected members of the WiSP family.

  5. Comparative Genomics Revealed Genetic Diversity and Species/Strain-Level Differences in Carbohydrate Metabolism of Three Probiotic Bifidobacterial Species

    Science.gov (United States)

    Odamaki, Toshitaka; Horigome, Ayako; Sugahara, Hirosuke; Hashikura, Nanami; Minami, Junichi; Xiao, Jin-zhong; Abe, Fumiaki

    2015-01-01

    Strains of Bifidobacterium longum, Bifidobacterium breve, and Bifidobacterium animalis are widely used as probiotics in the food industry. Although numerous studies have revealed the properties and functionality of these strains, it is uncertain whether these characteristics are species common or strain specific. To address this issue, we performed a comparative genomic analysis of 49 strains belonging to these three bifidobacterial species to describe their genetic diversity and to evaluate species-level differences. There were 166 common clusters between strains of B. breve and B. longum, whereas there were nine common clusters between strains of B. animalis and B. longum and four common clusters between strains of B. animalis and B. breve. Further analysis focused on carbohydrate metabolism revealed the existence of certain strain-dependent genes, such as those encoding enzymes for host glycan utilisation or certain membrane transporters, and many genes commonly distributed at the species level, as was previously reported in studies with limited strains. As B. longum and B. breve are human-residential bifidobacteria (HRB), whereas B. animalis is a non-HRB species, several of the differences in these species' gene distributions might be the result of their adaptations to the nutrient environment. This information may aid both in selecting probiotic candidates and in understanding their potential function as probiotics. PMID:26236711

  6. Comparative Genomics Revealed Genetic Diversity and Species/Strain-Level Differences in Carbohydrate Metabolism of Three Probiotic Bifidobacterial Species

    Directory of Open Access Journals (Sweden)

    Toshitaka Odamaki

    2015-01-01

    Full Text Available Strains of Bifidobacterium longum, Bifidobacterium breve, and Bifidobacterium animalis are widely used as probiotics in the food industry. Although numerous studies have revealed the properties and functionality of these strains, it is uncertain whether these characteristics are species common or strain specific. To address this issue, we performed a comparative genomic analysis of 49 strains belonging to these three bifidobacterial species to describe their genetic diversity and to evaluate species-level differences. There were 166 common clusters between strains of B. breve and B. longum, whereas there were nine common clusters between strains of B. animalis and B. longum and four common clusters between strains of B. animalis and B. breve. Further analysis focused on carbohydrate metabolism revealed the existence of certain strain-dependent genes, such as those encoding enzymes for host glycan utilisation or certain membrane transporters, and many genes commonly distributed at the species level, as was previously reported in studies with limited strains. As B. longum and B. breve are human-residential bifidobacteria (HRB, whereas B. animalis is a non-HRB species, several of the differences in these species’ gene distributions might be the result of their adaptations to the nutrient environment. This information may aid both in selecting probiotic candidates and in understanding their potential function as probiotics.

  7. The Genomics Education Partnership: Successful Integration of Research into Laboratory Classes at a Diverse Group of Undergraduate Institutions

    Science.gov (United States)

    Shaffer, Christopher D.; Alvarez, Consuelo; Bailey, Cheryl; Barnard, Daron; Bhalla, Satish; Chandrasekaran, Chitra; Chandrasekaran, Vidya; Chung, Hui-Min; Dorer, Douglas R.; Du, Chunguang; Eckdahl, Todd T.; Poet, Jeff L.; Frohlich, Donald; Goodman, Anya L.; Gosser, Yuying; Hauser, Charles; Hoopes, Laura L. M.; Johnson, Diana; Jones, Christopher J.; Kaehler, Marian; Kokan, Nighat; Kopp, Olga R.; Kuleck, Gary A.; McNeil, Gerard; Moss, Robert; Myka, Jennifer L.; Nagengast, Alexis; Morris, Robert; Overvoorde, Paul J.; Shoop, Elizabeth; Parrish, Susan; Reed, Kelynne; Regisford, E. Gloria; Revie, Dennis; Rosenwald, Anne G.; Saville, Ken; Schroeder, Stephanie; Shaw, Mary; Skuse, Gary; Smith, Christopher; Smith, Mary; Spana, Eric P.; Spratt, Mary; Stamm, Joyce; Thompson, Jeff S.; Wawersik, Matthew; Wilson, Barbara A.; Youngblom, Jim; Leung, Wilson; Buhler, Jeremy; Mardis, Elaine R.; Lopatto, David; Elgin, Sarah C. R.

    2010-01-01

    Genomics is not only essential for students to understand biology but also provides unprecedented opportunities for undergraduate research. The goal of the Genomics Education Partnership (GEP), a collaboration between a growing number of colleges and universities around the country and the Department of Biology and Genome Center of Washington…

  8. The Diversity of Sequence and Chromosomal Distribution of New Transposable Element-Related Segments in the Rye Genome Revealed by FISH and Lineage Annotation

    Directory of Open Access Journals (Sweden)

    Yingxin Zhang

    2017-10-01

    Full Text Available Transposable elements (TEs in plant genomes exhibit a great variety of structure, sequence content and copy number, making them important drivers for species diversity and genome evolution. Even though a genome-wide statistic summary of TEs in rye has been obtained using high-throughput DNA sequencing technology, the accurate diversity of TEs in rye, as well as their chromosomal distribution and evolution, remains elusive due to the repetitive sequence assembling problems and the high dynamic and nested nature of TEs. In this study, using genomic plasmid library construction combined with dot-blot hybridization and fluorescence in situ hybridization (FISH analysis, we successfully isolated 70 unique FISH-positive TE-related sequences including 47 rye genome specific ones: 30 showed homology or partial homology with previously FISH characterized sequences and 40 have not been characterized. Among the 70 sequences, 48 sequences carried Ty3/gypsy-derived segments, 7 sequences carried Ty1/copia-derived segments and 15 sequences carried segments homologous with multiple TE families. 26 TE lineages were found in the 70 sequences, and among these lineages, Wilma was found in sequences dispersed in all chromosome regions except telomeric positions; Abiba was found in sequences predominantly located at pericentromeric and centromeric positions; Wis, Carmilla, and Inga were found in sequences displaying signals dispersed from distal regions toward pericentromeric positions; except DNA transposon lineages, all the other lineages were found in sequences displaying signals dispersed from proximal regions toward distal regions. A high percentage (21.4% of chimeric sequences were identified in this study and their high abundance in rye genome suggested that new TEs might form through recombination and nested transposition. Our results also gave proofs that diverse TE lineages were arranged at centromeric and pericentromeric positions in rye, and lineages like

  9. Proceedings of the SMBE Tri-National Young Investigators' Workshop 2005. Insight into the diversity and evolution of the cryptomonad nucleomorph genome.

    Science.gov (United States)

    Lane, Christopher E; Khan, Hameed; MacKinnon, Melissa; Fong, Anna; Theophilou, Stan; Archibald, John M

    2006-05-01

    loci. These results provide a glimpse into the genetic diversity of nucleomorph genomes in cryptomonads and set the stage for more comprehensive sequence-based studies in closely and distantly related taxa.

  10. Development, characterization and use of genomic SSR markers for assessment of genetic diversity in some Saudi date palm (Phoenix dactylifera L. cultivars

    Directory of Open Access Journals (Sweden)

    Sulieman A. Al-Faifi

    2016-05-01

    Conclusions: The developed microsatellite markers are additional values to date palm characterization tools that can be used by researchers in population genetics, cultivar identification as well as genetic resource exploration and management. The tested cultivars exhibited a significant amount of genetic diversity and could be suitable for successful breeding program. Genomic sequences generated from this study are available at the National Center for Biotechnology Information (NCBI, Sequence Read Archive (Accession numbers. LIBGSS_039019.

  11. Whole Genome Sequencing of Danish Staphylococcus argenteus Reveals a Genetically Diverse Collection with Clear Separation from Staphylococcus aureus.

    Science.gov (United States)

    Hansen, Thomas A; Bartels, Mette D; Høgh, Silje V; Dons, Lone E; Pedersen, Michael; Jensen, Thøger G; Kemp, Michael; Skov, Marianne N; Gumpert, Heidi; Worning, Peder; Westh, Henrik

    2017-01-01

    Staphylococcus argenteus (S. argenteus) is a newly identified Staphylococcus species that has been misidentified as Staphylococcus aureus (S. aureus) and is clinically relevant. We identified 25 S. argenteus genomes in our collection of whole genome sequenced S. aureus. These genomes were compared to publicly available genomes and a phylogeny revealed seven clusters corresponding to seven clonal complexes. The genome of S. argenteus was found to be different from the genome of S. aureus and a core genome analysis showed that ~33% of the total gene pool was shared between the two species, at 90% homology level. An assessment of mobile elements shows flow of SCCmec cassettes, plasmids, phages, and pathogenicity islands, between S. argenteus and S. aureus. This dataset emphasizes that S. argenteus and S. aureus are two separate species that share genetic material.

  12. Whole Genome Sequencing of Danish Staphylococcus argenteus Reveals a Genetically Diverse Collection with Clear Separation from Staphylococcus aureus

    DEFF Research Database (Denmark)

    Hansen, Thomas A; Bartels, Mette D; Høgh, Silje V

    2017-01-01

    Staphylococcus argenteus (S. argenteus) is a newly identified Staphylococcus species that has been misidentified as Staphylococcus aureus (S. aureus) and is clinically relevant. We identified 25 S. argenteus genomes in our collection of whole genome sequenced S. aureus. These genomes were compared...... to publicly available genomes and a phylogeny revealed seven clusters corresponding to seven clonal complexes. The genome of S. argenteus was found to be different from the genome of S. aureus and a core genome analysis showed that ~33% of the total gene pool was shared between the two species, at 90......% homology level. An assessment of mobile elements shows flow of SCCmec cassettes, plasmids, phages, and pathogenicity islands, between S. argenteus and S. aureus. This dataset emphasizes that S. argenteus and S. aureus are two separate species that share genetic material....

  13. Whole Genome Sequencing of Danish Staphylococcus argenteus Reveals a Genetically Diverse Collection with Clear Separation from Staphylococcus aureus

    Directory of Open Access Journals (Sweden)

    Thomas A. Hansen

    2017-08-01

    Full Text Available Staphylococcus argenteus (S. argenteus is a newly identified Staphylococcus species that has been misidentified as Staphylococcus aureus (S. aureus and is clinically relevant. We identified 25 S. argenteus genomes in our collection of whole genome sequenced S. aureus. These genomes were compared to publicly available genomes and a phylogeny revealed seven clusters corresponding to seven clonal complexes. The genome of S. argenteus was found to be different from the genome of S. aureus and a core genome analysis showed that ~33% of the total gene pool was shared between the two species, at 90% homology level. An assessment of mobile elements shows flow of SCCmec cassettes, plasmids, phages, and pathogenicity islands, between S. argenteus and S. aureus. This dataset emphasizes that S. argenteus and S. aureus are two separate species that share genetic material.

  14. The genome of the endophytic bacterium H. frisingense GSF30T identifies diverse strategies in the Herbaspirillum genus to interact with plants

    Directory of Open Access Journals (Sweden)

    Daniel eStraub

    2013-06-01

    Full Text Available The diazotrophic, bacterial endophyte Herbaspirillum frisingense GSF30T has been identified in biomass grasses grown in temperate climate, including the highly nitrogen-efficient grass Miscanthus. Its genome was annotated and compared with related Herbaspirillum species from diverse habitats, including H. seropedicae, and further well-characterized endophytes. The analysis revealed that Herbaspirillum frisingense lacks a type III secretion system that is present in some related Herbaspirillum grass endophytes. Together with the lack of components of the type II secretion system, the genomic inventory indicates distinct interaction scenarios of endophytic Herbaspirillum strains with plants. Differences in respiration, carbon, nitrogen and cell wall metabolism among Herbaspirillum isolates partially correlate with their different habitats. Herbaspirillum frisingense is closely related to strains isolated from the rhizosphere of phragmites and from well water, but these lack nitrogen fixation and metabolism genes. Within grass endophytes, the high diversity in their genomic inventory suggests that even individual plant species provide distinct, highly diverse metabolic niches for successful endophyte-plant associations.

  15. Diversity arrays technology (DArT for pan-genomic evolutionary studies of non-model organisms.

    Directory of Open Access Journals (Sweden)

    Karen E James

    Full Text Available BACKGROUND: High-throughput tools for pan-genomic study, especially the DNA microarray platform, have sparked a remarkable increase in data production and enabled a shift in the scale at which biological investigation is possible. The use of microarrays to examine evolutionary relationships and processes, however, is predominantly restricted to model or near-model organisms. METHODOLOGY/PRINCIPAL FINDINGS: This study explores the utility of Diversity Arrays Technology (DArT in evolutionary studies of non-model organisms. DArT is a hybridization-based genotyping method that uses microarray technology to identify and type DNA polymorphism. Theoretically applicable to any organism (even one for which no prior genetic data are available, DArT has not yet been explored in exclusively wild sample sets, nor extensively examined in a phylogenetic framework. DArT recovered 1349 markers of largely low copy-number loci in two lineages of seed-free land plants: the diploid fern Asplenium viride and the haploid moss Garovaglia elegans. Direct sequencing of 148 of these DArT markers identified 30 putative loci including four routinely sequenced for evolutionary studies in plants. Phylogenetic analyses of DArT genotypes reveal phylogeographic and substrate specificity patterns in A. viride, a lack of phylogeographic pattern in Australian G. elegans, and additive variation in hybrid or mixed samples. CONCLUSIONS/SIGNIFICANCE: These results enable methodological recommendations including procedures for detecting and analysing DArT markers tailored specifically to evolutionary investigations and practical factors informing the decision to use DArT, and raise evolutionary hypotheses concerning substrate specificity and biogeographic patterns. Thus DArT is a demonstrably valuable addition to the set of existing molecular approaches used to infer biological phenomena such as adaptive radiations, population dynamics, hybridization, introgression, ecological

  16. Heterozygous genome assembly via binary classification of homologous sequence.

    Science.gov (United States)

    Bodily, Paul M; Fujimoto, M; Ortega, Cameron; Okuda, Nozomu; Price, Jared C; Clement, Mark J; Snell, Quinn

    2015-01-01

    Genome assemblers to date have predominantly targeted haploid reference reconstruction from homozygous data. When applied to diploid genome assembly, these assemblers perform poorly, owing to the violation of assumptions during both the contigging and scaffolding phases. Effective tools to overcome these problems are in growing demand. Increasing parameter stringency during contigging is an effective solution to obtaining haplotype-specific contigs; however, effective algorithms for scaffolding such contigs are lacking. We present a stand-alone scaffolding algorithm, ScaffoldScaffolder, designed specifically for scaffolding diploid genomes. The algorithm identifies homologous sequences as found in "bubble" structures in scaffold graphs. Machine learning classification is used to then classify sequences in partial bubbles as homologous or non-homologous sequences prior to reconstructing haplotype-specific scaffolds. We define four new metrics for assessing diploid scaffolding accuracy: contig sequencing depth, contig homogeneity, phase group homogeneity, and heterogeneity between phase groups. We demonstrate the viability of using bubbles to identify heterozygous homologous contigs, which we term homolotigs. We show that machine learning classification trained on these homolotig pairs can be used effectively for identifying homologous sequences elsewhere in the data with high precision (assuming error-free reads). More work is required to comparatively analyze this approach on real data with various parameters and classifiers against other diploid genome assembly methods. However, the initial results of ScaffoldScaffolder supply validity to the idea of employing machine learning in the difficult task of diploid genome assembly. Software is available at http://bioresearch.byu.edu/scaffoldscaffolder.

  17. Genome survey of pistachio (Pistacia vera L.) by next generation sequencing: Development of novel SSR markers and genetic diversity in Pistacia species.

    Science.gov (United States)

    Ziya Motalebipour, Elmira; Kafkas, Salih; Khodaeiaminjan, Mortaza; Çoban, Nergiz; Gözel, Hatice

    2016-12-07

    Pistachio (Pistacia vera L.) is one of the most important nut crops in the world. There are about 11 wild species in the genus Pistacia, and they have importance as rootstock seed sources for cultivated P. vera and forest trees. Published information on the pistachio genome is limited. Therefore, a genome survey is necessary to obtain knowledge on the genome structure of pistachio by next generation sequencing. Simple sequence repeat (SSR) markers are useful tools for germplasm characterization, genetic diversity analysis, and genetic linkage mapping, and may help to elucidate genetic relationships among pistachio cultivars and species. To explore the genome structure of pistachio, a genome survey was performed using the Illumina platform at approximately 40× coverage depth in the P. vera cv. Siirt. The K-mer analysis indicated that pistachio has a genome that is about 600 Mb in size and is highly heterozygous. The assembly of 26.77 Gb Illumina data produced 27,069 scaffolds at N50 = 3.4 kb with a total of 513.5 Mb. A total of 59,280 SSR motifs were detected with a frequency of 8.67 kb. A total of 206 SSRs were used to characterize 24 P. vera cultivars and 20 wild Pistacia genotypes (four genotypes from each five wild Pistacia species) belonging to P. atlantica, P. integerrima, P. chinenesis, P. terebinthus, and P. lentiscus genotypes. Overall 135 SSR loci amplified in all 44 cultivars and genotypes, 41 were polymorphic in six Pistacia species. The novel SSR loci developed from cultivated pistachio were highly transferable to wild Pistacia species. The results from a genome survey of pistachio suggest that the genome size of pistachio is about 600 Mb with a high heterozygosity rate. This information will help to design whole genome sequencing strategies for pistachio. The newly developed novel polymorphic SSRs in this study may help germplasm characterization, genetic diversity, and genetic linkage mapping studies in the genus Pistacia.

  18. Diversity and Composition of Sulfate-Reducing Microbial Communities Based on Genomic DNA and RNA Transcription in Production Water of High Temperature and Corrosive Oil Reservoir

    Science.gov (United States)

    Li, Xiao-Xiao; Liu, Jin-Feng; Zhou, Lei; Mbadinga, Serge M.; Yang, Shi-Zhong; Gu, Ji-Dong; Mu, Bo-Zhong

    2017-01-01

    Deep subsurface petroleum reservoir ecosystems harbor a high diversity of microorganisms, and microbial influenced corrosion is a major problem for the petroleum industry. Here, we used high-throughput sequencing to explore the microbial communities based on genomic 16S rDNA and metabolically active 16S rRNA analyses of production water samples with different extents of corrosion from a high-temperature oil reservoir. Results showed that Desulfotignum and Roseovarius were the most abundant genera in both genomic and active bacterial communities of all the samples. Both genomic and active archaeal communities were mainly composed of Archaeoglobus and Methanolobus. Within both bacteria and archaea, the active and genomic communities were compositionally distinct from one another across the different oil wells (bacteria p = 0.002; archaea p = 0.01). In addition, the sulfate-reducing microorganisms (SRMs) were specifically assessed by Sanger sequencing of functional genes aprA and dsrA encoding the enzymes adenosine-5′-phosphosulfate reductase and dissimilatory sulfite reductase, respectively. Functional gene analysis indicated that potentially active Archaeoglobus, Desulfotignum, Desulfovibrio, and Thermodesulforhabdus were frequently detected, with Archaeoglobus as the most abundant and active sulfate-reducing group. Canonical correspondence analysis revealed that the SRM communities in petroleum reservoir system were closely related to pH of the production water and sulfate concentration. This study highlights the importance of distinguishing the metabolically active microorganisms from the genomic community and extends our knowledge on the active SRM communities in corrosive petroleum reservoirs. PMID:28638372

  19. Diversity and Composition of Sulfate-Reducing Microbial Communities Based on Genomic DNA and RNA Transcription in Production Water of High Temperature and Corrosive Oil Reservoir

    Directory of Open Access Journals (Sweden)

    Xiao-Xiao Li

    2017-06-01

    Full Text Available Deep subsurface petroleum reservoir ecosystems harbor a high diversity of microorganisms, and microbial influenced corrosion is a major problem for the petroleum industry. Here, we used high-throughput sequencing to explore the microbial communities based on genomic 16S rDNA and metabolically active 16S rRNA analyses of production water samples with different extents of corrosion from a high-temperature oil reservoir. Results showed that Desulfotignum and Roseovarius were the most abundant genera in both genomic and active bacterial communities of all the samples. Both genomic and active archaeal communities were mainly composed of Archaeoglobus and Methanolobus. Within both bacteria and archaea, the active and genomic communities were compositionally distinct from one another across the different oil wells (bacteria p = 0.002; archaea p = 0.01. In addition, the sulfate-reducing microorganisms (SRMs were specifically assessed by Sanger sequencing of functional genes aprA and dsrA encoding the enzymes adenosine-5′-phosphosulfate reductase and dissimilatory sulfite reductase, respectively. Functional gene analysis indicated that potentially active Archaeoglobus, Desulfotignum, Desulfovibrio, and Thermodesulforhabdus were frequently detected, with Archaeoglobus as the most abundant and active sulfate-reducing group. Canonical correspondence analysis revealed that the SRM communities in petroleum reservoir system were closely related to pH of the production water and sulfate concentration. This study highlights the importance of distinguishing the metabolically active microorganisms from the genomic community and extends our knowledge on the active SRM communities in corrosive petroleum reservoirs.

  20. Adaptation of maize to temperate climates: mid-density genome-wide association genetics and diversity patterns reveal key genomic regions, with a major contribution of the Vgt2 (ZCN8 locus.

    Directory of Open Access Journals (Sweden)

    Sophie Bouchet

    Full Text Available The migration of maize from tropical to temperate climates was accompanied by a dramatic evolution in flowering time. To gain insight into the genetic architecture of this adaptive trait, we conducted a 50K SNP-based genome-wide association and diversity investigation on a panel of tropical and temperate American and European representatives. Eighteen genomic regions were associated with flowering time. The number of early alleles cumulated along these regions was highly correlated with flowering time. Polymorphism in the vicinity of the ZCN8 gene, which is the closest maize homologue to Arabidopsis major flowering time (FT gene, had the strongest effect. This polymorphism is in the vicinity of the causal factor of Vgt2 QTL. Diversity was lower, whereas differentiation and LD were higher for associated loci compared to the rest of the genome, which is consistent with selection acting on flowering time during maize migration. Selection tests also revealed supplementary loci that were highly differentiated among groups and not associated with flowering time in our panel, whereas they were in other linkage-based studies. This suggests that allele fixation led to a lack of statistical power when structure and relatedness were taken into account in a linear mixed model. Complementary designs and analysis methods are necessary to unravel the architecture of complex traits. Based on linkage disequilibrium (LD estimates corrected for population structure, we concluded that the number of SNPs genotyped should be at least doubled to capture all QTLs contributing to the genetic architecture of polygenic traits in this panel. These results show that maize flowering time is controlled by numerous QTLs of small additive effect and that strong polygenic selection occurred under cool climatic conditions. They should contribute to more efficient genomic predictions of flowering time and facilitate the dissemination of diverse maize genetic resources under a wide

  1. Evolutionary genomics of mycovirus-related dsRNA viruses reveals cross-family horizontal gene transfer and evolution of diverse viral lineages.

    Science.gov (United States)

    Liu, Huiquan; Fu, Yanping; Xie, Jiatao; Cheng, Jiasen; Ghabrial, Said A; Li, Guoqing; Peng, Youliang; Yi, Xianhong; Jiang, Daohong

    2012-06-20

    Double-stranded (ds) RNA fungal viruses are typically isometric single-shelled particles that are classified into three families, Totiviridae, Partitiviridae and Chrysoviridae, the members of which possess monopartite, bipartite and quadripartite genomes, respectively. Recent findings revealed that mycovirus-related dsRNA viruses are more diverse than previously recognized. Although an increasing number of viral complete genomic sequences have become available, the evolution of these diverse dsRNA viruses remains to be clarified. This is particularly so since there is little evidence for horizontal gene transfer (HGT) among dsRNA viruses. In this study, we report the molecular properties of two novel dsRNA mycoviruses that were isolated from a field strain of Sclerotinia sclerotiorum, Sunf-M: one is a large monopartite virus representing a distinct evolutionary lineage of dsRNA viruses; the other is a new member of the family Partitiviridae. Comprehensive phylogenetic analysis and genome comparison revealed that there are at least ten monopartite, three bipartite, one tripartite and three quadripartite lineages in the known dsRNA mycoviruses and that the multipartite lineages have possibly evolved from different monopartite dsRNA viruses. Moreover, we found that homologs of the S7 Domain, characteristic of members of the genus phytoreovirus in family Reoviridae are widely distributed in diverse dsRNA viral lineages, including chrysoviruses, endornaviruses and some unclassified dsRNA mycoviruses. We further provided evidence that multiple HGT events may have occurred among these dsRNA viruses from different families. Our study provides an insight into the phylogeny and evolution of mycovirus-related dsRNA viruses and reveals that the occurrence of HGT between different virus species and the development of multipartite genomes during evolution are important macroevolutionary mechanisms in dsRNA viruses.

  2. Evolutionary genomics of mycovirus-related dsRNA viruses reveals cross-family horizontal gene transfer and evolution of diverse viral lineages

    Directory of Open Access Journals (Sweden)

    Liu Huiquan

    2012-06-01

    Full Text Available Abstract Background Double-stranded (ds RNA fungal viruses are typically isometric single-shelled particles that are classified into three families, Totiviridae, Partitiviridae and Chrysoviridae, the members of which possess monopartite, bipartite and quadripartite genomes, respectively. Recent findings revealed that mycovirus-related dsRNA viruses are more diverse than previously recognized. Although an increasing number of viral complete genomic sequences have become available, the evolution of these diverse dsRNA viruses remains to be clarified. This is particularly so since there is little evidence for horizontal gene transfer (HGT among dsRNA viruses. Results In this study, we report the molecular properties of two novel dsRNA mycoviruses that were isolated from a field strain of Sclerotinia sclerotiorum, Sunf-M: one is a large monopartite virus representing a distinct evolutionary lineage of dsRNA viruses; the other is a new member of the family Partitiviridae. Comprehensive phylogenetic analysis and genome comparison revealed that there are at least ten monopartite, three bipartite, one tripartite and three quadripartite lineages in the known dsRNA mycoviruses and that the multipartite lineages have possibly evolved from different monopartite dsRNA viruses. Moreover, we found that homologs of the S7 Domain, characteristic of members of the genus phytoreovirus in family Reoviridae are widely distributed in diverse dsRNA viral lineages, including chrysoviruses, endornaviruses and some unclassified dsRNA mycoviruses. We further provided evidence that multiple HGT events may have occurred among these dsRNA viruses from different families. Conclusions Our study provides an insight into the phylogeny and evolution of mycovirus-related dsRNA viruses and reveals that the occurrence of HGT between different virus species and the development of multipartite genomes during evolution are important macroevolutionary mechanisms in dsRNA viruses.

  3. Cyanobacterial life at low O(2): community genomics and function reveal metabolic versatility and extremely low diversity in a Great Lakes sinkhole mat.

    Science.gov (United States)

    Voorhies, A A; Biddanda, B A; Kendall, S T; Jain, S; Marcus, D N; Nold, S C; Sheldon, N D; Dick, G J

    2012-05-01

    Cyanobacteria are renowned as the mediators of Earth's oxygenation. However, little is known about the cyanobacterial communities that flourished under the low-O(2) conditions that characterized most of their evolutionary history. Microbial mats in the submerged Middle Island Sinkhole of Lake Huron provide opportunities to investigate cyanobacteria under such persistent low-O(2) conditions. Here, venting groundwater rich in sulfate and low in O(2) supports a unique benthic ecosystem of purple-colored cyanobacterial mats. Beneath the mat is a layer of carbonate that is enriched in calcite and to a lesser extent dolomite. In situ benthic metabolism chambers revealed that the mats are net sinks for O(2), suggesting primary production mechanisms other than oxygenic photosynthesis. Indeed, (14)C-bicarbonate uptake studies of autotrophic production show variable contributions from oxygenic and anoxygenic photosynthesis and chemosynthesis, presumably because of supply of sulfide. These results suggest the presence of either facultatively anoxygenic cyanobacteria or a mix of oxygenic/anoxygenic types of cyanobacteria. Shotgun metagenomic sequencing revealed a remarkably low-diversity mat community dominated by just one genotype most closely related to the cyanobacterium Phormidium autumnale, for which an essentially complete genome was reconstructed. Also recovered were partial genomes from a second genotype of Phormidium and several Oscillatoria. Despite the taxonomic simplicity, diverse cyanobacterial genes putatively involved in sulfur oxidation were identified, suggesting a diversity of sulfide physiologies. The dominant Phormidium genome reflects versatile metabolism and physiology that is specialized for a communal lifestyle under fluctuating redox conditions and light availability. Overall, this study provides genomic and physiologic insights into low-O(2) cyanobacterial mat ecosystems that played crucial geobiological roles over long stretches of Earth history.

  4. Using diverse U.S. beef cattle genomes to identify missense mutations in EPAS1, a gene associated with pulmonary hypertension [version 2; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    Michael P. Heaton

    2016-10-01

    Full Text Available The availability of whole genome sequence (WGS data has made it possible to discover protein variants in silico. However, existing bovine WGS databases do not show data in a form conducive to protein variant analysis, and tend to under represent the breadth of genetic diversity in global beef cattle. Thus, our first aim was to use 96 beef sires, sharing minimal pedigree relationships, to create a searchable and publicly viewable set of mapped genomes relevant for 19 popular breeds of U.S. cattle. Our second aim was to identify protein variants encoded by the bovine endothelial PAS domain-containing protein 1 gene (EPAS1, a gene associated with pulmonary hypertension in Angus cattle. The identity and quality of genomic sequences were verified by comparing WGS genotypes to those derived from other methods. The average read depth, genotype scoring rate, and genotype accuracy exceeded 14, 99%, and 99%, respectively. The 96 genomes were used to discover four amino acid variants encoded by EPAS1 (E270Q, P362L, A671G, and L701F and confirm two variants previously associated with disease (A606T and G610S. The six EPAS1 missense mutations were verified with matrix-assisted laser desorption/ionization time-of-flight mass spectrometry assays, and their frequencies were estimated in a separate collection of 1154 U.S. cattle representing 46 breeds. A rooted phylogenetic tree of eight polypeptide sequences provided a framework for evaluating the likely order of mutations and potential impact of EPAS1 alleles on the adaptive response to chronic hypoxia in U.S. cattle. This public, whole genome resource facilitates in silico identification of protein variants in diverse types of U.S. beef cattle, and provides a means of translating WGS data into a practical biological and evolutionary context for generating and testing hypotheses.

  5. Extraction of ribosomal RNA and genomic DNA from soil for studying the diversity of the indigenous bacterial community

    NARCIS (Netherlands)

    Duarte, G.F.; Rosado, A.S.; Keijzer-Wolters, A.C.; Elsas, van J.D.

    1998-01-01

    A method for the indirect (cell extraction followed by nucleic acid extraction) isolation of bacterial ribosomal RNA (rRNA) and genomic DNA from soil was developed. The protocol allowed for the rapid parallel extraction of genomic DNA as well as small and large ribosomal subunit RNA from four soils

  6. Genome characterization and genetic diversity of sweet potato symptomless virus 1: a mastrevirus with an unusual nonanucleotide

    Science.gov (United States)

    Complete genomic sequences of nine isolates of sweet potato symptomless virus 1 (SPSMV-1), a virus of genus Mastrevirus in the family Geminiviridae, was determined to be 2,559-2,602 nucleotides from sweet potato accessions from different countries. These isolates shared genomic sequence identities o...

  7. The draft genome of watermelon (Citrullus lanatus) and resequencing of 20 diverse accessions

    DEFF Research Database (Denmark)

    Guo, Shaogui; Zhang, Jianguo; Sun, Honghe

    2013-01-01

    Watermelon, Citrullus lanatus, is an important cucurbit crop grown throughout the world. Here we report a high-quality draft genome sequence of the east Asia watermelon cultivar 97103 (2n = 2× = 22) containing 23,440 predicted protein-coding genes. Comparative genomics analysis provided an evolut...

  8. The Genomics Education Partnership: Successful Integration of Research into Laboratory Classes at a Diverse Group of Undergraduate Institutions

    Science.gov (United States)

    Shaffer, Christopher D.; Alvarez, Consuelo; Bailey, Cheryl; Barnard, Daron; Bhalla, Satish; Chandrasekaran, Chitra; Chandrasekaran, Vidya; Chung, Hui-Min; Dorer, Douglas R.; Du, Chunguang; Eckdahl, Todd T.; Poet, Jeff L.; Frohlich, Donald; Goodman, Anya L.; Gosser, Yuying; Hauser, Charles; Hoopes, Laura L.M.; Johnson, Diana; Jones, Christopher J.; Kaehler, Marian; Kokan, Nighat; Kopp, Olga R.; Kuleck, Gary A.; McNeil, Gerard; Moss, Robert; Myka, Jennifer L.; Nagengast, Alexis; Morris, Robert; Overvoorde, Paul J.; Shoop, Elizabeth; Parrish, Susan; Reed, Kelynne; Regisford, E. Gloria; Revie, Dennis; Rosenwald, Anne G.; Saville, Ken; Schroeder, Stephanie; Shaw, Mary; Skuse, Gary; Smith, Christopher; Smith, Mary; Spana, Eric P.; Spratt, Mary; Stamm, Joyce; Thompson, Jeff S.; Wawersik, Matthew; Wilson, Barbara A.; Youngblom, Jim; Leung, Wilson; Buhler, Jeremy; Mardis, Elaine R.; Lopatto, David

    2010-01-01

    Genomics is not only essential for students to understand biology but also provides unprecedented opportunities for undergraduate research. The goal of the Genomics Education Partnership (GEP), a collaboration between a growing number of colleges and universities around the country and the Department of Biology and Genome Center of Washington University in St. Louis, is to provide such research opportunities. Using a versatile curriculum that has been adapted to many different class settings, GEP undergraduates undertake projects to bring draft-quality genomic sequence up to high quality and/or participate in the annotation of these sequences. GEP undergraduates have improved more than 2 million bases of draft genomic sequence from several species of Drosophila and have produced hundreds of gene models using evidence-based manual annotation. Students appreciate their ability to make a contribution to ongoing research, and report increased independence and a more active learning approach after participation in GEP projects. They show knowledge gains on pre- and postcourse quizzes about genes and genomes and in bioinformatic analysis. Participating faculty also report professional gains, increased access to genomics-related technology, and an overall positive experience. We have found that using a genomics research project as the core of a laboratory course is rewarding for both faculty and students. PMID:20194808

  9. Using sheep genomes from diverse U.S. breeds to identify missense variants in genes affecting fecundity

    Science.gov (United States)

    Background: Access to sheep genome sequences significantly improves the chances of identifying genes that may influence the health, welfare, and productivity of these animals. Methods: A public, searchable DNA sequence resource for U.S. sheep was created with whole genome sequence (WGS) of 96 rams. ...

  10. Annotated Draft Genome Assemblies for the Northern Bobwhite (Colinus virginianus) and the Scaled Quail (Callipepla squamata) Reveal Disparate Estimates of Modern Genome Diversity and Historic Effective Population Size.

    Science.gov (United States)

    Oldeschulte, David L; Halley, Yvette A; Wilson, Miranda L; Bhattarai, Eric K; Brashear, Wesley; Hill, Joshua; Metz, Richard P; Johnson, Charles D; Rollins, Dale; Peterson, Markus J; Bickhart, Derek M; Decker, Jared E; Sewell, John F; Seabury, Christopher M

    2017-09-07

    Northern bobwhite ( Colinus virginianus ; hereafter bobwhite) and scaled quail ( Callipepla squamata ) populations have suffered precipitous declines across most of their US ranges. Illumina-based first- (v1.0) and second- (v2.0) generation draft genome assemblies for the scaled quail and the bobwhite produced N50 scaffold sizes of 1.035 and 2.042 Mb, thereby producing a 45-fold improvement in contiguity over the existing bobwhite assembly, and ≥90% of the assembled genomes were captured within 1313 and 8990 scaffolds, respectively. The scaled quail assembly (v1.0 = 1.045 Gb) was ∼20% smaller than the bobwhite (v2.0 = 1.254 Gb), which was supported by kmer-based estimates of genome size. Nevertheless, estimates of GC content (41.72%; 42.66%), genome-wide repetitive content (10.40%; 10.43%), and MAKER-predicted protein coding genes (17,131; 17,165) were similar for the scaled quail (v1.0) and bobwhite (v2.0) assemblies, respectively. BUSCO analyses utilizing 3023 single-copy orthologs revealed a high level of assembly completeness for the scaled quail (v1.0; 84.8%) and the bobwhite (v2.0; 82.5%), as verified by comparison with well-established avian genomes. We also detected 273 putative segmental duplications in the scaled quail genome (v1.0), and 711 in the bobwhite genome (v2.0), including some that were shared among both species. Autosomal variant prediction revealed ∼2.48 and 4.17 heterozygous variants per kilobase within the scaled quail (v1.0) and bobwhite (v2.0) genomes, respectively, and estimates of historic effective population size were uniformly higher for the bobwhite across all time points in a coalescent model. However, large-scale declines were predicted for both species beginning ∼15-20 KYA. Copyright © 2017 Oldeschulte et al.

  11. Genetic diversity and linkage disequilibrium studies on a 3.1-Mb genomic region of chromosome 3B in European and Asian bread wheat (Triticum aestivum L.) populations.

    Science.gov (United States)

    Hao, C Y; Perretant, M R; Choulet, F; Wang, L F; Paux, E; Sourdille, P; Zhang, X Y; Feuillet, C; Balfourier, Francois

    2010-11-01

    Genetic diversity and linkage disequilibrium (LD) were investigated in 376 Asian and European accessions of bread wheat (Triticum aestivum L.). After a first and rapid screening about diversity and genetic structure at the whole genome scale using 70 simple sequence repeats (SSRs), we focused on a sequenced contig (ctg954) of 3.1 Mb located on the short arm of chromosome 3B of cv. Chinese Spring, using 32 SSRs and 10 single nucleotide polymorphisms. This contig is part of a multiple fungal resistance region. Mean polymorphism information content value on the 32 SSRs was slightly higher in the Asian genepool (0.396) than that for the European (0.329) pool. Compared with results at the whole genome scale, data from this 3.1-Mb region indicated similar trends in genetic diversity indices between both genepools. Population structure and molecular variance analyses demonstrated significant genetic differentiation and geographical subdivision in both groups of accessions. Concerning LD at the contig level, the European population had a significantly higher mean r(2) value (0.23) than the Asian population (0.18), indicating a stronger LD in the European material. With a mean of 1 marker every 74 kb, the resolution reached here allowed to perform a detailed comparative analysis of the LD and genetic diversity along the complete 3.1-Mb region in both genepools. A sliding-window approach revealed some interesting regions of the contig where LD is increasing when genetic diversity is decreasing. This study provides an in-depth understanding of molecular population genetics in European and Asian wheat gene pools, and prospects for association mapping of important sources of fungal disease resistance.

  12. Whole genome sequences of the USMARC beef cattle diversity panel v2.9 aligned to the bovine reference genome assembly

    Science.gov (United States)

    A searchable and publicly viewable set of mapped genomes from 96 beef sires from 19 popular breeds of U.S. cattle was created. These sires with minimal pedigree relationships, represent >99% of the germplasm used in the US beef industry circa 2000. The group is estimated to contain more than 187 u...

  13. Chromosome-level genome map provides insights into diverse defense mechanisms in the medicinal fungus Ganoderma sinense

    Science.gov (United States)

    Zhu, Yingjie; Xu, Jiang; Sun, Chao; Zhou, Shiguo; Xu, Haibin; Nelson, David R.; Qian, Jun; Song, Jingyuan; Luo, Hongmei; Xiang, Li; Li, Ying; Xu, Zhichao; Ji, Aijia; Wang, Lizhi; Lu, Shanfa; Hayward, Alice; Sun, Wei; Li, Xiwen; Schwartz, David C.; Wang, Yitao; Chen, Shilin

    2015-01-01

    Fungi have evolved powerful genomic and chemical defense systems to protect themselves against genetic destabilization and other organisms. However, the precise molecular basis involved in fungal defense remain largely unknown in Basidiomycetes. Here the complete genome sequence, as well as DNA methylation patterns and small RNA transcriptomes, was analyzed to provide a holistic overview of secondary metabolism and defense processes in the model medicinal fungus, Ganoderma sinense. We reported the 48.96 Mb genome sequence of G. sinense, consisting of 12 chromosomes and encoding 15,688 genes. More than thirty gene clusters involved in the biosynthesis of secondary metabolites, as well as a large array of genes responsible for their transport and regulation were highlighted. In addition, components of genome defense mechanisms, namely repeat-induced point mutation (RIP), DNA methylation and small RNA-mediated gene silencing, were revealed in G. sinense. Systematic bioinformatic investigation of the genome and methylome suggested that RIP and DNA methylation combinatorially maintain G. sinense genome stability by inactivating invasive genetic material and transposable elements. The elucidation of the G. sinense genome and epigenome provides an unparalleled opportunity to advance our understanding of secondary metabolism and fungal defense mechanisms. PMID:26046933

  14. Rapid and accurate species and genomic species identification and exhaustive population diversity assessment of Agrobacterium spp. using recA-based PCR.

    Science.gov (United States)

    Shams, M; Vial, L; Chapulliot, D; Nesme, X; Lavire, C

    2013-07-01

    Agrobacteria are common soil bacteria that interact with plants as commensals, plant growth promoting rhizobacteria or alternatively as pathogens. Indigenous agrobacterial populations are composites, generally with several species and/or genomic species and several strains per species. We thus developed a recA-based PCR approach to accurately identify and specifically detect agrobacteria at various taxonomic levels. Specific primers were designed for all species and/or genomic species of Agrobacterium presently known, including 11 genomic species of the Agrobacterium tumefaciens complex (G1-G9, G13 and G14, among which only G2, G4, G8 and G14 still received a Latin epithet: pusense, radiobacter, fabrum and nepotum, respectively), A. larrymoorei, A. rubi, R. skierniewicense, A. sp. 1650, and A. vitis, and for the close relative Allorhizobium undicola. Specific primers were also designed for superior taxa, Agrobacterium spp. and Rhizobiaceace. Primer specificities were assessed with target and non-target pure culture DNAs as well as with DNAs extracted from composite agrobacterial communities. In addition, we showed that the amplicon cloning-sequencing approach used with Agrobacterium-specific or Rhizobiaceae-specific primers is a way to assess the agrobacterial diversity of an indigenous agrobacterial population. Hence, the agrobacterium-specific primers designed in the present study enabled the first accurate and rapid identification of all species and/or genomic species of Agrobacterium, as well as their direct detection in environmental samples. Copyright © 2013 Elsevier GmbH. All rights reserved.

  15. High-throughput sequencing analysis reveals the genetic diversity of different regions of the murine norovirus genome during in vitro replication.

    Science.gov (United States)

    Mauroy, Axel; Taminiau, Bernard; Nezer, Carine; Ghurburrun, Elsa; Baurain, Denis; Daube, Georges; Thiry, Etienne

    2017-04-01

    In this study, we report the genetic diversity and nucleotide mutation rates of five representative regions of the murine norovirus genome during in vitro passages. The mutation rates were similar in genomic regions encompassing partial coding sequences for non-structural (NS) 1-2, NS5, NS6, NS7 proteins within open reading frame (ORF) 1. In a region encoding a portion of the major capsid protein (VP1) within ORF2 (also including the ORF4 region) and a portion of the minor structural protein (VP2), the mutation rates were estimated to be at least one order of magnitude higher. The VP2 coding region was found to have the highest mutation rate.

  16. Use of Comparative Genomics To Characterize the Diversity of Acinetobacter baumannii Surveillance Isolates in a Health Care Institution.

    Science.gov (United States)

    Wallace, Lalena; Daugherty, Sean C; Nagaraj, Sushma; Johnson, J Kristie; Harris, Anthony D; Rasko, David A

    2016-10-01

    Despite the increasing prevalence of the nosocomial pathogen Acinetobacter baumannii, little is known about which genomic components contribute to clinical presentation of this important pathogen. Most whole-genome comparisons of A. baumannii have focused on specific genomic regions associated with phenotypes in a limited number of genomes. In this work, we describe the results of a whole-genome comparative analysis of 254 surveillance isolates of Acinetobacter species, 203 of which were A. baumannii, isolated from perianal swabs and sputum samples collected as part of an infection control active surveillance program at the University of Maryland Medical Center. The collection of surveillance isolates includes both carbapenem-susceptible and -resistant isolates. Based on the whole-genome phylogeny, the A. baumannii isolates collected belong to two major phylogenomic lineages. Results from multilocus sequence typing indicated that one of the major phylogenetic groups of A. baumannii was comprised solely of strains from the international clonal lineage 2. The genomic content of the A. baumannii isolates was examined using large-scale BLAST score ratio analysis to identify genes that are associated with carbapenem-susceptible and -resistant isolates, as well as genes potentially associated with the source of isolation. This analysis revealed a number of genes that were exclusive or at greater frequency in each of these classifications. This study is the most comprehensive genomic comparison of Acinetobacter isolates from a surveillance study to date and provides important information that will contribute to our understanding of the success of A. baumannii as a human pathogen. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  17. Significant variance in genetic diversity among populations of Schistosoma haematobium detected using microsatellite DNA loci from a genome-wide database.

    Science.gov (United States)

    Glenn, Travis C; Lance, Stacey L; McKee, Anna M; Webster, Bonnie L; Emery, Aidan M; Zerlotini, Adhemar; Oliveira, Guilherme; Rollinson, David; Faircloth, Brant C

    2013-10-17

    Urogenital schistosomiasis caused by Schistosoma haematobium is widely distributed across Africa and is increasingly being targeted for control. Genome sequences and population genetic parameters can give insight into the potential for population- or species-level drug resistance. Microsatellite DNA loci are genetic markers in wide use by Schistosoma researchers, but there are few primers available for S. haematobium. We sequenced 1,058,114 random DNA fragments from clonal cercariae collected from a snail infected with a single Schistosoma haematobium miracidium. We assembled and aligned the S. haematobium sequences to the genomes of S. mansoni and S. japonicum, identifying microsatellite DNA loci across all three species and designing primers to amplify the loci in S. haematobium. To validate our primers, we screened 32 randomly selected primer pairs with population samples of S. haematobium. We designed >13,790 primer pairs to amplify unique microsatellite loci in S. haematobium, (available at http://www.cebio.org/projetos/schistosoma-haematobium-genome). The three Schistosoma genomes contained similar overall frequencies of microsatellites, but the frequency and length distributions of specific motifs differed among species. We identified 15 primer pairs that amplified consistently and were easily scored. We genotyped these 15 loci in S. haematobium individuals from six locations: Zanzibar had the highest levels of diversity; Malawi, Mauritius, Nigeria, and Senegal were nearly as diverse; but the sample from South Africa was much less diverse. About half of the primers in the database of Schistosoma haematobium microsatellite DNA loci should yield amplifiable and easily scored polymorphic markers, thus providing thousands of potential markers. Sequence conservation among S. haematobium, S. japonicum, and S. mansoni is relatively high, thus it should now be possible to identify markers that are universal among Schistosoma species (i.e., using DNA sequences

  18. Genome-wide association study and genetic diversity analysis on nitrogen use efficiency in a Central European winter wheat (Triticum aestivum L. collection.

    Directory of Open Access Journals (Sweden)

    István Monostori

    Full Text Available To satisfy future demands, the increase of wheat (Triticum aestivum L. yield is inevitable. Simultaneously, maintaining high crop productivity and efficient use of nutrients, especially nitrogen use efficiency (NUE, are essential for sustainable agriculture. NUE and its components are inherently complex and highly influenced by environmental factors, nitrogen management practices and genotypic variation. Therefore, a better understanding of their genetic basis and regulation is fundamental. To investigate NUE-related traits and their genetic and environmental regulation, field trials were evaluated in a Central European wheat collection of 93 cultivars at two nitrogen input levels across three seasons. This elite germplasm collection was genotyped on DArTseq® genotypic platform to identify loci affecting N-related complex agronomic traits. To conduct robust genome-wide association mapping, the genetic diversity, population structure and linkage disequilibrium were examined. Population structure was investigated by various methods and two subpopulations were identified. Their separation is based on the breeding history of the cultivars, while analysis of linkage disequilibrium suggested that selective pressures had acted on genomic regions bearing loci with remarkable agronomic importance. Besides NUE, genetic basis for variation in agronomic traits indirectly affecting NUE and its components, moreover genetic loci underlying response to nitrogen fertilisation were also determined. Altogether, 183 marker-trait associations (MTA were identified spreading over almost the entire genome. We found that most of the MTAs were environmental-dependent. The present study identified several associated markers in those genomic regions where previous reports had found genes or quantitative trait loci influencing the same traits, while most of the MTAs revealed new genomic regions. Our data provides an overview of the allele composition of bread wheat

  19. Transcriptional Slippage and RNA Editing Increase the Diversity of Transcripts in Chloroplasts: Insight from Deep Sequencing of Vigna radiata Genome and Transcriptome.

    Directory of Open Access Journals (Sweden)

    Ching-Ping Lin

    Full Text Available We performed deep sequencing of the nuclear and organellar genomes of three mungbean genotypes: Vigna radiata ssp. sublobata TC1966, V. radiata var. radiata NM92 and the recombinant inbred line RIL59 derived from a cross between TC1966 and NM92. Moreover, we performed deep sequencing of the RIL59 transcriptome to investigate transcript variability. The mungbean chloroplast genome has a quadripartite structure including a pair of inverted repeats separated by two single copy regions. A total of 213 simple sequence repeats were identified in the chloroplast genomes of NM92 and RIL59; 78 single nucleotide variants and nine indels were discovered in comparing the chloroplast genomes of TC1966 and NM92. Analysis of the mungbean chloroplast transcriptome revealed mRNAs that were affected by transcriptional slippage and RNA editing. Transcriptional slippage frequency was positively correlated with the length of simple sequence repeats of the mungbean chloroplast genome (R2=0.9911. In total, 41 C-to-U editing sites were found in 23 chloroplast genes and in one intergenic spacer. No editing site that swapped U to C was found. A combination of bioinformatics and experimental methods revealed that the plastid-encoded RNA polymerase-transcribed genes psbF and ndhA are affected by transcriptional slippage in mungbean and in main lineages of land plants, including three dicots (Glycine max, Brassica rapa, and Nicotiana tabacum, two monocots (Oryza sativa and Zea mays, two gymnosperms (Pinus taeda and Ginkgo biloba and one moss (Physcomitrella patens. Transcript analysis of the rps2 gene showed that transcriptional slippage could affect transcripts at single sequence repeat regions with poly-A runs. It showed that transcriptional slippage together with incomplete RNA editing may cause sequence diversity of transcripts in chloroplasts of land plants.

  20. Genetic Diversity, Population Structure, and Linkage Disequilibrium of an Association-Mapping Panel Revealed by Genome-Wide SNP Markers in Sesame

    Science.gov (United States)

    Cui, Chengqi; Mei, Hongxian; Liu, Yanyang; Zhang, Haiyang; Zheng, Yongzhan

    2017-01-01

    The characterization of genetic diversity and population structure can be used in tandem to detect reliable phenotype–genotype associations. In the present study, we genotyped a set of 366 sesame germplasm accessions by using 89,924 single-nucleotide polymorphisms (SNPs). The number of SNPs on each chromosome was consistent with the physical length of the respective chromosome, and the average marker density was approximately 2.67 kb/SNP. The genetic diversity analysis showed that the average nucleotide diversity of the panel was 1.1 × 10-3, with averages of 1.0 × 10-4, 2.7 × 10-4, and 3.6 × 10-4 obtained, respectively for three identified subgroups of the panel: Pop 1, Pop 2, and the Mixed. The genetic structure analysis revealed that these sesame germplasm accessions were structured primarily along the basis of their geographic collection, and that an extensive admixture occurred in the panel. The genome-wide linkage disequilibrium (LD) analysis showed that an average LD extended up to ∼99 kb. The genetic diversity and population structure revealed in this study should provide guidance to the future design of association studies and the systematic utilization of the genetic variation characterizing the sesame panel. PMID:28729877

  1. Diversity of the Ty-1 copia retrotransposon Tos17 in rice (Oryza sativa L.) and the AA genome of the Oryza genus.

    Science.gov (United States)

    Petit, Julie; Bourgeois, Emmanuelle; Stenger, Wilfried; Bès, Martine; Droc, Gaétan; Meynard, Donaldo; Courtois, Brigitte; Ghesquière, Alain; Sabot, François; Panaud, Olivier; Guiderdoni, Emmanuel

    2009-12-01

    Retrotransposons are mobile genetic elements, ubiquitous in Eukaryotic genomes, which have proven to be major genetic tools in determining phylogeny and structuring genetic diversity, notably in plants. We investigate here the diversity of the Ty1-copia retrotransposon Tos17 in the cultivated rice of Asian origin (Oryza sativa L.) and related AA genome species of the Oryza genus, to contribute understanding of the complex evolutionary history in this group of species through that of the element in the lineages. In that aim, we used a combination of Southern hybridization with a reverse transcriptase (RT) probe and an adapter-PCR mediated amplification, which allowed the sequencing of the genomic regions flanking Tos17 insertions. This analysis was carried out in a collection of 47 A-genome Oryza species accessions and 202 accessions of a core collection of Oryza sativa L. representative of the diversity of the species. Our Southern hybridization results show that Tos17 is present in all the accessions of the A-genome Oryza species, except for the South American species O. glumaepatula and the African species O. glaberrima and O. breviligulata. In O. sativa, the number of putative copies of Tos17 per accession ranged from 1 to 11 and multivariate analysis based on presence/absence of putative copies yielded a varietal clustering which is consistent with the isozyme classification of rice. Adapter PCR amplification and sequencing of flanking regions of Tos17 insertions in A-genome species other than O. sativa, followed by anchoring on the Nipponbare genome sequence, revealed 13 insertion sites of Tos17 in the surveyed O. rufipogon and O. longistaminata accessions, including one shared by both species. In O. sativa, the same approach revealed 25 insertions in the 6 varietal groups. Four insertion sites located on chromosomes 1, 2, 10, and 11 were found orthologous in O. rufipogon and O. sativa. The chromosome 1 insertion was also shared between O. rufipogon and O

  2. Genetic diversity, molecular phylogeny and selection evidence of the silkworm mitochondria implicated by complete resequencing of 41 genomes

    Directory of Open Access Journals (Sweden)

    Tellier Laurent C

    2010-03-01

    Full Text Available Abstract Background Mitochondria are a valuable resource for studying the evolutionary process and deducing phylogeny. A few mitochondria genomes have been sequenced, but a comprehensive picture of the domestication event for silkworm mitochondria remains to be established. In this study, we integrate the extant data, and perform a whole genome resequencing of Japanese wild silkworm to obtain breakthrough results in silkworm mitochondrial (mt population, and finally use these to deduce a more comprehensive phylogeny of the Bombycidae. Results We identified 347 single nucleotide polymorphisms (SNPs in the mt genome, but found no past recombination event to have occurred in the silkworm progenitor. A phylogeny inferred from these whole genome SNPs resulted in a well-classified tree, confirming that the domesticated silkworm, Bombyx mori, most recently diverged from the Chinese wild silkworm, rather than from the Japanese wild silkworm. We showed that the population sizes of the domesticated and Chinese wild silkworms both experience neither expansion nor contraction. We also discovered that one mt gene, named cytochrome b, shows a strong signal of positive selection in the domesticated clade. This gene is related to energy metabolism, and may have played an important role during silkworm domestication. Conclusions We present a comparative analysis on 41 mt genomes of B. mori and B. mandarina from China and Japan. With these, we obtain a much clearer picture of the evolution history of the silkworm. The data and analyses presented here aid our understanding of the silkworm in general, and provide a crucial insight into silkworm phylogeny.

  3. Systematic inference of copy-number genotypes from personal genome sequencing data reveals extensive olfactory receptor gene content diversity.

    Science.gov (United States)

    Waszak, Sebastian M; Hasin, Yehudit; Zichner, Thomas; Olender, Tsviya; Keydar, Ifat; Khen, Miriam; Stütz, Adrian M; Schlattl, Andreas; Lancet, Doron; Korbel, Jan O

    2010-11-11

    Copy-number variations (CNVs) are widespread in the human genome, but comprehensive assignments of integer locus copy-numbers (i.e., copy-number genotypes) that, for example, enable discrimination of homozygous from heterozygous CNVs, have remained challenging. Here we present CopySeq, a novel computational approach with an underlying statistical framework that analyzes the depth-of-coverage of high-throughput DNA sequencing reads, and can incorporate paired-end and breakpoint junction analysis based CNV-analysis approaches, to infer locus copy-number genotypes. We benchmarked CopySeq by genotyping 500 chromosome 1 CNV regions in 150 personal genomes sequenced at low-coverage. The assessed copy-number genotypes were highly concordant with our performed qPCR experiments (Pearson correlation coefficient 0.94), and with the published results of two microarray platforms (95-99% concordance). We further demonstrated the utility of CopySeq for analyzing gene regions enriched for segmental duplications by comprehensively inferring copy-number genotypes in the CNV-enriched >800 olfactory receptor (OR) human gene and pseudogene loci. CopySeq revealed that OR loci display an extensive range of locus copy-numbers across individuals, with zero to two copies in some OR loci, and two to nine copies in others. Among genetic variants affecting OR loci we identified deleterious variants including CNVs and SNPs affecting ~15% and ~20% of the human OR gene repertoire, respectively, implying that genetic variants with a possible impact on smell perception are widespread. Finally, we found that for several OR loci the reference genome appears to represent a minor-frequency variant, implying a necessary revision of the OR repertoire for future functional studies. CopySeq can ascertain genomic structural variation in specific gene families as well as at a genome-wide scale, where it may enable the quantitative evaluation of CNVs in genome-wide association studies involving high

  4. Comparative genomic analysis of the MHC: the evolution of class I duplication blocks, diversity and complexity from shark to man.

    Science.gov (United States)

    Kulski, Jerzy K; Shiina, Takashi; Anzai, Tatsuya; Kohara, Sakae; Inoko, Hidetoshi

    2002-12-01

    The major histocompatibility complex (MHC) genomic region is composed of a group of linked genes involved functionally with the adaptive and innate immune systems. The class I and class II genes are intrinsic features of the MHC and have been found in all the jawed vertebrates studied so far. The MHC genomic regions of the human and the chicken (B locus) have been fully sequenced and mapped, and the mouse MHC sequence is almost finished. Information on the MHC genomic structures (size, complexity, genic and intergenic composition and organization, gene order and number) of other vertebrates is largely limited or nonexistent. Therefore, we are mapping, sequencing and analyzing the MHC genomic regions of different human haplotypes and at least eight nonhuman species. Here, we review our progress with these sequences and compare the human MHC structure with that of the nonhuman primates (chimpanzee and rhesus macaque), other mammals (pigs, mice and rats) and nonmammalian vertebrates such as birds (chicken and quail), bony fish (medaka, pufferfish and zebrafish) and cartilaginous fish (nurse shark). This comparison reveals a complex MHC structure for mammals and a relatively simpler design for nonmammalian animals with a hypothetical prototypic structure for the shark. In the mammalian MHC, there are two to five different class I duplication blocks embedded within a framework of conserved nonclass I and/or nonclass II genes. With a few exceptions, the class I framework genes are absent from the MHC of birds, bony fish and sharks. Comparative genomics of the MHC reveal a highly plastic region with major structural differences between the mammalian and nonmammalian vertebrates. Additional genomic data are needed on animals of the reptilia, crocodilia and marsupial classes to find the origins of the class I framework genes and examples of structures that may be intermediate between the simple and complex MHC organizations of birds and mammals, respectively.

  5. Comparative genomic and proteomic analyses of two Mycoplasma agalactiae strains: clues to the macro- and micro-events that are shaping mycoplasma diversity

    Directory of Open Access Journals (Sweden)

    Mangenot Sophie

    2010-02-01

    Full Text Available Abstract Background While the genomic era is accumulating a tremendous amount of data, the question of how genomics can describe a bacterial species remains to be fully addressed. The recent sequencing of the genome of the Mycoplasma agalactiae type strain has challenged our general view on mycoplasmas by suggesting that these simple bacteria are able to exchange significant amount of genetic material via horizontal gene transfer. Yet, events that are shaping mycoplasma genomes and that are underlining diversity within this species have to be fully evaluated. For this purpose, we compared two strains that are representative of the genetic spectrum encountered in this species: the type strain PG2 which genome is already available and a field strain, 5632, which was fully sequenced and annotated in this study. Results The two genomes differ by ca. 130 kbp with that of 5632 being the largest (1006 kbp. The make up of this additional genetic material mainly corresponds (i to mobile genetic elements and (ii to expanded repertoire of gene families that encode putative surface proteins and display features of highly-variable systems. More specifically, three entire copies of a previously described integrative conjugative element are found in 5632 that accounts for ca. 80 kbp. Other mobile genetic elements, found in 5632 but not in PG2, are the more classical insertion sequences which are related to those found in two other ruminant pathogens, M. bovis and M. mycoides subsp. mycoides SC. In 5632, repertoires of gene families encoding surface proteins are larger due to gene duplication. Comparative proteomic analyses of the two strains indicate that the additional coding capacity of 5632 affects the overall architecture of the surface and suggests the occurrence of new phase variable systems based on single nucleotide polymorphisms. Conclusion Overall, comparative analyses of two M. agalactiae strains revealed a very dynamic genome which structure has

  6. Comparative genomic and proteomic analyses of two Mycoplasma agalactiae strains: clues to the macro- and micro-events that are shaping mycoplasma diversity.

    Science.gov (United States)

    Nouvel, Laurent X; Sirand-Pugnet, Pascal; Marenda, Marc S; Sagné, Eveline; Barbe, Valérie; Mangenot, Sophie; Schenowitz, Chantal; Jacob, Daniel; Barré, Aurélien; Claverol, Stéphane; Blanchard, Alain; Citti, Christine

    2010-02-02

    While the genomic era is accumulating a tremendous amount of data, the question of how genomics can describe a bacterial species remains to be fully addressed. The recent sequencing of the genome of the Mycoplasma agalactiae type strain has challenged our general view on mycoplasmas by suggesting that these simple bacteria are able to exchange significant amount of genetic material via horizontal gene transfer. Yet, events that are shaping mycoplasma genomes and that are underlining diversity within this species have to be fully evaluated. For this purpose, we compared two strains that are representative of the genetic spectrum encountered in this species: the type strain PG2 which genome is already available and a field strain, 5632, which was fully sequenced and annotated in this study. The two genomes differ by ca. 130 kbp with that of 5632 being the largest (1006 kbp). The make up of this additional genetic material mainly corresponds (i) to mobile genetic elements and (ii) to expanded repertoire of gene families that encode putative surface proteins and display features of highly-variable systems. More specifically, three entire copies of a previously described integrative conjugative element are found in 5632 that accounts for ca. 80 kbp. Other mobile genetic elements, found in 5632 but not in PG2, are the more classical insertion sequences which are related to those found in two other ruminant pathogens, M. bovis and M. mycoides subsp. mycoides SC. In 5632, repertoires of gene families encoding surface proteins are larger due to gene duplication. Comparative proteomic analyses of the two strains indicate that the additional coding capacity of 5632 affects the overall architecture of the surface and suggests the occurrence of new phase variable systems based on single nucleotide polymorphisms. Overall, comparative analyses of two M. agalactiae strains revealed a very dynamic genome which structure has been shaped by gene flow among ruminant mycoplasmas and

  7. Genetic diversity of the obligate intracellular bacterium Chlamydophila pneumoniae by genome-wide analysis of single nucleotide polymorphisms: evidence for highly clonal population structure

    Directory of Open Access Journals (Sweden)

    Solbach Werner

    2007-10-01

    Full Text Available Abstract Background Chlamydophila pneumoniae is an obligate intracellular bacterium that replicates in a biphasic life cycle within eukaryotic host cells. Four published genomes revealed an identity of > 99 %. This remarkable finding raised questions about the existence of distinguishable genotypes in correlation with geographical and anatomical origin. Results We studied the genetic diversity of C. pneumoniae by analysing synonymous single nucleotide polymorphisms (sSNPs that are under reduced selection pressure. We conducted an in silico analysis of the four sequenced genomes, chose 232 representative sSNPs and analysed the loci in 38 C. pneumoniae isolates. We identified 15 different genotypes that were separated in four major clusters. Clusters were not associated with anatomical or geographical origin. However, animal lineages are basal on the C. pneumomiae phylogeny, suggesting a recent transmission to humans through successive bottlenecks some 150,000 years ago. A lack of detectable variation in 17 isolates emphasizes the extraordinary genetic conservation of this species and the high clonality of the population. Moreover, the largest cluster, which encompasses 80% of all analysed strains, is an extremely young clade, that went through an important population expansion some 3,300 years ago. Conclusion sSNPs have proven useful as a sensitive marker to gain new insights into genetic diversity, population structure and evolutionary history of C. pneumoniae.

  8. The Arsenic Resistance-Associated Listeria Genomic Island LGI2 Exhibits Sequence and Integration Site Diversity and a Propensity for Three Listeria monocytogenes Clones with Enhanced Virulence.

    Science.gov (United States)

    Lee, Sangmi; Ward, Todd J; Jima, Dereje D; Parsons, Cameron; Kathariou, Sophia

    2017-11-01

    In the foodborne pathogen Listeria monocytogenes , arsenic resistance is encountered primarily in serotype 4b clones considered to have enhanced virulence and is associated with an arsenic resistance gene cluster within a 35-kb chromosomal region, Listeria genomic island 2 (LGI2). LGI2 was first identified in strain Scott A and includes genes putatively involved in arsenic and cadmium resistance, DNA integration, conjugation, and pathogenicity. However, the genomic localization and sequence content of LGI2 remain poorly characterized. Here we investigated 85 arsenic-resistant L. monocytogenes strains, mostly of serotype 4b. All but one of the 70 serotype 4b strains belonged to clonal complex 1 (CC1), CC2, and CC4, three major clones associated with enhanced virulence. PCR analysis suggested that 53 strains (62.4%) harbored an island highly similar to LGI2 of Scott A, frequently (42/53) in the same location as Scott A ( LMOf2365_2257 homolog). Random-primed PCR and whole-genome sequencing revealed seven novel insertion sites, mostly internal to chromosomal coding sequences, among strains harboring LGI2 outside the LMOf2365_2257 homolog. Interestingly, many CC1 strains harbored a noticeably diversified LGI2 (LGI2-1) in a unique location ( LMOf2365_0902 homolog) and with a novel additional gene. With few exceptions, the tested LGI2 genes were not detected in arsenic-resistant strains of serogroup 1/2, which instead often harbored a Tn 554 -associated arsenic resistance determinant not encountered in serotype 4b. These findings indicate that in L. monocytogenes , LGI2 has a propensity for certain serotype 4b clones, exhibits content diversity, and is highly promiscuous, suggesting an ability to mobilize various accessory genes into diverse chromosomal loci. IMPORTANCE Listeria monocytogenes is widely distributed in the environment and causes listeriosis, a foodborne disease with high mortality and morbidity. Arsenic and other heavy metals can powerfully shape the

  9. Novel viral genomes identified from six metagenomes reveal wide distribution of archaeal viruses and high viral diversity in terrestrial hot springs

    DEFF Research Database (Denmark)

    Islin, Sóley Ruth; Menzel, Peter; Krogh, Anders

    2016-01-01

    Limited by culture-dependent methods the number of viruses identified from thermophilic Archaea and Bacteria is still very small. In this study we retrieved viral sequences from six hot spring metagenomes isolated worldwide, revealing a wide distribution of four archaeal viral families, Ampullavi......Limited by culture-dependent methods the number of viruses identified from thermophilic Archaea and Bacteria is still very small. In this study we retrieved viral sequences from six hot spring metagenomes isolated worldwide, revealing a wide distribution of four archaeal viral families......, Ampullaviridae, Bicaudaviridae, Lipothrixviridae and Rudiviridae. Importantly, we identified ten complete or near complete viral genomes allowing, for the first time, an assessment of genome conservation and evolution of the Ampullaviridae family as well as Sulfolobus Monocaudavirus 1 (SMV1) related viruses....... Among the novel genomes, one belongs to a putative thermophilic virus infecting the bacterium Hydrogenobaculum, for which no virus has been reported in the literature. Moreover, a high viral diversity was observed in the metagenomes, especially among the Lipothrixviridae, as indicated by the large...

  10. From genomes to genotypes: molecular epidemiological analysis of Chlamydia gallinacea reveals a high level of genetic diversity for this newly emerging chlamydial pathogen.

    Science.gov (United States)

    Guo, Weina; Jelocnik, Martina; Li, Jing; Sachse, Konrad; Polkinghorne, Adam; Pannekoek, Yvonne; Kaltenboeck, Bernhard; Gong, Jiansen; You, Jinfeng; Wang, Chengming

    2017-12-06

    Chlamydia (C.) gallinacea is a recently identified bacterium that mainly infects domestic chickens. Demonstration of C. gallinacea in human atypical pneumonia suggests its zoonotic potential. Its prevalence in chickens exceeds that of C. psittaci, but genetic and genomic research on C. gallinacea is still at the beginning. In this study, we conducted whole-genome sequencing of C. gallinacea strain JX-1 isolated from an asymptomatic chicken, and comparative genomic analysis between C. gallinacea strains and related chlamydial species. The genome of C. gallinacea JX-1 was sequenced by single-molecule, real-time technology and is comprised of a 1,059,522-bp circular chromosome with an overall G + C content of 37.93% and sequence similarity of 99.4% to type strain 08-1274/3. In addition, a plasmid designated pJX-1, almost identical to p1274 of the type strain, except for two point mutations, was only found in field strains from chicken, but not in other hosts. In contrast to chlamydial species with notably variable polymorphic membrane protein (pmp) genes and plasticity zone (PZ), these regions were conserved in both C. gallinacea strains. There were 15 predicted pmp genes, but only B, A, E1, H, G1 and G2 were apparently intact in both strains. In comparison to chlamydial species where the PZ may be up to 50 kbp, C. gallinacea strains displayed gene content reduction in the PZ (14 kbp), with strain JX-1 having a premature STOP codon in the cytotoxin (tox) gene, while tox gene is intact in the type strain. In multilocus sequence typing (MLST), 15 C. gallinacea STs were identified among 25 strains based on cognate MLST allelic profiles of the concatenated sequences. The type strain and all Chinese strains belong to two distinct phylogenetic clades. Clade of the Chinese strains separated into 14 genetically distinct lineages, thus revealing considerable genetic diversity of C. gallinacea strains in China. In this first detailed comparative genomic analysis of C

  11. Salmonella strains isolated from Galápagos iguanas show spatial structuring of serovar and genomic diversity.

    Directory of Open Access Journals (Sweden)

    Emily W Lankau

    Full Text Available It is thought that dispersal limitation primarily structures host-associated bacterial populations because host distributions inherently limit transmission opportunities. However, enteric bacteria may disperse great distances during food-borne outbreaks. It is unclear if such rapid long-distance dispersal events happen regularly in natural systems or if these events represent an anthropogenic exception. We characterized Salmonella enterica isolates from the feces of free-living Galápagos land and marine iguanas from five sites on four islands using serotyping and genomic fingerprinting. Each site hosted unique and nearly exclusive serovar assemblages. Genomic fingerprint analysis offered a more complex model of S. enterica biogeography, with evidence of both unique strain pools and of spatial population structuring along a geographic gradient. These findings suggest that even relatively generalist enteric bacteria may be strongly dispersal limited in a natural system with strong barriers, such as oceanic divides. Yet, these differing results seen on two typing methods also suggests that genomic variation is less dispersal limited, allowing for different ecological processes to shape biogeographical patterns of the core and flexible portions of this bacterial species' genome.

  12. Ancient population structure in Phoenix dactylifera revealed by genome-wide genotyping of geographically diverse date palm cultivars

    Science.gov (United States)

    The date palm was one of the earliest cultivated fruit trees and is intimately tied to the history of human migration. With no true known wild ancestor little is known about the genetic origins and the effect of human cultivation on the date palm. Recent genome projects have just begun to provide th...

  13. Genome-wide distribution of genetic diversity and linkage disequilibrium in a mass-selected population of maritime pine.

    NARCIS (Netherlands)

    Plomion, C.; Chancerel, E.; Endelman, J.; Lamy, J.B.; Mandrou, E.; Lesur, I.; Ehrenmann, F.; Isik, F.; Bink, M.C.A.M.; Heerwaarden, van J.; Bouffier, L.

    2014-01-01

    BACKGROUND: The accessibility of high-throughput genotyping technologies has contributed greatly to the development of genomic resources in non-model organisms. High-density genotyping arrays have only recently been developed for some economically important species such as conifers. The potential

  14. Draft Assembly of Elite Inbred Line PH207 Provides Insights into Genomic and Transcriptome Diversity in Maize[OPEN

    Science.gov (United States)

    Soifer, Ilya; Barad, Omer; Shem-Tov, Doron; Baruch, Kobi; Lu, Fei; Hernandez, Alvaro G.; Wright, Chris L.; Koehler, Klaus; Buell, C. Robin; de Leon, Natalia

    2016-01-01

    Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison of these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools. PMID:27803309

  15. Comparative genomics reveals high biological diversity and specific adaptations in the industrially and medically important fungal genus Aspergillus

    DEFF Research Database (Denmark)

    de Vries, Ronald P.; Riley, Robert; Wiebenga, Ad

    2017-01-01

    Background:  The fungal genus Aspergillus is of critical importance to humankind. Species include those with industrial applications, important pathogens of humans, animals and crops, a source of potent carcinogenic contaminants of food, and an important genetic model. The genome sequences of eig...

  16. Draft assembly of elite inbred line PH207 provides insights into genomic and transcriptome diversity in maize

    Science.gov (United States)

    Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft geno...

  17. Genomic Features of the Human Dopamine Transporter Gene and Its Potential Epigenetic States: Implications for Phenotypic Diversity

    Energy Technology Data Exchange (ETDEWEB)

    Shumay, E.; Shumay, E.; Fowler, J.S.; Volkow, N.D.

    2010-06-01

    Human dopamine transporter gene (DAT1 or SLC6A3) has been associated with various brain-related diseases and behavioral traits and, as such, has been investigated intensely in experimental- and clinical-settings. However, the abundance of research data has not clarified the biological mechanism of DAT regulation; similarly, studies of DAT genotype-phenotype associations yielded inconsistent results. Hence, our understanding of the control of the DAT protein product is incomplete; having this knowledge is critical, since DAT plays the major role in the brain's dopaminergic circuitry. Accordingly, we reevaluated the genomic attributes of the SLC6A3 gene that might confer sensitivity to regulation, hypothesizing that its unique genomic characteristics might facilitate highly dynamic, region-specific DAT expression, so enabling multiple regulatory modes. Our comprehensive bioinformatic analyzes revealed very distinctive genomic characteristics of the SLC6A3, including high inter-individual variability of its sequence (897 SNPs, about 90 repeats and several CNVs spell out all abbreviations in abstract) and pronounced sensitivity to regulation by epigenetic mechanisms, as evident from the GC-bias composition (0.55) of the SLC6A3, and numerous intragenic CpG islands (27 CGIs). We propose that this unique combination of the genomic features and the regulatory attributes enables the differential expression of the DAT1 gene and fulfills seemingly contradictory demands to its regulation; that is, robustness of region-specific expression and functional dynamics.

  18. Genome characterization and genetic diversity of Sweet potato symptomless virus 1: a mastrevirus with an unusual nonanucleotide

    Science.gov (United States)

    Next generation sequencing of small interfering RNAs (siRNAs) revealed the presence of Sweet potato symptomless virus 1 (SPSMV-1), a recently described virus in the genus Mastrevirus of the family Geminiviridae, in both a diseased and a symptomless sweet potato plant. Its full-length genome of 2602 ...

  19. The nuclear genome of Rhazya stricta and the evolution of alkaloid diversity in a medically relevant clade of Apocynaceae.

    Science.gov (United States)

    Sabir, Jamal S M; Jansen, Robert K; Arasappan, Dhivya; Calderon, Virginie; Noutahi, Emmanuel; Zheng, Chunfang; Park, Seongjun; Sabir, Meshaal J; Baeshen, Mohammed N; Hajrah, Nahid H; Khiyami, Mohammad A; Baeshen, Nabih A; Obaid, Abdullah Y; Al-Malki, Abdulrahman L; Sankoff, David; El-Mabrouk, Nadia; Ruhlman, Tracey A

    2016-09-22

    Alkaloid accumulation in plants is activated in response to stress, is limited in distribution and specific alkaloid repertoires are variable across taxa. Rauvolfioideae (Apocynaceae, Gentianales) represents a major center of structural expansion in the monoterpenoid indole alkaloids (MIAs) yielding thousands of unique molecules including highly valuable chemotherapeutics. The paucity of genome-level data for Apocynaceae precludes a deeper understanding of MIA pathway evolution hindering the elucidation of remaining pathway enzymes and the improvement of MIA availability in planta or in vitro. We sequenced the nuclear genome of Rhazya stricta (Apocynaceae, Rauvolfioideae) and present this high quality assembly in comparison with that of coffee (Rubiaceae, Coffea canephora, Gentianales) and others to investigate the evolution of genome-scale features. The annotated Rhazya genome was used to develop the community resource, RhaCyc, a metabolic pathway database. Gene family trees were constructed to identify homologs of MIA pathway genes and to examine their evolutionary history. We found that, unlike Coffea, the Rhazya lineage has experienced many structural rearrangements. Gene tree analyses suggest recent, lineage-specific expansion and diversification among homologs encoding MIA pathway genes in Gentianales and provide candidate sequences with the potential to close gaps in characterized pathways and support prospecting for new MIA production avenues.

  20. Characterization of Vibrio parahaemolyticus clinical strains from Maryland (2012-2013 and comparisons to a locally and globally diverse V. parahaemolyticus strains by Whole-Genome Sequence Analysis

    Directory of Open Access Journals (Sweden)

    Julie eHaendiges

    2015-02-01

    Full Text Available Vibrio parahaemolyticus is the leading cause of foodborne illnesses in the US associated with the consumption of raw shellfish. Previous population studies of V. parahaemolyticus have used Multi-Locus Sequence Typing (MLST or Pulsed Field Gel Electrophoresis (PFGE. Whole genome sequencing (WGS provides a much higher level of resolution, but has been used to characterize only a few United States (US clinical isolates. Here we report the WGS characterization of 34 genomes of V. parahaemolyticus strains that were isolated from clinical cases in the state of Maryland (MD during two years (2012-2013. Among these MD isolates, 28% were negative for tdh and trh, 8% were tdh positive only, 11% were trh positive only, and 53% contained both genes. We compared this set of V. parahaemolyticus genomes to those of a collection of 17 archival strains from the US (10 previously sequenced strains and 7 from NCBI, collected between 1988 and 2004 and 15 international strains, isolated from geographically-diverse environmental and clinical sources (collected between 1980 and 2010. A WGS phylogenetic analysis of these strains revealed the regional outbreak strains from MD are highly diverse and yet genetically distinct from the international strains. Some of the MD strains caused outbreaks two years in a row, indicating a local source of contamination (e.g. ST631. Advances in WGS will enable this type of analysis to become routine, providing an excellent tool for improved surveillance. Databases built with phylogenetic data will help pinpoint sources of contamination in future outbreaks and contribute to faster outbreak control.

  1. Genome-wide discovery and validation of Eucalyptus small RNAs reveals variable patterns of conservation and diversity across species of Myrtaceae.

    Science.gov (United States)

    Pappas, Marília de Castro Rodrigues; Pappas, Georgios Joannis; Grattapaglia, Dario

    2015-12-29

    Micro RNAs are a class of small non coding RNAs of 20-24 nucleotides transcribed as single stranded precursors from MIR gene loci. Initially described as post-transcriptional regulators involved in development, two decades ago, miRNAs have been proven to regulate a wide range of processes in plants such as germination, morphology and responses to biotic and abiotic stress. Despite wide conservation in plants, a number of miRNAs are lineage specific. We describe the first genome wide survey of Eucalyptus miRNAs based on high throughput sequencing. In addition to discovering small RNA sequences, MIR loci were mapped onto the reference genome and interspecific variability investigated. Sequencing was carried out for the two most world widely planted species, E. grandis and E. globulus. To maximize discovery, E. grandis samples were from BRASUZ1, the same tree whose genome provided the reference sequence. Interspecific analysis reinforces the variability in small RNA repertoire even between closely related species. Characterization of Eucalyptus small RNA sequences showed 95 orthologous to conserved miRNAs and 193 novel miRNAs. In silico target prediction confirmed 163 novel miRNAs and degradome sequencing experimentally confirmed several hundred targets. Experimental evidence based on the exclusive expression of a set of small RNAs across 16 species within Myrtaceae further highlighted variable patterns of conservation and diversity of these regulatory elements. The description of miRNAs in Eucalyptus contributes to scientific knowledge of this vast genre, which is the most widely planted hardwood crop in the tropical and subtropical world, adding another important element to the annotation of Eucalyptus grandis reference genome.

  2. Whole-Genome Relationships among Francisella Bacteria of Diverse Origins Define New Species and Provide Specific Regions for Detection.

    Science.gov (United States)

    Challacombe, Jean F; Petersen, Jeannine M; Gallegos-Graves, La Verne; Hodge, David; Pillai, Segaran; Kuske, Cheryl R

    2017-02-01

    Francisella tularensis is a highly virulent zoonotic pathogen that causes tularemia and, because of weaponization efforts in past world wars, is considered a tier 1 biothreat agent. Detection and surveillance of F. tularensis may be confounded by the presence of uncharacterized, closely related organisms. Through DNA-based diagnostics and environmental surveys, novel clinical and environmental Francisella isolates have been obtained in recent years. Here we present 7 new Francisella genomes and a comparison of their characteristics to each other and to 24 publicly available genomes as well as a comparative analysis of 16S rRNA and sdhA genes from over 90 Francisella strains. Delineation of new species in bacteria is challenging, especially when isolates having very close genomic characteristics exhibit different physiological features-for example, when some are virulent pathogens in humans and animals while others are nonpathogenic or are opportunistic pathogens. Species resolution within Francisella varies with analyses of single genes, multiple gene or protein sets, or whole-genome comparisons of nucleic acid and amino acid sequences. Analyses focusing on single genes (16S rRNA, sdhA), multiple gene sets (virulence genes, lipopolysaccharide [LPS] biosynthesis genes, pathogenicity island), and whole-genome comparisons (nucleotide and protein) gave congruent results, but with different levels of discrimination confidence. We designate four new species within the genus; Francisella opportunistica sp. nov. (MA06-7296), Francisella salina sp. nov. (TX07-7308), Francisella uliginis sp. nov. (TX07-7310), and Francisella frigiditurris sp. nov. (CA97-1460). This study provides a robust comparative framework to discern species and virulence features of newly detected Francisella bacteria. DNA-based detection and sequencing methods have identified thousands of new bacteria in the human body and the environment. In most cases, there are no cultured isolates that correspond

  3. Mitochondrial genome diversity and population structure of the giant squid Architeuthis: genetics sheds new light on one of the most enigmatic marine species

    Science.gov (United States)

    Winkelmann, Inger; Campos, Paula F.; Strugnell, Jan; Cherel, Yves; Smith, Peter J.; Kubodera, Tsunemi; Allcock, Louise; Kampmann, Marie-Louise; Schroeder, Hannes; Guerra, Angel; Norman, Mark; Finn, Julian; Ingrao, Debra; Clarke, Malcolm; Gilbert, M. Thomas P.

    2013-01-01

    Despite its charismatic appeal to both scientists and the general public, remarkably little is known about the giant squid Architeuthis, one of the largest of the invertebrates. Although specimens of Architeuthis are becoming more readily available owing to the advancement of deep-sea fishing techniques, considerable controversy exists with regard to topics as varied as their taxonomy, biology and even behaviour. In this study, we have characterized the mitochondrial genome (mitogenome) diversity of 43 Architeuthis samples collected from across the range of the species, in order to use genetic information to provide new and otherwise difficult to obtain insights into the life of this animal. The results show no detectable phylogenetic structure at the mitochondrial level and, furthermore, that the level of nucleotide diversity is exceptionally low. These observations are consistent with the hypotheses that there is only one global species of giant squid, Architeuthis dux (Steenstrup, 1857), and that it is highly vagile, possibly dispersing through both a drifting paralarval stage and migration of larger individuals. Demographic history analyses of the genetic data suggest that there has been a recent population expansion or selective sweep, which may explain the low level of genetic diversity. PMID:23516246

  4. Genomic data reveal a loss of diversity in two species of tuco-tucos (genus Ctenomys) following a volcanic eruption.

    Science.gov (United States)

    Hsu, Jeremy L; Crawford, Jeremy Chase; Tammone, Mauro N; Ramakrishnan, Uma; Lacey, Eileen A; Hadly, Elizabeth A

    2017-11-24

    Marked reductions in population size can trigger corresponding declines in genetic variation. Understanding the precise genetic consequences of such reductions, however, is often challenging due to the absence of robust pre- and post-reduction datasets. Here, we use heterochronous genomic data from samples obtained before and immediately after the 2011 eruption of the Puyehue-Cordón Caulle volcanic complex in Patagonia to explore the genetic impacts of this event on two parapatric species of rodents, the colonial tuco-tuco (Ctenomys sociabilis) and the Patagonian tuco-tuco (C. haigi). Previous analyses using microsatellites revealed no post-eruption changes in genetic variation in C. haigi, but an unexpected increase in variation in C. sociabilis. To explore this outcome further, we used targeted gene capture to sequence over 2,000 putatively neutral regions for both species. Our data revealed that, contrary to the microsatellite analyses, the eruption was associated with a small but significant decrease in genetic variation in both species. We suggest that genome-level analyses provide greater power than traditional molecular markers to detect the genetic consequences of population size changes, particularly changes that are recent, short-term, or modest in size. Consequently, genomic analyses promise to generate important new insights into the effects of specific environmental events on demography and genetic variation.

  5. Extending Comprehensive Cancer Center Expertise in Clinical Cancer Genetics and Genomics to Diverse Communities: The power of partnership

    Science.gov (United States)

    MacDonald, Deborah J.; Blazer, Kathleen R.; Weitzel, Jeffrey N.

    2012-01-01

    Rapidly evolving genetic and genomic technologies for genetic cancer risk assessment (GCRA) are revolutionizing our approach to targeted therapy and cancer screening and prevention, heralding the era of personalized medicine. Although many academic medical centers provide GCRA services, most people receive their medical care in the community setting. Yet, few community clinicians have the knowledge or time needed to adequately select, apply and interpret genetic/genomic tests. This article describes alternative approaches to the delivery of GCRA services, profiling the City of Hope Cancer Screening & Prevention Program Network (CSPPN) academic and community-based health center partnership as a model for the delivery of the highest quality evidence-based GCRA services while promoting research participation in the community setting. Growth of the CSPPN was enabled by information technology, with videoconferencing for telemedicine and web conferencing for remote participation in interdisciplinary genetics tumor boards. Grant support facilitated the establishment of an underserved minority outreach clinic in the regional County hospital. Innovative clinician education, technology and collaboration are powerful tools to extend GCRA expertise from a NCI-designated Comprehensive Cancer Center, enabling diffusion of evidenced-base genetic/genomic information and best practice into the community setting. PMID:20495088

  6. Population genomic analysis reveals differential evolutionary histories and patterns of diversity across subgenomes and subpopulations of Brassica napus L.

    Science.gov (United States)

    Brassica napus (L.) is a crop of major economic importance that produces canola oil (seed), vegetables, fodder and animal meal. Characterizing the genetic diversity present in the extant germplasm pool of B. napus is fundamental to better conserve, manage and utilize the genetic resources of this s...

  7. Diversity and strain specificity of plant cell wall degrading enzymes revealed by the draft genome of Ruminococcus flavefaciens FD-1.

    Science.gov (United States)

    Berg Miller, Margret E; Antonopoulos, Dionysios A; Rincon, Marco T; Band, Mark; Bari, Albert; Akraiko, Tatsiana; Hernandez, Alvaro; Thimmapuram, Jyothi; Henrissat, Bernard; Coutinho, Pedro M; Borovok, Ilya; Jindou, Sadanari; Lamed, Raphael; Flint, Harry J; Bayer, Edward A; White, Bryan A

    2009-08-14

    Ruminococcus flavefaciens is a predominant cellulolytic rumen bacterium, which forms a multi-enzyme cellulosome complex that could play an integral role in the ability of this bacterium to degrade plant cell