WorldWideScience

Sample records for sequence diversity coverage

  1. Increased sequence diversity coverage improves detection of HIV-Specific T cell responses

    DEFF Research Database (Denmark)

    Frahm, N.; Kaufmann, D.E.; Yusim, K.

    2007-01-01

    The accurate identification of HIV-specific T cell responses is important for determining the relationship between immune response, viral control, and disease progression. HIV-specific immune responses are usually measured using peptide sets based on consensus sequences, which frequently miss res...

  2. Aspects of coverage in medical DNA sequencing

    Directory of Open Access Journals (Sweden)

    Wilson Richard K

    2008-05-01

    Full Text Available Abstract Background DNA sequencing is now emerging as an important component in biomedical studies of diseases like cancer. Short-read, highly parallel sequencing instruments are expected to be used heavily for such projects, but many design specifications have yet to be conclusively established. Perhaps the most fundamental of these is the redundancy required to detect sequence variations, which bears directly upon genomic coverage and the consequent resolving power for discerning somatic mutations. Results We address the medical sequencing coverage problem via an extension of the standard mathematical theory of haploid coverage. The expected diploid multi-fold coverage, as well as its generalization for aneuploidy are derived and these expressions can be readily evaluated for any project. The resulting theory is used as a scaling law to calibrate performance to that of standard BAC sequencing at 8× to 10× redundancy, i.e. for expected coverages that exceed 99% of the unique sequence. A differential strategy is formalized for tumor/normal studies wherein tumor samples are sequenced more deeply than normal ones. In particular, both tumor alleles should be detected at least twice, while both normal alleles are detected at least once. Our theory predicts these requirements can be met for tumor and normal redundancies of approximately 26× and 21×, respectively. We explain why these values do not differ by a factor of 2, as might intuitively be expected. Future technology developments should prompt even deeper sequencing of tumors, but the 21× value for normal samples is essentially a constant. Conclusion Given the assumptions of standard coverage theory, our model gives pragmatic estimates for required redundancy. The differential strategy should be an efficient means of identifying potential somatic mutations for further study.

  3. Diverse Politics, Diverse News Coverage? A Longitudinal Study of Diversity of Dutch Political News During Two Decades of Election Campaigns

    NARCIS (Netherlands)

    van Hoof, A.M.J.; Jacobi, C.; Ruigrok, N.

    2014-01-01

    Although diverse political news has been recognized a requirement for a well functioning democracy, longitudinal research into this topic is sparse. In this paper, we analyze the development of diversity in election coverage in the Netherlands between 1994 and 2012. We distinguish between diversity

  4. Diverse politics, diverse news coverage? A longitudinal study of diversity in Dutch political news during two decades of election campaigns

    NARCIS (Netherlands)

    van Hoof, A.M.J.; Jacobi, C.; Ruigrok, N.; van Atteveldt, W.H.

    2014-01-01

    Although diverse political news has been recognized as a requirement for a well-functioning democracy, longitudinal research into this topic is sparse. In this article, we analyse the development of diversity in election coverage in the Netherlands between 1994 and 2012. We distinguish between

  5. Building a model: developing genomic resources for common milkweed (Asclepias syriaca with low coverage genome sequencing

    Directory of Open Access Journals (Sweden)

    Weitemier Kevin

    2011-05-01

    Full Text Available Abstract Background Milkweeds (Asclepias L. have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L. could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. Results A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp and 5S rDNA (120 bp sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp, with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae unigenes (median coverage of 0.29× and 66% of single copy orthologs (COSII in asterids (median coverage of 0.14×. From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites and phylogenetics (low-copy nuclear genes studies were developed. Conclusions The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species

  6. Building a model: developing genomic resources for common milkweed (Asclepias syriaca) with low coverage genome sequencing.

    Science.gov (United States)

    Straub, Shannon C K; Fishbein, Mark; Livshultz, Tatyana; Foster, Zachary; Parks, Matthew; Weitemier, Kevin; Cronn, Richard C; Liston, Aaron

    2011-05-04

    Milkweeds (Asclepias L.) have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L.) could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp) and 5S rDNA (120 bp) sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp), with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae) unigenes (median coverage of 0.29×) and 66% of single copy orthologs (COSII) in asterids (median coverage of 0.14×). From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites) and phylogenetics (low-copy nuclear genes) studies were developed. The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species and its relatives. This study represents a first

  7. Expanding Kenya's protected areas under the Convention on Biological Diversity to maximize coverage of plant diversity.

    Science.gov (United States)

    Scherer, Laura; Curran, Michael; Alvarez, Miguel

    2017-04-01

    Biodiversity is highly valuable and critically threatened by anthropogenic degradation of the natural environment. In response, governments have pledged enhanced protected-area coverage, which requires scarce biological data to identify conservation priorities. To assist this effort, we mapped conservation priorities in Kenya based on maximizing alpha (species richness) and beta diversity (species turnover) of plant communities while minimizing economic costs. We used plant-cover percentages from vegetation surveys of over 2000 plots to build separate models for each type of diversity. Opportunity and management costs were based on literature data and interviews with conservation organizations. Species richness was predicted to be highest in a belt from Lake Turkana through Mount Kenya and in a belt parallel to the coast, and species turnover was predicted to be highest in western Kenya and along the coast. Our results suggest the expanding reserve network should focus on the coast and northeastern provinces of Kenya, where new biological surveys would also fill biological data gaps. Meeting the Convention on Biological Diversity target of 17% terrestrial coverage by 2020 would increase representation of Kenya's plant communities by 75%. However, this would require about 50 times more funds than Kenya has received thus far from the Global Environment Facility. © 2016 Society for Conservation Biology.

  8. Analysis of transposable elements in the genome of Asparagus officinalis from high coverage sequence data.

    Science.gov (United States)

    Li, Shu-Fen; Gao, Wu-Jun; Zhao, Xin-Peng; Dong, Tian-Yu; Deng, Chuan-Liang; Lu, Long-Dou

    2014-01-01

    Asparagus officinalis is an economically and nutritionally important vegetable crop that is widely cultivated and is used as a model dioecious species to study plant sex determination and sex chromosome evolution. To improve our understanding of its genome composition, especially with respect to transposable elements (TEs), which make up the majority of the genome, we performed Illumina HiSeq2000 sequencing of both male and female asparagus genomes followed by bioinformatics analysis. We generated 17 Gb of sequence (12×coverage) and assembled them into 163,406 scaffolds with a total cumulated length of 400 Mbp, which represent about 30% of asparagus genome. Overall, TEs masked about 53% of the A. officinalis assembly. Majority of the identified TEs belonged to LTR retrotransposons, which constitute about 28% of genomic DNA, with Ty1/copia elements being more diverse and accumulated to higher copy numbers than Ty3/gypsy. Compared with LTR retrotransposons, non-LTR retrotransposons and DNA transposons were relatively rare. In addition, comparison of the abundance of the TE groups between male and female genomes showed that the overall TE composition was highly similar, with only slight differences in the abundance of several TE groups, which is consistent with the relatively recent origin of asparagus sex chromosomes. This study greatly improves our knowledge of the repetitive sequence construction of asparagus, which facilitates the identification of TEs responsible for the early evolution of plant sex chromosomes and is helpful for further studies on this dioecious plant.

  9. A Poisson hierarchical modelling approach to detecting copy number variation in sequence coverage data

    OpenAIRE

    Sep?lveda, Nuno; Campino, Susana G; Assefa, Samuel A; Sutherland, Colin J; Pain5, Arnab; Clark, Taane G

    2013-01-01

    BACKGROUND: The advent of next generation sequencing technology has accelerated efforts to map and catalogue copy number variation (CNV) in genomes of important micro-organisms for public health. A typical analysis of the sequence data involves mapping reads onto a reference genome, calculating the respective coverage, and detecting regions with too-low or too-high coverage (deletions and amplifications, respectively). Current CNV detection methods rely on statistical assumptions (e.g., a Poi...

  10. Dynamics of domain coverage of the protein sequence universe

    Science.gov (United States)

    2012-01-01

    Background The currently known protein sequence space consists of millions of sequences in public databases and is rapidly expanding. Assigning sequences to families leads to a better understanding of protein function and the nature of the protein universe. However, a large portion of the current protein space remains unassigned and is referred to as its “dark matter”. Results Here we suggest that true size of “dark matter” is much larger than stated by current definitions. We propose an approach to reducing the size of “dark matter” by identifying and subtracting regions in protein sequences that are not likely to contain any domain. Conclusions Recent improvements in computational domain modeling result in a decrease, albeit slowly, in the relative size of “dark matter”; however, its absolute size increases substantially with the growth of sequence data. PMID:23157439

  11. Dynamics of domain coverage of the protein sequence universe

    Directory of Open Access Journals (Sweden)

    Rekapalli Bhanu

    2012-11-01

    Full Text Available Abstract Background The currently known protein sequence space consists of millions of sequences in public databases and is rapidly expanding. Assigning sequences to families leads to a better understanding of protein function and the nature of the protein universe. However, a large portion of the current protein space remains unassigned and is referred to as its “dark matter”. Results Here we suggest that true size of “dark matter” is much larger than stated by current definitions. We propose an approach to reducing the size of “dark matter” by identifying and subtracting regions in protein sequences that are not likely to contain any domain. Conclusions Recent improvements in computational domain modeling result in a decrease, albeit slowly, in the relative size of “dark matter”; however, its absolute size increases substantially with the growth of sequence data.

  12. Pigs in sequence space: A 0.66X coverage pig genome survey based on shotgun sequencing

    DEFF Research Database (Denmark)

    Wernersson, Rasmus; Schierup, M.H.; Jorgensen, F.G.

    2005-01-01

    sequences (0.66X coverage) from the pig genome. The data are hereby released (NCBI Trace repository with center name "SDJVP", and project name "Sino-Danish Pig Genome Project") together with an initial evolutionary analysis. The non-repetitive fraction of the sequences was aligned to the UCSC human...

  13. Coverage and Rate of Downlink Sequence Transmissions with Reliability Guarantees

    DEFF Research Database (Denmark)

    Park, Jihong; Popovski, Petar

    2017-01-01

    Real-time distributed control is a promising application of 5G in which communication links should satisfy certain reliability guarantees. In this letter, we derive closed-form maximum average rate when a device (e.g. industrial machine) downloads a sequence of n operational commands through cell...

  14. Diversity and coverage of seagrass ecosystems in south-west ...

    African Journals Online (AJOL)

    We discuss the natural and anthropogenic factors that may account for the observed low diversity of seagrasses in southern Velondriake, including overfishing, beach-seining, cyclones, siltation and mangrove deforestation. Based on these baseline surveys, as well as discussions with local communities, it is recommended ...

  15. Single-base resolution and long-coverage sequencing based on single-molecule nanomanipulation

    International Nuclear Information System (INIS)

    An Hongjie; Huang Jiehuan; Lue Ming; Li Xueling; Lue Junhong; Li Haikuo; Zhang Yi; Li Minqian; Hu Jun

    2007-01-01

    We show new approaches towards a novel single-molecule sequencing strategy which consists of high-resolution positioning isolation of overlapping DNA fragments with atomic force microscopy (AFM), subsequent single-molecule PCR amplification and conventional Sanger sequencing. In this study, a DNA labelling technique was used to guarantee the accuracy in positioning the target DNA. Single-molecule multiplex PCR was carried out to test the contamination. The results showed that the two overlapping DNA fragments isolated by AFM could be successfully sequenced with high quality and perfect contiguity, indicating that single-base resolution and long-coverage sequencing have been achieved simultaneously

  16. Progress towards the Conventionon Biological Diversity terrestrial2010 and marine 2012 targets forprotected area coverage

    DEFF Research Database (Denmark)

    Coad, Lauren; Burgess, Neil David; Fish, Lucy

    2010-01-01

    coverage targets. National protected areas data from the WDPA have been used to measure progress in protected areas coverage at global, regional and national scale. The mean protected area coverage per nation was 12.2% for terrestrial area, and only 5.1% for near-shore marine area. Variation in protected......Protected area coverage targets set by the Convention on Biological Diversity (CBD) for both terrestrial and marine environments provide a major incentive for governments to review and upgrade their protected area systems. Assessing progress towards these targets will form an important component...... of the work of the Xth CBD Conference of Parties meeting to be held in Japan in 2010. The World Database on Protected Areas (WDPA) is the largest assembly of data on the world's terrestrial and marine protected areas and, as such, represents a fundamental tool in tracking progress towards protected area...

  17. Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes

    DEFF Research Database (Denmark)

    Albertsen, Mads; Hugenholtz, Philip; Skarshewski, Adam

    2013-01-01

    Reference genomes are required to understand the diverse roles of microorganisms in ecology, evolution, human and animal health, but most species remain uncultured. Here we present a sequence composition–independent approach to recover high-quality microbial genomes from deeply sequenced metageno......Reference genomes are required to understand the diverse roles of microorganisms in ecology, evolution, human and animal health, but most species remain uncultured. Here we present a sequence composition–independent approach to recover high-quality microbial genomes from deeply sequenced...

  18. Accurate and High-Coverage Immune Repertoire Sequencing Reveals Characteristics of Antibody Repertoire Diversification in Young Children with Malaria

    Science.gov (United States)

    Jiang, Ning

    Accurately measuring the immune repertoire sequence composition, diversity, and abundance is important in studying repertoire response in infections, vaccinations, and cancer immunology. Using molecular identifiers (MIDs) to tag mRNA molecules is an effective method in improving the accuracy of immune repertoire sequencing (IR-seq). However, it is still difficult to use IR-seq on small amount of clinical samples to achieve a high coverage of the repertoire diversities. This is especially challenging in studying infections and vaccinations where B cell subpopulations with fewer cells, such as memory B cells or plasmablasts, are often of great interest to study somatic mutation patterns and diversity changes. Here, we describe an approach of IR-seq based on the use of MIDs in combination with a clustering method that can reveal more than 80% of the antibody diversity in a sample and can be applied to as few as 1,000 B cells. We applied this to study the antibody repertoires of young children before and during an acute malaria infection. We discovered unexpectedly high levels of somatic hypermutation (SHM) in infants and revealed characteristics of antibody repertoire development in young children that would have a profound impact on immunization in children.

  19. Progress in strategies for sequence diversity library creation for ...

    African Journals Online (AJOL)

    As the simplest technique of protein engineering, directed evolution has been ... An experiment of directed evolution comprises mutant libraries creation and ... evolution, sequence diversity creation, novel strategy, computational design, ...

  20. A Poisson hierarchical modelling approach to detecting copy number variation in sequence coverage data

    KAUST Repository

    Sepú lveda, Nuno; Campino, Susana G; Assefa, Samuel A; Sutherland, Colin J; Pain, Arnab; Clark, Taane G

    2013-01-01

    Background: The advent of next generation sequencing technology has accelerated efforts to map and catalogue copy number variation (CNV) in genomes of important micro-organisms for public health. A typical analysis of the sequence data involves mapping reads onto a reference genome, calculating the respective coverage, and detecting regions with too-low or too-high coverage (deletions and amplifications, respectively). Current CNV detection methods rely on statistical assumptions (e.g., a Poisson model) that may not hold in general, or require fine-tuning the underlying algorithms to detect known hits. We propose a new CNV detection methodology based on two Poisson hierarchical models, the Poisson-Gamma and Poisson-Lognormal, with the advantage of being sufficiently flexible to describe different data patterns, whilst robust against deviations from the often assumed Poisson model.Results: Using sequence coverage data of 7 Plasmodium falciparum malaria genomes (3D7 reference strain, HB3, DD2, 7G8, GB4, OX005, and OX006), we showed that empirical coverage distributions are intrinsically asymmetric and overdispersed in relation to the Poisson model. We also demonstrated a low baseline false positive rate for the proposed methodology using 3D7 resequencing data and simulation. When applied to the non-reference isolate data, our approach detected known CNV hits, including an amplification of the PfMDR1 locus in DD2 and a large deletion in the CLAG3.2 gene in GB4, and putative novel CNV regions. When compared to the recently available FREEC and cn.MOPS approaches, our findings were more concordant with putative hits from the highest quality array data for the 7G8 and GB4 isolates.Conclusions: In summary, the proposed methodology brings an increase in flexibility, robustness, accuracy and statistical rigour to CNV detection using sequence coverage data. 2013 Seplveda et al.; licensee BioMed Central Ltd.

  1. A Poisson hierarchical modelling approach to detecting copy number variation in sequence coverage data.

    Science.gov (United States)

    Sepúlveda, Nuno; Campino, Susana G; Assefa, Samuel A; Sutherland, Colin J; Pain, Arnab; Clark, Taane G

    2013-02-26

    The advent of next generation sequencing technology has accelerated efforts to map and catalogue copy number variation (CNV) in genomes of important micro-organisms for public health. A typical analysis of the sequence data involves mapping reads onto a reference genome, calculating the respective coverage, and detecting regions with too-low or too-high coverage (deletions and amplifications, respectively). Current CNV detection methods rely on statistical assumptions (e.g., a Poisson model) that may not hold in general, or require fine-tuning the underlying algorithms to detect known hits. We propose a new CNV detection methodology based on two Poisson hierarchical models, the Poisson-Gamma and Poisson-Lognormal, with the advantage of being sufficiently flexible to describe different data patterns, whilst robust against deviations from the often assumed Poisson model. Using sequence coverage data of 7 Plasmodium falciparum malaria genomes (3D7 reference strain, HB3, DD2, 7G8, GB4, OX005, and OX006), we showed that empirical coverage distributions are intrinsically asymmetric and overdispersed in relation to the Poisson model. We also demonstrated a low baseline false positive rate for the proposed methodology using 3D7 resequencing data and simulation. When applied to the non-reference isolate data, our approach detected known CNV hits, including an amplification of the PfMDR1 locus in DD2 and a large deletion in the CLAG3.2 gene in GB4, and putative novel CNV regions. When compared to the recently available FREEC and cn.MOPS approaches, our findings were more concordant with putative hits from the highest quality array data for the 7G8 and GB4 isolates. In summary, the proposed methodology brings an increase in flexibility, robustness, accuracy and statistical rigour to CNV detection using sequence coverage data.

  2. A Poisson hierarchical modelling approach to detecting copy number variation in sequence coverage data

    KAUST Repository

    Sepúlveda, Nuno

    2013-02-26

    Background: The advent of next generation sequencing technology has accelerated efforts to map and catalogue copy number variation (CNV) in genomes of important micro-organisms for public health. A typical analysis of the sequence data involves mapping reads onto a reference genome, calculating the respective coverage, and detecting regions with too-low or too-high coverage (deletions and amplifications, respectively). Current CNV detection methods rely on statistical assumptions (e.g., a Poisson model) that may not hold in general, or require fine-tuning the underlying algorithms to detect known hits. We propose a new CNV detection methodology based on two Poisson hierarchical models, the Poisson-Gamma and Poisson-Lognormal, with the advantage of being sufficiently flexible to describe different data patterns, whilst robust against deviations from the often assumed Poisson model.Results: Using sequence coverage data of 7 Plasmodium falciparum malaria genomes (3D7 reference strain, HB3, DD2, 7G8, GB4, OX005, and OX006), we showed that empirical coverage distributions are intrinsically asymmetric and overdispersed in relation to the Poisson model. We also demonstrated a low baseline false positive rate for the proposed methodology using 3D7 resequencing data and simulation. When applied to the non-reference isolate data, our approach detected known CNV hits, including an amplification of the PfMDR1 locus in DD2 and a large deletion in the CLAG3.2 gene in GB4, and putative novel CNV regions. When compared to the recently available FREEC and cn.MOPS approaches, our findings were more concordant with putative hits from the highest quality array data for the 7G8 and GB4 isolates.Conclusions: In summary, the proposed methodology brings an increase in flexibility, robustness, accuracy and statistical rigour to CNV detection using sequence coverage data. 2013 Seplveda et al.; licensee BioMed Central Ltd.

  3. Genomic Selection Using Genotyping-By-Sequencing Data with Different Coverage Depth in Perennial Ryegrass

    DEFF Research Database (Denmark)

    Cericola, Fabio; Fé, Dario; Janss, Luc

    2015-01-01

    the diagonal elements by estimating the amount of genetic variance caused by the reduction of the coverage depth. Secondly we developed a method to scale the relationship matrix by taking into account the overall amount of pairwise non-missing loci between all families. Rust resistance and heading date were......Genotyping by sequencing (GBS) allows generating up to millions of molecular markers with a cost per sample which is proportional to the level of multiplexing. Increasing the sample multiplexing decreases the genotyping price but also reduces the numbers of reads per marker. In this work we...... investigated how this reduction of the coverage depth affects the genomic relationship matrices used to estimated breeding value of F2 family pools in perennial ryegrass. A total of 995 families were genotyped via GBS providing more than 1.8M allele frequency estimates for each family with an average coverage...

  4. Determination of genetic relatedness from low-coverage human genome sequences using pedigree simulations.

    Science.gov (United States)

    Martin, Michael D; Jay, Flora; Castellano, Sergi; Slatkin, Montgomery

    2017-08-01

    We develop and evaluate methods for inferring relatedness among individuals from low-coverage DNA sequences of their genomes, with particular emphasis on sequences obtained from fossil remains. We suggest the major factors complicating the determination of relatedness among ancient individuals are sequencing depth, the number of overlapping sites, the sequencing error rate and the presence of contamination from present-day genetic sources. We develop a theoretical model that facilitates the exploration of these factors and their relative effects, via measurement of pairwise genetic distances, without calling genotypes, and determine the power to infer relatedness under various scenarios of varying sequencing depth, present-day contamination and sequencing error. The model is validated by a simulation study as well as the analysis of aligned sequences from present-day human genomes. We then apply the method to the recently published genome sequences of ancient Europeans, developing a statistical treatment to determine confidence in assigned relatedness that is, in some cases, more precise than previously reported. As the majority of ancient specimens are from animals, this method would be applicable to investigate kinship in nonhuman remains. The developed software grups (Genetic Relatedness Using Pedigree Simulations) is implemented in Python and freely available. © 2017 John Wiley & Sons Ltd.

  5. Pigs in sequence space: A 0.66X coverage pig genome survey based on shotgun sequencing

    Directory of Open Access Journals (Sweden)

    Li Wei

    2005-05-01

    Full Text Available Abstract Background Comparative whole genome analysis of Mammalia can benefit from the addition of more species. The pig is an obvious choice due to its economic and medical importance as well as its evolutionary position in the artiodactyls. Results We have generated ~3.84 million shotgun sequences (0.66X coverage from the pig genome. The data are hereby released (NCBI Trace repository with center name "SDJVP", and project name "Sino-Danish Pig Genome Project" together with an initial evolutionary analysis. The non-repetitive fraction of the sequences was aligned to the UCSC human-mouse alignment and the resulting three-species alignments were annotated using the human genome annotation. Ultra-conserved elements and miRNAs were identified. The results show that for each of these types of orthologous data, pig is much closer to human than mouse is. Purifying selection has been more efficient in pig compared to human, but not as efficient as in mouse, and pig seems to have an isochore structure most similar to the structure in human. Conclusion The addition of the pig to the set of species sequenced at low coverage adds to the understanding of selective pressures that have acted on the human genome by bisecting the evolutionary branch between human and mouse with the mouse branch being approximately 3 times as long as the human branch. Additionally, the joint alignment of the shot-gun sequences to the human-mouse alignment offers the investigator a rapid way to defining specific regions for analysis and resequencing.

  6. Advancing Eucalyptus genomics: identification and sequencing of lignin biosynthesis genes from deep-coverage BAC libraries

    Directory of Open Access Journals (Sweden)

    Kudrna David

    2011-03-01

    Full Text Available Abstract Background Eucalyptus species are among the most planted hardwoods in the world because of their rapid growth, adaptability and valuable wood properties. The development and integration of genomic resources into breeding practice will be increasingly important in the decades to come. Bacterial artificial chromosome (BAC libraries are key genomic tools that enable positional cloning of important traits, synteny evaluation, and the development of genome framework physical maps for genetic linkage and genome sequencing. Results We describe the construction and characterization of two deep-coverage BAC libraries EG_Ba and EG_Bb obtained from nuclear DNA fragments of E. grandis (clone BRASUZ1 digested with HindIII and BstYI, respectively. Genome coverages of 17 and 15 haploid genome equivalents were estimated for EG_Ba and EG_Bb, respectively. Both libraries contained large inserts, with average sizes ranging from 135 Kb (Eg_Bb to 157 Kb (Eg_Ba, very low extra-nuclear genome contamination providing a probability of finding a single copy gene ≥ 99.99%. Libraries were screened for the presence of several genes of interest via hybridizations to high-density BAC filters followed by PCR validation. Five selected BAC clones were sequenced and assembled using the Roche GS FLX technology providing the whole sequence of the E. grandis chloroplast genome, and complete genomic sequences of important lignin biosynthesis genes. Conclusions The two E. grandis BAC libraries described in this study represent an important milestone for the advancement of Eucalyptus genomics and forest tree research. These BAC resources have a highly redundant genome coverage (> 15×, contain large average inserts and have a very low percentage of clones with organellar DNA or empty vectors. These publicly available BAC libraries are thus suitable for a broad range of applications in genetic and genomic research in Eucalyptus and possibly in related species of Myrtaceae

  7. AD-LIBS: inferring ancestry across hybrid genomes using low-coverage sequence data.

    Science.gov (United States)

    Schaefer, Nathan K; Shapiro, Beth; Green, Richard E

    2017-04-04

    Inferring the ancestry of each region of admixed individuals' genomes is useful in studies ranging from disease gene mapping to speciation genetics. Current methods require high-coverage genotype data and phased reference panels, and are therefore inappropriate for many data sets. We present a software application, AD-LIBS, that uses a hidden Markov model to infer ancestry across hybrid genomes without requiring variant calling or phasing. This approach is useful for non-model organisms and in cases of low-coverage data, such as ancient DNA. We demonstrate the utility of AD-LIBS with synthetic data. We then use AD-LIBS to infer ancestry in two published data sets: European human genomes with Neanderthal ancestry and brown bear genomes with polar bear ancestry. AD-LIBS correctly infers 87-91% of ancestry in simulations and produces ancestry maps that agree with published results and global ancestry estimates in humans. In brown bears, we find more polar bear ancestry than has been published previously, using both AD-LIBS and an existing software application for local ancestry inference, HAPMIX. We validate AD-LIBS polar bear ancestry maps by recovering a geographic signal within bears that mirrors what is seen in SNP data. Finally, we demonstrate that AD-LIBS is more effective than HAPMIX at inferring ancestry when preexisting phased reference data are unavailable and genomes are sequenced to low coverage. AD-LIBS is an effective tool for ancestry inference that can be used even when few individuals are available for comparison or when genomes are sequenced to low coverage. AD-LIBS is therefore likely to be useful in studies of non-model or ancient organisms that lack large amounts of genomic DNA. AD-LIBS can therefore expand the range of studies in which admixture mapping is a viable tool.

  8. Next generation sequencing reveals the hidden diversity of zooplankton assemblages.

    Directory of Open Access Journals (Sweden)

    Penelope K Lindeque

    Full Text Available BACKGROUND: Zooplankton play an important role in our oceans, in biogeochemical cycling and providing a food source for commercially important fish larvae. However, difficulties in correctly identifying zooplankton hinder our understanding of their roles in marine ecosystem functioning, and can prevent detection of long term changes in their community structure. The advent of massively parallel next generation sequencing technology allows DNA sequence data to be recovered directly from whole community samples. Here we assess the ability of such sequencing to quantify richness and diversity of a mixed zooplankton assemblage from a productive time series site in the Western English Channel. METHODOLOGY/PRINCIPLE FINDINGS: Plankton net hauls (200 µm were taken at the Western Channel Observatory station L4 in September 2010 and January 2011. These samples were analysed by microscopy and metagenetic analysis of the 18S nuclear small subunit ribosomal RNA gene using the 454 pyrosequencing platform. Following quality control a total of 419,041 sequences were obtained for all samples. The sequences clustered into 205 operational taxonomic units using a 97% similarity cut-off. Allocation of taxonomy by comparison with the National Centre for Biotechnology Information database identified 135 OTUs to species level, 11 to genus level and 1 to order, <2.5% of sequences were classified as unknowns. By comparison a skilled microscopic analyst was able to routinely enumerate only 58 taxonomic groups. CONCLUSIONS: Metagenetics reveals a previously hidden taxonomic richness, especially for Copepoda and hard-to-identify meroplankton such as Bivalvia, Gastropoda and Polychaeta. It also reveals rare species and parasites. We conclude that Next Generation Sequencing of 18S amplicons is a powerful tool for elucidating the true diversity and species richness of zooplankton communities. While this approach allows for broad diversity assessments of plankton it may

  9. Preliminary Genomic Characterization of Ten Hardwood Tree Species from Multiplexed Low Coverage Whole Genome Sequencing.

    Directory of Open Access Journals (Sweden)

    Margaret Staton

    Full Text Available Forest health issues are on the rise in the United States, resulting from introduction of alien pests and diseases, coupled with abiotic stresses related to climate change. Increasingly, forest scientists are finding genetic/genomic resources valuable in addressing forest health issues. For a set of ten ecologically and economically important native hardwood tree species representing a broad phylogenetic spectrum, we used low coverage whole genome sequencing from multiplex Illumina paired ends to economically profile their genomic content. For six species, the genome content was further analyzed by flow cytometry in order to determine the nuclear genome size. Sequencing yielded a depth of 0.8X to 7.5X, from which in silico analysis yielded preliminary estimates of gene and repetitive sequence content in the genome for each species. Thousands of genomic SSRs were identified, with a clear predisposition toward dinucleotide repeats and AT-rich repeat motifs. Flanking primers were designed for SSR loci for all ten species, ranging from 891 loci in sugar maple to 18,167 in redbay. In summary, we have demonstrated that useful preliminary genome information including repeat content, gene content and useful SSR markers can be obtained at low cost and time input from a single lane of Illumina multiplex sequence.

  10. Sequence diversity and evolution of antimicrobial peptides in invertebrates.

    Science.gov (United States)

    Tassanakajon, Anchalee; Somboonwiwat, Kunlaya; Amparyup, Piti

    2015-02-01

    Antimicrobial peptides (AMPs) are evolutionarily ancient molecules that act as the key components in the invertebrate innate immunity against invading pathogens. Several AMPs have been identified and characterized in invertebrates, and found to display considerable diversity in their amino acid sequence, structure and biological activity. AMP genes appear to have rapidly evolved, which might have arisen from the co-evolutionary arms race between host and pathogens, and enabled organisms to survive in different microbial environments. Here, the sequence diversity of invertebrate AMPs (defensins, cecropins, crustins and anti-lipopolysaccharide factors) are presented to provide a better understanding of the evolution pattern of these peptides that play a major role in host defense mechanisms. Copyright © 2014 Elsevier Ltd. All rights reserved.

  11. Alternative Enzymes Lead to Improvements in Sequence Coverage and PTM Analysis

    Science.gov (United States)

    Hooper, Kyle; Rosenblatt, Michael; Urh, Marjeta; Saveliev, Sergei; Hosfield, Chris; Kobs, Gary; Ford, Michael; Jones, Richard; Amunugama, Ravi; Allen, David; Brazas, Robert

    2013-01-01

    The profiling of proteins using biological mass spectrometry (bottom up proteomics) most commonly requires trypsin. Trypsin is advantageous in that it produces peptides of optimal charge and size. However, for applications in which the proteins under investigation are part of a complex mixture or not isolated at high levels (i.e. low ng from an immunoprecipitation), sequence coverage is rarely complete. In addition, we have found that in several cases, like phosphorylation, acetylation, and methylation, alternative proteases are required to prepare peptides suitable for MS detection. This poster will provide specific examples which demonstrate this observation. For example, the application of a combined Trypsin/ Lys-C mixture reduces the number of missed cleavages by more than 3-fold producing samples with lower CV's (for biological replicates). The mixture is also well-suited for the complete proteolysis of hydrophobic, compact proteins. The addition of chymotrypsin and elastase has been found to be useful for identifying phosphorylation sites on proteins, especially on sequences where the site of phosphorylation inhibits trypsin (i.e. proximal to K or R). Many epigenetic applications have focused on histone modifications, like lysine acetylation and arginine methylation. Alternative proteases like Asp-N, Glu-C, and chymotrypsin have been especially useful given the fact that the modified K and R residues are resistant to c-terminal cleavage by trypsin. Finally, in the case of serum profiling, the addition of the endoglycosidase, PNGase F has been found to improve sequence coverage due to the removal of N-linked glycans.

  12. Characterisation of the genetic diversity of Brucella by multilocus sequencing

    Directory of Open Access Journals (Sweden)

    MacMillan Alastair P

    2007-04-01

    Full Text Available Abstract Background Brucella species include economically important zoonotic pathogens that can infect a wide range of animals. There are currently six classically recognised species of Brucella although, as yet unnamed, isolates from various marine mammal species have been reported. In order to investigate genetic relationships within the group and identify potential diagnostic markers we have sequenced multiple genetic loci from a large sample of Brucella isolates representing the known diversity of the genus. Results Nine discrete genomic loci corresponding to 4,396 bp of sequence were examined from 160 Brucella isolates. By assigning each distinct allele at a locus an arbitrary numerical designation the population was found to represent 27 distinct sequence types (STs. Diversity at each locus ranged from 1.03–2.45% while overall genetic diversity equated to 1.5%. Most loci examined represent housekeeping gene loci and, in all but one case, the ratio of non-synonymous to synonymous change was substantially Brucella species, B. abortus, B. melitensis, B. ovis and B. neotomae correspond to well-separated clusters. With the exception of biovar 5, B. suis isolates cluster together, although they form a more diverse group than other classical species with a number of distinct STs corresponding to the remaining four biovars. B. canis isolates are located on the same branch very closely related to, but distinguishable from, B. suis biovar 3 and 4 isolates. Marine mammal isolates represent a distinct, though rather weakly supported, cluster within which individual STs display one of three clear host preferences. Conclusion The sequence database provides a powerful dataset for addressing ongoing controversies in Brucella taxonomy and a tool for unambiguously placing atypical, phenotypically discordant or newly emerging Brucella isolates. Furthermore, by using the phylogenetic backbone described here, robust and rationally selected markers for use in

  13. CoverageAnalyzer (CAn: A Tool for Inspection of Modification Signatures in RNA Sequencing Profiles

    Directory of Open Access Journals (Sweden)

    Ralf Hauenschild

    2016-11-01

    Full Text Available Combination of reverse transcription (RT and deep sequencing has emerged as a powerful instrument for the detection of RNA modifications, a field that has seen a recent surge in activity because of its importance in gene regulation. Recent studies yielded high-resolution RT signatures of modified ribonucleotides relying on both sequence-dependent mismatch patterns and reverse transcription arrests. Common alignment viewers lack specialized functionality, such as filtering, tailored visualization, image export and differential analysis. Consequently, the community will profit from a platform seamlessly connecting detailed visual inspection of RT signatures and automated screening for modification candidates. CoverageAnalyzer (CAn was developed in response to the demand for a powerful inspection tool. It is freely available for all three main operating systems. With SAM file format as standard input, CAn is an intuitive and user-friendly tool that is generally applicable to the large community of biomedical users, starting from simple visualization of RNA sequencing (RNA-Seq data, up to sophisticated modification analysis with significance-based modification candidate calling.

  14. Multilocus sequence analysis of Treponema denticola strains of diverse origin

    Directory of Open Access Journals (Sweden)

    Mo Sisu

    2013-02-01

    Full Text Available Abstract Background The oral spirochete bacterium Treponema denticola is associated with both the incidence and severity of periodontal disease. Although the biological or phenotypic properties of a significant number of T. denticola isolates have been reported in the literature, their genetic diversity or phylogeny has never been systematically investigated. Here, we describe a multilocus sequence analysis (MLSA of 20 of the most highly studied reference strains and clinical isolates of T. denticola; which were originally isolated from subgingival plaque samples taken from subjects from China, Japan, the Netherlands, Canada and the USA. Results The sequences of the 16S ribosomal RNA gene, and 7 conserved protein-encoding genes (flaA, recA, pyrH, ppnK, dnaN, era and radC were successfully determined for each strain. Sequence data was analyzed using a variety of bioinformatic and phylogenetic software tools. We found no evidence of positive selection or DNA recombination within the protein-encoding genes, where levels of intraspecific sequence polymorphism varied from 18.8% (flaA to 8.9% (dnaN. Phylogenetic analysis of the concatenated protein-encoding gene sequence data (ca. 6,513 nucleotides for each strain using Bayesian and maximum likelihood approaches indicated that the T. denticola strains were monophyletic, and formed 6 well-defined clades. All analyzed T. denticola strains appeared to have a genetic origin distinct from that of ‘Treponema vincentii’ or Treponema pallidum. No specific geographical relationships could be established; but several strains isolated from different continents appear to be closely related at the genetic level. Conclusions Our analyses indicate that previous biological and biophysical investigations have predominantly focused on a subset of T. denticola strains with a relatively narrow range of genetic diversity. Our methodology and results establish a genetic framework for the discrimination and phylogenetic

  15. Overview of recurrent chromosomal losses in retinoblastoma detected by low coverage next generation sequencing

    Science.gov (United States)

    García-Chequer, A.J.; Méndez-Tenorio, A.; Olguín-Ruiz, G.; Sánchez-Vallejo, C.; Isa, P.; Arias, C.F.; Torres, J.; Hernández-Angeles, A.; Ramírez-Ortiz, M.A.; Lara, C.; Cabrera-Muñoz, M.L.; Sadowinski-Pine, S.; Bravo-Ortiz, J.C.; Ramón-García, G.; Diegopérez-Ramírez, J.; Ramírez-Reyes, G.; Casarrubias-Islas, R.; Ramírez, J.; Orjuela, M.A.; Ponce-Castañeda, M.V.

    2016-01-01

    Genes are frequently lost or gained in malignant tumors and the analysis of these changes can be informative about the underlying tumor biology. Retinoblastoma is a pediatric intraocular malignancy, and since deletions in chromosome 13 have been described in this tumor, we performed genome wide sequencing with the Illumina platform to test whether recurrent losses could be detected in low coverage data from DNA pools of Rb cases. An in silico reference profile for each pool was created from the human genome sequence GRCh37p5; a chromosome integrity score and a graphics 40 Kb window analysis approach, allowed us to identify with high resolution previously reported non random recurrent losses in all chromosomes of these tumors. We also found a pattern of gains and losses associated to clear and dark cytogenetic bands respectively. We further analyze a pool of medulloblastoma and found a more stable genomic profile and previously reported losses in this tumor. This approach facilitates identification of recurrent deletions from many patients that may be biological relevant for tumor development. PMID:26883451

  16. Finding the right coverage : The impact of coverage and sequence quality on single nucleotide polymorphism genotyping error rates

    NARCIS (Netherlands)

    Fountain, Emily D.; Pauli, Jonathan N.; Reid, Brendan N.; Palsboll, Per J.; Peery, M. Zachariah

    Restriction-enzyme-based sequencing methods enable the genotyping of thousands of single nucleotide polymorphism (SNP) loci in nonmodel organisms. However, in contrast to traditional genetic markers, genotyping error rates in SNPs derived from restriction-enzyme-based methods remain largely unknown.

  17. Low-coverage MiSeq next generation sequencing reveals the mitochondrial genome of the Eastern Rock Lobster, Sagmariasus verreauxi.

    Science.gov (United States)

    Doyle, Stephen R; Griffith, Ian S; Murphy, Nick P; Strugnell, Jan M

    2015-01-01

    The complete mitochondrial genome of the Eastern Rock lobster, Sagmariasus verreauxi, is reported for the first time. Using low-coverage, long read MiSeq next generation sequencing, we constructed and determined the mtDNA genome organization of the 15,470 bp sequence from two isolates from Eastern Tasmania, Australia and Northern New Zealand, and identified 46 polymorphic nucleotides between the two sequences. This genome sequence and its genetic polymorphisms will likely be useful in understanding the distribution and population connectivity of the Eastern Rock Lobster, and in the fisheries management of this commercially important species.

  18. Transcriptome sequencing from diverse human populations reveals differentiated regulatory architecture.

    Directory of Open Access Journals (Sweden)

    Alicia R Martin

    2014-08-01

    Full Text Available Large-scale sequencing efforts have documented extensive genetic variation within the human genome. However, our understanding of the origins, global distribution, and functional consequences of this variation is far from complete. While regulatory variation influencing gene expression has been studied within a handful of populations, the breadth of transcriptome differences across diverse human populations has not been systematically analyzed. To better understand the spectrum of gene expression variation, alternative splicing, and the population genetics of regulatory variation in humans, we have sequenced the genomes, exomes, and transcriptomes of EBV transformed lymphoblastoid cell lines derived from 45 individuals in the Human Genome Diversity Panel (HGDP. The populations sampled span the geographic breadth of human migration history and include Namibian San, Mbuti Pygmies of the Democratic Republic of Congo, Algerian Mozabites, Pathan of Pakistan, Cambodians of East Asia, Yakut of Siberia, and Mayans of Mexico. We discover that approximately 25.0% of the variation in gene expression found amongst individuals can be attributed to population differences. However, we find few genes that are systematically differentially expressed among populations. Of this population-specific variation, 75.5% is due to expression rather than splicing variability, and we find few genes with strong evidence for differential splicing across populations. Allelic expression analyses indicate that previously mapped common regulatory variants identified in eight populations from the International Haplotype Map Phase 3 project have similar effects in our seven sampled HGDP populations, suggesting that the cellular effects of common variants are shared across diverse populations. Together, these results provide a resource for studies analyzing functional differences across populations by estimating the degree of shared gene expression, alternative splicing, and

  19. Fully immunized child: coverage, timing and sequencing of routine immunization in an urban poor settlement in Nairobi, Kenya.

    Science.gov (United States)

    Mutua, Martin Kavao; Kimani-Murage, Elizabeth; Ngomi, Nicholas; Ravn, Henrik; Mwaniki, Peter; Echoka, Elizabeth

    2016-01-01

    More efforts have been put in place to increase full immunization coverage rates in the last decade. Little is known about the levels and consequences of delaying or vaccinating children in different schedules. Vaccine effectiveness depends on the timing of its administration, and it is not optimal if given early, delayed or not given as recommended. Evidence of non-specific effects of vaccines is well documented and could be linked to timing and sequencing of immunization. This paper documents the levels of coverage, timing and sequencing of routine childhood vaccines. The study was conducted between 2007 and 2014 in two informal urban settlements in Nairobi. A total of 3856 children, aged 12-23 months and having a vaccination card seen were included in analysis. Vaccination dates recorded from the cards seen were used to define full immunization coverage, timeliness and sequencing. Proportions, medians and Kaplan-Meier curves were used to assess and describe the levels of full immunization coverage, vaccination delays and sequencing. The findings indicate that 67 % of the children were fully immunized by 12 months of age. Missing measles and third doses of polio and pentavalent vaccine were the main reason for not being fully immunized. Delays were highest for third doses of polio and pentavalent and measles. About 22 % of fully immunized children had vaccines in an out-of-sequence manner with 18 % not receiving pentavalent together with polio vaccine as recommended. Results show higher levels of missed opportunities and low coverage of routine childhood vaccinations given at later ages. New strategies are needed to enable health care providers and parents/guardians to work together to increase the levels of completion of all required vaccinations. In particular, more focus is needed on vaccines given in multiple doses (polio, pentavalent and pneumococcal conjugate vaccines).

  20. IdentiCS – Identification of coding sequence and in silico reconstruction of the metabolic network directly from unannotated low-coverage bacterial genome sequence

    Directory of Open Access Journals (Sweden)

    Zeng An-Ping

    2004-08-01

    Full Text Available Abstract Background A necessary step for a genome level analysis of the cellular metabolism is the in silico reconstruction of the metabolic network from genome sequences. The available methods are mainly based on the annotation of genome sequences including two successive steps, the prediction of coding sequences (CDS and their function assignment. The annotation process takes time. The available methods often encounter difficulties when dealing with unfinished error-containing genomic sequence. Results In this work a fast method is proposed to use unannotated genome sequence for predicting CDSs and for an in silico reconstruction of metabolic networks. Instead of using predicted genes or CDSs to query public databases, entries from public DNA or protein databases are used as queries to search a local database of the unannotated genome sequence to predict CDSs. Functions are assigned to the predicted CDSs simultaneously. The well-annotated genome of Salmonella typhimurium LT2 is used as an example to demonstrate the applicability of the method. 97.7% of the CDSs in the original annotation are correctly identified. The use of SWISS-PROT-TrEMBL databases resulted in an identification of 98.9% of CDSs that have EC-numbers in the published annotation. Furthermore, two versions of sequences of the bacterium Klebsiella pneumoniae with different genome coverage (3.9 and 7.9 fold, respectively are examined. The results suggest that a 3.9-fold coverage of the bacterial genome could be sufficiently used for the in silico reconstruction of the metabolic network. Compared to other gene finding methods such as CRITICA our method is more suitable for exploiting sequences of low genome coverage. Based on the new method, a program called IdentiCS (Identification of Coding Sequences from Unfinished Genome Sequences is delivered that combines the identification of CDSs with the reconstruction, comparison and visualization of metabolic networks (free to download

  1. Analysis of sequence diversity through internal transcribed spacers and simple sequence repeats to identify Dendrobium species.

    Science.gov (United States)

    Liu, Y T; Chen, R K; Lin, S J; Chen, Y C; Chin, S W; Chen, F C; Lee, C Y

    2014-04-08

    The Orchidaceae is one of the largest and most diverse families of flowering plants. The Dendrobium genus has high economic potential as ornamental plants and for medicinal purposes. In addition, the species of this genus are able to produce large crops. However, many Dendrobium varieties are very similar in outward appearance, making it difficult to distinguish one species from another. This study demonstrated that the 12 Dendrobium species used in this study may be divided into 2 groups by internal transcribed spacer (ITS) sequence analysis. Red and yellow flowers may also be used to separate these species into 2 main groups. In particular, the deciduous characteristic is associated with the ITS genetic diversity of the A group. Of 53 designed simple sequence repeat (SSR) primer pairs, 7 pairs were polymorphic for polymerase chain reaction products that were amplified from a specific band. The results of this study demonstrate that these 7 SSR primer pairs may potentially be used to identify Dendrobium species and their progeny in future studies.

  2. Exploring the potential of second-generation sequencing in diverse biological contexts

    DEFF Research Database (Denmark)

    Fordyce, Sarah Louise

    Second generation sequencing (SGS) has revolutionized the study of DNA, allowing massive parallel sequencing of nucleic acids with unprecedented depths of coverage. The research undertaken in this thesis occurred in parallel with the increased accessibility of SGS platforms for routine genetic...

  3. Vaccination coverage and out-of-sequence vaccinations in rural Guinea-Bissau

    DEFF Research Database (Denmark)

    Hornshøj, Linda; Benn, Christine Stabell; Fernandes, Manuel

    2012-01-01

    OBJECTIVE: The WHO aims for 90% coverage of the Expanded Program on Immunization (EPI), which in Guinea-Bissau included BCG vaccine at birth, three doses of diphtheria-tetanus-pertussis vaccine (DTP) and oral polio vaccine (OPV) at 6, 10 and 14 weeks and measles vaccine (MV) at 9 months when...

  4. Low diversity in the mitogenome of sperm whales revealed by next-generation sequencing

    Science.gov (United States)

    Alana Alexander; Debbie Steel; Beth Slikas; Kendra Hoekzema; Colm Carraher; Matthew Parks; Richard Cronn; C. Scott Baker

    2012-01-01

    Large population sizes and global distributions generally associate with high mitochondrial DNA control region (CR) diversity. The sperm whale (Physeter macrocephalus) is an exception, showing low CR diversity relative to other cetaceans; however, diversity levels throughout the remainder of the sperm whale mitogenome are unknown. We sequenced 20...

  5. Heap: a highly sensitive and accurate SNP detection tool for low-coverage high-throughput sequencing data

    KAUST Repository

    Kobayashi, Masaaki

    2017-04-20

    Recent availability of large-scale genomic resources enables us to conduct so called genome-wide association studies (GWAS) and genomic prediction (GP) studies, particularly with next-generation sequencing (NGS) data. The effectiveness of GWAS and GP depends on not only their mathematical models, but the quality and quantity of variants employed in the analysis. In NGS single nucleotide polymorphism (SNP) calling, conventional tools ideally require more reads for higher SNP sensitivity and accuracy. In this study, we aimed to develop a tool, Heap, that enables robustly sensitive and accurate calling of SNPs, particularly with a low coverage NGS data, which must be aligned to the reference genome sequences in advance. To reduce false positive SNPs, Heap determines genotypes and calls SNPs at each site except for sites at the both ends of reads or containing a minor allele supported by only one read. Performance comparison with existing tools showed that Heap achieved the highest F-scores with low coverage (7X) restriction-site associated DNA sequencing reads of sorghum and rice individuals. This will facilitate cost-effective GWAS and GP studies in this NGS era. Code and documentation of Heap are freely available from https://github.com/meiji-bioinf/heap (29 March 2017, date last accessed) and our web site (http://bioinf.mind.meiji.ac.jp/lab/en/tools.html (29 March 2017, date last accessed)).

  6. Genomic Variance Estimation Based on Genotyping-by-Sequencing with Different Coverage in Perennial Ryegrass

    DEFF Research Database (Denmark)

    Ashraf, Bilal; Fé, Dario; Jensen, Just

    2014-01-01

    at each SNP in family pools or polyploids. There are, however, several statistical challenges associated with this method, including low sequencing depth and missing values. Low sequencing depth results in inaccuracies in estimates of allele frequencies for each SNP. In this work we have focused...

  7. Simultaneous and complete genome sequencing of influenza A and B with high coverage by Illumina MiSeq Platform.

    Science.gov (United States)

    Rutvisuttinunt, Wiriya; Chinnawirotpisan, Piyawan; Simasathien, Sriluck; Shrestha, Sanjaya K; Yoon, In-Kyu; Klungthong, Chonticha; Fernandez, Stefan

    2013-11-01

    Active global surveillance and characterization of influenza viruses are essential for better preparation against possible pandemic events. Obtaining comprehensive information about the influenza genome can improve our understanding of the evolution of influenza viruses and emergence of new strains, and improve the accuracy when designing preventive vaccines. This study investigated the use of deep sequencing by the next-generation sequencing (NGS) Illumina MiSeq Platform to obtain complete genome sequence information from influenza virus isolates. The influenza virus isolates were cultured from 6 respiratory acute clinical specimens collected in Thailand and Nepal. DNA libraries obtained from each viral isolate were mixed and all were sequenced simultaneously. Total information of 2.6 Gbases was obtained from a 455±14 K/mm2 density with 95.76% (8,571,655/8,950,724 clusters) of the clusters passing quality control (QC) filters. Approximately 93.7% of all sequences from Read1 and 83.5% from Read2 contained high quality sequences that were ≥Q30, a base calling QC score standard. Alignments analysis identified three seasonal influenza A H3N2 strains, one 2009 pandemic influenza A H1N1 strain and two influenza B strains. The nearly entire genomes of all six virus isolates yielded equal or greater than 600-fold sequence coverage depth. MiSeq Platform identified seasonal influenza A H3N2, 2009 pandemic influenza A H1N1and influenza B in the DNA library mixtures efficiently. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.

  8. An Enumerative Combinatorics Model for Fragmentation Patterns in RNA Sequencing Provides Insights into Nonuniformity of the Expected Fragment Starting-Point and Coverage Profile.

    Science.gov (United States)

    Prakash, Celine; Haeseler, Arndt Von

    2017-03-01

    RNA sequencing (RNA-seq) has emerged as the method of choice for measuring the expression of RNAs in a given cell population. In most RNA-seq technologies, sequencing the full length of RNA molecules requires fragmentation into smaller pieces. Unfortunately, the issue of nonuniform sequencing coverage across a genomic feature has been a concern in RNA-seq and is attributed to biases for certain fragments in RNA-seq library preparation and sequencing. To investigate the expected coverage obtained from fragmentation, we develop a simple fragmentation model that is independent of bias from the experimental method and is not specific to the transcript sequence. Essentially, we enumerate all configurations for maximal placement of a given fragment length, F, on transcript length, T, to represent every possible fragmentation pattern, from which we compute the expected coverage profile across a transcript. We extend this model to incorporate general empirical attributes such as read length, fragment length distribution, and number of molecules of the transcript. We further introduce the fragment starting-point, fragment coverage, and read coverage profiles. We find that the expected profiles are not uniform and that factors such as fragment length to transcript length ratio, read length to fragment length ratio, fragment length distribution, and number of molecules influence the variability of coverage across a transcript. Finally, we explore a potential application of the model where, with simulations, we show that it is possible to correctly estimate the transcript copy number for any transcript in the RNA-seq experiment.

  9. Mariner Mars 1971 television picture catalog. Volume 2: Sequence design and picture coverage

    Science.gov (United States)

    Koskela, P. E.; Helton, M. R.; Seeley, L. N.; Zawacki, S. J.

    1972-01-01

    A collection of data relating to the Mariner 9 TV picture is presented. The data are arranged to offer speedy identification of what took place during entire science cycles, on individual revolutions, and during individual science links or sequences. Summary tables present the nominal design for each of the major picture-taking cycles, along with the sequences actually taken on each revolution. These tables permit identification at a glance, all TV sequences and the corresponding individual pictures for the first 262 revolutions (primary mission). A list of TV pictures, categorized according to their latitude and longitude, is also provided. Orthographic and/or mercator plots for all pictures, along with pertinent numerical data for their center points are presented. Other tables and plots of interest are also included. This document is based upon data contained in the Supplementary Experiment Data Record (SEDR) files as of 21 August 1972.

  10. LOX: Inferring level of expression from diverse methods of census sequencing

    KAUST Repository

    Zhang, Zhang

    2010-06-10

    Summary: We present LOX (Level Of eXpression) that estimates the Level Of gene eXpression from high-throughput-expressed sequence datasets with multiple treatments or samples. Unlike most analyses, LOX incorporates a gene bias model that facilitates integration of diverse transcriptomic sequencing data that arises when transcriptomic data have been produced using diverse experimental methodologies. LOX integrates overall sequence count tallies normalized by total expressed sequence count to provide expression levels for each gene relative to all treatments as well as Bayesian credible intervals. © The Author 2010. Published by Oxford University Press. All rights reserved.

  11. LOX: Inferring level of expression from diverse methods of census sequencing

    KAUST Repository

    Zhang, Zhang; Ló pez-Girá ldez, Francesc Francisco; Townsend, Jeffrey P.

    2010-01-01

    Summary: We present LOX (Level Of eXpression) that estimates the Level Of gene eXpression from high-throughput-expressed sequence datasets with multiple treatments or samples. Unlike most analyses, LOX incorporates a gene bias model that facilitates integration of diverse transcriptomic sequencing data that arises when transcriptomic data have been produced using diverse experimental methodologies. LOX integrates overall sequence count tallies normalized by total expressed sequence count to provide expression levels for each gene relative to all treatments as well as Bayesian credible intervals. © The Author 2010. Published by Oxford University Press. All rights reserved.

  12. Evaluation of genetic diversity in rice using simple sequence repeats ...

    African Journals Online (AJOL)

    The genetic diversity of 64 rice genotypes using 20 SSR primers on chromosome number 7-12 was investigated. DNA was extracted by modified cetyl trimethyl ammonium bromide (CTAB) method. The banding pattern was recorded in the form of 0-1 data sheet which was analyzed using unweighted pair group method with ...

  13. Insights into the genetic structure and diversity of 38 South Asian Indians from deep whole-genome sequencing.

    Directory of Open Access Journals (Sweden)

    Lai-Ping Wong

    2014-05-01

    Full Text Available South Asia possesses a significant amount of genetic diversity due to considerable intergroup differences in culture and language. There have been numerous reports on the genetic structure of Asian Indians, although these have mostly relied on genotyping microarrays or targeted sequencing of the mitochondria and Y chromosomes. Asian Indians in Singapore are primarily descendants of immigrants from Dravidian-language-speaking states in south India, and 38 individuals from the general population underwent deep whole-genome sequencing with a target coverage of 30X as part of the Singapore Sequencing Indian Project (SSIP. The genetic structure and diversity of these samples were compared against samples from the Singapore Sequencing Malay Project and populations in Phase 1 of the 1,000 Genomes Project (1 KGP. SSIP samples exhibited greater intra-population genetic diversity and possessed higher heterozygous-to-homozygous genotype ratio than other Asian populations. When compared against a panel of well-defined Asian Indians, the genetic makeup of the SSIP samples was closely related to South Indians. However, even though the SSIP samples clustered distinctly from the Europeans in the global population structure analysis with autosomal SNPs, eight samples were assigned to mitochondrial haplogroups that were predominantly present in Europeans and possessed higher European admixture than the remaining samples. An analysis of the relative relatedness between SSIP with two archaic hominins (Denisovan, Neanderthal identified higher ancient admixture in East Asian populations than in SSIP. The data resource for these samples is publicly available and is expected to serve as a valuable complement to the South Asian samples in Phase 3 of 1 KGP.

  14. Insights into the genetic structure and diversity of 38 South Asian Indians from deep whole-genome sequencing.

    Science.gov (United States)

    Wong, Lai-Ping; Lai, Jason Kuan-Han; Saw, Woei-Yuh; Ong, Rick Twee-Hee; Cheng, Anthony Youzhi; Pillai, Nisha Esakimuthu; Liu, Xuanyao; Xu, Wenting; Chen, Peng; Foo, Jia-Nee; Tan, Linda Wei-Lin; Koo, Seok-Hwee; Soong, Richie; Wenk, Markus Rene; Lim, Wei-Yen; Khor, Chiea-Chuen; Little, Peter; Chia, Kee-Seng; Teo, Yik-Ying

    2014-05-01

    South Asia possesses a significant amount of genetic diversity due to considerable intergroup differences in culture and language. There have been numerous reports on the genetic structure of Asian Indians, although these have mostly relied on genotyping microarrays or targeted sequencing of the mitochondria and Y chromosomes. Asian Indians in Singapore are primarily descendants of immigrants from Dravidian-language-speaking states in south India, and 38 individuals from the general population underwent deep whole-genome sequencing with a target coverage of 30X as part of the Singapore Sequencing Indian Project (SSIP). The genetic structure and diversity of these samples were compared against samples from the Singapore Sequencing Malay Project and populations in Phase 1 of the 1,000 Genomes Project (1 KGP). SSIP samples exhibited greater intra-population genetic diversity and possessed higher heterozygous-to-homozygous genotype ratio than other Asian populations. When compared against a panel of well-defined Asian Indians, the genetic makeup of the SSIP samples was closely related to South Indians. However, even though the SSIP samples clustered distinctly from the Europeans in the global population structure analysis with autosomal SNPs, eight samples were assigned to mitochondrial haplogroups that were predominantly present in Europeans and possessed higher European admixture than the remaining samples. An analysis of the relative relatedness between SSIP with two archaic hominins (Denisovan, Neanderthal) identified higher ancient admixture in East Asian populations than in SSIP. The data resource for these samples is publicly available and is expected to serve as a valuable complement to the South Asian samples in Phase 3 of 1 KGP.

  15. Performance Evaluation of NIPT in Detection of Chromosomal Copy Number Variants Using Low-Coverage Whole-Genome Sequencing of Plasma DNA

    DEFF Research Database (Denmark)

    Liu, Hongtai; Gao, Ya; Hu, Zhiyang

    2016-01-01

    , including 33 CNVs samples and 886 normal samples from September 1, 2011 to May 31, 2013, were enrolled in this study. The samples were randomly rearranged and blindly sequenced by low-coverage (about 7M reads) whole-genome sequencing of plasma DNA. Fetal CNVs were detected by Fetal Copy-number Analysis...

  16. Genetic diversity in breonadia salicina based on intra-species sequence variation of chloroplast dna spacer sequence

    International Nuclear Information System (INIS)

    Qurainy, F.A.; Gaafar, A.R.Z.

    2014-01-01

    Assessment and knowledge of the genetic diversity and variation within and between populations of rare and endangered plants is very important for effective conservation. Intergenic spacer sequences variation of psbA-trnH locus of chloroplast genome was assessed within Breonadia salicina (Rubiaceae), a critically endangered and endemic plant species to South western part of Kingdom of Saudi Arabia. The obtained sequence data from 19 individuals in three populations revealed nine haplotypes. The aligned sequences obtained from the overall Saudi accessions extended to 355 bp, revealing nine haplotypes. A high level of haplotype diversity (Hd = 0.842) and low level of nucleotide diversity (Pi = 0.0058) were detected. Consistently, both hierarchical analysis of molecular variance (AMOVA) and constructed neighbor-joining tree indicated null genetic differentiation among populations. This level of differentiation between populations or between regions in psbA-trnH sequences may be due to effects of the abundance of ancestral haplotype sharing and the presence of private haplotypes fixed for each population. Furthermore, the results revealed almost the same level of genetic diversity in comparison with Yemeni accessions, in which Saudi accessions were sharing three haplotypes from the four haplotypes found in Yemeni accessions. (author)

  17. Common Wheat Chromosome 5B Composition Analysis Using Low-Coverage 454 Sequencing

    Czech Academy of Sciences Publication Activity Database

    Sergeeva, E.M.; Afonnikov, D. A.; Koltunova, M. K.; Gusev, V.D.; Miroshnichenko, L. A.; Vrána, Jan; Kubaláková, Marie; Poncet, C.; Sourdille, P.; Feuillet, C.; Doležel, Jaroslav; Salina, E.A.

    2014-01-01

    Roč. 7, č. 2 (2014) ISSN 1940-3372 R&D Projects: GA ČR GBP501/12/G090; GA MŠk(CZ) LO1204 Grant - others:GA MŠk(CZ) ED0007/01/01 Program:ED Institutional support: RVO:61389030 Keywords : GENOME SHOTGUN SEQUENCES * IN-SITU HYBRIDIZATION * HEXAPLOID WHEAT Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.933, year: 2014

  18. The European sea bass Dicentrarchus labrax genome puzzle: comparative BAC-mapping and low coverage shotgun sequencing

    Directory of Open Access Journals (Sweden)

    Volckaert Filip AM

    2010-01-01

    Full Text Available Abstract Background Food supply from the ocean is constrained by the shortage of domesticated and selected fish. Development of genomic models of economically important fishes should assist with the removal of this bottleneck. European sea bass Dicentrarchus labrax L. (Moronidae, Perciformes, Teleostei is one of the most important fishes in European marine aquaculture; growing genomic resources put it on its way to serve as an economic model. Results End sequencing of a sea bass genomic BAC-library enabled the comparative mapping of the sea bass genome using the three-spined stickleback Gasterosteus aculeatus genome as a reference. BAC-end sequences (102,690 were aligned to the stickleback genome. The number of mappable BACs was improved using a two-fold coverage WGS dataset of sea bass resulting in a comparative BAC-map covering 87% of stickleback chromosomes with 588 BAC-contigs. The minimum size of 83 contigs covering 50% of the reference was 1.2 Mbp; the largest BAC-contig comprised 8.86 Mbp. More than 22,000 BAC-clones aligned with both ends to the reference genome. Intra-chromosomal rearrangements between sea bass and stickleback were identified. Size distributions of mapped BACs were used to calculate that the genome of sea bass may be only 1.3 fold larger than the 460 Mbp stickleback genome. Conclusions The BAC map is used for sequencing single BACs or BAC-pools covering defined genomic entities by second generation sequencing technologies. Together with the WGS dataset it initiates a sea bass genome sequencing project. This will allow the quantification of polymorphisms through resequencing, which is important for selecting highly performing domesticated fish.

  19. Genome sequence and genetic diversity of European ash trees.

    Science.gov (United States)

    Sollars, Elizabeth S A; Harper, Andrea L; Kelly, Laura J; Sambles, Christine M; Ramirez-Gonzalez, Ricardo H; Swarbreck, David; Kaithakottil, Gemy; Cooper, Endymion D; Uauy, Cristobal; Havlickova, Lenka; Worswick, Gemma; Studholme, David J; Zohren, Jasmin; Salmon, Deborah L; Clavijo, Bernardo J; Li, Yi; He, Zhesi; Fellgett, Alison; McKinney, Lea Vig; Nielsen, Lene Rostgaard; Douglas, Gerry C; Kjær, Erik Dahl; Downie, J Allan; Boshier, David; Lee, Steve; Clark, Jo; Grant, Murray; Bancroft, Ian; Caccamo, Mario; Buggs, Richard J A

    2017-01-12

    Ash trees (genus Fraxinus, family Oleaceae) are widespread throughout the Northern Hemisphere, but are being devastated in Europe by the fungus Hymenoscyphus fraxineus, causing ash dieback, and in North America by the herbivorous beetle Agrilus planipennis. Here we sequence the genome of a low-heterozygosity Fraxinus excelsior tree from Gloucestershire, UK, annotating 38,852 protein-coding genes of which 25% appear ash specific when compared with the genomes of ten other plant species. Analyses of paralogous genes suggest a whole-genome duplication shared with olive (Olea europaea, Oleaceae). We also re-sequence 37 F. excelsior trees from Europe, finding evidence for apparent long-term decline in effective population size. Using our reference sequence, we re-analyse association transcriptomic data, yielding improved markers for reduced susceptibility to ash dieback. Surveys of these markers in British populations suggest that reduced susceptibility to ash dieback may be more widespread in Great Britain than in Denmark. We also present evidence that susceptibility of trees to H. fraxineus is associated with their iridoid glycoside levels. This rapid, integrated, multidisciplinary research response to an emerging health threat in a non-model organism opens the way for mitigation of the epidemic.

  20. Calibrating snakehead diversity with DNA barcodes: expanding taxonomic coverage to enable identification of potential and established invasive species.

    Directory of Open Access Journals (Sweden)

    Natasha R Serrao

    Full Text Available Detecting and documenting the occurrence of invasive species outside their native range requires tools to support their identification. This can be challenging for taxa with diverse life stages and/or problematic or unresolved morphological taxonomies. DNA barcoding provides a potent method for identifying invasive species, as it allows for species identification at all life stages, including fragmentary remains. It also provides an efficient interim taxonomic framework for quantifying cryptic genetic diversity by parsing barcode sequences into discontinuous haplogroup clusters (typical of reproductively isolated species and labelling them with unique alphanumeric identifiers. Snakehead fishes are a diverse group of opportunistic predators endemic to Asia and Africa that may potentially pose significant threats as aquatic invasive species. At least three snakehead species (Channa argus, C. maculata, and C. marulius are thought to have entered North America through the aquarium and live-food fish markets, and have established populations, yet their origins remain unclear. The objectives of this study were to assemble a library of DNA barcode sequences derived from expert identified reference specimens in order to determine the identity and aid invasion pathway analysis of the non-indigenous species found in North America using DNA barcodes. Sequences were obtained from 121 tissue samples representing 25 species and combined with public records from GenBank for a total of 36 putative species, which then partitioned into 49 discrete haplogroups. Multiple divergent clusters were observed within C. gachua, C. marulius, C. punctata and C. striata suggesting the potential presence of cryptic species diversity within these lineages. Our findings demonstrate that DNA barcoding is a valuable tool for species identification in challenging and under-studied taxonomic groups such as snakeheads, and provides a useful framework for inferring invasion pathway

  1. High coverage of the complete mitochondrial genome of the rare Gray's beaked whale (Mesoplodon grayi) using Illumina next generation sequencing.

    Science.gov (United States)

    Thompson, Kirsten F; Patel, Selina; Williams, Liam; Tsai, Peter; Constantine, Rochelle; Baker, C Scott; Millar, Craig D

    2016-01-01

    Using an Illumina platform, we shot-gun sequenced the complete mitochondrial genome of Gray's beaked whale (Mesoplodon grayi) to an average coverage of 152X. We performed a de novo assembly using SOAPdenovo2 and determined the total mitogenome length to be 16,347 bp. The nucleotide composition was asymmetric (33.3% A, 24.6% C, 12.6% G, 29.5% T) with an overall GC content of 37.2%. The gene organization was similar to that of other cetaceans with 13 protein-coding genes, 2 rRNAs (12S and 16S), 22 predicted tRNAs and 1 control region or D-loop. We found no evidence of heteroplasmy or nuclear copies of mitochondrial DNA in this individual. Beaked whales within the genus Mesoplodon are rarely seen at sea and their basic biology is poorly understood. These data will contribute to resolving the phylogeography and population ecology of this speciose group.

  2. Repetitive sequences: the hidden diversity of heterochromatin in prochilodontid fish

    Directory of Open Access Journals (Sweden)

    Maria L. Terencio

    2015-08-01

    Full Text Available The structure and organization of repetitive elements in fish genomes are still relatively poorly understood, although most of these elements are believed to be located in heterochromatic regions. Repetitive elements are considered essential in evolutionary processes as hotspots for mutations and chromosomal rearrangements, among other functions – thus providing new genomic alternatives and regulatory sites for gene expression. The present study sought to characterize repetitive DNA sequences in the genomes of Semaprochilodus insignis (Jardine & Schomburgk, 1841 and Semaprochilodus taeniurus (Valenciennes, 1817 and identify regions of conserved syntenic blocks in this genome fraction of three species of Prochilodontidae (S. insignis, S. taeniurus, and Prochilodus lineatus (Valenciennes, 1836 by cross-FISH using Cot-1 DNA (renaturation kinetics probes. We found that the repetitive fractions of the genomes of S. insignis and S. taeniurus have significant amounts of conserved syntenic blocks in hybridization sites, but with low degrees of similarity between them and the genome of P. lineatus, especially in relation to B chromosomes. The cloning and sequencing of the repetitive genomic elements of S. insignis and S. taeniurus using Cot-1 DNA identified 48 fragments that displayed high similarity with repetitive sequences deposited in public DNA databases and classified as microsatellites, transposons, and retrotransposons. The repetitive fractions of the S. insignis and S. taeniurus genomes exhibited high degrees of conserved syntenic blocks in terms of both the structures and locations of hybridization sites, but a low degree of similarity with the syntenic blocks of the P. lineatus genome. Future comparative analyses of other prochilodontidae species will be needed to advance our understanding of the organization and evolution of the genomes in this group of fish.

  3. Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data

    Directory of Open Access Journals (Sweden)

    Niko eBeerenwinkel

    2012-09-01

    Full Text Available Many viruses, including the clinically relevant RNA viruses HIV and HCV, exist in large populations and display high genetic heterogeneity within and between infected hosts. Assessing intra-patient viral genetic diversity is essential for understanding the evolutionary dynamics of viruses, for designing effective vaccines, and for the success of antiviral therapy. Next-generation sequencing technologies allow the rapid and cost-effective acquisition of thousands to millions of short DNA sequences from a single sample. However, this approach entails several challenges in experimental design and computational data analysis. Here, we review the entire process of inferring viral diversity from sample collection to computing measures of genetic diversity. We discuss sample preparation, including reverse transcription and amplification, and the effect of experimental conditions on diversity estimates due to in vitro base substitutions, insertions, deletions, and recombination. The use of different next-generation sequencing platforms and their sequencing error profiles are compared in the context of various applications of diversity estimation, ranging from the detection of single nucleotide variants to the reconstruction of whole-genome haplotypes. We describe the statistical and computational challenges arising from these technical artifacts, and we review existing approaches, including available software, for their solution. Finally, we discuss open problems, and highlight successful biomedical applications and potential future clinical use of next-generation sequencing to estimate viral diversity.

  4. Global sequence diversity of the lactate dehydrogenase gene in Plasmodium falciparum.

    Science.gov (United States)

    Simpalipan, Phumin; Pattaradilokrat, Sittiporn; Harnyuttanakorn, Pongchai

    2018-01-09

    Antigen-detecting rapid diagnostic tests (RDTs) have been recommended by the World Health Organization for use in remote areas to improve malaria case management. Lactate dehydrogenase (LDH) of Plasmodium falciparum is one of the main parasite antigens employed by various commercial RDTs. It has been hypothesized that the poor detection of LDH-based RDTs is attributed in part to the sequence diversity of the gene. To test this, the present study aimed to investigate the genetic diversity of the P. falciparum ldh gene in Thailand and to construct the map of LDH sequence diversity in P. falciparum populations worldwide. The ldh gene was sequenced for 50 P. falciparum isolates in Thailand and compared with hundreds of sequences from P. falciparum populations worldwide. Several indices of molecular variation were calculated, including the proportion of polymorphic sites, the average nucleotide diversity index (π), and the haplotype diversity index (H). Tests of positive selection and neutrality tests were performed to determine signatures of natural selection on the gene. Mean genetic distance within and between species of Plasmodium ldh was analysed to infer evolutionary relationships. Nucleotide sequences of P. falciparum ldh could be classified into 9 alleles, encoding 5 isoforms of LDH. L1a was the most common allelic type and was distributed in P. falciparum populations worldwide. Plasmodium falciparum ldh sequences were highly conserved, with haplotype and nucleotide diversity values of 0.203 and 0.0004, respectively. The extremely low genetic diversity was maintained by purifying selection, likely due to functional constraints. Phylogenetic analysis inferred the close genetic relationship of P. falciparum to malaria parasites of great apes, rather than to other human malaria parasites. This study revealed the global genetic variation of the ldh gene in P. falciparum, providing knowledge for improving detection of LDH-based RDTs and supporting the candidacy of

  5. Transposon fingerprinting using low coverage whole genome shotgun sequencing in cacao (Theobroma cacao L.) and related species.

    Science.gov (United States)

    Sveinsson, Saemundur; Gill, Navdeep; Kane, Nolan C; Cronk, Quentin

    2013-07-24

    Transposable elements (TEs) and other repetitive elements are a large and dynamically evolving part of eukaryotic genomes, especially in plants where they can account for a significant proportion of genome size. Their dynamic nature gives them the potential for use in identifying and characterizing crop germplasm. However, their repetitive nature makes them challenging to study using conventional methods of molecular biology. Next generation sequencing and new computational tools have greatly facilitated the investigation of TE variation within species and among closely related species. (i) We generated low-coverage Illumina whole genome shotgun sequencing reads for multiple individuals of cacao (Theobroma cacao) and related species. These reads were analysed using both an alignment/mapping approach and a de novo (graph based clustering) approach. (ii) A standard set of ultra-conserved orthologous sequences (UCOS) standardized TE data between samples and provided phylogenetic information on the relatedness of samples. (iii) The mapping approach proved highly effective within the reference species but underestimated TE abundance in interspecific comparisons relative to the de novo methods. (iv) Individual T. cacao accessions have unique patterns of TE abundance indicating that the TE composition of the genome is evolving actively within this species. (v) LTR/Gypsy elements are the most abundant, comprising c.10% of the genome. (vi) Within T. cacao the retroelement families show an order of magnitude greater sequence variability than the DNA transposon families. (vii) Theobroma grandiflorum has a similar TE composition to T. cacao, but the related genus Herrania is rather different, with LTRs making up a lower proportion of the genome, perhaps because of a massive presence (c. 20%) of distinctive low complexity satellite-like repeats in this genome. (i) Short read alignment/mapping to reference TE contigs provides a simple and effective method of investigating

  6. Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples

    Directory of Open Access Journals (Sweden)

    Maley Carlo C

    2008-10-01

    Full Text Available Abstract Background Genomes store information for building and maintaining organisms. Complete sequencing of many genomes provides the opportunity to study and compare global information properties of those genomes. Results We have analyzed aspects of the information content of Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, Saccharomyces cerevisiae, and Escherichia coli (K-12 genomes. Virtually all possible (> 98% 12 bp oligomers appear in vertebrate genomes while 98% to D. melanogaster (12–17 bp, C. elegans (11–17 bp, A. thaliana (11–17 bp, S. cerevisiae (10–16 bp and E. coli (9–15 bp. Frequencies of unique oligomers in the genomes follow similar patterns. We identified a set of 2.6 M 15-mers that are more than 1 nucleotide different from all 15-mers in the human genome and so could be used as probes to detect microbes in human samples. In a human sample, these probes would detect 100% of the 433 currently fully sequenced prokaryotes and 75% of the 3065 fully sequenced viruses. The human genome is significantly more compact in sequence space than a random genome. We identified the most frequent 5- to 20-mers in the human genome, which may prove useful as PCR primers. We also identified a bacterium, Anaeromyxobacter dehalogenans, which has an exceptionally low diversity of oligomers given the size of its genome and its GC content. The entropy of coding regions in the human genome is significantly higher than non-coding regions and chromosomes. However chromosomes 1, 2, 9, 12 and 14 have a relatively high proportion of coding DNA without high entropy, and chromosome 20 is the opposite with a low frequency of coding regions but relatively high entropy. Conclusion Measures of the frequency of oligomers are useful for designing PCR assays and for identifying chromosomes and organisms with hidden structure that had not been previously recognized. This information may be used to detect

  7. Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples

    Science.gov (United States)

    Liu, Zhandong; Venkatesh, Santosh S; Maley, Carlo C

    2008-01-01

    Background Genomes store information for building and maintaining organisms. Complete sequencing of many genomes provides the opportunity to study and compare global information properties of those genomes. Results We have analyzed aspects of the information content of Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, Saccharomyces cerevisiae, and Escherichia coli (K-12) genomes. Virtually all possible (> 98%) 12 bp oligomers appear in vertebrate genomes while 98% to < 2% of possible oligomers in D. melanogaster (12–17 bp), C. elegans (11–17 bp), A. thaliana (11–17 bp), S. cerevisiae (10–16 bp) and E. coli (9–15 bp). Frequencies of unique oligomers in the genomes follow similar patterns. We identified a set of 2.6 M 15-mers that are more than 1 nucleotide different from all 15-mers in the human genome and so could be used as probes to detect microbes in human samples. In a human sample, these probes would detect 100% of the 433 currently fully sequenced prokaryotes and 75% of the 3065 fully sequenced viruses. The human genome is significantly more compact in sequence space than a random genome. We identified the most frequent 5- to 20-mers in the human genome, which may prove useful as PCR primers. We also identified a bacterium, Anaeromyxobacter dehalogenans, which has an exceptionally low diversity of oligomers given the size of its genome and its GC content. The entropy of coding regions in the human genome is significantly higher than non-coding regions and chromosomes. However chromosomes 1, 2, 9, 12 and 14 have a relatively high proportion of coding DNA without high entropy, and chromosome 20 is the opposite with a low frequency of coding regions but relatively high entropy. Conclusion Measures of the frequency of oligomers are useful for designing PCR assays and for identifying chromosomes and organisms with hidden structure that had not been previously recognized. This information may be used to

  8. Salmonella enterica Prophage Sequence Profiles Reflect Genome Diversity and Can Be Used for High Discrimination Subtyping

    Directory of Open Access Journals (Sweden)

    Walid Mottawea

    2018-05-01

    Full Text Available Non-typhoidal Salmonella is a leading cause of foodborne illness worldwide. Prompt and accurate identification of the sources of Salmonella responsible for disease outbreaks is crucial to minimize infections and eliminate ongoing sources of contamination. Current subtyping tools including single nucleotide polymorphism (SNP typing may be inadequate, in some instances, to provide the required discrimination among epidemiologically unrelated Salmonella strains. Prophage genes represent the majority of the accessory genes in bacteria genomes and have potential to be used as high discrimination markers in Salmonella. In this study, the prophage sequence diversity in different Salmonella serovars and genetically related strains was investigated. Using whole genome sequences of 1,760 isolates of S. enterica representing 151 Salmonella serovars and 66 closely related bacteria, prophage sequences were identified from assembled contigs using PHASTER. We detected 154 different prophages in S. enterica genomes. Prophage sequences were highly variable among S. enterica serovars with a median ± interquartile range (IQR of 5 ± 3 prophage regions per genome. While some prophage sequences were highly conserved among the strains of specific serovars, few regions were lineage specific. Therefore, strains belonging to each serovar could be clustered separately based on their prophage content. Analysis of S. Enteritidis isolates from seven outbreaks generated distinct prophage profiles for each outbreak. Taken altogether, the diversity of the prophage sequences correlates with genome diversity. Prophage repertoires provide an additional marker for differentiating S. enterica subtypes during foodborne outbreaks.

  9. Population diversity of Diaphorina citri (Hemiptera: Liviidae) in China based on whole mitochondrial genome sequences.

    Science.gov (United States)

    Wu, Fengnian; Jiang, Hongyan; Beattie, G Andrew C; Holford, Paul; Chen, Jianchi; Wallis, Christopher M; Zheng, Zheng; Deng, Xiaoling; Cen, Yijing

    2018-04-24

    Diaphorina citri (Asian citrus psyllid; ACP) transmits 'Candidatus Liberibacter asiaticus' associated with citrus Huanglongbing (HLB). ACP has been reported in 11 provinces/regions in China, yet its population diversity remains unclear. In this study, we evaluated ACP population diversity in China using representative whole mitochondrial genome (mitogenome) sequences. Additional mitogenome sequences outside China were also acquired and evaluated. The sizes of the 27 ACP mitogenome sequences ranged from 14 986 to 15 030 bp. Along with three previously published mitogenome sequences, the 30 sequences formed three major mitochondrial groups (MGs): MG1, present in southwestern China and occurring at elevations above 1000 m; MG2, present in southeastern China and Southeast Asia (Cambodia, Indonesia, Malaysia, and Vietnam) and occurring at elevations below 180 m; and MG3, present in the USA and Pakistan. Single nucleotide polymorphisms in five genes (cox2, atp8, nad3, nad1 and rrnL) contributed mostly in the ACP diversity. Among these genes, rrnL had the most variation. Mitogenome sequences analyses revealed two major phylogenetic groups of ACP present in China as well as a possible unique group present currently in Pakistan and the USA. The information could have significant implications for current ACP control and HLB management. © 2018 Society of Chemical Industry. © 2018 Society of Chemical Industry.

  10. Protein-Level Integration Strategy of Multiengine MS Spectra Search Results for Higher Confidence and Sequence Coverage.

    Science.gov (United States)

    Zhao, Panpan; Zhong, Jiayong; Liu, Wanting; Zhao, Jing; Zhang, Gong

    2017-12-01

    Multiple search engines based on various models have been developed to search MS/MS spectra against a reference database, providing different results for the same data set. How to integrate these results efficiently with minimal compromise on false discoveries is an open question due to the lack of an independent, reliable, and highly sensitive standard. We took the advantage of the translating mRNA sequencing (RNC-seq) result as a standard to evaluate the integration strategies of the protein identifications from various search engines. We used seven mainstream search engines (Andromeda, Mascot, OMSSA, X!Tandem, pFind, InsPecT, and ProVerB) to search the same label-free MS data sets of human cell lines Hep3B, MHCCLM3, and MHCC97H from the Chinese C-HPP Consortium for Chromosomes 1, 8, and 20. As expected, the union of seven engines resulted in a boosted false identification, whereas the intersection of seven engines remarkably decreased the identification power. We found that identifications of at least two out of seven engines resulted in maximizing the protein identification power while minimizing the ratio of suspicious/translation-supported identifications (STR), as monitored by our STR index, based on RNC-Seq. Furthermore, this strategy also significantly improves the peptides coverage of the protein amino acid sequence. In summary, we demonstrated a simple strategy to significantly improve the performance for shotgun mass spectrometry by protein-level integrating multiple search engines, maximizing the utilization of the current MS spectra without additional experimental work.

  11. Somatosensory neuron types identified by high-coverage single-cell RNA-sequencing and functional heterogeneity

    Science.gov (United States)

    Li, Chang-Lin; Li, Kai-Cheng; Wu, Dan; Chen, Yan; Luo, Hao; Zhao, Jing-Rong; Wang, Sa-Shuang; Sun, Ming-Ming; Lu, Ying-Jin; Zhong, Yan-Qing; Hu, Xu-Ye; Hou, Rui; Zhou, Bei-Bei; Bao, Lan; Xiao, Hua-Sheng; Zhang, Xu

    2016-01-01

    Sensory neurons are distinguished by distinct signaling networks and receptive characteristics. Thus, sensory neuron types can be defined by linking transcriptome-based neuron typing with the sensory phenotypes. Here we classify somatosensory neurons of the mouse dorsal root ganglion (DRG) by high-coverage single-cell RNA-sequencing (10 950 ± 1 218 genes per neuron) and neuron size-based hierarchical clustering. Moreover, single DRG neurons responding to cutaneous stimuli are recorded using an in vivo whole-cell patch clamp technique and classified by neuron-type genetic markers. Small diameter DRG neurons are classified into one type of low-threshold mechanoreceptor and five types of mechanoheat nociceptors (MHNs). Each of the MHN types is further categorized into two subtypes. Large DRG neurons are categorized into four types, including neurexophilin 1-expressing MHNs and mechanical nociceptors (MNs) expressing BAI1-associated protein 2-like 1 (Baiap2l1). Mechanoreceptors expressing trafficking protein particle complex 3-like and Baiap2l1-marked MNs are subdivided into two subtypes each. These results provide a new system for cataloging somatosensory neurons and their transcriptome databases. PMID:26691752

  12. Estimating and comparing microbial diversity in the presence of sequencing errors

    Science.gov (United States)

    Chiu, Chun-Huo

    2016-01-01

    Estimating and comparing microbial diversity are statistically challenging due to limited sampling and possible sequencing errors for low-frequency counts, producing spurious singletons. The inflated singleton count seriously affects statistical analysis and inferences about microbial diversity. Previous statistical approaches to tackle the sequencing errors generally require different parametric assumptions about the sampling model or about the functional form of frequency counts. Different parametric assumptions may lead to drastically different diversity estimates. We focus on nonparametric methods which are universally valid for all parametric assumptions and can be used to compare diversity across communities. We develop here a nonparametric estimator of the true singleton count to replace the spurious singleton count in all methods/approaches. Our estimator of the true singleton count is in terms of the frequency counts of doubletons, tripletons and quadrupletons, provided these three frequency counts are reliable. To quantify microbial alpha diversity for an individual community, we adopt the measure of Hill numbers (effective number of taxa) under a nonparametric framework. Hill numbers, parameterized by an order q that determines the measures’ emphasis on rare or common species, include taxa richness (q = 0), Shannon diversity (q = 1, the exponential of Shannon entropy), and Simpson diversity (q = 2, the inverse of Simpson index). A diversity profile which depicts the Hill number as a function of order q conveys all information contained in a taxa abundance distribution. Based on the estimated singleton count and the original non-singleton frequency counts, two statistical approaches (non-asymptotic and asymptotic) are developed to compare microbial diversity for multiple communities. (1) A non-asymptotic approach refers to the comparison of estimated diversities of standardized samples with a common finite sample size or sample completeness. This

  13. Complete sequence and diversity of a maize-associated Polerovirus in East Africa

    Science.gov (United States)

    Since 2011-2012, Maize lethal necrosis (MLN) has emerged in East Africa, causing massive yield loss and propelling research to identify viruses and virus populations present in maize. As expected, next generation sequencing (NGS) has revealed diverse and abundant viruses from the family Potyviridae,...

  14. Nucleotide Sequence Diversity and Linkage Disequilibrium of Four Nuclear Loci in Foxtail Millet (Setaria italica.

    Directory of Open Access Journals (Sweden)

    Shui-Lian He

    Full Text Available Foxtail millet (Setaria italica (L. Beauv is one of the earliest domesticated grains, which has been cultivated in northern China by 8,700 years before present (YBP and across Eurasia by 4,000 YBP. Owing to a small genome and diploid nature, foxtail millet is a tractable model crop for studying functional genomics of millets and bioenergy grasses. In this study, we examined nucleotide sequence diversity, geographic structure, and levels of linkage disequilibrium at four nuclear loci (ADH1, G3PDH, IGS1 and TPI1 in representative samples of 311 landrace accessions across its cultivated range. Higher levels of nucleotide sequence and haplotype diversity were observed in samples from China relative to other sampled regions. Genetic assignment analysis classified the accessions into seven clusters based on nucleotide sequence polymorphisms. Intralocus LD decayed rapidly to half the initial value within ~1.2 kb or less.

  15. Nucleotide Sequence Diversity and Linkage Disequilibrium of Four Nuclear Loci in Foxtail Millet (Setaria italica).

    Science.gov (United States)

    He, Shui-Lian; Yang, Yang; Morrell, Peter L; Yi, Ting-Shuang

    2015-01-01

    Foxtail millet (Setaria italica (L.) Beauv) is one of the earliest domesticated grains, which has been cultivated in northern China by 8,700 years before present (YBP) and across Eurasia by 4,000 YBP. Owing to a small genome and diploid nature, foxtail millet is a tractable model crop for studying functional genomics of millets and bioenergy grasses. In this study, we examined nucleotide sequence diversity, geographic structure, and levels of linkage disequilibrium at four nuclear loci (ADH1, G3PDH, IGS1 and TPI1) in representative samples of 311 landrace accessions across its cultivated range. Higher levels of nucleotide sequence and haplotype diversity were observed in samples from China relative to other sampled regions. Genetic assignment analysis classified the accessions into seven clusters based on nucleotide sequence polymorphisms. Intralocus LD decayed rapidly to half the initial value within ~1.2 kb or less.

  16. HIV-1 envelope sequence-based diversity measures for identifying recent infections.

    Directory of Open Access Journals (Sweden)

    Alexis Kafando

    Full Text Available Identifying recent HIV-1 infections is crucial for monitoring HIV-1 incidence and optimizing public health prevention efforts. To identify recent HIV-1 infections, we evaluated and compared the performance of 4 sequence-based diversity measures including percent diversity, percent complexity, Shannon entropy and number of haplotypes targeting 13 genetic segments within the env gene of HIV-1. A total of 597 diagnostic samples obtained in 2013 and 2015 from recently and chronically HIV-1 infected individuals were selected. From the selected samples, 249 (134 from recent versus 115 from chronic infections env coding regions, including V1-C5 of gp120 and the gp41 ectodomain of HIV-1, were successfully amplified and sequenced by next generation sequencing (NGS using the Illumina MiSeq platform. The ability of the four sequence-based diversity measures to correctly identify recent HIV infections was evaluated using the frequency distribution curves, median and interquartile range and area under the curve (AUC of the receiver operating characteristic (ROC. Comparing the median and interquartile range and evaluating the frequency distribution curves associated with the 4 sequence-based diversity measures, we observed that the percent diversity, number of haplotypes and Shannon entropy demonstrated significant potential to discriminate recent from chronic infections (p<0.0001. Using the AUC of ROC analysis, only the Shannon entropy measure within three HIV-1 env segments could accurately identify recent infections at a satisfactory level. The env segments were gp120 C2_1 (AUC = 0.806, gp120 C2_3 (AUC = 0.805 and gp120 V3 (AUC = 0.812. Our results clearly indicate that the Shannon entropy measure represents a useful tool for predicting HIV-1 infection recency.

  17. Genotyping-By-Sequencing for Plant Genetic Diversity Analysis: A Lab Guide for SNP Genotyping

    Directory of Open Access Journals (Sweden)

    Gregory W. Peterson

    2014-10-01

    Full Text Available Genotyping-by-sequencing (GBS has recently emerged as a promising genomic approach for exploring plant genetic diversity on a genome-wide scale. However, many uncertainties and challenges remain in the application of GBS, particularly in non-model species. Here, we present a GBS protocol we developed and use for plant genetic diversity analysis. It uses two restriction enzymes to reduce genome complexity, applies Illumina multiplexing indexes for barcoding and has a custom bioinformatics pipeline for genotyping. This genetic diversity-focused GBS (gd-GBS protocol can serve as an easy-to-follow lab guide to assist a researcher through every step of a GBS application with five main components: sample preparation, library assembly, sequencing, SNP calling and diversity analysis. Specifically, in this presentation, we provide a brief overview of the GBS approach, describe the gd-GBS procedures, illustrate it with an application to analyze genetic diversity in 20 flax (Linum usitatissimum L. accessions and discuss related issues in GBS application. Following these lab bench procedures and using the custom bioinformatics pipeline, one could generate genome-wide SNP genotype data for a conventional genetic diversity analysis of a non-model plant species.

  18. Comparison of two next-generation sequencing kits for diagnosis of epileptic disorders with a user-friendly tool for displaying gene coverage, DeCovA

    Directory of Open Access Journals (Sweden)

    Sarra Dimassi

    2015-12-01

    Full Text Available In recent years, molecular genetics has been playing an increasing role in the diagnostic process of monogenic epilepsies. Knowing the genetic basis of one patient's epilepsy provides accurate genetic counseling and may guide therapeutic options. Genetic diagnosis of epilepsy syndromes has long been based on Sanger sequencing and search for large rearrangements using MLPA or DNA arrays (array-CGH or SNP-array. Recently, next-generation sequencing (NGS was demonstrated to be a powerful approach to overcome the wide clinical and genetic heterogeneity of epileptic disorders. Coverage is critical for assessing the quality and accuracy of results from NGS. However, it is often a difficult parameter to display in practice. The aim of the study was to compare two library-building methods (Haloplex, Agilent and SeqCap EZ, Roche for a targeted panel of 41 genes causing monogenic epileptic disorders. We included 24 patients, 20 of whom had known disease-causing mutations. For each patient both libraries were built in parallel and sequenced on an Ion Torrent Personal Genome Machine (PGM. To compare coverage and depth, we developed a simple homemade tool, named DeCovA (Depth and Coverage Analysis. DeCovA displays the sequencing depth of each base and the coverage of target genes for each genomic position. The fraction of each gene covered at different thresholds could be easily estimated. None of the two methods used, namely NextGene and Ion Reporter, were able to identify all the known mutations/CNVs displayed by the 20 patients. Variant detection rate was globally similar for the two techniques and DeCovA showed that failure to detect a mutation was mainly related to insufficient coverage.

  19. Evaluating Methods for Isolating Total RNA and Predicting the Success of Sequencing Phylogenetically Diverse Plant Transcriptomes

    Science.gov (United States)

    Bruskiewich, Richard; Burris, Jason N.; Carrigan, Charlotte T.; Chase, Mark W.; Clarke, Neil D.; Covshoff, Sarah; dePamphilis, Claude W.; Edger, Patrick P.; Goh, Falicia; Graham, Sean; Greiner, Stephan; Hibberd, Julian M.; Jordon-Thaden, Ingrid; Kutchan, Toni M.; Leebens-Mack, James; Melkonian, Michael; Miles, Nicholas; Myburg, Henrietta; Patterson, Jordan; Pires, J. Chris; Ralph, Paula; Rolf, Megan; Sage, Rowan F.; Soltis, Douglas; Soltis, Pamela; Stevenson, Dennis; Stewart, C. Neal; Surek, Barbara; Thomsen, Christina J. M.; Villarreal, Juan Carlos; Wu, Xiaolei; Zhang, Yong; Deyholos, Michael K.; Wong, Gane Ka-Shu

    2012-01-01

    Next-generation sequencing plays a central role in the characterization and quantification of transcriptomes. Although numerous metrics are purported to quantify the quality of RNA, there have been no large-scale empirical evaluations of the major determinants of sequencing success. We used a combination of existing and newly developed methods to isolate total RNA from 1115 samples from 695 plant species in 324 families, which represents >900 million years of phylogenetic diversity from green algae through flowering plants, including many plants of economic importance. We then sequenced 629 of these samples on Illumina GAIIx and HiSeq platforms and performed a large comparative analysis to identify predictors of RNA quality and the diversity of putative genes (scaffolds) expressed within samples. Tissue types (e.g., leaf vs. flower) varied in RNA quality, sequencing depth and the number of scaffolds. Tissue age also influenced RNA quality but not the number of scaffolds ≥1000 bp. Overall, 36% of the variation in the number of scaffolds was explained by metrics of RNA integrity (RIN score), RNA purity (OD 260/230), sequencing platform (GAIIx vs HiSeq) and the amount of total RNA used for sequencing. However, our results show that the most commonly used measures of RNA quality (e.g., RIN) are weak predictors of the number of scaffolds because Illumina sequencing is robust to variation in RNA quality. These results provide novel insight into the methods that are most important in isolating high quality RNA for sequencing and assembling plant transcriptomes. The methods and recommendations provided here could increase the efficiency and decrease the cost of RNA sequencing for individual labs and genome centers. PMID:23185583

  20. A robust, simple genotyping-by-sequencing (GBS approach for high diversity species.

    Directory of Open Access Journals (Sweden)

    Robert J Elshire

    Full Text Available Advances in next generation technologies have driven the costs of DNA sequencing down to the point that genotyping-by-sequencing (GBS is now feasible for high diversity, large genome species. Here, we report a procedure for constructing GBS libraries based on reducing genome complexity with restriction enzymes (REs. This approach is simple, quick, extremely specific, highly reproducible, and may reach important regions of the genome that are inaccessible to sequence capture approaches. By using methylation-sensitive REs, repetitive regions of genomes can be avoided and lower copy regions targeted with two to three fold higher efficiency. This tremendously simplifies computationally challenging alignment problems in species with high levels of genetic diversity. The GBS procedure is demonstrated with maize (IBM and barley (Oregon Wolfe Barley recombinant inbred populations where roughly 200,000 and 25,000 sequence tags were mapped, respectively. An advantage in species like barley that lack a complete genome sequence is that a reference map need only be developed around the restriction sites, and this can be done in the process of sample genotyping. In such cases, the consensus of the read clusters across the sequence tagged sites becomes the reference. Alternatively, for kinship analyses in the absence of a reference genome, the sequence tags can simply be treated as dominant markers. Future application of GBS to breeding, conservation, and global species and population surveys may allow plant breeders to conduct genomic selection on a novel germplasm or species without first having to develop any prior molecular tools, or conservation biologists to determine population structure without prior knowledge of the genome or diversity in the species.

  1. Genetic diversity of mtDNA D-loop sequences in four native Chinese chicken breeds.

    Science.gov (United States)

    Guo, H W; Li, C; Wang, X N; Li, Z J; Sun, G R; Li, G X; Liu, X J; Kang, X T; Han, R L

    2017-10-01

    1. To explore the genetic diversity of Chinese indigenous chicken breeds, a 585 bp fragment of the mitochondrial DNA (mtDNA) region was sequenced in 102 birds from the Xichuan black-bone chicken, Yunyang black-bone chicken and Lushi chicken. In addition, 30 mtDNA D-loop sequences of Silkie fowls were downloaded from NCBI. The mtDNA D-loop sequence polymorphism and maternal origin of 4 chicken breeds were analysed in this study. 2. The results showed that a total of 33 mutation sites and 28 haplotypes were detected in the 4 chicken breeds. The haplotype diversity and nucleotide diversity of these 4 native breeds were 0.916 ± 0.014 and 0.012 ± 0.002, respectively. Three clusters were formed in 4 Chinese native chickens and 12 reference breeds. Both the Xichuan black-bone chicken and Yunyang black-bone chicken were grouped into one cluster. Four haplogroups (A, B, C and E) emerged in the median-joining network in these breeds. 3. It was concluded that these 4 Chinese chicken breeds had high genetic diversity. The phylogenetic tree and median network profiles showed that Chinese native chickens and its neighbouring countries had at least two maternal origins, one from Yunnan, China and another from Southeast Asia or its surrounding area.

  2. Genetic diversity analysis of Leuconostoc mesenteroides from Korean vegetables and food products by multilocus sequence typing.

    Science.gov (United States)

    Sharma, Anshul; Kaur, Jasmine; Lee, Sulhee; Park, Young-Seo

    2018-06-01

    In the present study, 35 Leuconostoc mesenteroides strains isolated from vegetables and food products from South Korea were studied by multilocus sequence typing (MLST) of seven housekeeping genes (atpA, groEL, gyrB, pheS, pyrG, rpoA, and uvrC). The fragment sizes of the seven amplified housekeeping genes ranged in length from 366 to 1414 bp. Sequence analysis indicated 27 different sequence types (STs) with 25 of them being represented by a single strain indicating high genetic diversity, whereas the remaining 2 were characterized by five strains each. In total, 220 polymorphic nucleotide sites were detected among seven housekeeping genes. The phylogenetic analysis based on the STs of the seven loci indicated that the 35 strains belonged to two major groups, A (28 strains) and B (7 strains). Split decomposition analysis showed that intraspecies recombination played a role in generating diversity among strains. The minimum spanning tree showed that the evolution of the STs was not correlated with food source. This study signifies that the multilocus sequence typing is a valuable tool to access the genetic diversity among L. mesenteroides strains from South Korea and can be used further to monitor the evolutionary changes.

  3. Characterization of sequence diversity in Plasmodium falciparum SERA5 from Indian isolates

    Directory of Open Access Journals (Sweden)

    Rahul C.N

    2015-06-01

    Full Text Available Objective: To characterize the sequence diversity of blood-stage Plasmodium falciparum serine repeat antigen-5 (PfSERA5 which is lacking in a malaria-endemic country like India. Methods: In this study, parasitic DNA was obtained from field isolates collected from various geographic regions. Subsequently, PfSERA5 gene sequence was PCR amplified and DNA sequenced. Results: We reported the existence of unique repeat polymorphisms and novel haplotypes for both the octamer repeat (OR and serine repeat (SR regions of the N-terminal fragment of PfSERA5 from Indian isolates. Several isolates from India were identical to low-frequency African haplotypes. Unique finding of our study was an Indian isolate showing deletion in a perfectly conserved 14 mer sequence within octamer repeat. Indian haplotypes reported in this study were found to be distributed into the three earlier classified allelic clusters of FCR3, K1 and Honduras showcasing broad diversity as compared to worldwide haplotypes. Conclusions: This study is the first report on genetic diversity of PfSERA5 antigen from India. Further evaluation of these haplotypes by serotyping would provide useful information for investigating variant-specific immunity and aid in malaria vaccine research.

  4. Development of novel InDel markers and genetic diversity in Chenopodium quinoa through whole-genome re-sequencing.

    Science.gov (United States)

    Zhang, Tifu; Gu, Minfeng; Liu, Yuhe; Lv, Yuanda; Zhou, Ling; Lu, Haiyan; Liang, Shuaiqiang; Bao, Huabin; Zhao, Han

    2017-09-05

    Quinoa (Chenopodium quinoa Willd.) is a balanced nutritional crop, but its breeding improvement has been limited by the lack of information on its genetics and genomics. Therefore, it is necessary to obtain knowledge on genomic variation, population structure, and genetic diversity and to develop novel Insertion/Deletion (InDel) markers for quinoa by whole-genome re-sequencing. We re-sequenced 11 quinoa accessions and obtained a coverage depth between approximately 7× to 23× the quinoa genome. Based on the 1453-megabase (Mb) assembly from the reference accession Riobamba, 8,441,022 filtered bi-allelic single nucleotide polymorphisms (SNPs) and 842,783 filtered InDels were identified, with an estimated SNP and InDel density of 5.81 and 0.58 per kilobase (kb). From the genomic InDel variations, 85 dimorphic InDel markers were newly developed and validated. Together with the 62 simple sequence repeat (SSR) markers reported, a total of 147 markers were used for genotyping the 129 quinoa accessions. Molecular grouping analysis showed classification into two major groups, the Andean highland (composed of the northern and southern highland subgroups) and Chilean coastal, based on combined STRUCTURE, phylogenetic tree and PCA (Principle Component Analysis) analyses. Further analysis of the genetic diversity exhibited a decreasing tendency from the Chilean coast group to the Andean highland group, and the gene flow between subgroups was more frequent than that between the two subgroups and the Chilean coastal group. The majority of the variations (approximately 70%) were found through an analysis of molecular variation (AMOVA) due to the diversity between the groups. This was congruent with the observation of a highly significant F ST value (0.705) between the groups, demonstrating significant genetic differentiation between the Andean highland type of quinoa and the Chilean coastal type. Moreover, a core set of 16 quinoa germplasms that capture all 362 alleles was

  5. Detection of Diverse Novel Bat Astrovirus Sequences in the Czech Republic.

    Science.gov (United States)

    Dufkova, Lucie; Straková, Petra; Širmarová, Jana; Salát, Jiří; Moutelíková, Romana; Chrudimský, Tomáš; Bartonička, Tomáš; Nowotny, Norbert; Růžek, Daniel

    2015-08-01

    Astroviruses are a major cause of gastroenteritis in humans and animals. Recently, novel groups of astroviruses were identified in apparently healthy insectivorous bats. We report the detection of diverse novel astrovirus sequences in nine different European bat species: Eptesicus serotinus, Hypsugo savii, Myotis emarginatus, M. mystacinus, Nyctalus noctula, Pipistrellus nathusii or P. pygmaeus, P. pipistrellus, Vespertilio murinus, and Rhinolophus hipposideros. In six bat species, astrovirus sequences were detected for the first time. One astrovirus strain detected in R. hipposideros clustered phylogenetically with Chinese astrovirus strains originating from bats of the families Rhinolophidae and Hipposideridae. All other Czech astrovirus sequences from vesper bats formed, together with one Hungarian sequence, a separate monophyletic lineage within the bat astrovirus group. These findings provide new insights into the molecular epidemiology, ecology, and prevalence of astroviruses in European bat populations.

  6. Phylogenetic diversity in the core group of Peziza inferred from ITS sequences and morphology

    DEFF Research Database (Denmark)

    Hansen, K.; Læssøe, Thomas; Pfister, D.H.

    2002-01-01

    Species delimitation within the core group of Peziza is highly controversial. The group, typified by P. vesiculosa, is morphologically coherent and in previous analyses of LSU rDNA sequences it formed a highly supported clade. Phylogenetic diversity and species limits were investigated within......), shallowly cup- to disc-shaped apothecia (A) and large (up to 15 cm), deeply cup-shaped to expanded apothecia (B). The overall exciple structure (a stratified or non-stratified medullary layer) and to some degree spore surface relief, likewise support the groupings. Clade A contains taxa with smooth...... that populations on a diverse array of substrates may be closely related, or indeed, conspecific....

  7. Genome sequence diversity and clues to the evolution of variola (smallpox) virus.

    Science.gov (United States)

    Esposito, Joseph J; Sammons, Scott A; Frace, A Michael; Osborne, John D; Olsen-Rasmussen, Melissa; Zhang, Ming; Govil, Dhwani; Damon, Inger K; Kline, Richard; Laker, Miriam; Li, Yu; Smith, Geoffrey L; Meyer, Hermann; Leduc, James W; Wohlhueter, Robert M

    2006-08-11

    Comparative genomics of 45 epidemiologically varied variola virus isolates from the past 30 years of the smallpox era indicate low sequence diversity, suggesting that there is probably little difference in the isolates' functional gene content. Phylogenetic clustering inferred three clades coincident with their geographical origin and case-fatality rate; the latter implicated putative proteins that mediate viral virulence differences. Analysis of the viral linear DNA genome suggests that its evolution involved direct descent and DNA end-region recombination events. Knowing the sequences will help understand the viral proteome and improve diagnostic test precision, therapeutics, and systems for their assessment.

  8. Using sequence similarity networks for visualization of relationships across diverse protein superfamilies.

    Directory of Open Access Journals (Sweden)

    Holly J Atkinson

    Full Text Available The dramatic increase in heterogeneous types of biological data--in particular, the abundance of new protein sequences--requires fast and user-friendly methods for organizing this information in a way that enables functional inference. The most widely used strategy to link sequence or structure to function, homology-based function prediction, relies on the fundamental assumption that sequence or structural similarity implies functional similarity. New tools that extend this approach are still urgently needed to associate sequence data with biological information in ways that accommodate the real complexity of the problem, while being accessible to experimental as well as computational biologists. To address this, we have examined the application of sequence similarity networks for visualizing functional trends across protein superfamilies from the context of sequence similarity. Using three large groups of homologous proteins of varying types of structural and functional diversity--GPCRs and kinases from humans, and the crotonase superfamily of enzymes--we show that overlaying networks with orthogonal information is a powerful approach for observing functional themes and revealing outliers. In comparison to other primary methods, networks provide both a good representation of group-wise sequence similarity relationships and a strong visual and quantitative correlation with phylogenetic trees, while enabling analysis and visualization of much larger sets of sequences than trees or multiple sequence alignments can easily accommodate. We also define important limitations and caveats in the application of these networks. As a broadly accessible and effective tool for the exploration of protein superfamilies, sequence similarity networks show great potential for generating testable hypotheses about protein structure-function relationships.

  9. Using sequence similarity networks for visualization of relationships across diverse protein superfamilies.

    Science.gov (United States)

    Atkinson, Holly J; Morris, John H; Ferrin, Thomas E; Babbitt, Patricia C

    2009-01-01

    The dramatic increase in heterogeneous types of biological data--in particular, the abundance of new protein sequences--requires fast and user-friendly methods for organizing this information in a way that enables functional inference. The most widely used strategy to link sequence or structure to function, homology-based function prediction, relies on the fundamental assumption that sequence or structural similarity implies functional similarity. New tools that extend this approach are still urgently needed to associate sequence data with biological information in ways that accommodate the real complexity of the problem, while being accessible to experimental as well as computational biologists. To address this, we have examined the application of sequence similarity networks for visualizing functional trends across protein superfamilies from the context of sequence similarity. Using three large groups of homologous proteins of varying types of structural and functional diversity--GPCRs and kinases from humans, and the crotonase superfamily of enzymes--we show that overlaying networks with orthogonal information is a powerful approach for observing functional themes and revealing outliers. In comparison to other primary methods, networks provide both a good representation of group-wise sequence similarity relationships and a strong visual and quantitative correlation with phylogenetic trees, while enabling analysis and visualization of much larger sets of sequences than trees or multiple sequence alignments can easily accommodate. We also define important limitations and caveats in the application of these networks. As a broadly accessible and effective tool for the exploration of protein superfamilies, sequence similarity networks show great potential for generating testable hypotheses about protein structure-function relationships.

  10. Exploring the environmental diversity of kinetoplastid flagellates in the high-throughput DNA sequencing era

    Directory of Open Access Journals (Sweden)

    Claudia Masini d’Avila-Levy

    2015-01-01

    Full Text Available The class Kinetoplastea encompasses both free-living and parasitic species from a wide range of hosts. Several representatives of this group are responsible for severe human diseases and for economic losses in agriculture and livestock. While this group encompasses over 30 genera, most of the available information has been derived from the vertebrate pathogenic genera Leishmaniaand Trypanosoma.Recent studies of the previously neglected groups of Kinetoplastea indicated that the actual diversity is much higher than previously thought. This article discusses the known segment of kinetoplastid diversity and how gene-directed Sanger sequencing and next-generation sequencing methods can help to deepen our knowledge of these interesting protists.

  11. AST: an automated sequence-sampling method for improving the taxonomic diversity of gene phylogenetic trees.

    Science.gov (United States)

    Zhou, Chan; Mao, Fenglou; Yin, Yanbin; Huang, Jinling; Gogarten, Johann Peter; Xu, Ying

    2014-01-01

    A challenge in phylogenetic inference of gene trees is how to properly sample a large pool of homologous sequences to derive a good representative subset of sequences. Such a need arises in various applications, e.g. when (1) accuracy-oriented phylogenetic reconstruction methods may not be able to deal with a large pool of sequences due to their high demand in computing resources; (2) applications analyzing a collection of gene trees may prefer to use trees with fewer operational taxonomic units (OTUs), for instance for the detection of horizontal gene transfer events by identifying phylogenetic conflicts; and (3) the pool of available sequences is biased towards extensively studied species. In the past, the creation of subsamples often relied on manual selection. Here we present an Automated sequence-Sampling method for improving the Taxonomic diversity of gene phylogenetic trees, AST, to obtain representative sequences that maximize the taxonomic diversity of the sampled sequences. To demonstrate the effectiveness of AST, we have tested it to solve four problems, namely, inference of the evolutionary histories of the small ribosomal subunit protein S5 of E. coli, 16 S ribosomal RNAs and glycosyl-transferase gene family 8, and a study of ancient horizontal gene transfers from bacteria to plants. Our results show that the resolution of our computational results is almost as good as that of manual inference by domain experts, hence making the tool generally useful to phylogenetic studies by non-phylogeny specialists. The program is available at http://csbl.bmb.uga.edu/~zhouchan/AST.php.

  12. [Study on Microbial Diversity of Peri-implantitis Subgingival by High-throughput Sequencing].

    Science.gov (United States)

    Li, Zhi-jie; Wang, Shao-guo; Li, Yue-hong; Tu, Dong-xiang; Liu, Shi-yun; Nie, Hong-bing; Li, Zhi-qiang; Zhang, Ju-mei

    2015-07-01

    To study microbial diversity of peri-implantitis subgingival with high-throughput sequencing, and investigate microbiological etiology of peri-implantitis. Subgingival plaques were sampled from the patients with peri-implantitis (D group) and non-peri-implantitis subjects (N group). The microbiological diversity of the subgingival plaques was detected by sequencing V4 region of 16S rRNA with Illumina Miseq platform. The diversity of the community structure was analyzed using Mothur software. A total of 156 507 gene sequences were detected in nine samples and 4 402 operational taxonomic units (OTUs) were found. Selenomonas, Pseudomonas, and Fusobacterium were dominant bacteria in D group, while Fusobacterium, Veillonella and Streptococcus were dominant bacteria in N group. Differences between peri-implantitis and non-peri-implantitis bacterial communities were observed at all phylogenetic levels by LEfSe, which was also found in PcoA test. The occurrence of peri-implantitis is not only related to periodontitis pathogenic microbe, but also related with the changes of oral microbial community structure. Treponema, Herbaspirillum, Butyricimonas and Phaeobacte may be closely related to the occurrence and development of peri-implantitis.

  13. Strategies for achieving high sequencing accuracy for low diversity samples and avoiding sample bleeding using illumina platform.

    Science.gov (United States)

    Mitra, Abhishek; Skrzypczak, Magdalena; Ginalski, Krzysztof; Rowicka, Maga

    2015-01-01

    Sequencing microRNA, reduced representation sequencing, Hi-C technology and any method requiring the use of in-house barcodes result in sequencing libraries with low initial sequence diversity. Sequencing such data on the Illumina platform typically produces low quality data due to the limitations of the Illumina cluster calling algorithm. Moreover, even in the case of diverse samples, these limitations are causing substantial inaccuracies in multiplexed sample assignment (sample bleeding). Such inaccuracies are unacceptable in clinical applications, and in some other fields (e.g. detection of rare variants). Here, we discuss how both problems with quality of low-diversity samples and sample bleeding are caused by incorrect detection of clusters on the flowcell during initial sequencing cycles. We propose simple software modifications (Long Template Protocol) that overcome this problem. We present experimental results showing that our Long Template Protocol remarkably increases data quality for low diversity samples, as compared with the standard analysis protocol; it also substantially reduces sample bleeding for all samples. For comprehensiveness, we also discuss and compare experimental results from alternative approaches to sequencing low diversity samples. First, we discuss how the low diversity problem, if caused by barcodes, can be avoided altogether at the barcode design stage. Second and third, we present modified guidelines, which are more stringent than the manufacturer's, for mixing low diversity samples with diverse samples and lowering cluster density, which in our experience consistently produces high quality data from low diversity samples. Fourth and fifth, we present rescue strategies that can be applied when sequencing results in low quality data and when there is no more biological material available. In such cases, we propose that the flowcell be re-hybridized and sequenced again using our Long Template Protocol. Alternatively, we discuss how

  14. Strategies for achieving high sequencing accuracy for low diversity samples and avoiding sample bleeding using illumina platform.

    Directory of Open Access Journals (Sweden)

    Abhishek Mitra

    Full Text Available Sequencing microRNA, reduced representation sequencing, Hi-C technology and any method requiring the use of in-house barcodes result in sequencing libraries with low initial sequence diversity. Sequencing such data on the Illumina platform typically produces low quality data due to the limitations of the Illumina cluster calling algorithm. Moreover, even in the case of diverse samples, these limitations are causing substantial inaccuracies in multiplexed sample assignment (sample bleeding. Such inaccuracies are unacceptable in clinical applications, and in some other fields (e.g. detection of rare variants. Here, we discuss how both problems with quality of low-diversity samples and sample bleeding are caused by incorrect detection of clusters on the flowcell during initial sequencing cycles. We propose simple software modifications (Long Template Protocol that overcome this problem. We present experimental results showing that our Long Template Protocol remarkably increases data quality for low diversity samples, as compared with the standard analysis protocol; it also substantially reduces sample bleeding for all samples. For comprehensiveness, we also discuss and compare experimental results from alternative approaches to sequencing low diversity samples. First, we discuss how the low diversity problem, if caused by barcodes, can be avoided altogether at the barcode design stage. Second and third, we present modified guidelines, which are more stringent than the manufacturer's, for mixing low diversity samples with diverse samples and lowering cluster density, which in our experience consistently produces high quality data from low diversity samples. Fourth and fifth, we present rescue strategies that can be applied when sequencing results in low quality data and when there is no more biological material available. In such cases, we propose that the flowcell be re-hybridized and sequenced again using our Long Template Protocol. Alternatively

  15. Targeted genomic enrichment and sequencing of CyHV-3 from carp tissues confirms low nucleotide diversity and mixed genotype infections

    Directory of Open Access Journals (Sweden)

    Saliha Hammoumi

    2016-09-01

    Full Text Available Koi herpesvirus disease (KHVD is an emerging disease that causes mass mortality in koi and common carp, Cyprinus carpio L. Its causative agent is Cyprinid herpesvirus 3 (CyHV-3, also known as koi herpesvirus (KHV. Although data on the pathogenesis of this deadly virus is relatively abundant in the literature, still little is known about its genomic diversity and about the molecular mechanisms that lead to such a high virulence. In this context, we developed a new strategy for sequencing full-length CyHV-3 genomes directly from infected fish tissues. Total genomic DNA extracted from carp gill tissue was specifically enriched with CyHV-3 sequences through hybridization to a set of nearly 2 million overlapping probes designed to cover the entire genome length, using KHV-J sequence (GenBank accession number AP008984 as reference. Applied to 7 CyHV-3 specimens from Poland and Indonesia, this targeted genomic enrichment enabled recovery of the full genomes with >99.9% reference coverage. The enrichment rate was directly correlated to the estimated number of viral copies contained in the DNA extracts used for library preparation, which varied between ∼5000 and ∼2×107. The average sequencing depth was >200 for all samples, thus allowing the search for variants with high confidence. Sequence analyses highlighted a significant proportion of intra-specimen sequence heterogeneity, suggesting the presence of mixed infections in all investigated fish. They also showed that inter-specimen genetic diversity at the genome scale was very low (>99.95% of sequence identity. By enabling full genome comparisons directly from infected fish tissues, this new method will be valuable to trace outbreaks rapidly and at a reasonable cost, and in turn to understand the transmission routes of CyHV-3.

  16. Oral treponeme major surface protein: Sequence diversity and distributions within periodontal niches.

    Science.gov (United States)

    You, M; Chan, Y; Lacap-Bugler, D C; Huo, Y-B; Gao, W; Leung, W K; Watt, R M

    2017-12-01

    Treponema denticola and other species (phylotypes) of oral spirochetes are widely considered to play important etiological roles in periodontitis and other oral infections. The major surface protein (Msp) of T. denticola is directly implicated in several pathological mechanisms. Here, we have analyzed msp sequence diversity across 68 strains of oral phylogroup 1 and 2 treponemes; including reference strains of T. denticola, Treponema putidum, Treponema medium, 'Treponema vincentii', and 'Treponema sinensis'. All encoded Msp proteins contained highly conserved, taxon-specific signal peptides, and shared a predicted 'three-domain' structure. A clone-based strategy employing 'msp-specific' polymerase chain reaction primers was used to analyze msp gene sequence diversity present in subgingival plaque samples collected from a group of individuals with chronic periodontitis (n=10), vs periodontitis-free controls (n=10). We obtained 626 clinical msp gene sequences, which were assigned to 21 distinct 'clinical msp genotypes' (95% sequence identity cut-off). The most frequently detected clinical msp genotype corresponded to T. denticola ATCC 35405 T , but this was not correlated to disease status. UniFrac and libshuff analysis revealed that individuals with periodontitis and periodontitis-free controls harbored significantly different communities of treponeme clinical msp genotypes (Pdiversity than periodontitis-free controls (Mann-Whitney U-test, Pdiversity of Treponema clinical msp genotypes within their subgingival niches. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  17. Cytogenetic Diversity of Simple Sequences Repeats in Morphotypes of Brassica rapa ssp. chinensis.

    Science.gov (United States)

    Zheng, Jin-Shuang; Sun, Cheng-Zhen; Zhang, Shu-Ning; Hou, Xi-Lin; Bonnema, Guusje

    2016-01-01

    A significant fraction of the nuclear DNA of all eukaryotes is comprised of simple sequence repeats (SSRs). Although these sequences are widely used for studying genetic variation, linkage mapping and evolution, little attention had been paid to the chromosomal distribution and cytogenetic diversity of these sequences. In this paper, we report the distribution characterization of mono-, di-, and tri-nucleotide SSRs in Brassica rapa ssp. chinensis. Fluorescence in situ hybridization was used to characterize the cytogenetic diversity of SSRs among morphotypes of B. rapa ssp. chinensis. The proportion of different SSR motifs varied among morphotypes of B. rapa ssp. chinensis, with tri-nucleotide SSRs being more prevalent in the genome of B. rapa ssp. chinensis. We determined the chromosomal locations of mono-, di-, and tri-nucleotide repeat loci. The results showed that the chromosomal distribution of SSRs in the different morphotypes is non-random and motif-dependent, and allowed us to characterize the relative variability in terms of SSR numbers and similar chromosomal distributions in centromeric/peri-centromeric heterochromatin. The differences between SSR repeats with respect to abundance and distribution indicate that SSRs are a driving force in the genomic evolution of B. rapa species. Our results provide a comprehensive view of the SSR sequence distribution and evolution for comparison among morphotypes B. rapa ssp. chinensis.

  18. Phylogeny and genetic diversity of Bridgeoporus nobilissimus inferred using mitochondrial and nuclear rDNA sequences

    Science.gov (United States)

    Redberg, G.L.; Hibbett, D.S.; Ammirati, J.F.; Rodriguez, R.J.

    2003-01-01

    The genetic diversity and phylogeny of Bridgeoporus nobilissimus have been analyzed. DNA was extracted from spores collected from individual fruiting bodies representing six geographically distinct populations in Oregon and Washington. Spore samples collected contained low levels of bacteria, yeast and a filamentous fungal species. Using taxon-specific PCR primers, it was possible to discriminate among rDNA from bacteria, yeast, a filamentous associate and B. nobilissimus. Nuclear rDNA internal transcribed spacer (ITS) region sequences of B. nobilissimus were compared among individuals representing six populations and were found to have less than 2% variation. These sequences also were used to design dual and nested PCR primers for B. nobilissimus-specific amplification. Mitochondrial small-subunit rDNA sequences were used in a phylogenetic analysis that placed B. nobilissimus in the hymenochaetoid clade, where it was associated with Oxyporus and Schizopora.

  19. Twenty-one genome sequences from Pseudomonas species and 19 genome sequences from diverse bacteria isolated from the rhizosphere and endosphere of Populus deltoides.

    Science.gov (United States)

    Brown, Steven D; Utturkar, Sagar M; Klingeman, Dawn M; Johnson, Courtney M; Martin, Stanton L; Land, Miriam L; Lu, Tse-Yuan S; Schadt, Christopher W; Doktycz, Mitchel J; Pelletier, Dale A

    2012-11-01

    To aid in the investigation of the Populus deltoides microbiome, we generated draft genome sequences for 21 Pseudomonas strains and 19 other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium, and Variovorax were generated.

  20. Twenty-One Genome Sequences from Pseudomonas Species and 19 Genome Sequences from Diverse Bacteria Isolated from the Rhizosphere and Endosphere of Populus deltoides

    Energy Technology Data Exchange (ETDEWEB)

    Brown, Steven D [ORNL; Utturkar, Sagar M [ORNL; Klingeman, Dawn Marie [ORNL; Johnson, Courtney M [ORNL; Martin, Stanton [ORNL; Land, Miriam L [ORNL; Lu, Tse-Yuan [ORNL; Schadt, Christopher Warren [ORNL; Doktycz, Mitchel John [ORNL; Pelletier, Dale A [ORNL

    2012-01-01

    To aid in the investigation of the Populus deltoides microbiome we generated draft genome sequences for twenty one Pseudomonas and twenty one other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Burkholderia, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium and Variovorax were generated.

  1. Hunting down frame shifts: Ecological analysis of diverse functional gene sequences

    Directory of Open Access Journals (Sweden)

    Michal eStrejcek

    2015-11-01

    Full Text Available Functional gene ecological analyses using amplicon sequencing can be challenging as translated sequences are often burdened with shifted reading frames. The aim of this work was to evaluate several bioinformatics tools designed to correct errors which arise during sequencing in an effort to reduce the number of frame-shifts (FS. Genes encoding for alpha subunits of biphenyl (bphA and benzoate (benA dioxygenases were used as model sequences. FrameBot, a FS correction tool, was able to reduce the number of detected FS to zero. However, up to 43.1% of sequences were discarded by FrameBot as non-specific targets. Therefore, we proposed a de novo mode of FrameBot for FS correction, which works on a similar basis as common chimera identifying platforms and is not dependent on reference sequences. By nature of FrameBot de novo design, it is crucial to provide it with data as error free as possible. We tested the ability of several publicly available correction tools to decrease the number of errors in the data sets. The combination of Maximum Expected Error (MEE filtering and single linkage pre-clustering (SLP proved the most efficient read procession. Applying FrameBot de novo on the processed data enabled analysis of BphA sequences with minimal losses of potentially functional sequences not homologous to those previously known. This experiment also demonstrated the extensive diversity of dioxygenases in soil. A script which performs FrameBot de novo is presented in the supplementary material to the study and the tool was implemented into FunGene Pipeline available at http://fungene.cme.msu.edu/FunGenePipeline/ and https://github.com/rdpstaff/Framebot.

  2. Multilocus sequence analysis (MLSA) of Bradyrhizobium strains: revealing high diversity of tropical diazotrophic symbiotic bacteria.

    Science.gov (United States)

    Delamuta, Jakeline Renata Marçon; Ribeiro, Renan Augusto; Menna, Pâmela; Bangel, Eliane Villamil; Hungria, Mariangela

    2012-04-01

    Symbiotic association of several genera of bacteria collectively called as rhizobia and plants belonging to the family Leguminosae (=Fabaceae) results in the process of biological nitrogen fixation, playing a key role in global N cycling, and also bringing relevant contributions to the agriculture. Bradyrhizobium is considered as the ancestral of all nitrogen-fixing rhizobial species, probably originated in the tropics. The genus encompasses a variety of diverse bacteria, but the diversity captured in the analysis of the 16S rRNA is often low. In this study, we analyzed twelve Bradyrhizobium strains selected from previous studies performed by our group for showing high genetic diversity in relation to the described species. In addition to the 16S rRNA, five housekeeping genes (recA, atpD, glnII, gyrB and rpoB) were analyzed in the MLSA (multilocus sequence analysis) approach. Analysis of each gene and of the concatenated housekeeping genes captured a considerably higher level of genetic diversity, with indication of putative new species. The results highlight the high genetic variability associated with Bradyrhizobium microsymbionts of a variety of legumes. In addition, the MLSA approach has proved to represent a rapid and reliable method to be employed in phylogenetic and taxonomic studies, speeding the identification of the still poorly known diversity of nitrogen-fixing rhizobia in the tropics.

  3. Determining Clostridium difficile intra-taxa diversity by mining multilocus sequence typing databases.

    Science.gov (United States)

    Muñoz, Marina; Ríos-Chaparro, Dora Inés; Patarroyo, Manuel Alfonso; Ramírez, Juan David

    2017-03-14

    Multilocus sequence typing (MLST) is a highly discriminatory typing strategy; it is reproducible and scalable. There is a MLST scheme for Clostridium difficile (CD), a gram positive bacillus causing different pathologies of the gastrointestinal tract. This work was aimed at describing the frequency of sequence types (STs) and Clades (C) reported and evalute the intra-taxa diversity in the CD MLST database (CD-MLST-db) using an MLSA approach. Analysis of 1778 available isolates showed that clade 1 (C1) was the most frequent worldwide (57.7%), followed by C2 (29.1%). Regarding sequence types (STs), it was found that ST-1, belonging to C2, was the most frequent. The isolates analysed came from 17 countries, mostly from the United Kingdom (UK) (1541 STs, 87.0%). The diversity of the seven housekeeping genes in the MLST scheme was evaluated, and alleles from the profiles (STs), for identifying CD population structure. It was found that adk and atpA are conserved genes allowing a limited amount of clusters to be discriminated; however, different genes such as drx, glyA and particularly sodA showed high diversity indexes and grouped CD populations in many clusters, suggesting that these genes' contribution to CD typing should be revised. It was identified that CD STs reported to date have a mostly clonal population structure with foreseen events of recombination; however, one group of STs was not assigned to a clade being highly different containing at least nine well-supported clusters, suggesting a greater amount of clades for CD. This study shows the usefulness of CD-MLST-db as a tool for studying CD distribution and population structure, identifying the need for reviewing the usefulness of sodA as housekeeping gene within the MLST scheme and suggesting the existence of a greater amount of CD clades. The study also shows the plausible exchange of genetic material between STs, contributing towards intra-taxa genetic diversity.

  4. A versatile palindromic amphipathic repeat coding sequence horizontally distributed among diverse bacterial and eucaryotic microbes

    Directory of Open Access Journals (Sweden)

    Glass John I

    2010-07-01

    Full Text Available Abstract Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT. Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the

  5. A comparison of genotyping-by-sequencing analysis methods on low-coverage crop datasets shows advantages of a new workflow, GB-eaSy.

    Science.gov (United States)

    Wickland, Daniel P; Battu, Gopal; Hudson, Karen A; Diers, Brian W; Hudson, Matthew E

    2017-12-28

    Genotyping-by-sequencing (GBS), a method to identify genetic variants and quickly genotype samples, reduces genome complexity by using restriction enzymes to divide the genome into fragments whose ends are sequenced on short-read sequencing platforms. While cost-effective, this method produces extensive missing data and requires complex bioinformatics analysis. GBS is most commonly used on crop plant genomes, and because crop plants have highly variable ploidy and repeat content, the performance of GBS analysis software can vary by target organism. Here we focus our analysis on soybean, a polyploid crop with a highly duplicated genome, relatively little public GBS data and few dedicated tools. We compared the performance of five GBS pipelines using low-coverage Illumina sequence data from three soybean populations. To address issues identified with existing methods, we developed GB-eaSy, a GBS bioinformatics workflow that incorporates widely used genomics tools, parallelization and automation to increase the accuracy and accessibility of GBS data analysis. Compared to other GBS pipelines, GB-eaSy rapidly and accurately identified the greatest number of SNPs, with SNP calls closely concordant with whole-genome sequencing of selected lines. Across all five GBS analysis platforms, SNP calls showed unexpectedly low convergence but generally high accuracy, indicating that the workflows arrived at largely complementary sets of valid SNP calls on the low-coverage data analyzed. We show that GB-eaSy is approximately as good as, or better than, other leading software solutions in the accuracy, yield and missing data fraction of variant calling, as tested on low-coverage genomic data from soybean. It also performs well relative to other solutions in terms of the run time and disk space required. In addition, GB-eaSy is built from existing open-source, modular software packages that are regularly updated and commonly used, making it straightforward to install and maintain

  6. A transcriptome resource for the koala (Phascolarctos cinereus): insights into koala retrovirus transcription and sequence diversity.

    Science.gov (United States)

    Hobbs, Matthew; Pavasovic, Ana; King, Andrew G; Prentis, Peter J; Eldridge, Mark D B; Chen, Zhiliang; Colgan, Donald J; Polkinghorne, Adam; Wilkins, Marc R; Flanagan, Cheyne; Gillett, Amber; Hanger, Jon; Johnson, Rebecca N; Timms, Peter

    2014-09-11

    The koala, Phascolarctos cinereus, is a biologically unique and evolutionarily distinct Australian arboreal marsupial. The goal of this study was to sequence the transcriptome from several tissues of two geographically separate koalas, and to create the first comprehensive catalog of annotated transcripts for this species, enabling detailed analysis of the unique attributes of this threatened native marsupial, including infection by the koala retrovirus. RNA-Seq data was generated from a range of tissues from one male and one female koala and assembled de novo into transcripts using Velvet-Oases. Transcript abundance in each tissue was estimated. Transcripts were searched for likely protein-coding regions and a non-redundant set of 117,563 putative protein sequences was produced. In similarity searches there were 84,907 (72%) sequences that aligned to at least one sequence in the NCBI nr protein database. The best alignments were to sequences from other marsupials. After applying a reciprocal best hit requirement of koala sequences to those from tammar wallaby, Tasmanian devil and the gray short-tailed opossum, we estimate that our transcriptome dataset represents approximately 15,000 koala genes. The marsupial alignment information was used to look for potential gene duplications and we report evidence for copy number expansion of the alpha amylase gene, and of an aldehyde reductase gene.Koala retrovirus (KoRV) transcripts were detected in the transcriptomes. These were analysed in detail and the structure of the spliced envelope gene transcript was determined. There was appreciable sequence diversity within KoRV, with 233 sites in the KoRV genome showing small insertions/deletions or single nucleotide polymorphisms. Both koalas had sequences from the KoRV-A subtype, but the male koala transcriptome has, in addition, sequences more closely related to the KoRV-B subtype. This is the first report of a KoRV-B-like sequence in a wild population. This transcriptomic

  7. Combining genomic sequencing methods to explore viral diversity and reveal potential virus-host interactions

    Directory of Open Access Journals (Sweden)

    Cheryl-Emiliane Tien Chow

    2015-04-01

    Full Text Available Viral diversity and virus-host interactions in oxygen-starved regions of the ocean, also known as oxygen minimum zones (OMZs, remain relatively unexplored. Microbial community metabolism in OMZs alters nutrient and energy flow through marine food webs, resulting in biological nitrogen loss and greenhouse gas production. Thus, viruses infecting OMZ microbes have the potential to modulate community metabolism with resulting feedback on ecosystem function. Here, we describe viral communities inhabiting oxic surface (10m and oxygen-starved basin (200m waters of Saanich Inlet, a seasonally anoxic fjord on the coast of Vancouver Island, British Columbia using viral metagenomics and complete viral fosmid sequencing on samples collected between April 2007 and April 2010. Of 6459 open reading frames (ORFs predicted across all 34 viral fosmids, 77.6% (n=5010 had no homology to reference viral genomes. These fosmids recruited a higher proportion of viral metagenomic sequences from Saanich Inlet than from nearby northeastern subarctic Pacific Ocean (Line P waters, indicating differences in the viral communities between coastal and open ocean locations. While functional annotations of fosmid ORFs were limited, recruitment to NCBI’s non-redundant ‘nr’ database and publicly available single-cell genomes identified putative viruses infecting marine thaumarchaeal and SUP05 proteobacteria to provide potential host linkages with relevance to coupled biogeochemical cycling processes in OMZ waters. Taken together, these results highlight the power of coupled analyses of multiple sequence data types, such as viral metagenomic and fosmid sequence data with prokaryotic single cell genomes, to chart viral diversity, elucidate genomic and ecological contexts for previously unclassifiable viral sequences, and identify novel host interactions in natural and engineered ecosystems.

  8. ITS2 sequence-structure phylogeny reveals diverse endophytic Pseudocercospora fungi on poplars.

    Science.gov (United States)

    Yan, Dong-Hui; Gao, Qian; Sun, Xiaoming; Song, Xiaoyu; Li, Hongchang

    2018-04-01

    For matching the new fungal nomenclature to abolish pleomorphic names for a fungus, a genus Pseudocercospora s. str. was suggested to host holomorphic Pseudocercosproa fungi. But the Pseudocercosproa fungi need extra phylogenetic loci to clarify their taxonomy and diversity for their existing and coming species. Internal transcribed spacer 2 (ITS2) secondary structures have been promising in charactering species phylogeny in plants, animals and fungi. In present study, a conserved model of ITS2 secondary structures was confirmed on fungi in Pseudocercospora s. str. genus using RNAshape program. The model has a typical eukaryotic four-helix ITS2 secondary structure. But a single U base occurred in conserved motif of U-U mismatch in Helix 2, and a UG emerged in UGGU motif in Helix 3 to Pseudocercospora fungi. The phylogeny analyses based on the ITS2 sequence-secondary structures with compensatory base change characterizations are able to delimit more species for Pseudocercospora s. str. than phylogenic inferences of traditional multi-loci alignments do. The model was employed to explore the diversity of endophytic Pseudocercospora fungi in poplar trees. The analysis results also showed that endophytic Pseudocercospora fungi were diverse in species and evolved a specific lineage in poplar trees. This work suggested that ITS2 sequence-structures could become as additionally significant loci for species phylogenetic and taxonomic studies on Pseudocerospora fungi, and that Pseudocercospora endophytes could be important roles to Pseudocercospora fungi's evolution and function in ecology.

  9. High diversity of picornaviruses in rats from different continents revealed by deep sequencing.

    Science.gov (United States)

    Hansen, Thomas Arn; Mollerup, Sarah; Nguyen, Nam-Phuong; White, Nicole E; Coghlan, Megan; Alquezar-Planas, David E; Joshi, Tejal; Jensen, Randi Holm; Fridholm, Helena; Kjartansdóttir, Kristín Rós; Mourier, Tobias; Warnow, Tandy; Belsham, Graham J; Bunce, Michael; Willerslev, Eske; Nielsen, Lars Peter; Vinner, Lasse; Hansen, Anders Johannes

    2016-08-17

    Outbreaks of zoonotic diseases in humans and livestock are not uncommon, and an important component in containment of such emerging viral diseases is rapid and reliable diagnostics. Such methods are often PCR-based and hence require the availability of sequence data from the pathogen. Rattus norvegicus (R. norvegicus) is a known reservoir for important zoonotic pathogens. Transmission may be direct via contact with the animal, for example, through exposure to its faecal matter, or indirectly mediated by arthropod vectors. Here we investigated the viral content in rat faecal matter (n=29) collected from two continents by analyzing 2.2 billion next-generation sequencing reads derived from both DNA and RNA. Among other virus families, we found sequences from members of the Picornaviridae to be abundant in the microbiome of all the samples. Here we describe the diversity of the picornavirus-like contigs including near-full-length genomes closely related to the Boone cardiovirus and Theiler's encephalomyelitis virus. From this study, we conclude that picornaviruses within R. norvegicus are more diverse than previously recognized. The virome of R. norvegicus should be investigated further to assess the full potential for zoonotic virus transmission.

  10. [Bacterial diversity in sequencing batch biofilm reactor (SBBR) for landfill leachate treatment using PCR-DGGE].

    Science.gov (United States)

    Xiao, Yong; Yang, Zhao-hui; Zeng, Guang-ming; Ma, Yan-he; Liu, You-sheng; Wang, Rong-juan; Xu, Zheng-yong

    2007-05-01

    For studying the bacterial diversity and the mechanism of denitrification in sequencing bath biofilm reactor (SBBR) treating landfill leachate to provide microbial evidence for technique improvements, total microbial DNA was extracted from samples which were collected from natural landfill leachate and biofilm of a SBBR that could efficiently remove NH4+ -N and COD of high concentration. 16S rDNA fragments were amplified from the total DNA successfully using a pair of universal bacterial 16S rDNA primer, GC341F and 907R, and then were used for denaturing gradient gel electrophoresis (DGGE) analysis. The bands in the gel were analyzed by statistical methods and excided from the gel for sequencing, and the sequences were used for homology analysis and then two phylogenetic trees were constructed using DNAStar software. Results indicated that the bacterial diversity of the biofilm in SBBR and the landfill leachate was abundant, and no obvious change of community structure happened during running in the biofilm, in which most bacteria came from the landfill leachate. There may be three different modes of denitrification in the reactor because several different nitrifying bacteria, denitrifying bacteria and anaerobic ammonia oxidation bacteria coexisted in it. The results provided some valuable references for studying microbiological mechanism of denitrification in SBBR.

  11. Prevalence of single nucleotide polymorphism among 27 diverse alfalfa genotypes as assessed by transcriptome sequencing

    Directory of Open Access Journals (Sweden)

    Li Xuehui

    2012-10-01

    Full Text Available Abstract Background Alfalfa, a perennial, outcrossing species, is a widely planted forage legume producing highly nutritious biomass. Currently, improvement of cultivated alfalfa mainly relies on recurrent phenotypic selection. Marker assisted breeding strategies can enhance alfalfa improvement efforts, particularly if many genome-wide markers are available. Transcriptome sequencing enables efficient high-throughput discovery of single nucleotide polymorphism (SNP markers for a complex polyploid species. Result The transcriptomes of 27 alfalfa genotypes, including elite breeding genotypes, parents of mapping populations, and unimproved wild genotypes, were sequenced using an Illumina Genome Analyzer IIx. De novo assembly of quality-filtered 72-bp reads generated 25,183 contigs with a total length of 26.8 Mbp and an average length of 1,065 bp, with an average read depth of 55.9-fold for each genotype. Overall, 21,954 (87.2% of the 25,183 contigs represented 14,878 unique protein accessions. Gene ontology (GO analysis suggested that a broad diversity of genes was represented in the resulting sequences. The realignment of individual reads to the contigs enabled the detection of 872,384 SNPs and 31,760 InDels. High resolution melting (HRM analysis was used to validate 91% of 192 putative SNPs identified by sequencing. Both allelic variants at about 95% of SNP sites identified among five wild, unimproved genotypes are still present in cultivated alfalfa, and all four US breeding programs also contain a high proportion of these SNPs. Thus, little evidence exists among this dataset for loss of significant DNA sequence diversity from either domestication or breeding of alfalfa. Structure analysis indicated that individuals from the subspecies falcata, the diploid subspecies caerulea, and the tetraploid subspecies sativa (cultivated tetraploid alfalfa were clearly separated. Conclusion We used transcriptome sequencing to discover large numbers of SNPs

  12. Diversity and functions of bacterial community in drinking water biofilms revealed by high-throughput sequencing

    Science.gov (United States)

    Chao, Yuanqing; Mao, Yanping; Wang, Zhiping; Zhang, Tong

    2015-06-01

    The development of biofilms in drinking water (DW) systems may cause various problems to water quality. To investigate the community structure of biofilms on different pipe materials and the global/specific metabolic functions of DW biofilms, PCR-based 454 pyrosequencing data for 16S rRNA genes and Illumina metagenomic data were generated and analysed. Considerable differences in bacterial diversity and taxonomic structure were identified between biofilms formed on stainless steel and biofilms formed on plastics, indicating that the metallic materials facilitate the formation of higher diversity biofilms. Moreover, variations in several dominant genera were observed during biofilm formation. Based on PCA analysis, the global functions in the DW biofilms were similar to other DW metagenomes. Beyond the global functions, the occurrences and abundances of specific protective genes involved in the glutathione metabolism, the SoxRS system, the OxyR system, RpoS regulated genes, and the production/degradation of extracellular polymeric substances were also evaluated. A near-complete and low-contamination draft genome was constructed from the metagenome of the DW biofilm, based on the coverage and tetranucleotide frequencies, and identified as a Bradyrhizobiaceae-like bacterium according to a phylogenetic analysis. Our findings provide new insight into DW biofilms, especially in terms of their metabolic functions.

  13. Analysis of Plasmodium falciparum diversity in natural infections by deep sequencing

    Science.gov (United States)

    Manske, Magnus; Miotto, Olivo; Campino, Susana; Auburn, Sarah; Almagro-Garcia, Jacob; Maslen, Gareth; O’Brien, Jack; Djimde, Abdoulaye; Doumbo, Ogobara; Zongo, Issaka; Ouedraogo, Jean-Bosco; Michon, Pascal; Mueller, Ivo; Siba, Peter; Nzila, Alexis; Borrmann, Steffen; Kiara, Steven M.; Marsh, Kevin; Jiang, Hongying; Su, Xin-Zhuan; Amaratunga, Chanaki; Fairhurst, Rick; Socheat, Duong; Nosten, Francois; Imwong, Mallika; White, Nicholas J.; Sanders, Mandy; Anastasi, Elisa; Alcock, Dan; Drury, Eleanor; Oyola, Samuel; Quail, Michael A.; Turner, Daniel J.; Rubio, Valentin Ruano; Jyothi, Dushyanth; Amenga-Etego, Lucas; Hubbart, Christina; Jeffreys, Anna; Rowlands, Kate; Sutherland, Colin; Roper, Cally; Mangano, Valentina; Modiano, David; Tan, John C.; Ferdig, Michael T.; Amambua-Ngwa, Alfred; Conway, David J.; Takala-Harrison, Shannon; Plowe, Christopher V.; Rayner, Julian C.; Rockett, Kirk A.; Clark, Taane G.; Newbold, Chris I.; Berriman, Matthew; MacInnis, Bronwyn; Kwiatkowski, Dominic P.

    2013-01-01

    Malaria elimination strategies require surveillance of the parasite population for genetic changes that demand a public health response, such as new forms of drug resistance. 1,2 Here we describe methods for large-scale analysis of genetic variation in Plasmodium falciparum by deep sequencing of parasite DNA obtained from the blood of patients with malaria, either directly or after short term culture. Analysis of 86,158 exonic SNPs that passed genotyping quality control in 227 samples from Africa, Asia and Oceania provides genome-wide estimates of allele frequency distribution, population structure and linkage disequilibrium. By comparing the genetic diversity of individual infections with that of the local parasite population, we derive a metric of within-host diversity that is related to the level of inbreeding in the population. An open-access web application has been established for exploration of regional differences in allele frequency and of highly differentiated loci in the P. falciparum genome. PMID:22722859

  14. Molecular detection and sequence characterization of diverse rhabdoviruses in bats, China.

    Science.gov (United States)

    Xu, Lin; Wu, Jianmin; Jiang, Tinglei; Qin, Shaomin; Xia, Lele; Li, Xingyu; He, Biao; Tu, Changchun

    2018-01-15

    The Rhabdoviridae is among the most diverse families of RNA viruses and currently classified into 18 genera with some rhabdoviruses lethal to humans and other animals. Herein, we describe genetic characterization of three novel rhabdoviruses from bats in China. Of these, two viruses (Jinghong bat virus and Benxi bat virus) found in Rhinolophus bats showed a phylogenetic relationship with vesiculoviruses, and sequence analyses indicate that they represent two new species within the genus Vesiculovirus. The remaining Yangjiang bat virus found in Hipposideros larvatus bats were only distantly related to currently known rhabdoviruses. Copyright © 2017 Elsevier B.V. All rights reserved.

  15. Photobiont diversity in lichens from metal-rich substrata based on ITS rDNA sequences.

    Science.gov (United States)

    Backor, Martin; Peksa, Ondrej; Skaloud, Pavel; Backorová, Miriam

    2010-05-01

    The photobiont is considered as the more sensitive partner of lichen symbiosis in metal pollution. For this reason the presence of a metal tolerant photobiont in lichens may be a key factor of ecological success of lichens growing on metal polluted substrata. The photobiont inventory was examined for terricolous lichen community growing in Cu mine-spoil heaps derived by historical mining. Sequences of internal transcribed spacer (ITS) were phylogenetically analyzed using maximum likelihood analyses. A total of 50 ITS algal sequences were obtained from 22 selected lichen taxa collected at three Cu mine-spoil heaps and two control localities. Algae associated with Cladonia and Stereocaulon were identified as members of several Asterochloris lineages, photobionts of cetrarioid lichens clustered with Trebouxia hypogymniae ined. We did not find close relationship between heavy metal content (in localities as well as lichen thalli) and photobiont diversity. Presence of multiple algal genotypes in single lichen thallus has been confirmed. Copyright 2009 Elsevier Inc. All rights reserved.

  16. Molecular sequence data of hepatitis B virus and genetic diversity after vaccination.

    Science.gov (United States)

    van Ballegooijen, W Marijn; van Houdt, Robin; Bruisten, Sylvia M; Boot, Hein J; Coutinho, Roel A; Wallinga, Jacco

    2009-12-15

    The effect of vaccination programs on transmission of infectious disease is usually assessed by monitoring programs that rely on notifications of symptomatic illness. For monitoring of infectious diseases with a high proportion of asymptomatic cases or a low reporting rate, molecular sequence data combined with modern coalescent-based techniques offer a complementary tool to assess transmission. Here, the authors investigate the added value of using viral sequence data to monitor a vaccination program that was started in 1998 and was targeted against hepatitis B virus in men who have sex with men in Amsterdam, the Netherlands. The incidence in this target group, as estimated from the notifications of acute infections with hepatitis B virus, was low; therefore, there was insufficient power to show a significant change in incidence. In contrast, the genetic diversity, as estimated from the viral sequence collected from the target group, revealed a marked decrease after vaccination was introduced. Taken together, the findings suggest that introduction of vaccination coincided with a change in the target group toward behavior with a higher risk of infection. The authors argue that molecular sequence data provide a powerful additional monitoring instrument, next to conventional case registration, for assessing the impact of vaccination.

  17. Genome Microscale Heterogeneity among Wild Potatoes Revealed by Diversity Arrays Technology Marker Sequences

    Directory of Open Access Journals (Sweden)

    Alessandra Traini

    2013-01-01

    Full Text Available Tuber-bearing potato species possess several genes that can be exploited to improve the genetic background of the cultivated potato Solanum tuberosum. Among them, S. bulbocastanum and S. commersonii are well known for their strong resistance to environmental stresses. However, scant information is available for these species in terms of genome organization, gene function, and regulatory networks. Consequently, genomic tools to assist breeding are meager, and efficient exploitation of these species has been limited so far. In this paper, we employed the reference genome sequences from cultivated potato and tomato and a collection of sequences of 1,423 potato Diversity Arrays Technology (DArT markers that show polymorphic representation across the genomes of S. bulbocastanum and/or S. commersonii genotypes. Our results highlighted microscale genome sequence heterogeneity that may play a significant role in functional and structural divergence between related species. Our analytical approach provides knowledge of genome structural and sequence variability that could not be detected by transcriptome and proteome approaches.

  18. Genome Microscale Heterogeneity among Wild Potatoes Revealed by Diversity Arrays Technology Marker Sequences.

    Science.gov (United States)

    Traini, Alessandra; Iorizzo, Massimo; Mann, Harpartap; Bradeen, James M; Carputo, Domenico; Frusciante, Luigi; Chiusano, Maria Luisa

    2013-01-01

    Tuber-bearing potato species possess several genes that can be exploited to improve the genetic background of the cultivated potato Solanum tuberosum. Among them, S. bulbocastanum and S. commersonii are well known for their strong resistance to environmental stresses. However, scant information is available for these species in terms of genome organization, gene function, and regulatory networks. Consequently, genomic tools to assist breeding are meager, and efficient exploitation of these species has been limited so far. In this paper, we employed the reference genome sequences from cultivated potato and tomato and a collection of sequences of 1,423 potato Diversity Arrays Technology (DArT) markers that show polymorphic representation across the genomes of S. bulbocastanum and/or S. commersonii genotypes. Our results highlighted microscale genome sequence heterogeneity that may play a significant role in functional and structural divergence between related species. Our analytical approach provides knowledge of genome structural and sequence variability that could not be detected by transcriptome and proteome approaches.

  19. Sequence diversity and copy number variation of Mutator-like transposases in wheat

    Directory of Open Access Journals (Sweden)

    Nobuaki Asakura

    2008-01-01

    Full Text Available Partial transposase-coding sequences of Mutator-like elements (MULEs were isolated from a wild einkorn wheat, Triticum urartu, by degenerate PCR. The isolated sequences were classified into a MuDR or Class I clade and divided into two distinct subclasses (subclass I and subclass II. The average pair-wise identity between members of both subclasses was 58.8% at the nucleotide sequence level. Sequence diversity of subclass I was larger than that of subclass II. DNA gel blot analysis showed that subclass I was present as low copy number elements in the genomes of all Triticum and Aegilops accessions surveyed, while subclass II was present as high copy number elements. These two subclasses seemed uncapable of recognizing each other for transposition. The number of copies of subclass II elements was much higher in Aegilops with the S, Sl and D genomes and polyploid Triticum species than in diploid Triticum with the A genome, indicating that active transposition occurred in S, Sl and D genomes before polyploidization. DNA gel blot analysis of six species selected from three subfamilies of Poaceae demonstrated that only the tribe Triticeae possessed both subclasses. These results suggest that the differentiation of these two subclasses occurred before or immediately after the establishment of the tribe Triticeae.

  20. Enrichment allows identification of diverse, rare elements in metagenomic resistome-virulome sequencing.

    Science.gov (United States)

    Noyes, Noelle R; Weinroth, Maggie E; Parker, Jennifer K; Dean, Chris J; Lakin, Steven M; Raymond, Robert A; Rovira, Pablo; Doster, Enrique; Abdo, Zaid; Martin, Jennifer N; Jones, Kenneth L; Ruiz, Jaime; Boucher, Christina A; Belk, Keith E; Morley, Paul S

    2017-10-17

    Shotgun metagenomic sequencing is increasingly utilized as a tool to evaluate ecological-level dynamics of antimicrobial resistance and virulence, in conjunction with microbiome analysis. Interest in use of this method for environmental surveillance of antimicrobial resistance and pathogenic microorganisms is also increasing. In published metagenomic datasets, the total of all resistance- and virulence-related sequences accounts for enrichment system that incorporates unique molecular indices to count DNA molecules and correct for enrichment bias. The use of the bait-capture and enrichment system significantly increased on-target sequencing of the resistome-virulome, enabling detection of an additional 1441 gene accessions and revealing a low-abundance portion of the resistome-virulome that was more diverse and compositionally different than that detected by more traditional metagenomic assays. The low-abundance portion of the resistome-virulome also contained resistance genes with public health importance, such as extended-spectrum betalactamases, that were not detected using traditional shotgun metagenomic sequencing. In addition, the use of the bait-capture and enrichment system enabled identification of rare resistance gene haplotypes that were used to discriminate between sample origins. These results demonstrate that the rare resistome-virulome contains valuable and unique information that can be utilized for both surveillance and population genetic investigations of resistance. Access to the rare resistome-virulome using the bait-capture and enrichment system validated in this study can greatly advance our understanding of microbiome-resistome dynamics.

  1. On the Use of Diversity Measures in Longitudinal Sequencing Studies of Microbial Communities.

    Science.gov (United States)

    Wagner, Brandie D; Grunwald, Gary K; Zerbe, Gary O; Mikulich-Gilbertson, Susan K; Robertson, Charles E; Zemanick, Edith T; Harris, J Kirk

    2018-01-01

    Identification of the majority of organisms present in human-associated microbial communities is feasible with the advent of high throughput sequencing technology. As substantial variability in microbiota communities is seen across subjects, the use of longitudinal study designs is important to better understand variation of the microbiome within individual subjects. Complex study designs with longitudinal sample collection require analytic approaches to account for this additional source of variability. A common approach to assessing community changes is to evaluate the change in alpha diversity (the variety and abundance of organisms in a community) over time. However, there are several commonly used alpha diversity measures and the use of different measures can result in different estimates of magnitude of change and different inferences. It has recently been proposed that diversity profile curves are useful for clarifying these differences, and may provide a more complete picture of the community structure. However, it is unclear how to utilize these curves when interest is in evaluating changes in community structure over time. We propose the use of a bi-exponential function in a longitudinal model that accounts for repeated measures on each subject to compare diversity profiles over time. Furthermore, it is possible that no change in alpha diversity (single community/sample) may be observed despite the presence of a highly divergent community composition. Thus, it is also important to use a beta diversity measure (similarity between multiple communities/samples) that captures changes in community composition. Ecological methods developed to evaluate temporal turnover have currently only been applied to investigate changes of a single community over time. We illustrate the extension of this approach to multiple communities of interest (i.e., subjects) by modeling the beta diversity measure over time. With this approach, a rate of change in community

  2. Exploring fungal diversity in deep-sea sediments from Okinawa Trough using high-throughput Illumina sequencing

    Science.gov (United States)

    Zhang, Xiao-Yong; Wang, Guang-Hua; Xu, Xin-Ya; Nong, Xu-Hua; Wang, Jie; Amin, Muhammad; Qi, Shu-Hua

    2016-10-01

    The present study investigated the fungal diversity in four different deep-sea sediments from Okinawa Trough using high-throughput Illumina sequencing of the nuclear ribosomal internal transcribed spacer-1 (ITS1). A total of 40,297 fungal ITS1 sequences clustered into 420 operational taxonomic units (OTUs) with 97% sequence similarity and 170 taxa were recovered from these sediments. Most ITS1 sequences (78%) belonged to the phylum Ascomycota, followed by Basidiomycota (17.3%), Zygomycota (1.5%) and Chytridiomycota (0.8%), and a small proportion (2.4%) belonged to unassigned fungal phyla. Compared with previous studies on fungal diversity of sediments from deep-sea environments by culture-dependent approach and clone library analysis, the present result suggested that Illumina sequencing had been dramatically accelerating the discovery of fungal community of deep-sea sediments. Furthermore, our results revealed that Sordariomycetes was the most diverse and abundant fungal class in this study, challenging the traditional view that the diversity of Sordariomycetes phylotypes was low in the deep-sea environments. In addition, more than 12 taxa accounted for 21.5% sequences were found to be rarely reported as deep-sea fungi, suggesting the deep-sea sediments from Okinawa Trough harbored a plethora of different fungal communities compared with other deep-sea environments. To our knowledge, this study is the first exploration of the fungal diversity in deep-sea sediments from Okinawa Trough using high-throughput Illumina sequencing.

  3. High levels of diversity characterize mandrill (Mandrillus sphinx) Mhc-DRB sequences.

    Science.gov (United States)

    Abbott, Kristin M; Wickings, E Jean; Knapp, Leslie A

    2006-08-01

    The major histocompatibility complex (MHC) is highly polymorphic in most primate species studied thus far. The rhesus macaque (Macaca mulatta) has been studied extensively and the Mhc-DRB region demonstrates variability similar to humans. The extent of MHC diversity is relatively unknown for other Old World monkeys (OWM), especially among genera other than Macaca. A molecular survey of the Mhc-DRB region in mandrills (Mandrillus sphinx) revealed extensive variability, suggesting that other OWMs may also possess high levels of Mhc-DRB polymorphism. In the present study, 33 Mhc-DRB loci were identified from only 13 animals. Eleven were wild-born and presumed to be unrelated and two were captive-born twins. Two to seven different sequences were identified for each individual, suggesting that some mandrills may have as many as four Mhc-DRB loci on a single haplotype. From these sequences, representatives of at least six Mhc-DRB loci or lineages were identified. As observed in other primates, some new lineages may have arisen through the process of gene conversion. These findings indicate that mandrills have Mhc-DRB diversity not unlike rhesus macaques and humans.

  4. [Influence of PCR cycle number on microbial diversity analysis through next generation sequencing].

    Science.gov (United States)

    An, Yunhe; Gao, Lijuan; Li, Junbo; Tian, Yanjie; Wang, Jinlong; Zheng, Xuejuan; Wu, Huijuan

    2016-08-25

    Using of high throughput sequencing technology to study the microbial diversity in complex samples has become one of the hottest issues in the field of microbial diversity research. In this study, the soil and sheep rumen chyme samples were used to extract DNA, respectively. Then the 25 ng total DNA was used to amplify the 16S rRNA V3 region with 20, 25, 30 PCR cycles, and the final sequencing library was constructed by mixing equal amounts of purified PCR products. Finally, the operational taxonomic unit (OUT) amount, rarefaction curve, microbial number and species were compared through data analysis. It was found that at the same amount of DNA template, the proportion of the community composition was not the best with more numbers of PCR cycle, although the species number was much more. In all, when the PCR cycle number is 25, the number of species and proportion of the community composition were the most optimal both in soil or chyme samples.

  5. Extracellular DNA amplicon sequencing reveals high levels of benthic eukaryotic diversity in the central Red Sea

    KAUST Repository

    Pearman, John K.

    2015-11-01

    The present study aims to characterize the benthic eukaryotic biodiversity patterns at a coarse taxonomic level in three areas of the central Red Sea (a lagoon, an offshore area in Thuwal and a shallow coastal area near Jeddah) based on extracellular DNA. High-throughput amplicon sequencing targeting the V9 region of the 18S rRNA gene was undertaken for 32 sediment samples. High levels of alpha-diversity were detected with 16,089 operational taxonomic units (OTUs) being identified. The majority of the OTUs were assigned to Metazoa (29.2%), Alveolata (22.4%) and Stramenopiles (17.8%). Stramenopiles (Diatomea) and Alveolata (Ciliophora) were frequent in a lagoon and in shallower coastal stations, whereas metazoans (Arthropoda: Maxillopoda) were dominant in deeper offshore stations. Only 24.6% of total OTUs were shared among all areas. Beta-diversity was generally lower between the lagoon and Jeddah (nearshore) than between either of those and the offshore area, suggesting a nearshore–offshore biodiversity gradient. The current approach allowed for a broad-range of benthic eukaryotic biodiversity to be analysed with significantly less labour than would be required by other traditional taxonomic approaches. Our findings suggest that next generation sequencing techniques have the potential to provide a fast and standardised screening of benthic biodiversity at large spatial and temporal scales.

  6. Rare recombination events generate sequence diversity among balancer chromosomes in Drosophila melanogaster.

    Science.gov (United States)

    Miller, Danny E; Cook, Kevin R; Yeganeh Kazemi, Nazanin; Smith, Clarissa B; Cockrell, Alexandria J; Hawley, R Scott; Bergman, Casey M

    2016-03-08

    Multiply inverted balancer chromosomes that suppress exchange with their homologs are an essential part of the Drosophila melanogaster genetic toolkit. Despite their widespread use, the organization of balancer chromosomes has not been characterized at the molecular level, and the degree of sequence variation among copies of balancer chromosomes is unknown. To map inversion breakpoints and study potential diversity in descendants of a structurally identical balancer chromosome, we sequenced a panel of laboratory stocks containing the most widely used X chromosome balancer, First Multiple 7 (FM7). We mapped the locations of FM7 breakpoints to precise euchromatic coordinates and identified the flanking sequence of breakpoints in heterochromatic regions. Analysis of SNP variation revealed megabase-scale blocks of sequence divergence among currently used FM7 stocks. We present evidence that this divergence arose through rare double-crossover events that replaced a female-sterile allele of the singed gene (sn(X2)) on FM7c with a sequence from balanced chromosomes. We propose that although double-crossover events are rare in individual crosses, many FM7c chromosomes in the Bloomington Drosophila Stock Center have lost sn(X2) by this mechanism on a historical timescale. Finally, we characterize the original allele of the Bar gene (B(1)) that is carried on FM7, and validate the hypothesis that the origin and subsequent reversion of the B(1) duplication are mediated by unequal exchange. Our results reject a simple nonrecombining, clonal mode for the laboratory evolution of balancer chromosomes and have implications for how balancer chromosomes should be used in the design and interpretation of genetic experiments in Drosophila.

  7. Origin, diversity and maturation of human antiviral antibodies analyzed by high-throughput sequencing

    Directory of Open Access Journals (Sweden)

    Ponraj ePrabakaran

    2012-08-01

    Full Text Available Our understanding of how antibodies are generated and function could help develop effective vaccines and antibody-based therapeutics against viruses such as HIV-1, SARS Coronavirus (CoV, and Hendra and Nipah viruses (henipaviruses. Although broadly neutralizing antibodies (bnAbs against the HIV-1 were observed in patients, elicitation of such bnAbs remains a major challenge when compared to other viral targets. We previously hypothesized that HIV-1 could have evolved a strategy to evade the immune system due to absent or very weak binding of germline antibodies to the conserved epitopes that may not be sufficient to initiate and/or maintain an effective immune response. To further explore our hypothesis, we used the 454 sequence analysis of a large naïve library of human IgM antibodies which had been used for selecting antibodies against SARS Coronavirus (CoV receptor-binding domain (RBD, and soluble G proteins (sG of Hendra and Nipah viruses (henipaviruses. We found that the human IgM repertoires from the 454 sequencing have diverse germline usages, recombination patterns, junction diversity and a lower extent of somatic mutation. In this study, we identified germline intermediates of antibodies specific to HIV-1 and other viruses as observed in normal individuals, and compared their genetic diversity and somatic mutation level along with available structural and functional data. Further computational analysis will provide framework for understanding the underlying genetic and molecular determinants related to maturation pathways of antiviral bnAbs that could be useful for applying novel approaches to the design of effective vaccine immunogens and antibody-based therapeutics.

  8. Abundance and genetic diversity of nifH gene sequences in anthropogenically affected Brazilian mangrove sediments.

    Science.gov (United States)

    Dias, Armando Cavalcante Franco; Pereira e Silva, Michele de Cassia; Cotta, Simone Raposo; Dini-Andreote, Francisco; Soares, Fábio Lino; Salles, Joana Falcão; Azevedo, João Lúcio; van Elsas, Jan Dirk; Andreote, Fernando Dini

    2012-11-01

    Although mangroves represent ecosystems of global importance, the genetic diversity and abundance of functional genes that are key to their functioning scarcely have been explored. Here, we present a survey based on the nifH gene across transects of sediments of two mangrove systems located along the coast line of São Paulo state (Brazil) which differed by degree of disturbance, i.e., an oil-spill-affected and an unaffected mangrove. The diazotrophic communities were assessed by denaturing gradient gel electrophoresis (DGGE), quantitative PCR (qPCR), and clone libraries. The nifH gene abundance was similar across the two mangrove sediment systems, as evidenced by qPCR. However, the nifH-based PCR-DGGE profiles revealed clear differences between the mangroves. Moreover, shifts in the nifH gene diversities were noted along the land-sea transect within the previously oiled mangrove. The nifH gene diversity depicted the presence of nitrogen-fixing bacteria affiliated with a wide range of taxa, encompassing members of the Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria, Firmicutes, and also a group of anaerobic sulfate-reducing bacteria. We also detected a unique mangrove-specific cluster of sequences denoted Mgv-nifH. Our results indicate that nitrogen-fixing bacterial guilds can be partially endemic to mangroves, and these communities are modulated by oil contamination, which has important implications for conservation strategies.

  9. Abundance and Genetic Diversity of nifH Gene Sequences in Anthropogenically Affected Brazilian Mangrove Sediments

    Science.gov (United States)

    Dias, Armando Cavalcante Franco; Pereira e Silva, Michele de Cassia; Cotta, Simone Raposo; Dini-Andreote, Francisco; Soares, Fábio Lino; Salles, Joana Falcão; Azevedo, João Lúcio; van Elsas, Jan Dirk

    2012-01-01

    Although mangroves represent ecosystems of global importance, the genetic diversity and abundance of functional genes that are key to their functioning scarcely have been explored. Here, we present a survey based on the nifH gene across transects of sediments of two mangrove systems located along the coast line of São Paulo state (Brazil) which differed by degree of disturbance, i.e., an oil-spill-affected and an unaffected mangrove. The diazotrophic communities were assessed by denaturing gradient gel electrophoresis (DGGE), quantitative PCR (qPCR), and clone libraries. The nifH gene abundance was similar across the two mangrove sediment systems, as evidenced by qPCR. However, the nifH-based PCR-DGGE profiles revealed clear differences between the mangroves. Moreover, shifts in the nifH gene diversities were noted along the land-sea transect within the previously oiled mangrove. The nifH gene diversity depicted the presence of nitrogen-fixing bacteria affiliated with a wide range of taxa, encompassing members of the Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria, Firmicutes, and also a group of anaerobic sulfate-reducing bacteria. We also detected a unique mangrove-specific cluster of sequences denoted Mgv-nifH. Our results indicate that nitrogen-fixing bacterial guilds can be partially endemic to mangroves, and these communities are modulated by oil contamination, which has important implications for conservation strategies. PMID:22941088

  10. Genotyping-by-Sequencing Analysis for Determining Population Structure of Finger Millet Germplasm of Diverse Origins

    Directory of Open Access Journals (Sweden)

    Anil Kumar

    2016-07-01

    Full Text Available Finger millet [ (L. Gaertn.] is grown mainly by subsistence farmers in arid and semiarid regions of the world. To broaden its genetic base and to boost its production, it is of paramount importance to characterize and genotype the diverse gene pool of this important food and nutritional security crop. However, as a result of nonavailability of the genome sequence of finger millet, the progress could not be made in realizing the molecular basis of unique qualities of the crop. In the present investigation, attempts have been made to characterize the genetically diverse collection of 113 finger millet accessions through whole-genome genotyping-by-sequencing (GBS, which resulted in a genome-wide set of 23,000 single-nucleotide polymorphisms (SNPs segregating across the entire collection and several thousand SNPs segregating within every accession. A model-based population structure analysis reveals the presence of three subpopulations among the finger millet accessions, which are in parallel with the results of phylogenetic analysis. The observed population structure is consistent with the hypothesis that finger millet was domesticated first in Africa, and from there it was introduced to India some 3000 yr ago. A total of 1128 gene ontology (GO terms were assigned to SNP-carrying genes for three main categories: biological process, cellular component, and molecular function. Facilitated access to high-throughput genotyping and sequencing technologies are likely to improve the breeding process in developing countries, and as such, this data will be very useful to breeders who are working for the genetic improvement of finger millet.

  11. Genotyping-by-Sequencing Analysis for Determining Population Structure of Finger Millet Germplasm of Diverse Origins.

    Science.gov (United States)

    Kumar, Anil; Sharma, Divya; Tiwari, Apoorv; Jaiswal, J P; Singh, N K; Sood, Salej

    2016-07-01

    Finger millet [ (L.) Gaertn.] is grown mainly by subsistence farmers in arid and semiarid regions of the world. To broaden its genetic base and to boost its production, it is of paramount importance to characterize and genotype the diverse gene pool of this important food and nutritional security crop. However, as a result of nonavailability of the genome sequence of finger millet, the progress could not be made in realizing the molecular basis of unique qualities of the crop. In the present investigation, attempts have been made to characterize the genetically diverse collection of 113 finger millet accessions through whole-genome genotyping-by-sequencing (GBS), which resulted in a genome-wide set of 23,000 single-nucleotide polymorphisms (SNPs) segregating across the entire collection and several thousand SNPs segregating within every accession. A model-based population structure analysis reveals the presence of three subpopulations among the finger millet accessions, which are in parallel with the results of phylogenetic analysis. The observed population structure is consistent with the hypothesis that finger millet was domesticated first in Africa, and from there it was introduced to India some 3000 yr ago. A total of 1128 gene ontology (GO) terms were assigned to SNP-carrying genes for three main categories: biological process, cellular component, and molecular function. Facilitated access to high-throughput genotyping and sequencing technologies are likely to improve the breeding process in developing countries, and as such, this data will be very useful to breeders who are working for the genetic improvement of finger millet. Copyright © 2016 Crop Science Society of America.

  12. Benchmark Evaluation of True Single Molecular Sequencing to Determine Cystic Fibrosis Airway Microbiome Diversity

    Directory of Open Access Journals (Sweden)

    Andrea Hahn

    2018-05-01

    Full Text Available Cystic fibrosis (CF is an autosomal recessive disease associated with recurrent lung infections that can lead to morbidity and mortality. The impact of antibiotics for treatment of acute pulmonary exacerbations on the CF airway microbiome remains unclear with prior studies giving conflicting results and being limited by their use of 16S ribosomal RNA sequencing. Our primary objective was to validate the use of true single molecular sequencing (tSMS and PathoScope in the analysis of the CF airway microbiome. Three control samples were created with differing amounts of Burkholderia cepacia, Pseudomonas aeruginosa, and Prevotella melaninogenica, three common bacteria found in cystic fibrosis lungs. Paired sputa were also obtained from three study participants with CF before and >6 days after initiation of antibiotics. Antibiotic resistant B. cepacia and P. aeruginosa were identified in concurrently obtained respiratory cultures. Direct sequencing was performed using tSMS, and filtered reads were aligned to reference genomes from NCBI using PathoScope and Kraken and unique clade-specific marker genes using MetaPhlAn. A total of 180–518 K of 6–12 million filtered reads were aligned for each sample. Detection of known pathogens in control samples was most successful using PathoScope. In the CF sputa, alpha diversity measures varied based on the alignment method used, but similar trends were found between pre- and post-antibiotic samples. PathoScope outperformed Kraken and MetaPhlAn in our validation study of artificial bacterial community controls and also has advantages over Kraken and MetaPhlAn of being able to determine bacterial strains and the presence of fungal organisms. PathoScope can be confidently used when evaluating metagenomic data to determine CF airway microbiome diversity.

  13. Diversity and Structure of Diazotrophic Communities in Mangrove Rhizosphere, Revealed by High-Throughput Sequencing.

    Science.gov (United States)

    Zhang, Yanying; Yang, Qingsong; Ling, Juan; Van Nostrand, Joy D; Shi, Zhou; Zhou, Jizhong; Dong, Junde

    2017-01-01

    Diazotrophic communities make an essential contribution to the productivity through providing new nitrogen. However, knowledge of the roles that both mangrove tree species and geochemical parameters play in shaping mangove rhizosphere diazotrophic communities is still elusive. Here, a comprehensive examination of the diversity and structure of microbial communities in the rhizospheres of three mangrove species, Rhizophora apiculata , Avicennia marina , and Ceriops tagal , was undertaken using high - throughput sequencing of the 16S rRNA and nifH genes. Our results revealed a great diversity of both the total microbial composition and the diazotrophic composition specifically in the mangrove rhizosphere. Deltaproteobacteria and Gammaproteobacteria were both ubiquitous and dominant, comprising an average of 45.87 and 86.66% of total microbial and diazotrophic communities, respectively. Sulfate-reducing bacteria belonging to the Desulfobacteraceae and Desulfovibrionaceae were the dominant diazotrophs. Community statistical analyses suggested that both mangrove tree species and additional environmental variables played important roles in shaping total microbial and potential diazotroph communities in mangrove rhizospheres. In contrast to the total microbial community investigated by analysis of 16S rRNA gene sequences, most of the dominant diazotrophic groups identified by nifH gene sequences were significantly different among mangrove species. The dominant diazotrophs of the family Desulfobacteraceae were positively correlated with total phosphorus, but negatively correlated with the nitrogen to phosphorus ratio. The Pseudomonadaceae were positively correlated with the concentration of available potassium, suggesting that diazotrophs potentially play an important role in biogeochemical cycles, such as those of nitrogen, phosphorus, sulfur, and potassium, in the mangrove ecosystem.

  14. Diversity and Structure of Diazotrophic Communities in Mangrove Rhizosphere, Revealed by High-Throughput Sequencing

    Directory of Open Access Journals (Sweden)

    Yanying Zhang

    2017-10-01

    Full Text Available Diazotrophic communities make an essential contribution to the productivity through providing new nitrogen. However, knowledge of the roles that both mangrove tree species and geochemical parameters play in shaping mangove rhizosphere diazotrophic communities is still elusive. Here, a comprehensive examination of the diversity and structure of microbial communities in the rhizospheres of three mangrove species, Rhizophora apiculata, Avicennia marina, and Ceriops tagal, was undertaken using high-throughput sequencing of the 16S rRNA and nifH genes. Our results revealed a great diversity of both the total microbial composition and the diazotrophic composition specifically in the mangrove rhizosphere. Deltaproteobacteria and Gammaproteobacteria were both ubiquitous and dominant, comprising an average of 45.87 and 86.66% of total microbial and diazotrophic communities, respectively. Sulfate-reducing bacteria belonging to the Desulfobacteraceae and Desulfovibrionaceae were the dominant diazotrophs. Community statistical analyses suggested that both mangrove tree species and additional environmental variables played important roles in shaping total microbial and potential diazotroph communities in mangrove rhizospheres. In contrast to the total microbial community investigated by analysis of 16S rRNA gene sequences, most of the dominant diazotrophic groups identified by nifH gene sequences were significantly different among mangrove species. The dominant diazotrophs of the family Desulfobacteraceae were positively correlated with total phosphorus, but negatively correlated with the nitrogen to phosphorus ratio. The Pseudomonadaceae were positively correlated with the concentration of available potassium, suggesting that diazotrophs potentially play an important role in biogeochemical cycles, such as those of nitrogen, phosphorus, sulfur, and potassium, in the mangrove ecosystem.

  15. Complete sequence and diversity of a maize-associated Polerovirus in East Africa.

    Science.gov (United States)

    Massawe, Deogracious P; Stewart, Lucy R; Kamatenesi, Jovia; Asiimwe, Theodore; Redinbaugh, Margaret G

    2018-06-01

    Since 2011-2012, Maize lethal necrosis (MLN) has emerged in East Africa, causing massive yield loss and propelling research to identify viruses and virus populations present in maize. As expected, next generation sequencing (NGS) has revealed diverse and abundant viruses from the family Potyviridae, primarily sugarcane mosaic virus (SCMV), and maize chlorotic mottle virus (MCMV) (Tombusviridae), which are known to cause MLN by synergistic co-infection. In addition to these expected viruses, we identified a virus in the genus Polerovirus (family Luteoviridae) in 104/172 samples selected for MLN or other potential virus symptoms from Kenya, Uganda, Rwanda, and Tanzania. This polerovirus (MF974579) nucleotide sequence is 97% identical to maize-associated viruses recently reported in China, termed 'maize yellow mosaic virus' (MaYMV) and maize yellow dwarf virus (MaYMV; KU291101, KU291107, MYDV-RMV2; KT992824); and 99% identical to MaYMV (KY684356) infecting sugarcane and itch grass in Nigeria; 83% identical to a barley-associated polerovirus recently identified in Korea (BVG; KT962089); and 79% identical to the U.S. maize-infecting polerovirus maize yellow dwarf virus (MYDV-RMV; KT992824). Nucleotide sequences from ORF0 of 20 individual East African isolates collected from Kenya, Uganda, Rwanda, and Tanzania shared 98% or higher identity, and were detected in 104/172 (60.5%) of samples collected for virus-like symptoms, indicating extensive prevalence but limited diversity of this virus in East Africa. We refer to this virus as "MYDV-like polerovirus" until symptoms of the virus in maize are known.

  16. Sequence diversity and differential expression of major phenylpropanoid-flavonoid biosynthetic genes among three mango varieties.

    Science.gov (United States)

    Hoang, Van L T; Innes, David J; Shaw, P Nicholas; Monteith, Gregory R; Gidley, Michael J; Dietzgen, Ralf G

    2015-07-30

    Mango fruits contain a broad spectrum of phenolic compounds which impart potential health benefits; their biosynthesis is catalysed by enzymes in the phenylpropanoid-flavonoid (PF) pathway. The aim of this study was to reveal the variability in genes involved in the PF pathway in three different mango varieties Mangifera indica L., a member of the family Anacardiaceae: Kensington Pride (KP), Irwin (IW) and Nam Doc Mai (NDM) and to determine associations with gene expression and mango flavonoid profiles. A close evolutionary relationship between mango genes and those from the woody species poplar of the Salicaceae family (Populus trichocarpa) and grape of the Vitaceae family (Vitis vinifera), was revealed through phylogenetic analysis of PF pathway genes. We discovered 145 SNPs in total within coding sequences with an average frequency of one SNP every 316 bp. Variety IW had the highest SNP frequency (one SNP every 258 bp) while KP and NDM had similar frequencies (one SNP every 369 bp and 360 bp, respectively). The position in the PF pathway appeared to influence the extent of genetic diversity of the encoded enzymes. The entry point enzymes phenylalanine lyase (PAL), cinnamate 4-mono-oxygenase (C4H) and chalcone synthase (CHS) had low levels of SNP diversity in their coding sequences, whereas anthocyanidin reductase (ANR) showed the highest SNP frequency followed by flavonoid 3'-hydroxylase (F3'H). Quantitative PCR revealed characteristic patterns of gene expression that differed between mango peel and flesh, and between varieties. The combination of mango expressed sequence tags and availability of well-established reference PF biosynthetic genes from other plant species allowed the identification of coding sequences of genes that may lead to the formation of important flavonoid compounds in mango fruits and facilitated characterisation of single nucleotide polymorphisms between varieties. We discovered an association between the extent of sequence variation and

  17. Detecting differential DNA methylation from sequencing of bisulfite converted DNA of diverse species.

    Science.gov (United States)

    Huh, Iksoo; Wu, Xin; Park, Taesung; Yi, Soojin V

    2017-07-21

    DNA methylation is one of the most extensively studied epigenetic modifications of genomic DNA. In recent years, sequencing of bisulfite-converted DNA, particularly via next-generation sequencing technologies, has become a widely popular method to study DNA methylation. This method can be readily applied to a variety of species, dramatically expanding the scope of DNA methylation studies beyond the traditionally studied human and mouse systems. In parallel to the increasing wealth of genomic methylation profiles, many statistical tools have been developed to detect differentially methylated loci (DMLs) or differentially methylated regions (DMRs) between biological conditions. We discuss and summarize several key properties of currently available tools to detect DMLs and DMRs from sequencing of bisulfite-converted DNA. However, the majority of the statistical tools developed for DML/DMR analyses have been validated using only mammalian data sets, and less priority has been placed on the analyses of invertebrate or plant DNA methylation data. We demonstrate that genomic methylation profiles of non-mammalian species are often highly distinct from those of mammalian species using examples of honey bees and humans. We then discuss how such differences in data properties may affect statistical analyses. Based on these differences, we provide three specific recommendations to improve the power and accuracy of DML and DMR analyses of invertebrate data when using currently available statistical tools. These considerations should facilitate systematic and robust analyses of DNA methylation from diverse species, thus advancing our understanding of DNA methylation. © The Author 2017. Published by Oxford University Press.

  18. Impact of Human Immunodeficiency Virus Type-1 Sequence Diversity on Antiretroviral Therapy Outcomes

    Directory of Open Access Journals (Sweden)

    Allison Langs-Barlow

    2014-10-01

    Full Text Available Worldwide circulating HIV-1 genomes show extensive variation represented by different subtypes, polymorphisms and drug-resistant strains. Reports on the impact of sequence variation on antiretroviral therapy (ART outcomes are mixed. In this review, we summarize relevant published data from both resource-rich and resource-limited countries in the last 10 years on the impact of HIV-1 sequence diversity on treatment outcomes. The prevalence of transmission of drug resistant mutations (DRMs varies considerably, ranging from 0% to 27% worldwide. Factors such as geographic location, access and availability to ART, duration since inception of treatment programs, quality of care, risk-taking behaviors, mode of transmission, and viral subtype all dictate the prevalence in a particular geographical region. Although HIV-1 subtype may not be a good predictor of treatment outcome, review of emerging evidence supports the fact that HIV-1 genome sequence-resulting from natural polymorphisms or drug-associated mutations-matters when it comes to treatment outcomes. Therefore, continued surveillance of drug resistant variants in both treatment-naïve and treatment-experienced populations is needed to reduce the transmission of DRMs and to optimize the efficacy of the current ART armamentarium.

  19. Development of simple sequence repeat markers and diversity analysis in alfalfa (Medicago sativa L.).

    Science.gov (United States)

    Wang, Zan; Yan, Hongwei; Fu, Xinnian; Li, Xuehui; Gao, Hongwen

    2013-04-01

    Efficient and robust molecular markers are essential for molecular breeding in plant. Compared to dominant and bi-allelic markers, multiple alleles of simple sequence repeat (SSR) markers are particularly informative and superior in genetic linkage map and QTL mapping in autotetraploid species like alfalfa. The objective of this study was to enrich SSR markers directly from alfalfa expressed sequence tags (ESTs). A total of 12,371 alfalfa ESTs were retrieved from the National Center for Biotechnology Information. Total 774 SSR-containing ESTs were identified from 716 ESTs. On average, one SSR was found per 7.7 kb of EST sequences. Tri-nucleotide repeats (48.8 %) was the most abundant motif type, followed by di-(26.1 %), tetra-(11.5 %), penta-(9.7 %), and hexanucleotide (3.9 %). One hundred EST-SSR primer pairs were successfully designed and 29 exhibited polymorphism among 28 alfalfa accessions. The allele number per marker ranged from two to 21 with an average of 6.8. The PIC values ranged from 0.195 to 0.896 with an average of 0.608, indicating a high level of polymorphism of the EST-SSR markers. Based on the 29 EST-SSR markers, assessment of genetic diversity was conducted and found that Medicago sativa ssp. sativa was clearly different from the other subspecies. The high transferability of those EST-SSR markers was also found for relative species.

  20. Identification and characterization of rhizospheric microbial diversity by 16S ribosomal RNA gene sequencing

    Directory of Open Access Journals (Sweden)

    Muhammad Naveed

    2014-09-01

    Full Text Available In the present study, samples of rhizosphere and root nodules were collected from different areas of Pakistan to isolate plant growth promoting rhizobacteria. Identification of bacterial isolates was made by 16S rRNA gene sequence analysis and taxonomical confirmation on EzTaxon Server. The identified bacterial strains were belonged to 5 genera i.e. Ensifer, Bacillus, Pseudomona, Leclercia and Rhizobium. Phylogenetic analysis inferred from 16S rRNA gene sequences showed the evolutionary relationship of bacterial strains with the respective genera. Based on phylogenetic analysis, some candidate novel species were also identified. The bacterial strains were also characterized for morphological, physiological, biochemical tests and glucose dehydrogenase (gdh gene that involved in the phosphate solublization using cofactor pyrroloquinolone quinone (PQQ. Seven rhizoshperic and 3 root nodulating stains are positive for gdh gene. Furthermore, this study confirms a novel association between microbes and their hosts like field grown crops, leguminous and non-leguminous plants. It was concluded that a diverse group of bacterial population exist in the rhizosphere and root nodules that might be useful in evaluating the mechanisms behind plant microbial interactions and strains QAU-63 and QAU-68 have sequence similarity of 97 and 95% which might be declared as novel after further taxonomic characterization.

  1. New var reconstruction algorithm exposes high var sequence diversity in a single geographic location in Mali.

    Science.gov (United States)

    Dara, Antoine; Drábek, Elliott F; Travassos, Mark A; Moser, Kara A; Delcher, Arthur L; Su, Qi; Hostelley, Timothy; Coulibaly, Drissa; Daou, Modibo; Dembele, Ahmadou; Diarra, Issa; Kone, Abdoulaye K; Kouriba, Bourema; Laurens, Matthew B; Niangaly, Amadou; Traore, Karim; Tolo, Youssouf; Fraser, Claire M; Thera, Mahamadou A; Djimde, Abdoulaye A; Doumbo, Ogobara K; Plowe, Christopher V; Silva, Joana C

    2017-03-28

    Encoded by the var gene family, highly variable Plasmodium falciparum erythrocyte membrane protein-1 (PfEMP1) proteins mediate tissue-specific cytoadherence of infected erythrocytes, resulting in immune evasion and severe malaria disease. Sequencing and assembling the 40-60 var gene complement for individual infections has been notoriously difficult, impeding molecular epidemiological studies and the assessment of particular var elements as subunit vaccine candidates. We developed and validated a novel algorithm, Exon-Targeted Hybrid Assembly (ETHA), to perform targeted assembly of var gene sequences, based on a combination of Pacific Biosciences and Illumina data. Using ETHA, we characterized the repertoire of var genes in 12 samples from uncomplicated malaria infections in children from a single Malian village and showed them to be as genetically diverse as vars from isolates from around the globe. The gene var2csa, a member of the var family associated with placental malaria pathogenesis, was present in each genome, as were vars previously associated with severe malaria. ETHA, a tool to discover novel var sequences from clinical samples, will aid the understanding of malaria pathogenesis and inform the design of malaria vaccines based on PfEMP1. ETHA is available at: https://sourceforge.net/projects/etha/ .

  2. mtDNA sequence diversity of Hazara ethnic group from Pakistan.

    Science.gov (United States)

    Rakha, Allah; Fatima; Peng, Min-Sheng; Adan, Atif; Bi, Rui; Yasmin, Memona; Yao, Yong-Gang

    2017-09-01

    The present study was undertaken to investigate mitochondrial DNA (mtDNA) control region sequences of Hazaras from Pakistan, so as to generate mtDNA reference database for forensic casework in Pakistan and to analyze phylogenetic relationship of this particular ethnic group with geographically proximal populations. Complete mtDNA control region (nt 16024-576) sequences were generated through Sanger Sequencing for 319 Hazara individuals from Quetta, Baluchistan. The population sample set showed a total of 189 distinct haplotypes, belonging mainly to West Eurasian (51.72%), East & Southeast Asian (29.78%) and South Asian (18.50%) haplogroups. Compared with other populations from Pakistan, the Hazara population had a relatively high haplotype diversity (0.9945) and a lower random match probability (0.0085). The dataset has been incorporated into EMPOP database under accession number EMP00680. The data herein comprises the largest, and likely most thoroughly examined, control region mtDNA dataset from Hazaras of Pakistan. Copyright © 2017 Elsevier B.V. All rights reserved.

  3. Assessing the Diversity of Rodent-Borne Viruses: Exploring of High-Throughput Sequencing and Classical Amplification/Sequencing Approaches.

    Science.gov (United States)

    Drewes, Stephan; Straková, Petra; Drexler, Jan F; Jacob, Jens; Ulrich, Rainer G

    2017-01-01

    Rodents are distributed throughout the world and interact with humans in many ways. They provide vital ecosystem services, some species are useful models in biomedical research and some are held as pet animals. However, many rodent species can have adverse effects such as damage to crops and stored produce, and they are of health concern because of the transmission of pathogens to humans and livestock. The first rodent viruses were discovered by isolation approaches and resulted in break-through knowledge in immunology, molecular and cell biology, and cancer research. In addition to rodent-specific viruses, rodent-borne viruses are causing a large number of zoonotic diseases. Most prominent examples are reemerging outbreaks of human hemorrhagic fever disease cases caused by arena- and hantaviruses. In addition, rodents are reservoirs for vector-borne pathogens, such as tick-borne encephalitis virus and Borrelia spp., and may carry human pathogenic agents, but likely are not involved in their transmission to human. In our days, next-generation sequencing or high-throughput sequencing (HTS) is revolutionizing the speed of the discovery of novel viruses, but other molecular approaches, such as generic RT-PCR/PCR and rolling circle amplification techniques, contribute significantly to the rapidly ongoing process. However, the current knowledge still represents only the tip of the iceberg, when comparing the known human viruses to those known for rodents, the mammalian taxon with the largest species number. The diagnostic potential of HTS-based metagenomic approaches is illustrated by their use in the discovery and complete genome determination of novel borna- and adenoviruses as causative disease agents in squirrels. In conclusion, HTS, in combination with conventional RT-PCR/PCR-based approaches, resulted in a drastically increased knowledge of the diversity of rodent viruses. Future improvements of the used workflows, including bioinformatics analysis, will further

  4. a Comparison of Morphological Taxonomy and Next Generation DNA Sequencing for the Assessment of Zooplankton Diversity

    Science.gov (United States)

    Harvey, J.; Fisher, J. L.; Johnson, S.; Morgan, S.; Peterson, W. T.; Satterthwaite, E. V.; Vrijenhoek, R. C.

    2016-02-01

    Our ability to accurately characterize the diversity of planktonic organisms is affected by both the methods we use to collect water samples and our approaches to assessing sample contents. Plankton nets collect organisms from high volumes of water, but integrate sample contents along the net's path. In contrast, plankton pumps collect water from discrete depths. Autonomous underwater vehicles (AUVs) can collect water samples with pinpoint accuracy from physical features such as upwelling fronts or biological features such as phytoplankton blooms, but sample volumes are necessarily much smaller than those possible with nets. Characterization of plankton diversity and abundances in water samples may also vary with the assessment method we apply. Morphological taxonomy provides visual identification and enumeration of organisms via microscopy, but is labor intensive. Next generation DNA sequencing (NGS) shows great promise for assessing plankton diversity in water samples but accurate assessment of relative abundances may not be possible in all cases. Comparison of morphological taxonomy to molecular approaches is necessary to identify areas of overlap and also areas of disagreement between these methods. We have compared morphological taxonomic assessments to mitochondrial COI and nuclear 28S ribosomal RNA NGS results for plankton net samples collected in Monterey bay, California. We have made a similar comparison for plankton pump samples, and have also applied our NGS methods to targeted, small volume water samples collected by an AUV. Our goal is to communicate current results and lessons learned regarding application of traditional taxonomy and novel molecular approaches to the study of plankton diversity in spatially and temporally variable, coastal marine environments.

  5. Child health security in China: a survey of child health insurance coverage in diverse areas of the country.

    Science.gov (United States)

    Xiong, Juyang; Hipgrave, David; Myklebust, Karoline; Guo, Sufang; Scherpbier, Robert W; Tong, Xuetao; Yao, Lan; Moran, Andrew E

    2013-11-01

    China embarked on an ambitious health system reform in 2009, and pledged to achieve universal health insurance coverage by 2020. However, there are gaps in access to healthcare for some children in China. We assessed health insurance status and associated variables among children under five in twelve communities in 2010: two urban community health centers and two rural township health centers in each of three municipalities located in China's distinctly different East, Central and Western regions. Information on demographic and socio-economic variables and children's insurance status was gathered from parents or caregivers of all children enrolled in local health programs, and others recruited from the local communities. Only 62% of 1131 children assessed were insured. This figure did not vary across geographic regions, but urban children were less likely to be insured than rural children. In multivariate analysis, infants were 2.44 times more likely to be uninsured than older children and children having at least one migrant parent were 1.90 times more likely to be uninsured than those living with non-migrant parents. Low maternal education was also associated with being uninsured. Gaps in China's child health insurance coverage might be bridged if newborns are automatically covered from birth, and if insurance is extended to all urban migrant children, regardless of the family's residential registration status and size. Copyright © 2013 Elsevier Ltd. All rights reserved.

  6. Using Whole Genome Analysis to Examine Recombination across Diverse Sequence Types of Staphylococcus aureus.

    Directory of Open Access Journals (Sweden)

    Elizabeth M Driebe

    Full Text Available Staphylococcus aureus is an important clinical pathogen worldwide and understanding this organism's phylogeny and, in particular, the role of recombination, is important both to understand the overall spread of virulent lineages and to characterize outbreaks. To further elucidate the phylogeny of S. aureus, 35 diverse strains were sequenced using whole genome sequencing. In addition, 29 publicly available whole genome sequences were included to create a single nucleotide polymorphism (SNP-based phylogenetic tree encompassing 11 distinct lineages. All strains of a particular sequence type fell into the same clade with clear groupings of the major clonal complexes of CC8, CC5, CC30, CC45 and CC1. Using a novel analysis method, we plotted the homoplasy density and SNP density across the whole genome and found evidence of recombination throughout the entire chromosome, but when we examined individual clonal lineages we found very little recombination. However, when we analyzed three branches of multiple lineages, we saw intermediate and differing levels of recombination between them. These data demonstrate that in S. aureus, recombination occurs across major lineages that subsequently expand in a clonal manner. Estimated mutation rates for the CC8 and CC5 lineages were different from each other. While the CC8 lineage rate was similar to previous studies, the CC5 lineage was 100-fold greater. Fifty known virulence genes were screened in all genomes in silico to determine their distribution across major clades. Thirty-three genes were present variably across clades, most of which were not constrained by ancestry, indicating horizontal gene transfer or gene loss.

  7. Penicillium arizonense, a new, genome sequenced fungal species, reveals a high chemical diversity in secreted metabolites

    DEFF Research Database (Denmark)

    Grijseels, Sietske; Nielsen, Jens Christian; Randelovic, Milica

    2016-01-01

    A new soil-borne species belonging to the Penicillium section Canescentia is described, Penicillium arizonense sp. nov. (type strain CBS 141311T = IBT 12289T). The genome was sequenced and assembled into 33.7 Mb containing 12,502 predicted genes. A phylogenetic assessment based on marker genes...... confirmed the grouping of P. arizonense within section Canescentia. Compared to related species, P. arizonense proved to encode a high number of proteins involved in carbohydrate metabolism, in particular hemicellulases. Mining the genome for genes involved in secondary metabolite biosynthesis resulted...... of biosynthetic gene clusters in P. arizonense responsible for the synthesis of all detected compounds except curvulinic acid. The capacity to produce biomass degrading enzymes and the identification of a high chemical diversity in secreted bioactive secondary metabolites, offers a broad range of potential...

  8. Genetic diversity of clinical isolates of Bacillus cereus using multilocus sequence typing

    Directory of Open Access Journals (Sweden)

    Pruckler James M

    2008-11-01

    Full Text Available Abstract Background Bacillus cereus is most commonly associated with foodborne illness (diarrheal and emetic but is also an opportunistic pathogen that can cause severe and fatal infections. Several multilocus sequence typing (MLST schemes have recently been developed to genotype B. cereus and analysis has suggested a clonal or weakly clonal population structure for B. cereus and its close relatives B. anthracis and B. thuringiensis. In this study we used MLST to determine if B. cereus isolates associated with illnesses of varying severity (e.g., severe, systemic vs. gastrointestinal (GI illness were clonal or formed clonal complexes. Results A retrospective analysis of 55 clinical B. cereus isolates submitted to the Centers for Disease Control and Prevention between 1954 and 2004 was conducted. Clinical isolates from severe infections (n = 27, gastrointestinal (GI illness (n = 18, and associated isolates from food (n = 10 were selected for analysis using MLST. The 55 isolates were diverse and comprised 38 sequence types (ST in two distinct clades. Of the 27 isolates associated with serious illness, 13 clustered in clade 1 while 14 were in clade 2. Isolates associated with GI illness were also found throughout clades 1 and 2, while no isolates in this study belonged to clade 3. All the isolates from this study belonging to the clade 1/cereus III lineage were associated with severe disease while isolates belonging to clade1/cereus II contained isolates primarily associated with severe disease and emetic illness. Only three STs were observed more than once for epidemiologically distinct isolates. Conclusion STs of clinical B. cereus isolates were phylogenetically diverse and distributed among two of three previously described clades. Greater numbers of strains will need to be analyzed to confirm if specific lineages or clonal complexes are more likely to contain clinical isolates or be associated with specific illness, similar to B. anthracis and

  9. Seasonal diversity and dynamics of haptophytes in the Skagerrak, Norway, explored by high-throughput sequencing

    Science.gov (United States)

    Egge, Elianne Sirnæs; Johannessen, Torill Vik; Andersen, Tom; Eikrem, Wenche; Bittner, Lucie; Larsen, Aud; Sandaa, Ruth-Anne; Edvardsen, Bente

    2015-01-01

    Microalgae in the division Haptophyta play key roles in the marine ecosystem and in global biogeochemical processes. Despite their ecological importance, knowledge on seasonal dynamics, community composition and abundance at the species level is limited due to their small cell size and few morphological features visible under the light microscope. Here, we present unique data on haptophyte seasonal diversity and dynamics from two annual cycles, with the taxonomic resolution and sampling depth obtained with high-throughput sequencing. From outer Oslofjorden, S Norway, nano- and picoplanktonic samples were collected monthly for 2 years, and the haptophytes targeted by amplification of RNA/cDNA with Haptophyta-specific 18S rDNA V4 primers. We obtained 156 operational taxonomic units (OTUs), from c. 400.000 454 pyrosequencing reads, after rigorous bioinformatic filtering and clustering at 99.5%. Most OTUs represented uncultured and/or not yet 18S rDNA-sequenced species. Haptophyte OTU richness and community composition exhibited high temporal variation and significant yearly periodicity. Richness was highest in September–October (autumn) and lowest in April–May (spring). Some taxa were detected all year, such as Chrysochromulina simplex, Emiliania huxleyi and Phaeocystis cordata, whereas most calcifying coccolithophores only appeared from summer to early winter. We also revealed the seasonal dynamics of OTUs representing putative novel classes (clades HAP-3–5) or orders (clades D, E, F). Season, light and temperature accounted for 29% of the variation in OTU composition. Residual variation may be related to biotic factors, such as competition and viral infection. This study provides new, in-depth knowledge on seasonal diversity and dynamics of haptophytes in North Atlantic coastal waters. PMID:25893259

  10. Seasonal diversity and dynamics of haptophytes in the Skagerrak, Norway, explored by high-throughput sequencing.

    Science.gov (United States)

    Egge, Elianne Sirnaes; Johannessen, Torill Vik; Andersen, Tom; Eikrem, Wenche; Bittner, Lucie; Larsen, Aud; Sandaa, Ruth-Anne; Edvardsen, Bente

    2015-06-01

    Microalgae in the division Haptophyta play key roles in the marine ecosystem and in global biogeochemical processes. Despite their ecological importance, knowledge on seasonal dynamics, community composition and abundance at the species level is limited due to their small cell size and few morphological features visible under the light microscope. Here, we present unique data on haptophyte seasonal diversity and dynamics from two annual cycles, with the taxonomic resolution and sampling depth obtained with high-throughput sequencing. From outer Oslofjorden, S Norway, nano- and picoplanktonic samples were collected monthly for 2 years, and the haptophytes targeted by amplification of RNA/cDNA with Haptophyta-specific 18S rDNA V4 primers. We obtained 156 operational taxonomic units (OTUs), from c. 400.000 454 pyrosequencing reads, after rigorous bioinformatic filtering and clustering at 99.5%. Most OTUs represented uncultured and/or not yet 18S rDNA-sequenced species. Haptophyte OTU richness and community composition exhibited high temporal variation and significant yearly periodicity. Richness was highest in September-October (autumn) and lowest in April-May (spring). Some taxa were detected all year, such as Chrysochromulina simplex, Emiliania huxleyi and Phaeocystis cordata, whereas most calcifying coccolithophores only appeared from summer to early winter. We also revealed the seasonal dynamics of OTUs representing putative novel classes (clades HAP-3-5) or orders (clades D, E, F). Season, light and temperature accounted for 29% of the variation in OTU composition. Residual variation may be related to biotic factors, such as competition and viral infection. This study provides new, in-depth knowledge on seasonal diversity and dynamics of haptophytes in North Atlantic coastal waters. © 2015 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.

  11. Whole mitochondrial genome sequencing of domestic horses reveals incorporation of extensive wild horse diversity during domestication

    Directory of Open Access Journals (Sweden)

    Lippold Sebastian

    2011-11-01

    Full Text Available Abstract Background DNA target enrichment by micro-array capture combined with high throughput sequencing technologies provides the possibility to obtain large amounts of sequence data (e.g. whole mitochondrial DNA genomes from multiple individuals at relatively low costs. Previously, whole mitochondrial genome data for domestic horses (Equus caballus were limited to only a few specimens and only short parts of the mtDNA genome (especially the hypervariable region were investigated for larger sample sets. Results In this study we investigated whole mitochondrial genomes of 59 domestic horses from 44 breeds and a single Przewalski horse (Equus przewalski using a recently described multiplex micro-array capture approach. We found 473 variable positions within the domestic horses, 292 of which are parsimony-informative, providing a well resolved phylogenetic tree. Our divergence time estimate suggests that the mitochondrial genomes of modern horse breeds shared a common ancestor around 93,000 years ago and no later than 38,000 years ago. A Bayesian skyline plot (BSP reveals a significant population expansion beginning 6,000-8,000 years ago with an ongoing exponential growth until the present, similar to other domestic animal species. Our data further suggest that a large sample of wild horse diversity was incorporated into the domestic population; specifically, at least 46 of the mtDNA lineages observed in domestic horses (73% already existed before the beginning of domestication about 5,000 years ago. Conclusions Our study provides a window into the maternal origins of extant domestic horses and confirms that modern domestic breeds present a wide sample of the mtDNA diversity found in ancestral, now extinct, wild horse populations. The data obtained allow us to detect a population expansion event coinciding with the beginning of domestication and to estimate both the minimum number of female horses incorporated into the domestic gene pool and the

  12. Genetic diversity studies in pea (Pisum sativum L.) using simple sequence repeat markers.

    Science.gov (United States)

    Kumari, P; Basal, N; Singh, A K; Rai, V P; Srivastava, C P; Singh, P K

    2013-03-13

    The genetic diversity among 28 pea (Pisum sativum L.) genotypes was analyzed using 32 simple sequence repeat markers. A total of 44 polymorphic bands, with an average of 2.1 bands per primer, were obtained. The polymorphism information content ranged from 0.657 to 0.309 with an average of 0.493. The variation in genetic diversity among these cultivars ranged from 0.11 to 0.73. Cluster analysis based on Jaccard's similarity coefficient using the unweighted pair-group method with arithmetic mean (UPGMA) revealed 2 distinct clusters, I and II, comprising 6 and 22 genotypes, respectively. Cluster II was further differentiated into 2 subclusters, IIA and IIB, with 12 and 10 genotypes, respectively. Principal component (PC) analysis revealed results similar to those of UPGMA. The first, second, and third PCs contributed 21.6, 16.1, and 14.0% of the variation, respectively; cumulative variation of the first 3 PCs was 51.7%.

  13. Genetic diversity among Puccinia melanocephala isolates from Brazil assessed using simple sequence repeat markers.

    Science.gov (United States)

    Peixoto-Junior, R F; Creste, S; Landell, M G A; Nunes, D S; Sanguino, A; Campos, M F; Vencovsky, R; Tambarussi, E V; Figueira, A

    2014-09-26

    Brown rust (causal agent Puccinia melanocephala) is an important sugarcane disease that is responsible for large losses in yield worldwide. Despite its importance, little is known regarding the genetic diversity of this pathogen in the main Brazilian sugarcane cultivation areas. In this study, we characterized the genetic diversity of 34 P. melanocephala isolates from 4 Brazilian states using loci identified from an enriched simple sequence repeat (SSR) library. The aggressiveness of 3 isolates from major sugarcane cultivation areas was evaluated by inoculating an intermediately resistant and a susceptible cultivar. From the enriched library, 16 SSR-specific primers were developed, which produced scorable alleles. Of these, 4 loci were polymorphic and 12 were monomorphic for all isolates evaluated. The molecular characterization of the 34 isolates of P. melanocephala conducted using 16 SSR loci revealed the existence of low genetic variability among the isolates. The average estimated genetic distance was 0.12. Phenetic analysis based on Nei's genetic distance clustered the isolates into 2 major groups. Groups I and II included 18 and 14 isolates, respectively, and both groups contained isolates from all 4 geographic regions studied. Two isolates did not cluster with these groups. It was not possible to obtain clusters according to location or state of origin. Analysis of disease severity data revealed that the isolates did not show significant differences in aggressiveness between regions.

  14. High-coverage sequencing and annotated assembly of the genome of the Australian dragon lizard Pogona vitticeps.

    Science.gov (United States)

    Georges, Arthur; Li, Qiye; Lian, Jinmin; O'Meally, Denis; Deakin, Janine; Wang, Zongji; Zhang, Pei; Fujita, Matthew; Patel, Hardip R; Holleley, Clare E; Zhou, Yang; Zhang, Xiuwen; Matsubara, Kazumi; Waters, Paul; Graves, Jennifer A Marshall; Sarre, Stephen D; Zhang, Guojie

    2015-01-01

    The lizards of the family Agamidae are one of the most prominent elements of the Australian reptile fauna. Here, we present a genomic resource built on the basis of a wild-caught male ZZ central bearded dragon Pogona vitticeps. The genomic sequence for P. vitticeps, generated on the Illumina HiSeq 2000 platform, comprised 317 Gbp (179X raw read depth) from 13 insert libraries ranging from 250 bp to 40 kbp. After filtering for low-quality and duplicated reads, 146 Gbp of data (83X) was available for assembly. Exceptionally high levels of heterozygosity (0.85 % of single nucleotide polymorphisms plus sequence insertions or deletions) complicated assembly; nevertheless, 96.4 % of reads mapped back to the assembled scaffolds, indicating that the assembly included most of the sequenced genome. Length of the assembly was 1.8 Gbp in 545,310 scaffolds (69,852 longer than 300 bp), the longest being 14.68 Mbp. N50 was 2.29 Mbp. Genes were annotated on the basis of de novo prediction, similarity to the green anole Anolis carolinensis, Gallus gallus and Homo sapiens proteins, and P. vitticeps transcriptome sequence assemblies, to yield 19,406 protein-coding genes in the assembly, 63 % of which had intact open reading frames. Our assembly captured 99 % (246 of 248) of core CEGMA genes, with 93 % (231) being complete. The quality of the P. vitticeps assembly is comparable or superior to that of other published squamate genomes, and the annotated P. vitticeps genome can be accessed through a genome browser available at https://genomics.canberra.edu.au.

  15. Penicillium arizonense, a new, genome sequenced fungal species, reveals a high chemical diversity in secreted metabolites

    Science.gov (United States)

    Grijseels, Sietske; Nielsen, Jens Christian; Randelovic, Milica; Nielsen, Jens; Nielsen, Kristian Fog; Workman, Mhairi; Frisvad, Jens Christian

    2016-01-01

    A new soil-borne species belonging to the Penicillium section Canescentia is described, Penicillium arizonense sp. nov. (type strain CBS 141311T = IBT 12289T). The genome was sequenced and assembled into 33.7 Mb containing 12,502 predicted genes. A phylogenetic assessment based on marker genes confirmed the grouping of P. arizonense within section Canescentia. Compared to related species, P. arizonense proved to encode a high number of proteins involved in carbohydrate metabolism, in particular hemicellulases. Mining the genome for genes involved in secondary metabolite biosynthesis resulted in the identification of 62 putative biosynthetic gene clusters. Extracts of P. arizonense were analysed for secondary metabolites and austalides, pyripyropenes, tryptoquivalines, fumagillin, pseurotin A, curvulinic acid and xanthoepocin were detected. A comparative analysis against known pathways enabled the proposal of biosynthetic gene clusters in P. arizonense responsible for the synthesis of all detected compounds except curvulinic acid. The capacity to produce biomass degrading enzymes and the identification of a high chemical diversity in secreted bioactive secondary metabolites, offers a broad range of potential industrial applications for the new species P. arizonense. The description and availability of the genome sequence of P. arizonense, further provides the basis for biotechnological exploitation of this species. PMID:27739446

  16. Deep sequencing reveals exceptional diversity and modes of transmission for bacterial sponge symbionts.

    Science.gov (United States)

    Webster, Nicole S; Taylor, Michael W; Behnam, Faris; Lücker, Sebastian; Rattei, Thomas; Whalan, Stephen; Horn, Matthias; Wagner, Michael

    2010-08-01

    Marine sponges contain complex bacterial communities of considerable ecological and biotechnological importance, with many of these organisms postulated to be specific to sponge hosts. Testing this hypothesis in light of the recent discovery of the rare microbial biosphere, we investigated three Australian sponges by massively parallel 16S rRNA gene tag pyrosequencing. Here we show bacterial diversity that is unparalleled in an invertebrate host, with more than 250,000 sponge-derived sequence tags being assigned to 23 bacterial phyla and revealing up to 2996 operational taxonomic units (95% sequence similarity) per sponge species. Of the 33 previously described 'sponge-specific' clusters that were detected in this study, 48% were found exclusively in adults and larvae - implying vertical transmission of these groups. The remaining taxa, including 'Poribacteria', were also found at very low abundance among the 135,000 tags retrieved from surrounding seawater. Thus, members of the rare seawater biosphere may serve as seed organisms for widely occurring symbiont populations in sponges and their host association might have evolved much more recently than previously thought. © 2009 Society for Applied Microbiology and Blackwell Publishing Ltd.

  17. Structural and sequence diversity of the transposon Galileo in the Drosophila willistoni genome.

    Science.gov (United States)

    Gonçalves, Juliana W; Valiati, Victor Hugo; Delprat, Alejandra; Valente, Vera L S; Ruiz, Alfredo

    2014-09-13

    Galileo is one of three members of the P superfamily of DNA transposons. It was originally discovered in Drosophila buzzatii, in which three segregating chromosomal inversions were shown to have been generated by ectopic recombination between Galileo copies. Subsequently, Galileo was identified in six of 12 sequenced Drosophila genomes, indicating its widespread distribution within this genus. Galileo is strikingly abundant in Drosophila willistoni, a neotropical species that is highly polymorphic for chromosomal inversions, suggesting a role for this transposon in the evolution of its genome. We carried out a detailed characterization of all Galileo copies present in the D. willistoni genome. A total of 191 copies, including 133 with two terminal inverted repeats (TIRs), were classified according to structure in six groups. The TIRs exhibited remarkable variation in their length and structure compared to the most complete copy. Three copies showed extended TIRs due to internal tandem repeats, the insertion of other transposable elements (TEs), or the incorporation of non-TIR sequences into the TIRs. Phylogenetic analyses of the transposase (TPase)-encoding and TIR segments yielded two divergent clades, which we termed Galileo subfamilies V and W. Target-site duplications (TSDs) in D. willistoni Galileo copies were 7- or 8-bp in length, with the consensus sequence GTATTAC. Analysis of the region around the TSDs revealed a target site motif (TSM) with a 15-bp palindrome that may give rise to a stem-loop secondary structure. There is a remarkable abundance and diversity of Galileo copies in the D. willistoni genome, although no functional copies were found. The TIRs in particular have a dynamic structure and extend in different ways, but their ends (required for transposition) are more conserved than the rest of the element. The D. willistoni genome harbors two Galileo subfamilies (V and W) that diverged ~9 million years ago and may have descended from an ancestral

  18. Assessing genetic diversity among Brettanomyces yeasts by DNA fingerprinting and whole-genome sequencing.

    Science.gov (United States)

    Crauwels, Sam; Zhu, Bo; Steensels, Jan; Busschaert, Pieter; De Samblanx, Gorik; Marchal, Kathleen; Willems, Kris A; Verstrepen, Kevin J; Lievens, Bart

    2014-07-01

    Brettanomyces yeasts, with the species Brettanomyces (Dekkera) bruxellensis being the most important one, are generally reported to be spoilage yeasts in the beer and wine industry due to the production of phenolic off flavors. However, B. bruxellensis is also known to be a beneficial contributor in certain fermentation processes, such as the production of certain specialty beers. Nevertheless, despite its economic importance, Brettanomyces yeasts remain poorly understood at the genetic and genomic levels. In this study, the genetic relationship between more than 50 Brettanomyces strains from all presently known species and from several sources was studied using a combination of DNA fingerprinting techniques. This revealed an intriguing correlation between the B. bruxellensis fingerprints and the respective isolation source. To further explore this relationship, we sequenced a (beneficial) beer isolate of B. bruxellensis (VIB X9085; ST05.12/22) and compared its genome sequence with the genome sequences of two wine spoilage strains (AWRI 1499 and CBS 2499). ST05.12/22 was found to be substantially different from both wine strains, especially at the level of single nucleotide polymorphisms (SNPs). In addition, there were major differences in the genome structures between the strains investigated, including the presence of large duplications and deletions. Gene content analysis revealed the presence of 20 genes which were present in both wine strains but absent in the beer strain, including many genes involved in carbon and nitrogen metabolism, and vice versa, no genes that were missing in both AWRI 1499 and CBS 2499 were found in ST05.12/22. Together, this study provides tools to discriminate Brettanomyces strains and provides a first glimpse at the genetic diversity and genome plasticity of B. bruxellensis. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  19. Norrie disease gene sequence variants in an ethnically diverse population with retinopathy of prematurity.

    Science.gov (United States)

    Hutcheson, Kelly A; Paluru, Prasuna C; Bernstein, Steven L; Koh, Jamie; Rappaport, Eric F; Leach, Richard A; Young, Terri L

    2005-07-14

    Retinopathy of prematurity (ROP) is a leading cause of visual loss in the pediatric population. Mutations in the Norrie disease gene (NDP) are associated with heritable retinal vascular disorders, and have been found in a small subset of patients with severe retinopathy of prematurity. Varying rates of progression to threshold disease in different races may have a genetic basis, as recent studies suggest that the incidence of NDP mutations may vary in different groups. African Americans, for example, are less likely to develop severe degrees of ROP. We screened a large cohort of ethnically diverse patients for mutations in the entire NDP. A total of 143 subjects of different ethnic backgrounds were enrolled in the study. Fifty-four patients had severe ROP (Stage 3 or worse). Of these, 38 were threshold in at least one eye (with a mean gestational age of 26.1 weeks and mean birth weight of 788.4 g). There were 36 patients with mild or no ROP, 31 parents with no history of retinal disease or prematurity, and 22 wild type (normal) controls. There were 70 African American subjects, 55 Caucasians, and 18 of other races. Severe ROP was noted in 29 African American subjects, 17 Caucasians, and 8 of other races. Seven polymerase chain reaction primer pairs spanning the NDP were optimized for denaturing high performance liquid chromatography and direct sequencing. Three primer pairs covered the coding region, and the remaining four spanned the 3' and 5' untranslated regions (UTR). Six of 54 (11%) infants with severe ROP had polymorphisms in the NDP. Five of the infants were African American, and one was Caucasian. Two parents were heterozygous for the same polymorphism as their child. One parent-child pair had a single base pair (bp) insertion in the 3' UTR region. Another parent-child pair had two mutations: a 14 bp deletion in the 5' UTR region of exon 1 and a single nucleotide polymorphism in the 5' UTR region of exon 2. No coding region sequence changes were found. No

  20. Low coverage sequencing of three echinoderm genomes: the brittle star Ophionereis fasciata, the sea star Patiriella regularis, and the sea cucumber Australostichopus mollis.

    Science.gov (United States)

    Long, Kyle A; Nossa, Carlos W; Sewell, Mary A; Putnam, Nicholas H; Ryan, Joseph F

    2016-01-01

    There are five major extant groups of Echinodermata: Crinoidea (feather stars and sea lillies), Ophiuroidea (brittle stars and basket stars), Asteroidea (sea stars), Echinoidea (sea urchins, sea biscuits, and sand dollars), and Holothuroidea (sea cucumbers). These animals are known for their pentaradial symmetry as adults, unique water vascular system, mutable collagenous tissues, and endoskeletons of high magnesium calcite. To our knowledge, the only echinoderm species with a genome sequence available to date is Strongylocentrotus pupuratus (Echinoidea). The availability of additional echinoderm genome sequences is crucial for understanding the biology of these animals. Here we present assembled draft genomes of the brittle star Ophionereis fasciata, the sea star Patiriella regularis, and the sea cucumber Australostichopus mollis from Illumina sequence data with coverages of 12.5x, 22.5x, and 21.4x, respectively. These data provide a resource for mining gene superfamilies, identifying non-coding RNAs, confirming gene losses, and designing experimental constructs. They will be important comparative resources for future genomic studies in echinoderms.

  1. Differential sequence diversity at merozoite surface protein-1 locus of Plasmodium knowlesi from humans and macaques in Thailand.

    Science.gov (United States)

    Putaporntip, Chaturong; Thongaree, Siriporn; Jongwutiwes, Somchai

    2013-08-01

    To determine the genetic diversity and potential transmission routes of Plasmodium knowlesi, we analyzed the complete nucleotide sequence of the gene encoding the merozoite surface protein-1 of this simian malaria (Pkmsp-1), an asexual blood-stage vaccine candidate, from naturally infected humans and macaques in Thailand. Analysis of Pkmsp-1 sequences from humans (n=12) and monkeys (n=12) reveals five conserved and four variable domains. Most nucleotide substitutions in conserved domains were dimorphic whereas three of four variable domains contained complex repeats with extensive sequence and size variation. Besides purifying selection in conserved domains, evidence of intragenic recombination scattering across Pkmsp-1 was detected. The number of haplotypes, haplotype diversity, nucleotide diversity and recombination sites of human-derived sequences exceeded that of monkey-derived sequences. Phylogenetic networks based on concatenated conserved sequences of Pkmsp-1 displayed a character pattern that could have arisen from sampling process or the presence of two independent routes of P. knowlesi transmission, i.e. from macaques to human and from human to humans in Thailand. Copyright © 2013 Elsevier B.V. All rights reserved.

  2. Multilocus sequence typing and rtxA toxin gene sequencing analysis of Kingella kingae isolates demonstrates genetic diversity and international clones.

    Directory of Open Access Journals (Sweden)

    Romain Basmaci

    Full Text Available BACKGROUND: Kingella kingae, a normal component of the upper respiratory flora, is being increasingly recognized as an important invasive pathogen in young children. Genetic diversity of this species has not been studied. METHODS: We analyzed 103 strains from different countries and clinical origins by a new multilocus sequence-typing (MLST schema. Putative virulence gene rtxA, encoding an RTX toxin, was also sequenced, and experimental virulence of representative strains was assessed in a juvenile-rat model. RESULTS: Thirty-six sequence-types (ST and nine ST-complexes (STc were detected. The main STc 6, 14 and 23 comprised 23, 17 and 20 strains respectively, and were internationally distributed. rtxA sequencing results were mostly congruent with MLST, and showed horizontal transfer events. Of interest, all members of the distantly related ST-6 (n = 22 and ST-5 (n = 4 harboured a 33 bp duplication or triplication in their rtxA sequence, suggesting that this genetic trait arose through selective advantage. The animal model revealed significant differences in virulence among strains of the species. CONCLUSION: MLST analysis reveals international spread of ST-complexes and will help to decipher acquisition and evolution of virulence traits and diversity of pathogenicity among K. kingae strains, for which an experimental animal model is now available.

  3. DNA sequence analyses reveal abundant diversity, endemism and evidence for Asian origin of the porcini mushrooms.

    Directory of Open Access Journals (Sweden)

    Bang Feng

    Full Text Available The wild gourmet mushroom Boletus edulis and its close allies are of significant ecological and economic importance. They are found throughout the Northern Hemisphere, but despite their ubiquity there are still many unresolved issues with regard to the taxonomy, systematics and biogeography of this group of mushrooms. Most phylogenetic studies of Boletus so far have characterized samples from North America and Europe and little information is available on samples from other areas, including the ecologically and geographically diverse regions of China. Here we analyzed DNA sequence variation in three gene markers from samples of these mushrooms from across China and compared our findings with those from other representative regions. Our results revealed fifteen novel phylogenetic species (about one-third of the known species and a newly identified lineage represented by Boletus sp. HKAS71346 from tropical Asia. The phylogenetic analyses support eastern Asia as the center of diversity for the porcini sensu stricto clade. Within this clade, B. edulis is the only known holarctic species. The majority of the other phylogenetic species are geographically restricted in their distributions. Furthermore, molecular dating and geological evidence suggest that this group of mushrooms originated during the Eocene in eastern Asia, followed by dispersal to and subsequent speciation in other parts of Asia, Europe, and the Americas from the middle Miocene through the early Pliocene. In contrast to the ancient dispersal of porcini in the strict sense in the Northern Hemisphere, the occurrence of B. reticulatus and B. edulis sensu lato in the Southern Hemisphere was probably due to recent human-mediated introductions.

  4. DNA Sequence Analyses Reveal Abundant Diversity, Endemism and Evidence for Asian Origin of the Porcini Mushrooms

    Science.gov (United States)

    Feng, Bang; Xu, Jianping; Wu, Gang; Zeng, Nian-Kai; Li, Yan-Chun; Tolgor, Bau; Kost, Gerhard W.; Yang, Zhu L.

    2012-01-01

    The wild gourmet mushroom Boletus edulis and its close allies are of significant ecological and economic importance. They are found throughout the Northern Hemisphere, but despite their ubiquity there are still many unresolved issues with regard to the taxonomy, systematics and biogeography of this group of mushrooms. Most phylogenetic studies of Boletus so far have characterized samples from North America and Europe and little information is available on samples from other areas, including the ecologically and geographically diverse regions of China. Here we analyzed DNA sequence variation in three gene markers from samples of these mushrooms from across China and compared our findings with those from other representative regions. Our results revealed fifteen novel phylogenetic species (about one-third of the known species) and a newly identified lineage represented by Boletus sp. HKAS71346 from tropical Asia. The phylogenetic analyses support eastern Asia as the center of diversity for the porcini sensu stricto clade. Within this clade, B. edulis is the only known holarctic species. The majority of the other phylogenetic species are geographically restricted in their distributions. Furthermore, molecular dating and geological evidence suggest that this group of mushrooms originated during the Eocene in eastern Asia, followed by dispersal to and subsequent speciation in other parts of Asia, Europe, and the Americas from the middle Miocene through the early Pliocene. In contrast to the ancient dispersal of porcini in the strict sense in the Northern Hemisphere, the occurrence of B. reticulatus and B. edulis sensu lato in the Southern Hemisphere was probably due to recent human-mediated introductions. PMID:22629418

  5. Diversity and dynamics of dominant and rare bacterial taxa in replicate sequencing batch reactors operated under different solids retention time

    KAUST Repository

    Bagchi, Samik; Garcia Tellez, Berenice; Rao, Hari Ananda; Lamendella, Regina; Saikaly, Pascal

    2014-01-01

    In this study, 16S rRNA gene pyrosequencing was applied in order to provide a better insight on the diversity and dynamics of total, dominant, and rare bacterial taxa in replicate lab-scale sequencing batch reactors (SBRs) operated at different

  6. Diversity

    Science.gov (United States)

    Portraits In Courage Vol. VIII Portraits In Courage Vol. IX Portraits In Courage Vol. X AF Sites Social -Wide Initiative to Promote Diversity and Inclusion in the Federal Workforce Executive Order 13548 : Virtual Diversity Conference Air Force Diversity & Inclusion Air Force Diversity Graphic There is no

  7. High-Throughput Sequencing of Microbial Community Diversity and Dynamics during Douchi Fermentation

    Science.gov (United States)

    Tu, Zong-cai; Wang, Xiao-lan

    2016-01-01

    Douchi is a type of Chinese traditional fermented food that is an important source of protein and is used in flavouring ingredients. The end product is affected by the microbial community present during fermentation, but exactly how microbes influence the fermentation process remains poorly understood. We used an Illumina MiSeq approach to investigate bacterial and fungal community diversity during both douchi-koji making and fermentation. A total of 181,443 high quality bacterial 16S rRNA sequences and 221,059 high quality fungal internal transcribed spacer reads were used for taxonomic classification, revealing eight bacterial and three fungal phyla. Firmicutes, Actinobacteria and Proteobacteria were the dominant bacterial phyla, while Ascomycota and Zygomycota were the dominant fungal phyla. At the genus level, Staphylococcus and Weissella were the dominant bacteria, while Aspergillus and Lichtheimia were the dominant fungi. Principal coordinate analysis showed structural separation between the composition of bacteria in koji making and fermentation. However, multivariate analysis of variance based on unweighted UniFrac distances did identify distinct differences (p fermentation. This is the first investigation to integrate douchi fermentation and koji making and fermentation processes through this technological approach. The results provide insight into the microbiome of the douchi fermentation process, and reveal a structural separation that may be stratified by the environment during the production of this traditional fermented food. PMID:27992473

  8. Genetic Diversity Assessment and Identification of New Sour Cherry Genotypes Using Intersimple Sequence Repeat Markers

    Directory of Open Access Journals (Sweden)

    Roghayeh Najafzadeh

    2014-01-01

    Full Text Available Iran is one of the chief origins of subgenus Cerasus germplasm. In this study, the genetic variation of new Iranian sour cherries (which had such superior growth characteristics and fruit quality as to be considered for the introduction of new cultivars was investigated and identified using 23 intersimple sequence repeat (ISSR markers. Results indicated a high level of polymorphism of the genotypes based on these markers. According to these results, primers tested in this study specially ISSR-4, ISSR-6, ISSR-13, ISSR-14, ISSR-16, and ISSR-19 produced good and various levels of amplifications which can be effectively used in genetic studies of the sour cherry. The genetic similarity among genotypes showed a high diversity among the genotypes. Cluster analysis separated improved cultivars from promising Iranian genotypes, and the PCoA supported the cluster analysis results. Since the Iranian genotypes were superior to the improved cultivars and were separated from them in most groups, these genotypes can be considered as distinct genotypes for further evaluations in the framework of breeding programs and new cultivar identification in cherries. Results also confirmed that ISSR is a reliable DNA marker that can be used for exact genetic studies and in sour cherry breeding programs.

  9. Evaluation of the microbial diversity in amyotrophic lateral sclerosis using high-throughput sequencing

    Directory of Open Access Journals (Sweden)

    Xin Fang

    2016-09-01

    Full Text Available More and more evidences indicate that diseases of the central nervous system (CNS have been seriously affected by faecal microbes. However, little work is done to explore interaction between amyotrophic lateral sclerosis (ALS and faecal microbes. In the present study, high-throughput sequencing method was used to compare the intestinal microbial diversity of healthy people and ALS patients. The principal coordinate analysis (PCoA, Venn and unweighted pair-group method using arithmetic averages (UPGMA showed an obvious microbial changes between healthy people (group H and ALS patients (group A, and the average ratios of Bacteroides, Faecalibacterium, Anaerostipes, Prevotella, Escherichia and Lachnospira at genus level between ALS patients and healthy people were 0.78, 2.18, 3.41, 0.35, 0.79 and 13.07. Furthermore, the decreased Firmicutes/Bacteroidetes ratio at phylum level using LEfSE (LDA >4.0, together with the significant increased genus Dorea (harmful microorganisms and significant reduced genus Oscillibacter, Anaerostipes, Lachnospiraceae (beneficial microorganisms in ALS patients, indicated that the imbalance in intestinal microflora constitution had a strong association with the pathogenesis of ALS.

  10. Evaluation of the Microbial Diversity in Amyotrophic Lateral Sclerosis Using High-Throughput Sequencing.

    Science.gov (United States)

    Fang, Xin; Wang, Xin; Yang, Shaoguo; Meng, Fanjing; Wang, Xiaolei; Wei, Hua; Chen, Tingtao

    2016-01-01

    More and more evidences indicate that diseases of the central nervous system have been seriously affected by fecal microbes. However, little work is done to explore interaction between amyotrophic lateral sclerosis (ALS) and fecal microbes. In the present study, high-throughput sequencing method was used to compare the intestinal microbial diversity of healthy people and ALS patients. The principal coordinate analysis, Venn and unweighted pair-group method using arithmetic averages (UPGMA) showed an obvious microbial changes between healthy people (group H) and ALS patients (group A), and the average ratios of Bacteroides , Faecalibacterium , Anaerostipes , Prevotella , Escherichia , and Lachnospira at genus level between ALS patients and healthy people were 0.78, 2.18, 3.41, 0.35, 0.79, and 13.07. Furthermore, the decreased Firmicutes/Bacteroidetes ratio at phylum level using LEfSE (LDA > 4.0), together with the significant increased genus Dorea (harmful microorganisms) and significant reduced genus Oscillibacter , Anaerostipes , Lachnospiraceae (beneficial microorganisms) in ALS patients, indicated that the imbalance in intestinal microflora constitution had a strong association with the pathogenesis of ALS.

  11. Next-Generation Sequencing Analysis of the Diversity of Human Noroviruses in Japanese Oysters.

    Science.gov (United States)

    Imamura, Saiki; Kanezashi, Hiromi; Goshima, Tomoko; Haruna, Mika; Okada, Tsukasa; Inagaki, Nobuya; Uema, Masashi; Noda, Mamoru; Akimoto, Keiko

    2017-08-01

    To obtain detailed information on the diversity of infectious norovirus in oysters (Crossostrea gigas), oysters obtained from fish producers at six different sites (sites A, B, C, D, E, and F) in Japan were analyzed once a month during the period spanning October 2015-February 2016. To avoid false-positive polymerase chain reaction (PCR) results derived from noninfectious virus particles, samples were pretreated with RNase before reverse transcription-PCR (RT-PCR). RT-PCR products were subjected to next-generation sequencing to identify norovirus genotypes in oysters. As a result, all GI genotypes were detected in the investigational period. The detection rate and proportion of norovirus GI genotypes differed depending on the sampling site and month. GII.3, GII.4, GII.13, GII.16, and GII.17 were detected in this study. Both the detection rate and proportion of norovirus GII genotypes differed depending on the sampling site and month. In total, the detection rate and proportion of GII.3 were highest from October to December among all detected genotypes. In January, the detection rates of GII.4 and GII.17 reached the same level as that of GII.3. The proportion of GII.17 was relatively lower from October to December, whereas it was the highest in January. To our knowledge, this is the first investigation on noroviruses in oysters in Japan, based on a method that can distinguish their infectivity.

  12. Sequence variants of the DFNB31 gene among Usher syndrome patients of diverse origin

    Science.gov (United States)

    Aller, Elena; Jaijo, Teresa; van Wijk, Erwin; Ebermann, Inga; Kersten, Ferry; García-García, Gema; Voesenek, Krysta; Aparisi, María José; Hoefsloot, Lies; Cremers, Cor; Díaz-Llopis, Manuel; Pennings, Ronald; Bolz, Hanno J.; Kremer, Hannie; Millán, José M.

    2010-01-01

    Purpose It has been demonstrated that mutations in deafness, autosomal recessive 31 (DFNB31), the gene encoding whirlin, is responsible for nonsyndromic hearing loss (NSHL; DFNB31) and Usher syndrome type II (USH2D). We screened DFNB31 in a large cohort of patients with different clinical subtypes of Usher syndrome (USH) to determine the prevalence of DFNB31 mutations among USH patients. Methods DFNB31 was screened in 149 USH2, 29 USH1, six atypical USH, and 11 unclassified USH patients from diverse ethnic backgrounds. Mutation detection was performed by direct sequencing of all coding exons. Results We identified 38 different variants among 195 patients. Most variants were clearly polymorphic, but at least two out of the 15 nonsynonymous variants (p.R350W and p.R882S) are predicted to impair whirlin structure and function, suggesting eventual pathogenicity. No putatively pathogenic mutation was found in the second allele of patients with these mutations. Conclusions DFNB31 is not a major cause of USH. PMID:20352026

  13. Genetic Diversity in Passiflora Species Assessed by Morphological and ITS Sequence Analysis

    Directory of Open Access Journals (Sweden)

    Shiamala Devi Ramaiya

    2014-01-01

    Full Text Available This study used morphological characterization and phylogenetic analysis of the internal transcribed spacer (ITS region of nuclear ribosomal DNA to investigate the phylogeny of Passiflora species. The samples were collected from various regions of East Malaysia, and discriminant function analysis based on linear combinations of morphological variables was used to classify the Passiflora species. The biplots generated five distinct groups discriminated by morphological variables. The group consisted of cultivars of P. edulis with high levels of genetic similarity; in contrast, P. foetida was highly divergent from other species in the morphological biplots. The final dataset of aligned sequences from nine studied Passiflora accessions and 30 other individuals obtained from GenBank database (NCBI yielded one most parsimonious tree with two strongly supported clades. Maximum parsimony (MP tree showed the phylogenetic relationships within this subgenus Passiflora support the classification at the series level. The constructed phylogenic tree also confirmed the divergence of P. foetida from all other species and the closeness of wild and cultivated species. The phylogenetic relationships were consistent with results of morphological assessments. The results of this study indicate that ITS region analysis represents a useful tool for evaluating genetic diversity in Passiflora at the species level.

  14. Intermediary metabolism in protists: a sequence-based view of facultative anaerobic metabolism in evolutionarily diverse eukaryotes.

    Science.gov (United States)

    Ginger, Michael L; Fritz-Laylin, Lillian K; Fulton, Chandler; Cande, W Zacheus; Dawson, Scott C

    2010-12-01

    Protists account for the bulk of eukaryotic diversity. Through studies of gene and especially genome sequences the molecular basis for this diversity can be determined. Evident from genome sequencing are examples of versatile metabolism that go far beyond the canonical pathways described for eukaryotes in textbooks. In the last 2-3 years, genome sequencing and transcript profiling has unveiled several examples of heterotrophic and phototrophic protists that are unexpectedly well-equipped for ATP production using a facultative anaerobic metabolism, including some protists that can (Chlamydomonas reinhardtii) or are predicted (Naegleria gruberi, Acanthamoeba castellanii, Amoebidium parasiticum) to produce H(2) in their metabolism. It is possible that some enzymes of anaerobic metabolism were acquired and distributed among eukaryotes by lateral transfer, but it is also likely that the common ancestor of eukaryotes already had far more metabolic versatility than was widely thought a few years ago. The discussion of core energy metabolism in unicellular eukaryotes is the subject of this review. Since genomic sequencing has so far only touched the surface of protist diversity, it is anticipated that sequences of additional protists may reveal an even wider range of metabolic capabilities, while simultaneously enriching our understanding of the early evolution of eukaryotes. Copyright © 2010 Elsevier GmbH. All rights reserved.

  15. COCACOLA: binning metagenomic contigs using sequence COmposition, read CoverAge, CO-alignment and paired-end read LinkAge.

    Science.gov (United States)

    Lu, Yang Young; Chen, Ting; Fuhrman, Jed A; Sun, Fengzhu

    2017-03-15

    The advent of next-generation sequencing technologies enables researchers to sequence complex microbial communities directly from the environment. Because assembly typically produces only genome fragments, also known as contigs, instead of an entire genome, it is crucial to group them into operational taxonomic units (OTUs) for further taxonomic profiling and down-streaming functional analysis. OTU clustering is also referred to as binning. We present COCACOLA, a general framework automatically bin contigs into OTUs based on sequence composition and coverage across multiple samples. The effectiveness of COCACOLA is demonstrated in both simulated and real datasets in comparison with state-of-art binning approaches such as CONCOCT, GroopM, MaxBin and MetaBAT. The superior performance of COCACOLA relies on two aspects. One is using L 1 distance instead of Euclidean distance for better taxonomic identification during initialization. More importantly, COCACOLA takes advantage of both hard clustering and soft clustering by sparsity regularization. In addition, the COCACOLA framework seamlessly embraces customized knowledge to facilitate binning accuracy. In our study, we have investigated two types of additional knowledge, the co-alignment to reference genomes and linkage of contigs provided by paired-end reads, as well as the ensemble of both. We find that both co-alignment and linkage information further improve binning in the majority of cases. COCACOLA is scalable and faster than CONCOCT, GroopM, MaxBin and MetaBAT. The software is available at https://github.com/younglululu/COCACOLA . fsun@usc.edu. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  16. Genetic Diversity and Population Structure of F3:6 Nebraska Winter Wheat Genotypes Using Genotyping-By-Sequencing.

    Science.gov (United States)

    Eltaher, Shamseldeen; Sallam, Ahmed; Belamkar, Vikas; Emara, Hamdy A; Nower, Ahmed A; Salem, Khaled F M; Poland, Jesse; Baenziger, Peter S

    2018-01-01

    The availability of information on the genetic diversity and population structure in wheat ( Triticum aestivum L.) breeding lines will help wheat breeders to better use their genetic resources and manage genetic variation in their breeding program. The recent advances in sequencing technology provide the opportunity to identify tens or hundreds of thousands of single nucleotide polymorphism (SNPs) in large genome species (e.g., wheat). These SNPs can be utilized for understanding genetic diversity and performing genome wide association studies (GWAS) for complex traits. In this study, the genetic diversity and population structure were investigated in a set of 230 genotypes (F 3:6 ) derived from various crosses as a prerequisite for GWAS and genomic selection. Genotyping-by-sequencing provided 25,566 high-quality SNPs. The polymorphism information content (PIC) across chromosomes ranged from 0.09 to 0.37 with an average of 0.23. The distribution of SNPs markers on the 21 chromosomes ranged from 319 on chromosome 3D to 2,370 on chromosome 3B. The analysis of population structure revealed three subpopulations (G1, G2, and G3). Analysis of molecular variance identified 8% variance among and 92% within subpopulations. Of the three subpopulations, G2 had the highest level of genetic diversity based on three genetic diversity indices: Shannon's information index ( I ) = 0.494, diversity index ( h ) = 0.328 and unbiased diversity index (uh) = 0.331, while G3 had lowest level of genetic diversity ( I = 0.348, h = 0.226 and uh = 0.236). This high genetic diversity identified among the subpopulations can be used to develop new wheat cultivars.

  17. Expressed Sequence Tag-Simple Sequence Repeat (EST-SSR Marker Resources for Diversity Analysis of Mango (Mangifera indica L.

    Directory of Open Access Journals (Sweden)

    Natalie L. Dillon

    2014-01-01

    Full Text Available In this study, a collection of 24,840 expressed sequence tags (ESTs generated from five mango (Mangifera indica L. cDNA libraries was mined for EST-based simple sequence repeat (SSR markers. Over 1,000 ESTs with SSR motifs were detected from more than 24,000 EST sequences with di- and tri-nucleotide repeat motifs the most abundant. Of these, 25 EST-SSRs in genes involved in plant development, stress response, and fruit color and flavor development pathways were selected, developed into PCR markers and characterized in a population of 32 mango selections including M. indica varieties, and related Mangifera species. Twenty-four of the 25 EST-SSR markers exhibited polymorphisms, identifying a total of 86 alleles with an average of 5.38 alleles per locus, and distinguished between all Mangifera selections. Private alleles were identified for Mangifera species. These newly developed EST-SSR markers enhance the current 11 SSR mango genetic identity panel utilized by the Australian Mango Breeding Program. The current panel has been used to identify progeny and parents for selection and the application of this extended panel will further improve and help to design mango hybridization strategies for increased breeding efficiency.

  18. Application of Ion Torrent Sequencing to the Assessment of the Effect of Alkali Ballast Water Treatment on Microbial Community Diversity

    Science.gov (United States)

    Fujimoto, Masanori; Moyerbrailean, Gregory A.; Noman, Sifat; Gizicki, Jason P.; Ram, Michal L.; Green, Phyllis A.; Ram, Jeffrey L.

    2014-01-01

    The impact of NaOH as a ballast water treatment (BWT) on microbial community diversity was assessed using the 16S rRNA gene based Ion Torrent sequencing with its new 400 base chemistry. Ballast water samples from a Great Lakes ship were collected from the intake and discharge of both control and NaOH (pH 12) treated tanks and were analyzed in duplicates. One set of duplicates was treated with the membrane-impermeable DNA cross-linking reagent propidium mono-azide (PMA) prior to PCR amplification to differentiate between live and dead microorganisms. Ion Torrent sequencing generated nearly 580,000 reads for 31 bar-coded samples and revealed alterations of the microbial community structure in ballast water that had been treated with NaOH. Rarefaction analysis of the Ion Torrent sequencing data showed that BWT using NaOH significantly decreased microbial community diversity relative to control discharge (pbased principal coordinate analysis (PCoA) plots and UPGMA tree analysis revealed that NaOH-treated ballast water microbial communities differed from both intake communities and control discharge communities. After NaOH treatment, bacteria from the genus Alishewanella became dominant in the NaOH-treated samples, accounting for alkali ballast water treatment in reducing ballast water microbial diversity and demonstrated the application of new Ion Torrent sequencing techniques to microbial community studies. PMID:25222021

  19. Characterization of Human Cytomegalovirus Genome Diversity in Immunocompromised Hosts by Whole-Genome Sequencing Directly From Clinical Specimens.

    Science.gov (United States)

    Hage, Elias; Wilkie, Gavin S; Linnenweber-Held, Silvia; Dhingra, Akshay; Suárez, Nicolás M; Schmidt, Julius J; Kay-Fedorov, Penelope C; Mischak-Weissinger, Eva; Heim, Albert; Schwarz, Anke; Schulz, Thomas F; Davison, Andrew J; Ganzenmueller, Tina

    2017-06-01

    Advances in next-generation sequencing (NGS) technologies allow comprehensive studies of genetic diversity over the entire genome of human cytomegalovirus (HCMV), a significant pathogen for immunocompromised individuals. Next-generation sequencing was performed on target enriched sequence libraries prepared directly from a variety of clinical specimens (blood, urine, breast milk, respiratory samples, biopsies, and vitreous humor) obtained longitudinally or from different anatomical compartments from 20 HCMV-infected patients (renal transplant recipients, stem cell transplant recipients, and congenitally infected children). De novo-assembled HCMV genome sequences were obtained for 57 of 68 sequenced samples. Analysis of longitudinal or compartmental HCMV diversity revealed various patterns: no major differences were detected among longitudinal, intraindividual blood samples from 9 of 15 patients and in most of the patients with compartmental samples, whereas a switch of the major HCMV population was observed in 6 individuals with sequential blood samples and upon compartmental analysis of 1 patient with HCMV retinitis. Variant analysis revealed additional aspects of minor virus population dynamics and antiviral-resistance mutations. In immunosuppressed patients, HCMV can remain relatively stable or undergo drastic genomic changes that are suggestive of the emergence of minor resident strains or de novo infection. © The Author 2017. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail journals.permissions@oup.com.

  20. Sequence diversity and natural selection at domain I of the apical membrane antigen 1 among Indian Plasmodium falciparum populations

    Directory of Open Access Journals (Sweden)

    Kumar Ashwani

    2007-11-01

    Full Text Available Abstract Background The Plasmodium falciparum apical membrane antigen 1 (AMA1 is a leading malaria vaccine candidate antigen. The complete AMA1 protein is comprised of three domains where domain I exhibits high sequence polymorphism and is thus named as the hyper-variable region (HVR. The present study describes the extent of genetic polymorphism and natural selection at domain I of the ama1 gene among Indian P. falciparum isolates. Methods The part of the ama1 gene covering domain I was PCR amplified and sequenced from 157 P. falciparum isolates collected from five different geographical regions of India. Statistical and phylogenetic analyses of the sequences were done using DnaSP ver. 4. 10. 9 and MEGA version 3.0 packages. Results A total of 57 AMA1 haplotypes were observed among 157 isolates sequenced. Forty-six of these 57 haplotypes are being reported here for the first time. The parasites collected from the high malaria transmission areas (Assam, Orissa, and Andaman and Nicobar Islands showed more haplotypes (H and nucleotide diversity π as compared to low malaria transmission areas (Uttar Pradesh and Goa. The comparison of all five Indian P. falciparum subpopulations indicated moderate level of genetic differentiation and limited gene flow (Fixation index ranging from 0.048 to 0.13 between populations. The difference between rates of non-synonymous and synonymous mutations, Tajima's D and McDonald-Kreitman test statistics suggested that the diversity at domain I of the AMA1 antigen is due to positive natural selection. The minimum recombination events were also high indicating the possible role of recombination in generating AMA1 allelic diversity. Conclusion The level of genetic diversity and diversifying selection were higher in Assam, Orissa, and Andaman and Nicobar Islands populations as compared to Uttar Pradesh and Goa. The amounts of gene flow among these populations were moderate. The data reported here will be valuable for the

  1. Genetic diversity and molecular evolution of Naga King Chili inferred from internal transcribed spacer sequence of nuclear ribosomal DNA.

    Science.gov (United States)

    Kehie, Mechuselie; Kumaria, Suman; Devi, Khumuckcham Sangeeta; Tandon, Pramod

    2016-02-01

    Sequences of the Internal Transcribed Spacer (ITS1-5.8S-ITS2) of nuclear ribosomal DNAs were explored to study the genetic diversity and molecular evolution of Naga King Chili. Our study indicated the occurrence of nucleotide polymorphism and haplotypic diversity in the ITS regions. The present study demonstrated that the variability of ITS1 with respect to nucleotide diversity and sequence polymorphism exceeded that of ITS2. Sequence analysis of 5.8S gene revealed a much conserved region in all the accessions of Naga King Chili. However, strong phylogenetic information of this species is the distinct 13 bp deletion in the 5.8S gene which discriminated Naga King Chili from the rest of the Capsicum sp. Neutrality test results implied a neutral variation, and population seems to be evolving at drift-mutation equilibrium and free from directed selection pressure. Furthermore, mismatch analysis showed multimodal curve indicating a demographic equilibrium. Phylogenetic relationships revealed by Median Joining Network (MJN) analysis denoted a clear discrimination of Naga King Chili from its closest sister species (Capsicum chinense and Capsicum frutescens). The absence of star-like network of haplotypes suggested an ancient population expansion of this chili.

  2. Application of ion torrent sequencing to the assessment of the effect of alkali ballast water treatment on microbial community diversity.

    Science.gov (United States)

    Fujimoto, Masanori; Moyerbrailean, Gregory A; Noman, Sifat; Gizicki, Jason P; Ram, Michal L; Green, Phyllis A; Ram, Jeffrey L

    2014-01-01

    The impact of NaOH as a ballast water treatment (BWT) on microbial community diversity was assessed using the 16S rRNA gene based Ion Torrent sequencing with its new 400 base chemistry. Ballast water samples from a Great Lakes ship were collected from the intake and discharge of both control and NaOH (pH 12) treated tanks and were analyzed in duplicates. One set of duplicates was treated with the membrane-impermeable DNA cross-linking reagent propidium mono-azide (PMA) prior to PCR amplification to differentiate between live and dead microorganisms. Ion Torrent sequencing generated nearly 580,000 reads for 31 bar-coded samples and revealed alterations of the microbial community structure in ballast water that had been treated with NaOH. Rarefaction analysis of the Ion Torrent sequencing data showed that BWT using NaOH significantly decreased microbial community diversity relative to control discharge (pPCoA) plots and UPGMA tree analysis revealed that NaOH-treated ballast water microbial communities differed from both intake communities and control discharge communities. After NaOH treatment, bacteria from the genus Alishewanella became dominant in the NaOH-treated samples, accounting for microbial community structure between PMA-processed and non-PMA samples occurred in intake water samples, which exhibited a significantly higher amount of PMA-sensitive cyanobacteria/chloroplast 16S rRNA than their corresponding non-PMA total DNA samples. The community assembly obtained using Ion Torrent sequencing was comparable to that obtained from a subset of samples that were also subjected to 454 pyrosequencing. This study showed the efficacy of alkali ballast water treatment in reducing ballast water microbial diversity and demonstrated the application of new Ion Torrent sequencing techniques to microbial community studies.

  3. Second generation sequencing for elucidating the diversity of bacteria and plasmids in soil

    DEFF Research Database (Denmark)

    Holmsgaard, Peter Nikolai

    . The relative abundance of IncP-1β1 plasmids also increased. In papers four and five, the mobile genetic elements and bacterial diversity, respectively, was studied over a pesticide spraying season in the same BPS used in paper three. The addition of pesticides decreased overall bacterial diversity...

  4. Genetic diversity of Taenia hydatigena in the northern part of the West Bank, Palestine as determined by mitochondrial DNA sequences.

    Science.gov (United States)

    Adwan, Kamel; Jayousi, Alaa; Abuseir, Sameh; Abbasi, Ibrahim; Adwan, Ghaleb; Jarrar, Naser

    2018-06-26

    Cysticercus tenuicollis is the metacestode of canine tapeworm Taenia hydatigena, which has been reported in domestic and wild ruminants and is causing veterinary and economic losses in the meat industry. This study was conducted to determine the sequence variation in the mitochondrial cytochrome c oxidase subunit 1 (coxl) gene in 20 isolates of T. hydatigena metacestodes (cysticercus tenuicollis) collected from northern West Bank in Palestine. Nine haplotypes were detected, with one prevailing (55%). The total haplotype diversity (0.705) and the total nucleotide diversity (0.0045) displayed low genetic diversity among our isolates. Haplotype analysis showed a star-shaped network with a centrally positioned common haplotype. The Tajima's D, and Fu and Li's statistics in cysticercus tenuicollis population of this region showed a negative value, indicating deviations from neutrality and both suggested recent population expansion for the population. The findings of this study would greatly help to implement control and preventive measures for T. hydatigena larvae infection in Palestine.

  5. DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.

    Science.gov (United States)

    de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

    2015-11-16

    Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. Genetic Diversity and Sequence Variations at Growth Hormone Loci among Composite and Hereford Populations of Beef Cattle

    Directory of Open Access Journals (Sweden)

    ALAN J. LYMBERY

    2000-07-01

    Full Text Available A total of 194 Hereford and 235 composite breed cattle from Wokalup Research Station were used in this study. The aims of the study were to: Investigate polymorphisms in the growth hormone gene in the composite and purebred Hereford herds from the Wokalup selection experiment, compare genetic diversity in the growth hormone gene of the breeds, sequencing and compare the sequences of growth hormone loci between composite and purebred Hereford herds with published sequence from Genebank. The genomic DNA was extracted using Wizard genomic DNA purification system from Promega. Two fragments of growth hormone gene were amplified using PCR and continued with RFLP. Each genotype in both loci was sequenced. PCR products of each genotypes were cloned into PCR II, transformed, colonies selection, plasmid DNA extraction continued with cycle sequencing. Polymorphisms were found in both breeds of cattle in both loci of GH-L1 and GH-L2 of the growth hormone gene by PCR-RFLP analysis. Sequencing analysis confirmed the RFLPs data, polymorphism detected using AluI at GH-L1 is due to substitution between leusin/ valine at position 127, while polymorphism at the MspI restriction site was caused by transition of C to T at +837 position.

  7. Analysis of intra-host genetic diversity of Prunus necrotic ringspot virus (PNRSV) using amplicon next generation sequencing.

    Science.gov (United States)

    Kinoti, Wycliff M; Constable, Fiona E; Nancarrow, Narelle; Plummer, Kim M; Rodoni, Brendan

    2017-01-01

    PCR amplicon next generation sequencing (NGS) analysis offers a broadly applicable and targeted approach to detect populations of both high- or low-frequency virus variants in one or more plant samples. In this study, amplicon NGS was used to explore the diversity of the tripartite genome virus, Prunus necrotic ringspot virus (PNRSV) from 53 PNRSV-infected trees using amplicons from conserved gene regions of each of PNRSV RNA1, RNA2 and RNA3. Sequencing of the amplicons from 53 PNRSV-infected trees revealed differing levels of polymorphism across the three different components of the PNRSV genome with a total number of 5040, 2083 and 5486 sequence variants observed for RNA1, RNA2 and RNA3 respectively. The RNA2 had the lowest diversity of sequences compared to RNA1 and RNA3, reflecting the lack of flexibility tolerated by the replicase gene that is encoded by this RNA component. Distinct PNRSV phylo-groups, consisting of closely related clusters of sequence variants, were observed in each of PNRSV RNA1, RNA2 and RNA3. Most plant samples had a single phylo-group for each RNA component. Haplotype network analysis showed that smaller clusters of PNRSV sequence variants were genetically connected to the largest sequence variant cluster within a phylo-group of each RNA component. Some plant samples had sequence variants occurring in multiple PNRSV phylo-groups in at least one of each RNA and these phylo-groups formed distinct clades that represent PNRSV genetic strains. Variants within the same phylo-group of each Prunus plant sample had ≥97% similarity and phylo-groups within a Prunus plant sample and between samples had less ≤97% similarity. Based on the analysis of diversity, a definition of a PNRSV genetic strain was proposed. The proposed definition was applied to determine the number of PNRSV genetic strains in each of the plant samples and the complexity in defining genetic strains in multipartite genome viruses was explored.

  8. Analysis of intra-host genetic diversity of Prunus necrotic ringspot virus (PNRSV using amplicon next generation sequencing.

    Directory of Open Access Journals (Sweden)

    Wycliff M Kinoti

    Full Text Available PCR amplicon next generation sequencing (NGS analysis offers a broadly applicable and targeted approach to detect populations of both high- or low-frequency virus variants in one or more plant samples. In this study, amplicon NGS was used to explore the diversity of the tripartite genome virus, Prunus necrotic ringspot virus (PNRSV from 53 PNRSV-infected trees using amplicons from conserved gene regions of each of PNRSV RNA1, RNA2 and RNA3. Sequencing of the amplicons from 53 PNRSV-infected trees revealed differing levels of polymorphism across the three different components of the PNRSV genome with a total number of 5040, 2083 and 5486 sequence variants observed for RNA1, RNA2 and RNA3 respectively. The RNA2 had the lowest diversity of sequences compared to RNA1 and RNA3, reflecting the lack of flexibility tolerated by the replicase gene that is encoded by this RNA component. Distinct PNRSV phylo-groups, consisting of closely related clusters of sequence variants, were observed in each of PNRSV RNA1, RNA2 and RNA3. Most plant samples had a single phylo-group for each RNA component. Haplotype network analysis showed that smaller clusters of PNRSV sequence variants were genetically connected to the largest sequence variant cluster within a phylo-group of each RNA component. Some plant samples had sequence variants occurring in multiple PNRSV phylo-groups in at least one of each RNA and these phylo-groups formed distinct clades that represent PNRSV genetic strains. Variants within the same phylo-group of each Prunus plant sample had ≥97% similarity and phylo-groups within a Prunus plant sample and between samples had less ≤97% similarity. Based on the analysis of diversity, a definition of a PNRSV genetic strain was proposed. The proposed definition was applied to determine the number of PNRSV genetic strains in each of the plant samples and the complexity in defining genetic strains in multipartite genome viruses was explored.

  9. Propionibacterium acnes: disease-causing agent or common contaminant? Detection in diverse patient samples by next generation sequencing

    DEFF Research Database (Denmark)

    Mollerup, Sarah; Friis-Nielsen, Jens; Vinner, Lasse

    2016-01-01

    Propionibacterium acnes is the most abundant bacterium on human skin, particularly in sebaceous areas. P. acnes is suggested to be an opportunistic pathogen involved in the development of diverse medical conditions, but is also a proven contaminant of human samples and surgical wounds. Its...... significance as a pathogen is consequently a matter of debate.In the present study we investigated the presence of P. acnes DNA in 250 next generation sequencing datasets generated from 180 samples of 20 different sample types, mostly of cancerous origin. The samples were either subjected to microbial...... enrichment, involving nuclease treatment to reduce the amount of host nucleic acids, or shotgun-sequenced.We detected high proportions of P. acnes in enriched samples, particularly skin derived and other tissue samples, with levels being higher in enriched compared to shotgun-sequenced samples. P. acnes...

  10. Genome sequence analysis with MonetDB - A case study on Ebola virus diversity

    NARCIS (Netherlands)

    Cijvat, R.; Manegold, S.; Kersten, M.; Klau, G.W.; Schönhuth, A.; Marschall, T.; Zhang, Y.

    2015-01-01

    Next-generation sequencing (NGS) technology has led the life sciences into the big data era. Today, sequencing genomes takes little time and cost, but yields terabytes of data to be stored and analyzed. Biologists are often exposed to excessively time consuming and error-prone data management and

  11. Genome survey of pistachio (Pistacia vera L.) by next generation sequencing: Development of novel SSR markers and genetic diversity in Pistacia species.

    Science.gov (United States)

    Ziya Motalebipour, Elmira; Kafkas, Salih; Khodaeiaminjan, Mortaza; Çoban, Nergiz; Gözel, Hatice

    2016-12-07

    Pistachio (Pistacia vera L.) is one of the most important nut crops in the world. There are about 11 wild species in the genus Pistacia, and they have importance as rootstock seed sources for cultivated P. vera and forest trees. Published information on the pistachio genome is limited. Therefore, a genome survey is necessary to obtain knowledge on the genome structure of pistachio by next generation sequencing. Simple sequence repeat (SSR) markers are useful tools for germplasm characterization, genetic diversity analysis, and genetic linkage mapping, and may help to elucidate genetic relationships among pistachio cultivars and species. To explore the genome structure of pistachio, a genome survey was performed using the Illumina platform at approximately 40× coverage depth in the P. vera cv. Siirt. The K-mer analysis indicated that pistachio has a genome that is about 600 Mb in size and is highly heterozygous. The assembly of 26.77 Gb Illumina data produced 27,069 scaffolds at N50 = 3.4 kb with a total of 513.5 Mb. A total of 59,280 SSR motifs were detected with a frequency of 8.67 kb. A total of 206 SSRs were used to characterize 24 P. vera cultivars and 20 wild Pistacia genotypes (four genotypes from each five wild Pistacia species) belonging to P. atlantica, P. integerrima, P. chinenesis, P. terebinthus, and P. lentiscus genotypes. Overall 135 SSR loci amplified in all 44 cultivars and genotypes, 41 were polymorphic in six Pistacia species. The novel SSR loci developed from cultivated pistachio were highly transferable to wild Pistacia species. The results from a genome survey of pistachio suggest that the genome size of pistachio is about 600 Mb with a high heterozygosity rate. This information will help to design whole genome sequencing strategies for pistachio. The newly developed novel polymorphic SSRs in this study may help germplasm characterization, genetic diversity, and genetic linkage mapping studies in the genus Pistacia.

  12. Comparison of the Diversity of Basidiomycetes from Dead Wood of the Manchurian fir (Abies holophylla) as Evaluated by Fruiting Body Collection, Mycelial Isolation, and 454 Sequencing.

    Science.gov (United States)

    Jang, Yeongseon; Jang, Seokyoon; Min, Mihee; Hong, Joo-Hyun; Lee, Hanbyul; Lee, Hwanhwi; Lim, Young Woon; Kim, Jae-Jin

    2015-10-01

    In this study, three different methods (fruiting body collection, mycelial isolation, and 454 sequencing) were implemented to determine the diversity of wood-inhabiting basidiomycetes from dead Manchurian fir (Abies holophylla). The three methods recovered similar species richness (26 species from fruiting bodies, 32 species from mycelia, and 32 species from 454 sequencing), but Fisher's alpha, Shannon-Wiener, Simpson's diversity indices of fungal communities indicated fruiting body collection and mycelial isolation displayed higher diversity compared with 454 sequencing. In total, 75 wood-inhabiting basidiomycetes were detected. The most frequently observed species were Heterobasidion orientale (fruiting body collection), Bjerkandera adusta (mycelial isolation), and Trichaptum fusco-violaceum (454 sequencing). Only two species, Hymenochaete yasudae and Hypochnicium karstenii, were detected by all three methods. This result indicated that Manchurian fir harbors a diverse basidiomycetous fungal community and for complete estimation of fungal diversity, multiple methods should be used. Further studies are required to understand their ecology in the context of forest ecosystems.

  13. Linking secondary metabolites to gene clusters through genome sequencing of six diverse Aspergillus species

    DEFF Research Database (Denmark)

    Kjærbølling, Inge; Vesth, Tammi C.; Frisvad, Jens C.

    2018-01-01

    The fungal genus of Aspergillus is highly interesting, containing everything from industrial cell factories, model organisms, and human pathogens. In particular, this group has a prolific production of bioactive secondary metabolites (SMs). In this work, four diverse Aspergillus species (A...

  14. Genetic diversity of the Andean tuber-bearing species, oca (Oxalis tuberosa Mol.), investigated by inter-simple sequence repeats.

    Science.gov (United States)

    Pissard, A; Ghislain, M; Bertin, P

    2006-01-01

    The Andean tuber-bearing species, Oxalis tuberosa Mol., is a vegetatively propagated crop cultivated in the uplands of the Andes. Its genetic diversity was investigated in the present study using the inter-simple sequence repeat (ISSR) technique. Thirty-two accessions originating from South America (Argentina, Bolivia, Chile, and Peru) and maintained in vitro were chosen to represent the ecogeographic diversity of its cultivation area. Twenty-two primers were tested and 9 were selected according to fingerprinting quality and reproducibility. Genetic diversity analysis was performed with 90 markers. Jaccard's genetic distance between accessions ranged from 0 to 0.49 with an average of 0.28 +/- 0.08 (mean +/- SD). Dendrogram (UPGMA (unweighted pair-group method with arithmetic averaging)) and factorial correspondence analysis (FCA) showed that the genetic structure was influenced by the collection site. The two most distant clusters contained all of the Peruvian accessions, one from Bolivia, none from Argentina or Chile. Analysis by country revealed that Peru presented the greatest genetic distances from the other countries and possessed the highest intra-country genetic distance (0.30 +/- 0.08). This suggests that the Peruvian oca accessions form a distinct genetic group. The relatively low level of genetic diversity in the oca species may be related to its predominating reproduction strategy, i.e., vegetative propagation. The extent and structure of the genetic diversity of the species detailed here should help the establishment of conservation strategies.

  15. Genetic Diversity and Phylogenetic Evolution of Tibetan Sheep Based on mtDNA D-Loop Sequences.

    Directory of Open Access Journals (Sweden)

    Jianbin Liu

    Full Text Available The molecular and population genetic evidence of the phylogenetic status of the Tibetan sheep (Ovis aries is not well understood, and little is known about this species' genetic diversity. This knowledge gap is partly due to the difficulty of sample collection. This is the first work to address this question. Here, the genetic diversity and phylogenetic relationship of 636 individual Tibetan sheep from fifteen populations were assessed using 642 complete sequences of the mitochondrial DNA D-loop. Samples were collected from the Qinghai-Tibetan Plateau area in China, and reference data were obtained from the six reference breed sequences available in GenBank. The length of the sequences varied considerably, between 1031 and 1259 bp. The haplotype diversity and nucleotide diversity were 0.992±0.010 and 0.019±0.001, respectively. The average number of nucleotide differences was 19.635. The mean nucleotide composition of the 350 haplotypes was 32.961% A, 29.708% T, 22.892% C, 14.439% G, 62.669% A+T, and 37.331% G+C. Phylogenetic analysis showed that all four previously defined haplogroups (A, B, C, and D were found in the 636 individuals of the fifteen Tibetan sheep populations but that only the D haplogroup was found in Linzhou sheep. Further, the clustering analysis divided the fifteen Tibetan sheep populations into at least two clusters. The estimation of the demographic parameters from the mismatch analyses showed that haplogroups A, B, and C had at least one demographic expansion in Tibetan sheep. These results contribute to the knowledge of Tibetan sheep populations and will help inform future conservation programs about the Tibetan sheep native to the Qinghai-Tibetan Plateau.

  16. The Pinus taeda genome is characterized by diverse and highly diverged repetitive sequences

    Directory of Open Access Journals (Sweden)

    Yandell Mark

    2010-07-01

    Full Text Available Abstract Background In today's age of genomic discovery, no attempt has been made to comprehensively sequence a gymnosperm genome. The largest genus in the coniferous family Pinaceae is Pinus, whose 110-120 species have extremely large genomes (c. 20-40 Gb, 2N = 24. The size and complexity of these genomes have prompted much speculation as to the feasibility of completing a conifer genome sequence. Conifer genomes are reputed to be highly repetitive, but there is little information available on the nature and identity of repetitive units in gymnosperms. The pines have extensive genetic resources, with approximately 329000 ESTs from eleven species and genetic maps in eight species, including a dense genetic map of the twelve linkage groups in Pinus taeda. Results We present here the Sanger sequence and annotation of ten P. taeda BAC clones and Genome Analyzer II whole genome shotgun (WGS sequences representing 7.5% of the genome. Computational annotation of ten BACs predicts three putative protein-coding genes and at least fifteen likely pseudogenes in nearly one megabase of sequence. We found three conifer-specific LTR retroelements in the BACs, and tentatively identified at least 15 others based on evidence from the distantly related angiosperms. Alignment of WGS sequences to the BACs indicates that 80% of BAC sequences have similar copies (≥ 75% nucleotide identity elsewhere in the genome, but only 23% have identical copies (99% identity. The three most common repetitive elements in the genome were identified and, when combined, represent less than 5% of the genome. Conclusions This study indicates that the majority of repeats in the P. taeda genome are 'novel' and will therefore require additional BAC or genomic sequencing for accurate characterization. The pine genome contains a very large number of diverged and probably defunct repetitive elements. This study also provides new evidence that sequencing a pine genome using a WGS approach is

  17. Diversity analysis in Cannabis sativa based on large-scale development of expressed sequence tag-derived simple sequence repeat markers.

    Science.gov (United States)

    Gao, Chunsheng; Xin, Pengfei; Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

    2014-01-01

    Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis.

  18. Low diversity Cryptococcus neoformans variety grubii multilocus sequence types from Thailand are consistent with an ancestral African origin.

    Directory of Open Access Journals (Sweden)

    Sitali P Simwami

    2011-04-01

    Full Text Available The global burden of HIV-associated cryptococcal meningitis is estimated at nearly one million cases per year, causing up to a third of all AIDS-related deaths. Molecular epidemiology constitutes the main methodology for understanding the factors underpinning the emergence of this understudied, yet increasingly important, group of pathogenic fungi. Cryptococcus species are notable in the degree that virulence differs amongst lineages, and highly-virulent emerging lineages are changing patterns of human disease both temporally and spatially. Cryptococcus neoformans variety grubii (Cng, serotype A constitutes the most ubiquitous cause of cryptococcal meningitis worldwide, however patterns of molecular diversity are understudied across some regions experiencing significant burdens of disease. We compared 183 clinical and environmental isolates of Cng from one such region, Thailand, Southeast Asia, against a global MLST database of 77 Cng isolates. Population genetic analyses showed that Thailand isolates from 11 provinces were highly homogenous, consisting of the same genetic background (globally known as VNI and exhibiting only ten nearly identical sequence types (STs, with three (STs 44, 45 and 46 dominating our sample. This population contains significantly less diversity when compared against the global population of Cng, specifically Africa. Genetic diversity in Cng was significantly subdivided at the continental level with nearly half (47% of the global STs unique to a genetically diverse and recombining population in Botswana. These patterns of diversity, when combined with evidence from haplotypic networks and coalescent analyses of global populations, are highly suggestive of an expansion of the Cng VNI clade out of Africa, leading to a limited number of genotypes founding the Asian populations. Divergence time testing estimates the time to the most common ancestor between the African and Asian populations to be 6,920 years ago (95% HPD

  19. Sifting through genomes with iterative-sequence clustering produces a large, phylogenetically diverse protein-family resource.

    Science.gov (United States)

    Sharpton, Thomas J; Jospin, Guillaume; Wu, Dongying; Langille, Morgan G I; Pollard, Katherine S; Eisen, Jonathan A

    2012-10-13

    New computational resources are needed to manage the increasing volume of biological data from genome sequencing projects. One fundamental challenge is the ability to maintain a complete and current catalog of protein diversity. We developed a new approach for the identification of protein families that focuses on the rapid discovery of homologous protein sequences. We implemented fully automated and high-throughput procedures to de novo cluster proteins into families based upon global alignment similarity. Our approach employs an iterative clustering strategy in which homologs of known families are sifted out of the search for new families. The resulting reduction in computational complexity enables us to rapidly identify novel protein families found in new genomes and to perform efficient, automated updates that keep pace with genome sequencing. We refer to protein families identified through this approach as "Sifting Families," or SFams. Our analysis of ~10.5 million protein sequences from 2,928 genomes identified 436,360 SFams, many of which are not represented in other protein family databases. We validated the quality of SFam clustering through statistical as well as network topology-based analyses. We describe the rapid identification of SFams and demonstrate how they can be used to annotate genomes and metagenomes. The SFam database catalogs protein-family quality metrics, multiple sequence alignments, hidden Markov models, and phylogenetic trees. Our source code and database are publicly available and will be subject to frequent updates (http://edhar.genomecenter.ucdavis.edu/sifting_families/).

  20. Soil Parameters Drive the Structure, Diversity and Metabolic Potentials of the Bacterial Communities Across Temperate Beech Forest Soil Sequences.

    Science.gov (United States)

    Jeanbille, M; Buée, M; Bach, C; Cébron, A; Frey-Klett, P; Turpault, M P; Uroz, S

    2016-02-01

    Soil and climatic conditions as well as land cover and land management have been shown to strongly impact the structure and diversity of the soil bacterial communities. Here, we addressed under a same land cover the potential effect of the edaphic parameters on the soil bacterial communities, excluding potential confounding factors as climate. To do this, we characterized two natural soil sequences occurring in the Montiers experimental site. Spatially distant soil samples were collected below Fagus sylvatica tree stands to assess the effect of soil sequences on the edaphic parameters, as well as the structure and diversity of the bacterial communities. Soil analyses revealed that the two soil sequences were characterized by higher pH and calcium and magnesium contents in the lower plots. Metabolic assays based on Biolog Ecoplates highlighted higher intensity and richness in usable carbon substrates in the lower plots than in the middle and upper plots, although no significant differences occurred in the abundance of bacterial and fungal communities along the soil sequences as assessed using quantitative PCR. Pyrosequencing analysis of 16S ribosomal RNA (rRNA) gene amplicons revealed that Proteobacteria, Acidobacteria and Bacteroidetes were the most abundantly represented phyla. Acidobacteria, Proteobacteria and Chlamydiae were significantly enriched in the most acidic and nutrient-poor soils compared to the Bacteroidetes, which were significantly enriched in the soils presenting the higher pH and nutrient contents. Interestingly, aluminium, nitrogen, calcium, nutrient availability and pH appeared to be the best predictors of the bacterial community structures along the soil sequences.

  1. Diversity in non-repetitive human sequences not found in the reference genome.

    Science.gov (United States)

    Kehr, Birte; Helgadottir, Anna; Melsted, Pall; Jonsson, Hakon; Helgason, Hannes; Jonasdottir, Adalbjörg; Jonasdottir, Aslaug; Sigurdsson, Asgeir; Gylfason, Arnaldur; Halldorsson, Gisli H; Kristmundsdottir, Snaedis; Thorgeirsson, Gudmundur; Olafsson, Isleifur; Holm, Hilma; Thorsteinsdottir, Unnur; Sulem, Patrick; Helgason, Agnar; Gudbjartsson, Daniel F; Halldorsson, Bjarni V; Stefansson, Kari

    2017-04-01

    Genomes usually contain some non-repetitive sequences that are missing from the reference genome and occur only in a population subset. Such non-repetitive, non-reference (NRNR) sequences have remained largely unexplored in terms of their characterization and downstream analyses. Here we describe 3,791 breakpoint-resolved NRNR sequence variants called using PopIns from whole-genome sequence data of 15,219 Icelanders. We found that over 95% of the 244 NRNR sequences that are 200 bp or longer are present in chimpanzees, indicating that they are ancestral. Furthermore, 149 variant loci are in linkage disequilibrium (r 2 > 0.8) with a genome-wide association study (GWAS) catalog marker, suggesting disease relevance. Additionally, we report an association (P = 3.8 × 10 -8 , odds ratio (OR) = 0.92) with myocardial infarction (23,360 cases, 300,771 controls) for a 766-bp NRNR sequence variant. Our results underline the importance of including variation of all complexity levels when searching for variants that associate with disease.

  2. Integrated analysis of whole genome and transcriptome sequencing reveals diverse transcriptomic aberrations driven by somatic genomic changes in liver cancers.

    Directory of Open Access Journals (Sweden)

    Yuichi Shiraishi

    Full Text Available Recent studies applying high-throughput sequencing technologies have identified several recurrently mutated genes and pathways in multiple cancer genomes. However, transcriptional consequences from these genomic alterations in cancer genome remain unclear. In this study, we performed integrated and comparative analyses of whole genomes and transcriptomes of 22 hepatitis B virus (HBV-related hepatocellular carcinomas (HCCs and their matched controls. Comparison of whole genome sequence (WGS and RNA-Seq revealed much evidence that various types of genomic mutations triggered diverse transcriptional changes. Not only splice-site mutations, but also silent mutations in coding regions, deep intronic mutations and structural changes caused splicing aberrations. HBV integrations generated diverse patterns of virus-human fusion transcripts depending on affected gene, such as TERT, CDK15, FN1 and MLL4. Structural variations could drive over-expression of genes such as WNT ligands, with/without creating gene fusions. Furthermore, by taking account of genomic mutations causing transcriptional aberrations, we could improve the sensitivity of deleterious mutation detection in known cancer driver genes (TP53, AXIN1, ARID2, RPS6KA3, and identified recurrent disruptions in putative cancer driver genes such as HNF4A, CPS1, TSC1 and THRAP3 in HCCs. These findings indicate genomic alterations in cancer genome have diverse transcriptomic effects, and integrated analysis of WGS and RNA-Seq can facilitate the interpretation of a large number of genomic alterations detected in cancer genome.

  3. Diverse Array of New Viral Sequences Identified in Worldwide Populations of the Asian Citrus Psyllid (Diaphorina citri) Using Viral Metagenomics.

    Science.gov (United States)

    Nouri, Shahideh; Salem, Nidá; Nigg, Jared C; Falk, Bryce W

    2015-12-16

    The Asian citrus psyllid, Diaphorina citri, is the natural vector of the causal agent of Huanglongbing (HLB), or citrus greening disease. Together; HLB and D. citri represent a major threat to world citrus production. As there is no cure for HLB, insect vector management is considered one strategy to help control the disease, and D. citri viruses might be useful. In this study, we used a metagenomic approach to analyze viral sequences associated with the global population of D. citri. By sequencing small RNAs and the transcriptome coupled with bioinformatics analysis, we showed that the virus-like sequences of D. citri are diverse. We identified novel viral sequences belonging to the picornavirus superfamily, the Reoviridae, Parvoviridae, and Bunyaviridae families, and an unclassified positive-sense single-stranded RNA virus. Moreover, a Wolbachia prophage-related sequence was identified. This is the first comprehensive survey to assess the viral community from worldwide populations of an agricultural insect pest. Our results provide valuable information on new putative viruses, some of which may have the potential to be used as biocontrol agents. Insects have the most species of all animals, and are hosts to, and vectors of, a great variety of known and unknown viruses. Some of these most likely have the potential to be important fundamental and/or practical resources. In this study, we used high-throughput next-generation sequencing (NGS) technology and bioinformatics analysis to identify putative viruses associated with Diaphorina citri, the Asian citrus psyllid. D. citri is the vector of the bacterium causing Huanglongbing (HLB), currently the most serious threat to citrus worldwide. Here, we report several novel viral sequences associated with D. citri. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  4. High‑throughput sequencing analyses of oral microbial diversity in healthy people and patients with dental caries and periodontal disease.

    Science.gov (United States)

    Chen, Tingtao; Shi, Yan; Wang, Xiaolei; Wang, Xin; Meng, Fanjing; Yang, Shaoguo; Yang, Jian; Xin, Hongbo

    2017-07-01

    Recurrence of oral diseases caused by antibiotics has brought about an urgent requirement to explore the oral microbial diversity in the human oral cavity. In the present study, the high‑throughput sequencing method was adopted to compare the microbial diversity of healthy people and oral patients and sequence analysis was performed by UPARSE software package. The Venn results indicated that a mean of 315 operational taxonomic units (OTUs) was obtained, and 73, 64, 53, 19 and 18 common OTUs belonging to Firmicutes, Bacteroidetes, Proteobacteria, Actinobacteria and Fusobacteria, respectively, were identified in healthy people. Moreover, the reduction of Firmicutes and the increase of Proteobacteria in the children group, and the increase of Firmicutes and the reduction of Proteobacteria in the youth and adult groups, indicated that the age bracket and oral disease had largely influenced the tooth development and microbial development in the oral cavity. In addition, the traditional 'pathogenic bacteria' of Firmicutes, Proteobacteria and Bacteroidetes (accounted for >95% of the total sequencing number in each group) indicated that the 'harmful' bacteria may exert beneficial effects on oral health. Therefore, the data will provide certain clues for curing some oral diseases by the strategy of adjusting the disturbed microbial compositions in oral disease to healthy level.

  5. Genetic diversity of Taenia asiatica from Thailand and other geographical locations as revealed by cytochrome c oxidase subunit 1 sequences.

    Science.gov (United States)

    Anantaphruti, Malinee Thairungroj; Thaenkham, Urusa; Watthanakulpanich, Dorn; Phuphisut, Orawan; Maipanich, Wanna; Yoonuan, Tippayarat; Nuamtanong, Supaporn; Pubampen, Somjit; Sanguankiat, Surapol

    2013-02-01

    Twelve 924 bp cytochrome c oxidase subunit 1 (cox1) mitochondrial DNA sequences from Taenia asiatica isolates from Thailand were aligned and compared with multiple sequence isolates from Thailand and 6 other countries from the GenBank database. The genetic divergence of T. asiatica was also compared with Taenia saginata database sequences from 6 different countries in Asia, including Thailand, and 3 countries from other continents. The results showed that there were minor genetic variations within T. asiatica species, while high intraspecies variation was found in T. saginata. There were only 2 haplotypes and 1 polymorphic site found in T. asiatica, but 8 haplotypes and 9 polymorphic sites in T. saginata. Haplotype diversity was very low, 0.067, in T. asiatica and high, 0.700, in T. saginata. The very low genetic diversity suggested that T. asiatica may be at a risk due to the loss of potential adaptive alleles, resulting in reduced viability and decreased responses to environmental changes, which may endanger the species.

  6. Generic Amplicon Deep Sequencing to Determine Ilarvirus Species Diversity in Australian Prunus.

    Science.gov (United States)

    Kinoti, Wycliff M; Constable, Fiona E; Nancarrow, Narelle; Plummer, Kim M; Rodoni, Brendan

    2017-01-01

    The distribution of Ilarvirus species populations amongst 61 Australian Prunus trees was determined by next generation sequencing (NGS) of amplicons generated using a genus-based generic RT-PCR targeting a conserved region of the Ilarvirus RNA2 component that encodes the RNA dependent RNA polymerase (RdRp) gene. Presence of Ilarvirus sequences in each positive sample was further validated by Sanger sequencing of cloned amplicons of regions of each of RNA1, RNA2 and/or RNA3 that were generated by species specific PCRs and by metagenomic NGS. Prunus necrotic ringspot virus (PNRSV) was the most frequently detected Ilarvirus , occurring in 48 of the 61 Ilarvirus -positive trees and Prune dwarf virus (PDV) and Apple mosaic virus (ApMV) were detected in three trees and one tree, respectively. American plum line pattern virus (APLPV) was detected in three trees and represents the first report of APLPV detection in Australia. Two novel and distinct groups of Ilarvirus -like RNA2 amplicon sequences were also identified in several trees by the generic amplicon NGS approach. The high read depth from the amplicon NGS of the generic PCR products allowed the detection of distinct RNA2 RdRp sequence variant populations of PNRSV, PDV, ApMV, APLPV and the two novel Ilarvirus -like sequences. Mixed infections of ilarviruses were also detected in seven Prunus trees. Sanger sequencing of specific RNA1, RNA2, and/or RNA3 genome segments of each virus and total nucleic acid metagenomics NGS confirmed the presence of PNRSV, PDV, ApMV and APLPV detected by RNA2 generic amplicon NGS. However, the two novel groups of Ilarvirus -like RNA2 amplicon sequences detected by the generic amplicon NGS could not be associated to the presence of sequence from RNA1 or RNA3 genome segments or full Ilarvirus genomes, and their origin is unclear. This work highlights the sensitivity of genus-specific amplicon NGS in detection of virus sequences and their distinct populations in multiple samples, and the

  7. Generic Amplicon Deep Sequencing to Determine Ilarvirus Species Diversity in Australian Prunus

    Directory of Open Access Journals (Sweden)

    Wycliff M. Kinoti

    2017-06-01

    Full Text Available The distribution of Ilarvirus species populations amongst 61 Australian Prunus trees was determined by next generation sequencing (NGS of amplicons generated using a genus-based generic RT-PCR targeting a conserved region of the Ilarvirus RNA2 component that encodes the RNA dependent RNA polymerase (RdRp gene. Presence of Ilarvirus sequences in each positive sample was further validated by Sanger sequencing of cloned amplicons of regions of each of RNA1, RNA2 and/or RNA3 that were generated by species specific PCRs and by metagenomic NGS. Prunus necrotic ringspot virus (PNRSV was the most frequently detected Ilarvirus, occurring in 48 of the 61 Ilarvirus-positive trees and Prune dwarf virus (PDV and Apple mosaic virus (ApMV were detected in three trees and one tree, respectively. American plum line pattern virus (APLPV was detected in three trees and represents the first report of APLPV detection in Australia. Two novel and distinct groups of Ilarvirus-like RNA2 amplicon sequences were also identified in several trees by the generic amplicon NGS approach. The high read depth from the amplicon NGS of the generic PCR products allowed the detection of distinct RNA2 RdRp sequence variant populations of PNRSV, PDV, ApMV, APLPV and the two novel Ilarvirus-like sequences. Mixed infections of ilarviruses were also detected in seven Prunus trees. Sanger sequencing of specific RNA1, RNA2, and/or RNA3 genome segments of each virus and total nucleic acid metagenomics NGS confirmed the presence of PNRSV, PDV, ApMV and APLPV detected by RNA2 generic amplicon NGS. However, the two novel groups of Ilarvirus-like RNA2 amplicon sequences detected by the generic amplicon NGS could not be associated to the presence of sequence from RNA1 or RNA3 genome segments or full Ilarvirus genomes, and their origin is unclear. This work highlights the sensitivity of genus-specific amplicon NGS in detection of virus sequences and their distinct populations in multiple samples

  8. 3' terminal diversity of MRP RNA and other human noncoding RNAs revealed by deep sequencing.

    Science.gov (United States)

    Goldfarb, Katherine C; Cech, Thomas R

    2013-09-21

    Post-transcriptional 3' end processing is a key component of RNA regulation. The abundant and essential RNA subunit of RNase MRP has been proposed to function in three distinct cellular compartments and therefore may utilize this mode of regulation. Here we employ 3' RACE coupled with high-throughput sequencing to characterize the 3' terminal sequences of human MRP RNA and other noncoding RNAs that form RNP complexes. The 3' terminal sequence of MRP RNA from HEK293T cells has a distinctive distribution of genomically encoded termini (including an assortment of U residues) with a portion of these selectively tagged by oligo(A) tails. This profile contrasts with the relatively homogenous 3' terminus of an in vitro transcribed MRP RNA control and the differing 3' terminal profiles of U3 snoRNA, RNase P RNA, and telomerase RNA (hTR). 3' RACE coupled with deep sequencing provides a valuable framework for the functional characterization of 3' terminal sequences of noncoding RNAs.

  9. 3′ terminal diversity of MRP RNA and other human noncoding RNAs revealed by deep sequencing

    Science.gov (United States)

    2013-01-01

    Background Post-transcriptional 3′ end processing is a key component of RNA regulation. The abundant and essential RNA subunit of RNase MRP has been proposed to function in three distinct cellular compartments and therefore may utilize this mode of regulation. Here we employ 3′ RACE coupled with high-throughput sequencing to characterize the 3′ terminal sequences of human MRP RNA and other noncoding RNAs that form RNP complexes. Results The 3′ terminal sequence of MRP RNA from HEK293T cells has a distinctive distribution of genomically encoded termini (including an assortment of U residues) with a portion of these selectively tagged by oligo(A) tails. This profile contrasts with the relatively homogenous 3′ terminus of an in vitro transcribed MRP RNA control and the differing 3′ terminal profiles of U3 snoRNA, RNase P RNA, and telomerase RNA (hTR). Conclusions 3′ RACE coupled with deep sequencing provides a valuable framework for the functional characterization of 3′ terminal sequences of noncoding RNAs. PMID:24053768

  10. Deciphering the Diversities of Astroviruses and Noroviruses in Wastewater Treatment Plant Effluents by a High-Throughput Sequencing Method.

    Science.gov (United States)

    Prevost, B; Lucas, F S; Ambert-Balay, K; Pothier, P; Moulin, L; Wurtzer, S

    2015-10-01

    Although clinical epidemiology lists human enteric viruses to be among the primary causes of acute gastroenteritis in the human population, their circulation in the environment remains poorly investigated. These viruses are excreted by the human population into sewers and may be released into rivers through the effluents of wastewater treatment plants (WWTPs). In order to evaluate the viral diversity and loads in WWTP effluents of the Paris, France, urban area, which includes about 9 million inhabitants (approximately 15% of the French population), the seasonal occurrence of astroviruses and noroviruses in 100 WWTP effluent samples was investigated over 1 year. The coupling of these measurements with a high-throughput sequencing approach allowed the specific estimation of the diversity of human astroviruses (human astrovirus genotype 1 [HAstV-1], HAstV-2, HAstV-5, and HAstV-6), 7 genotypes of noroviruses (NoVs) of genogroup I (NoV GI.1 to NoV GI.6 and NoV GI.8), and 16 genotypes of NoVs of genogroup II (NoV GII.1 to NoV GII.7, NoV GII.9, NoV GII.12 to NoV GII.17, NoV GII.20, and NoV GII.21) in effluent samples. Comparison of the viral diversity in WWTP effluents to the viral diversity found by analysis of clinical data obtained throughout France underlined the consistency between the identified genotypes. However, some genotypes were locally present in effluents and were not found in the analysis of the clinical data. These findings could highlight an underestimation of the diversity of enteric viruses circulating in the human population. Consequently, analysis of WWTP effluents could allow the exploration of viral diversity not only in environmental waters but also in a human population linked to a sewerage network in order to better comprehend viral epidemiology and to forecast seasonal outbreaks. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  11. Percent Coverage

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The Percent Coverage is a spreadsheet that keeps track of and compares the number of vessels that have departed with and without observers to the numbers of vessels...

  12. Diversity and dynamics of dominant and rare bacterial taxa in replicate sequencing batch reactors operated under different solids retention time

    KAUST Repository

    Bagchi, Samik

    2014-10-19

    In this study, 16S rRNA gene pyrosequencing was applied in order to provide a better insight on the diversity and dynamics of total, dominant, and rare bacterial taxa in replicate lab-scale sequencing batch reactors (SBRs) operated at different solids retention time (SRT). Rank-abundance curves showed few dominant operational taxonomic units (OTUs) and a long tail of rare OTUs in all reactors. Results revealed that there was no detectable effect of SRT (2 vs. 10 days) on Shannon diversity index and OTU richness of both dominant and rare taxa. Nonmetric multidimensional scaling analysis showed that the total, dominant, and rare bacterial taxa were highly dynamic during the entire period of stable reactor performance. Also, the rare taxa were more dynamic than the dominant taxa despite expected low invasion rates because of the use of sterile synthetic media.

  13. Genetic Diversity of Arabica Coffee (Coffea arabica L. in Nicaragua as Estimated by Simple Sequence Repeat Markers

    Directory of Open Access Journals (Sweden)

    Mulatu Geleta

    2012-01-01

    Full Text Available Coffea arabica L. (arabica coffee, the only tetraploid species in the genus Coffea, represents the majority of the world’s coffee production and has a significant contribution to Nicaragua’s economy. The present paper was conducted to determine the genetic diversity of arabica coffee in Nicaragua for its conservation and breeding values. Twenty-six populations that represent eight varieties in Nicaragua were investigated using simple sequence repeat (SSR markers. A total of 24 alleles were obtained from the 12 loci investigated across 260 individual plants. The total Nei’s gene diversity (HT and the within-population gene diversity (HS were 0.35 and 0.29, respectively, which is comparable with that previously reported from other countries and regions. Among the varieties, the highest diversity was recorded in the variety Catimor. Analysis of variance (AMOVA revealed that about 87% of the total genetic variation was found within populations and the remaining 13% differentiate the populations (FST=0.13; P<0.001. The variation among the varieties was also significant. The genetic variation in Nicaraguan coffee is significant enough to be used in the breeding programs, and most of this variation can be conserved through ex situ conservation of a low number of populations from each variety.

  14. Evaluation of genetic diversity amongst Descurainia sophia L. genotypes by inter-simple sequence repeat (ISSR) marker.

    Science.gov (United States)

    Saki, Sahar; Bagheri, Hedayat; Deljou, Ali; Zeinalabedini, Mehrshad

    2016-01-01

    Descurainia sophia is a valuable medicinal plant in family of Brassicaceae. To determine the range of diversity amongst D. sophia in Iran, 32 naturally distributed plants belonging to six natural populations of the Iranian plateau were investigated by inter-simple sequence repeat (ISSR) markers. The average percentage of polymorphism produced by 12 ISSR primers was 86 %. The PIC values for primers ranged from 0.22 to 0.40 and Rp values ranged between 6.5 and 19.9. The relative genetic diversity of the populations was not high (Gst =0.32). However, the value of gene flow revealed by the ISSR marker was high (Nm = 1.03). UPGMA clustering method based on Jaccard similarity coefficient grouped the genotypes into two major clusters. Graph results from Neighbor-Net Network generated after a 1000 bootstrap test using Jaccard coefficient, and STRUCTURE analysis confirmed the UPGMA clustering. The first three PCAs represented 57.31 % of the total variation. The high levels of genetic diversity were observed within populations, which is useful in breeding and conservation programs. ISSR is found to be an eligible marker to study genetic diversity of D. sophia.

  15. On the use of high-throughput sequencing for the study of cyanobacterial diversity in Antarctic aquatic mats.

    Science.gov (United States)

    Pessi, Igor Stelmach; Maalouf, Pedro De Carvalho; Laughinghouse, Haywood Dail; Baurain, Denis; Wilmotte, Annick

    2016-06-01

    The study of Antarctic cyanobacterial diversity has been mostly limited to morphological identification and traditional molecular techniques. High-throughput sequencing (HTS) allows a much better understanding of microbial distribution in the environment, but its application is hampered by several methodological and analytical challenges. In this work, we explored the use of HTS as a tool for the study of cyanobacterial diversity in Antarctic aquatic mats. Our results highlight the importance of using artificial communities to validate the parameters of the bioinformatics procedure used to analyze natural communities, since pipeline-dependent biases had a strong effect on the observed community structures. Analysis of microbial mats from five Antarctic lakes and an aquatic biofilm from the Sub-Antarctic showed that HTS is a valuable tool for the assessment of cyanobacterial diversity. The majority of the operational taxonomic units retrieved were related to filamentous taxa such as Leptolyngbya and Phormidium, which are common genera in Antarctic lacustrine microbial mats. However, other phylotypes related to different taxa such as Geitlerinema, Pseudanabaena, Synechococcus, Chamaesiphon, Calothrix, and Coleodesmium were also found. Results revealed a much higher diversity than what had been reported using traditional methods and also highlighted remarkable differences between the cyanobacterial communities of the studied lakes. The aquatic biofilm from the Sub-Antarctic had a distinct cyanobacterial community from the Antarctic lakes, which in turn displayed a salinity-dependent community structure at the phylotype level. © 2016 Phycological Society of America.

  16. Unusually high genetic diversity in COI sequences of Chimarra obscura (Trichoptera: Philopotamidae)

    Science.gov (United States)

    Chimarra obscura (Walker 1852) is a philopotamid caddisfly found throughout much of North America. Using the COI DNA barcode locus, we have found unexpectedly high amounts of genetic diversity and distances within C. obscura. Of the approximately 150 specimens sampled, we have fo...

  17. Population diversity of Diaphorina citri (Hemiptera: Liviidae) in China based on whole mitochondrial genome sequences

    Science.gov (United States)

    BACKGROUND: Diaphorina citri (Asian citrus psyllid, ACP) transmits “Candidatus Liberibacter asiaticus”, an unculturable alpha-proteobacterium associated with citrus Huanglongbing (HLB). ACP has been reported in 11 provinces/regions in China, yet its population diversity remains unclear. In this stud...

  18. Abundance and genetic diversity of nifH gene sequences in anthropogenically affected Brazilian mangrove sediments

    NARCIS (Netherlands)

    Franco Dias, Armando Cavalcante; Pereira e Silva, Michele de Cassia; Cotta, Simone Raposo; Dini Andreote, Francisco; Soares, Fabio Lino; Salles, Joana Falcao; Azevedo, Joao Lucio; van Elsas, Jan Dirk; Andreote, Fernando Dini

    Although mangroves represent ecosystems of global importance, the genetic diversity and abundance of functional genes that are key to their functioning scarcely have been explored. Here, we present a survey based on the nifH gene across transects of sediments of two mangrove systems located along

  19. Phylogenetic diversity of insecticolous fusaria inferred from multilocus DNA sequence data and their molecular identification via FUSARIUM-ID and Fusarium MLST

    NARCIS (Netherlands)

    O'Donnell, K.; Humber, R.A.; Geiser, D.M.; Kang, S.; Robert, V.; Park, B.; Crous, P.W.; Johnston, P.; Aoki, T.; Rooney, A.P.; Rehner, S.A.

    2012-01-01

    We constructed several multilocus DNA sequence datasets to assess the phylogenetic diversity of insecticolous fusaria, especially focusing on those housed at the Agricultural Research Service Collection of Entomopathogenic Fungi (ARSEF), and to aid molecular identifications of unknowns via the

  20. Sequences of the joining region genes for immunoglobulin heavy chains and their role in generation of antibody diversity.

    OpenAIRE

    Gough, N M; Bernard, O

    1981-01-01

    To assess the contribution to immunoglobulin heavy chain diversity made by recombination between variable region (VH) genes and joining region (JH) genes, we have determined the sequence of about 2000 nucleotides spanning the rearranged JH gene cluster associated with the VH gene expressed in plasmacytoma HPC76. The active VH76 gene has recombined with the second germ-line JH gene. The region we have studied contains two other JH genes, designated JH3 and JH4. No other JH gene was found withi...

  1. Diversity analysis of Bemisia tabaci biotypes: RAPD, PCR-RFLP and sequencing of the ITS1 rDNA region

    OpenAIRE

    Rabello, Aline R.; Queiroz, Paulo R.; Simões, Kenya C.C.; Hiragi, Cássia O.; Lima, Luzia H.C.; Oliveira, Maria Regina V.; Mehta, Angela

    2008-01-01

    The Bemisia tabaci complex is formed by approximately 41 biotypes, two of which (B and BR) occur in Brazil. In this work we aimed at obtaining genetic markers to assess the genetic diversity of the different biotypes. In order to do that we analyzed Bemisia tabaci biotypes B, BR, Q and Cassava using molecular techniques including RAPD, PCR-RFLP and sequencing of the ITS1 rDNA region. The analyses revealed a high similarity between the individuals of the B and Q biotypes, which could be distin...

  2. Genome sequence and genetic diversity of the common carp, Cyprinus carpio.

    Science.gov (United States)

    Xu, Peng; Zhang, Xiaofeng; Wang, Xumin; Li, Jiongtang; Liu, Guiming; Kuang, Youyi; Xu, Jian; Zheng, Xianhu; Ren, Lufeng; Wang, Guoliang; Zhang, Yan; Huo, Linhe; Zhao, Zixia; Cao, Dingchen; Lu, Cuiyun; Li, Chao; Zhou, Yi; Liu, Zhanjiang; Fan, Zhonghua; Shan, Guangle; Li, Xingang; Wu, Shuangxiu; Song, Lipu; Hou, Guangyuan; Jiang, Yanliang; Jeney, Zsigmond; Yu, Dan; Wang, Li; Shao, Changjun; Song, Lai; Sun, Jing; Ji, Peifeng; Wang, Jian; Li, Qiang; Xu, Liming; Sun, Fanyue; Feng, Jianxin; Wang, Chenghui; Wang, Shaolin; Wang, Baosen; Li, Yan; Zhu, Yaping; Xue, Wei; Zhao, Lan; Wang, Jintu; Gu, Ying; Lv, Weihua; Wu, Kejing; Xiao, Jingfa; Wu, Jiayan; Zhang, Zhang; Yu, Jun; Sun, Xiaowen

    2014-11-01

    The common carp, Cyprinus carpio, is one of the most important cyprinid species and globally accounts for 10% of freshwater aquaculture production. Here we present a draft genome of domesticated C. carpio (strain Songpu), whose current assembly contains 52,610 protein-coding genes and approximately 92.3% coverage of its paleotetraploidized genome (2n = 100). The latest round of whole-genome duplication has been estimated to have occurred approximately 8.2 million years ago. Genome resequencing of 33 representative individuals from worldwide populations demonstrates a single origin for C. carpio in 2 subspecies (C. carpio Haematopterus and C. carpio carpio). Integrative genomic and transcriptomic analyses were used to identify loci potentially associated with traits including scaling patterns and skin color. In combination with the high-resolution genetic map, the draft genome paves the way for better molecular studies and improved genome-assisted breeding of C. carpio and other closely related species.

  3. Application of ion torrent sequencing to the assessment of the effect of alkali ballast water treatment on microbial community diversity.

    Directory of Open Access Journals (Sweden)

    Masanori Fujimoto

    Full Text Available The impact of NaOH as a ballast water treatment (BWT on microbial community diversity was assessed using the 16S rRNA gene based Ion Torrent sequencing with its new 400 base chemistry. Ballast water samples from a Great Lakes ship were collected from the intake and discharge of both control and NaOH (pH 12 treated tanks and were analyzed in duplicates. One set of duplicates was treated with the membrane-impermeable DNA cross-linking reagent propidium mono-azide (PMA prior to PCR amplification to differentiate between live and dead microorganisms. Ion Torrent sequencing generated nearly 580,000 reads for 31 bar-coded samples and revealed alterations of the microbial community structure in ballast water that had been treated with NaOH. Rarefaction analysis of the Ion Torrent sequencing data showed that BWT using NaOH significantly decreased microbial community diversity relative to control discharge (p<0.001. UniFrac distance based principal coordinate analysis (PCoA plots and UPGMA tree analysis revealed that NaOH-treated ballast water microbial communities differed from both intake communities and control discharge communities. After NaOH treatment, bacteria from the genus Alishewanella became dominant in the NaOH-treated samples, accounting for <0.5% of the total reads in intake samples but more than 50% of the reads in the treated discharge samples. The only apparent difference in microbial community structure between PMA-processed and non-PMA samples occurred in intake water samples, which exhibited a significantly higher amount of PMA-sensitive cyanobacteria/chloroplast 16S rRNA than their corresponding non-PMA total DNA samples. The community assembly obtained using Ion Torrent sequencing was comparable to that obtained from a subset of samples that were also subjected to 454 pyrosequencing. This study showed the efficacy of alkali ballast water treatment in reducing ballast water microbial diversity and demonstrated the application of new

  4. Molecular sequence data of hepatitis B virus and genetic diversity after vaccination

    NARCIS (Netherlands)

    van Ballegooijen, W. Marijn; van Houdt, Robin; Bruisten, Sylvia M.; Boot, Hein J.; Coutinho, Roel A.; Wallinga, Jacco

    2009-01-01

    The effect of vaccination programs on transmission of infectious disease is usually assessed by monitoring programs that rely on notifications of symptomatic illness. For monitoring of infectious diseases with a high proportion of asymptomatic cases or a low reporting rate, molecular sequence data

  5. Determination of Elizabethkingia Diversity by MALDI-TOF Mass Spectrometry and Whole-Genome Sequencing

    DEFF Research Database (Denmark)

    Eriksen, Helle Brander; Gumpert, Heidi; Faurholt, Cecilie Haase

    2017-01-01

    In a hospital-acquired infection with multidrug-resistant Elizabethkingia, matrix-assisted laser desorption/ionization time-of-flight mass spectrometry and 16S rRNA gene analysis identified the pathogen as Elizabethkingia miricola. Whole-genome sequencing, genus-level core genome analysis, and in...

  6. High diversity of picornaviruses in rats from different continents revealed by deep sequencing

    DEFF Research Database (Denmark)

    Arn Hansen, Thomas; Mollerup, Sarah; Nguyen, Nam-Phuong

    2016-01-01

    Outbreaks of zoonotic diseases in humans and livestock are not uncommon, and an important component in containment of such emerging viral diseases is rapid and reliable diagnostics. Such methods are often PCR-based and hence require the availability of sequence data from the pathogen. Rattus norv...

  7. High diversity of picornaviruses in rats from different continents revealed by deep sequencing

    DEFF Research Database (Denmark)

    Arn Hansen, Thomas; Mollerup, Sarah; Nguyen, Nam-Phuong

    2016-01-01

    Outbreaks of zoonotic diseases in humans and livestock are not uncommon, and an important component in containment of such emerging viral diseases is rapid and reliable diagnostics. Such methods are often PCR-based and hence require the availability of sequence data from the pathogen. Rattus...

  8. Increasing genomic diversity and evidence of constrained lifestyle evolution due to insertion sequences in Aeromonas salmonicida.

    Science.gov (United States)

    Vincent, Antony T; Trudel, Mélanie V; Freschi, Luca; Nagar, Vandan; Gagné-Thivierge, Cynthia; Levesque, Roger C; Charette, Steve J

    2016-01-12

    Aeromonads make up a group of Gram-negative bacteria that includes human and fish pathogens. The Aeromonas salmonicida species has the peculiarity of including five known subspecies. However, few studies of the genomes of A. salmonicida subspecies have been reported to date. We sequenced the genomes of additional A. salmonicida isolates, including three from India, using next-generation sequencing in order to gain a better understanding of the genomic and phylogenetic links between A. salmonicida subspecies. Their relative phylogenetic positions were confirmed by a core genome phylogeny based on 1645 gene sequences. The Indian isolates, which formed a sub-group together with A. salmonicida subsp. pectinolytica, were able to grow at either at 18 °C and 37 °C, unlike the A. salmonicida psychrophilic isolates that did not grow at 37 °C. Amino acid frequencies, GC content, tRNA composition, loss and gain of genes during evolution, pseudogenes as well as genes under positive selection and the mobilome were studied to explain this intraspecies dichotomy. Insertion sequences appeared to be an important driving force that locked the psychrophilic strains into their particular lifestyle in order to conserve their genomic integrity. This observation, based on comparative genomics, is in agreement with previous results showing that insertion sequence mobility induced by heat in A. salmonicida subspecies causes genomic plasticity, resulting in a deleterious effect on the virulence of the bacterium. We provide a proof-of-concept that selfish DNAs play a major role in the evolution of bacterial species by modeling genomes.

  9. Annotation and sequence diversity of transposable elements in common bean (Phaseolus vulgaris

    Directory of Open Access Journals (Sweden)

    Scott eJackson

    2014-07-01

    Full Text Available Common bean (Phaseolus vulgaris is an important legume crop grown and consumed worldwide. With the availability of the common bean genome sequence, the next challenge is to annotate the genome and characterize functional DNA elements. Transposable elements (TEs are the most abundant component of plant genomes and can dramatically affect genome evolution and genetic variation. Thus, it is pivotal to identify TEs in the common bean genome. In this study, we performed a genome-wide transposon annotation in common bean using a combination of homology and sequence structure-based methods. We developed a 2.12-Mb transposon database which includes 791 representative transposon sequences and is available upon request or from www.phytozome.org. Of note, nearly all transposons in the database are previously unrecognized TEs. More than 5,000 transposon-related expressed sequence tags (ESTs were detected which indicates that some transposons may be transcriptionally active. Two Ty1-copia retrotransposon families were found to encode the envelope-like protein which has rarely been identified in plant genomes. Also, we identified an extra open reading frame (ORF termed ORF2 from 15 Ty3-gypsy families that was located between the ORF encoding the retrotransposase and the 3’LTR. The ORF2 was in opposite transcriptional orientation to retrotransposase. Sequence homology searches and phylogenetic analysis suggested that the ORF2 may have an ancient origin, but its function is not clear. This transposon data provides a useful resource for understanding the genome organization and evolution and may be used to identify active TEs for developing transposon-tagging system in common bean and other related genomes.

  10. Genetic Diversity of Selected Mangifera Species Revealed by Inter Simple Sequence Repeats Markers

    OpenAIRE

    Ariffin, Zulhairil; Md Sah, Muhammad Shafie; Idris, Salma; Hashim, Nuradni

    2015-01-01

    ISSR markers were employed to reveal genetic diversity and genetic relatedness among 28 Mangifera accessions collected from Yan (Kedah), Bukit Gantang (Perak), Sibuti (Sarawak), and Papar (Sabah). A total of 198 markers were generated using nine anchored primers and one nonanchored primer. Genetic variation among the 28 accessions of Mangifera species including wild relatives, landraces, and clonal varieties is high, with an average degree of polymorphism of 98% and mean Shannon index, H0=7.5...

  11. Phylogeography and population diversity of Simulium hirtipupa Lutz (Diptera: Simuliidae based on mitochondrial COI sequences.

    Directory of Open Access Journals (Sweden)

    Vanderly Andrade-Souza

    Full Text Available High morphological homogeneity and cryptic speciation may cause the diversity within Simuliidae to be underestimated. Recent molecular studies on population genetics and phylogeography have contributed to reveal which factors influenced the diversity within this group. This study aimed at examining the genetic diversity of Simulium hirtipupa Lutz, 1910 in populations from the biomes Caatinga, Cerrado, and Atlantic Forest. In this study, we carried out phylogeographic and population genetic analyses using a fragment of the mitochondrial gene COI. The 19 populations studied were clustered into seven groups, most of which are associated with geography indicating certain genetic structure. The northern region of the state of Minas Gerais is most likely the center of origin of this species. The average intergroup genetic distance was 3.7%, indicating the presence of cryptic species. The species tree as well as the haplotype network recovered all groups forming two major groups: the first comprises groups Gr-Bahia (in which the São Francisco river has not acted as geographical barrier, Gr-Pernambuco, and Gr-Mato Grosso do Sul. The second included groups comprising populations of the states of Goiás, Tocantins, Minas Gerais, Bahia, São Paulo, and Espírito Santo. The mismatch distribution for groups was consistent with the model of demographic expansion, except for the Gr-Central-East_1 group. The diversification in this group occurred about 1.19 Mya during the Pleistocene, influenced by paleoclimatic oscillations during the Quaternary glacial cycles.

  12. Phylogenetic and ecological analyses of soil and sporocarp DNA sequences reveal high diversity and strong habitat partitioning in the boreal ectomycorrhizal genus Russula (Russulales; Basidiomycota)

    Science.gov (United States)

    József Geml; Gary A. Laursen; Ian C. Herriott; Jack M. McFarland; Michael G. Booth; Niall Lennon; H. Chad Nusbaum; D. Lee Taylor

    2010-01-01

    Although critical for the functioning of ecosystems, fungi are poorly known in high-latitude regions. Here, we provide the first genetic diversity assessment of one of the most diverse and abundant ectomycorrhizal genera in Alaska: Russula. We analyzed internal transcribed spacer rDNA sequences from sporocarps and soil samples using phylogenetic...

  13. Estimating variation within the genes and inferring the phylogeny of 186 sequenced diverse Escherichia coli genomes

    DEFF Research Database (Denmark)

    Kaas, Rolf Sommer; Rundsten, Carsten Friis; Ussery, David

    2012-01-01

    Background Escherichia coli exists in commensal and pathogenic forms. By measuring the variation of individual genes across more than a hundred sequenced genomes, gene variation can be studied in detail, including the number of mutations found for any given gene. This knowledge will be useful...... for creating better phylogenies, for determination of molecular clocks and for improved typing techniques. Results We find 3,051 gene clusters/families present in at least 95% of the genomes and 1,702 gene clusters present in 100% of the genomes. The former 'soft core' of about 3,000 gene families is perhaps...... more biologically relevant, especially considering that many of these genome sequences are draft quality. The E. coli pan-genome for this set of isolates contains 16,373 gene clusters. A core-gene tree, based on alignment and a pan-genome tree based on gene presence/absence, maps the relatedness...

  14. Analysis of Plasmodium falciparum diversity in natural infections by deep sequencing

    OpenAIRE

    Manske, Magnus; Miotto, Olivo; Campino, Susana; Auburn, Sarah; Almagro-Garcia, Jacob; Maslen, Gareth; O?Brien, Jack; Djimde, Abdoulaye; Doumbo, Ogobara; Zongo, Issaka; Ouedraogo, Jean-Bosco; Michon, Pascal; Mueller, Ivo; Siba, Peter; Nzila, Alexis

    2012-01-01

    : Malaria elimination strategies require surveillance of the parasite population for genetic changes that demand a public health response, such as new forms of drug resistance. Here we describe methods for the large-scale analysis of genetic variation in Plasmodium falciparum by deep sequencing of parasite DNA obtained from the blood of patients with malaria, either directly or after short-term culture. Analysis of 86,158 exonic single nucleotide polymorphisms that passed genotyping quality c...

  15. Sequence diversity in haloalkane dehalogenases, as revealed by PCR using family-specific primers

    Czech Academy of Sciences Publication Activity Database

    Kotík, Michael; Faměrová, Veronika

    2012-01-01

    Roč. 88, č. 2 (2012), s. 212-217 ISSN 0167-7012 R&D Projects: GA ČR GAP504/10/0137; GA ČR GAP207/10/0135 Institutional research plan: CEZ:AV0Z50200510 Keywords : Dehalogenation * Consensus sequence * Degenerate PCR primer Subject RIV: EE - Microbiology, Virology Impact factor: 2.161, year: 2012

  16. Sequence-Dependent Self-Assembly and Structural Diversity of Islet Amyloid Polypeptide-Derived β-Sheet Fibrils

    International Nuclear Information System (INIS)

    Wang, Shih-Ting; Lin, Yiyang; Spencer, Ryan K.; Thomas, Michael R.; Nguyen, Andy I.

    2017-01-01

    Determining the structural origins of amyloid fibrillation is essential for understanding both the pathology of amyloidosis and the rational design of inhibitors to prevent or reverse amyloid formation. In this work, the decisive roles of peptide structures on amyloid self-assembly and morphological diversity were investigated by the design of eight amyloidogenic peptides derived from islet amyloid polypeptide. Among the segments, two distinct morphologies were highlighted in the form of twisted and planar (untwisted) ribbons with varied diameters, thicknesses, and lengths. In particular, transformation of amyloid fibrils from twisted ribbons into untwisted structures was triggered by substitution of the C-terminal serine with threonine, where the side chain methyl group was responsible for the distinct morphological change. This effect was confirmed following serine substitution with alanine and valine and was ascribed to the restriction of intersheet torsional strain through the increased hydrophobic interactions and hydrogen bonding. We also studied the variation of fibril morphology (i.e., association and helicity) and peptide aggregation propensity by increasing the hydrophobicity of the peptide side group, capping the N-terminus, and extending sequence length. Lastly, we anticipate that our insights into sequence-dependent fibrillation and morphological diversity will shed light on the structural interpretation of amyloidogenesis and development of structure-specific imaging agents and aggregation inhibitors.

  17. Next-Generation Sequencing Assessment of Eukaryotic Diversity in Oil Sands Tailings Ponds Sediments and Surface Water.

    Science.gov (United States)

    Aguilar, Maria; Richardson, Elisabeth; Tan, BoonFei; Walker, Giselle; Dunfield, Peter F; Bass, David; Nesbø, Camilla; Foght, Julia; Dacks, Joel B

    2016-11-01

    Tailings ponds in the Athabasca oil sands (Canada) contain fluid wastes, generated by the extraction of bitumen from oil sands ores. Although the autochthonous prokaryotic communities have been relatively well characterized, almost nothing is known about microbial eukaryotes living in the anoxic soft sediments of tailings ponds or in the thin oxic layer of water that covers them. We carried out the first next-generation sequencing study of microbial eukaryotic diversity in oil sands tailings ponds. In metagenomes prepared from tailings sediment and surface water, we detected very low numbers of sequences encoding eukaryotic small subunit ribosomal RNA representing seven major taxonomic groups of protists. We also produced and analysed three amplicon-based 18S rRNA libraries prepared from sediment samples. These revealed a more diverse set of taxa, 169 different OTUs encompassing up to eleven higher order groups of eukaryotes, according to detailed classification using homology searching and phylogenetic methods. The 10 most abundant OTUs accounted for > 90% of the total of reads, vs. large numbers of rare OTUs (< 1% abundance). Despite the anoxic and hydrocarbon-enriched nature of the environment, the tailings ponds harbour complex communities of microbial eukaryotes indicating that these organisms should be taken into account when studying the microbiology of the oil sands. © 2016 The Author(s) Journal of Eukaryotic Microbiology © 2016 International Society of Protistologists.

  18. Sequence capture by hybridization to explore modern and ancient genomic diversity in model and nonmodel organisms.

    Science.gov (United States)

    Gasc, Cyrielle; Peyretaillade, Eric; Peyret, Pierre

    2016-06-02

    The recent expansion of next-generation sequencing has significantly improved biological research. Nevertheless, deep exploration of genomes or metagenomic samples remains difficult because of the sequencing depth and the associated costs required. Therefore, different partitioning strategies have been developed to sequence informative subsets of studied genomes. Among these strategies, hybridization capture has proven to be an innovative and efficient tool for targeting and enriching specific biomarkers in complex DNA mixtures. It has been successfully applied in numerous areas of biology, such as exome resequencing for the identification of mutations underlying Mendelian or complex diseases and cancers, and its usefulness has been demonstrated in the agronomic field through the linking of genetic variants to agricultural phenotypic traits of interest. Moreover, hybridization capture has provided access to underexplored, but relevant fractions of genomes through its ability to enrich defined targets and their flanking regions. Finally, on the basis of restricted genomic information, this method has also allowed the expansion of knowledge of nonreference species and ancient genomes and provided a better understanding of metagenomic samples. In this review, we present the major advances and discoveries permitted by hybridization capture and highlight the potency of this approach in all areas of biology. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  19. Metabolic diversity and ecological niches of Achromatium populations revealed with single-cell genomic sequencing

    Directory of Open Access Journals (Sweden)

    Muammar eMansor

    2015-08-01

    Full Text Available Large, sulfur-cycling, calcite-precipitating bacteria in the genus Achromatium represent a significant proportion of bacterial communities near sediment-water interfaces throughout the world. Our understanding of their potentially crucial roles in calcium, carbon, sulfur, nitrogen, and iron cycling is limited because they have not been cultured or sequenced using environmental genomics approaches to date. We utilized single-cell genomic sequencing to obtain one incomplete and two nearly complete draft genomes for Achromatium collected at Warm Mineral Springs, FL. Based on 16S rRNA gene sequences, the three cells represent distinct and relatively distant Achromatium populations (91-92% identity. The draft genomes encode key genes involved in sulfur and hydrogen oxidation; oxygen, nitrogen and polysulfide respiration; carbon and nitrogen fixation; organic carbon assimilation and storage; chemotaxis; twitching motility; antibiotic resistance; and membrane transport. Known genes for iron and manganese energy metabolism were not detected. The presence of pyrophosphatase and vacuolar (V-type ATPases, which are generally rare in bacterial genomes, suggests a role for these enzymes in calcium transport, proton pumping, and/or energy generation in the membranes of calcite-containing inclusions.

  20. Multilocus sequence analysis of nectar pseudomonads reveals high genetic diversity and contrasting recombination patterns.

    Science.gov (United States)

    Alvarez-Pérez, Sergio; de Vega, Clara; Herrera, Carlos M

    2013-01-01

    The genetic and evolutionary relationships among floral nectar-dwelling Pseudomonas 'sensu stricto' isolates associated to South African and Mediterranean plants were investigated by multilocus sequence analysis (MLSA) of four core housekeeping genes (rrs, gyrB, rpoB and rpoD). A total of 35 different sequence types were found for the 38 nectar bacterial isolates characterised. Phylogenetic analyses resulted in the identification of three main clades [nectar groups (NGs) 1, 2 and 3] of nectar pseudomonads, which were closely related to five intrageneric groups: Pseudomonas oryzihabitans (NG 1); P. fluorescens, P. lutea and P. syringae (NG 2); and P. rhizosphaerae (NG 3). Linkage disequilibrium analysis pointed to a mostly clonal population structure, even when the analysis was restricted to isolates from the same floristic region or belonging to the same NG. Nevertheless, signatures of recombination were observed for NG 3, which exclusively included isolates retrieved from the floral nectar of insect-pollinated Mediterranean plants. In contrast, the other two NGs comprised both South African and Mediterranean isolates. Analyses relating diversification to floristic region and pollinator type revealed that there has been more unique evolution of the nectar pseudomonads within the Mediterranean region than would be expected by chance. This is the first work analysing the sequence of multiple loci to reveal geno- and ecotypes of nectar bacteria.

  1. Multilocus Sequence Analysis of Nectar Pseudomonads Reveals High Genetic Diversity and Contrasting Recombination Patterns

    Science.gov (United States)

    Álvarez-Pérez, Sergio; de Vega, Clara; Herrera, Carlos M.

    2013-01-01

    The genetic and evolutionary relationships among floral nectar-dwelling Pseudomonas ‘sensu stricto’ isolates associated to South African and Mediterranean plants were investigated by multilocus sequence analysis (MLSA) of four core housekeeping genes (rrs, gyrB, rpoB and rpoD). A total of 35 different sequence types were found for the 38 nectar bacterial isolates characterised. Phylogenetic analyses resulted in the identification of three main clades [nectar groups (NGs) 1, 2 and 3] of nectar pseudomonads, which were closely related to five intrageneric groups: Pseudomonas oryzihabitans (NG 1); P. fluorescens, P. lutea and P. syringae (NG 2); and P. rhizosphaerae (NG 3). Linkage disequilibrium analysis pointed to a mostly clonal population structure, even when the analysis was restricted to isolates from the same floristic region or belonging to the same NG. Nevertheless, signatures of recombination were observed for NG 3, which exclusively included isolates retrieved from the floral nectar of insect-pollinated Mediterranean plants. In contrast, the other two NGs comprised both South African and Mediterranean isolates. Analyses relating diversification to floristic region and pollinator type revealed that there has been more unique evolution of the nectar pseudomonads within the Mediterranean region than would be expected by chance. This is the first work analysing the sequence of multiple loci to reveal geno- and ecotypes of nectar bacteria. PMID:24116076

  2. High Diversity of Myocyanophage in Various Aquatic Environments Revealed by High-Throughput Sequencing of Major Capsid Protein Gene With a New Set of Primers

    Directory of Open Access Journals (Sweden)

    Weiguo Hou

    2018-05-01

    Full Text Available Myocyanophages, a group of viruses infecting cyanobacteria, are abundant and play important roles in elemental cycling. Here we investigated the particle-associated viral communities retained on 0.2 μm filters and in sediment samples (representing ancient cyanophage communities from four ocean and three lake locations, using high-throughput sequencing and a newly designed primer pair targeting a gene fragment (∼145-bp in length encoding the cyanophage gp23 major capsid protein (MCP. Diverse viral communities were detected in all samples. The fragments of 142-, 145-, and 148-bp in length were most abundant in the amplicons, and most sequences (>92% belonged to cyanophages. Additionally, different sequencing depths resulted in different diversity estimates of the viral community. Operational taxonomic units obtained from deep sequencing of the MCP gene covered the majority of those obtained from shallow sequencing, suggesting that deep sequencing exhibited a more complete picture of cyanophage community than shallow sequencing. Our results also revealed a wide geographic distribution of marine myocyanophages, i.e., higher dissimilarities of the myocyanophage communities corresponded with the larger distances between the sampling sites. Collectively, this study suggests that the newly designed primer pair can be effectively used to study the community and diversity of myocyanophage from different environments, and the high-throughput sequencing represents a good method to understand viral diversity.

  3. Sifting through genomes with iterative-sequence clustering produces a large, phylogenetically diverse protein-family resource

    Directory of Open Access Journals (Sweden)

    Sharpton Thomas J

    2012-10-01

    Full Text Available Abstract Background New computational resources are needed to manage the increasing volume of biological data from genome sequencing projects. One fundamental challenge is the ability to maintain a complete and current catalog of protein diversity. We developed a new approach for the identification of protein families that focuses on the rapid discovery of homologous protein sequences. Results We implemented fully automated and high-throughput procedures to de novo cluster proteins into families based upon global alignment similarity. Our approach employs an iterative clustering strategy in which homologs of known families are sifted out of the search for new families. The resulting reduction in computational complexity enables us to rapidly identify novel protein families found in new genomes and to perform efficient, automated updates that keep pace with genome sequencing. We refer to protein families identified through this approach as “Sifting Families,” or SFams. Our analysis of ~10.5 million protein sequences from 2,928 genomes identified 436,360 SFams, many of which are not represented in other protein family databases. We validated the quality of SFam clustering through statistical as well as network topology–based analyses. Conclusions We describe the rapid identification of SFams and demonstrate how they can be used to annotate genomes and metagenomes. The SFam database catalogs protein-family quality metrics, multiple sequence alignments, hidden Markov models, and phylogenetic trees. Our source code and database are publicly available and will be subject to frequent updates (http://edhar.genomecenter.ucdavis.edu/sifting_families/.

  4. Transcriptome Sequencing of Diverse Peanut (Arachis Wild Species and the Cultivated Species Reveals a Wealth of Untapped Genetic Variability

    Directory of Open Access Journals (Sweden)

    Ratan Chopra

    2016-12-01

    Full Text Available To test the hypothesis that the cultivated peanut species possesses almost no molecular variability, we sequenced a diverse panel of 22 Arachis accessions representing Arachis hypogaea botanical classes, A-, B-, and K- genome diploids, a synthetic amphidiploid, and a tetraploid wild species. RNASeq was performed on pools of three tissues, and de novo assembly was performed. Realignment of individual accession reads to transcripts of the cultivar OLin identified 306,820 biallelic SNPs. Among 10 naturally occurring tetraploid accessions, 40,382 unique homozygous SNPs were identified in 14,719 contigs. In eight diploid accessions, 291,115 unique SNPs were identified in 26,320 contigs. The average SNP rate among the 10 cultivated tetraploids was 0.5, and among eight diploids was 9.2 per 1000 bp. Diversity analysis indicated grouping of diploids according to genome classification, and cultivated tetraploids by subspecies. Cluster analysis of variants indicated that sequences of B genome species were the most similar to the tetraploids, and the next closest diploid accession belonged to the A genome species. A subset of 66 SNPs selected from the dataset was validated; of 782 SNP calls, 636 (81.32% were confirmed using an allele-specific discrimination assay. We conclude that substantial genetic variability exists among wild species. Additionally, significant but lesser variability at the molecular level occurs among accessions of the cultivated species. This survey is the first to report significant SNP level diversity among transcripts, and may explain some of the phenotypic differences observed in germplasm surveys. Understanding SNP variants in the Arachis accessions will benefit in developing markers for selection.

  5. Genetic diversity in two Japanese flounder populations from China seas inferred using microsatellite markers and COI sequences

    Science.gov (United States)

    Xu, Dongdong; Li, Sanlei; Lou, Bao; Zhang, Yurong; Zhan, Wei; Shi, Huilai

    2012-07-01

    Japanese flounder is one of the most important commercial species in China; however, information on the genetic background of natural populations in China seas is scarce. The lack of genetic data has hampered fishery management and aquaculture development programs for this species. In the present study, we have analyzed the genetic diversity in natural populations of Japanese flounder sampled from the Yellow Sea (Qingdao population, QD) and East China Sea (Zhoushan population, ZS) using 10 polymorphic microsatellite loci and cytochrome c oxidase subunit I (COI) sequencing data. A total of 68 different alleles were observed over 10 microsatellite loci. The total number of alleles per locus ranged from 2 to 9, and the number of genotypes per locus ranged from 3 to 45. The observed heterozygosity and expected heterozygosity in QD were 0.733 and 0.779, respectively, and in ZS the heterozygosity values were 0.708 and 0.783, respectively. Significant departures from Hardy-Weinberg equilibrium were observed in 7 of the 10 microsatellite loci in each of the two populations. The COI sequencing analysis revealed 25 polymorphic sites and 15 haplotypes in the two populations. The haplotype diversity and nucleotide diversity in the QD population were 0.746±0.072 8 and 0.003 34±0.001 03 respectively, and in ZS population the genetic diversity values were 0.712±0.047 0 and 0.003 18±0.000 49, respectively. The microsatellite data ( F st =0.048 7, P <0.001) and mitochondrial DNA data ( F st =0.128, P <0.001) both revealed significant genetic differentiation between the two populations. The information on the genetic variation and differentiation in Japanese flounder obtained in this study could be used to set up suitable guidelines for the management and conservation of this species, as well as for managing artificial selection programs. In future studies, more geographically diverse stocks should be used to obtain a deeper understanding of the population structure of Japanese

  6. A not-so-big crisis: re-reading Silurian conodont diversity in a sequence-stratigraphic framework

    Science.gov (United States)

    Jarochowska, Emilia; Munnecke, Axel

    2016-04-01

    Conodonts are extensively used in Ordovician through Triassic biostratigraphy and fossil-based geochemistry. However, their distribution in rock successions is commonly taken at face value, without taking into account their diverse and poorly understood ecology. Multielement taxonomy, ontogenetic and environmental variability, difficulties in extraction, and relative rarity all contribute to the general lack of quantitative studies on conodont stratigraphic distribution and temporal turnover. With respect to Silurian conodonts, the concept of recurrent conodont extinction events - the so called Ireviken, Mulde and Lau events - has become a standard in the stratigraphic literature. The concept has been proposed based on qualitative observations of local extirpations of open-marine pelagic or nekto-benthic taxa and temporary dominance of shallow-water species in the Silurian succession of the Swedish island of Gotland. These changes coincided with positive carbon isotope excursions, abrupt facies shifts, "blooms" of benthic fauna, and changes in reef communities, which have all been combined into a general view of Silurian bio-geochemical events. This view posits a deterministic, reproducible pattern in Silurian conodont diversity, attributed to recurrent ecological or geochemical conditions. The growing body of sequence-stratigraphic interpretations across these events in Gotland and other sections worldwide indicate that in all cases the Silurian "events" are associated with rapid global regressions. This suggests that faunal changes such as the dominance of shallow-water, low-diversity conodont fauna and the increase of benthic invertebrate diversity and abundance represent predictable consequences of the variation in the completeness of the rock record and preservation potential of different environments. Our studies in Poland and Ukraine indicate that the magnitude of change in the taxonomic composition of conodont assemblages across the middle Silurian global

  7. Analysis of genetic diversity of Tunisian pistachio (Pistacia vera L.) using sequence-related amplified polymorphism (SRAP) markers.

    Science.gov (United States)

    Guenni, K; Aouadi, M; Chatti, K; Salhi-Hannachi, A

    2016-10-17

    Sequence-related amplified polymorphism (SRAP) markers preferentially amplify open reading frames and were used to study the genetic diversity of Tunisian pistachio. In the present study, 43 Pistacia vera accessions were screened using seven SRAP primer pairs. A total of 78 markers was revealed (95.12%) with an average polymorphic information content of 0.850. The results suggest that there is strong genetic differentiation, which characterizes the local resources (G ST = 0.307). High gene flow (N m = 1.127) among groups was explained by the exchange of plant material among regions. Analysis of molecular variance revealed significant differences within groups and showed that 73.88% of the total genetic diversity occurred within groups, whereas the remaining 26.12% occurred among groups. Bayesian clustering and principal component analysis identified three pools, El Guettar, Pollenizers, and the rest of the pistachios belonging to the Gabès, Kasserine, and Sfax localities. Bayesian analysis revealed that El Guettar and male genotypes were assigned with more than 80% probability. The BayeScan method proposed that locus 59 (F13-R9) could be used in the development of sex-linked SCAR markers from SRAP since it is a commonly detected locus in comparisons involving the Pollenizers group. This is the first application of SRAP markers for the assessment of genetic diversity in Tunisian germplasm of P. vera. Such information will be useful to define conservation strategies and improvement programs for this species.

  8. The pig gut microbial diversity: Understanding the pig gut microbial ecology through the next generation high throughput sequencing.

    Science.gov (United States)

    Kim, Hyeun Bum; Isaacson, Richard E

    2015-06-12

    The importance of the gut microbiota of animals is widely acknowledged because of its pivotal roles in the health and well being of animals. The genetic diversity of the gut microbiota contributes to the overall development and metabolic needs of the animal, and provides the host with many beneficial functions including production of volatile fatty acids, re-cycling of bile salts, production of vitamin K, cellulose digestion, and development of immune system. Thus the intestinal microbiota of animals has been the subject of study for many decades. Although most of the older studies have used culture dependent methods, the recent advent of high throughput sequencing of 16S rRNA genes has facilitated in depth studies exploring microbial populations and their dynamics in the animal gut. These culture independent DNA based studies generate large amounts of data and as a result contribute to a more detailed understanding of the microbiota dynamics in the gut and the ecology of the microbial populations. Of equal importance, is being able to identify and quantify microbes that are difficult to grow or that have not been grown in the laboratory. Interpreting the data obtained from this type of study requires using basic principles of microbial diversity to understand importance of the composition of microbial populations. In this review, we summarize the literature on culture independent studies of the pig gut microbiota with an emphasis on its succession and alterations caused by diverse factors. Copyright © 2015 Elsevier B.V. All rights reserved.

  9. Distribution and Diversity of Bacteria and Fungi Colonization in Stone Monuments Analyzed by High-Throughput Sequencing.

    Directory of Open Access Journals (Sweden)

    Qiang Li

    Full Text Available The historical and cultural heritage of Qingxing palace and Lingyin and Kaihua temple, located in Hangzhou of China, include a large number of exquisite Buddhist statues and ancient stone sculptures which date back to the Northern Song (960-1219 A.D. and Qing dynasties (1636-1912 A.D. and are considered to be some of the best examples of ancient stone sculpting techniques. They were added to the World Heritage List in 2011 because of their unique craftsmanship and importance to the study of ancient Chinese Buddhist culture. However, biodeterioration of the surface of the ancient Buddhist statues and white marble pillars not only severely impairs their aesthetic value but also alters their material structure and thermo-hygric properties. In this study, high-throughput sequencing was utilized to identify the microbial communities colonizing the stone monuments. The diversity and distribution of the microbial communities in six samples collected from three different environmental conditions with signs of deterioration were analyzed by means of bioinformatics software and diversity indices. In addition, the impact of environmental factors, including temperature, light intensity, air humidity, and the concentration of NO2 and SO2, on the microbial communities' diversity and distribution was evaluated. The results indicate that the presence of predominantly phototrophic microorganisms was correlated with light and humidity, while nitrifying bacteria and Thiobacillus were associated with NO2 and SO2 from air pollution.

  10. Distribution and Diversity of Bacteria and Fungi Colonization in Stone Monuments Analyzed by High-Throughput Sequencing.

    Science.gov (United States)

    Li, Qiang; Zhang, Bingjian; He, Zhang; Yang, Xiaoru

    The historical and cultural heritage of Qingxing palace and Lingyin and Kaihua temple, located in Hangzhou of China, include a large number of exquisite Buddhist statues and ancient stone sculptures which date back to the Northern Song (960-1219 A.D.) and Qing dynasties (1636-1912 A.D.) and are considered to be some of the best examples of ancient stone sculpting techniques. They were added to the World Heritage List in 2011 because of their unique craftsmanship and importance to the study of ancient Chinese Buddhist culture. However, biodeterioration of the surface of the ancient Buddhist statues and white marble pillars not only severely impairs their aesthetic value but also alters their material structure and thermo-hygric properties. In this study, high-throughput sequencing was utilized to identify the microbial communities colonizing the stone monuments. The diversity and distribution of the microbial communities in six samples collected from three different environmental conditions with signs of deterioration were analyzed by means of bioinformatics software and diversity indices. In addition, the impact of environmental factors, including temperature, light intensity, air humidity, and the concentration of NO2 and SO2, on the microbial communities' diversity and distribution was evaluated. The results indicate that the presence of predominantly phototrophic microorganisms was correlated with light and humidity, while nitrifying bacteria and Thiobacillus were associated with NO2 and SO2 from air pollution.

  11. Diversity and genetic stability in banana genotypes in a breeding program using inter simple sequence repeats (ISSR) markers.

    Science.gov (United States)

    Silva, A V C; Nascimento, A L S; Vitória, M F; Rabbani, A R C; Soares, A N R; Lédo, A S

    2017-02-23

    Banana (Musa spp) is a fruit species frequently cultivated and consumed worldwide. Molecular markers are important for estimating genetic diversity in germplasm and between genotypes in breeding programs. The objective of this study was to analyze the genetic diversity of 21 banana genotypes (FHIA 23, PA42-44, Maçã, Pacovan Ken, Bucaneiro, YB42-47, Grand Naine, Tropical, FHIA 18, PA94-01, YB42-17, Enxerto, Japira, Pacovã, Prata-Anã, Maravilha, PV79-34, Caipira, Princesa, Garantida, and Thap Maeo), by using inter-simple sequence repeat (ISSR) markers. Material was generated from the banana breeding program of Embrapa Cassava & Fruits and evaluated at Embrapa Coastal Tablelands. The 12 primers used in this study generated 97.5% polymorphism. Four clusters were identified among the different genotypes studied, and the sum of the first two principal components was 48.91%. From the Unweighted Pair Group Method using Arithmetic averages (UPGMA) dendrogram, it was possible to identify two main clusters and subclusters. Two genotypes (Garantida and Thap Maeo) remained isolated from the others, both in the UPGMA clustering and in the principal cordinate analysis (PCoA). Using ISSR markers, we could analyze the genetic diversity of the studied material and state that these markers were efficient at detecting sufficient polymorphism to estimate the genetic variability in banana genotypes.

  12. Genetic diversity and geographic differentiation analysis of duckweed using inter-simple sequence repeat markers.

    Science.gov (United States)

    Xue, Huiling; Xiao, Yao; Jin, Yanling; Li, Xinbo; Fang, Yang; Zhao, Hai; Zhao, Yun; Guan, Jiafa

    2012-01-01

    Duckweed, with rapid growth rate and high starch content, is a new alternate feedstock for bioethanol production. The genetic diversity among 27 duckweed populations of seven species in genus Lemna and Spirodela from China and Vietnam was analyzed by ISSR-PCR. Eight ISSR primers generating a reproducible amplification banding pattern had been screened. 89 polymorphic bands were scored out of the 92 banding patterns of 16 Lemna populations, accounting for 96.74% of the polymorphism. 98 polymorphic bands of 11 Spirodela populations were scored out of 99 banding patterns, and the polymorphism was 98.43%. The genetic distance of Lemna varied from 0.127 to 0.784, and from 0.138 to 0.902 for Spirodela, which indicated a high level of genetic variation among the populations studied. The unweighted pair group method with arithmetic average (UPGMA) cluster analysis corresponded well with the genetic distance. Populations from Sichuan China grouped together and so did the populations from Vietnam, which illuminated populations collected from the same region clustered into one group. Especially, the only one population from Tibet was included in subgroup A2 alone. Clustering analysis indicated that the geographic differentiation of collected sites correlated closely with the genetic differentiation of duckweeds. The results suggested that geographic differentiation had great influence on genetic diversity of duckweed in China and Vietnam at the regional scale. This study provided primary guidelines for collection, conservation, characterization of duckweed resources for bioethanol production etc.

  13. Highly diverse microbiota in dental root canals in cases of apical periodontitis (data of illumina sequencing).

    Science.gov (United States)

    Vengerfeldt, Veiko; Špilka, Katerina; Saag, Mare; Preem, Jens-Konrad; Oopkaup, Kristjan; Truu, Jaak; Mändar, Reet

    2014-11-01

    Chronic apical periodontitis (CAP) is a frequent condition that has a considerable effect on a patient's quality of life. We aimed to reveal root canal microbial communities in antibiotic-naive patients by applying Illumina sequencing (Illumina Inc, San Diego, CA). Samples were collected under strict aseptic conditions from 12 teeth (5 with primary CAP, 3 with secondary CAP, and 4 with a periapical abscess [PA]) and characterized by profiling the microbial community on the basis of the V6 hypervariable region of the 16S ribosomal RNA gene by using Illumina HiSeq2000 sequencing combinatorial sequence-tagged polymerase chain reaction products. Root canal specimens displayed highly polymicrobial communities in all 3 patient groups. One sample contained 5-8 (mean = 6.5) phyla of bacteria. The most numerous were Firmicutes and Bacteroidetes, but Actinobacteria, Fusobacteria, Proteobacteria, Spirochaetes, Tenericutes, and Synergistetes were also present in most of the patients. One sample contained 30-70 different operational taxonomic units; the mean (± standard deviation) was lower in the primary CAP group (36 ± 4) than in the PA (45 ± 4) and secondary CAP (43 ± 13) groups (P < .05). The communities were individually different, but anaerobic bacteria predominated as the rule. Enterococcus faecalis was found only in patients with secondary CAP. One PA sample displayed a significantly high proportion (47%) of Proteobacteria, mainly at the expense of Janthinobacterium lividum. This study provided an in-depth characterization of the microbiota of periapical tissues, revealing highly polymicrobial communities and minor differences between the study groups. A full understanding of the etiology of periodontal disease will only be possible through further in-depth systems-level analyses of the host-microbiome interaction. Copyright © 2014 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.

  14. Genome wide characterization of simple sequence repeats in watermelon genome and their application in comparative mapping and genetic diversity analysis.

    Science.gov (United States)

    Zhu, Huayu; Song, Pengyao; Koo, Dal-Hoe; Guo, Luqin; Li, Yanman; Sun, Shouru; Weng, Yiqun; Yang, Luming

    2016-08-05

    Microsatellite markers are one of the most informative and versatile DNA-based markers used in plant genetic research, but their development has traditionally been difficult and costly. The whole genome sequencing with next-generation sequencing (NGS) technologies provides large amounts of sequence data to develop numerous microsatellite markers at whole genome scale. SSR markers have great advantage in cross-species comparisons and allow investigation of karyotype and genome evolution through highly efficient computation approaches such as in silico PCR. Here we described genome wide development and characterization of SSR markers in the watermelon (Citrullus lanatus) genome, which were then use in comparative analysis with two other important crop species in the Cucurbitaceae family: cucumber (Cucumis sativus L.) and melon (Cucumis melo L.). We further applied these markers in evaluating the genetic diversity and population structure in watermelon germplasm collections. A total of 39,523 microsatellite loci were identified from the watermelon draft genome with an overall density of 111 SSRs/Mbp, and 32,869 SSR primers were designed with suitable flanking sequences. The dinucleotide SSRs were the most common type representing 34.09 % of the total SSR loci and the AT-rich motifs were the most abundant in all nucleotide repeat types. In silico PCR analysis identified 832 and 925 SSR markers with each having a single amplicon in the cucumber and melon draft genome, respectively. Comparative analysis with these cross-species SSR markers revealed complicated mosaic patterns of syntenic blocks among the genomes of three species. In addition, genetic diversity analysis of 134 watermelon accessions with 32 highly informative SSR loci placed these lines into two groups with all accessions of C.lanatus var. citorides and three accessions of C. colocynthis clustered in one group and all accessions of C. lanatus var. lanatus and the remaining accessions of C. colocynthis

  15. Endophytic bacterial diversity in grapevine (Vitis vinifera L.) leaves described by 16S rRNA gene sequence analysis and length heterogeneity-PCR.

    Science.gov (United States)

    Bulgari, Daniela; Casati, Paola; Brusetti, Lorenzo; Quaglino, Fabio; Brasca, Milena; Daffonchio, Daniele; Bianco, Piero Attilio

    2009-08-01

    Diversity of bacterial endophytes associated with grapevine leaf tissues was analyzed by cultivation and cultivation-independent methods. In order to identify bacterial endophytes directly from metagenome, a protocol for bacteria enrichment and DNA extraction was optimized. Sequence analysis of 16S rRNA gene libraries underscored five diverse Operational Taxonomic Units (OTUs), showing best sequence matches with gamma-Proteobacteria, family Enterobacteriaceae, with a dominance of the genus Pantoea. Bacteria isolation through cultivation revealed the presence of six OTUs, showing best sequence matches with Actinobacteria, genus Curtobacterium, and with Firmicutes genera Bacillus and Enterococcus. Length Heterogeneity-PCR (LH-PCR) electrophoretic peaks from single bacterial clones were used to setup a database representing the bacterial endophytes identified in association with grapevine tissues. Analysis of healthy and phytoplasma-infected grapevine plants showed that LH-PCR could be a useful complementary tool for examining the diversity of bacterial endophytes especially for diversity survey on a large number of samples.

  16. Genetic diversity and population structure analysis in Perilla frutescens from Northern areas of China based on simple sequence repeats.

    Science.gov (United States)

    Ma, S J; Sa, K J; Hong, T K; Lee, J K

    2017-09-21

    In this study, 21 simple sequence repeat (SSR) markers were used to evaluate the genetic diversity and population structure among 77 Perilla accessions from high-latitude and middle-latitude areas of China. Ninety-five alleles were identified with an average of 4.52 alleles per locus. The average polymorphic information content (PIC) and genetic diversity values were 0.346 and 0.372, respectively. The level of genetic diversity and PIC value for cultivated accessions of Perilla frutescens var. frutescens from middle-latitude areas were higher than accessions from high-latitude areas. Based on the dendrogram of unweighted pair group method with arithmetic mean (UPGMA), all accessions were classified into four major groups with a genetic similarity of 46%. All accessions of the cultivated var. frutescens were discriminated from the cultivated P. frutescens var. crispa. Furthermore, most accessions of the cultivated var. frutescens collected in high-latitude and middle-latitude areas were distinguished depending on their geographical location. However, the geographical locations of several accessions of the cultivated var. frutescens have no relation with their positions in the UPGMA dendrogram and population structure. This result implies that the diffusion of accessions of the cultivated Perilla crop in the northern areas of China might be through multiple routes. On the population structure analysis, 77 Perilla accessions were divided into Group I, Group II, and an admixed group based on a membership probability threshold of 0.8. Finally, the findings in this study can provide useful theoretical knowledge for further study on the population structure and genetic diversity of Perilla and benefit for Perilla crop breeding and germplasm conservation.

  17. Distinct genetic diversity of Oncomelania hupensis, intermediate host of Schistosoma japonicum in mainland China as revealed by ITS sequences.

    Directory of Open Access Journals (Sweden)

    Qin Ping Zhao

    Full Text Available BACKGROUND: Oncomelania hupensis is the unique intermediate host of Schistosoma japonicum, which causes schistosomiasis endemic in the Far East, and especially in mainland China. O. hupensis largely determines the parasite's geographical range. How O. hupensis's genetic diversity is distributed geographically in mainland China has never been well examined with DNA sequence data. METHODOLOGY/PRINCIPAL FINDINGS: In this study we investigate the genetic variation among O. hupensis from different geographical origins using the combined complete internal transcribed spacer 1 (ITS1 and ITS2 regions of nuclear ribosomal DNA. 165 O. hupensis isolates were obtained in 29 localities from 7 provinces across mainland China: lake/marshland and hill regions in Anhui, Hubei, Hunan, Jiangxi and Jiangsu provinces, located along the middle and lower reaches of Yangtze River, and mountainous regions in Sichuan and Yunnan provinces. Phylogenetic and haplotype network analyses showed distinct genetic diversity and no shared haplotypes between populations from lake/marshland regions of the middle and lower reaches of the Yangtze River and populations from mountainous regions of Sichuan and Yunnan provinces. The genetic distance between these two groups is up to 0.81 based on Fst, and branch time was estimated as 2-6 Ma. As revealed in the phylogenetic tree, snails from Sichuan and Yunnan provinces were also clustered separately. Geographical separation appears to be an important factor accounting for the diversification of the two groups of O. hupensis in mainland China, and probably for the separate clades between snails from Sichuan and Yunnan provinces. In lake/marshland and hill regions along the middle and lower reaches of the Yangtze River, three clades were identified in the phylogenetic tree, but without any obvious clustering of snails from different provinces. CONCLUSIONS: O. hupensis in mainland China may have considerable genetic diversity, and a more

  18. Immunization Coverage

    Science.gov (United States)

    ... room/fact-sheets/detail/immunization-coverage","@context":"http://schema.org","@type":"Article"}; العربية 中文 français русский español ... Plan Global Health Observatory (GHO) data - Immunization More information on vaccines and immunization News 1 in 10 ...

  19. Functional coverages

    NARCIS (Netherlands)

    Donchyts, G.; Baart, F.; Jagers, H.R.A.; Van Dam, A.

    2011-01-01

    A new Application Programming Interface (API) is presented which simplifies working with geospatial coverages as well as many other data structures of a multi-dimensional nature. The main idea extends the Common Data Model (CDM) developed at the University Corporation for Atmospheric Research

  20. Sequence diversity within the HA-1 gene as detected by melting temperature assay without oligonucleotide probes

    Directory of Open Access Journals (Sweden)

    Mattiuz Pier

    2005-10-01

    Full Text Available Abstract Background The minor histocompatibility antigens (mHags are self-peptides derived from common cellular proteins and presented by MHC class I and II molecules. Disparities in mHags are a potential risk for the development of graft-versus-host disease (GvHD in the recipients of bone marrow from HLA-identical donors. Two alleles have been identified in the mHag HA-1. The correlation between mismatches of the mHag HA-1 and GvHD has been suggested and methods to facilitate large-scale testing were afterwards developed. Methods We used sequence specific primer (SSP PCR and direct sequencing to detect HA-1 gene polymorphisms in a sample of 131 unrelated Italian subjects. We then set up a novel melting temperature (Tm assay that may help identification of HA-1 alleles without oligonucleotide probes. Results We report the frequencies of HA-1 alleles in the Italian population and the presence of an intronic 5 base-pair deletion associated with the immunogeneic allele HA-1H. We also detected novel variable sites with respect to the consensus sequence of HA-1 locus. Even though recombination/gene conversion events are documented, there is considerable linkage disequilibrium in the data. The gametic associations between HA-1R/H alleles and the intronic 5-bp ins/del polymorphism prompted us to try the Tm analysis with SYBR® Green I. We show that the addition of dimethylsulfoxide (DMSO during the assay yields distinct patterns when amplicons from HA-1H homozygotes, HA-1R homozygotes, and heterozygotes are analysed. Conclusion The possibility to use SYBR® Green I to detect Tm differences between allelic variants is attractive but requires great caution. We succeeded in allele discrimination of the HA-1 locus using a relatively short (101 bp amplicon, only in the presence of DMSO. We believe that, at least in certain assets, Tm assays may benefit by the addition of DMSO or other agents affecting DNA strand conformation and stability.

  1. Bacterial diversity of the American sand fly Lutzomyia intermedia using high-throughput metagenomic sequencing.

    Science.gov (United States)

    Monteiro, Carolina Cunha; Villegas, Luis Eduardo Martinez; Campolina, Thais Bonifácio; Pires, Ana Clara Machado Araújo; Miranda, Jose Carlos; Pimenta, Paulo Filemon Paolucci; Secundino, Nagila Francinete Costa

    2016-08-31

    Parasites of the genus Leishmania cause a broad spectrum of diseases, collectively known as leishmaniasis, in humans worldwide. American cutaneous leishmaniasis is a neglected disease transmitted by sand fly vectors including Lutzomyia intermedia, a proven vector. The female sand fly can acquire or deliver Leishmania spp. parasites while feeding on a blood meal, which is required for nutrition, egg development and survival. The microbiota composition and abundance varies by food source, life stages and physiological conditions. The sand fly microbiota can affect parasite life-cycle in the vector. We performed a metagenomic analysis for microbiota composition and abundance in Lu. intermedia, from an endemic area in Brazil. The adult insects were collected using CDC light traps, morphologically identified, carefully sterilized, dissected under a microscope and the females separated into groups according to their physiological condition: (i) absence of blood meal (unfed = UN); (ii) presence of blood meal (blood-fed = BF); and (iii) presence of developed ovaries (gravid = GR). Then, they were processed for metagenomics with Illumina Hiseq Sequencing in order to be sequence analyzed and to obtain the taxonomic profiles of the microbiota. Bacterial metagenomic analysis revealed differences in microbiota composition based upon the distinct physiological stages of the adult insect. Sequence identification revealed two phyla (Proteobacteria and Actinobacteria), 11 families and 15 genera; 87 % of the bacteria were Gram-negative, while only one family and two genera were identified as Gram-positive. The genera Ochrobactrum, Bradyrhizobium and Pseudomonas were found across all of the groups. The metagenomic analysis revealed that the microbiota of the Lu. intermedia female sand flies are distinct under specific physiological conditions and consist of 15 bacterial genera. The Ochrobactrum, Bradyrhizobium and Pseudomonas were the common genera. Our results detailing

  2. HIV sequence diversity during the early phase of infection is associated with HIV DNA reductions during antiretroviral therapy.

    Science.gov (United States)

    Wang, Nidan; Li, Yijia; Han, Yang; Xie, Jing; Li, Taisheng

    2017-06-01

    The association between baseline human immunodeficiency virus (HIV) sequence diversity and HIV DNA decay after the initiation of antiretroviral therapy (ART) remains uncharacterized during the early stages of HIV infection. Samples were obtained from a cohort of 17 patients with early HIV infection (HIV-1 envelope (env) gene was amplified via single genome amplification (SGA) to determine the peripheral plasma HIV quasispecies. We categorized HIV quasispecies into two groups according to baseline viral sequence genetic distance, which was determined by the Poisson-Fitter tool. Total HIV DNA in peripheral blood mononuclear cells (PBMCs), viral load, and T cell subsets were measured prior to and after the initiation of ART. The median SGA sequence number was 17 (range 6-28). At baseline, we identified 7 patients with homogeneous viral populations (designated the Homogeneous group) and 10 patients with heterogeneous viral populations (designated the Heterogeneous group) based on SGA sequences. Both groups exhibited similar HIV DNA decay rates during the first 6 months of ART (P > 0.99), but the Homogenous group experienced more prominent decay than the Heterogeneous group after 6 months (P = 0.037). The Heterogeneous group had higher CD4 cell counts after ART initiation; however, both groups had comparable recovery in terms of CD4/CD8 ratios and CD8 T cell activation levels. Viral population homogeneity upon the initiation of ART is associated with a decrease in HIV DNA levels during ART. J. Med. Virol. 89:982-988, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

  3. Detecting exact breakpoints of deletions with diversity in hepatitis B viral genomic DNA from next-generation sequencing data.

    Science.gov (United States)

    Cheng, Ji-Hong; Liu, Wen-Chun; Chang, Ting-Tsung; Hsieh, Sun-Yuan; Tseng, Vincent S

    2017-10-01

    Many studies have suggested that deletions of Hepatitis B Viral (HBV) are associated with the development of progressive liver diseases, even ultimately resulting in hepatocellular carcinoma (HCC). Among the methods for detecting deletions from next-generation sequencing (NGS) data, few methods considered the characteristics of virus, such as high evolution rates and high divergence among the different HBV genomes. Sequencing high divergence HBV genome sequences using the NGS technology outputs millions of reads. Thus, detecting exact breakpoints of deletions from these big and complex data incurs very high computational cost. We proposed a novel analytical method named VirDelect (Virus Deletion Detect), which uses split read alignment base to detect exact breakpoint and diversity variable to consider high divergence in single-end reads data, such that the computational cost can be reduced without losing accuracy. We use four simulated reads datasets and two real pair-end reads datasets of HBV genome sequence to verify VirDelect accuracy by score functions. The experimental results show that VirDelect outperforms the state-of-the-art method Pindel in terms of accuracy score for all simulated datasets and VirDelect had only two base errors even in real datasets. VirDelect is also shown to deliver high accuracy in analyzing the single-end read data as well as pair-end data. VirDelect can serve as an effective and efficient bioinformatics tool for physiologists with high accuracy and efficient performance and applicable to further analysis with characteristics similar to HBV on genome length and high divergence. The software program of VirDelect can be downloaded at https://sourceforge.net/projects/virdelect/. Copyright © 2017. Published by Elsevier Inc.

  4. Sequence diversities of serine-aspartate repeat genes among Staphylococcus aureus isolates from different hosts presumably by horizontal gene transfer.

    Directory of Open Access Journals (Sweden)

    Huping Xue

    Full Text Available BACKGROUND: Horizontal gene transfer (HGT is recognized as one of the major forces for bacterial genome evolution. Many clinically important bacteria may acquire virulence factors and antibiotic resistance through HGT. The comparative genomic analysis has become an important tool for identifying HGT in emerging pathogens. In this study, the Serine-Aspartate Repeat (Sdr family has been compared among different sources of Staphylococcus aureus (S. aureus to discover sequence diversities within their genomes. METHODOLOGY/PRINCIPAL FINDINGS: Four sdr genes were analyzed for 21 different S. aureus strains and 218 mastitis-associated S. aureus isolates from Canada. Comparative genomic analyses revealed that S. aureus strains from bovine mastitis (RF122 and mastitis isolates in this study, ovine mastitis (ED133, pig (ST398, chicken (ED98, and human methicillin-resistant S. aureus (MRSA (TCH130, MRSA252, Mu3, Mu50, N315, 04-02981, JH1 and JH9 were highly associated with one another, presumably due to HGT. In addition, several types of insertion and deletion were found in sdr genes of many isolates. A new insertion sequence was found in mastitis isolates, which was presumably responsible for the HGT of sdrC gene among different strains. Moreover, the sdr genes could be used to type S. aureus. Regional difference of sdr genes distribution was also indicated among the tested S. aureus isolates. Finally, certain associations were found between sdr genes and subclinical or clinical mastitis isolates. CONCLUSIONS: Certain sdr gene sequences were shared in S. aureus strains and isolates from different species presumably due to HGT. Our results also suggest that the distributional assay of virulence factors should detect the full sequences or full functional regions of these factors. The traditional assay using short conserved regions may not be accurate or credible. These findings have important implications with regard to animal husbandry practices that may

  5. Xylariaceae diversity in Thailand and Philippines, based on rDNA sequencing

    Directory of Open Access Journals (Sweden)

    Natarajan Velmurugan

    2013-05-01

    Full Text Available Twenty three different Xylariaceae Tul. & C. Tul were isolatedfrom samples collected from forest zones of Thailand and Philippines.The fungal samples were characterized based on morphological characteristics and nuclear ITS1-5.8S rDNA-ITS2 region sequences. Ten species of Xylaria, two species of Hypoxylon, Biscogniauxia, Rosellinia and one species of Annulohypoxylon and Entonaema were found. Entonaema the distinctive genus of Xylariaceae, isolated in the study from Thailand samples showed a close relationship with Xylaria in phylogenetic tree. Xylariaceous species identified at molecular level showed significant similarity of the morphological characters, such as stromal structure, ascal apex and the germ slit of ascospores. In addition, three species of Arthrinium, two species of Pestalotiopsis were also isolated and characterized in the study. A phylogenetic affinity of Pestalotiopsis with Xylariaceae was found.

  6. Xylariaceae diversity in Thailand and Philippines, based on rDNA sequencing

    Directory of Open Access Journals (Sweden)

    Natarajan Velmurugan

    2013-07-01

    Full Text Available Twenty three different Xylariaceae Tul. & C. Tul were isolated from samples collected from forest zones of Thailand and Philippines. The fungal samples were characterized based on morphological characteristics and nuclear ITS1-5.8S rDNA-ITS2 region sequences. Ten species of Xylaria, two species of Hypoxylon, Biscogniauxia, Rosellinia and one species of Annulohypoxylon and Entonaema were found. Entonaema the distinctive genus of Xylariaceae, isolated in the study from Thailand samples showed a close relationship withXylaria in phylogenetic tree. Xylariaceous species identified at molecular level showed significant similarity of the morphological characters, such as stromal structure, ascal apex and the germ slit of ascospores. In addition, three species of Arthrinium, two species of Pestalotiopsis were also isolated and characterized in the study. A phylogenetic affinity of Pestalotiopsis with Xylariaceae was found.

  7. Genetic Diversity of Selected Mangifera Species Revealed by Inter Simple Sequence Repeats Markers

    Directory of Open Access Journals (Sweden)

    Zulhairil Ariffin

    2015-01-01

    Full Text Available ISSR markers were employed to reveal genetic diversity and genetic relatedness among 28 Mangifera accessions collected from Yan (Kedah, Bukit Gantang (Perak, Sibuti (Sarawak, and Papar (Sabah. A total of 198 markers were generated using nine anchored primers and one nonanchored primer. Genetic variation among the 28 accessions of Mangifera species including wild relatives, landraces, and clonal varieties is high, with an average degree of polymorphism of 98% and mean Shannon index, H0=7.50. Analysis on 18 Mangifera indica accessions also showed high degree of polymorphism of 99% and mean Shannon index, H0=5.74. Dice index of genetic similarity ranged from 0.0938 to 0.8046 among the Mangifera species. The dendrogram showed that the Mangifera species were grouped into three main divergent clusters. Cluster 1 comprised 14 accessions from Kedah and Perak. Cluster II and cluster III comprised 14 accessions from Sarawak and Sabah. Meanwhile, the Dice index of genetic similarity for 18 accessions of Mangifera indica ranged from 0.2588 to 0.7742. The dendrogram also showed the 18 accessions of Mangifera indica were grouped into three main clusters. Cluster I comprised 10 landraces of Mangifera indica from Kedah. Cluster II comprised 7 landraces of Mangifera indica followed by Chokanan to form Cluster III.

  8. Deep sequencing and ecological characterization of gut microbial communities of diverse bumble bee species.

    Directory of Open Access Journals (Sweden)

    Haw Chuan Lim

    Full Text Available Gut bacterial communities of bumble bees are correlated with defense against pathogens. Further understanding this host-microbe association is vitally important as bumble bees are currently experiencing global population declines, potentially due in part to emergent diseases. In this study, we used pyrosequencing and community fingerprinting (ARISA to characterize the gut microbial communities of nine bumble species from across the Bombus phylogeny. Overall, we delimited 74 bacterial taxa (operational taxonomic units or OTUs belonging to Betaproteobacteria, Gammaproteobacteria, Bacilli, Actinobacteria, Flavobacteria and Alphaproteobacteria. Each bacterial community was taxonomically simple, containing an average of 1.9 common (relative abundance per sample > 5% bacterial OTUs. The most abundant and prevalent (occurring in 92% of the samples bacterial OTU, based on 16S rRNA sequences, closely matched that of the previously described Betaproteobacteria species Snodgrassella alvi. Bacteria that were first described in bee-related external environments dominated a number of gut bacterial communities, suggesting that they are not strictly dependent on the internal gut environment. The ARISA data showed a correlation between bacterial community structures and the geographic locations where the bees were sampled, suggesting that at least a subset of the bacterial species may be transmitted environmentally. Using light and fluorescent microscopy, we demonstrated that the gut bacteria form a biofilm on the internal epithelial surface of the ileum, corroborating results obtained from Apis mellifera.

  9. Diversity of Protease-Producing Bacillus spp. From Fresh Indonesian Tempeh Based on 16S rRNA Gene Sequence

    Directory of Open Access Journals (Sweden)

    Tati Barus

    2017-01-01

    Full Text Available Tempeh is a type of traditional fermented food in Indonesia. The fermentation can be performed by Rhizopus microsporus as a main microorganism. However, Bacillus spp. is found in abundance in tempeh production. Nevertheless, information regarding the diversity of Bacillus spp. in tempeh production has not been reported yet. Therefore, the aim of this investigation was to study the genetic diversity of Bacillus spp. in tempeh production based on the 16S ribosomal RNA sequence. In this study, about 22 of 24 fresh tempeh from Jakarta, Bogor, and Tangerang were used. A total of 52 protease-producing Bacillus spp. isolates were obtained. Based on 16S ribosomal RNA results, all 52 isolates were identified to be similar to B. pumilus, B. subtilis, B. megaterium, B. licheniformis, B. cereus, B. thuringiensis, B. amyloliquefaciens, Brevibacillus brevis, and Bacillus sp. All the identified isolates were divided into two large clusters: 1 a cluster of B. cereus, B. thuringiensis, Bacillus sp., and B. brevis and 2 a cluster of B. pumilus, B. subtilis, B. megaterium, B. licheniformis, and B. amyloliquefaciens. Information about the Bacillus spp. role in determining the quality of tempeh has not been reported and this is a preliminary study of Bacillus spp. from tempeh.

  10. Genetic diversity analysis in Malaysian giant prawns using expressed sequence tag microsatellite markers for stock improvement program.

    Science.gov (United States)

    Atin, K H; Christianus, A; Fatin, N; Lutas, A C; Shabanimofrad, M; Subha, B

    2017-08-17

    The Malaysian giant prawn is among the most commonly cultured species of the genus Macrobrachium. Stocks of giant prawns from four rivers in Peninsular Malaysia have been used for aquaculture over the past 25 years, which has led to repeated harvesting, restocking, and transplantation between rivers. Consequently, a stock improvement program is now important to avoid the depletion of wild stocks and the loss of genetic diversity. However, the success of such an improvement program depends on our knowledge of the genetic variation of these base populations. The aim of the current study was to estimate genetic variation and differentiation of these riverine sources using novel expressed sequence tag-microsatellite (EST-SSR) markers, which not only are informative on genetic diversity but also provide information on immune and metabolic traits. Our findings indicated that the tested stocks have inbreeding depression due to a significant deficiency in heterozygotes, and F IS was estimated as 0.15538 to 0.31938. An F-statistics analysis suggested that the stocks are composed of one large panmictic population. Among the four locations, stocks from Johor, in the southern region of the peninsular, showed higher allelic and genetic diversity than the other stocks. To overcome inbreeding problems, the Johor population could be used as a base population in a stock improvement program by crossing to the other populations. The study demonstrated that EST-SSR markers can be incorporated in future marker assisted breeding to aid the proper management of the stocks by breeders and stakeholders in Malaysia.

  11. Phylogenetic diversity and genotypical complexity of H9N2 influenza A viruses revealed by genomic sequence analysis.

    Directory of Open Access Journals (Sweden)

    Guoying Dong

    Full Text Available H9N2 influenza A viruses have become established worldwide in terrestrial poultry and wild birds, and are occasionally transmitted to mammals including humans and pigs. To comprehensively elucidate the genetic and evolutionary characteristics of H9N2 influenza viruses, we performed a large-scale sequence analysis of 571 viral genomes from the NCBI Influenza Virus Resource Database, representing the spectrum of H9N2 influenza viruses isolated from 1966 to 2009. Our study provides a panoramic framework for better understanding the genesis and evolution of H9N2 influenza viruses, and for describing the history of H9N2 viruses circulating in diverse hosts. Panorama phylogenetic analysis of the eight viral gene segments revealed the complexity and diversity of H9N2 influenza viruses. The 571 H9N2 viral genomes were classified into 74 separate lineages, which had marked host and geographical differences in phylogeny. Panorama genotypical analysis also revealed that H9N2 viruses include at least 98 genotypes, which were further divided according to their HA lineages into seven series (A-G. Phylogenetic analysis of the internal genes showed that H9N2 viruses are closely related to H3, H4, H5, H7, H10, and H14 subtype influenza viruses. Our results indicate that H9N2 viruses have undergone extensive reassortments to generate multiple reassortants and genotypes, suggesting that the continued circulation of multiple genotypical H9N2 viruses throughout the world in diverse hosts has the potential to cause future influenza outbreaks in poultry and epidemics in humans. We propose a nomenclature system for identifying and unifying all lineages and genotypes of H9N2 influenza viruses in order to facilitate international communication on the evolution, ecology and epidemiology of H9N2 influenza viruses.

  12. Illumina MiSeq sequencing analysis of fungal diversity in stored dates.

    Science.gov (United States)

    Al-Bulushi, Ismail M; Bani-Uraba, Muna S; Guizani, Nejib S; Al-Khusaibi, Mohammed K; Al-Sadi, Abdullah M

    2017-03-27

    Date palm has been a major fruit tree in the Middle East over thousands of years, especially in the Arabian Peninsula. Dates are consumed fresh (Rutab) or after partial drying and storage (Tamar) during off-season. The aim of the study was to provide in-depth analysis of fungal communities associated with the skin (outer part) and mesocarp (inner fleshy part) of stored dates (Tamar) of two cultivars (Khenizi and Burny) through the use of Illumina MiSeq sequencing. The study revealed the dominance of Ascomycota (94%) in both cultivars, followed by Chytridiomycota (4%) and Zygomycota (2%). Among the classes recovered, Eurotiomycetes, Dothideomycetes, Saccharomycetes and Sordariomycetes were the most dominant. A total of 54 fungal species were detected, with species belonging to Penicillium, Alternaria, Cladosporium and Aspergillus comprising more than 60% of the fungal reads. Some potentially mycotoxin-producing fungi were detected in stored dates, including Aspergillus flavus, A. versicolor and Penicillium citrinum, but their relative abundance was very limited (PerMANOVA analysis revealed the presence of insignificant differences in fungal communities between date parts or date cultivars, indicating that fungal species associated with the skin may also be detected in the mesocarp. It also indicates the possible contamination of dates from different cultivars with similar fungal species, even though if they are obtained from different areas. The analysis shows the presence of different fungal species in dates. This appears to be the first study to report 25 new fungal species in Oman and 28 new fungal species from date fruits. The study discusses the sources of fungi on dates and the presence of potentially mycotoxin producing fungi on date skin and mesocarp.

  13. Survey of bacterial diversity in chronic wounds using Pyrosequencing, DGGE, and full ribosome shotgun sequencing

    Directory of Open Access Journals (Sweden)

    Wolcott Benjamin M

    2008-03-01

    Full Text Available Abstract Background Chronic wound pathogenic biofilms are host-pathogen environments that colonize and exist as a cohabitation of many bacterial species. These bacterial populations cooperate to promote their own survival and the chronic nature of the infection. Few studies have performed extensive surveys of the bacterial populations that occur within different types of chronic wound biofilms. The use of 3 separate16S-based molecular amplifications followed by pyrosequencing, shotgun Sanger sequencing, and denaturing gradient gel electrophoresis were utilized to survey the major populations of bacteria that occur in the pathogenic biofilms of three types of chronic wound types: diabetic foot ulcers (D, venous leg ulcers (V, and pressure ulcers (P. Results There are specific major populations of bacteria that were evident in the biofilms of all chronic wound types, including Staphylococcus, Pseudomonas, Peptoniphilus, Enterobacter, Stenotrophomonas, Finegoldia, and Serratia spp. Each of the wound types reveals marked differences in bacterial populations, such as pressure ulcers in which 62% of the populations were identified as obligate anaerobes. There were also populations of bacteria that were identified but not recognized as wound pathogens, such as Abiotrophia para-adiacens and Rhodopseudomonas spp. Results of molecular analyses were also compared to those obtained using traditional culture-based diagnostics. Only in one wound type did culture methods correctly identify the primary bacterial population indicating the need for improved diagnostic methods. Conclusion If clinicians can gain a better understanding of the wound's microbiota, it will give them a greater understanding of the wound's ecology and will allow them to better manage healing of the wound improving the prognosis of patients. This research highlights the necessity to begin evaluating, studying, and treating chronic wound pathogenic biofilms as multi-species entities in

  14. High-throughput sequencing of microbial community diversity in soil, grapes, leaves, grape juice and wine of grapevine from China.

    Science.gov (United States)

    Wei, Yu-Jie; Wu, Yun; Yan, Yin-Zhuo; Zou, Wan; Xue, Jie; Ma, Wen-Rui; Wang, Wei; Tian, Ge; Wang, Li-Ye

    2018-01-01

    In this study Illumina MiSeq was performed to investigate microbial diversity in soil, leaves, grape, grape juice and wine. A total of 1,043,102 fungal Internal Transcribed Spacer (ITS) reads and 2,422,188 high quality bacterial 16S rDNA sequences were used for taxonomic classification, revealed five fungal and eight bacterial phyla. At the genus level, the dominant fungi were Ascomycota, Sordariales, Tetracladium and Geomyces in soil, Aureobasidium and Pleosporaceae in grapes leaves, Aureobasidium in grape and grape juice. The dominant bacteria were Kaistobacter, Arthrobacter, Skermanella and Sphingomonas in soil, Pseudomonas, Acinetobacter and Kaistobacter in grape and grapes leaves, and Oenococcus in grape juice and wine. Principal coordinate analysis showed structural separation between the composition of fungi and bacteria in all samples. This is the first study to understand microbiome population in soil, grape, grapes leaves, grape juice and wine in Xinjiang through High-throughput Sequencing and identify microorganisms like Saccharomyces cerevisiae and Oenococcus spp. that may contribute to the quality and flavor of wine.

  15. Diversity of thermophiles in a Malaysian hot spring determined using 16S rRNA and shotgun metagenome sequencing.

    Science.gov (United States)

    Chan, Chia Sing; Chan, Kok-Gan; Tay, Yea-Ling; Chua, Yi-Heng; Goh, Kian Mau

    2015-01-01

    The Sungai Klah (SK) hot spring is the second hottest geothermal spring in Malaysia. This hot spring is a shallow, 150-m-long, fast-flowing stream, with temperatures varying from 50 to 110°C and a pH range of 7.0-9.0. Hidden within a wooded area, the SK hot spring is continually fed by plant litter, resulting in a relatively high degree of total organic content (TOC). In this study, a sample taken from the middle of the stream was analyzed at the 16S rRNA V3-V4 region by amplicon metagenome sequencing. Over 35 phyla were detected by analyzing the 16S rRNA data. Firmicutes and Proteobacteria represented approximately 57% of the microbiome. Approximately 70% of the detected thermophiles were strict anaerobes; however, Hydrogenobacter spp., obligate chemolithotrophic thermophiles, represented one of the major taxa. Several thermophilic photosynthetic microorganisms and acidothermophiles were also detected. Most of the phyla identified by 16S rRNA were also found using the shotgun metagenome approaches. The carbon, sulfur, and nitrogen metabolism within the SK hot spring community were evaluated by shotgun metagenome sequencing, and the data revealed diversity in terms of metabolic activity and dynamics. This hot spring has a rich diversified phylogenetic community partly due to its natural environment (plant litter, high TOC, and a shallow stream) and geochemical parameters (broad temperature and pH range). It is speculated that symbiotic relationships occur between the members of the community.

  16. Unexpected diversity in the mobilome of a Pseudomonas aeruginosa strain isolated from a dental unit waterline revealed by SMRT Sequencing.

    Science.gov (United States)

    Vincent, Antony T; Charette, Steve J; Barbeau, Jean

    2018-05-01

    The Gram-negative bacterium Pseudomonas aeruginosa is found in several habitats, both natural and human-made, and is particularly known for its recurrent presence as a pathogen in the lungs of patients suffering from cystic fibrosis, a genetic disease. Given its clinical importance, several major studies have investigated the genomic adaptation of P. aeruginosa in lungs and its transition as acute infections become chronic. However, our knowledge about the diversity and adaptation of the P. aeruginosa genome to non-clinical environments is still fragmentary, in part due to the lack of accurate reference genomes of strains from the numerous environments colonized by the bacterium. Here, we used PacBio long-read technology to sequence the genome of PPF-1, a strain of P. aeruginosa isolated from a dental unit waterline. Generating this closed genome was an opportunity to investigate genomic features that are difficult to accurately study in a draft genome (contigs state). It was possible to shed light on putative genomic islands, some shared with other reference genomes, new prophages, and the complete content of insertion sequences. In addition, four different group II introns were also found, including two characterized here and not listed in the specialized group II intron database.

  17. Investigating intra-host and intra-herd sequence diversity of foot-and-mouth disease virus.

    Science.gov (United States)

    King, David J; Freimanis, Graham L; Orton, Richard J; Waters, Ryan A; Haydon, Daniel T; King, Donald P

    2016-10-01

    Due to the poor-fidelity of the enzymes involved in RNA genome replication, foot-and-mouth disease (FMD) virus samples comprise of unique polymorphic populations. In this study, deep sequencing was utilised to characterise the diversity of FMD virus (FMDV) populations in 6 infected cattle present on a single farm during the series of outbreaks in the UK in 2007. A novel RT-PCR method was developed to amplify a 7.6kb nucleotide fragment encompassing the polyprotein coding region of the FMDV genome. Illumina sequencing of each sample identified the fine polymorphic structures at each nucleotide position, from consensus level changes to variants present at a 0.24% frequency. These data were used to investigate population dynamics of FMDV at both herd and host levels, evaluate the impact of host on the viral swarm structure and to identify transmission links with viruses recovered from other farms in the same series of outbreaks. In 7 samples, from 6 different animals, a total of 5 consensus level variants were identified, in addition to 104 sub-consensus variants of which 22 were shared between 2 or more animals. Further analysis revealed differences in swarm structures from samples derived from the same animal suggesting the presence of distinct viral populations evolving independently at different lesion sites within the same infected animal. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

  18. High-throughput sequencing of microbial community diversity in soil, grapes, leaves, grape juice and wine of grapevine from China

    Science.gov (United States)

    Yan, Yin-zhuo; Zou, Wan; Ma, Wen-rui; Wang, Wei; Tian, Ge; Wang, Li-ye

    2018-01-01

    In this study Illumina MiSeq was performed to investigate microbial diversity in soil, leaves, grape, grape juice and wine. A total of 1,043,102 fungal Internal Transcribed Spacer (ITS) reads and 2,422,188 high quality bacterial 16S rDNA sequences were used for taxonomic classification, revealed five fungal and eight bacterial phyla. At the genus level, the dominant fungi were Ascomycota, Sordariales, Tetracladium and Geomyces in soil, Aureobasidium and Pleosporaceae in grapes leaves, Aureobasidium in grape and grape juice. The dominant bacteria were Kaistobacter, Arthrobacter, Skermanella and Sphingomonas in soil, Pseudomonas, Acinetobacter and Kaistobacter in grape and grapes leaves, and Oenococcus in grape juice and wine. Principal coordinate analysis showed structural separation between the composition of fungi and bacteria in all samples. This is the first study to understand microbiome population in soil, grape, grapes leaves, grape juice and wine in Xinjiang through High-throughput Sequencing and identify microorganisms like Saccharomyces cerevisiae and Oenococcus spp. that may contribute to the quality and flavor of wine. PMID:29565999

  19. Unravelling the Molecular Epidemiology and Genetic Diversity among Burkholderia pseudomallei Isolates from South India Using Multi-Locus Sequence Typing.

    Science.gov (United States)

    Tellapragada, Chaitanya; Kamthan, Aayushi; Shaw, Tushar; Ke, Vandana; Kumar, Subodh; Bhat, Vinod; Mukhopadhyay, Chiranjay

    2016-01-01

    There is a slow but steady rise in the case detection rates of melioidosis from various parts of the Indian sub-continent in the past two decades. However, the epidemiology of the disease in India and the surrounding South Asian countries remains far from well elucidated. Multi-locus sequence typing (MLST) is a useful epidemiological tool to study the genetic relatedness of bacterial isolates both with-in and across the countries. With this background, we studied the molecular epidemiology of 32 Burkholderia pseudomallei isolates (31 clinical and 1 soil isolate) obtained during 2006-2015 from various parts of south India using multi-locus sequencing typing and analysis. Of the 32 isolates included in the analysis, 30 (93.7%) had novel allelic profiles that were not reported previously. Sequence type (ST) 1368 (n = 15, 46.8%) with allelic profile (1, 4, 6, 4, 1, 1, 3) was the most common genotype observed. We did not observe a genotypic association of STs with geographical location, type of infection and year of isolation in the present study. Measure of genetic differentiation (FST) between Indian and the rest of world isolates was 0.14413. Occurrence of the same ST across three adjacent states of south India suggest the dispersion of B.pseudomallei across the south western coastal part of India with limited geographical clustering. However, majority of the STs reported from the present study remained as "outliers" on the eBURST "Population snapshot", suggesting the genetic diversity of Indian isolates from the Australasian and Southeast Asian isolates.

  20. Unravelling the Molecular Epidemiology and Genetic Diversity among Burkholderia pseudomallei Isolates from South India Using Multi-Locus Sequence Typing.

    Directory of Open Access Journals (Sweden)

    Chaitanya Tellapragada

    Full Text Available There is a slow but steady rise in the case detection rates of melioidosis from various parts of the Indian sub-continent in the past two decades. However, the epidemiology of the disease in India and the surrounding South Asian countries remains far from well elucidated. Multi-locus sequence typing (MLST is a useful epidemiological tool to study the genetic relatedness of bacterial isolates both with-in and across the countries. With this background, we studied the molecular epidemiology of 32 Burkholderia pseudomallei isolates (31 clinical and 1 soil isolate obtained during 2006-2015 from various parts of south India using multi-locus sequencing typing and analysis. Of the 32 isolates included in the analysis, 30 (93.7% had novel allelic profiles that were not reported previously. Sequence type (ST 1368 (n = 15, 46.8% with allelic profile (1, 4, 6, 4, 1, 1, 3 was the most common genotype observed. We did not observe a genotypic association of STs with geographical location, type of infection and year of isolation in the present study. Measure of genetic differentiation (FST between Indian and the rest of world isolates was 0.14413. Occurrence of the same ST across three adjacent states of south India suggest the dispersion of B.pseudomallei across the south western coastal part of India with limited geographical clustering. However, majority of the STs reported from the present study remained as "outliers" on the eBURST "Population snapshot", suggesting the genetic diversity of Indian isolates from the Australasian and Southeast Asian isolates.

  1. Deep sequencing of the Trypanosoma cruzi GP63 surface proteases reveals diversity and diversifying selection among chronic and congenital Chagas disease patients.

    Science.gov (United States)

    Llewellyn, Martin S; Messenger, Louisa A; Luquetti, Alejandro O; Garcia, Lineth; Torrico, Faustino; Tavares, Suelene B N; Cheaib, Bachar; Derome, Nicolas; Delepine, Marc; Baulard, Céline; Deleuze, Jean-Francois; Sauer, Sascha; Miles, Michael A

    2015-04-01

    Chagas disease results from infection with the diploid protozoan parasite Trypanosoma cruzi. T. cruzi is highly genetically diverse, and multiclonal infections in individual hosts are common, but little studied. In this study, we explore T. cruzi infection multiclonality in the context of age, sex and clinical profile among a cohort of chronic patients, as well as paired congenital cases from Cochabamba, Bolivia and Goias, Brazil using amplicon deep sequencing technology. A 450bp fragment of the trypomastigote TcGP63I surface protease gene was amplified and sequenced across 70 chronic and 22 congenital cases on the Illumina MiSeq platform. In addition, a second, mitochondrial target--ND5--was sequenced across the same cohort of cases. Several million reads were generated, and sequencing read depths were normalized within patient cohorts (Goias chronic, n = 43, Goias congenital n = 2, Bolivia chronic, n = 27; Bolivia congenital, n = 20), Among chronic cases, analyses of variance indicated no clear correlation between intra-host sequence diversity and age, sex or symptoms, while principal coordinate analyses showed no clustering by symptoms between patients. Between congenital pairs, we found evidence for the transmission of multiple sequence types from mother to infant, as well as widespread instances of novel genotypes in infants. Finally, non-synonymous to synonymous (dn:ds) nucleotide substitution ratios among sequences of TcGP63Ia and TcGP63Ib subfamilies within each cohort provided powerful evidence of strong diversifying selection at this locus. Our results shed light on the diversity of parasite DTUs within each patient, as well as the extent to which parasite strains pass between mother and foetus in congenital cases. Although we were unable to find any evidence that parasite diversity accumulates with age in our study cohorts, putative diversifying selection within members of the TcGP63I gene family suggests a link between genetic diversity within this gene

  2. Genetic diversity based on 28S rDNA sequences among populations of Culex quinquefasciatus collected at different locations in Tamil Nadu, India.

    Science.gov (United States)

    Sakthivelkumar, S; Ramaraj, P; Veeramani, V; Janarthanan, S

    2015-09-01

    The basis of the present study was to distinguish the existence of any genetic variability among populations of Culex quinquefasciatus which would be a valuable tool in the management of mosquito control programmes. In the present study, population of Cx. quinquefasciatus collected at different locations in Tamil Nadu were analyzed for their genetic variation based on 28S rDNA D2 region nucleotide sequences. A high degree of genetic polymorphism was detected in the sequences of D2 region of 28S rDNA on the predicted secondary structures in spite of high nucleotide sequence similarity. The findings based on secondary structure using rDNA sequences suggested the existence of a complex genotypic diversity of Cx. quinquefasciatus population collected at different locations of Tamil Nadu, India. This complexity in genetic diversity in a single mosquito population collected at different locations is considered an important issue towards their influence and nature of vector potential of these mosquitoes.

  3. SNP discovery in common bean by restriction-associated DNA (RAD) sequencing for genetic diversity and population structure analysis.

    Science.gov (United States)

    Valdisser, Paula Arielle M R; Pappas, Georgios J; de Menezes, Ivandilson P P; Müller, Bárbara S F; Pereira, Wendell J; Narciso, Marcelo G; Brondani, Claudio; Souza, Thiago L P O; Borba, Tereza C O; Vianello, Rosana P

    2016-06-01

    Researchers have made great advances into the development and application of genomic approaches for common beans, creating opportunities to driving more real and applicable strategies for sustainable management of the genetic resource towards plant breeding. This work provides useful polymorphic single-nucleotide polymorphisms (SNPs) for high-throughput common bean genotyping developed by RAD (restriction site-associated DNA) sequencing. The RAD tags were generated from DNA pooled from 12 common bean genotypes, including breeding lines of different gene pools and market classes. The aligned sequences identified 23,748 putative RAD-SNPs, of which 3357 were adequate for genotyping; 1032 RAD-SNPs with the highest ADT (assay design tool) score are presented in this article. The RAD-SNPs were structurally annotated in different coding (47.00 %) and non-coding (53.00 %) sequence components of genes. A subset of 384 RAD-SNPs with broad genome distribution was used to genotype a diverse panel of 95 common bean germplasms and revealed a successful amplification rate of 96.6 %, showing 73 % of polymorphic SNPs within the Andean group and 83 % in the Mesoamerican group. A slightly increased He (0.161, n = 21) value was estimated for the Andean gene pool, compared to the Mesoamerican group (0.156, n = 74). For the linkage disequilibrium (LD) analysis, from a group of 580 SNPs (289 RAD-SNPs and 291 BARC-SNPs) genotyped for the same set of genotypes, 70.2 % were in LD, decreasing to 0.10 %in the Andean group and 0.77 % in the Mesoamerican group. Haplotype patterns spanning 310 Mb of the genome (60 %) were characterized in samples from different origins. However, the haplotype frameworks were under-represented for the Andean (7.85 %) and Mesoamerican (5.55 %) gene pools separately. In conclusion, RAD sequencing allowed the discovery of hundreds of useful SNPs for broad genetic analysis of common bean germplasm. From now, this approach provides an excellent panel

  4. A recent bottleneck of Y chromosome diversity coincides with a global change in culture

    DEFF Research Database (Denmark)

    Karmin, Monika; Saag, Lauri; Vicente, Mário

    2015-01-01

    It is commonly thought that human genetic diversity in non-African populations was shaped primarily by an out-of-Africa dispersal 50-100 thousand yr ago (kya). Here, we present a study of 456 geographically diverse high-coverage Y chromosome sequences, including 299 newly reported samples. Applyi...

  5. A recent bottleneck of Y chromosome diversity coincides with a global change in culture

    NARCIS (Netherlands)

    Karmin, M.; Saag, L.; Vicente, M.; Wilson Sayres, M.A.; Jarve, M.; Talas, U.G.; Rootsi, S.; Ilumae, A.M.; Magi, R.; Mitt, M.; Pagani, L.; Puurand, T.; Faltyskova, Z.; Clemente, F.; Cardona, A.; Metspalu, E.; Sahakyan, H.; Yunusbayev, B.; Hudjashov, G.; DeGiorgio, M.; Loogvali, E.L.; Eichstaedt, C.; Eelmets, M.; Chaubey, G.; Tambets, K.; Litvinov, S.; Mormina, M.; Xue, Y.; Ayub, Q.; Zoraqi, G.; Korneliussen, T.S.; Akhatova, F.; Lachance, J.; Tishkoff, S.; Momynaliev, K.; Ricaut, F.X.; Kusuma, P.; Razafindrazaka, H.; Pierron, D.; Cox, M.P.; Sultana, G.N.; Willerslev, R.; Muller, C.; Westaway, M.; Lambert, D.; Skaro, V.; Kovacevic, L.; Turdikulova, S.; Dalimova, D.; Khusainova, R.; Trofimova, N.; Akhmetova, V.; Khidiyatova, I.; Lichman, D.V.; Isakova, J.; Pocheshkhova, E.; Sabitov, Z.; Barashkov, N.A.; Nymadawa, P.; Mihailov, E.; Seng, J.W.; Evseeva, I.; Migliano, A.B.; Abdullah, S.; Andriadze, G.; Primorac, D.; Atramentova, L.; Utevska, O.; Yepiskoposyan, L.; Marjanovic, D.; Kushniarevich, A.; Behar, D.M.; Gilissen, C.; Vissers, L.; Veltman, J.A.; Balanovska, E.; Derenko, M.; Malyarchuk, B.; Metspalu, A.; Fedorova, S.; Eriksson, A.; Manica, A.; Mendez, F.L.; Karafet, T.M.; Veeramah, K.R.; Bradman, N.; Hammer, M.F.; Osipova, L.P.; Balanovsky, O.; Khusnutdinova, E.K.; Johnsen, K.; Remm, M.; Thomas, M.G.; Tyler-Smith, C.; Underhill, P.A.; Willerslev, E.; Nielsen, R.; Metspalu, M.; Villems, R.; Kivisild, T.

    2015-01-01

    It is commonly thought that human genetic diversity in non-African populations was shaped primarily by an out-of-Africa dispersal 50-100 thousand yr ago (kya). Here, we present a study of 456 geographically diverse high-coverage Y chromosome sequences, including 299 newly reported samples. Applying

  6. Diversity of endophytic and rhizoplane bacterial communities associated with exotic Spartina alterniflora and native mangrove using Illumina amplicon sequencing.

    Science.gov (United States)

    Hong, Youwei; Liao, Dan; Hu, Anyi; Wang, Han; Chen, Jinsheng; Khan, Sardar; Su, Jianqiang; Li, Hu

    2015-10-01

    Root-associated microbial communities are very important for biogeochemical cycles in wetland ecosystems and help to elaborate the mechanisms of plant invasions. In the estuary of Jiulong River (China), Spartina alterniflora has widely invaded Kandelia obovata-dominated habitats, offering an opportunity to study the influence of root-associated bacteria. The community structures of endophytic and rhizosphere bacteria associated with selected plant species were investigated using the barcoded Illumina paired-end sequencing technique. The diversity indices of bacteria associated with the roots of S. alterniflora were higher than those of the transition stands and K. obovata monoculture. Using principal coordinate analysis with UniFrac metrics, the comparison of β-diversity showed that all samples could be significantly clustered into 3 major groups, according to the bacteria communities of origin. Four phyla, namely Proteobacteria, Bacteroidetes, Chloroflexi, and Firmicutes, were enriched in the rhizoplane of both salt marsh plants, while they shared higher abundances of Cyanobacteria and Proteobacteria among endophytic bacteria. Members of the phyla Spirochaetes and Chloroflexi were found among the endophytic bacteria of S. alterniflora and K. obovata, respectively. One of the interesting findings was that endophytes were more sensitive in response to plant invasion than were rhizosphere bacteria. With linear discriminate analysis, we found some predominant rhizoplane and endophytic bacteria, including Methylococcales, Pseudoalteromonadacea, Clostridium, Vibrio, and Desulfovibrio, which have the potential to affect the carbon, nitrogen, and sulfur cycles. Thus, the results provide clues to the isolation of functional bacteria and the effects of root-associated microbial groups on S. alterniflora invasions.

  7. Assessing Symbiodinium diversity in scleractinian corals via next-generation sequencing-based genotyping of the ITS2 rDNA region

    KAUST Repository

    Arif, Chatchanit; Daniels, Camille; Bayer, Till; Banguera Hinestroza, Eulalia; Barbrook, Adrian; Howe, Christopher J.; LaJeunesse, Todd C.; Voolstra, Christian R.

    2014-01-01

    The persistence of coral reef ecosystems relies on the symbiotic relationship between scleractinian corals and intracellular, photosynthetic dinoflagellates in the genus Symbiodinium. Genetic evidence indicates that these symbionts are biologically diverse and exhibit discrete patterns of environmental and host distribution. This makes the assessment of Symbiodinium diversity critical to understanding the symbiosis ecology of corals. Here, we applied pyrosequencing to the elucidation of Symbiodinium diversity via analysis of the internal transcribed spacer 2 (ITS2) region, a multicopy genetic marker commonly used to analyse Symbiodinium diversity. Replicated data generated from isoclonal Symbiodinium cultures showed that all genomes contained numerous, yet mostly rare, ITS2 sequence variants. Pyrosequencing data were consistent with more traditional denaturing gradient gel electrophoresis (DGGE) approaches to the screening of ITS2 PCR amplifications, where the most common sequences appeared as the most intense bands. Further, we developed an operational taxonomic unit (OTU)-based pipeline for Symbiodinium ITS2 diversity typing to provisionally resolve ecologically discrete entities from intragenomic variation. A genetic distance cut-off of 0.03 collapsed intragenomic ITS2 variants of isoclonal cultures into single OTUs. When applied to the analysis of field-collected coral samples, our analyses confirm that much of the commonly observed Symbiodinium ITS2 diversity can be attributed to intragenomic variation. We conclude that by analysing Symbiodinium populations in an OTU-based framework, we can improve objectivity, comparability and simplicity when assessing ITS2 diversity in field-based studies.

  8. Assessing Symbiodinium diversity in scleractinian corals via next-generation sequencing-based genotyping of the ITS2 rDNA region

    KAUST Repository

    Arif, Chatchanit

    2014-09-01

    The persistence of coral reef ecosystems relies on the symbiotic relationship between scleractinian corals and intracellular, photosynthetic dinoflagellates in the genus Symbiodinium. Genetic evidence indicates that these symbionts are biologically diverse and exhibit discrete patterns of environmental and host distribution. This makes the assessment of Symbiodinium diversity critical to understanding the symbiosis ecology of corals. Here, we applied pyrosequencing to the elucidation of Symbiodinium diversity via analysis of the internal transcribed spacer 2 (ITS2) region, a multicopy genetic marker commonly used to analyse Symbiodinium diversity. Replicated data generated from isoclonal Symbiodinium cultures showed that all genomes contained numerous, yet mostly rare, ITS2 sequence variants. Pyrosequencing data were consistent with more traditional denaturing gradient gel electrophoresis (DGGE) approaches to the screening of ITS2 PCR amplifications, where the most common sequences appeared as the most intense bands. Further, we developed an operational taxonomic unit (OTU)-based pipeline for Symbiodinium ITS2 diversity typing to provisionally resolve ecologically discrete entities from intragenomic variation. A genetic distance cut-off of 0.03 collapsed intragenomic ITS2 variants of isoclonal cultures into single OTUs. When applied to the analysis of field-collected coral samples, our analyses confirm that much of the commonly observed Symbiodinium ITS2 diversity can be attributed to intragenomic variation. We conclude that by analysing Symbiodinium populations in an OTU-based framework, we can improve objectivity, comparability and simplicity when assessing ITS2 diversity in field-based studies.

  9. Yeast diversity during the fermentation of Andean chicha: A comparison of high-throughput sequencing and culture-dependent approaches.

    Science.gov (United States)

    Mendoza, Lucía M; Neef, Alexander; Vignolo, Graciela; Belloch, Carmela

    2017-10-01

    Diversity and dynamics of yeasts associated with the fermentation of Argentinian maize-based beverage chicha was investigated. Samples taken at different stages from two chicha productions were analyzed by culture-dependent and culture-independent methods. Five hundred and ninety six yeasts were isolated by classical microbiological methods and 16 species identified by RFLPs and sequencing of D1/D2 26S rRNA gene. Genetic typing of isolates from the dominant species, Saccharomyces cerevisiae, by PCR of delta elements revealed up to 42 different patterns. High-throughput sequencing (HTS) of D1/D2 26S rRNA gene amplicons from chicha samples detected more than one hundred yeast species and almost fifty filamentous fungi taxa. Analysis of the data revealed that yeasts dominated the fermentation, although, a significant percentage of filamentous fungi appeared in the first step of the process. Statistical analysis of results showed that very few taxa were represented by more than 1% of the reads per sample at any step of the process. S. cerevisiae represented more than 90% of the reads in the fermentative samples. Other yeast species dominated the pre-fermentative steps and abounded in fermented samples when S. cerevisiae was in percentages below 90%. Most yeasts species detected by pyrosequencing were not recovered by cultivation. In contrast, the cultivation-based methodology detected very few yeast taxa, and most of them corresponded with very few reads in the pyrosequencing analysis. Copyright © 2017 Elsevier Ltd. All rights reserved.

  10. Diversity of thermophiles in a Malaysian hot spring determined using 16S rRNA and shotgun metagenome sequencing

    Directory of Open Access Journals (Sweden)

    Chia Sing eChan

    2015-03-01

    Full Text Available The Sungai Klah (SK hot spring is the second hottest geothermal spring in Malaysia. This hot spring is a shallow, 150-meter-long, fast-flowing stream, with temperatures varying from 50 to 110°C and a pH range of 7.0 to 9.0. Hidden within a wooded area, the SK hot spring is continually fed by plant litter, resulting in a relatively high degree of total organic content (TOC. In this study, a sample taken from the middle of the stream was analyzed at the 16S rRNA V3−V4 region by amplicon metagenome sequencing. Over 35 phyla were detected by analyzing the 16S rRNA data. Firmicutes and Proteobacteria represented approximately 57% of the microbiome. Approximately 70% of the detected thermophiles were strict anaerobes; however, Hydrogenobacter spp., obligate chemolithotrophic thermophiles, represented one of the major taxa. Several thermophilic photosynthetic microorganisms and acidothermophiles were also detected. Most of the phyla identified by 16S rRNA were also found using the shotgun metagenome approaches. The carbon, sulfur, and nitrogen metabolism within the SK hot spring community were evaluated by shotgun metagenome sequencing, and the data revealed diversity in terms of metabolic activity and dynamics. This hot spring has a rich diversified phylogenetic community partly due to its natural environment (plant litter, high TOC, and a shallow stream and geochemical parameters (broad temperature and pH range. It is speculated that symbiotic relationships occur between the members of the community.

  11. Sequence-Based Discovery Demonstrates That Fixed Light Chain Human Transgenic Rats Produce a Diverse Repertoire of Antigen-Specific Antibodies

    Directory of Open Access Journals (Sweden)

    Katherine E. Harris

    2018-04-01

    Full Text Available We created a novel transgenic rat that expresses human antibodies comprising a diverse repertoire of heavy chains with a single common rearranged kappa light chain (IgKV3-15-JK1. This fixed light chain animal, called OmniFlic, presents a unique system for human therapeutic antibody discovery and a model to study heavy chain repertoire diversity in the context of a constant light chain. The purpose of this study was to analyze heavy chain variable gene usage, clonotype diversity, and to describe the sequence characteristics of antigen-specific monoclonal antibodies (mAbs isolated from immunized OmniFlic animals. Using next-generation sequencing antibody repertoire analysis, we measured heavy chain variable gene usage and the diversity of clonotypes present in the lymph node germinal centers of 75 OmniFlic rats immunized with 9 different protein antigens. Furthermore, we expressed 2,560 unique heavy chain sequences sampled from a diverse set of clonotypes as fixed light chain antibody proteins and measured their binding to antigen by ELISA. Finally, we measured patterns and overall levels of somatic hypermutation in the full B-cell repertoire and in the 2,560 mAbs tested for binding. The results demonstrate that OmniFlic animals produce an abundance of antigen-specific antibodies with heavy chain clonotype diversity that is similar to what has been described with unrestricted light chain use in mammals. In addition, we show that sequence-based discovery is a highly effective and efficient way to identify a large number of diverse monoclonal antibodies to a protein target of interest.

  12. The ITS1-5.8S-ITS2 sequence region in the Musaceae: structure, diversity and use in molecular phylogeny.

    Directory of Open Access Journals (Sweden)

    Eva Hřibová

    2011-03-01

    Full Text Available Genes coding for 45S ribosomal RNA are organized in tandem arrays of up to several thousand copies and contain 18S, 5.8S and 26S rRNA units separated by internal transcribed spacers ITS1 and ITS2. While the rRNA units are evolutionary conserved, ITS show high level of interspecific divergence and have been used frequently in genetic diversity and phylogenetic studies. In this work we report on the structure and diversity of the ITS region in 87 representatives of the family Musaceae. We provide the first detailed information on ITS sequence diversity in the genus Musa and describe the presence of more than one type of ITS sequence within individual species. Both Sanger sequencing of amplified ITS regions and whole genome 454 sequencing lead to similar phylogenetic inferences. We show that it is necessary to identify putative pseudogenic ITS sequences, which may have negative effect on phylogenetic reconstruction at lower taxonomic levels. Phylogenetic reconstruction based on ITS sequence showed that the genus Musa is divided into two distinct clades--Callimusa and Australimusa and Eumusa and Rhodochlamys. Most of the intraspecific banana hybrids analyzed contain conserved parental ITS sequences, indicating incomplete concerted evolution of rDNA loci. Independent evolution of parental rDNA in hybrids enables determination of genomic constitution of hybrids using ITS. The observation of only one type of ITS sequence in some of the presumed interspecific hybrid clones warrants further study to confirm their hybrid origin and to unravel processes leading to evolution of their genomes.

  13. Bacterial diversity of soil under eucalyptus assessed by 16S rDNA sequencing analysis Diversidade bacteriana de solo sob eucaliptos obtida por seqüenciamento do 16S rDNA

    Directory of Open Access Journals (Sweden)

    Érico Leandro da Silveira

    2006-10-01

    Full Text Available Studies on the impact of Eucalyptus spp. on Brazilian soils have focused on soil chemical properties and isolating interesting microbial organisms. Few studies have focused on microbial diversity and ecology in Brazil due to limited coverage of traditional cultivation and isolation methods. Molecular microbial ecology methods based on PCR amplified 16S rDNA have enriched the knowledge of soils microbial biodiversity. The objective of this work was to compare and estimate the bacterial diversity of sympatric communities within soils from two areas, a native forest (NFA and an eucalyptus arboretum (EAA. PCR primers, whose target soil metagenomic 16S rDNA were used to amplify soil DNA, were cloned using pGEM-T and sequenced to determine bacterial diversity. From the NFA soil 134 clones were analyzed, while 116 clones were analyzed from the EAA soil samples. The sequences were compared with those online at the GenBank. Phylogenetic analyses revealed differences between the soil types and high diversity in both communities. Soil from the Eucalyptus spp. arboretum was found to have a greater bacterial diversity than the soil investigated from the native forest area.Estudos sobre impacto do Eucalyptus spp. em solos brasileiros têm focalizado propriedades químicas do solo e isolamento de microrganismos de interesse. No Brasil há pouco enfoque em ecologia e diversidade microbiana, devido às limitações dos métodos tradicionais de cultivo e isolamento. A utilização de métodos moleculares no estudo da ecologia microbiana baseados na amplificação por PCR do 16S rDNA têm enriquecido o conhecimento da biodiversidade microbiana dos solos. O objetivo deste trabalho foi comparar e estimar a diversidade bacteriana de comunidades simpátricas em solos de duas áreas: uma floresta nativa (NFA e outra adjacente com arboreto de eucaliptos (EAA. Oligonucleotídeos iniciadores foram utilizados para amplificar o 16S rDNA metagenômico do solo, o qual foi

  14. Low level of sequence diversity at merozoite surface protein-1 locus of Plasmodium ovale curtisi and P. ovale wallikeri from Thai isolates.

    Science.gov (United States)

    Putaporntip, Chaturong; Hughes, Austin L; Jongwutiwes, Somchai

    2013-01-01

    The merozoite surface protein-1 (MSP-1) is a candidate target for the development of blood stage vaccines against malaria. Polymorphism in MSP-1 can be useful as a genetic marker for strain differentiation in malarial parasites. Although sequence diversity in the MSP-1 locus has been extensively analyzed in field isolates of Plasmodium falciparum and P. vivax, the extent of variation in its homologues in P. ovale curtisi and P. ovale wallikeri, remains unknown. Analysis of the mitochondrial cytochrome b sequences of 10 P. ovale isolates from symptomatic malaria patients from diverse endemic areas of Thailand revealed co-existence of P. ovale curtisi (n = 5) and P. ovale wallikeri (n = 5). Direct sequencing of the PCR-amplified products encompassing the entire coding region of MSP-1 of P. ovale curtisi (PocMSP-1) and P. ovale wallikeri (PowMSP-1) has identified 3 imperfect repeated segments in the former and one in the latter. Most amino acid differences between these proteins were located in the interspecies variable domains of malarial MSP-1. Synonymous nucleotide diversity (πS) exceeded nonsynonymous nucleotide diversity (πN) for both PocMSP-1 and PowMSP-1, albeit at a non-significant level. However, when MSP-1 of both these species was considered together, πS was significantly greater than πN (pdiversity at this locus prior to speciation. Phylogenetic analysis based on conserved domains has placed PocMSP-1 and PowMSP-1 in a distinct bifurcating branch that probably diverged from each other around 4.5 million years ago. The MSP-1 sequences support that P. ovale curtisi and P. ovale wallikeri are distinct species. Both species are sympatric in Thailand. The low level of sequence diversity in PocMSP-1 and PowMSP-1 among Thai isolates could stem from persistent low prevalence of these species, limiting the chance of outcrossing at this locus.

  15. Coverage of the employment of people with functional diversity in the press: ABC and El País newspapers (1978-2012

    Directory of Open Access Journals (Sweden)

    Borja González Luna

    2015-06-01

    Full Text Available This paper presents the results of analyses performed on news articles related to ordinary and protected employment for people with functional diversity in a sample taken from ABC and El País newspapers between 1978 and 2012. Employment is a common topic covered in the press and present in news that reports on percentages of unemployed people with functional diversity, personal stories of job success, opinions on the business sector, the failure of legislation, demands of associations, and special employment centers. A common theme is employment as a measure of social integration and normalization, which is understood in a capitalist society to mean the ability to produce an economic value to make earnings for acquiring goods and services. The speeches obtained have been supplemented with relevant legislation and have been contrasted with available secondary data on employment.

  16. Assessing biosynthetic potential of agricultural groundwater through metagenomic sequencing: A diverse anammox community dominates nitrate-rich groundwater.

    Directory of Open Access Journals (Sweden)

    William B Ludington

    Full Text Available Climate change produces extremes in both temperature and precipitation causing increased drought severity and increased reliance on groundwater resources. Agricultural practices, which rely on groundwater, are sensitive to but also sources of contaminants, including nitrate. How agricultural contamination drives groundwater geochemistry through microbial metabolism is poorly understood.On an active cow dairy in the Central Valley of California, we sampled groundwater from three wells at depths of 4.3 m (two wells and 100 m (one well below ground surface (bgs as well as an effluent surface water lagoon that fertilizes surrounding corn fields. We analyzed the samples for concentrations of solutes, heavy metals, and USDA pathogenic bacteria of the Escherichia coli and Enterococcus groups as part of a long term groundwater monitoring study. Whole metagenome shotgun sequencing and assembly revealed taxonomic composition and metabolic potential of the community.Elevated nitrate and dissolved organic carbon occurred at 4.3m but not at 100m bgs. Metagenomics confirmed chemical observations and revealed several Planctomycete genomes, including a new Brocadiaceae lineage and a likely Planctomycetes OM190, as well novel diversity and high abundance of nano-prokaryotes from the Candidate Phyla Radiation (CPR, the Diapherotrites, Parvarchaeota, Aenigmarchaeota, Nanoarchaeota, Nanohaloarchaea (DPANN and the Thaumarchaeota, Aigarchaeota, Crenarchaeota, Korarchaeota (TACK superphyla. Pathway analysis suggests community interactions based on complimentary primary metabolic pathways and abundant secondary metabolite operons encoding antimicrobials and quorum sensing systems.The metagenomes show strong resemblance to activated sludge communities from a nitrogen removal reactor at a wastewater treatment plant, suggesting that natural bioremediation occurs through microbial metabolism. Elevated nitrate and rich secondary metabolite biosynthetic capacity suggest

  17. Genetic diversity analysis of cyanogenic potential (CNp) of root among improved genotypes of cassava using simple sequence repeat markers.

    Science.gov (United States)

    Moyib, O K; Mkumbira, J; Odunola, O A; Dixon, A G

    2012-12-01

    Cyanogenic potential (CNp) of cassava constitutes a serious problem for over 500 million people who rely on the crop as their main source of calories. Genetic diversity is a key to successful crop improvement for breeding new improved variability for target traits. Forty-three improved genotypes of cassava developed by International Institute of Tropical Agriculture (ITA), Ibadan, were characterized for CNp trait using 35 Simple Sequence.Repeat (SSR) markers. Essential colorimetry picric test was used for evaluation of CNp on a color scale of 1 to 14. The CNp scores obtained ranged from 3 to 9, with a mean score of 5.48 (+/- 0.09) based on Statistical Analysis System (SAS) package. TMS M98/ 0068 (4.0 +/- 0.25) was identified as the best genotype with low CNp while TMS M98/0028 (7.75 +/- 0.25) was the worst. The 43 genotypes were assigned into 7 phenotypic groups based on rank-sum analysis in SAS. Dissimilarity analysis representatives for windows generated a phylogenetic tree with 5 clusters which represented hybridizing groups. Each of the clusters (except 4) contained low CNp genotypes that could be used for improving the high CNp genotypes in the same or near cluster. The scatter plot of the genotypes showed that there was little or no demarcation for phenotypic CNp groupings in the molecular groupings. The result of this study demonstrated that SSR markers are powerful tools for the assessment of genetic variability, and proper identification and selection of parents for genetic improvement of low CNp trait among the IITA cassava collection.

  18. Intracellular diversity of the V4 and V9 regions of the 18S rRNA in marine protists (radiolarians) assessed by high-throughput sequencing.

    Science.gov (United States)

    Decelle, Johan; Romac, Sarah; Sasaki, Eriko; Not, Fabrice; Mahé, Frédéric

    2014-01-01

    Metabarcoding is a powerful tool for exploring microbial diversity in the environment, but its accurate interpretation is impeded by diverse technical (e.g. PCR and sequencing errors) and biological biases (e.g. intra-individual polymorphism) that remain poorly understood. To help interpret environmental metabarcoding datasets, we investigated the intracellular diversity of the V4 and V9 regions of the 18S rRNA gene from Acantharia and Nassellaria (radiolarians) using 454 pyrosequencing. Individual cells of radiolarians were isolated, and PCRs were performed with generalist primers to amplify the V4 and V9 regions. Different denoising procedures were employed to filter the pyrosequenced raw amplicons (Acacia, AmpliconNoise, Linkage method). For each of the six isolated cells, an average of 541 V4 and 562 V9 amplicons assigned to radiolarians were obtained, from which one numerically dominant sequence and several minor variants were found. At the 97% identity, a diversity metrics commonly used in environmental surveys, up to 5 distinct OTUs were detected in a single cell. However, most amplicons grouped within a single OTU whereas other OTUs contained very few amplicons. Different analytical methods provided evidence that most minor variants forming different OTUs correspond to PCR and sequencing artifacts. Duplicate PCR and sequencing from the same DNA extract of a single cell had only 9 to 16% of unique amplicons in common, and alignment visualization of V4 and V9 amplicons showed that most minor variants contained substitutions in highly-conserved regions. We conclude that intracellular variability of the 18S rRNA in radiolarians is very limited despite its multi-copy nature and the existence of multiple nuclei in these protists. Our study recommends some technical guidelines to conservatively discard artificial amplicons from metabarcoding datasets, and thus properly assess the diversity and richness of protists in the environment.

  19. Genetic diversity of the captive Asian tapir population in Thailand, based on mitochondrial control region sequence data and the comparison of its nucleotide structure with Brazilian tapir.

    Science.gov (United States)

    Muangkram, Yuttamol; Amano, Akira; Wajjwalku, Worawidh; Pinyopummintr, Tanu; Thongtip, Nikorn; Kaolim, Nongnid; Sukmak, Manakorn; Kamolnorranath, Sumate; Siriaroonrat, Boripat; Tipkantha, Wanlaya; Maikaew, Umaporn; Thomas, Warisara; Polsrila, Kanda; Dongsaard, Kwanreaun; Sanannu, Saowaphang; Wattananorrasate, Anuwat

    2017-07-01

    The Asian tapir (Tapirus indicus) has been classified as Endangered on the IUCN Red List of Threatened Species (2008). Genetic diversity data provide important information for the management of captive breeding and conservation of this species. We analyzed mitochondrial control region (CR) sequences from 37 captive Asian tapirs in Thailand. Multiple alignments of the full-length CR sequences sized 1268 bp comprised three domains as described in other mammal species. Analysis of 16 parsimony-informative variable sites revealed 11 haplotypes. Furthermore, the phylogenetic analysis using median-joining network clearly showed three clades correlated with our earlier cytochrome b gene study in this endangered species. The repetitive motif is located between first and second conserved sequence blocks, similar to the Brazilian tapir. The highest polymorphic site was located in the extended termination associated sequences domain. The results could be applied for future genetic management based in captivity and wild that shows stable populations.

  20. Revisiting the ichthyodiversity of Java and Bali through DNA barcodes: taxonomic coverage, identification accuracy, cryptic diversity and identification of exotic species.

    Science.gov (United States)

    Dahruddin, Hadi; Hutama, Aditya; Busson, Frédéric; Sauri, Sopian; Hanner, Robert; Keith, Philippe; Hadiaty, Renny; Hubert, Nicolas

    2017-03-01

    Among the 899 species of freshwater fishes reported from Sundaland biodiversity hotspot, nearly 50% are endemics. The functional integrity of aquatic ecosystems is currently jeopardized by human activities, and landscape conversion led to the decline of fish populations in several part of Sundaland, particularly in Java. The inventory of the Javanese ichthyofauna has been discontinuous, and the taxonomic knowledge is scattered in the literature. This study provides a DNA barcode reference library for the inland fishes of Java and Bali with the aim to streamline the inventory of fishes in this part of Sundaland. Owing to the lack of available checklist for estimating the taxonomic coverage of this study, a checklist was compiled based on online catalogues. A total of 95 sites were visited, and a library including 1046 DNA barcodes for 159 species was assembled. Nearest neighbour distance was 28-fold higher than maximum intraspecific distance on average, and a DNA barcoding gap was observed. The list of species with DNA barcodes displayed large discrepancies with the checklist compiled here as only 36% (i.e. 77 species) and 60% (i.e. 24 species) of the known species were sampled in Java and Bali, respectively. This result was contrasted by a high number of new occurrences and the ceiling of the accumulation curves for both species and genera. These results highlight the poor taxonomic knowledge of this ichthyofauna, and the apparent discrepancy between present and historical occurrence data is to be attributed to species extirpations, synonymy and misidentifications in previous studies. © 2016 John Wiley & Sons Ltd.

  1. DNA barcode sequencing from old type specimens as a tool in taxonomy: a case study in the diverse genus Eois (Lepidoptera: Geometridae.

    Directory of Open Access Journals (Sweden)

    Patrick Strutzenberger

    Full Text Available In this study we report on the sequencing of the COI barcode region from 96 historical specimens (92 type specimens +4 non-types of Eois. Eois is a diverse clade of tropical geometrid moths and is the target of a number of ongoing studies on life-histories, phylogeny, co-evolution with host plants or parasitoids, and diversity patterns across temporal and spatial dimensions. The unequivocal application of valid names is crucial for all aspects of biodiversity research as well as monitoring and conservation efforts. The availability of barcodes from historical type specimens has the potential to facilitate the much-needed acceleration of species description. We performed non-destructive DNA extraction on the abdomens of Eois specimens between 79 and 157 years of age. We used six primer combinations (recovering between 109 and 130 bp each to target the full-length barcode sequence of each specimen. We were able to obtain sequences for 91 of 96 specimens (success rate 94.8%. Sequence length ranged from 121 bp to full barcode sequences (658 bp, the average sequence length was ~500 bp. We detected a moderately strong and statistically significant negative correlation between specimen age and total sequence length, which is in agreement with expectations. The abdomen proved to be an exceedingly valuable source of DNA in old specimens of Lepidoptera. Barcode sequences obtained in this study are currently being used in an effort towards a step-wise taxonomic revision of Eois. We encourage that DNA barcodes obtained from types specimens should be included in all species descriptions and revisions whenever feasible.

  2. Genome-wide-analyses of Listeria monocytogenes from food-processing plants reveal clonal diversity and date the emergence of persisting sequence types.

    Science.gov (United States)

    Knudsen, Gitte M; Nielsen, Jesper Boye; Marvig, Rasmus L; Ng, Yin; Worning, Peder; Westh, Henrik; Gram, Lone

    2017-08-01

    Whole genome sequencing is increasing used in epidemiology, e.g. for tracing outbreaks of food-borne diseases. This requires in-depth understanding of pathogen emergence, persistence and genomic diversity along the food production chain including in food processing plants. We sequenced the genomes of 80 isolates of Listeria monocytogenes sampled from Danish food processing plants over a time-period of 20 years, and analysed the sequences together with 10 public available reference genomes to advance our understanding of interplant and intraplant genomic diversity of L. monocytogenes. Except for three persisting sequence types (ST) based on Multi Locus Sequence Typing being ST7, ST8 and ST121, long-term persistence of clonal groups was limited, and new clones were introduced continuously, potentially from raw materials. No particular gene could be linked to the persistence phenotype. Using time-based phylogenetic analyses of the persistent STs, we estimate the L. monocytogenes evolutionary rate to be 0.18-0.35 single nucleotide polymorphisms/year, suggesting that the persistent STs emerged approximately 100 years ago, which correlates with the onset of industrialization and globalization of the food market. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.

  3. Extensive 16S rRNA gene sequence diversity in Campylobacter hyointestinalis strains: taxonomic and applied implications

    DEFF Research Database (Denmark)

    Harrington, C.S.; On, Stephen L.W.

    1999-01-01

    Phylogenetic relationships of Campylobacter hyointestinalis subspecies were examined by means of 16S rRNA gene sequencing. Sequence similarities among C. hyointestinalis subsp. lawsonii strains exceeded 99.0 %, but values among C. hyointestinalis subsp. hyointestinalis strains ranged from 96...... of the genus Campylobacter, emphasizing the need for multiple strain analysis when using 16S rRNA gene sequence comparisons for taxonomic investigations........4 to 100 %. Sequence similarites between strains representing the two different subspecies ranged from 95.7 to 99.0 %. An intervening sequence was identified in certain of the C. hyointestinalis subsp. lawsonii strains. C. hyointestinalis strains occupied two distinct branches in a phylogenetic analysis...

  4. Diversity and Genome Analysis of Australian and Global Oilseed Brassica napus L. Germplasm Using Transcriptomics and Whole Genome Re-sequencing

    Directory of Open Access Journals (Sweden)

    M. Michelle Malmberg

    2018-04-01

    Full Text Available Intensive breeding of Brassica napus has resulted in relatively low diversity, such that B. napus would benefit from germplasm improvement schemes that sustain diversity. As such, samples representative of global germplasm pools need to be assessed for existing population structure, diversity and linkage disequilibrium (LD. Complexity reduction genotyping-by-sequencing (GBS methods, including GBS-transcriptomics (GBS-t, enable cost-effective screening of a large number of samples, while whole genome re-sequencing (WGR delivers the ability to generate large numbers of unbiased genomic single nucleotide polymorphisms (SNPs, and identify structural variants (SVs. Furthermore, the development of genomic tools based on whole genomes representative of global oilseed diversity and orientated by the reference genome has substantial industry relevance and will be highly beneficial for canola breeding. As recent studies have focused on European and Chinese varieties, a global diversity panel as well as a substantial number of Australian spring types were included in this study. Focusing on industry relevance, 633 varieties were initially genotyped using GBS-t to examine population structure using 61,037 SNPs. Subsequently, 149 samples representative of global diversity were selected for WGR and both data sets used for a side-by-side evaluation of diversity and LD. The WGR data was further used to develop genomic resources consisting of a list of 4,029,750 high-confidence SNPs annotated using SnpEff, and SVs in the form of 10,976 deletions and 2,556 insertions. These resources form the basis of a reliable and repeatable system allowing greater integration between canola genomics studies, with a strong focus on breeding germplasm and industry applicability.

  5. Location analysis for the estrogen receptor-α reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements

    Science.gov (United States)

    Mason, Christopher E.; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M.; Kallen, Roland G.; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B.

    2010-01-01

    Location analysis for estrogen receptor-α (ERα)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERα-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: ERE sequence. We demonstrate that ∼50% of all ERα-bound loci do not have a discernable ERE and show that most ERα-bound EREs are not perfect consensus EREs. Approximately one-third of all ERα-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERα-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERα binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers. PMID:20047966

  6. Location analysis for the estrogen receptor-alpha reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements.

    Science.gov (United States)

    Mason, Christopher E; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M; Kallen, Roland G; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B

    2010-04-01

    Location analysis for estrogen receptor-alpha (ERalpha)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERalpha-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: ERE sequence. We demonstrate that approximately 50% of all ERalpha-bound loci do not have a discernable ERE and show that most ERalpha-bound EREs are not perfect consensus EREs. Approximately one-third of all ERalpha-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERalpha-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERalpha binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers.

  7. Structural Conservation Despite Huge Sequence Diversity Allows EPCR Binding by the PfEMP1 Family Implicated in Severe Childhood Malaria

    DEFF Research Database (Denmark)

    Lau, Clinton K.Y.; Turner, Louise; Jespersen, Jakob S.

    2015-01-01

    with severe childhood malaria. We combine crystal structures of CIDRa1:EPCR complexes with analysis of 885 CIDRa1 sequences, showing that the EPCR-binding surfaces of CIDRa1 domains are conserved in shape and bonding potential, despite dramatic sequence diversity. Additionally, these domains mimic features...... of the natural EPCR ligand and can block this ligand interaction. Using peptides corresponding to the EPCR-binding region, antibodies can be purified from individuals in malaria-endemic regions that block EPCR binding of diverse CIDRa1 variants. This highlights the extent to which such a surface protein family......The PfEMP1 family of surface proteins is central for Plasmodium falciparum virulence and must retain the ability to bind to host receptors while also diversifying to aid immune evasion. The interaction between CIDRa1 domains of PfEMP1 and endothelial protein C receptor (EPCR) is associated...

  8. A comparison of parallel pyrosequencing and sanger clone-based sequencing and its impact on the characterization of the genetic diversity of HIV-1.

    Directory of Open Access Journals (Sweden)

    Binhua Liang

    Full Text Available BACKGROUND: Pyrosequencing technology has the potential to rapidly sequence HIV-1 viral quasispecies without requiring the traditional approach of cloning. In this study, we investigated the utility of ultra-deep pyrosequencing to characterize genetic diversity of the HIV-1 gag quasispecies and assessed the possible contribution of pyrosequencing technology in studying HIV-1 biology and evolution. METHODOLOGY/PRINCIPAL FINDINGS: HIV-1 gag gene was amplified from 96 patients using nested PCR. The PCR products were cloned and sequenced using capillary based Sanger fluorescent dideoxy termination sequencing. The same PCR products were also directly sequenced using the 454 pyrosequencing technology. The two sequencing methods were evaluated for their ability to characterize quasispecies variation, and to reveal sites under host immune pressure for their putative functional significance. A total of 14,034 variations were identified by 454 pyrosequencing versus 3,632 variations by Sanger clone-based (SCB sequencing. 11,050 of these variations were detected only by pyrosequencing. These undetected variations were located in the HIV-1 Gag region which is known to contain putative cytotoxic T lymphocyte (CTL and neutralizing antibody epitopes, and sites related to virus assembly and packaging. Analysis of the positively selected sites derived by the two sequencing methods identified several differences. All of them were located within the CTL epitope regions. CONCLUSIONS/SIGNIFICANCE: Ultra-deep pyrosequencing has proven to be a powerful tool for characterization of HIV-1 genetic diversity with enhanced sensitivity, efficiency, and accuracy. It also improved reliability of downstream evolutionary and functional analysis of HIV-1 quasispecies.

  9. Medicare Coverage Database

    Data.gov (United States)

    U.S. Department of Health & Human Services — The Medicare Coverage Database (MCD) contains all National Coverage Determinations (NCDs) and Local Coverage Determinations (LCDs), local articles, and proposed NCD...

  10. Genetic Diversity and Phylogenetic Analysis of the Iranian Leishmania Parasites Based on HSP70 Gene PCR-RFLP and Sequence Analysis.

    Science.gov (United States)

    Nemati, Sara; Fazaeli, Asghar; Hajjaran, Homa; Khamesipour, Ali; Anbaran, Mohsen Falahati; Bozorgomid, Arezoo; Zarei, Fatah

    2017-08-01

    Despite the broad distribution of leishmaniasis among Iranians and animals across the country, little is known about the genetic characteristics of the causative agents. Applying both HSP70 PCR-RFLP and sequence analyses, this study aimed to evaluate the genetic diversity and phylogenetic relationships among Leishmania spp. isolated from Iranian endemic foci and available reference strains. A total of 36 Leishmania isolates from almost all districts across the country were genetically analyzed for the HSP70 gene using both PCR-RFLP and sequence analysis. The original HSP70 gene sequences were aligned along with homologous Leishmania sequences retrieved from NCBI, and subjected to the phylogenetic analysis. Basic parameters of genetic diversity were also estimated. The HSP70 PCR-RFLP presented 3 different electrophoretic patterns, with no further intraspecific variation, corresponding to 3 Leishmania species available in the country, L. tropica, L. major, and L. infantum. Phylogenetic analyses presented 5 major clades, corresponding to 5 species complexes. Iranian lineages, including L. major, L. tropica, and L. infantum, were distributed among 3 complexes L. major, L. tropica, and L. donovani. However, within the L. major and L. donovani species complexes, the HSP70 phylogeny was not able to distinguish clearly between the L. major and L. turanica isolates, and between the L. infantum, L. donovani, and L. chagasi isolates, respectively. Our results indicated that both HSP70 PCR-RFLP and sequence analyses are medically applicable tools for identification of Leishmania species in Iranian patients. However, the reduced genetic diversity of the target gene makes it inevitable that its phylogeny only resolves the major groups, namely, the species complexes.

  11. Rapid microsatellite marker development for African mahogany (Khaya senegalensis, Meliaceae) using next-generation sequencing and assessment of its intra-specific genetic diversity.

    Science.gov (United States)

    Karan, M; Evans, D S; Reilly, D; Schulte, K; Wright, C; Innes, D; Holton, T A; Nikles, D G; Dickinson, G R

    2012-03-01

    Khaya senegalensis (African mahogany or dry-zone mahogany) is a high-value hardwood timber species with great potential for forest plantations in northern Australia. The species is distributed across the sub-Saharan belt from Senegal to Sudan and Uganda. Because of heavy exploitation and constraints on natural regeneration and sustainable planting, it is now classified as a vulnerable species. Here, we describe the development of microsatellite markers for K. senegalensis using next-generation sequencing to assess its intra-specific diversity across its natural range, which is a key for successful breeding programs and effective conservation management of the species. Next-generation sequencing yielded 93,943 sequences with an average read length of 234 bp. The assembled sequences contained 1030 simple sequence repeats, with primers designed for 522 microsatellite loci. Twenty-one microsatellite loci were tested with 11 showing reliable amplification and polymorphism in K. senegalensis. The 11 novel microsatellites, together with one previously published, were used to assess 73 accessions belonging to the Australian K. senegalensis domestication program, sampled from across the natural range of the species. STRUCTURE analysis shows two major clusters, one comprising mainly accessions from west Africa (Senegal to Benin) and the second based in the far eastern limits of the range in Sudan and Uganda. Higher levels of genetic diversity were found in material from western Africa. This suggests that new seed collections from this region may yield more diverse genotypes than those originating from Sudan and Uganda in eastern Africa. © 2011 Blackwell Publishing Ltd.

  12. Analysis of genetic diversity and population structure of oil palm (Elaeis guineensis) from China and Malaysia based on species-specific simple sequence repeat markers.

    Science.gov (United States)

    Zhou, L X; Xiao, Y; Xia, W; Yang, Y D

    2015-12-08

    Genetic diversity and patterns of population structure of the 94 oil palm lines were investigated using species-specific simple sequence repeat (SSR) markers. We designed primers for 63 SSR loci based on their flanking sequences and conducted amplification in 94 oil palm DNA samples. The amplification result showed that a relatively high level of genetic diversity was observed between oil palm individuals according a set of 21 polymorphic microsatellite loci. The observed heterozygosity (Ho) was 0.3683 and 0.4035, with an average of 0.3859. The Ho value was a reliable determinant of the discriminatory power of the SSR primer combinations. The principal component analysis and unweighted pair-group method with arithmetic averaging cluster analysis showed the 94 oil palm lines were grouped into one cluster. These results demonstrated that the oil palm in Hainan Province of China and the germplasm introduced from Malaysia may be from the same source. The SSR protocol was effective and reliable for assessing the genetic diversity of oil palm. Knowledge of the genetic diversity and population structure will be crucial for establishing appropriate management stocks for this species.

  13. Using high-throughput sequencing to leverage surveillance of genetic diversity and oseltamivir resistance: a pilot study during the 2009 influenza A(H1N1 pandemic.

    Directory of Open Access Journals (Sweden)

    Juan Téllez-Sosa

    Full Text Available BACKGROUND: Influenza viruses display a high mutation rate and complex evolutionary patterns. Next-generation sequencing (NGS has been widely used for qualitative and semi-quantitative assessment of genetic diversity in complex biological samples. The "deep sequencing" approach, enabled by the enormous throughput of current NGS platforms, allows the identification of rare genetic viral variants in targeted genetic regions, but is usually limited to a small number of samples. METHODOLOGY AND PRINCIPAL FINDINGS: We designed a proof-of-principle study to test whether redistributing sequencing throughput from a high depth-small sample number towards a low depth-large sample number approach is feasible and contributes to influenza epidemiological surveillance. Using 454-Roche sequencing, we sequenced at a rather low depth, a 307 bp amplicon of the neuraminidase gene of the Influenza A(H1N1 pandemic (A(H1N1pdm virus from cDNA amplicons pooled in 48 barcoded libraries obtained from nasal swab samples of infected patients (n  =  299 taken from May to November, 2009 pandemic period in Mexico. This approach revealed that during the transition from the first (May-July to second wave (September-November of the pandemic, the initial genetic variants were replaced by the N248D mutation in the NA gene, and enabled the establishment of temporal and geographic associations with genetic diversity and the identification of mutations associated with oseltamivir resistance. CONCLUSIONS: NGS sequencing of a short amplicon from the NA gene at low sequencing depth allowed genetic screening of a large number of samples, providing insights to viral genetic diversity dynamics and the identification of genetic variants associated with oseltamivir resistance. Further research is needed to explain the observed replacement of the genetic variants seen during the second wave. As sequencing throughput rises and library multiplexing and automation improves, we foresee that

  14. Divide and conquer: enriching environmental sequencing data.

    Directory of Open Access Journals (Sweden)

    Anne Bergeron

    2007-09-01

    Full Text Available In environmental sequencing projects, a mix of DNA from a whole microbial community is fragmented and sequenced, with one of the possible goals being to reconstruct partial or complete genomes of members of the community. In communities with high diversity of species, a significant proportion of the sequences do not overlap any other fragment in the sample. This problem will arise not only in situations with a relatively even distribution of many species, but also when the community in a particular environment is routinely dominated by the same few species. In the former case, no genomes may be assembled at all, while in the latter case a few dominant species in an environment will always be sequenced at high coverage to the detriment of coverage of the greater number of sparse species.Here we show that, with the same global sequencing effort, separating the species into two or more sub-communities prior to sequencing can yield a much higher proportion of sequences that can be assembled. We first use the Lander-Waterman model to show that, if the expected percentage of singleton sequences is higher than 25%, then, under the uniform distribution hypothesis, splitting the community is always a wise choice. We then construct simulated microbial communities to show that the results hold for highly non-uniform distributions. We also show that, for the distributions considered in the experiments, it is possible to estimate quite accurately the relative diversity of the two sub-communities.Given the fact that several methods exist to split microbial communities based on physical properties such as size, density, surface biochemistry, or optical properties, we strongly suggest that groups involved in environmental sequencing, and expecting high diversity, consider splitting their communities in order to maximize the information content of their sequencing effort.

  15. Error correction and statistical analyses for intra-host comparisons of feline immunodeficiency virus diversity from high-throughput sequencing data.

    Science.gov (United States)

    Liu, Yang; Chiaromonte, Francesca; Ross, Howard; Malhotra, Raunaq; Elleder, Daniel; Poss, Mary

    2015-06-30

    Infection with feline immunodeficiency virus (FIV) causes an immunosuppressive disease whose consequences are less severe if cats are co-infected with an attenuated FIV strain (PLV). We use virus diversity measurements, which reflect replication ability and the virus response to various conditions, to test whether diversity of virulent FIV in lymphoid tissues is altered in the presence of PLV. Our data consisted of the 3' half of the FIV genome from three tissues of animals infected with FIV alone, or with FIV and PLV, sequenced by 454 technology. Since rare variants dominate virus populations, we had to carefully distinguish sequence variation from errors due to experimental protocols and sequencing. We considered an exponential-normal convolution model used for background correction of microarray data, and modified it to formulate an error correction approach for minor allele frequencies derived from high-throughput sequencing. Similar to accounting for over-dispersion in counts, this accounts for error-inflated variability in frequencies - and quite effectively reproduces empirically observed distributions. After obtaining error-corrected minor allele frequencies, we applied ANalysis Of VAriance (ANOVA) based on a linear mixed model and found that conserved sites and transition frequencies in FIV genes differ among tissues of dual and single infected cats. Furthermore, analysis of minor allele frequencies at individual FIV genome sites revealed 242 sites significantly affected by infection status (dual vs. single) or infection status by tissue interaction. All together, our results demonstrated a decrease in FIV diversity in bone marrow in the presence of PLV. Importantly, these effects were weakened or undetectable when error correction was performed with other approaches (thresholding of minor allele frequencies; probabilistic clustering of reads). We also queried the data for cytidine deaminase activity on the viral genome, which causes an asymmetric increase

  16. Location analysis for the estrogen receptor-? reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements

    OpenAIRE

    Mason, Christopher E.; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M.; Kallen, Roland G.; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B.

    2010-01-01

    Location analysis for estrogen receptor-? (ER?)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ER?-bound loci and quantify the incidence of ERE sequences under two stringencies of detection:

  17. Estimating DNA coverage and abundance in metagenomes using a gamma approximation

    Energy Technology Data Exchange (ETDEWEB)

    Hooper, Sean D; Dalevi, Daniel; Pati, Amrita; Mavromatis, Konstantinos; Ivanova, Natalia N; Kyrpides, Nikos C

    2010-01-01

    Shotgun sequencing generates large numbers of short DNA reads from either an isolated organism or, in the case of metagenomics projects, from the aggregate genome of a microbial community. These reads are then assembled based on overlapping sequences into larger, contiguous sequences (contigs). The feasibility of assembly and the coverage achieved (reads per nucleotide or distinct sequence of nucleotides) depend on several factors: the number of reads sequenced, the read length and the relative abundances of their source genomes in the microbial community. A low coverage suggests that most of the genomic DNA in the sample has not been sequenced, but it is often difficult to estimate either the extent of the uncaptured diversity or the amount of additional sequencing that would be most efficacious. In this work, we regard a metagenome as a population of DNA fragments (bins), each of which may be covered by one or more reads. We employ a gamma distribution to model this bin population due to its flexibility and ease of use. When a gamma approximation can be found that adequately fits the data, we may estimate the number of bins that were not sequenced and that could potentially be revealed by additional sequencing. We evaluated the performance of this model using simulated metagenomes and demonstrate its applicability on three recent metagenomic datasets.

  18. Lactococcus lactis Diversity in Undefined Mixed Dairy Starter Cultures as Revealed by Comparative Genome Analyses and Targeted Amplicon Sequencing of epsD.

    Science.gov (United States)

    Frantzen, Cyril A; Kleppen, Hans Petter; Holo, Helge

    2018-02-01

    Undefined mesophilic mixed (DL) starter cultures are used in the production of continental cheeses and contain unknown strain mixtures of Lactococcus lactis and leuconostocs. The choice of starter culture affects the taste, aroma, and quality of the final product. To gain insight into the diversity of Lactococcus lactis strains in starter cultures, we whole-genome sequenced 95 isolates from three different starter cultures. Pan-genomic analyses, which included 30 publically available complete genomes, grouped the strains into 21 L. lactis subsp . lactis and 28 L. lactis subsp. cremoris lineages. Only one of the 95 isolates grouped with previously sequenced strains, and the three starter cultures showed no overlap in lineage distributions. The culture diversity was assessed by targeted amplicon sequencing using purR , a core gene, and epsD , present in 93 of the 95 starter culture isolates but absent in most of the reference strains. This enabled an unprecedented discrimination of starter culture Lactococcus lactis and revealed substantial differences between the three starter cultures and compositional shifts during the cultivation of cultures in milk. IMPORTANCE In contemporary cheese production, standardized frozen seed stock starter cultures are used to ensure production stability, reproducibility, and quality control of the product. The dairy industry experiences significant disruptions of cheese production due to phage attacks, and one commonly used countermeasure to phage attack is to employ a starter rotation strategy, in which two or more starters with minimal overlap in phage sensitivity are used alternately. A culture-independent analysis of the lactococcal diversity in complex undefined starter cultures revealed large differences between the three starter cultures and temporal shifts in lactococcal composition during the production of bulk starters. A better understanding of the lactococcal diversity in starter cultures will enable the development of

  19. Genetic Diversity of Toxoplasma gondii Strains from Different Hosts and Geographical Regions by Sequence Analysis of GRA20 Gene.

    Science.gov (United States)

    Ning, Hong-Rui; Huang, Si-Yang; Wang, Jin-Lei; Xu, Qian-Ming; Zhu, Xing-Quan

    2015-06-01

    Toxoplasma gondii is a eukaryotic parasite of the phylum Apicomplexa, which infects all warm-blood animals, including humans. In the present study, we examined sequence variation in dense granule 20 (GRA20) genes among T. gondii isolates collected from different hosts and geographical regions worldwide. The complete GRA20 genes were amplified from 16 T. gondii isolates using PCR, sequence were analyzed, and phylogenetic reconstruction was analyzed by maximum parsimony (MP) and maximum likelihood (ML) methods. The results showed that the complete GRA20 gene sequence was 1,586 bp in length among all the isolates used in this study, and the sequence variations in nucleotides were 0-7.9% among all strains. However, removing the type III strains (CTG, VEG), the sequence variations became very low, only 0-0.7%. These results indicated that the GRA20 sequence in type III was more divergence. Phylogenetic analysis of GRA20 sequences using MP and ML methods can differentiate 2 major clonal lineage types (type I and type III) into their respective clusters, indicating the GRA20 gene may represent a novel genetic marker for intraspecific phylogenetic analyses of T. gondii.

  20. Ultra-deep sequencing reveals high prevalence and broad structural diversity of hepatitis B surface antigen mutations in a global population.

    Science.gov (United States)

    Gencay, Mikael; Hübner, Kirsten; Gohl, Peter; Seffner, Anja; Weizenegger, Michael; Neofytos, Dionysios; Batrla, Richard; Woeste, Andreas; Kim, Hyon-Suk; Westergaard, Gaston; Reinsch, Christine; Brill, Eva; Thu Thuy, Pham Thi; Hoang, Bui Huu; Sonderup, Mark; Spearman, C Wendy; Pabinger, Stephan; Gautier, Jérémie; Brancaccio, Giuseppina; Fasano, Massimo; Santantonio, Teresa; Gaeta, Giovanni B; Nauck, Markus; Kaminski, Wolfgang E

    2017-01-01

    The diversity of the hepatitis B surface antigen (HBsAg) has a significant impact on the performance of diagnostic screening tests and the clinical outcome of hepatitis B infection. Neutralizing or diagnostic antibodies against the HBsAg are directed towards its highly conserved major hydrophilic region (MHR), in particular towards its "a" determinant subdomain. Here, we explored, on a global scale, the genetic diversity of the HBsAg MHR in a large, multi-ethnic cohort of randomly selected subjects with HBV infection from four continents. A total of 1553 HBsAg positive blood samples of subjects originating from 20 different countries across Africa, America, Asia and central Europe were characterized for amino acid variation in the MHR. Using highly sensitive ultra-deep sequencing, we found 72.8% of the successfully sequenced subjects (n = 1391) demonstrated amino acid sequence variation in the HBsAg MHR. This indicates that the global variation frequency in the HBsAg MHR is threefold higher than previously reported. The majority of the amino acid mutations were found in the HBV genotypes B (28.9%) and C (25.4%). Collectively, we identified 345 distinct amino acid mutations in the MHR. Among these, we report 62 previously unknown mutations, which extends the worldwide pool of currently known HBsAg MHR mutations by 22%. Importantly, topological analysis identified the "a" determinant upstream flanking region as the structurally most diverse subdomain of the HBsAg MHR. The highest prevalence of "a" determinant region mutations was observed in subjects from Asia, followed by the African, American and European cohorts, respectively. Finally, we found that more than half (59.3%) of all HBV subjects investigated carried multiple MHR mutations. Together, this worldwide ultra-deep sequencing based genotyping study reveals that the global prevalence and structural complexity of variation in the hepatitis B surface antigen have, to date, been significantly underappreciated.

  1. Analysis of high-depth sequence data for studying viral diversity: a comparison of next generation sequencing platforms using Segminator II

    Directory of Open Access Journals (Sweden)

    Archer John

    2012-03-01

    Full Text Available Abstract Background Next generation sequencing provides detailed insight into the variation present within viral populations, introducing the possibility of treatment strategies that are both reactive and predictive. Current software tools, however, need to be scaled up to accommodate for high-depth viral data sets, which are often temporally or spatially linked. In addition, due to the development of novel sequencing platforms and chemistries, each with implicit strengths and weaknesses, it will be helpful for researchers to be able to routinely compare and combine data sets from different platforms/chemistries. In particular, error associated with a specific sequencing process must be quantified so that true biological variation may be identified. Results Segminator II was developed to allow for the efficient comparison of data sets derived from different sources. We demonstrate its usage by comparing large data sets from 12 influenza H1N1 samples sequenced on both the 454 Life Sciences and Illumina platforms, permitting quantification of platform error. For mismatches median error rates at 0.10 and 0.12%, respectively, suggested that both platforms performed similarly. For insertions and deletions median error rates within the 454 data (at 0.3 and 0.2%, respectively were significantly higher than those within the Illumina data (0.004 and 0.006%, respectively. In agreement with previous observations these higher rates were strongly associated with homopolymeric stretches on the 454 platform. Outside of such regions both platforms had similar indel error profiles. Additionally, we apply our software to the identification of low frequency variants. Conclusion We have demonstrated, using Segminator II, that it is possible to distinguish platform specific error from biological variation using data derived from two different platforms. We have used this approach to quantify the amount of error present within the 454 and Illumina platforms in

  2. Effect of different salt adaptation strategies on the microbial diversity, activity, and settling of nitrifying sludge in sequencing batch reactors

    NARCIS (Netherlands)

    Bassin, J.P.; Kleerebezem, R.; Muyzer, G.; Rosado, A.S.; Van Loosdrecht, M.C.M.; Dezotti, M.

    2011-01-01

    The effect of salinity on the activity of nitrifying bacteria, floc characteristics, and microbial community structure accessed by fluorescent in situ hybridization and polymerase chain reaction–denaturing gradient gel electrophoresis techniques was investigated. Two sequencing batch reactors (SRB1

  3. Analysis of human mitochondrial DNA sequences from fecally polluted environmental waters as a tool to study population diversity

    Science.gov (United States)

    Mitochondrial signature sequences have frequently been used to study the demographics of many different populations around the world. Traditionally, this requires obtaining samples directly from individuals which is cumbersome, time consuming and limited to the number of individu...

  4. Exploring origins, invasion history and genetic diversity of Imperata cylindrica (L.) P. Beauv. (Cogongrass) in the United States using genotyping by sequencing.

    Science.gov (United States)

    Burrell, A Millie; Pepper, Alan E; Hodnett, George; Goolsby, John A; Overholt, William A; Racelis, Alexis E; Diaz, Rodrigo; Klein, Patricia E

    2015-05-01

    Imperata cylindrica (Cogongrass, Speargrass) is a diploid C4 grass that is a noxious weed in 73 countries and constitutes a significant threat to global biodiversity and sustainable agriculture. We used a cost-effective genotyping-by-sequencing (GBS) approach to identify the reproductive system, genetic diversity and geographic origins of invasions in the south-eastern United States. In this work, we demonstrated the advantage of employing the closely related, fully sequenced crop species Sorghum bicolor (L.) Moench as a proxy reference genome to identify a set of 2320 informative single nucleotide and insertion-deletion polymorphisms. Genetic analyses identified four clonal lineages of cogongrass and one clonal lineage of Imperata brasiliensis Trin. in the United States. Each lineage was highly homogeneous, and we found no evidence of hybridization among the different lineages, despite geographical overlap. We found evidence that at least three of these lineages showed clonal reproduction prior to introduction to the United States. These results indicate that cogongrass has limited evolutionary potential to adapt to novel environments and further suggest that upon arrival to its invaded range, this species did not require local adaptation through hybridization/introgression or selection of favourable alleles from a broad genetic base. Thus, cogongrass presents a clear case of broad invasive success, across a diversity of environments, in a clonal organism with limited genetic diversity. © 2015 John Wiley & Sons Ltd.

  5. Lactobacillus strain diversity based on partial hsp60 gene sequences and design of PCR-restriction fragment length polymorphism assays for species identification and differentiation.

    Science.gov (United States)

    Blaiotta, Giuseppe; Fusco, Vincenzina; Ercolini, Danilo; Aponte, Maria; Pepe, Olimpia; Villani, Francesco

    2008-01-01

    A phylogenetic tree showing diversities among 116 partial (499-bp) Lactobacillus hsp60 (groEL, encoding a 60-kDa heat shock protein) nucleotide sequences was obtained and compared to those previously described for 16S rRNA and tuf gene sequences. The topology of the tree produced in this study showed a Lactobacillus species distribution similar, but not identical, to those previously reported. However, according to the most recent systematic studies, a clear differentiation of 43 single-species clusters was detected/identified among the sequences analyzed. The slightly higher variability of the hsp60 nucleotide sequences than of the 16S rRNA sequences offers better opportunities to design or develop molecular assays allowing identification and differentiation of either distant or very closely related Lactobacillus species. Therefore, our results suggest that hsp60 can be considered an excellent molecular marker for inferring the taxonomy and phylogeny of members of the genus Lactobacillus and that the chosen primers can be used in a simple PCR procedure allowing the direct sequencing of the hsp60 fragments. Moreover, in this study we performed a computer-aided restriction endonuclease analysis of all 499-bp hsp60 partial sequences and we showed that the PCR-restriction fragment length polymorphism (RFLP) patterns obtainable by using both endonucleases AluI and TacI (in separate reactions) can allow identification and differentiation of all 43 Lactobacillus species considered, with the exception of the pair L. plantarum/L. pentosus. However, the latter species can be differentiated by further analysis with Sau3AI or MseI. The hsp60 PCR-RFLP approach was efficiently applied to identify and to differentiate a total of 110 wild Lactobacillus strains (including closely related species, such as L. casei and L. rhamnosus or L. plantarum and L. pentosus) isolated from cheese and dry-fermented sausages.

  6. Fungal Diversity in Field Mold-Damaged Soybean Fruits and Pathogenicity Identification Based on High-Throughput rDNA Sequencing

    Directory of Open Access Journals (Sweden)

    Jiang Liu

    2017-05-01

    Full Text Available Continuous rain and an abnormally wet climate during harvest can easily lead to soybean plants being damaged by field mold (FM, which can reduce seed yield and quality. However, to date, the underlying pathogen and its resistance mechanism have remained unclear. The objective of the present study was to investigate the fungal diversity of various soybean varieties and to identify and confirm the FM pathogenic fungi. A total of 62,382 fungal ITS1 sequences clustered into 164 operational taxonomic units (OTUs with 97% sequence similarity; 69 taxa were recovered from the samples by internal transcribed spacer (ITS region sequencing. The fungal community compositions differed among the tested soybeans, with 42 OTUs being amplified from all varieties. The quadratic relationships between fungal diversity and organ-specific mildew indexes were analyzed, confirming that mildew on soybean pods can mitigate FM damage to the seeds. In addition, four potentially pathogenic fungi were isolated from FM-damaged soybean fruits; morphological and molecular identification confirmed these fungi as Aspergillus flavus, A. niger, Fusarium moniliforme, and Penicillium chrysogenum. Further re-inoculation experiments demonstrated that F. moniliforme is dominant among these FM pathogenic fungi. These results lay the foundation for future studies on mitigating or preventing FM damage to soybean.

  7. [The use of 16S rDNA sequencing in species diversity analysis for sputum of patients with ventilator-associated pneumonia].

    Science.gov (United States)

    Yang, Xiaojun; Wang, Xiaohong; Liang, Zhijuan; Zhang, Xiaoya; Wang, Yanbo; Wang, Zhenhai

    2014-05-01

    To study the species and amount of bacteria in sputum of patients with ventilator-associated pneumonia (VAP) by using 16S rDNA sequencing analysis, and to explore the new method for etiologic diagnosis of VAP. Bronchoalveolar lavage sputum samples were collected from 31 patients with VAP. Bacterial DNA of the samples were extracted and identified by polymerase chain reaction (PCR). At the same time, sputum specimens were processed for routine bacterial culture. The high flux sequencing experiment was conducted on PCR positive samples with 16S rDNA macro genome sequencing technology, and sequencing results were analyzed using bioinformatics, then the results between the sequencing and bacteria culture were compared. (1) 550 bp of specific DNA sequences were amplified in sputum specimens from 27 cases of the 31 patients with VAP, and they were used for sequencing analysis. 103 856 sequences were obtained from those sputum specimens using 16S rDNA sequencing, yielding approximately 39 Mb of raw data. Tag sequencing was able to inform genus level in all 27 samples. (2) Alpha-diversity analysis showed that sputum samples of patients with VAP had significantly higher variability and richness in bacterial species (Shannon index values 1.20, Simpson index values 0.48). Rarefaction curve analysis showed that there were more species that were not detected by sequencing from some VAP sputum samples. (3) Analysis of 27 sputum samples with VAP by using 16S rDNA sequences yielded four phyla: namely Acitinobacteria, Bacteroidetes, Firmicutes, Proteobacteria. With genus as a classification, it was found that the dominant species included Streptococcus 88.9% (24/27), Limnohabitans 77.8% (21/27), Acinetobacter 70.4% (19/27), Sphingomonas 63.0% (17/27), Prevotella 63.0% (17/27), Klebsiella 55.6% (15/27), Pseudomonas 55.6% (15/27), Aquabacterium 55.6% (15/27), and Corynebacterium 55.6% (15/27). (4) Pyrophosphate sequencing discovered that Prevotella, Limnohabitans, Aquabacterium

  8. Study of endophytic Xylariaceae in Thailand: diversity and taxonomy inferred from rDNA sequence analyses with saprobes forming fruit bodies in the field

    DEFF Research Database (Denmark)

    Okane, Izumi; Srikitikulchai, Prasert; Toyama, Kyoko

    2008-01-01

    to reveal the diversity and taxonomy of endophytes and the relationships between those endophytes and saprobic Xylariaceae in Thailand that have been recorded according to fruit-body formation on decayed plant materials. Analysis of 28S rDNA D1/D2 sequences revealed 21 xylariaceous species inhabiting......A study of the diversity, taxonomy, and ecology of endophytic Xylariaceae (Ascomycota) was carried out. In this study, we obtained isolates of Xylariaceae from healthy, attached leaves and teleomorphic stromata on decayed plant materials in a permanent plot at Khao Yai National Park (Thailand......). In addition, strains deposited beforehand were selected in which both endophytic strains isolated from living plant tissues and saprobic strains from fruit bodies were included. Consequently, 405 strains of Xylariaceae (273 endophytic and 132 saprobic strains, including identified strains) were studied...

  9. Ancient DNA sequences point to a large loss of mitochondrial genetic diversity in the saiga antelope (Saiga tatarica) since the Pleistocene

    DEFF Research Database (Denmark)

    Campos, Paula; Kristensen, Tommy; Orlando, Ludovic Antoine Alexandre

    2010-01-01

    of the Soviet Union, after which its populations were reduced by over 95%. We have analysed the mitochondrial control region sequence variation of 27 ancient and 38 modern specimens, to assay how the species' genetic diversity has changed since the Pleistocene. Phylogenetic analyses reveal the existence of two...... well-supported, and clearly distinct, clades of saiga. The first, spanning a time range from >49,500 (14) C ybp to the present, comprises all the modern specimens and ancient samples from the Northern Urals, Middle Urals and Northeast Yakutia. The second clade is exclusive to the Northern Urals...... and includes samples dating from between 40,400 to 10,250 (14) C ybp. Current genetic diversity is much lower than that present during the Pleistocene, an observation that data modelling using serial coalescent indicates cannot be explained by genetic drift in a population of constant size. Approximate...

  10. Analysis of genetic diversity of Sclerotinia sclerotiorum from eggplant by mycelial compatibility, random amplification of polymorphic DNA (RAPD and simple sequence repeat (SSR analyses

    Directory of Open Access Journals (Sweden)

    Fatih Mehmet Tok

    2016-09-01

    Full Text Available The genetic diversity and pathogenicity/virulence among 60 eggplant Sclerotinia sclerotiorum isolates collected from six different geographic regions of Turkey were analysed using mycelial compatibility groupings (MCGs, random amplified polymorphic DNA (RAPD and simple sequence repeat (SSR polymorphism. By MCG tests, the isolates were classified into 22 groups. Out of 22 MCGs, 36% were represented each by a single isolate. The isolates showed great variability for virulence regardless of MCG and geographic origin. Based on the results of RAPD and SSR analyses, 60 S. sclerotiorum isolates representing 22 MCGs were grouped in 2 and 3 distinct clusters, respectively. Analyses using RAPD and SSR markers illustrated that cluster groupings or genetic distance of S. sclerotiorum populations from eggplant were not distinctly relative to the MCG, geographical origin and virulence diversity. The patterns obtained revealed a high heterogeneity of genetic composition and suggested the occurrence of clonal and sexual reproduction of S. sclerotiorum on eggplant in the areas surveyed.

  11. Study of the Effect of SRT on Microbial Diversity in Laboratory-scale Sequencing Batch Reactors Using Acclimated and Non-Acclimated Seed

    KAUST Repository

    Tellez, Berenice

    2011-07-07

    Solids Retention Time (SRT) is an important design parameter in activated sludge wastewater treatment systems. In this study, the effect of SRT on the bacterial community structure and diversity was examined in replicate lab-scale activated sludge sequencing batch reactors were operated for a period of 8 weeks and seeded with acclimated or non-acclimated sludge. Four SBRs (acclimated) were set up as duplicates and operated at an SRT of 2 days, and another set of four SBRs (non-acclimated) were operated at an SRT of 10 days. To characterize the microbial community in the SBRs, 16S rRNA gene pyrosequencing was used to measure biodiversity and to assess the reproducibility and stability of the bacterial community structure in replicate reactors. Diversity results showed that SBRs operated at an SRT of 10 days are more diverse than SBRs operated at an SRT of 2 days. This suggests that engineering decision could enhance diversity in activated sludge systems. Cluster analysis based on phylogenetic information revealed that the bacterial community structure was not stable and replicated SBRs evolved differently.

  12. Fungal diversity in oxygen-depleted regions of the Arabian Sea revealed by targeted environmental sequencing combined with cultivation

    Digital Repository Service at National Institute of Oceanography (India)

    Manohar, C.S.; Raghukumar, C.; Behnke, A.; Stoeck, T.

    , Ollivier B, Skiena S, Taghavi S, Zak D & van der Lelie D (2008) Elevated atmospheric CO 2 affects soil microbial diversity associated with trembling aspen. Environ Microbiol 10 : 926–941. López-García P, Vereshchaka A & Moreira D (2007) Eukaryotic...

  13. High genetic diversity among strains of the unindustrialized lactic acid bacterium Carnobacterium maltaromaticum in dairy products as revealed by multilocus sequence typing.

    Science.gov (United States)

    Rahman, Abdur; Cailliez-Grimal, Catherine; Bontemps, Cyril; Payot, Sophie; Chaillou, Stéphane; Revol-Junelles, Anne-Marie; Borges, Frédéric

    2014-07-01

    Dairy products are colonized with three main classes of lactic acid bacteria (LAB): opportunistic bacteria, traditional starters, and industrial starters. Most of the population structure studies were previously performed with LAB species belonging to these three classes and give interesting knowledge about the population structure of LAB at the stage where they are already industrialized. However, these studies give little information about the population structure of LAB prior their use as an industrial starter. Carnobacterium maltaromaticum is a LAB colonizing diverse environments, including dairy products. Since this bacterium was discovered relatively recently, it is not yet commercialized as an industrial starter, which makes C. maltaromaticum an interesting model for the study of unindustrialized LAB population structure in dairy products. A multilocus sequence typing scheme based on an analysis of fragments of the genes dapE, ddlA, glpQ, ilvE, pyc, pyrE, and leuS was applied to a collection of 47 strains, including 28 strains isolated from dairy products. The scheme allowed detecting 36 sequence types with a discriminatory index of 0.98. The whole population was clustered in four deeply branched lineages, in which the dairy strains were spread. Moreover, the dairy strains could exhibit a high diversity within these lineages, leading to an overall dairy population with a diversity level as high as that of the nondairy population. These results are in agreement with the hypothesis according to which the industrialization of LAB leads to a diversity reduction in dairy products. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  14. Cloning and sequencing of wsp encoding gene fragments reveals a diversity of co-infecting Wolbachia strains in Acromyrmex leafcutter ants

    DEFF Research Database (Denmark)

    van Borm, S.; Wenseleers, T.; Billen, J.

    2003-01-01

    Acromyrmex insinuator hosted two additional infections. The multiple Wolbachia strains may influence the expression of reproductive conflicts in leafcutter ants, but the expected turnover of infections may make the cumulative effects on host ant reproduction complex. The additional Wolbachia infections......By sequencing part of the wsp gene of a series of clones, we detected an unusually high diversity of nine Wolbachia strains in queens of three species of leafcutter ants. Up to four strains co-occurred in a single ant. Most strains occurred in two clusters (InvA and InvB), but the social parasite...

  15. Binning metagenomic contigs by coverage and composition

    NARCIS (Netherlands)

    Alneberg, J.; Bjarnason, B.S.; Bruijn, de I.; Schirmer, M.; Quick, J.; Ijaz, U.Z.; Lahti, L.M.; Loman, N.J.; Andersson, A.F.; Quince, C.

    2014-01-01

    Shotgun sequencing enables the reconstruction of genomes from complex microbial communities, but because assembly does not reconstruct entire genomes, it is necessary to bin genome fragments. Here we present CONCOCT, a new algorithm that combines sequence composition and coverage across multiple

  16. Violation of an evolutionarily conserved immunoglobulin diversity gene sequence preference promotes production of dsDNA-specific IgG antibodies.

    Directory of Open Access Journals (Sweden)

    Aaron Silva-Sanchez

    Full Text Available Variability in the developing antibody repertoire is focused on the third complementarity determining region of the H chain (CDR-H3, which lies at the center of the antigen binding site where it often plays a decisive role in antigen binding. The power of VDJ recombination and N nucleotide addition has led to the common conception that the sequence of CDR-H3 is unrestricted in its variability and random in its composition. Under this view, the immune response is solely controlled by somatic positive and negative clonal selection mechanisms that act on individual B cells to promote production of protective antibodies and prevent the production of self-reactive antibodies. This concept of a repertoire of random antigen binding sites is inconsistent with the observation that diversity (DH gene segment sequence content by reading frame (RF is evolutionarily conserved, creating biases in the prevalence and distribution of individual amino acids in CDR-H3. For example, arginine, which is often found in the CDR-H3 of dsDNA binding autoantibodies, is under-represented in the commonly used DH RFs rearranged by deletion, but is a frequent component of rarely used inverted RF1 (iRF1, which is rearranged by inversion. To determine the effect of altering this germline bias in DH gene segment sequence on autoantibody production, we generated mice that by genetic manipulation are forced to utilize an iRF1 sequence encoding two arginines. Over a one year period we collected serial serum samples from these unimmunized, specific pathogen-free mice and found that more than one-fifth of them contained elevated levels of dsDNA-binding IgG, but not IgM; whereas mice with a wild type DH sequence did not. Thus, germline bias against the use of arginine enriched DH sequence helps to reduce the likelihood of producing self-reactive antibodies.

  17. Transcriptional Slippage and RNA Editing Increase the Diversity of Transcripts in Chloroplasts: Insight from Deep Sequencing of Vigna radiata Genome and Transcriptome.

    Directory of Open Access Journals (Sweden)

    Ching-Ping Lin

    Full Text Available We performed deep sequencing of the nuclear and organellar genomes of three mungbean genotypes: Vigna radiata ssp. sublobata TC1966, V. radiata var. radiata NM92 and the recombinant inbred line RIL59 derived from a cross between TC1966 and NM92. Moreover, we performed deep sequencing of the RIL59 transcriptome to investigate transcript variability. The mungbean chloroplast genome has a quadripartite structure including a pair of inverted repeats separated by two single copy regions. A total of 213 simple sequence repeats were identified in the chloroplast genomes of NM92 and RIL59; 78 single nucleotide variants and nine indels were discovered in comparing the chloroplast genomes of TC1966 and NM92. Analysis of the mungbean chloroplast transcriptome revealed mRNAs that were affected by transcriptional slippage and RNA editing. Transcriptional slippage frequency was positively correlated with the length of simple sequence repeats of the mungbean chloroplast genome (R2=0.9911. In total, 41 C-to-U editing sites were found in 23 chloroplast genes and in one intergenic spacer. No editing site that swapped U to C was found. A combination of bioinformatics and experimental methods revealed that the plastid-encoded RNA polymerase-transcribed genes psbF and ndhA are affected by transcriptional slippage in mungbean and in main lineages of land plants, including three dicots (Glycine max, Brassica rapa, and Nicotiana tabacum, two monocots (Oryza sativa and Zea mays, two gymnosperms (Pinus taeda and Ginkgo biloba and one moss (Physcomitrella patens. Transcript analysis of the rps2 gene showed that transcriptional slippage could affect transcripts at single sequence repeat regions with poly-A runs. It showed that transcriptional slippage together with incomplete RNA editing may cause sequence diversity of transcripts in chloroplasts of land plants.

  18. Keragaman Genetik Sekuen Gen ATP Synthase FO Subunit 6 (ATP6 Monyet Hantu (Tarsius Indonesia (GENETIC DIVERSITY STUDY OF ATP6 GENE SEQUENCES OF TARSIERS FROM INDONESIA

    Directory of Open Access Journals (Sweden)

    Rini Widayanti

    2013-07-01

    Full Text Available In a conservation effort, the identification of Tarsier species, on the bases of the morphological andmolecular characteristic is necessary. Up to now, the identification of the animals were based on themorphology and vocalizations, which is extremely difficult to identify each, tarsier species. The objective ofthis research was to study the genetic diversity on ATP6 gene of Tarsius sp. Based on sequencing of PCRproduct using primer ATP6F and ATP6R with 681 nts. PCR product. The sequence of ATP6 fragmentswere aligned with other primates from Gene bank with aid of software Clustal W, and were analyzed usingMEGA program version 4.0. Three different nucleotide sites were found (nucleotide no. 288, 321 and 367.The genetic distance based on nucleotide ATP6 sequence calculated using Kimura 2-parameter modelindicated that the smallest genetic distance 0%, biggest 0.8% and average 0, 2%. The phylogenetic treeusing neighbor joining method based on the sequence of nucleotide ATP6 gene could not be used todifferentiate among T. Dianae (from Central Sulawesi, T. Spectrum (from North Sulawesi, T. bancanus(from lampung, South Sumatera and T.bancanus from West Kalimantan.

  19. Deep-sequencing to resolve complex diversity of apicomplexan parasites in platypuses and echidnas: Proof of principle for wildlife disease investigation.

    Science.gov (United States)

    Šlapeta, Jan; Saverimuttu, Stefan; Vogelnest, Larry; Sangster, Cheryl; Hulst, Frances; Rose, Karrie; Thompson, Paul; Whittington, Richard

    2017-11-01

    The short-beaked echidna (Tachyglossus aculeatus) and the platypus (Ornithorhynchus anatinus) are iconic egg-laying monotremes (Mammalia: Monotremata) from Australasia. The aim of this study was to demonstrate the utility of diversity profiles in disease investigations of monotremes. Using small subunit (18S) rDNA amplicon deep-sequencing we demonstrated the presence of apicomplexan parasites and confirmed by direct and cloned amplicon gene sequencing Theileria ornithorhynchi, Theileria tachyglossi, Eimeria echidnae and Cryptosporidium fayeri. Using a combination of samples from healthy and diseased animals, we show a close evolutionary relationship between species of coccidia (Eimeria) and piroplasms (Theileria) from the echidna and platypus. The presence of E. echidnae was demonstrated in faeces and tissues affected by disseminated coccidiosis. Moreover, the presence of E. echidnae DNA in the blood of echidnas was associated with atoxoplasma-like stages in white blood cells, suggesting Hepatozoon tachyglossi blood stages are disseminated E. echidnae stages. These next-generation DNA sequencing technologies are suited to material and organisms that have not been previously characterised and for which the material is scarce. The deep sequencing approach supports traditional diagnostic methods, including microscopy, clinical pathology and histopathology, to better define the status quo. This approach is particularly suitable for wildlife disease investigation. Copyright © 2017 Elsevier B.V. All rights reserved.

  20. Characterization of genomic sequence showing strong association with polyembryony among diverse Citrus species and cultivars, and its synteny with Vitis and Populus.

    Science.gov (United States)

    Nakano, Michiharu; Shimada, Takehiko; Endo, Tomoko; Fujii, Hiroshi; Nesumi, Hirohisa; Kita, Masayuki; Ebina, Masumi; Shimizu, Tokurou; Omura, Mitsuo

    2012-02-01

    Polyembryony, in which multiple somatic nucellar cell-derived embryos develop in addition to the zygotic embryo in a seed, is common in the genus Citrus. Previous genetic studies indicated polyembryony is mainly determined by a single locus, but the underlying molecular mechanism is still unclear. As a step towards identification and characterization of the gene or genes responsible for nucellar embryogenesis in Citrus, haplotype-specific physical maps around the polyembryony locus were constructed. By sequencing three BAC clones aligned on the polyembryony haplotype, a single contiguous draft sequence consisting of 380 kb containing 70 predicted open reading frames (ORFs) was reconstructed. Single nucleotide polymorphism genotypes detected in the sequenced genomic region showed strong association with embryo type in Citrus, indicating a common polyembryony locus is shared among widely diverse Citrus cultivars and species. The arrangement of the predicted ORFs in the characterized genomic region showed high collinearity to the genomic sequence of chromosome 4 of Vitis vinifera and linkage group VI of Populus trichocarpa, suggesting that the syntenic relationship among these species is conserved even though V. vinifera and P. trichocarpa are non-apomictic species. This is the first study to characterize in detail the genomic structure of an apomixis locus determining adventitious embryony. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.

  1. Complete ecological isolation and cryptic diversity in Polynucleobacter bacteria not resolved by 16S rRNA gene sequences.

    Czech Academy of Sciences Publication Activity Database

    Hahn, M.W.; Jezberová, Jitka; Koll, U.; Saueressig-Beck, T.; Schmidt, J.

    2016-01-01

    Roč. 10, č. 7 (2016), s. 1642-1655 ISSN 1751-7362 Institutional support: RVO:60077344 Keywords : fresh-water habitats * microbial species delineation * alteromonas-macleodii * community structure * genome sequence * organic-matter Subject RIV: EE - Microbiology, Virology Impact factor: 9.664, year: 2016

  2. Sequence diversity of the leukotoxin (lktA) gene in caprine and ovine strains of Mannheimia haemolytica.

    Science.gov (United States)

    Vougidou, C; Sandalakis, V; Psaroulaki, A; Petridou, E; Ekateriniadou, L

    2013-04-20

    Mannheimia haemolytica is the aetiological agent of pneumonic pasteurellosis in small ruminants. The primary virulence factor of the bacterium is a leukotoxin (LktA), which induces apoptosis in susceptible cells via mitochondrial targeting. It has been previously shown that certain lktA alleles are associated either with cattle or sheep. The objective of the present study was to investigate lktA sequence variation among ovine and caprine M haemolytica strains isolated from pneumonic lungs, revealing any potential adaptation for the caprine host, for which there is no available data. Furthermore, we investigated amino acid variation in the N-terminal part of the sequences and its effect on targeting mitochondria. Data analysis showed that the prevalent caprine genotype differed at a single non-synonymous site from a previously described uncommon bovine allele, whereas the ovine sequences represented new, distinct alleles. N-terminal sequence differences did not affect the mitochondrial targeting ability of the isolates; interestingly enough in one case, mitochondrial matrix targeting was indicated rather than membrane association, suggesting an alternative LktA trafficking pattern.

  3. Seasonal Distribution and Diversity of Ground Arthropods in Microhabitats Following a Shrub Plantation Age Sequence in Desertified Steppe

    Science.gov (United States)

    Liu, Rentao; Zhu, Fan; Song, Naiping; Yang, Xinguo; Chai, Yongqing

    2013-01-01

    In desertified regions, shrub-dominated patches are important microhabitats for ground arthropod assemblages. As shrub age increases, soil, vegetation and microbiological properties can change remarkably and spontaneously across seasons. However, relatively few studies have analyzed how ground arthropods respond to the microhabitats created by shrubs of different plantation ages across seasons. Using 6, 15, 24 and 36 year-old plantations of re-vegetated shrubs (Caragana koushinskii) in the desert steppe of northwestern China as a model system, we sampled ground arthropod communities using a pitfall trapping method in the microhabitats under shrubs and in the open areas between shrubs, during the spring, summer and autumn. The total ground arthropod assemblage was dominated by Carabidae, Melolonthidae, Curculionidae, Tenebrionidae and Formicidae that were affected by plantation age, seasonal changes, or the interaction between these factors, with the later two groups also influenced by microhabitat. Overall, a facilitative effect was observed, with more arthropods and a greater diversity found under shrubs as compared to open areas, but this was markedly affected by seasonal changes. There was a high degree of similarity in arthropod assemblages and diversity between microhabitats in summer and autumn. Shrub plantation age significantly influenced the distribution of the most abundant groups, and also the diversity indices of the ground arthropods. However, there was not an overall positive relationship between shrub age and arthropod abundance, richness or diversity index. The influence of plantation age on arthropod communities was also affected by seasonal changes. From spring through summer to autumn, community indices of ground arthropods tended to decline, and a high degree of similarity in these indices (with fluctuation) was observed among different ages of shrub plantation in autumn. Altogether the recovery of arthropod communities was markedly affected by

  4. Genetic Diversity of Cowpea (Vigna unguiculata (L.) Walp.) Accession in Kenya Gene Bank Based on Simple Sequence Repeat Markers.

    Science.gov (United States)

    Wamalwa, Emily N; Muoma, John; Wekesa, Clabe

    2016-01-01

    Increased agricultural production is an urgent issue. Projected global population is 9 million people by mid of this century. Estimation projects death of 1 million people for lack of food quality (micronutrient deficit) and quantity (protein deficit). Majority of these people will be living in developing countries. Other global challenges include shrinking cultivable lands, salinity, and flooding due to climate changes, new emerging pathogens, and pests. These affect crop production. Furthermore, they are major threats to crop genetic resources and food security. Genetic diversity in cultivated crops indicates gene pool richness. It is the greatest resource for plant breeders to select lines that enhance food security. This study was conducted by Masinde Muliro University to evaluate genetic diversity in 19 cowpea accessions from Kenya national gene bank. Accessions clustered into two major groups. High divergence was observed between accessions from Ethiopia and Australia and those from Western Kenya. Upper Volta accessions were closely related to those from Western Kenya. Low variation was observed between accessions from Eastern and Rift Valley than those from Western and Coastal regions of Kenya. Diversity obtained in this study can further be exploited for the improvement of cowpea in Kenya as a measure of food security.

  5. Genetic Diversity of Cowpea (Vigna unguiculata (L. Walp. Accession in Kenya Gene Bank Based on Simple Sequence Repeat Markers

    Directory of Open Access Journals (Sweden)

    Emily N. Wamalwa

    2016-01-01

    Full Text Available Increased agricultural production is an urgent issue. Projected global population is 9 million people by mid of this century. Estimation projects death of 1 million people for lack of food quality (micronutrient deficit and quantity (protein deficit. Majority of these people will be living in developing countries. Other global challenges include shrinking cultivable lands, salinity, and flooding due to climate changes, new emerging pathogens, and pests. These affect crop production. Furthermore, they are major threats to crop genetic resources and food security. Genetic diversity in cultivated crops indicates gene pool richness. It is the greatest resource for plant breeders to select lines that enhance food security. This study was conducted by Masinde Muliro University to evaluate genetic diversity in 19 cowpea accessions from Kenya national gene bank. Accessions clustered into two major groups. High divergence was observed between accessions from Ethiopia and Australia and those from Western Kenya. Upper Volta accessions were closely related to those from Western Kenya. Low variation was observed between accessions from Eastern and Rift Valley than those from Western and Coastal regions of Kenya. Diversity obtained in this study can further be exploited for the improvement of cowpea in Kenya as a measure of food security.

  6. Genetic diversity and population structure of Eleutheronema rhadinum in the East and South China Seas revealed in mitochondrial COI sequences

    Science.gov (United States)

    Sun, Xinxu; Xu, Dongdong; Lou, Bao; Zhang, Tao; Xin, Jian; Guo, Yaoshi; Ma, Shilei

    2013-11-01

    Eleutheronema rhadinum is a potential commercial fisheries species and is subject to intense exploitation in China. Knowledge on the population structure of E. rhadinum in Chinese coastal waters, which is important for sustainable exploitation and proper resource management, is lacking. In the present study, the genetic diversity and population structure of E. rhadinum were evaluated using a 564-base pair fragment of the mitochondrial cytochrome c oxidase subunit I (COI) gene. A total of 76 specimens were collected from three localities around the East (Qidong and Zhoushan) and South China Seas (Zhuhai). Among these individuals, nine polymorphic sites were detected and 11 distinct haplotypes were defined. High levels of haplotype diversity ( h =0.759±0.035) and low levels of nucleotide diversity ( π= 0.001 98±0.003 26) were observed in these populations. Hierarchical analysis of molecular variance (AMOVA) indicated that 96.72% of the genetic variation occurred within the populations, whereas 3.28% occurred among populations. No significant genealogical branches or clusters were recognized on the neighbor-joining tree. Intra-group variation among populations was significant ( φ st=0.032 85, PTests of neutral evolution and mismatch distribution suggest that E. rhadinum may have experienced a population expansion. The present study provides basic information for the conservation and sustainable exploitation of this species.

  7. A next generation sequencing approach with a suitable bioinformatics workflow to study fungal diversity in bioaerosols released from two different types of composting plants.

    Science.gov (United States)

    Mbareche, Hamza; Veillette, Marc; Bonifait, Laetitia; Dubuis, Marie-Eve; Benard, Yves; Marchand, Geneviève; Bilodeau, Guillaume J; Duchaine, Caroline

    2017-12-01

    Composting is used all over the world to transform different types of organic matter through the actions of complex microbial communities. Moving and handling composting material may lead to the emission of high concentrations of bioaerosols. High exposure levels are associated with adverse health effects among compost industry workers. Fungal spores are suspected to play a role in many respiratory illnesses. There is a paucity of information related to the detailed fungal diversity in compost as well as in the aerosols emitted through composting activities. The aim of this study was to analyze the fungal diversity of both organic matter and aerosols present in facilities that process domestic compost and facilities that process pig carcasses. This was accomplished using a next generation sequencing approach that targets the ITS1 genomic region. Multivariate analyses revealed differences in the fungal community present in samples coming from compost treating both raw materials. Furthermore, results show that the compost type affects the fungal diversity of aerosols emitted. Although 8 classes were evenly distributed in all samples, Eurotiomycetes were more dominant in carcass compost while Sordariomycetes were dominant in domestic compost. A large diversity profile was observed in bioaerosols from both compost types showing the presence of a number of pathogenic fungi newly identified in bioaerosols emitted from composting plants. Members of the family Herpotrichiellaceae and Gymnoascaceae which have been shown to cause human diseases were detected in compost and air samples. Moreover, some fungi were identified in higher proportion in air compared to compost. This is the first study to identify a high level of fungal diversity in bioaerosols present in composting plants suggesting a potential exposure risk for workers. This study suggests the need for creating guidelines that address human exposure to bioaerosols. The implementation of technical and organizational

  8. Assessment of genetic diversity in Vigna unguiculata L. (Walp) accessions using inter-simple sequence repeat (ISSR) and start codon targeted (SCoT) polymorphic markers.

    Science.gov (United States)

    Igwe, David Okeh; Afiukwa, Celestine Azubike; Ubi, Benjamin Ewa; Ogbu, Kenneth Idika; Ojuederie, Omena Bernard; Ude, George Nkem

    2017-11-17

    Assessment of genetic diversity of Vigna unguiculata (L.) Walp (cowpea) accessions using informative molecular markers is imperative for their genetic improvement and conservation. Use of efficacious molecular markers to obtain the required knowledge of the genetic diversity within the local and regional germplasm collections can enhance the overall effectiveness of cowpea improvement programs, hence, the comparative assessment of Inter-simple sequence repeat (ISSR) and Start codon targeted (SCoT) markers in genetic diversity of V. unguiculata accessions from different regions in Nigeria. Comparative analysis of the genetic diversity of eighteen accessions from different locations in Nigeria was investigated using ISSR and SCoT markers. DNA extraction was done using Zymogen Kit according to its manufacturer's instructions followed by amplifications with ISSR and SCoT and agarose gel electrophoresis. The reproducible bands were scored for analyses of dendrograms, principal component analysis, genetic diversity, allele frequency, polymorphic information content, and population structure. Both ISSR and SCoT markers resolved the accessions into five major clusters based on dendrogram and principal component analyses. Alleles of 32 and 52 were obtained with ISSR and SCoT, respectively. Numbers of alleles, gene diversity and polymorphic information content detected with ISSR were 9.4000, 0.7358 and 0.7192, while SCoT yielded 11.1667, 0.8158 and 0.8009, respectively. Polymorphic loci were 70 and 80 in ISSR and SCoT, respectively. Both markers produced high polymorphism (94.44-100%). The ranges of effective number of alleles (Ne) were 1.2887 ± 0.1797-1.7831 ± 0.2944 and 1.7416 ± 0.0776-1.9181 ± 0.2426 in ISSR and SCoT, respectively. The Nei's genetic diversity (H) ranged from 0.2112 ± 0.0600-0.4335 ± 0.1371 and 0.4111 ± 0.0226-0.4778 ± 0.1168 in ISSR and SCoT, respectively. Shannon's information index (I) from ISSR and SCoT were 0

  9. Sequence diversity, cytotoxicity and antigenic similarities of the leukotoxin of isolates of Mannheimia species from mastitis in domestic sheep.

    Science.gov (United States)

    Omaleki, Lida; Browning, Glenn F; Barber, Stuart R; Allen, Joanne L; Srikumaran, Subramaniam; Markham, Philip F

    2014-11-07

    Species within the genus Mannheimia are among the most important causes of ovine mastitis. Isolates of these species can express leukotoxin A (LktA), a primary virulence factor of these bacteria. To examine the significance of variation in the LktA, the sequences of the lktA genes in a panel of isolates from cases of ovine mastitis were compared. The cross-neutralising capacities of rat antisera raised against LktA of one Mannheimia glucosida, one haemolytic Mannheimia ruminalis, and two Mannheimia haemolytica isolates were also examined to assess the effect that variation in the lktA gene can have on protective immunity against leukotoxins with differing sequences. The lktA nucleotide distance between the M. haemolytica isolates was greater than between the M. glucosida isolates, with the M. haemolytica isolates divisible into two groups based on their lktA sequences. Comparison of the topology of phylogenetic trees of 16S rDNA and lktA sequences revealed differences in the relationships between some isolates, suggesting horizontal gene transfer. Cross neutralisation data obtained with monospecific anti-LktA rat sera were used to derive antigenic similarity coefficients for LktA from the four Mannheimia species isolates. Similarity coefficients indicated that LktA of the two M. haemolytica isolates were least similar, while LktA from M. glucosida was most similar to those for one of the M. haemolytica isolates and the haemolytic M. ruminalis isolate. The results suggested that vaccination with the M. glucosida leukotoxin would generate the greatest cross-protection against ovine mastitis caused by Mannheimia species with these alleles. Copyright © 2014 Elsevier B.V. All rights reserved.

  10. Functional anthology of intrinsic disorder. 2. Cellular components, domains, technical terms, developmental processes, and coding sequence diversities correlated with long disordered regions.

    Science.gov (United States)

    Vucetic, Slobodan; Xie, Hongbo; Iakoucheva, Lilia M; Oldfield, Christopher J; Dunker, A Keith; Obradovic, Zoran; Uversky, Vladimir N

    2007-05-01

    Biologically active proteins without stable ordered structure (i.e., intrinsically disordered proteins) are attracting increased attention. Functional repertoires of ordered and disordered proteins are very different, and the ability to differentiate whether a given function is associated with intrinsic disorder or with a well-folded protein is crucial for modern protein science. However, there is a large gap between the number of proteins experimentally confirmed to be disordered and their actual number in nature. As a result, studies of functional properties of confirmed disordered proteins, while helpful in revealing the functional diversity of protein disorder, provide only a limited view. To overcome this problem, a bioinformatics approach for comprehensive study of functional roles of protein disorder was proposed in the first paper of this series (Xie, H.; Vucetic, S.; Iakoucheva, L. M.; Oldfield, C. J.; Dunker, A. K.; Obradovic, Z.; Uversky, V. N. Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. J. Proteome Res. 2007, 5, 1882-1898). Applying this novel approach to Swiss-Prot sequences and functional keywords, we found over 238 and 302 keywords to be strongly positively or negatively correlated, respectively, with long intrinsically disordered regions. This paper describes approximately 90 Swiss-Prot keywords attributed to the cellular components, domains, technical terms, developmental processes, and coding sequence diversities possessing strong positive and negative correlation with long disordered regions.

  11. Functional Anthology of Intrinsic Disorder. II. Cellular Components, Domains, Technical Terms, Developmental Processes and Coding Sequence Diversities Correlated with Long Disordered Regions

    Science.gov (United States)

    Vucetic, Slobodan; Xie, Hongbo; Iakoucheva, Lilia M.; Oldfield, Christopher J.; Dunker, A. Keith; Obradovic, Zoran; Uversky, Vladimir N.

    2008-01-01

    Biologically active proteins without stable ordered structure (i.e., intrinsically disordered proteins) are attracting increased attention. Functional repertoires of ordered and disordered proteins are very different, and the ability to differentiate whether a given function is associated with intrinsic disorder or with a well-folded protein is crucial for modern protein science. However, there is a large gap between the number of proteins experimentally confirmed to be disordered and their actual number in nature. As a result, studies of functional properties of confirmed disordered proteins, while helpful in revealing the functional diversity of protein disorder, provide only a limited view. To overcome this problem, a bioinformatics approach for comprehensive study of functional roles of protein disorder was proposed in the first paper of this series (Xie H., Vucetic S., Iakoucheva L.M., Oldfield C.J., Dunker A.K., Obradovic Z., Uversky V.N. (2006) Functional anthology of intrinsic disorder. I. Biological processes and functions of proteins with long disordered regions. J. Proteome Res.). Applying this novel approach to Swiss-Prot sequences and functional keywords, we found over 238 and 302 keywords to be strongly positively or negatively correlated, respectively, with long intrinsically disordered regions. This paper describes ~90 Swiss-Prot keywords attributed to the cellular components, domains, technical terms, developmental processes and coding sequence diversities possessing strong positive and negative correlation with long disordered regions. PMID:17391015

  12. Women's Health Insurance Coverage

    Science.gov (United States)

    ... Women's Health Policy Women’s Health Insurance Coverage Women’s Health Insurance Coverage Published: Oct 31, 2017 Facebook Twitter LinkedIn ... that many women continue to face. Sources of Health Insurance Coverage Employer-Sponsored Insurance: Approximately 57.9 million ...

  13. Characterization of the Genetic Diversity of Acid Lime (Citrus aurantifolia (Christm.) Swingle) Cultivars of Eastern Nepal Using Inter-Simple Sequence Repeat Markers.

    Science.gov (United States)

    Munankarmi, Nabin Narayan; Rana, Neesha; Bhattarai, Tribikram; Shrestha, Ram Lal; Joshi, Bal Krishna; Baral, Bikash; Shrestha, Sangita

    2018-06-12

    Acid lime ( Citrus aurantifolia (Christm.) Swingle) is an important fruit crop, which has high commercial value and is cultivated in 60 out of the 77 districts representing all geographical landscapes of Nepal. A lack of improved high-yielding varieties, infestation with various diseases, and pests, as well as poor management practices might have contributed to its extremely reduced productivity, which necessitates a reliable understanding of genetic diversity in existing cultivars. Hereby, we aim to characterize the genetic diversity of acid lime cultivars cultivated at three different agro-ecological gradients of eastern Nepal, employing PCR-based inter-simple sequence repeat (ISSR) markers. Altogether, 21 polymorphic ISSR markers were used to assess the genetic diversity in 60 acid lime cultivars sampled from different geographical locations. Analysis of binary data matrix was performed on the basis of bands obtained, and principal coordinate analysis and phenogram construction were performed using different computer algorithms. ISSR profiling yielded 234 amplicons, of which 87.18% were polymorphic. The number of amplified fragments ranged from 7⁻18, with amplicon size ranging from ca. 250⁻3200 bp. The Numerical Taxonomy and Multivariate System (NTSYS)-based cluster analysis using the unweighted pair group method of arithmetic averages (UPGMA) algorithm and Dice similarity coefficient separated 60 cultivars into two major and three minor clusters. Genetic diversity analysis using Popgene ver. 1.32 revealed the highest percentage of polymorphic bands (PPB), Nei’s genetic diversity (H), and Shannon’s information index (I) for the Terai zone (PPB = 69.66%; H = 0.215; I = 0.325), and the lowest of all three for the high hill zone (PPB = 55.13%; H = 0.173; I = 0.262). Thus, our data indicate that the ISSR marker has been successfully employed for evaluating the genetic diversity of Nepalese acid lime cultivars and has furnished valuable information on

  14. Leadership and Diversity

    Science.gov (United States)

    Coleman, Marianne

    2012-01-01

    As part of the special edition recognizing the 40th anniversary of "Educational Management Administration & Leadership" this article reviews the coverage of leadership and diversity issues in the journal. The majority of articles concerning diversity have focused on gender, with attention turning to the wider concept of diversity since the year…

  15. Genetic Diversity of Pinus nigra Arn. Populations in Southern Spain and Northern Morocco Revealed By Inter-Simple Sequence Repeat Profiles

    Directory of Open Access Journals (Sweden)

    Oussama Ahrazem

    2012-05-01

    Full Text Available Eight Pinus nigra Arn. populations from Southern Spain and Northern Morocco were examined using inter-simple sequence repeat markers to characterize the genetic variability amongst populations. Pair-wise population genetic distance ranged from 0.031 to 0.283, with a mean of 0.150 between populations. The highest inter-population average distance was between PaCU from Cuenca and YeCA from Cazorla, while the lowest distance was between TaMO from Morocco and MA Sierra Mágina populations. Analysis of molecular variance (AMOVA and Nei’s genetic diversity analyses revealed higher genetic variation within the same population than among different populations. Genetic differentiation (Gst was 0.233. Cuenca showed the highest Nei’s genetic diversity followed by the Moroccan region, Sierra Mágina, and Cazorla region. However, clustering of populations was not in accordance with their geographical locations. Principal component analysis showed the presence of two major groups—Group 1 contained all populations from Cuenca while Group 2 contained populations from Cazorla, Sierra Mágina and Morocco—while Bayesian analysis revealed the presence of three clusters. The low genetic diversity observed in PaCU and YeCA is probably a consequence of inappropriate management since no estimation of genetic variability was performed before the silvicultural treatments. Data indicates that the inter-simple sequence repeat (ISSR method is sufficiently informative and powerful to assess genetic variability among populations of P. nigra.

  16. Genetic diversity and genetic structure of farmed and wild Chinese mitten crab (Eriocheir sinensis) populations from three major basins by mitochondrial DNA COI and Cyt b gene sequences.

    Science.gov (United States)

    Zhang, Cheng; Li, Qingqing; Wu, Xugan; Liu, Qing; Cheng, Yongxu

    2017-11-20

    The Chinese mitten crab, Eriocheir sinensis, is one of the important native crab species in East Asian region, which has been widely cultured throughout China, particularly in river basins of Yangtze, Huanghe and Liaohe. This study was designed to evaluate the genetic diversity and genetic structure of cultured and wild E. sinensis populations from the three river basins based on mitochondrial DNA (mtDNA) cytochrome oxidase subunit I (COI) and cytochrome b (Cyt b). The results showed that there were 62 variable sites and 30 parsimony informative sites in the 647 bp of sequenced mtDNA COI from 335 samples. Similarly, a 637 bp segment of Cyt b provided 59 variable sites and 26 parsimony informative sites. AMOVA showed that the levels of genetic differentiation were low among six populations. Although the haplotype diversity and nucleotide diversity of Huanghe wild population had slightly higher than the other populations, there were no significant differences. There was no significant differentiation between the genetic and geographic distance of the six populations, and haplotype network diagram indicated that there may exist genetic hybrids of E. sinensis from different river basins. The results of clustering and neutrality tests revealed that the distance of geographical locations were not completely related to their genetic distance values for the six populations. In conclusion, these results have great significance for the evaluation and exploitation of germplasm resources of E. sinensis.

  17. Analysis of the a genome genetic diversity among brassica napus, b. rapa and b. juncea accessions using specific simple sequence repeat markers

    International Nuclear Information System (INIS)

    Tian, H.; Yan, J.; Zhang, R.; Guo, Y.; Hu, S.; Channa, S.A.

    2017-01-01

    This investigation was aimed at evaluating the genetic diversity of 127 accessions among Brassica napus, B. rapa, and B. juncea by using 15 pairs of the A genome specific simple sequence repeat primers. These 127 accessions could be clearly separated into three groups by cluster analysis, principal component analysis, and population structure analysis separately, and the results analyzed by the three methods were very similar. Group I comprised of mainly B. napus accessions and the most of B. juncea accessions formed Group II, Group III included nearly all of the B. rapa accessions. The result showed that 36.86% of the variance was due to significant differences among populations of species, indicated that abundance genetic diversity existed among the A genome of B. napus, B. rapa, and B. juncea accessions. B. napus, B. rapa, and B. juncea have the abundant genetic diversity in the A genome, and some elite genes can be used to broaden the genetic base of them, especially for B. napus, in future rapeseed breeding program. (author)

  18. High protists diversity in the plankton of sulfurous lakes and lagoons examined by 18s rRNA gene sequence analyses.

    Science.gov (United States)

    Triadó-Margarit, Xavier; Casamayor, Emilio O

    2015-12-01

    Diversity of small protists was studied in sulfidic and anoxic (euxinic) stratified karstic lakes and coastal lagoons by 18S rRNA gene analyses. We hypothesized a major sulfide effect, reducing protist diversity and richness with only a few specialized populations adapted to deal with low-redox conditions and high-sulfide concentrations. However, genetic fingerprinting suggested similar ecological diversity in anoxic and sulfurous than in upper oxygen rich water compartments with specific populations inhabiting euxinic waters. Many of them agreed with genera previously identified by microscopic observations, but also new and unexpected groups were detected. Most of the sequences matched a rich assemblage of Ciliophora (i.e., Coleps, Prorodon, Plagiopyla, Strombidium, Metopus, Vorticella and Caenomorpha, among others) and algae (mainly Cryptomonadales). Unidentified Cercozoa, Fungi, Stramenopiles and Discoba were recurrently found. The lack of GenBank counterparts was higher in deep hypolimnetic waters and appeared differentially allocated in the different taxa, being higher within Discoba and lower in Cryptophyceae. A larger number of populations than expected were specifically detected in the deep sulfurous waters, with unknown ecological interactions and metabolic capabilities. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.

  19. Microbial Culturomics Broadens Human Vaginal Flora Diversity: Genome Sequence and Description of Prevotella lascolaii sp. nov. Isolated from a Patient with Bacterial Vaginosis.

    Science.gov (United States)

    Diop, Khoudia; Diop, Awa; Levasseur, Anthony; Mediannikov, Oleg; Robert, Catherine; Armstrong, Nicholas; Couderc, Carine; Bretelle, Florence; Raoult, Didier; Fournier, Pierre-Edouard; Fenollar, Florence

    2018-03-01

    Microbial culturomics is a new subfield of postgenomic medicine and omics biotechnology application that has broadened our awareness on bacterial diversity of the human microbiome, including the human vaginal flora bacterial diversity. Using culturomics, a new obligate anaerobic Gram-stain-negative rod-shaped bacterium designated strain khD1 T was isolated in the vagina of a patient with bacterial vaginosis and characterized using taxonogenomics. The most abundant cellular fatty acids were C 15:0 anteiso (36%), C 16:0 (19%), and C 15:0 iso (10%). Based on an analysis of the full-length 16S rRNA gene sequences, phylogenetic analysis showed that the strain khD1 T exhibited 90% sequence similarity with Prevotella loescheii, the phylogenetically closest validated Prevotella species. With 3,763,057 bp length, the genome of strain khD1 T contained (mol%) 48.7 G + C and 3248 predicted genes, including 3194 protein-coding and 54 RNA genes. Given the phenotypical and biochemical characteristic results as well as genome sequencing, strain khD1 T is considered to represent a novel species within the genus Prevotella, for which the name Prevotella lascolaii sp. nov. is proposed. The type strain is khD1 T ( = CSUR P0109, = DSM 101754). These results show that microbial culturomics greatly improves the characterization of the human microbiome repertoire by isolating potential putative new species. Further studies will certainly clarify the microbial mechanisms of pathogenesis of these new microbes and their role in health and disease. Microbial culturomics is an important new addition to the diagnostic medicine toolbox and warrants attention in future medical, global health, and integrative biology postgraduate teaching curricula.

  20. Bacterial Community Diversity of Oil-Contaminated Soils Assessed by High Throughput Sequencing of 16S rRNA Genes

    Directory of Open Access Journals (Sweden)

    Mu Peng

    2015-09-01

    Full Text Available Soil bacteria play a major role in ecological and biodegradable function processes in oil-contaminated soils. Here, we assessed the bacterial diversity and changes therein in oil-contaminated soils exposed to different periods of oil pollution using 454 pyrosequencing of 16S rRNA genes. No less than 24,953 valid reads and 6246 operational taxonomic units (OTUs were obtained from all five studied samples. OTU richness was relatively higher in contaminated soils than clean samples. Acidobacteria, Actinobacteria, Bacteroidetes, Chloroflexi, Planctomycetes and Proteobacteria were the dominant phyla among all the soil samples. The heatmap plot depicted the relative percentage of each bacterial family within each sample and clustered five samples into two groups. For the samples, bacteria in the soils varied at different periods of oil exposure. The oil pollution exerted strong selective pressure to propagate many potentially petroleum degrading bacteria. Redundancy analysis (RDA indicated that organic matter was the highest determinant factor for explaining the variations in community compositions. This suggests that compared to clean soils, oil-polluted soils support more diverse bacterial communities and soil bacterial community shifts were mainly controlled by organic matter and exposure time. These results provide some useful information for bioremediation of petroleum contaminated soil in the future.

  1. Bacterial Community Diversity of Oil-Contaminated Soils Assessed by High Throughput Sequencing of 16S rRNA Genes.

    Science.gov (United States)

    Peng, Mu; Zi, Xiaoxue; Wang, Qiuyu

    2015-09-24

    Soil bacteria play a major role in ecological and biodegradable function processes in oil-contaminated soils. Here, we assessed the bacterial diversity and changes therein in oil-contaminated soils exposed to different periods of oil pollution using 454 pyrosequencing of 16S rRNA genes. No less than 24,953 valid reads and 6246 operational taxonomic units (OTUs) were obtained from all five studied samples. OTU richness was relatively higher in contaminated soils than clean samples. Acidobacteria, Actinobacteria, Bacteroidetes, Chloroflexi, Planctomycetes and Proteobacteria were the dominant phyla among all the soil samples. The heatmap plot depicted the relative percentage of each bacterial family within each sample and clustered five samples into two groups. For the samples, bacteria in the soils varied at different periods of oil exposure. The oil pollution exerted strong selective pressure to propagate many potentially petroleum degrading bacteria. Redundancy analysis (RDA) indicated that organic matter was the highest determinant factor for explaining the variations in community compositions. This suggests that compared to clean soils, oil-polluted soils support more diverse bacterial communities and soil bacterial community shifts were mainly controlled by organic matter and exposure time. These results provide some useful information for bioremediation of petroleum contaminated soil in the future.

  2. Pooled Enrichment Sequencing Identifies Diversity and Evolutionary Pressures at NLR Resistance Genes within a Wild Tomato Population.

    Science.gov (United States)

    Stam, Remco; Scheikl, Daniela; Tellier, Aurélien

    2016-06-02

    Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  3. Pooled Enrichment Sequencing Identifies Diversity and Evolutionary Pressures at NLR Resistance Genes within a Wild Tomato Population

    Science.gov (United States)

    Stam, Remco; Scheikl, Daniela; Tellier, Aurélien

    2016-01-01

    Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. PMID:27189991

  4. Rumen bacterial diversity of 80 to 110-day-old goats using 16S rRNA sequencing.

    Directory of Open Access Journals (Sweden)

    Xufeng Han

    Full Text Available The ability of rumen microorganisms to use fibrous plant matter plays an important role in ruminant animals; however, little information about rumen colonization by microbial populations after weaning has been reported. In this study, high-throughput sequencing was used to investigate the establishment of this microbial population in 80 to 110-day-old goats. Illumina sequencing of goat rumen samples yielded 101,356,610 nucleotides that were assembled into 256,868 reads with an average read length of 394 nucleotides. Taxonomic analysis of metagenomic reads indicated that the predominant phyla were distinct at different growth stages. The phyla Firmicutes and Synergistetes were predominant in samples taken from 80 to 100-day-old goats, but Bacteroidetes and Firmicutes became the most abundant phyla in samples from 110-day-old animals. There was a remarkable variation in the microbial populations with age; Firmicutes and Synergistetes decreased after weaning, but Bacteroidetes and Proteobacteria increased from 80 to 110 day of age. These findings suggested that colonization of the rumen by microorganisms is related to their function in the rumen digestive system. These results give a better understanding of the role of rumen microbes and the establishment of the microbial population, which help to maintain the host's health and improve animal performance.

  5. A correspondence between solution-state dynamics of an individual protein and the sequence and conformational diversity of its family.

    Directory of Open Access Journals (Sweden)

    Gregory D Friedland

    2009-05-01

    Full Text Available Conformational ensembles are increasingly recognized as a useful representation to describe fundamental relationships between protein structure, dynamics and function. Here we present an ensemble of ubiquitin in solution that is created by sampling conformational space without experimental information using "Backrub" motions inspired by alternative conformations observed in sub-Angstrom resolution crystal structures. Backrub-generated structures are then selected to produce an ensemble that optimizes agreement with nuclear magnetic resonance (NMR Residual Dipolar Couplings (RDCs. Using this ensemble, we probe two proposed relationships between properties of protein ensembles: (i a link between native-state dynamics and the conformational heterogeneity observed in crystal structures, and (ii a relation between dynamics of an individual protein and the conformational variability explored by its natural family. We show that the Backrub motional mechanism can simultaneously explore protein native-state dynamics measured by RDCs, encompass the conformational variability present in ubiquitin complex structures and facilitate sampling of conformational and sequence variability matching those occurring in the ubiquitin protein family. Our results thus support an overall relation between protein dynamics and conformational changes enabling sequence changes in evolution. More practically, the presented method can be applied to improve protein design predictions by accounting for intrinsic native-state dynamics.

  6. The population genetics of Quechuas, the largest native South American group: autosomal sequences, SNPs, and microsatellites evidence high level of diversity.

    Science.gov (United States)

    Scliar, Marilia O; Soares-Souza, Giordano B; Chevitarese, Juliana; Lemos, Livia; Magalhães, Wagner C S; Fagundes, Nelson J; Bonatto, Sandro L; Yeager, Meredith; Chanock, Stephen J; Tarazona-Santos, Eduardo

    2012-03-01

    Elucidating the pattern of genetic diversity for non-European populations is necessary to make the benefits of human genetics research available to individuals from these groups. In the era of large human genomic initiatives, Native American populations have been neglected, in particular, the Quechua, the largest South Amerindian group settled along the Andes. We characterized the genetic diversity of a Quechua population in a global setting, using autosomal noncoding sequences (nine unlinked loci for a total of 16 kb), 351 unlinked SNPs and 678 microsatellites and tested predictions of the model of the evolution of Native Americans proposed by (Tarazona-Santos et al.: Am J Hum Genet 68 (2001) 1485-1496). European admixture is Quechua or Melanesian populations, which is concordant with the African origin of modern humans and the fact that South America was the last part of the world to be peopled. The diversity in the Quechua population is comparable with that of Eurasian populations, and the allele frequency spectrum based on resequencing data does not reflect a reduction in the proportion of rare alleles. Thus, the Quechua population is a large reservoir of common and rare genetic variants of South Amerindians. These results are consistent with and complement our evolutionary model of South Amerindians (Tarazona-Santos et al.: Am J Hum Genet 68 (2001) 1485-1496), proposed based on Y-chromosome data, which predicts high genomic diversity due to the high level of gene flow between Andean populations and their long-term effective population size. Copyright © 2012 Wiley Periodicals, Inc.

  7. The Arsenic Resistance-Associated Listeria Genomic Island LGI2 Exhibits Sequence and Integration Site Diversity and a Propensity for Three Listeria monocytogenes Clones with Enhanced Virulence.

    Science.gov (United States)

    Lee, Sangmi; Ward, Todd J; Jima, Dereje D; Parsons, Cameron; Kathariou, Sophia

    2017-11-01

    In the foodborne pathogen Listeria monocytogenes , arsenic resistance is encountered primarily in serotype 4b clones considered to have enhanced virulence and is associated with an arsenic resistance gene cluster within a 35-kb chromosomal region, Listeria genomic island 2 (LGI2). LGI2 was first identified in strain Scott A and includes genes putatively involved in arsenic and cadmium resistance, DNA integration, conjugation, and pathogenicity. However, the genomic localization and sequence content of LGI2 remain poorly characterized. Here we investigated 85 arsenic-resistant L. monocytogenes strains, mostly of serotype 4b. All but one of the 70 serotype 4b strains belonged to clonal complex 1 (CC1), CC2, and CC4, three major clones associated with enhanced virulence. PCR analysis suggested that 53 strains (62.4%) harbored an island highly similar to LGI2 of Scott A, frequently (42/53) in the same location as Scott A ( LMOf2365_2257 homolog). Random-primed PCR and whole-genome sequencing revealed seven novel insertion sites, mostly internal to chromosomal coding sequences, among strains harboring LGI2 outside the LMOf2365_2257 homolog. Interestingly, many CC1 strains harbored a noticeably diversified LGI2 (LGI2-1) in a unique location ( LMOf2365_0902 homolog) and with a novel additional gene. With few exceptions, the tested LGI2 genes were not detected in arsenic-resistant strains of serogroup 1/2, which instead often harbored a Tn 554 -associated arsenic resistance determinant not encountered in serotype 4b. These findings indicate that in L. monocytogenes , LGI2 has a propensity for certain serotype 4b clones, exhibits content diversity, and is highly promiscuous, suggesting an ability to mobilize various accessory genes into diverse chromosomal loci. IMPORTANCE Listeria monocytogenes is widely distributed in the environment and causes listeriosis, a foodborne disease with high mortality and morbidity. Arsenic and other heavy metals can powerfully shape the

  8. Whole-Genome Sequencing and Comparative Genome Analysis Provided Insight into the Predatory Features and Genetic Diversity of Two Bdellovibrio Species Isolated from Soil

    Directory of Open Access Journals (Sweden)

    Omotayo Opemipo Oyedara

    2018-01-01

    Full Text Available Bdellovibrio spp. are predatory bacteria with great potential as antimicrobial agents. Studies have shown that members of the genus Bdellovibrio exhibit peculiar characteristics that influence their ecological adaptations. In this study, whole genomes of two different Bdellovibrio spp. designated SKB1291214 and SSB218315 isolated from soil were sequenced. The core genes shared by all the Bdellovibrio spp. considered for the pangenome analysis including the epibiotic B. exovorus were 795. The number of unique genes identified in Bdellovibrio spp. SKB1291214, SSB218315, W, and B. exovorus JJS was 1343, 113, 857, and 1572, respectively. These unique genes encode hydrolytic, chemotaxis, and transporter proteins which might be useful for predation in the Bdellovibrio strains. Furthermore, the two Bdellovibrio strains exhibited differences based on the % GC content, amino acid identity, and 16S rRNA gene sequence. The 16S rRNA gene sequence of Bdellovibrio sp. SKB1291214 shared 99% identity with that of an uncultured Bdellovibrio sp. clone 12L 106 (a pairwise distance of 0.008 and 95–97% identity (a pairwise distance of 0.043 with that of other culturable terrestrial Bdellovibrio spp., including strain SSB218315. In Bdellovibrio sp. SKB1291214, 174 bp sequence was inserted at the host interaction (hit locus region usually attributed to prey attachment, invasion, and development of host independent Bdellovibrio phenotypes. Also, a gene equivalent to Bd0108 in B. bacteriovorus HD100 was not conserved in Bdellovibrio sp. SKB1291214. The results of this study provided information on the genetic characteristics and diversity of the genus Bdellovibrio that can contribute to their successful applications as a biocontrol agent.

  9. Diversity of Pea-Associated F. proliferatum and F. verticillioides Populations Revealed by FUM1 Sequence Analysis and Fumonisin Biosynthesis

    Science.gov (United States)

    Waśkiewicz, Agnieszka; Stępień, Łukasz; Wilman, Karolina; Kachlicki, Piotr

    2013-01-01

    Fusarium proliferatum and F. verticillioides are considered as minor pathogens of pea (Pisum sativum L.). Both species can survive in seed material without visible disease symptoms, but still contaminating it with fumonisins. Two populations of pea-derived F. proliferatum and F. verticillioides strains were subjected to FUM1 sequence divergence analysis, forming a distinct group when compared to the collection strains originating from different host species. Furthermore, the mycotoxigenic abilities of those strains were evaluated on the basis of in planta and in vitro fumonisin biosynthesis. No differences were observed in fumonisin B (FB) levels measured in pea seeds (maximum level reached 1.5 μg g−1); however, in rice cultures, the majority of F. proliferatum genotypes produced higher amounts of FB1–FB3 than F. verticillioides strains. PMID:23470545

  10. Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis.

    Science.gov (United States)

    Yutin, Natalya; Bäckström, Disa; Ettema, Thijs J G; Krupovic, Mart; Koonin, Eugene V

    2018-04-10

    Analysis of metagenomic sequences has become the principal approach for the study of the diversity of viruses. Many recent, extensive metagenomic studies on several classes of viruses have dramatically expanded the visible part of the virosphere, showing that previously undetected viruses, or those that have been considered rare, actually are important components of the global virome. We investigated the provenance of viruses related to tail-less bacteriophages of the family Tectiviridae by searching genomic and metagenomics sequence databases for distant homologs of the tectivirus-like Double Jelly-Roll major capsid proteins (DJR MCP). These searches resulted in the identification of numerous genomes of virus-like elements that are similar in size to tectiviruses (10-15 kilobases) and have diverse gene compositions. By comparison of the gene repertoires, the DJR MCP-encoding genomes were classified into 6 distinct groups that can be predicted to differ in reproduction strategies and host ranges. Only the DJR MCP gene that is present by design is shared by all these genomes, and most also encode a predicted DNA-packaging ATPase; the rest of the genes are present only in subgroups of this unexpectedly diverse collection of DJR MCP-encoding genomes. Only a minority encode a DNA polymerase which is a hallmark of the family Tectiviridae and the putative family "Autolykiviridae". Notably, one of the identified putative DJR MCP viruses encodes a homolog of Cas1 endonuclease, the integrase involved in CRISPR-Cas adaptation and integration of transposon-like elements called casposons. This is the first detected occurrence of Cas1 in a virus. Many of the identified elements are individual contigs flanked by inverted or direct repeats and appear to represent complete, extrachromosomal viral genomes, whereas others are flanked by bacterial genes and thus can be considered as proviruses. These contigs come from metagenomes of widely different environments, some dominated by

  11. Allelic diversity of the MHC class II DRB genes in brown bears (Ursus arctos) and a comparison of DRB sequences within the family Ursidae.

    Science.gov (United States)

    Goda, N; Mano, T; Kosintsev, P; Vorobiev, A; Masuda, R

    2010-11-01

    The allelic diversity of the DRB locus in major histocompatibility complex (MHC) genes was analyzed in the brown bear (Ursus arctos) from the Hokkaido Island of Japan, Siberia, and Kodiak of Alaska. Nineteen alleles of the DRB exon 2 were identified from a total of 38 individuals of U. arctos and were highly polymorphic. Comparisons of non-synonymous and synonymous substitutions in the antigen-binding sites of deduced amino acid sequences indicated evidence for balancing selection on the bear DRB locus. The phylogenetic analysis of the DRB alleles among three genera (Ursus, Tremarctos, and Ailuropoda) in the family Ursidae revealed that DRB allelic lineages were not separated according to species. This strongly shows trans-species persistence of DRB alleles within the Ursidae. © 2010 John Wiley & Sons A/S.

  12. Study of the diversity of microbial communities in a sequencing batch reactor oxic-settling-anaerobic process and its modified process.

    Science.gov (United States)

    Sun, Lianpeng; Chen, Jianfan; Wei, Xiange; Guo, Wuzhen; Lin, Meishan; Yu, Xiaoyu

    2016-05-01

    To further reveal the mechanism of sludge reduction in the oxic-settling-anaerobic (OSA) process, the polymerase chain reaction - denaturing gradient gel electrophoresis protocol was used to study the possible difference in the microbial communities between a sequencing batch reactor (SBR)-OSA process and its modified process, by analyzing the change in the diversity of the microbial communities in each reactor of both systems. The results indicated that the structure of the microbial communities in aerobic reactors of the 2 processes was very different, but the predominant microbial populations in anaerobic reactors were similar. The predominant microbial population in the aerobic reactor of the SBR-OSA belonged to Burkholderia cepacia, class Betaproteobacteria, while those of the modified process belonged to the classes Alphaproteobacteria, Betaproteobacteria, and Gammaproteobacteria. These 3 types of microbes had a cryptic growth characteristic, which was the main cause of a greater sludge reduction efficiency achieved by the modified process.

  13. Inter-Simple Sequence Repeat (ISSR Markers to Study Genetic Diversity Among Cotton Cultivars in Associated with Salt Tolerance

    Directory of Open Access Journals (Sweden)

    Ali Akbar ABDI

    2012-11-01

    Full Text Available Developing salt-tolerant crops is very important as a significant proportion of cultivated land is salt-affected. Screening and selection of salt tolerant genotypes of cotton using DNA molecular markers not only introduce tolerant cultivars useful for hybridization and breeding programs but also detect DNA regions involved in mechanism of salinity tolerance. To study this, 28 cotton cultivars, including 8 Iranian cotton varieties were grown in pots under greenhouse condition and three salt treatments were imposed with salt solutions (0, 70 and 140 mM NaCl. Eight agronomic traits including root length, root fresh weight, root dry weight, chlorophyll and fluorescence index, K+ and Na+ contents in shoot (above ground biomass, and K+/Na+ ratio were measured. Cluster analysis of cultivars based on measured agronomic traits, showed �Cindose� and �Ciacra� as the most tolerant cultivars, and �B-557� and �43347� as the most sensitive cultivars of salt damage. A total of 65 polymorphic DNA fragments were generated at 14 inter-simple sequence repeat (ISSR loci. Plants of 28 cultivars of cotton grouped into three clusters based on ISSR markers. Regression analysis of markers in relation with traits data showed that 23, 33 and 30 markers associated with the measured traits in three salt treatments respectively. These markers might help breeders in any marker assisted selection program in order to improving cotton cultivars against salt stress.

  14. High-Throughput Sequencing Reveals Diverse Sets of Conserved, Nonconserved, and Species-Specific miRNAs in Jute

    Directory of Open Access Journals (Sweden)

    Md. Tariqul Islam

    2015-01-01

    Full Text Available MicroRNAs play a pivotal role in regulating a broad range of biological processes, acting by cleaving mRNAs or by translational repression. A group of plant microRNAs are evolutionarily conserved; however, others are expressed in a species-specific manner. Jute is an agroeconomically important fibre crop; nonetheless, no practical information is available for microRNAs in jute to date. In this study, Illumina sequencing revealed a total of 227 known microRNAs and 17 potential novel microRNA candidates in jute, of which 164 belong to 23 conserved families and the remaining 63 belong to 58 nonconserved families. Among a total of 81 identified microRNA families, 116 potential target genes were predicted for 39 families and 11 targets were predicted for 4 among the 17 identified novel microRNAs. For understanding better the functions of microRNAs, target genes were analyzed by Gene Ontology and their pathways illustrated by KEGG pathway analyses. The presence of microRNAs identified in jute was validated by stem-loop RT-PCR followed by end point PCR and qPCR for randomly selected 20 known and novel microRNAs. This study exhaustively identifies microRNAs and their target genes in jute which will ultimately pave the way for understanding their role in this crop and other crops.

  15. High diversity of beta-lactamases in the General Hospital Vienna verified by whole genome sequencing and statistical analysis.

    Science.gov (United States)

    Barišić, Ivan; Mitteregger, Dieter; Hirschl, Alexander M; Noehammer, Christa; Wiesinger-Mayr, Herbert

    2014-10-01

    The detailed analysis of antibiotic resistance mechanisms is essential for understanding the underlying evolutionary processes, the implementation of appropriate intervention strategies and to guarantee efficient treatment options. In the present study, 110 β-lactam-resistant, clinical isolates of Enterobacteriaceae sampled in 2011 in one of Europe's largest hospitals, the General Hospital Vienna, were screened for the presence of 31 β-lactamase genes. Twenty of those isolates were selected for whole genome sequencing (WGS). In addition, the number of β-lactamase genes was estimated using biostatistical models. The carbapenemase genes blaKPC-2, blaKPC-3, and blaVIM-4 were identified in carbapenem-resistant and intermediate susceptible isolates, blaOXA-72 in an extended-spectrum β-lactamase (ESBL)-positive one. Furthermore, the observed high prevalence of the acquired blaDHA-1 and blaCMY AmpC β-lactamase genes (70%) in phenotypically AmpC-positive isolates is alarming due to their capability to become carbapenem-resistant upon changes in membrane permeability. The statistical analyses revealed that approximately 55% of all β-lactamase genes present in the General Hospital Vienna were detected by this study. In summary, this work gives a very detailed picture on the disseminated β-lactamases and other resistance genes in one of Europe's largest hospitals. Copyright © 2014 Elsevier B.V. All rights reserved.

  16. In silico analysis highlights the frequency and diversity of type 1 lantibiotic gene clusters in genome sequenced bacteria

    LENUS (Irish Health Repository)

    Marsh, Alan J

    2010-11-30

    Abstract Background Lantibiotics are lanthionine-containing, post-translationally modified antimicrobial peptides. These peptides have significant, but largely untapped, potential as preservatives and chemotherapeutic agents. Type 1 lantibiotics are those in which lanthionine residues are introduced into the structural peptide (LanA) through the activity of separate lanthionine dehydratase (LanB) and lanthionine synthetase (LanC) enzymes. Here we take advantage of the conserved nature of LanC enzymes to devise an in silico approach to identify potential lantibiotic-encoding gene clusters in genome sequenced bacteria. Results In total 49 novel type 1 lantibiotic clusters were identified which unexpectedly were associated with species, genera and even phyla of bacteria which have not previously been associated with lantibiotic production. Conclusions Multiple type 1 lantibiotic gene clusters were identified at a frequency that suggests that these antimicrobials are much more widespread than previously thought. These clusters represent a rich repository which can yield a large number of valuable novel antimicrobials and biosynthetic enzymes.

  17. A mutant Brassica napus (canola population for the identification of new genetic diversity via TILLING and next generation sequencing.

    Directory of Open Access Journals (Sweden)

    Erin J Gilchrist

    Full Text Available We have generated a Brassica napus (canola population of 3,158 EMS-mutagenised lines and used TILLING to demonstrate that the population has a high enough mutation density that it will be useful for identification of mutations in genes of interest in this important crop species. TILLING is a reverse genetics technique that has been successfully used in many plant and animal species. Classical TILLING involves the generation of a mutagenised population, followed by screening of DNA samples using a mismatch-specific endonuclease that cleaves only those PCR products that carry a mutation. Polyacrylamide gel detection is then used to visualise the mutations in any gene of interest. We have used this TILLING technique to identify 432 unique mutations in 26 different genes in B. napus (canola cv. DH12075. This reflects a mutation density ranging from 1/56 kb to 1/308 kb (depending on the locus with an average of 1/109 kb. We have also successfully verified the utility of next generation sequencing technology as a powerful approach for the identification of rare mutations in a population of plants, even in polyploid species such as B. napus. Most of the mutants we have identified are publically available.

  18. Molecular Comparison and Evolutionary Analyses of VP1 Nucleotide Sequences of New African Human Enterovirus 71 Isolates Reveal a Wide Genetic Diversity

    Science.gov (United States)

    Nougairède, Antoine; Joffret, Marie-Line; Deshpande, Jagadish M.; Dubot-Pérès, Audrey; Héraud, Jean-Michel

    2014-01-01

    Most circulating strains of Human enterovirus 71 (EV-A71) have been classified primarily into three genogroups (A to C) on the basis of genetic divergence between the 1D gene, which encodes the VP1 capsid protein. The aim of the present study was to provide further insights into the diversity of the EV-A71 genogroups following the recent description of highly divergent isolates, in particular those from African countries, including Madagascar. We classified recent EV-A71 isolates by a large comparison of 3,346 VP1 nucleotidic sequences collected from GenBank. Analysis of genetic distances and phylogenetic investigations indicated that some recently-reported isolates did not fall into the genogroups A-C and clustered into three additional genogroups, including one Indian genogroup (genogroup D) and 2 African ones (E and F). Our Bayesian phylogenetic analysis provided consistent data showing that the genogroup D isolates share a recent common ancestor with the members of genogroup E, while the isolates of genogroup F evolved from a recent common ancestor shared with the members of the genogroup B. Our results reveal the wide diversity that exists among EV-A71 isolates and suggest that the number of circulating genogroups is probably underestimated, particularly in developing countries where EV-A71 epidemiology has been poorly studied. PMID:24598878

  19. Multilocus sequence typing, biochemical and antibiotic resistance characterizations reveal diversity of North American strains of the honey bee pathogen Paenibacillus larvae.

    Science.gov (United States)

    Krongdang, Sasiprapa; Evans, Jay D; Pettis, Jeffery S; Chantawannakul, Panuwan

    2017-01-01

    Paenibacillus larvae is a Gram positive bacterium and the causative agent of the most widespread fatal brood disease of honey bees, American foulbrood (AFB). A total of thirty-three independent Paenibacillus larvae isolates from various geographical origins in North America and five reference strains were investigated for genetic diversity using multilocus sequence typing (MLST). This technique is regarded to be a powerful tool for epidemiological studies of pathogenic bacteria and is widely used in genotyping assays. For MLST, seven housekeeping gene loci, ilvD (dihydroxy-acid dyhydrogenase), tri (triosephosphate isomerase), purH (phospharibosyl-aminoimidazolecarboxamide), recF (DNA replication and repair protein), pyrE (orotate phosphoribosyltransferase), sucC (succinyl coenzyme A synthetase β subunit) and glpF (glycerol uptake facilitator protein) were studied and applied for primer designs. Previously, ERIC type DNA fingerprinting was applied to these same isolates and the data showed that almost all represented the ERIC I type, whereas using BOX-PCR gave an indication of more diversity. All isolates were screened for resistance to four antibiotics used by U.S. beekeepers, showing extensive resistance to tetracycline and the first records of resistance to tylosin and lincomycin. Our data highlight the intraspecies relationships of P. larvae and the potential application of MLST methods in enhancing our understanding of epidemiological relationships among bacterial isolates of different origins.

  20. Multilocus sequence typing, biochemical and antibiotic resistance characterizations reveal diversity of North American strains of the honey bee pathogen Paenibacillus larvae.

    Directory of Open Access Journals (Sweden)

    Sasiprapa Krongdang

    Full Text Available Paenibacillus larvae is a Gram positive bacterium and the causative agent of the most widespread fatal brood disease of honey bees, American foulbrood (AFB. A total of thirty-three independent Paenibacillus larvae isolates from various geographical origins in North America and five reference strains were investigated for genetic diversity using multilocus sequence typing (MLST. This technique is regarded to be a powerful tool for epidemiological studies of pathogenic bacteria and is widely used in genotyping assays. For MLST, seven housekeeping gene loci, ilvD (dihydroxy-acid dyhydrogenase, tri (triosephosphate isomerase, purH (phospharibosyl-aminoimidazolecarboxamide, recF (DNA replication and repair protein, pyrE (orotate phosphoribosyltransferase, sucC (succinyl coenzyme A synthetase β subunit and glpF (glycerol uptake facilitator protein were studied and applied for primer designs. Previously, ERIC type DNA fingerprinting was applied to these same isolates and the data showed that almost all represented the ERIC I type, whereas using BOX-PCR gave an indication of more diversity. All isolates were screened for resistance to four antibiotics used by U.S. beekeepers, showing extensive resistance to tetracycline and the first records of resistance to tylosin and lincomycin. Our data highlight the intraspecies relationships of P. larvae and the potential application of MLST methods in enhancing our understanding of epidemiological relationships among bacterial isolates of different origins.

  1. Evaluation of genetic diversity in Chinese kale (Brassica oleracea L. var. alboglabra Bailey) by using rapid amplified polymorphic DNA and sequence-related amplified polymorphism markers.

    Science.gov (United States)

    Zhang, J; Zhang, L G

    2014-02-14

    Chinese kale is an original Chinese vegetable of the Cruciferae family. To select suitable parents for hybrid breeding, we thoroughly analyzed the genetic diversity of Chinese kale. Random amplified polymorphic DNA (RAPD) and sequence-related amplified polymorphism (SRAP) molecular markers were used to evaluate the genetic diversity across 21 Chinese kale accessions from AVRDC and Guangzhou in China. A total of 104 bands were detected by 11 RAPD primers, of which 66 (63.5%) were polymorphic, and 229 polymorphic bands (68.4%) were observed in 335 bands amplified by 17 SRAP primer combinations. The dendrogram showed the grouping of the 21 accessions into 4 main clusters based on RAPD data, and into 6 clusters based on SRAP and combined data (RAPD + SRAP). The clustering of accessions based on SRAP data was consistent with petal colors. The Mantel test indicated a poor fit for the RAPD and SRAP data (r = 0.16). These results have an important implication for Chinese kale germplasm characterization and improvement.

  2. Sequence analysis of measles virus strains collected during the pre- and early-vaccination era in Denmark reveals a considerable diversity of ancient strains

    DEFF Research Database (Denmark)

    Christensen, Laurids Siig; Schöller, S.; Schierup, M. H.

    2002-01-01

    A total of 199 serum samples from patients with measles collected in Denmark, Greenland and the Faroe Islands from 1964 to 1983 were analysed by PCR. Measles virus (MV) RNA could be detected in 38 (19%) of the samples and a total of 18 strains were subjected to partial sequence analysis of the he......A total of 199 serum samples from patients with measles collected in Denmark, Greenland and the Faroe Islands from 1964 to 1983 were analysed by PCR. Measles virus (MV) RNA could be detected in 38 (19%) of the samples and a total of 18 strains were subjected to partial sequence analysis...... of the hemagglutinin gene. The strains exhibited a considerable genomic diversity, which is at odds with the assumption that one genome type prevailed among globally circulating MV strains prior to the advent of live-attenuated vaccines. Our data indicate that the similarity of the various vaccine strains...... is attributed to their having originated from the same primary isolate. Consequently, it is implied that a small number of clinical manifestations of MV worldwide from which strains similar to the vaccine strain were identified were vaccine related rather than being caused by members of a persistently...

  3. Distribution and factors associated with Salmonella enterica genotypes in a diverse population of humans and animals in Qatar using multi-locus sequence typing (MLST).

    Science.gov (United States)

    Chang, Yu C; Scaria, Joy; Ibraham, Mariamma; Doiphode, Sanjay; Chang, Yung-Fu; Sultan, Ali; Mohammed, Hussni O

    2016-01-01

    Salmonella enterica is one of the most commonly reported causes of bacterial foodborne illness around the world. Understanding the sources of this pathogen and the associated factors that exacerbate its risk to humans will help in developing risk mitigation strategies. The genetic relatedness among Salmonella isolates recovered from human gastroenteritis cases and food animals in Qatar were investigated in the hope of shedding light on these sources, their possible transmission routes, and any associated factors. A repeat cross-sectional study was conducted in which the samples and associated data were collected from both populations (gastroenteritis cases and animals). Salmonella isolates were initially analyzed using multi-locus sequence typing (MLST) to investigate the genetic diversity and clonality. The relatedness among the isolates was assessed using the minimum spanning tree (MST). Twenty-seven different sequence types (STs) were identified in this study; among them, seven were novel, including ST1695, ST1696, ST1697, ST1698, ST1699, ST1702, and ST1703. The pattern of overall ST distribution was diverse; in particular, it was revealed that ST11 and ST19 were the most common sequence types, presenting 29.5% and 11.5% within the whole population. In addition, 20 eBurst Groups (eBGs) were identified in our data, which indicates that ST11 and ST19 belonged to eBG4 and eBG1, respectively. In addition, the potential association between the putative risk factors and eBGs were evaluated. There was no significant clustering of these eBGs by season; however, a significant association was identified in terms of nationality in that Qataris were six times more likely to present with eBG1 compared to non-Qataris. In the MST analysis, four major clusters were presented, namely, ST11, ST19, ST16, and ST31. The linkages between the clusters alluded to a possible transmission route. The results of the study have provided insight into the ST distributions of S. enterica and

  4. Increased genetic diversity and prevalence of co-infection with Trypanosoma spp. in koalas (Phascolarctos cinereus and their ticks identified using next-generation sequencing (NGS.

    Directory of Open Access Journals (Sweden)

    Amanda D Barbosa

    Full Text Available Infections with Trypanosoma spp. have been associated with poor health and decreased survival of koalas (Phascolarctos cinereus, particularly in the presence of concurrent pathogens such as Chlamydia and koala retrovirus. The present study describes the application of a next-generation sequencing (NGS-based assay to characterise the prevalence and genetic diversity of trypanosome communities in koalas and two native species of ticks (Ixodes holocyclus and I. tasmani removed from koala hosts. Among 168 koalas tested, 32.2% (95% CI: 25.2-39.8% were positive for at least one Trypanosoma sp. Previously described Trypanosoma spp. from koalas were identified, including T. irwini (32.1%, 95% CI: 25.2-39.8%, T. gilletti (25%, 95% CI: 18.7-32.3%, T. copemani (27.4%, 95% CI: 20.8-34.8% and T. vegrandis (10.1%, 95% CI: 6.0-15.7%. Trypanosoma noyesi was detected for the first time in koalas, although at a low prevalence (0.6% 95% CI: 0-3.3%, and a novel species (Trypanosoma sp. AB-2017 was identified at a prevalence of 4.8% (95% CI: 2.1-9.2%. Mixed infections with up to five species were present in 27.4% (95% CI: 21-35% of the koalas, which was significantly higher than the prevalence of single infections 4.8% (95% CI: 2-9%. Overall, a considerably higher proportion (79.7% of the Trypanosoma sequences isolated from koala blood samples were identified as T. irwini, suggesting this is the dominant species. Co-infections involving T. gilletti, T. irwini, T. copemani, T. vegrandis and Trypanosoma sp. AB-2017 were also detected in ticks, with T. gilletti and T. copemani being the dominant species within the invertebrate hosts. Direct Sanger sequencing of Trypanosoma 18S rRNA gene amplicons was also performed and results revealed that this method was only able to identify the genotypes with greater amount of reads (according to NGS within koala samples, which highlights the advantages of NGS in detecting mixed infections. The present study provides new insights

  5. Increased genetic diversity and prevalence of co-infection with Trypanosoma spp. in koalas (Phascolarctos cinereus) and their ticks identified using next-generation sequencing (NGS).

    Science.gov (United States)

    Barbosa, Amanda D; Gofton, Alexander W; Paparini, Andrea; Codello, Annachiara; Greay, Telleasha; Gillett, Amber; Warren, Kristin; Irwin, Peter; Ryan, Una

    2017-01-01

    Infections with Trypanosoma spp. have been associated with poor health and decreased survival of koalas (Phascolarctos cinereus), particularly in the presence of concurrent pathogens such as Chlamydia and koala retrovirus. The present study describes the application of a next-generation sequencing (NGS)-based assay to characterise the prevalence and genetic diversity of trypanosome communities in koalas and two native species of ticks (Ixodes holocyclus and I. tasmani) removed from koala hosts. Among 168 koalas tested, 32.2% (95% CI: 25.2-39.8%) were positive for at least one Trypanosoma sp. Previously described Trypanosoma spp. from koalas were identified, including T. irwini (32.1%, 95% CI: 25.2-39.8%), T. gilletti (25%, 95% CI: 18.7-32.3%), T. copemani (27.4%, 95% CI: 20.8-34.8%) and T. vegrandis (10.1%, 95% CI: 6.0-15.7%). Trypanosoma noyesi was detected for the first time in koalas, although at a low prevalence (0.6% 95% CI: 0-3.3%), and a novel species (Trypanosoma sp. AB-2017) was identified at a prevalence of 4.8% (95% CI: 2.1-9.2%). Mixed infections with up to five species were present in 27.4% (95% CI: 21-35%) of the koalas, which was significantly higher than the prevalence of single infections 4.8% (95% CI: 2-9%). Overall, a considerably higher proportion (79.7%) of the Trypanosoma sequences isolated from koala blood samples were identified as T. irwini, suggesting this is the dominant species. Co-infections involving T. gilletti, T. irwini, T. copemani, T. vegrandis and Trypanosoma sp. AB-2017 were also detected in ticks, with T. gilletti and T. copemani being the dominant species within the invertebrate hosts. Direct Sanger sequencing of Trypanosoma 18S rRNA gene amplicons was also performed and results revealed that this method was only able to identify the genotypes with greater amount of reads (according to NGS) within koala samples, which highlights the advantages of NGS in detecting mixed infections. The present study provides new insights on the

  6. Application of next-generation sequencing technology to study genetic diversity and identify unique SNP markers in bread wheat from Kazakhstan.

    Science.gov (United States)

    Shavrukov, Yuri; Suchecki, Radoslaw; Eliby, Serik; Abugalieva, Aigul; Kenebayev, Serik; Langridge, Peter

    2014-09-28

    New SNP marker platforms offer the opportunity to investigate the relationships between wheat cultivars from different regions and assess the mechanism and processes that have led to adaptation to particular production environments. Wheat breeding has a long history in Kazakhstan and the aim of this study was to explore the relationship between key varieties from Kazakhstan and germplasm from breeding programs for other regions. The study revealed 5,898 polymorphic markers amongst ten cultivars, of which 2,730 were mapped in the consensus genetic map. Mapped SNP markers were distributed almost equally across the A and B genomes, with between 279 and 484 markers assigned to each chromosome. Marker coverage was approximately 10-fold lower in the D genome. There were 863 SNP markers identified as unique to specific cultivars, and clusters of these markers (regions containing more than three closely mapped unique SNPs) showed specific patterns on the consensus genetic map for each cultivar. Significant intra-varietal genetic polymorphism was identified in three cultivars (Tzelinnaya 3C, Kazakhstanskaya rannespelaya and Kazakhstanskaya 15). Phylogenetic analysis based on inter-varietal polymorphism showed that the very old cultivar Erythrospermum 841 was the most genetically distinct from the other nine cultivars from Kazakhstan, falling in a clade together with the American cultivar Sonora and genotypes from Central and South Asia. The modern cultivar Kazakhstanskaya 19 also fell into a separate clade, together with the American cultivar Thatcher. The remaining eight cultivars shared a single sub-clade but were categorised into four clusters. The accumulated data for SNP marker polymorphisms amongst bread wheat genotypes from Kazakhstan may be used for studying genetic diversity in bread wheat, with potential application for marker-assisted selection and the preparation of a set of genotype-specific markers.

  7. A Next-Generation Sequencing Data Analysis Pipeline for Detecting Unknown Pathogens from Mixed Clinical Samples and Revealing Their Genetic Diversity.

    Directory of Open Access Journals (Sweden)

    Yu-Nong Gong

    Full Text Available Forty-two cytopathic effect (CPE-positive isolates were collected from 2008 to 2012. All isolates could not be identified for known viral pathogens by routine diagnostic assays. They were pooled into 8 groups of 5-6 isolates to reduce the sequencing cost. Next-generation sequencing (NGS was conducted for each group of mixed samples, and the proposed data analysis pipeline was used to identify viral pathogens in these mixed samples. Polymerase chain reaction (PCR or enzyme-linked immunosorbent assay (ELISA was individually conducted for each of these 42 isolates depending on the predicted viral types in each group. Two isolates remained unknown after these tests. Moreover, iteration mapping was implemented for each of these 2 isolates, and predicted human parechovirus (HPeV in both. In summary, our NGS pipeline detected the following viruses among the 42 isolates: 29 human rhinoviruses (HRVs, 10 HPeVs, 1 human adenovirus (HAdV, 1 echovirus and 1 rotavirus. We then focused on the 10 identified Taiwanese HPeVs because of their reported clinical significance over HRVs. Their genomes were assembled and their genetic diversity was explored. One novel 6-bp deletion was found in one HPeV-1 virus. In terms of nucleotide heterogeneity, 64 genetic variants were detected from these HPeVs using the mapped NGS reads. Most importantly, a recombination event was found between our HPeV-3 and a known HPeV-4 strain in the database. Similar event was detected in the other HPeV-3 strains in the same clade of the phylogenetic tree. These findings demonstrated that the proposed NGS data analysis pipeline identified unknown viruses from the mixed clinical samples, revealed their genetic identity and variants, and characterized their genetic features in terms of viral evolution.

  8. Multilocus Sequence Typing and Staphylococcal Protein A Typing Revealed Novel and Diverse Clones of Methicillin-Resistant Staphylococcus aureus in Seafood and the Aquatic Environment.

    Science.gov (United States)

    Murugadas, V; Toms, C Joseph; Reethu, Sara A; Lalitha, K V

    2017-03-01

    Methicillin-resistant Staphylococcus aureus (MRSA) has been a global health concern since the 1960s, and isolation of this pathogen from food-producing animals has been increasing. However, little information is available on the prevalence of MRSA and its clonal characteristics in seafood and the aquatic environment. In this study, 267 seafood and aquatic environment samples were collected from three districts of Kerala, India. Staphylococcal protein A (spa) typing and multilocus sequence typing (MLST) was performed for 65 MRSA strains isolated from 20 seafood and aquatic environment samples. The MRSA clonal profiles were t657-ST772, t002-ST5, t334-ST5, t311-ST5, t121-ST8, t186-ST88, t127-ST1, and two non-spa assignable strains. Whole spa gene sequence analysis along with MLST confirmed one strain as t711-ST6 and another as a novel MRSA clone identified for the first time in seafood and the aquatic environment with a t15669 spa type and a new MLST profile of ST420-256-236-66-82-411-477. The MRSA strains were clustered into five clonal complexes based on the goeBURST algorithm, indicating high diversity among MRSA strains in seafood and the aquatic environment. The novel clone formed a separate clonal complex with matches to three loci. This study recommends large-scale spa typing and MLST of MRSA isolates from seafood and the aquatic environment to determine the prevalence of new MRSA clones. This monitoring process can be useful for tracing local spread of MRSA isolates into the seafood production chain in a defined geographical area.

  9. Relationship between ureB Sequence Diversity, Urease Activity and Genotypic Variations of Different Helicobacter pylori Strains in Patients with Gastric Disorders.

    Science.gov (United States)

    Ghalehnoei, Hossein; Ahmadzadeh, Alireza; Farzi, Nastaran; Alebouyeh, Masoud; Aghdaei, Hamid Asadzadeh; Azimzadeh, Pendram; Molaei, Mahsa; Zali, Mohammad Reza

    2016-01-01

    Association of the severity of Helicobacter pylori induced diseases with virulence entity of the colonized strains was proven in some studies. Urease has been demonstrated as a potent virulence factor for H. pylori. The main aim of this study was investigation of the relationships of ureB sequence diversity, urease activity and virulence genotypes of different H. pylori strains with histopathological changes of gastric tissue in infected patients suffering from different gastric disorders. Analysis of the virulence genotypes in the isolated strains indicated significant associations between the presence of severe active gastritis and cagA+ (P = 0.039) or cagA/iceA1 genotypes (P = 0.026), and intestinal metaplasia and vacA m1 (P = 0.008) or vacA s1/m2 (P = 0.001) genotypes. Our results showed a 2.4-fold increased risk of peptic ulcer (95% CI: 0.483-11.93), compared with gastritis, in the infected patients who had dupA positive strains; however this association was not statistically significant. The results of urease activity showed a significant mean difference between the isolated strains from patients with PUD and NUD (P = 0.034). This activity was relatively higher among patients with intestinal metaplasia. Also a significant association was found between the lack of cagA and increased urease activity among the isolated strains (P = 0.036). While the greatest sequence variation of ureB was detected in a strain from a patient with intestinal metaplasia, the sole determined amino acid change in UreB sequence (Ala201Thr, 30%), showed no influence on urease activity. In conclusion, the supposed role of H. pylori urease to form peptic ulcer and advancing of intestinal metaplasia was postulated in this study. Higher urease activity in the colonizing H. pylori strains that present specific virulence factors was indicated as a risk factor for promotion of histopathological changes of gastric tissue that advance gastric malignancy.

  10. Genetic mapping using the Diversity Arrays Technology (DArT) : application and validation using the whole-genome sequences of Arabidopsis thaliana and the fungal wheat pathogen Mycosphaerella graminicola

    NARCIS (Netherlands)

    Wittenberg, A.H.J.

    2007-01-01

    Diversity Arrays Technology (DArT) is a microarray-based DNA marker technique for genome-wide discovery and genotyping of genetic variation. DArT allows simultaneous scoring of hundreds- to thousands of restriction site based polymorphisms between genotypes and does not require DNA sequence

  11. Genetic diversity of nifH gene sequences in Paenibacillus azotofixans strains and soil samples analyzed by denaturing gradiënt gel electrophoresis of PCR-amplified gene fragments

    NARCIS (Netherlands)

    Rosado, A.S.; Duarte, G.F.; Seldin, L.; Elsas, van J.D.

    1998-01-01

    The diversity of dinitrogenase reductase gene (nifH) fragments in Paenibacillus azotofixans strains was investigated by using molecular methods. The partial nifH gene sequences of eight P. azotofixans strains, as well as one strain each of the close relatives Paenibacillus durum, Paenibacillus

  12. Eukaryotic Plankton Species Diversity in the Western Channel of the Korea Strait using 18S rDNA Sequences and its Implications for Water Masses

    Science.gov (United States)

    Lee, Sang-Rae; Song, Eun Hye; Lee, Tongsup

    2018-03-01

    Organisms entering the East Sea (Sea of Japan) through the Korea Strait, together with water, salt, and energy, affect the East Sea ecosystem. In this study, we report on the biodiversity of eukaryotic plankton found in the Western Channel of the Korea Strait for the first time using small subunit ribosomal RNA gene (18S rDNA) sequences. We also discuss the characteristics of water masses and their physicochemical factors. Diverse taxonomic groups were recovered from 18S rDNA clone libraries, including putative novel, higher taxonomic entities affiliated with Cercozoa, Raphidophyceae, Picozoa, and novel marine Stramenopiles. We also found that there was cryptic genetic variation at both the intraspecific and interspecific levels among arthropods, diatoms, and green algae. Specific plankton assemblages were identified at different sampling depths and they may provide useful information that could be used to interpret the origin and the subsequent mixing history of the water masses that contribute to the Tsushima Warm Current waters. Furthermore, the biological information highlighted in this study may help improve our understanding about the complex water mass interactions that were highlighted in the Korea Strait.

  13. Bacterial diversity analysis of Huanglongbing pathogen-infected citrus, using PhyloChip and 16S rRNA gene clone library sequencing

    Energy Technology Data Exchange (ETDEWEB)

    Shankar Sagaram, U.; DeAngelis, K.M.; Trivedi, P.; Andersen, G.L.; Lu, S.-E.; Wang, N.

    2009-03-01

    The bacterial diversity associated with citrus leaf midribs was characterized 1 from citrus groves that contained the Huanglongbing (HLB) pathogen, which has yet to be cultivated in vitro. We employed a combination of high-density phylogenetic 16S rDNA microarray and 16S rDNA clone library sequencing to determine the microbial community composition of symptomatic and asymptomatic citrus midribs. Our results revealed that citrus leaf midribs can support a diversity of microbes. PhyloChip analysis indicated that 47 orders of bacteria from 15 phyla were present in the citrus leaf midribs while 20 orders from phyla were observed with the cloning and sequencing method. PhyloChip arrays indicated that nine taxa were significantly more abundant in symptomatic midribs compared to asymptomatic midribs. Candidatus Liberibacter asiaticus (Las) was detected at a very low level in asymptomatic plants, but was over 200 times more abundant in symptomatic plants. The PhyloChip analysis was further verified by sequencing 16S rDNA clone libraries, which indicated the dominance of Las in symptomatic leaves. These data implicate Las as the pathogen responsible for HLB disease. Citrus is the most important commercial fruit crop in Florida. In recent years, citrus Huanglongbing (HLB), also called citrus greening, has severely affected Florida's citrus production and hence has drawn an enormous amount of attention. HLB is one of the most devastating diseases of citrus (6,13), characterized by blotchy mottling with green islands on leaves, as well as stunting, fruit decline, and small, lopsided fruits with poor coloration. The disease tends to be associated with a phloem-limited fastidious {alpha}-proteobacterium given a provisional Candidatus status (Candidatus Liberobacter spp. later changed to Candidatus Liberibacter spp.) in nomenclature (18,25,34). Previous studies indicate that HLB infection causes disorder in the phloem and severely impairs the translocation of assimilates in

  14. A Targeted Enrichment Strategy for Massively Parallel Sequencing of Angiosperm Plastid Genomes

    Directory of Open Access Journals (Sweden)

    Gregory W. Stull

    2013-02-01

    Full Text Available Premise of the study: We explored a targeted enrichment strategy to facilitate rapid and low-cost next-generation sequencing (NGS of numerous complete plastid genomes from across the phylogenetic breadth of angiosperms. Methods and Results: A custom RNA probe set including the complete sequences of 22 previously sequenced eudicot plastomes was designed to facilitate hybridization-based targeted enrichment of eudicot plastid genomes. Using this probe set and an Agilent SureSelect targeted enrichment kit, we conducted an enrichment experiment including 24 angiosperms (22 eudicots, two monocots, which were subsequently sequenced on a single lane of the Illumina GAIIx with single-end, 100-bp reads. This approach yielded nearly complete to complete plastid genomes with exceptionally high coverage (mean coverage: 717×, even for the two monocots. Conclusions: Our enrichment experiment was highly successful even though many aspects of the capture process employed were suboptimal. Hence, significant improvements to this methodology are feasible. With this general approach and probe set, it should be possible to sequence more than 300 essentially complete plastid genomes in a single Illumina GAIIx lane (achieving 50× mean coverage. However, given the complications of pooling numerous samples for multiplex sequencing and the limited number of barcodes (e.g., 96 available in commercial kits, we recommend 96 samples as a current practical maximum for multiplex plastome sequencing. This high-throughput approach should facilitate large-scale plastid genome sequencing at any level of phylogenetic diversity in angiosperms.

  15. ASAP: Amplification, sequencing & annotation of plastomes

    Directory of Open Access Journals (Sweden)

    Folta Kevin M

    2005-12-01

    Full Text Available Abstract Background Availability of DNA sequence information is vital for pursuing structural, functional and comparative genomics studies in plastids. Traditionally, the first step in mining the valuable information within a chloroplast genome requires sequencing a chloroplast plasmid library or BAC clones. These activities involve complicated preparatory procedures like chloroplast DNA isolation or identification of the appropriate BAC clones to be sequenced. Rolling circle amplification (RCA is being used currently to amplify the chloroplast genome from purified chloroplast DNA and the resulting products are sheared and cloned prior to sequencing. Herein we present a universal high-throughput, rapid PCR-based technique to amplify, sequence and assemble plastid genome sequence from diverse species in a short time and at reasonable cost from total plant DNA, using the large inverted repeat region from strawberry and peach as proof of concept. The method exploits the highly conserved coding regions or intergenic regions of plastid genes. Using an informatics approach, chloroplast DNA sequence information from 5 available eudicot plastomes was aligned to identify the most conserved regions. Cognate primer pairs were then designed to generate ~1 – 1.2 kb overlapping amplicons from the inverted repeat region in 14 diverse genera. Results 100% coverage of the inverted repeat region was obtained from Arabidopsis, tobacco, orange, strawberry, peach, lettuce, tomato and Amaranthus. Over 80% coverage was obtained from distant species, including Ginkgo, loblolly pine and Equisetum. Sequence from the inverted repeat region of strawberry and peach plastome was obtained, annotated and analyzed. Additionally, a polymorphic region identified from gel electrophoresis was sequenced from tomato and Amaranthus. Sequence analysis revealed large deletions in these species relative to tobacco plastome thus exhibiting the utility of this method for structural and

  16. Highly multiplexed targeted DNA sequencing from single nuclei.

    Science.gov (United States)

    Leung, Marco L; Wang, Yong; Kim, Charissa; Gao, Ruli; Jiang, Jerry; Sei, Emi; Navin, Nicholas E

    2016-02-01

    Single-cell DNA sequencing methods are challenged by poor physical coverage, high technical error rates and low throughput. To address these issues, we developed a single-cell DNA sequencing protocol that combines flow-sorting of single nuclei, time-limited multiple-displacement amplification (MDA), low-input library preparation, DNA barcoding, targeted capture and next-generation sequencing (NGS). This approach represents a major improvement over our previous single nucleus sequencing (SNS) Nature Protocols paper in terms of generating higher-coverage data (>90%), thereby enabling the detection of genome-wide variants in single mammalian cells at base-pair resolution. Furthermore, by pooling 48-96 single-cell libraries together for targeted capture, this approach can be used to sequence many single-cell libraries in parallel in a single reaction. This protocol greatly reduces the cost of single-cell DNA sequencing, and it can be completed in 5-6 d by advanced users. This single-cell DNA sequencing protocol has broad applications for studying rare cells and complex populations in diverse fields of biological research and medicine.

  17. Population structure and genetic diversity of Indian Major Carp, Labeo rohita (Hamilton, 1822) from three phylo-geographically isolated riverine ecosystems of India as revealed by mtDNA cytochrome b region sequences.

    Science.gov (United States)

    Behera, Bijay Kumar; Baisvar, Vishwamitra Singh; Kunal, Swaraj Priyaranjan; Meena, Dharmendra Kumar; Panda, Debarata; Pakrashi, Sudip; Paria, Prasenjit; Das, Pronob; Bhakta, Dibakar; Debnath, Dipesh; Roy, Suvra; Suresh, V R; Jena, J K

    2018-03-01

    The population structure and genetic diversity of Rohu (Labeo rohita Hamilton, 1822) was studied by analysis of the partial sequences of mitochondrial DNA cytochrome b region. We examined 133 samples collected from six locations in three geographically isolated rivers of India. Analysis of 11 haplotypes showed low haplotype diversity (0.00150), nucleotide diversity (π) (0.02884) and low heterogeneity value (0.00374). Analysis of molecular variance (AMOVA) revealed the genetic diversity of L. rohita within population is very high than between the populations. The Fst scores (-0.07479 to 0.07022) were the indication of low genetic structure of L. rohita populations of three rivers of India. Conspicuously, Farakka-Bharuch population pair Fst score of 0.0000, although the sampling sites are from different rivers. The phylogenetic reconstruction of unique haplotypes revealed sharing of a single central haplotype (Hap_1) by all the six populations with a point mutations ranging from 1-25 nucleotides.

  18. Cell culture isolation and sequence analysis of genetically diverse US porcine epidemic diarrhea virus strains including a novel strain with a large deletion in the spike gene.

    Science.gov (United States)

    Oka, Tomoichiro; Saif, Linda J; Marthaler, Douglas; Esseili, Malak A; Meulia, Tea; Lin, Chun-Ming; Vlasova, Anastasia N; Jung, Kwonil; Zhang, Yan; Wang, Qiuhong

    2014-10-10

    The highly contagious and deadly porcine epidemic diarrhea virus (PEDV) first appeared in the US in April 2013. Since then the virus has spread rapidly nationwide and to Canada and Mexico causing high mortality among nursing piglets and significant economic losses. Currently there are no efficacious preventive measures or therapeutic tools to control PEDV in the US. The isolation of PEDV in cell culture is the first step toward the development of an attenuated vaccine, to study the biology of PEDV and to develop in vitro PEDV immunoassays, inactivation assays and screen for PEDV antivirals. In this study, nine of 88 US PEDV strains were isolated successfully on Vero cells with supplemental trypsin and subjected to genomic sequence analysis. They differed genetically mainly in the N-terminal S protein region as follows: (1) strains (n=7) similar to the highly virulent US PEDV strains; (2) one similar to the reportedly US S INDEL PEDV strain; and (3) one novel strain most closely related to highly virulent US PEDV strains, but with a large (197aa) deletion in the S protein. Representative strains of these three genetic groups were passaged serially and grew to titers of ∼5-6log10 plaque forming units/mL. To our knowledge, this is the first report of the isolation in cell culture of an S INDEL PEDV strain and a PEDV strain with a large (197aa) deletion in the S protein. We also designed primer sets to detect these genetically diverse US PEDV strains. Copyright © 2014 Elsevier B.V. All rights reserved.

  19. Systematization of the protein sequence diversity in enzymes related to secondary metabolic pathways in plants, in the context of big data biology inspired by the KNApSAcK motorcycle database.

    Science.gov (United States)

    Ikeda, Shun; Abe, Takashi; Nakamura, Yukiko; Kibinge, Nelson; Hirai Morita, Aki; Nakatani, Atsushi; Ono, Naoaki; Ikemura, Toshimichi; Nakamura, Kensuke; Altaf-Ul-Amin, Md; Kanaya, Shigehiko

    2013-05-01

    Biology is increasingly becoming a data-intensive science with the recent progress of the omics fields, e.g. genomics, transcriptomics, proteomics and metabolomics. The species-metabolite relationship database, KNApSAcK Core, has been widely utilized and cited in metabolomics research, and chronological analysis of that research work has helped to reveal recent trends in metabolomics research. To meet the needs of these trends, the KNApSAcK database has been extended by incorporating a secondary metabolic pathway database called Motorcycle DB. We examined the enzyme sequence diversity related to secondary metabolism by means of batch-learning self-organizing maps (BL-SOMs). Initially, we constructed a map by using a big data matrix consisting of the frequencies of all possible dipeptides in the protein sequence segments of plants and bacteria. The enzyme sequence diversity of the secondary metabolic pathways was examined by identifying clusters of segments associated with certain enzyme groups in the resulting map. The extent of diversity of 15 secondary metabolic enzyme groups is discussed. Data-intensive approaches such as BL-SOM applied to big data matrices are needed for systematizing protein sequences. Handling big data has become an inevitable part of biology.

  20. Genome sequence analysis of five Canadian isolates of strawberry mottle virus reveals extensive intra-species diversity and a longer RNA2 with increased coding capacity compared to a previously characterized European isolate.

    Science.gov (United States)

    Bhagwat, Basdeo; Dickison, Virginia; Ding, Xinlun; Walker, Melanie; Bernardy, Michael; Bouthillier, Michel; Creelman, Alexa; DeYoung, Robyn; Li, Yinzi; Nie, Xianzhou; Wang, Aiming; Xiang, Yu; Sanfaçon, Hélène

    2016-06-01

    In this study, we report the genome sequence of five isolates of strawberry mottle virus (family Secoviridae, order Picornavirales) from strawberry field samples with decline symptoms collected in Eastern Canada. The Canadian isolates differed from the previously characterized European isolate 1134 in that they had a longer RNA2, resulting in a 239-amino-acid extension of the C-terminal region of the polyprotein. Sequence analysis suggests that reassortment and recombination occurred among the isolates. Phylogenetic analysis revealed that the Canadian isolates are diverse, grouping in two separate branches along with isolates from Europe and the Americas.

  1. The characterization of twenty sequenced human genomes.

    Directory of Open Access Journals (Sweden)

    Kimberly Pelak

    2010-09-01

    Full Text Available We present the analysis of twenty human genomes to evaluate the prospects for identifying rare functional variants that contribute to a phenotype of interest. We sequenced at high coverage ten "case" genomes from individuals with severe hemophilia A and ten "control" genomes. We summarize the number of genetic variants emerging from a study of this magnitude, and provide a proof of concept for the identification of rare and highly-penetrant functional variants by confirming that the cause of hemophilia A is easily recognizable in this data set. We also show that the number of novel single nucleotide variants (SNVs discovered per genome seems to stabilize at about 144,000 new variants per genome, after the first 15 individuals have been sequenced. Finally, we find that, on average, each genome carries 165 homozygous protein-truncating or stop loss variants in genes representing a diverse set of pathways.

  2. Sequential strand displacement beacon for detection of DNA coverage on functionalized gold nanoparticles.

    Science.gov (United States)

    Paliwoda, Rebecca E; Li, Feng; Reid, Michael S; Lin, Yanwen; Le, X Chris

    2014-06-17

    Functionalizing nanomaterials for diverse analytical, biomedical, and therapeutic applications requires determination of surface coverage (or density) of DNA on nanomaterials. We describe a sequential strand displacement beacon assay that is able to quantify specific DNA sequences conjugated or coconjugated onto gold nanoparticles (AuNPs). Unlike the conventional fluorescence assay that requires the target DNA to be fluorescently labeled, the sequential strand displacement beacon method is able to quantify multiple unlabeled DNA oligonucleotides using a single (universal) strand displacement beacon. This unique feature is achieved by introducing two short unlabeled DNA probes for each specific DNA sequence and by performing sequential DNA strand displacement reactions. Varying the relative amounts of the specific DNA sequences and spacing DNA sequences during their coconjugation onto AuNPs results in different densities of the specific DNA on AuNP, ranging from 90 to 230 DNA molecules per AuNP. Results obtained from our sequential strand displacement beacon assay are consistent with those obtained from the conventional fluorescence assays. However, labeling of DNA with some fluorescent dyes, e.g., tetramethylrhodamine, alters DNA density on AuNP. The strand displacement strategy overcomes this problem by obviating direct labeling of the target DNA. This method has broad potential to facilitate more efficient design and characterization of novel multifunctional materials for diverse applications.

  3. Two low coverage bird genomes and a comparison of reference-guided versus de novo genome assemblies.

    Science.gov (United States)

    Card, Daren C; Schield, Drew R; Reyes-Velasco, Jacobo; Fujita, Matthew K; Andrew, Audra L; Oyler-McCance, Sara J; Fike, Jennifer A; Tomback, Diana F; Ruggiero, Robert P; Castoe, Todd A

    2014-01-01

    As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (∼3.5-5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies.

  4. Two low coverage bird genomes and a comparison of reference-guided versus de novo genome assemblies

    Science.gov (United States)

    Card, Daren C.; Schield, Drew R.; Reyes-Velasco, Jacobo; Fujita, Matthre K.; Andrew, Audra L.; Oyler-McCance, Sara J.; Fike, Jennifer A.; Tomback, Diana F.; Ruggiero, Robert P.; Castoe, Todd A.

    2014-01-01

    As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (~3.5–5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies.

  5. Liver imaging at 3.0 T: Diffusion-induced black-blood echo-planar imaging with large anatomic volumetric coverage as an alternative for specific absorption rate-intensive echo-train spin-echo sequences: Feasibility study

    NARCIS (Netherlands)

    I.C. van den Bos (Indra); S.M. Hussain (Shahid); G.P. Krestin (Gabriel); P.A. Wielopolski (Piotr)

    2008-01-01

    textabstractInstitutional Review Board approval and signed informed consent were obtained by all participants for an ongoing sequence optimization project at 3.0 T. The purpose of this study was to evaluate breath-hold diffusion-induced blackblood echo-planar imaging (BBEPI) as a potential

  6. Coverage analysis of lists of genes involved in heterogeneous ...

    Indian Academy of Sciences (India)

    The rationale of our study was to specifically evaluate sequence coverage using ... Catherine Badens and Martin Krahn contributed equally to this work. the analysis of ... (Life Technologies) with sequencing data processing using the Torrent ... map4 parameter, which is the default option to balance rapid- ity and maintain a ...

  7. Genetic diversity analysis of sugarcane germplasm based on fluorescence-labeled simple sequence repeat markers and a capillary electrophoresis-based genotyping platform

    Science.gov (United States)

    Genetic diversity analysis, which refers to the elaboration of total extent of genetic characteristics in the genetic makeup of a certain species, constitutes a classical strategy for the study of diversity, population genetic structure, and breeding practices. In this study, fluorescence-labeled se...

  8. MerCat: a versatile k-mer counter and diversity estimator for database-independent property analysis obtained from metagenomic and/or metatranscriptomic sequencing data

    Energy Technology Data Exchange (ETDEWEB)

    White, Richard A.; Panyala, Ajay R.; Glass, Kevin A.; Colby, Sean M.; Glaesemann, Kurt R.; Jansson, Georg C.; Jansson, Janet K.

    2017-02-21

    MerCat is a parallel, highly scalable and modular property software package for robust analysis of features in next-generation sequencing data. MerCat inputs include assembled contigs and raw sequence reads from any platform resulting in feature abundance counts tables. MerCat allows for direct analysis of data properties without reference sequence database dependency commonly used by search tools such as BLAST and/or DIAMOND for compositional analysis of whole community shotgun sequencing (e.g. metagenomes and metatranscriptomes).

  9. Evaluation of haplotype diversity of Achatina fulica (Lissachatina) [Bowdich] from Indian sub-continent by means of 16S rDNA sequence and its phylogenetic relationships with other global populations.

    Science.gov (United States)

    Ayyagari, Vijaya Sai; Sreerama, Krupanidhi

    2017-08-01

    Achatina fulica (Lissachatina fulica) is one of the most invasive species found across the globe causing a significant damage to crops, vegetables, and horticultural plants. This terrestrial snail is native to east Africa and spread to different parts of the world by introductions. India, a hot spot for biodiversity of several endemic gastropods, has witnessed an outburst of this snail population in several parts of the country posing a serious threat to crop loss and also to human health. With an objective to evaluate the genetic diversity of this snail, we have sampled this snail from different parts of India and analyzed its haplotype diversity by means of 16S rDNA sequence information. Apart from this, we have studied the phylogenetic relationships of the isolates sequenced in the present study in relation with other global populations by Bayesian and Maximum-likelihood approaches. Of the isolates sequenced, haplotype 'C' is the predominant one. A new haplotype 'S' from the state of Odisha was observed. The isolates sequenced in the present study clustered with its conspecifics from the Indian sub-continent. Haplotype network analyses were also carried out for studying the evolution of different haplotypes. It was observed that haplotype 'S' was associated with a Mauritius haplotype 'H', indicating the possibility of multiple introductions of A. fulica to India.

  10. Genome-wide-analyses of Listeria monocytogenes from food-processing plants reveals clonal diversity and dates the emergence of persisting sequence types

    DEFF Research Database (Denmark)

    Knudsen, Gitte Maegaard; Nielsen, Jesper Boye; Marvig, Rasmus Lykke

    2017-01-01

    for three persisting sequence types (ST) based on Multi Locus Sequence Typing (MLST) being ST7, ST8 and ST121, long-term persistence of clonal groups was limited, and new clones were introduced continuously, potentially from raw materials. No particular gene could be linked to the persistence phenotype...

  11. Coverage Metrics for Model Checking

    Science.gov (United States)

    Penix, John; Visser, Willem; Norvig, Peter (Technical Monitor)

    2001-01-01

    When using model checking to verify programs in practice, it is not usually possible to achieve complete coverage of the system. In this position paper we describe ongoing research within the Automated Software Engineering group at NASA Ames on the use of test coverage metrics to measure partial coverage and provide heuristic guidance for program model checking. We are specifically interested in applying and developing coverage metrics for concurrent programs that might be used to support certification of next generation avionics software.

  12. Solid‐Phase Synthesis of Structurally Diverse Heterocycles by an Amide–Ketone Condensation/N‐Acyliminium Pictet–Spengler Sequence

    DEFF Research Database (Denmark)

    Komnatnyy, Vitaly V.; Givskov, Michael Christian; Nielsen, Thomas Eiland

    2012-01-01

    An efficient approach for the solid‐phase synthesis of structurally diverse heterocyclic compounds is presented. Under acidic reaction conditions, peptidic levulinamides undergo intramolecular ketone–amide condensation reactions to form cyclic N‐acyliminium intermediates. In the presence...

  13. Diversity of Entamoeba spp. in African great apes and humans: an insight from Illumina MiSeq high-throughput sequencing

    Czech Academy of Sciences Publication Activity Database

    Vlčková, Klára; Kreisinger, J.; Pafčo, B.; Čížková, Dagmar; Tagg, N.; Hehl, A. B.; Modrý, David

    (2018) ISSN 0020-7519 Institutional support: RVO:68081766 ; RVO:60077344 Keywords : Entamoeba * Western lowland gorilla * Central chimpanzee * Humans * Metabarcoding * Diversity * Mixed infections * Entamoeba histolytica Impact factor: 3.730, year: 2016

  14. Study of the Effect of SRT on Microbial Diversity in Laboratory-scale Sequencing Batch Reactors Using Acclimated and Non-Acclimated Seed

    KAUST Repository

    Tellez, Berenice

    2011-01-01

    Solids Retention Time (SRT) is an important design parameter in activated sludge wastewater treatment systems. In this study, the effect of SRT on the bacterial community structure and diversity was examined in replicate lab-scale activated sludge

  15. Subglacial Lake Vostok (Antarctica accretion ice contains a diverse set of sequences from aquatic, marine and sediment-inhabiting bacteria and eukarya.

    Directory of Open Access Journals (Sweden)

    Yury M Shtarkman

    Full Text Available Lake Vostok, the 7(th largest (by volume and 4(th deepest lake on Earth, is covered by more than 3,700 m of ice, making it the largest subglacial lake known. The combination of cold, heat (from possible hydrothermal activity, pressure (from the overriding glacier, limited nutrients and complete darkness presents extreme challenges to life. Here, we report metagenomic/metatranscriptomic sequence analyses from four accretion ice sections from the Vostok 5G ice core. Two sections accreted in the vicinity of an embayment on the southwestern end of the lake, and the other two represented part of the southern main basin. We obtained 3,507 unique gene sequences from concentrates of 500 ml of 0.22 µm-filtered accretion ice meltwater. Taxonomic classifications (to genus and/or species were possible for 1,623 of the sequences. Species determinations in combination with mRNA gene sequence results allowed deduction of the metabolic pathways represented in the accretion ice and, by extension, in the lake. Approximately 94% of the sequences were from Bacteria and 6% were from Eukarya. Only two sequences were from Archaea. In general, the taxa were similar to organisms previously described from lakes, brackish water, marine environments, soil, glaciers, ice, lake sediments, deep-sea sediments, deep-sea thermal vents, animals and plants. Sequences from aerobic, anaerobic, psychrophilic, thermophilic, halophilic, alkaliphilic, acidophilic, desiccation-resistant, autotrophic and heterotrophic organisms were present, including a number from multicellular eukaryotes.

  16. Conservation in gene encoding Mycobacterium tuberculosis antigen Rv2660 and a high predicted population coverage of H56 multistage vaccine in South Africa.

    Science.gov (United States)

    Perez-Martinez, Angy P; Ong, Edison; Zhang, Lixin; Marrs, Carl F; He, Yongqun; Yang, Zhenhua

    2017-11-01

    H56/AERAS-456+IC31 (H56), composed of two early secretion proteins, Ag85B and ESAT-6, and a latency associated protein, Rv2660, and the IC31 Intercell adjuvant, is a new fusion subunit vaccine candidate designed to induce immunity against both new infection and reactivation of latent tuberculosis infection. Efficacy of subunit vaccines may be affected by the diversity of vaccine antigens among clinical strains and the extent of recognition by the diverse HLA molecules in the recipient population. Although a previous study showed the conservative nature of Ag85B- and ESAT-6-encoding genes, genetic diversity of Rv2660c that encodes RV2660 is largely unknown. The population coverage of H56 as a whole yet remains to be assessed. The present study was conducted to address these important knowledge gaps. DNA sequence analysis of Rv2660c found no variation among 83 of the 84 investigated clinical strains belonging to four genetic lineages. H56 was predicted to have as high as 99.6% population coverage in the South Africa population using the Immune Epitope Database (IEDB) Population Coverage Tool. Further comparison of H56 population coverage between South African Blacks and Caucasians based on the phenotypic frequencies of binding MHC Class I and Class II supertype alleles found that all of the nine MHC-I and six of eight MHC-II human leukocyte antigen (HLA) supertype alleles analyzed were significantly differentially expressed between the two subpopulations. This finding suggests the presence of race-specific functional binding motifs of MHC-I and MHC-II HLA alleles, which, in turn, highlights the importance of including diverse populations in vaccine clinical evaluation. In conclusion, H56 vaccine is predicted to have a promising population coverage in South Africa; this study demonstrates the utility of integrating comparative genomics and bioinformatics in bridging animal and clinical studies of novel TB vaccines. Copyright © 2017 Elsevier B.V. All rights reserved.

  17. Predicted Strain Coverage of a New Meningococcal Multicomponent Vaccine (4CMenB in Spain: Analysis of the Differences with Other European Countries.

    Directory of Open Access Journals (Sweden)

    Raquel Abad

    Full Text Available A novel meningococcal multicomponent vaccine, 4CMenB (Bexsero®, has been approved in Europe, Canada, Australia and US. The potential impact of 4CMenB on strain coverage is being estimated by using Meningococcal Antigen Typing System (MATS, an ELISA assay which measures vaccine antigen expression and diversity in each strain. Here we show the genetic characterization and the 4CMenB potential coverage of Spanish invasive strains (collected during one epidemiological year compared to other European countries and discuss the potential reasons for the lower estimate of coverage in Spain.A panel of 300 strains, a representative sample of all serogroup B Neisseria meningitidis notified cases in Spain from 2009 to 2010, was characterized by multilocus sequence typing (MLST and FetA variable region determination. 4CMenB vaccine antigens, PorA, factor H binding protein (fHbp, Neisseria Heparin Binding Antigen (NHBA and Neisserial adhesin A (NadA were molecularly typed by sequencing. PorA coverage was assigned to strain with VR2 = 4. The levels of expression and cross-reactivity of fHbp, NHBA and NadA were analyzed using MATS ELISA.Global estimated strain coverage by MATS was 68.67% (95% CI: 47.77-84.59%, with 51.33%, 15.33% and 2% of strains covered by one, two and three vaccine antigens, respectively. The predicted strain coverage by individual antigens was: 42% NHBA, 36.33% fHbp, 8.33% PorA and 1.33% NadA. Coverage within the most prevalent clonal complexes (cc was 70.37% for cc 269, 30.19% for cc 213 and 95.83% for cc 32.Clonal complexes (cc distribution accounts for variations in strain coverage, so that country-by-country investigations of strain coverage and cc prevalence are important. Because the cc distribution could also vary over time, which in turn could lead to changes in strain coverage, continuous detailed surveillance and monitoring of vaccine antigens expression is needed in those countries where the multicomponent vaccine is introduced

  18. A High-Coverage Yersinia pestis Genome from a Sixth-Century Justinianic Plague Victim.

    Science.gov (United States)

    Feldman, Michal; Harbeck, Michaela; Keller, Marcel; Spyrou, Maria A; Rott, Andreas; Trautmann, Bernd; Scholz, Holger C; Päffgen, Bernd; Peters, Joris; McCormick, Michael; Bos, Kirsten; Herbig, Alexander; Krause, Johannes

    2016-11-01

    The Justinianic Plague, which started in the sixth century and lasted to the mid eighth century, is thought to be the first of three historically documented plague pandemics causing massive casualties. Historical accounts and molecular data suggest the bacterium Yersinia pestis as its etiological agent. Here we present a new high-coverage (17.9-fold) Y. pestis genome obtained from a sixth-century skeleton recovered from a southern German burial site close to Munich. The reconstructed genome enabled the detection of 30 unique substitutions as well as structural differences that have not been previously described. We report indels affecting a lacl family transcription regulator gene as well as nonsynonymous substitutions in the nrdE, fadJ, and pcp genes, that have been suggested as plague virulence determinants or have been shown to be upregulated in different models of plague infection. In addition, we identify 19 false positive substitutions in a previously published lower-coverage Y. pestis genome from another archaeological site of the same time period and geographical region that is otherwise genetically identical to the high-coverage genome sequence reported here, suggesting low-genetic diversity of the plague during the sixth century in rural southern Germany. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  19. Nucleotide sequences from the genomes of diverse cowpea accessions for discovery of genetic variation as part of the Feed the Future Innovation Lab for Climate Resilient Cowpea

    Data.gov (United States)

    US Agency for International Development — Nucleotide sequences were generated from 37 cowpea (Vigna unguiculata L. Walp.) accessions relevant to Africa, China and the USA to discover at type of genetic...

  20. Genome survey of pistachio (Pistacia vera L.) by next generation sequencing: Development of novel SSR markers and genetic diversity in Pistacia species

    OpenAIRE

    Ziya Motalebipour, Elmira; Kafkas, Salih; Khodaeiaminjan, Mortaza; ?oban, Nergiz; G?zel, Hatice

    2016-01-01

    Background Pistachio (Pistacia vera L.) is one of the most important nut crops in the world. There are about 11 wild species in the genus Pistacia, and they have importance as rootstock seed sources for cultivated P. vera and forest trees. Published information on the pistachio genome is limited. Therefore, a genome survey is necessary to obtain knowledge on the genome structure of pistachio by next generation sequencing. Simple sequence repeat (SSR) markers are useful tools for germplasm cha...

  1. Analyses of Twelve New Whole Genome Sequences of Cassava Brown Streak Viruses and Ugandan Cassava Brown Streak Viruses from East Africa: Diversity, Supercomputing and Evidence for Further Speciation

    Science.gov (United States)

    Ndunguru, Joseph; Sseruwagi, Peter; Tairo, Fred; Stomeo, Francesca; Maina, Solomon; Djinkeng, Appolinaire; Kehoe, Monica; Boykin, Laura M.

    2015-01-01

    Cassava brown streak disease is caused by two devastating viruses, Cassava brown streak virus (CBSV) and Ugandan cassava brown streak virus (UCBSV) which are frequently found infecting cassava, one of sub-Saharan Africa’s most important staple food crops. Each year these viruses cause losses of up to $100 million USD and can leave entire families without their primary food source, for an entire year. Twelve new whole genomes, including seven of CBSV and five of UCBSV were uncovered in this research, doubling the genomic sequences available in the public domain for these viruses. These new sequences disprove the assumption that the viruses are limited by agro-ecological zones, show that current diagnostic primers are insufficient to provide confident diagnosis of these viruses and give rise to the possibility that there may be as many as four distinct species of virus. Utilizing NGS sequencing technologies and proper phylogenetic practices will rapidly increase the solution to sustainable cassava production. PMID:26439260

  2. Analyses of Twelve New Whole Genome Sequences of Cassava Brown Streak Viruses and Ugandan Cassava Brown Streak Viruses from East Africa: Diversity, Supercomputing and Evidence for Further Speciation.

    Directory of Open Access Journals (Sweden)

    Joseph Ndunguru

    Full Text Available Cassava brown streak disease is caused by two devastating viruses, Cassava brown streak virus (CBSV and Ugandan cassava brown streak virus (UCBSV which are frequently found infecting cassava, one of sub-Saharan Africa's most important staple food crops. Each year these viruses cause losses of up to $100 million USD and can leave entire families without their primary food source, for an entire year. Twelve new whole genomes, including seven of CBSV and five of UCBSV were uncovered in this research, doubling the genomic sequences available in the public domain for these viruses. These new sequences disprove the assumption that the viruses are limited by agro-ecological zones, show that current diagnostic primers are insufficient to provide confident diagnosis of these viruses and give rise to the possibility that there may be as many as four distinct species of virus. Utilizing NGS sequencing technologies and proper phylogenetic practices will rapidly increase the solution to sustainable cassava production.

  3. Genetic diversity and host plant preferences revealed by simple sequence repeat and mitochondrial markers in a population of the arbuscular mycorrhizal fungus Glomus intraradices

    NARCIS (Netherlands)

    Croll, D.; Wille, L.; Gamper, H.A.; Mathimaran, N.; Lammers, P.J.; Corradi, N.; Sanders, I.R.

    2008-01-01

    Arbuscular mycorrhizal fungi (AMF) are important symbionts of plants that improve plant nutrient acquisition and promote plant diversity. Although within-species genetic differences among AMF have been shown to differentially affect plant growth, very little is actually known about the degree of

  4. The pioneering use of ISSR (Inter Simple Sequence Repeat in Neotropical anurans: preliminary assessment of genetic diversity in populations of Physalaemus cuvieri (Amphibia, Leiuperidae

    Directory of Open Access Journals (Sweden)

    Rafaela M Moresco

    2013-01-01

    Full Text Available The greatest diversity of anurans in the world is in Brazil and one of the major challenges is to reconcile the accelerated economic development with strategies that aim to maintain this diversity in forest fragments, often representing ESUs of some biomes. This study aimed to obtain data that will support conservation projects through the pioneering use of ISSR analysis in Neotropical anurans, estimating the intra- and interpopulation genetic diversity of four populations of P. cuvieri (Paraná and São Paulo regions. Of the 65 loci scored 58 were polymorphic, with 0.797 intrapopulation variation and 0.203 interpopulation variation. The index of interpopulation genetic differentiation (Fst proved to be high among the population of Marmeleiro-PR and the three populations of SP (Fst > 0.288; genetic dissimilarity was related to the geographical distance. The ISSR proved to be efficient and useful molecular markers in comparison with other markers most widely used for preliminary diagnosis of genetic diversity in populations of amphibians, and could be applied as a tool for future conservation projects, since they could identify potential ESUs and influence decisions on the preservation of fragments.

  5. A global meta-analysis of Tuber ITS rDNA sequences: species diversity, host associations and long-distance dispersal

    Science.gov (United States)

    Gregory M. Bonito; Andrii P. Gryganskyi; James M. Trappe; Rytas. Vilgalys

    2010-01-01

    Truffles (Tuber) are ectomycorrhizal fungi characterized by hypogeous fruitbodies. Their biodiversity, host associations and geographical distributions are not well documented. ITS rDNA sequences of Tuber are commonly recovered from molecular surveys of fungal communities, but most remain insufficiently identified making it...

  6. The Gut Microbiotassay: a high-throughput qPCR approach combinable with next generation sequencing to study gut microbial diversity

    DEFF Research Database (Denmark)

    Hermann-Bank, Marie Louise; Skovgaard, Kerstin; Stockmarr, Anders

    2013-01-01

    ®) followed by next generation sequencing. Primers were designed if necessary and all primer sets were screened against DNA extracted from pure cultures of 15 representative bacterial species. Subsequently the setup was tested on DNA extracted from small and large intestinal content from piglets...

  7. Sequence diversity of hepatitis C virus 6a within the extended interferon sensitivity-determining region correlates with interferon-alpha/ribavirin treatment outcomes.

    Science.gov (United States)

    Zhou, Daniel X M; Chan, Paul K S; Zhang, Tiejun; Tully, Damien C; Tam, John S

    2010-10-01

    Studies on the association between sequence variability of the interferon sensitivity-determining region (ISDR) of hepatitis C virus and the outcome of treatment have reached conflicting results. In this study, 25 patients infected with HCV 6a who had received interferon-alpha/ribavirin combination treatment were analyzed for the sequence variations. 14 of them had the full genome sequences obtained from a previous study, whereas the other 11 samples were sequenced for the extended ISDR (eISDR). This eISDR fragment covers 192 bp (64 amino acids) upstream and 201 bp (67 amino acids) downstream from the ISDR previously defined for HCV 1b. The comparison between interferon-alpha resistance and response groups for the amino acid mutations located in the full genome (6 and 8 patients respectively) as well as the mutations located in the eISDR (10 and 15 patients respectively) showed that the mutations I2160V, I2256V, V2292I (Pc) 2010 Elsevier B.V. All rights reserved.

  8. DNA sequences from two SSRs (CIR316 and MUCS088) linked to root-knot nematode resistance genes from diverse cottons (Gossypium spp).

    Science.gov (United States)

    We investigated DNA sequencing information from alleles (DNA amplified fragments) of two previously reported SSR markers (CIR316 and MUCS088) linked to root-knot nematode (RKN) resistance genes. Markers based on electrophoretic differences, including RFLPs, AFLPs and SSRs can sometimes mask underlyi...

  9. Urbanization Reduces Transfer of Diverse Environmental Microbiota Indoors

    Directory of Open Access Journals (Sweden)

    Anirudra Parajuli

    2018-02-01

    Full Text Available Expanding urbanization is a major factor behind rapidly declining biodiversity. It has been proposed that in urbanized societies, the rarity of contact with diverse environmental microbiota negatively impacts immune function and ultimately increases the risk for allergies and other immune-mediated disorders. Surprisingly, the basic assumption that urbanization reduces exposure to environmental microbiota and its transfer indoors has rarely been examined. We investigated if the land use type around Finnish homes affects the diversity, richness, and abundance of bacterial communities indoors. Debris deposited on standardized doormats was collected in 30 rural and 26 urban households in and near the city of Lahti, Finland, in August 2015. Debris was weighed, bacterial community composition determined by high throughput sequencing of bacterial 16S ribosomal RNA (rRNA gene on the Illumina MiSeq platform, and the percentage of four different land use types (i.e., built area, forest, transitional, and open area within 200 m and 2000 m radiuses from each household was characterized. The quantity of doormat debris was inversely correlated with coverage of built area. The diversity of total bacterial, Proteobacterial, Actinobacterial, Bacteroidetes, and Firmicutes communities decreased as the percentage of built area increased. Their richness followed the same pattern except for Firmicutes for which no association was observed. The relative abundance of Proteobacteria and particularly Gammaproteobacteria increased, whereas that of Actinobacteria decreased with increasing built area. Neither Phylum Firmicutes nor Bacteroidetes varied with coverage of built area. Additionally, the relative abundance of potentially pathogenic bacterial families and genera increased as the percentage of built area increased. Interestingly, having domestic animals (including pets only altered the association between the richness of Gammaproteobacteria and diversity of

  10. SSU rDNA sequence diversity and seasonally differentiated distribution of nanoplanktonic ciliates in neritic Bohai and Yellow Seas as revealed by T-RFLP.

    Directory of Open Access Journals (Sweden)

    Jun Dong

    Full Text Available Nanociliates have been frequently found to be important players in the marine microbial loop, however, little is known about their diversity and distribution in coastal ecosystems. We investigated the molecular diversity and distribution patterns of nanoplanktonic oligotrich and choreotrich (OC ciliates in surface water of three neritic basins of northern China, the South Yellow Sea (SYS, North Yellow Sea (NYS, and Bohai Sea (BS in June and November 2011. SSU rRNA gene clone libraries generated from three summertime samples (sites B38, B4 and H8 were analyzed and revealed a large novel ribotype diversity, of which many were low-abundant phylotypes belonging to the subclass Oligotrichia, but divergent from described morphospecies. Based on the data of terminal-restriction fragment length polymorphism (T-RFLP analysis of all 35 samples, we found that the T-RF richness was generally higher in the SYS than in the BS, and negatively correlated with the molar ratio of P to Si. Overall, multidimensional scaling and permutational multivariate analysis of variance of the community turnover demonstrated a distinct seasonal pattern but no basin-to-basin differentiation across all samples. Nevertheless, significant community differences among basins were recognized in the winter dataset. Mantel tests showed that the environmental factors, P:Si ratio, water temperature and concentration of dissolved oxygen (DO, determined the community across all samples. However, both biogeographic distance and environment shaped the community in winter, with DO being the most important physicochemical factor. Our results indicate that the stoichiometric ratio of P:Si is a key factor, through which the phytoplankton community may be shaped, resulting in a cascade effect on the diversity and community composition of OC nanociliates in the N-rich, Si-limited coastal surface waters, and that the Yellow Sea Warm Current drives the nanociliate community, and possibly the

  11. SSU rDNA sequence diversity and seasonally differentiated distribution of nanoplanktonic ciliates in neritic Bohai and Yellow Seas as revealed by T-RFLP.

    Science.gov (United States)

    Dong, Jun; Shi, Fei; Li, Han; Zhang, Xiaoming; Hu, Xiaozhong; Gong, Jun

    2014-01-01

    Nanociliates have been frequently found to be important players in the marine microbial loop, however, little is known about their diversity and distribution in coastal ecosystems. We investigated the molecular diversity and distribution patterns of nanoplanktonic oligotrich and choreotrich (OC) ciliates in surface water of three neritic basins of northern China, the South Yellow Sea (SYS), North Yellow Sea (NYS), and Bohai Sea (BS) in June and November 2011. SSU rRNA gene clone libraries generated from three summertime samples (sites B38, B4 and H8) were analyzed and revealed a large novel ribotype diversity, of which many were low-abundant phylotypes belonging to the subclass Oligotrichia, but divergent from described morphospecies. Based on the data of terminal-restriction fragment length polymorphism (T-RFLP) analysis of all 35 samples, we found that the T-RF richness was generally higher in the SYS than in the BS, and negatively correlated with the molar ratio of P to Si. Overall, multidimensional scaling and permutational multivariate analysis of variance of the community turnover demonstrated a distinct seasonal pattern but no basin-to-basin differentiation across all samples. Nevertheless, significant community differences among basins were recognized in the winter dataset. Mantel tests showed that the environmental factors, P:Si ratio, water temperature and concentration of dissolved oxygen (DO), determined the community across all samples. However, both biogeographic distance and environment shaped the community in winter, with DO being the most important physicochemical factor. Our results indicate that the stoichiometric ratio of P:Si is a key factor, through which the phytoplankton community may be shaped, resulting in a cascade effect on the diversity and community composition of OC nanociliates in the N-rich, Si-limited coastal surface waters, and that the Yellow Sea Warm Current drives the nanociliate community, and possibly the microbial food webs

  12. Evaluation of genetic diversity of cultivated tea clones (Camellia sinensis (L. Kuntze in the eastern black sea coast by inter-simple sequence repeats (ISSRS

    Directory of Open Access Journals (Sweden)

    Beris Fatih S.

    2016-01-01

    Full Text Available Tea is the most globally consumed drink after spring water and an important breeding plant with high economical value in Turkey. In half a century, various kinds of tea cultivars have been bred in Turkey to improve the quality and yield of tea plants. Since tea reproduces sexually, tea fields vary in quality. Thus, determining the genetic diversity and relationship of the plants to support breeding and cultivation is important. In this study we aimed to determine the genetic diversity of tea cultivars breeding in the Eastern Black Sea coast of Turkey and the genetic relationship between them, to verify whether the qualitative morphological designations of the clones are genetically true by the ISSR markers. Herein, the genetic diversity and relationships of 18 Turkish tea cultivars were determined using 15 ISSR markers with sizes ranging from 250 to 3000 base pairs. The similarity indices among these cultivars were between 0.456 and 0.743. Based on cluster analysis using UPGMA, some of tea cultivars originating from the same geographical position were found to be clustered closely. Our data provide valuable information and a useful basis to assist selection and cloning experiments of tea cultivars and also help farmers to find elite parental clones for tea breeding in the Eastern Black Sea coast of Turkey.

  13. Genetic diversity and relatedness of Fasciola spp. isolates from different hosts and geographic regions revealed by analysis of mitochondrial DNA sequences.

    Science.gov (United States)

    Ai, L; Weng, Y B; Elsheikha, H M; Zhao, G H; Alasaad, S; Chen, J X; Li, J; Li, H L; Wang, C R; Chen, M X; Lin, R Q; Zhu, X Q

    2011-09-27

    The present study examined sequence variability in a portion of the mitochondrial cytochrome c oxidase subunit 1 (pcox1) and NADH dehydrogenase subunits 4 and 5 (pnad4 and pnad5) among 39 isolates of Fasciola spp., from different hosts from China, Niger, France, the United States of America, and Spain; and their phylogenetic relationships were re-constructed. Intra-species sequence variations were 0.0-1.1% for pcox1, 0.0-2.7% for pnad4, and 0.0-3.3% for pnad5 for Fasciola hepatica; 0.0-1.8% for pcox1, 0.0-2.5% for pnad4, and 0.0-4.2% for pnad5 for Fasciola gigantica, and 0.0-0.9% for pcox1, 0.0-0.2% for pnad4, and 0.0-1.1% for pnad5 for the intermediate Fasciola form. Whereas, nucleotide differences were 2.1-2.7% for pcox1, 3.1-3.3% for pnad4, and 4.2-4.8% for pnad5 between F. hepatica and F. gigantica; were 1.3-1.5% for pcox1, 2.1-2.9% for pnad4, 3.1-3.4% for pnad5 between F. hepatica and the intermediate form; and were 0.9-1.1% for pcox1, 1.4-1.8% for pnad4, 2.2-2.4% for pnad5 between F. gigantica and the intermediate form. Phylogenetic analysis based on the combined sequences of pcox1, pnad4 and pnad5 revealed distinct groupings of isolates of F. hepatica, F. gigantica, or the intermediate Fasciola form irrespective of their origin, demonstrating the usefulness of the mtDNA sequences for the delineation of Fasciola species, and reinforcing the genetic evidence for the existence of the intermediate Fasciola form. Copyright © 2011 Elsevier B.V. All rights reserved.

  14. The genetic diversity of genus Bacillus and the related genera revealed by 16S rRNA gene sequences and ardra analyses isolated from geothermal regions of turkey

    Directory of Open Access Journals (Sweden)

    Arzu Coleri Cihan

    2012-03-01

    Full Text Available Previously isolated 115 endospore-forming bacilli were basically grouped according to their temperature requirements for growth: the thermophiles (74%, the facultative thermophiles (14% and the mesophiles (12%. These isolates were taken into 16S rRNA gene sequence analyses, and they were clustered among the 7 genera: Anoxybacillus, Aeribacillus, Bacillus, Brevibacillus, Geobacillus, Paenibacillus, and Thermoactinomycetes. Of these bacilli, only the thirty two isolates belonging to genera Bacillus (16, Brevibacillus (13, Paenibacillus (1 and Thermoactinomycetes (2 were selected and presented in this paper. The comparative sequence analyses revealed that the similarity values were ranged as 91.4-100 %, 91.8- 99.2 %, 92.6- 99.8 % and 90.7 - 99.8 % between the isolates and the related type strains from these four genera, respectively. Twenty nine of them were found to be related with the validly published type strains. The most abundant species was B. thermoruber with 9 isolates followed by B. pumilus (6, B. lichenformis (3, B. subtilis (3, B. agri (3, B. smithii (2, T. vulgaris (2 and finally P. barengoltzii (1. In addition, isolates of A391a, B51a and D295 were proposed as novel species as their 16S rRNA gene sequences displayed similarities ≤ 97% to their closely related type strains. The AluI-, HaeIII- and TaqI-ARDRA results were in congruence with the 16S rRNA gene sequence analyses. The ARDRA results allowed us to differentiate these isolates, and their discriminative restriction fragments were able to be determined. Some of their phenotypic characters and their amylase, chitinase and protease production were also studied and biotechnologically valuable enzyme producing isolates were introduced in order to use in further studies.

  15. Genomic sequencing: assessing the health care system, policy, and big-data implications.

    Science.gov (United States)

    Phillips, Kathryn A; Trosman, Julia R; Kelley, Robin K; Pletcher, Mark J; Douglas, Michael P; Weldon, Christine B

    2014-07-01

    New genomic sequencing technologies enable the high-speed analysis of multiple genes simultaneously, including all of those in a person's genome. Sequencing is a prominent example of a "big data" technology because of the massive amount of information it produces and its complexity, diversity, and timeliness. Our objective in this article is to provide a policy primer on sequencing and illustrate how it can affect health care system and policy issues. Toward this end, we developed an easily applied classification of sequencing based on inputs, methods, and outputs. We used it to examine the implications of sequencing for three health care system and policy issues: making care more patient-centered, developing coverage and reimbursement policies, and assessing economic value. We conclude that sequencing has great promise but that policy challenges include how to optimize patient engagement as well as privacy, develop coverage policies that distinguish research from clinical uses and account for bioinformatics costs, and determine the economic value of sequencing through complex economic models that take into account multiple findings and downstream costs. Project HOPE—The People-to-People Health Foundation, Inc.

  16. Viral metagenomics: Analysis of begomoviruses by illumina high-throughput sequencing

    KAUST Repository

    Idris, Ali

    2014-03-12

    Traditional DNA sequencing methods are inefficient, lack the ability to discern the least abundant viral sequences, and ineffective for determining the extent of variability in viral populations. Here, populations of single-stranded DNA plant begomoviral genomes and their associated beta- and alpha-satellite molecules (virus-satellite complexes) (genus, Begomovirus; family, Geminiviridae) were enriched from total nucleic acids isolated from symptomatic, field-infected plants, using rolling circle amplification (RCA). Enriched virus-satellite complexes were subjected to Illumina-Next Generation Sequencing (NGS). CASAVA and SeqMan NGen programs were implemented, respectively, for quality control and for de novo and reference-guided contig assembly of viral-satellite sequences. The authenticity of the begomoviral sequences, and the reproducibility of the Illumina-NGS approach for begomoviral deep sequencing projects, were validated by comparing NGS results with those obtained using traditional molecular cloning and Sanger sequencing of viral components and satellite DNAs, also enriched by RCA or amplified by polymerase chain reaction. As the use of NGS approaches, together with advances in software development, make possible deep sequence coverage at a lower cost; the approach described herein will streamline the exploration of begomovirus diversity and population structure from naturally infected plants, irrespective of viral abundance. This is the first report of the implementation of Illumina-NGS to explore the diversity and identify begomoviral-satellite SNPs directly from plants naturally-infected with begomoviruses under field conditions. 2014 by the authors; licensee MDPI, Basel, Switzerland.

  17. PuLSE: Quality control and quantification of peptide sequences explored by phage display libraries.

    Science.gov (United States)

    Shave, Steven; Mann, Stefan; Koszela, Joanna; Kerr, Alastair; Auer, Manfred

    2018-01-01

    The design of highly diverse phage display libraries is based on assumption that DNA bases are incorporated at similar rates within the randomized sequence. As library complexity increases and expected copy numbers of unique sequences decrease, the exploration of library space becomes sparser and the presence of truly random sequences becomes critical. We present the program PuLSE (Phage Library Sequence Evaluation) as a tool for assessing randomness and therefore diversity of phage display libraries. PuLSE runs on a collection of sequence reads in the fastq file format and generates tables profiling the library in terms of unique DNA sequence counts and positions, translated peptide sequences, and normalized 'expected' occurrences from base to residue codon frequencies. The output allows at-a-glance quantitative quality control of a phage library in terms of sequence coverage both at the DNA base and translated protein residue level, which has been missing from toolsets and literature. The open source program PuLSE is available in two formats, a C++ source code package for compilation and integration into existing bioinformatics pipelines and precompiled binaries for ease of use.

  18. Viral Metagenomics: Analysis of Begomoviruses by Illumina High-Throughput Sequencing

    Directory of Open Access Journals (Sweden)

    Ali Idris

    2014-03-01

    Full Text Available Traditional DNA sequencing methods are inefficient, lack the ability to discern the least abundant viral sequences, and ineffective for determining the extent of variability in viral populations. Here, populations of single-stranded DNA plant begomoviral genomes and their associated beta- and alpha-satellite molecules (virus-satellite complexes (genus, Begomovirus; family, Geminiviridae were enriched from total nucleic acids isolated from symptomatic, field-infected plants, using rolling circle amplification (RCA. Enriched virus-satellite complexes were subjected to Illumina-Next Generation Sequencing (NGS. CASAVA and SeqMan NGen programs were implemented, respectively, for quality control and for de novo and reference-guided contig assembly of viral-satellite sequences. The authenticity of the begomoviral sequences, and the reproducibility of the Illumina-NGS approach for begomoviral deep sequencing projects, were validated by comparing NGS results with those obtained using traditional molecular cloning and Sanger sequencing of viral components and satellite DNAs, also enriched by RCA or amplified by polymerase chain reaction. As the use of NGS approaches, together with advances in software development, make possible deep sequence coverage at a lower cost; the approach described herein will streamline the exploration of begomovirus diversity and population structure from naturally infected plants, irrespective of viral abundance. This is the first report of the implementation of Illumina-NGS to explore the diversity and identify begomoviral-satellite SNPs directly from plants naturally-infected with begomoviruses under field conditions.

  19. High-throughput sequencing for the detection of the bacterial and fungal diversity in Mongolian naturally fermented cow's milk in Russia.

    Science.gov (United States)

    Liu, Wenjun; Zheng, Yi; Kwok, Lai-Yu; Sun, Zhihong; Zhang, Jiachao; Guo, Zhuang; Hou, Qiangchuan; Menhe, Bilige; Zhang, Heping

    2015-02-22

    Traditional fermented dairy products are major components of the typical Mongolian diet since ancient times. However, almost all the previous studies on the microbial composition of traditional Mongolian fermented dairy products analyzed food samples from the Chinese Mongolian region and Mongolia but not the Russian Mongolian region. In this study, the bacterial and fungal community diversity of nineteen naturally fermented cow's milk (NFCM) samples from local Mongolian families residing in Kalmykia and Chita of Russia was investigated with pyrosequencing. Firmicutes and Ascomycota were the predominant phyla respectively for bacteria and fungi. The abundance of the bacterial phylum Acidobacteria was considerably different between the samples from the two regions. At genus level, Lactobacillus and Pichia were the predominating bacterial and fungal genera, respectively, while six bacterial genera significantly differed between the Kalmykia (enrichment of Aeromonas, Bacillus, Clostridium, Streptococcus, Vogesella) and Chita (enrichment of Lactococcus) samples. The results of principal coordinate analysis (PCoA) based on the bacterial or fungal composition of the Kalmykia and Chita samples revealed a different microbiota structure between the samples collected in these two locations. The redundancy analysis (RDA) identified 60 bacterial and 21 fungal OTUs as the key variables responsible for such microbiota structural difference. Our results suggest that structural differences existed in the microbiota of NFCM between Kalmykia and Chita. The difference in geographic environment may be an important factor influencing the microbial diversity of NFCM made by the Mongolians in Russia.

  20. Development of novel simple sequence repeat markers in bitter gourd (Momordica charantia L.) through enriched genomic libraries and their utilization in analysis of genetic diversity and cross-species transferability.

    Science.gov (United States)

    Saxena, Swati; Singh, Archana; Archak, Sunil; Behera, Tushar K; John, Joseph K; Meshram, Sudhir U; Gaikwad, Ambika B

    2015-01-01

    Microsatellite or simple sequence repeat (SSR) markers are the preferred markers for genetic analyses of crop plants. The availability of a limited number of such markers in bitter gourd (Momordica charantia L.) necessitates the development and characterization of more SSR markers. These were developed from genomic libraries enriched for three dinucleotide, five trinucleotide, and two tetranucleotide core repeat motifs. Employing the strategy of polymerase chain reaction-based screening, the number of clones to be sequenced was reduced by 81 % and 93.7 % of the sequenced clones contained in microsatellite repeats. Unique primer-pairs were designed for 160 microsatellite loci, and amplicons of expected length were obtained for 151 loci (94.4 %). Evaluation of diversity in 54 bitter gourd accessions at 51 loci indicated that 20 % of the loci were polymorphic with the polymorphic information content values ranging from 0.13 to 0.77. Fifteen Indian varieties were clearly distinguished indicative of the usefulness of the developed markers. Markers at 40 loci (78.4 %) were transferable to six species, viz. Momordica cymbalaria, Momordica subangulata subsp. renigera, Momordica balsamina, Momordica dioca, Momordica cochinchinesis, and Momordica sahyadrica. The microsatellite markers reported will be useful in various genetic and molecular genetic studies in bitter gourd, a cucurbit of immense nutritive, medicinal, and economic importance.

  1. Molecular characterization of Vulmar1, a complete mariner transposon of sugar beet and diversity of mariner- and En/Spm-like sequences in the genus Beta.

    Science.gov (United States)

    Jacobs, Gunnar; Dechyeva, Daryna; Menzel, Gerhard; Dombrowski, Cora; Schmidt, Thomas

    2004-12-01

    Transposons of the Tc1-mariner superfamily are widespread in eukaryotic genomes. We have isolated the mariner element Vulmar1 from Beta vulgaris L., which is 3909 bp long and bordered by perfect terminal inverted repeats of 32 bp with homology to terminal inverted repeats of transposons from soybean and rice. According to a characteristic amino acid signature, Vulmar1 can be assigned to the DD39D group of mariner transposons. Vulmar1 is flanked by a 5'-TA-3' target site duplication that is typical for mariner transposons. Southern hybridization revealed that mariner-like copies are highly abundant in Beta species, and sequence analysis of 10 transposase fragments from representative species of the four Beta sections revealed an identity between 34% and 100% after conceptual translation. By fluorescent in situ hybridization, Vulmar1 was detected in distal euchromatin as well as in some intercalary and pericentromeric regions of all B. vulgaris chromosomes. In addition, using PCR, we were able to amplify fragments of the transposase gene of En/Spm-like transposons in the genus Beta. En/Spm-like transposase sequences are highly amplified in four Beta sections and showed a considerable degree of conservation (88.5-100%) at the protein level, while the homology to corresponding regions of En/Spm transposons of other plant species ranges from 49.5% to 62.5%. By fluorescent in situ hybridization, En/Spm-like transposon signals of strong intensity were detected on all chromosomes of B. vulgaris.

  2. De novo assembly of highly diverse viral populations

    Directory of Open Access Journals (Sweden)

    Yang Xiao

    2012-09-01

    Full Text Available Abstract Background Extensive genetic diversity in viral populations within infected hosts and the divergence of variants from existing reference genomes impede the analysis of deep viral sequencing data. A de novo population consensus assembly is valuable both as a single linear representation of the population and as a backbone on which intra-host variants can be accurately mapped. The availability of consensus assemblies and robustly mapped variants are crucial to the genetic study of viral disease progression, transmission dynamics, and viral evolution. Existing de novo assembly techniques fail to robustly assemble ultra-deep sequence data from genetically heterogeneous populations such as viruses into full-length genomes due to the presence of extensive genetic variability, contaminants, and variable sequence coverage. Results We present VICUNA, a de novo assembly algorithm suitable for generating consensus assemblies from genetically heterogeneous populations. We demonstrate its effectiveness on Dengue, Human Immunodeficiency and West Nile viral populations, representing a range of intra-host diversity. Compared to state-of-the-art assemblers designed for haploid or diploid systems, VICUNA recovers full-length consensus and captures insertion/deletion polymorphisms in diverse samples. Final assemblies maintain a high base calling accuracy. VICUNA program is publicly available at: http://www.broadinstitute.org/scientific-community/science/projects/viral-genomics/ viral-genomics-analysis-software. Conclusions We developed VICUNA, a publicly available software tool, that enables consensus assembly of ultra-deep sequence derived from diverse viral populations. While VICUNA was developed for the analysis of viral populations, its application to other heterogeneous sequence data sets such as metagenomic or tumor cell population samples may prove beneficial in these fields of research.

  3. DNA sequence variants in the LOXL1 gene are associated with pseudoexfoliation glaucoma in a U.S. clinic-based population with broad ethnic diversity

    Directory of Open Access Journals (Sweden)

    Miller Joan W

    2008-02-01

    Full Text Available Abstract Background Pseudoexfoliation syndrome is a major risk factor for glaucoma in many populations throughout the world. Using a U.S. clinic-based case control sample with broad ethnic diversity, we show that three common SNPs in LOXL1 previously associated with pseudoexfoliation in Nordic populations are significantly associated with pseudoexfoliation syndrome and pseudoexfoliation glaucoma. Methods Three LOXL1 SNPs were genotyped in a patient sample (206 pseudoexfoliation, 331 primary open angle glaucoma, and 88 controls from the Glaucoma Consultation Service at the Massachusetts Eye and Ear Infirmary. The SNPs were evaluation for association with pseudeoexfoliation syndrome, pseudoexfoliation glaucoma, and primary open angle glaucoma. Results The strongest association was found for the G allele of marker rs3825942 (G153D with a frequency of 99% in pseudoexfoliation patients (with and without glaucoma compared with 79% in controls (p = 1.6 × 10-15; OR = 20.93, 95%CI: 8.06, 54.39. The homozygous GG genotype is also associated with pseudoexfoliation when compared to controls (p = 1.2 × 10-12; OR = 23.57, 95%CI: 7.95, 69.85. None of the SNPs were significantly associated with primary open angle glaucoma. Conclusion The pseudoexfoliation syndrome is a common cause of glaucoma. These results indicate that the G153D LOXL1 variant is significantly associated with an increased risk of pseudoexfoliation and pseudoexfoliation glaucoma in an ethnically diverse patient population from the Northeastern United States. Given the high prevalence of pseudooexfoliation in this geographic region, these results also indicate that the G153D LOXL1 variant is a significant risk factor for adult-onset glaucoma in this clinic based population.

  4. Molecular characterization and clonal diversity of meticillin-resistant Staphylococcus aureus isolated from the community in Spain: emergence of clone sequence type 72.

    Science.gov (United States)

    Potel, C; Rey, S; Otero, S; Rubio, J; Álvarez, M

    2016-08-01

    Sequence type 72 meticillin-resistant Staphylococcus aureus (ST72 MRSA) was recently detected in our hospital. Although in Europe this clone is rarely isolated, it is the leading cause of community-associated MRSA infections in Korea, spreading also into hospitals, where it has also emerged as the main MRSA clone recovered from raw meat. We studied MRSA isolated from outpatients in Spain during a nine-year period. More than 70% of the isolates belonged to predominant clones found in hospitals. There was a significant increase in the ST72 prevalence. It appears that boundaries of dominance among MRSA clones have become blurred, demanding continuous surveillance. Copyright © 2016 The Healthcare Infection Society. Published by Elsevier Ltd. All rights reserved.

  5. SNP design from 454 sequencing of Podosphaera plantaginis transcriptome reveals a genetically diverse pathogen metapopulation with high levels of mixed-genotype infection.

    Directory of Open Access Journals (Sweden)

    Charlotte Tollenaere

    Full Text Available Molecular tools may greatly improve our understanding of pathogen evolution and epidemiology but technical constraints have hindered the development of genetic resources for parasites compared to free-living organisms. This study aims at developing molecular tools for Podosphaera plantaginis, an obligate fungal pathogen of Plantago lanceolata. This interaction has been intensively studied in the Åland archipelago of Finland with epidemiological data collected from over 4,000 host populations annually since year 2001.A cDNA library of a pooled sample of fungal conidia was sequenced on the 454 GS-FLX platform. Over 549,411 reads were obtained and annotated into 45,245 contigs. Annotation data was acquired for 65.2% of the assembled sequences. The transcriptome assembly was screened for SNP loci, as well as for functionally important genes (mating-type genes and potential effector proteins. A genotyping assay of 27 SNP loci was designed and tested on 380 infected leaf samples from 80 populations within the Åland archipelago. With this panel we identified 85 multilocus genotypes (MLG with uneven frequencies across the pathogen metapopulation. Approximately half of the sampled populations contain polymorphism. Our genotyping protocol revealed mixed-genotype infection within a single host leaf to be common. Mixed infection has been proposed as one of the main drivers of pathogen evolution, and hence may be an important process in this pathosystem.The developed SNP panel offers exciting research perspectives for future studies in this well-characterized pathosystem. Also, the transcriptome provides an invaluable novel genomic resource for powdery mildews, which cause significant yield losses on commercially important crops annually. Furthermore, the features that render genetic studies in this system a challenge are shared with the majority of obligate parasitic species, and hence our results provide methodological insights from SNP calling to field

  6. Microbial Community and Biochemical Dynamics of Biological Soil Crusts across a Gradient of Surface Coverage in the Central Mojave Desert.

    Science.gov (United States)

    Mogul, Rakesh; Vaishampayan, Parag; Bashir, Mina; McKay, Chris P; Schubert, Keith; Bornaccorsi, Rosalba; Gomez, Ernesto; Tharayil, Sneha; Payton, Geoffrey; Capra, Juliana; Andaya, Jessica; Bacon, Leonard; Bargoma, Emily; Black, David; Boos, Katie; Brant, Michaela; Chabot, Michael; Chau, Danny; Cisneros, Jessica; Chu, Geoff; Curnutt, Jane; DiMizio, Jessica; Engelbrecht, Christian; Gott, Caroline; Harnoto, Raechel; Hovanesian, Ruben; Johnson, Shane; Lavergne, Britne; Martinez, Gabriel; Mans, Paul; Morales, Ernesto; Oei, Alex; Peplow, Gary; Piaget, Ryan; Ponce, Nicole; Renteria, Eduardo; Rodriguez, Veronica; Rodriguez, Joseph; Santander, Monica; Sarmiento, Khamille; Scheppelmann, Allison; Schroter, Gavin; Sexton, Devan; Stephenson, Jenin; Symer, Kristin; Russo-Tait, Tatiane; Weigel, Bill; Wilhelm, Mary B

    2017-01-01

    In this study, we expand upon the biogeography of biological soil crusts (BSCs) and provide molecular insights into the microbial community and biochemical dynamics along the vertical BSC column structure, and across a transect of increasing BSC surface coverage in the central Mojave Desert, CA, United States. Next generation sequencing reveals a bacterial community profile that is distinct among BSCs in the southwestern United States. Distribution of major phyla in the BSC topsoils included Cyanobacteria (33 ± 8%), Proteobacteria (26 ± 6%), and Chloroflexi (12 ± 4%), with Phormidium being the numerically dominant genus. Furthermore, BSC subsurfaces contained Proteobacteria (23 ± 5%), Actinobacteria (20 ± 5%), and Chloroflexi (18 ± 3%), with an unidentified genus from Chloroflexi (AKIW781, order) being numerically dominant. Across the transect, changes in distribution at the phylum ( p < 0.0439) and genus ( p < 0.006) levels, including multiple biochemical and geochemical trends ( p < 0.05), positively correlated with increasing BSC surface coverage. This included increases in (a) Chloroflexi abundance, (b) abundance and diversity of Cyanobacteria, (b) OTU-level diversity in the topsoil, (c) OTU-level differentiation between the topsoil and subsurface, (d) intracellular ATP abundances and catalase activities, and (e) enrichments in clay, silt, and varying elements, including S, Mn, Co, As, and Pb, in the BSC topsoils. In sum, these studies suggest that BSCs from regions of differing surface coverage represent early successional stages, which exhibit increasing bacterial diversity, metabolic activities, and capacity to restructure the soil. Further, these trends suggest that BSC successional maturation and colonization across the transect are inhibited by metals/metalloids such as B, Ca, Ti, Mn, Co, Ni, Mo, and Pb.

  7. Microbial Community and Biochemical Dynamics of Biological Soil Crusts across a Gradient of Surface Coverage in the Central Mojave Desert

    Directory of Open Access Journals (Sweden)

    Rakesh Mogul

    2017-10-01

    Full Text Available In this study, we expand upon the biogeography of biological soil crusts (BSCs and provide molecular insights into the microbial community and biochemical dynamics along the vertical BSC column structure, and across a transect of increasing BSC surface coverage in the central Mojave Desert, CA, United States. Next generation sequencing reveals a bacterial community profile that is distinct among BSCs in the southwestern United States. Distribution of major phyla in the BSC topsoils included Cyanobacteria (33 ± 8%, Proteobacteria (26 ± 6%, and Chloroflexi (12 ± 4%, with Phormidium being the numerically dominant genus. Furthermore, BSC subsurfaces contained Proteobacteria (23 ± 5%, Actinobacteria (20 ± 5%, and Chloroflexi (18 ± 3%, with an unidentified genus from Chloroflexi (AKIW781, order being numerically dominant. Across the transect, changes in distribution at the phylum (p < 0.0439 and genus (p < 0.006 levels, including multiple biochemical and geochemical trends (p < 0.05, positively correlated with increasing BSC surface coverage. This included increases in (a Chloroflexi abundance, (b abundance and diversity of Cyanobacteria, (b OTU-level diversity in the topsoil, (c OTU-level differentiation between the topsoil and subsurface, (d intracellular ATP abundances and catalase activities, and (e enrichments in clay, silt, and varying elements, including S, Mn, Co, As, and Pb, in the BSC topsoils. In sum, these studies suggest that BSCs from regions of differing surface coverage represent early successional stages, which exhibit increasing bacterial diversity, metabolic activities, and capacity to restructure the soil. Further, these trends suggest that BSC successional maturation and colonization across the transect are inhibited by metals/metalloids such as B, Ca, Ti, Mn, Co, Ni, Mo, and Pb.

  8. HLA diversity in the 1000 genomes dataset.

    Directory of Open Access Journals (Sweden)

    Pierre-Antoine Gourraud

    Full Text Available The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation by sequencing at a level that should allow the genome-wide detection of most variants with frequencies as low as 1%. However, in the major histocompatibility complex (MHC, only the top 10 most frequent haplotypes are in the 1% frequency range whereas thousands of haplotypes are present at lower frequencies. Given the limitation of both the coverage and the read length of the sequences generated by the 1000 Genomes Project, the highly variable positions that define HLA alleles may be difficult to identify. We used classical Sanger sequencing techniques to type the HLA-A, HLA-B, HLA-C, HLA-DRB1 and HLA-DQB1 genes in the available 1000 Genomes samples and combined the results with the 103,310 variants in the MHC region genotyped by the 1000 Genomes Project. Using pairwise identity-by-descent distances between individuals and principal component analysis, we established the relationship between ancestry and genetic diversity in the MHC region. As expected, both the MHC variants and the HLA phenotype can identify the major ancestry lineage, informed mainly by the most frequent HLA haplotypes. To some extent, regions of the genome with similar genetic or similar recombination rate have similar properties. An MHC-centric analysis underlines departures between the ancestral background of the MHC and the genome-wide picture. Our analysis of linkage disequilibrium (LD decay in these samples suggests that overestimation of pairwise LD occurs due to a limited sampling of the MHC diversity. This collection of HLA-specific MHC variants, available on the dbMHC portal, is a valuable resource for future analyses of the role of MHC in population and disease studies.

  9. Assuring Access to Affordable Coverage

    Data.gov (United States)

    U.S. Department of Health & Human Services — Under the Affordable Care Act, millions of uninsured Americans will gain access to affordable coverage through Affordable Insurance Exchanges and improvements in...

  10. Diversity of antisense and other non-coding RNAs in Archaea revealed by comparative small RNA sequencing in four Pyrobaculum species

    Directory of Open Access Journals (Sweden)

    David L Bernick

    2012-07-01

    Full Text Available A great diversity of small, non-coding RNA molecules with roles in gene regulation and RNA processing have been intensely studied in eukaryotic and bacterial model organisms, yet our knowledge of possible parallel roles for small RNAs in archaea is limited. We employed RNA-seq to identify novel small RNA across multiple species of the hyperthermophilic genus Pyrobaculum, known for unusual RNA gene characteristics. By comparing transcriptional data collected in parallel among four species, we were able to identify conserved RNA genes fitting into known and novel families. Among our findings, we highlight three novel cis-antisense small RNAs encoded opposite to key regulatory (ferric uptake regulator, metabolic (triose-phosphate isomerase, and core transcriptional apparatus genes (transcription factor B. We also found a large increase in the number of conserved C/D box small RNA genes over what had been previously recognized; many of these genes are encoded antisense to protein coding genes. The conserved opposition to orthologous genes across the Pyrobaculum genus suggests similarities to other cis-antisense regulatory systems. Furthermore, the genus-specific nature of these small RNAs indicates they are relatively recent, stable adaptations.

  11. Targeted next-generation sequencing reveals novel USH2A mutations associated with diverse disease phenotypes: implications for clinical and molecular diagnosis.

    Science.gov (United States)

    Chen, Xue; Sheng, Xunlun; Liu, Xiaoxing; Li, Huiping; Liu, Yani; Rong, Weining; Ha, Shaoping; Liu, Wenzhou; Kang, Xiaoli; Zhao, Kanxing; Zhao, Chen

    2014-01-01

    USH2A mutations have been implicated in the disease etiology of several inherited diseases, including Usher syndrome type 2 (USH2), nonsyndromic retinitis pigmentosa (RP), and nonsyndromic deafness. The complex genetic and phenotypic spectrums relevant to USH2A defects make it difficult to manage patients with such mutations. In the present study, we aim to determine the genetic etiology and to characterize the correlated clinical phenotypes for three Chinese pedigrees with nonsyndromic RP, one with RP sine pigmento (RPSP), and one with USH2. Family histories and clinical details for all included patients were reviewed. Ophthalmic examinations included best corrected visual acuities, visual field measurements, funduscopy,