WorldWideScience

Sample records for multilocus sequence analysis

  1. Multilocus sequence analysis of phytopathogenic species of the genus Streptomyces

    Science.gov (United States)

    The identification and classification of species within the genus Streptomyces is difficult because there are presently 576 validly described species and this number increases every year. The value of the application of multilocus sequence analysis scheme to the systematics of Streptomyces species h...

  2. Molecular characterization of Giardia psittaci by multilocus sequence analysis.

    Science.gov (United States)

    Abe, Niichiro; Makino, Ikuko; Kojima, Atsushi

    2012-12-01

    Multilocus sequence analyses targeting small subunit ribosomal DNA (SSU rDNA), elongation factor 1 alpha (ef1α), glutamate dehydrogenase (gdh), and beta giardin (β-giardin) were performed on Giardia psittaci isolates from three Budgerigars (Melopsittacus undulates) and four Barred parakeets (Bolborhynchus lineola) kept in individual households or imported from overseas. Nucleotide differences and phylogenetic analyses at four loci indicate the distinction of G. psittaci from the other known Giardia species: Giardia muris, Giardia microti, Giardia ardeae, and Giardia duodenalis assemblages. Furthermore, G. psittaci was related more closely to G. duodenalis than to the other known Giardia species, except for G. microti. Conflicting signals regarded as "double peaks" were found at the same nucleotide positions of the ef1α in all isolates. However, the sequences of the other three loci, including gdh and β-giardin, which are known to be highly variable, from all isolates were also mutually identical at every locus. They showed no double peaks. These results suggest that double peaks found in the ef1α sequences are caused not by mixed infection with genetically different G. psittaci isolates but by allelic sequence heterogeneity (ASH), which is observed in diplomonad lineages including G. duodenalis. No sequence difference was found in any G. psittaci isolates at the gdh and β-giardin, suggesting that G. psittaci is indeed not more diverse genetically than other Giardia species. This report is the first to provide evidence related to the genetic characteristics of G. psittaci obtained using multilocus sequence analysis. Copyright © 2012 Elsevier B.V. All rights reserved.

  3. Multilocus sequence analysis of Treponema denticola strains of diverse origin

    Directory of Open Access Journals (Sweden)

    Mo Sisu

    2013-02-01

    Full Text Available Abstract Background The oral spirochete bacterium Treponema denticola is associated with both the incidence and severity of periodontal disease. Although the biological or phenotypic properties of a significant number of T. denticola isolates have been reported in the literature, their genetic diversity or phylogeny has never been systematically investigated. Here, we describe a multilocus sequence analysis (MLSA of 20 of the most highly studied reference strains and clinical isolates of T. denticola; which were originally isolated from subgingival plaque samples taken from subjects from China, Japan, the Netherlands, Canada and the USA. Results The sequences of the 16S ribosomal RNA gene, and 7 conserved protein-encoding genes (flaA, recA, pyrH, ppnK, dnaN, era and radC were successfully determined for each strain. Sequence data was analyzed using a variety of bioinformatic and phylogenetic software tools. We found no evidence of positive selection or DNA recombination within the protein-encoding genes, where levels of intraspecific sequence polymorphism varied from 18.8% (flaA to 8.9% (dnaN. Phylogenetic analysis of the concatenated protein-encoding gene sequence data (ca. 6,513 nucleotides for each strain using Bayesian and maximum likelihood approaches indicated that the T. denticola strains were monophyletic, and formed 6 well-defined clades. All analyzed T. denticola strains appeared to have a genetic origin distinct from that of ‘Treponema vincentii’ or Treponema pallidum. No specific geographical relationships could be established; but several strains isolated from different continents appear to be closely related at the genetic level. Conclusions Our analyses indicate that previous biological and biophysical investigations have predominantly focused on a subset of T. denticola strains with a relatively narrow range of genetic diversity. Our methodology and results establish a genetic framework for the discrimination and phylogenetic

  4. Multilocus Sequence Typing

    OpenAIRE

    Belén, Ana; Pavón, Ibarz; Maiden, Martin C.J.

    2009-01-01

    Multilocus sequence typing (MLST) was first proposed in 1998 as a typing approach that enables the unambiguous characterization of bacterial isolates in a standardized, reproducible, and portable manner using the human pathogen Neisseria meningitidis as the exemplar organism. Since then, the approach has been applied to a large and growing number of organisms by public health laboratories and research institutions. MLST data, shared by investigators over the world via the Internet, have been ...

  5. Multilocus Sequence Analysis for Typing Leptospira interrogans and Leptospira kirschneri▿ †

    Science.gov (United States)

    Leon, Albertine; Pronost, Stéphane; Fortier, Guillaume; Andre-Fontaine, Geneviève; Leclercq, Roland

    2010-01-01

    Fifty-three strains belonging to the pathogenic species Leptospira interrogans and Leptospira kirschneri were analyzed by multilocus sequence analysis. The species formed two distinct branches. In the L. interrogans branch, the phylogenetic tree clustered the strains into three subgroups. Genogroups and serogroups were superimposed but not strictly. PMID:19955271

  6. Multilocus Sequence Analysis for Typing Leptospira interrogans and Leptospira kirschneri▿ †

    OpenAIRE

    Leon, Albertine; Pronost, Stéphane; Fortier, Guillaume; Andre-Fontaine, Geneviève; Leclercq, Roland

    2009-01-01

    Fifty-three strains belonging to the pathogenic species Leptospira interrogans and Leptospira kirschneri were analyzed by multilocus sequence analysis. The species formed two distinct branches. In the L. interrogans branch, the phylogenetic tree clustered the strains into three subgroups. Genogroups and serogroups were superimposed but not strictly.

  7. Differentiation of Xylella fastidiosa Strains via Multilocus Sequence Analysis of Environmentally Mediated Genes (MLSA-E)

    OpenAIRE

    Parker, Jennifer K.; Havird, Justin C.; De La Fuente, Leonardo

    2012-01-01

    Isolates of the plant pathogen Xylella fastidiosa are genetically very similar, but studies on their biological traits have indicated differences in virulence and infection symptomatology. Taxonomic analyses have identified several subspecies, and phylogenetic analyses of housekeeping genes have shown broad host-based genetic differences; however, results are still inconclusive for genetic differentiation of isolates within subspecies. This study employs multilocus sequence analysis of enviro...

  8. Multilocus Sequence Analysis and rpoB Sequencing of Mycobacterium abscessus (Sensu Lato) Strains▿

    Science.gov (United States)

    Macheras, Edouard; Roux, Anne-Laure; Bastian, Sylvaine; Leão, Sylvia Cardoso; Palaci, Moises; Sivadon-Tardy, Valérie; Gutierrez, Cristina; Richter, Elvira; Rüsch-Gerdes, Sabine; Pfyffer, Gaby; Bodmer, Thomas; Cambau, Emmanuelle; Gaillard, Jean-Louis; Heym, Beate

    2011-01-01

    Mycobacterium abscessus, Mycobacterium bolletii, and Mycobacterium massiliense (Mycobacterium abscessus sensu lato) are closely related species that currently are identified by the sequencing of the rpoB gene. However, recent studies show that rpoB sequencing alone is insufficient to discriminate between these species, and some authors have questioned their current taxonomic classification. We studied here a large collection of M. abscessus (sensu lato) strains by partial rpoB sequencing (752 bp) and multilocus sequence analysis (MLSA). The final MLSA scheme developed was based on the partial sequences of eight housekeeping genes: argH, cya, glpK, gnd, murC, pgm, pta, and purH. The strains studied included the three type strains (M. abscessus CIP 104536T, M. massiliense CIP 108297T, and M. bolletii CIP 108541T) and 120 isolates recovered between 1997 and 2007 in France, Germany, Switzerland, and Brazil. The rpoB phylogenetic tree confirmed the existence of three main clusters, each comprising the type strain of one species. However, divergence values between the M. massiliense and M. bolletii clusters all were below 3% and between the M. abscessus and M. massiliense clusters were from 2.66 to 3.59%. The tree produced using the concatenated MLSA gene sequences (4,071 bp) also showed three main clusters, each comprising the type strain of one species. The M. abscessus cluster had a bootstrap value of 100% and was mostly compact. Bootstrap values for the M. massiliense and M. bolletii branches were much lower (71 and 61%, respectively), with the M. massiliense cluster having a fuzzy aspect. Mean (range) divergence values were 2.17% (1.13 to 2.58%) between the M. abscessus and M. massiliense clusters, 2.37% (1.5 to 2.85%) between the M. abscessus and M. bolletii clusters, and 2.28% (0.86 to 2.68%) between the M. massiliense and M. bolletii clusters. Adding the rpoB sequence to the MLSA-concatenated sequence (total sequence, 4,823 bp) had little effect on the clustering

  9. Multilocus sequence analysis and rpoB sequencing of Mycobacterium abscessus (sensu lato) strains.

    Science.gov (United States)

    Macheras, Edouard; Roux, Anne-Laure; Bastian, Sylvaine; Leão, Sylvia Cardoso; Palaci, Moises; Sivadon-Tardy, Valérie; Gutierrez, Cristina; Richter, Elvira; Rüsch-Gerdes, Sabine; Pfyffer, Gaby; Bodmer, Thomas; Cambau, Emmanuelle; Gaillard, Jean-Louis; Heym, Beate

    2011-02-01

    Mycobacterium abscessus, Mycobacterium bolletii, and Mycobacterium massiliense (Mycobacterium abscessus sensu lato) are closely related species that currently are identified by the sequencing of the rpoB gene. However, recent studies show that rpoB sequencing alone is insufficient to discriminate between these species, and some authors have questioned their current taxonomic classification. We studied here a large collection of M. abscessus (sensu lato) strains by partial rpoB sequencing (752 bp) and multilocus sequence analysis (MLSA). The final MLSA scheme developed was based on the partial sequences of eight housekeeping genes: argH, cya, glpK, gnd, murC, pgm, pta, and purH. The strains studied included the three type strains (M. abscessus CIP 104536(T), M. massiliense CIP 108297(T), and M. bolletii CIP 108541(T)) and 120 isolates recovered between 1997 and 2007 in France, Germany, Switzerland, and Brazil. The rpoB phylogenetic tree confirmed the existence of three main clusters, each comprising the type strain of one species. However, divergence values between the M. massiliense and M. bolletii clusters all were below 3% and between the M. abscessus and M. massiliense clusters were from 2.66 to 3.59%. The tree produced using the concatenated MLSA gene sequences (4,071 bp) also showed three main clusters, each comprising the type strain of one species. The M. abscessus cluster had a bootstrap value of 100% and was mostly compact. Bootstrap values for the M. massiliense and M. bolletii branches were much lower (71 and 61%, respectively), with the M. massiliense cluster having a fuzzy aspect. Mean (range) divergence values were 2.17% (1.13 to 2.58%) between the M. abscessus and M. massiliense clusters, 2.37% (1.5 to 2.85%) between the M. abscessus and M. bolletii clusters, and 2.28% (0.86 to 2.68%) between the M. massiliense and M. bolletii clusters. Adding the rpoB sequence to the MLSA-concatenated sequence (total sequence, 4,823 bp) had little effect on the

  10. Genetic diversity analysis of Leuconostoc mesenteroides from Korean vegetables and food products by multilocus sequence typing.

    Science.gov (United States)

    Sharma, Anshul; Kaur, Jasmine; Lee, Sulhee; Park, Young-Seo

    2018-06-01

    In the present study, 35 Leuconostoc mesenteroides strains isolated from vegetables and food products from South Korea were studied by multilocus sequence typing (MLST) of seven housekeeping genes (atpA, groEL, gyrB, pheS, pyrG, rpoA, and uvrC). The fragment sizes of the seven amplified housekeeping genes ranged in length from 366 to 1414 bp. Sequence analysis indicated 27 different sequence types (STs) with 25 of them being represented by a single strain indicating high genetic diversity, whereas the remaining 2 were characterized by five strains each. In total, 220 polymorphic nucleotide sites were detected among seven housekeeping genes. The phylogenetic analysis based on the STs of the seven loci indicated that the 35 strains belonged to two major groups, A (28 strains) and B (7 strains). Split decomposition analysis showed that intraspecies recombination played a role in generating diversity among strains. The minimum spanning tree showed that the evolution of the STs was not correlated with food source. This study signifies that the multilocus sequence typing is a valuable tool to access the genetic diversity among L. mesenteroides strains from South Korea and can be used further to monitor the evolutionary changes.

  11. Taxonomic evaluation of putative Streptomyces scabiei strains held in the ARS (NRRL) Culture Collection using multi-locus sequence analysis

    Science.gov (United States)

    Multi-locus sequence analysis has been demonstrated to be a useful tool for identification of Streptomyces species and was previously applied to phylogenetically differentiate the type strains of species pathogenic on potatoes (Solanum tuberosum L.). The ARS Culture Collection (NRRL) contains 43 str...

  12. Multilocus sequence analysis of nectar pseudomonads reveals high genetic diversity and contrasting recombination patterns.

    Science.gov (United States)

    Alvarez-Pérez, Sergio; de Vega, Clara; Herrera, Carlos M

    2013-01-01

    The genetic and evolutionary relationships among floral nectar-dwelling Pseudomonas 'sensu stricto' isolates associated to South African and Mediterranean plants were investigated by multilocus sequence analysis (MLSA) of four core housekeeping genes (rrs, gyrB, rpoB and rpoD). A total of 35 different sequence types were found for the 38 nectar bacterial isolates characterised. Phylogenetic analyses resulted in the identification of three main clades [nectar groups (NGs) 1, 2 and 3] of nectar pseudomonads, which were closely related to five intrageneric groups: Pseudomonas oryzihabitans (NG 1); P. fluorescens, P. lutea and P. syringae (NG 2); and P. rhizosphaerae (NG 3). Linkage disequilibrium analysis pointed to a mostly clonal population structure, even when the analysis was restricted to isolates from the same floristic region or belonging to the same NG. Nevertheless, signatures of recombination were observed for NG 3, which exclusively included isolates retrieved from the floral nectar of insect-pollinated Mediterranean plants. In contrast, the other two NGs comprised both South African and Mediterranean isolates. Analyses relating diversification to floristic region and pollinator type revealed that there has been more unique evolution of the nectar pseudomonads within the Mediterranean region than would be expected by chance. This is the first work analysing the sequence of multiple loci to reveal geno- and ecotypes of nectar bacteria.

  13. Multilocus Sequence Analysis of Nectar Pseudomonads Reveals High Genetic Diversity and Contrasting Recombination Patterns

    Science.gov (United States)

    Álvarez-Pérez, Sergio; de Vega, Clara; Herrera, Carlos M.

    2013-01-01

    The genetic and evolutionary relationships among floral nectar-dwelling Pseudomonas ‘sensu stricto’ isolates associated to South African and Mediterranean plants were investigated by multilocus sequence analysis (MLSA) of four core housekeeping genes (rrs, gyrB, rpoB and rpoD). A total of 35 different sequence types were found for the 38 nectar bacterial isolates characterised. Phylogenetic analyses resulted in the identification of three main clades [nectar groups (NGs) 1, 2 and 3] of nectar pseudomonads, which were closely related to five intrageneric groups: Pseudomonas oryzihabitans (NG 1); P. fluorescens, P. lutea and P. syringae (NG 2); and P. rhizosphaerae (NG 3). Linkage disequilibrium analysis pointed to a mostly clonal population structure, even when the analysis was restricted to isolates from the same floristic region or belonging to the same NG. Nevertheless, signatures of recombination were observed for NG 3, which exclusively included isolates retrieved from the floral nectar of insect-pollinated Mediterranean plants. In contrast, the other two NGs comprised both South African and Mediterranean isolates. Analyses relating diversification to floristic region and pollinator type revealed that there has been more unique evolution of the nectar pseudomonads within the Mediterranean region than would be expected by chance. This is the first work analysing the sequence of multiple loci to reveal geno- and ecotypes of nectar bacteria. PMID:24116076

  14. Insights into the emergent bacterial pathogen Cronobacter spp., generated by multilocus sequence typing and analysis

    Directory of Open Access Journals (Sweden)

    Susan eJoseph

    2012-11-01

    Full Text Available Cronobacter spp. (previously known as Enterobacter sakazakii is a bacterial pathogen affecting all age groups, with particularly severe clinical complications in neonates and infants. One recognised route of infection being the consumption of contaminated infant formula. As a recently recognised bacterial pathogen of considerable importance and regulatory control, appropriate detection and identification schemes are required. The application of multilocus sequence typing (MLST and analysis (MLSA of the seven alleles atpD, fusA, glnS, gltB, gyrB, infB and ppsA (concatenated length 3036 base pairs has led to considerable advances in our understanding of the genus. This approach is supported by both the reliability of DNA sequencing over subjective phenotyping and the establishment of a MLST database which has open access and is also curated; http://www.pubMLST.org/cronobacter. MLST has been used to describe the diversity of the newly recognised genus, instrumental in the formal recognition of new Cronobacter species (C. universalis and C. condimenti and revealed the high clonality of strains and the association of clonal complex 4 with neonatal meningitis cases. Clearly the MLST approach has considerable benefits over the use of non-DNA sequence based methods of analysis for newly emergent bacterial pathogens. The application of MLST and MLSA has dramatically enabled us to better understand this opportunistic bacterium which can cause irreparable damage to a newborn baby’s brain, and has contributed to improved control measures to protect neonatal health.

  15. Multilocus Sequence Analysis of Cercospora spp. from Different Host Plant Families

    Directory of Open Access Journals (Sweden)

    Floreta Fiska Yuliarni

    2014-06-01

    Full Text Available Identification of the genus Cercospora is still complicated due to the host preferences often being used as the main criteria to propose a new name. We determined the relationship between host plants and multilocus sequence variations (ITS rDNA including 5.8S rDNA, elongation factor 1-α, and calmodulin in Cercospora spp. to investigate the host specificity. We used 53 strains of Cercospora spp. infecting 12 plant families for phylogenetic analysis. The sequences of 23 strains of Cercospora spp. infecting the plant families of Asteraceae, Cucurbitaceae, and Solanaceae were determined in this study. The sequences of 30 strains of Cercospora spp. infecting the plant families of Fabaceae, Amaranthaceae, Apiaceae, Plumbaginaceae, Malvaceae, Cistaceae, Plantaginaceae, Lamiaceae, and Poaceae were obtained from GenBank. The molecular phylogenetic analysis revealed that the majority of Cercospora species lack host specificity, and only C. zinniicola, C. zeina, C. zeae-maydis, C. cocciniae, and C. mikaniicola were found to be host-specific. Closely related species of Cercospora could not be distinguished using molecular analyses of ITS, EF, and CAL gene regions. The topology of the phylogenetic tree based on the CAL gene showed a better topology and Cercospora species separation than the trees developed based on the ITS rDNA region or the EF gene.

  16. Multilocus sequence typing and virulence analysis of Haemophilus parasuis strains isolated in five provinces of China.

    Science.gov (United States)

    Wang, Liyan; Ma, Lina; Liu, Yongan; Gao, Pengcheng; Li, Youquan; Li, Xuerui; Liu, Yongsheng

    2016-10-01

    Haemophilus parasuis is the etiological agent of Glässers disease, which causes high morbidity and mortality in swine herds. Although H. parasuis strains can be classified into 15 serovars with the Kielstein-Rapp-Gabrielson serotyping scheme, a large number of isolates cannot be classified and have been designated 'nontypeable' strains. In this study, multilocus sequence typing (MLST) of H. parasuis was used to analyze 48 H. parasuis field strains isolated in China and two strains from Australia. Twenty-six new alleles and 29 new sequence types (STs) were detected, enriching the H. parasuis MLST databases. A BURST analysis indicated that H. parasuis lacks stable population structure and is highly heterogeneous, and that there is no association between STs and geographic area. When an UPGMA dendrogram was constructed, two major clades, clade A and clade B, were defined. Animal experiments, in which guinea pigs were challenged intraperitoneally with the bacterial isolates, supported the hypothesis that the H. parasuis STs in clade A are generally avirulent or weakly virulent, whereas the STs in clade B tend to be virulent. Copyright © 2016 Elsevier B.V. All rights reserved.

  17. Multilocus sequence analysis (MLSA) of Bradyrhizobium strains: revealing high diversity of tropical diazotrophic symbiotic bacteria.

    Science.gov (United States)

    Delamuta, Jakeline Renata Marçon; Ribeiro, Renan Augusto; Menna, Pâmela; Bangel, Eliane Villamil; Hungria, Mariangela

    2012-04-01

    Symbiotic association of several genera of bacteria collectively called as rhizobia and plants belonging to the family Leguminosae (=Fabaceae) results in the process of biological nitrogen fixation, playing a key role in global N cycling, and also bringing relevant contributions to the agriculture. Bradyrhizobium is considered as the ancestral of all nitrogen-fixing rhizobial species, probably originated in the tropics. The genus encompasses a variety of diverse bacteria, but the diversity captured in the analysis of the 16S rRNA is often low. In this study, we analyzed twelve Bradyrhizobium strains selected from previous studies performed by our group for showing high genetic diversity in relation to the described species. In addition to the 16S rRNA, five housekeeping genes (recA, atpD, glnII, gyrB and rpoB) were analyzed in the MLSA (multilocus sequence analysis) approach. Analysis of each gene and of the concatenated housekeeping genes captured a considerably higher level of genetic diversity, with indication of putative new species. The results highlight the high genetic variability associated with Bradyrhizobium microsymbionts of a variety of legumes. In addition, the MLSA approach has proved to represent a rapid and reliable method to be employed in phylogenetic and taxonomic studies, speeding the identification of the still poorly known diversity of nitrogen-fixing rhizobia in the tropics.

  18. Differentiation of Xylella fastidiosa strains via multilocus sequence analysis of environmentally mediated genes (MLSA-E).

    Science.gov (United States)

    Parker, Jennifer K; Havird, Justin C; De La Fuente, Leonardo

    2012-03-01

    Isolates of the plant pathogen Xylella fastidiosa are genetically very similar, but studies on their biological traits have indicated differences in virulence and infection symptomatology. Taxonomic analyses have identified several subspecies, and phylogenetic analyses of housekeeping genes have shown broad host-based genetic differences; however, results are still inconclusive for genetic differentiation of isolates within subspecies. This study employs multilocus sequence analysis of environmentally mediated genes (MLSA-E; genes influenced by environmental factors) to investigate X. fastidiosa relationships and differentiate isolates with low genetic variability. Potential environmentally mediated genes, including host colonization and survival genes related to infection establishment, were identified a priori. The ratio of the rate of nonsynonymous substitutions to the rate of synonymous substitutions (dN/dS) was calculated to select genes that may be under increased positive selection compared to previously studied housekeeping genes. Nine genes were sequenced from 54 X. fastidiosa isolates infecting different host plants across the United States. Results of maximum likelihood (ML) and Bayesian phylogenetic (BP) analyses are in agreement with known X. fastidiosa subspecies clades but show novel within-subspecies differentiation, including geographic differentiation, and provide additional information regarding host-based isolate variation and specificity. dN/dS ratios of environmentally mediated genes, though gene dN/dS ratios and correlate with increased sequence variability. MLSA-E can more precisely resolve relationships between closely related bacterial strains with low genetic variability, such as X. fastidiosa isolates. Discovering the genetic relationships between X. fastidiosa isolates will provide new insights into the epidemiology of populations of X. fastidiosa, allowing improved disease management in economically important crops.

  19. Antimicrobial susceptibility among clinical Nocardia species identified by multilocus sequence analysis.

    Science.gov (United States)

    McTaggart, Lisa R; Doucet, Jennifer; Witkowska, Maria; Richardson, Susan E

    2015-01-01

    Antimicrobial susceptibility patterns of 112 clinical isolates, 28 type strains, and 9 reference strains of Nocardia were determined using the Sensititre Rapmyco microdilution panel (Thermo Fisher, Inc.). Isolates were identified by highly discriminatory multilocus sequence analysis and were chosen to represent the diversity of species recovered from clinical specimens in Ontario, Canada. Susceptibility to the most commonly used drug, trimethoprim-sulfamethoxazole, was observed in 97% of isolates. Linezolid and amikacin were also highly effective; 100% and 99% of all isolates demonstrated a susceptible phenotype. For the remaining antimicrobials, resistance was species specific with isolates of Nocardia otitidiscaviarum, N. brasiliensis, N. abscessus complex, N. nova complex, N. transvalensis complex, N. farcinica, and N. cyriacigeorgica displaying the traditional characteristic drug pattern types. In addition, the antimicrobial susceptibility profiles of a variety of rarely encountered species isolated from clinical specimens are reported for the first time and were categorized into four additional drug pattern types. Finally, MICs for the control strains N. nova ATCC BAA-2227, N. asteroides ATCC 19247(T), and N. farcinica ATCC 23826 were robustly determined to demonstrate method reproducibility and suitability of the commercial Sensititre Rapmyco panel for antimicrobial susceptibility testing of Nocardia spp. isolated from clinical specimens. The reported values will facilitate quality control and standardization among laboratories. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  20. Multilocus sequence analysis of Echinococcus granulosus strains isolated from humans and animals in Iran.

    Science.gov (United States)

    Nikmanesh, Bahram; Mirhendi, Hossein; Mahmoudi, Shahram; Rokni, Mohammad Bagher

    2017-12-01

    Echinococcus granulosus is now considered a complex consisting of at least four species and ten genotypes. Different molecular targets have been described for molecular characterization of E. granulosus; however, in almost all studies only one or two of the targets have been used, and only limited data is available on the utilization of multiple loci. Therefore, we investigated the genetic diversity among 64 strains isolated from 138 cyst specimens of human and animal isolates, using a set of nuclear and mitochondrial genes; i.e., cytochrome c oxidase subunit 1 (cox1), NADH dehydrogenase subunit 1 (nad1), ATPase subunit 6 (atp6), 12S rRNA (12S), and Actin II (act II). In comparison to the use of molecular reference targets (nad1 + cox1), using singular target (act II or 12S or atp6) yielded lower discriminatory power. Act II and 12S genes could accurately discriminate the G6 genotype, but they were not able to differentiate between G1 and G3 genotypes. As the G1 and G3 genotypes belong to the E. granulosus sensu stricto, low intra-species variation was observed for act II and 12S. The atp6 gene could identify the G3 genotype but could not differentiate G6 and G1 genotypes. Using concatenated sequence of five genes (cox1 + nad1 + atp6 + 12S + act II), genotypes were identified accurately, and markedly higher resolution was obtained in comparison with the use of reference markers (nad1 + cox1) only. Application of multilocus sequence analysis (MLSA) to large-scale studies could provide valuable epidemiological data to make efficient control and management measures for cystic echinococcosis. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. Molecular Epidemiologic Analysis of Enterococcus faecalis Isolates in Cuba by Multilocus Sequence Typing

    Science.gov (United States)

    Kobayashi, Nobumichi; Nagashima, Shigeo

    2009-01-01

    We carried out the first study of Enterococcus faecalis clinical isolates in Cuba by multilocus sequence typing linking the molecular typing data with the presence of virulence determinants and the antibiotic resistance genes. A total of 23 E. faecalis isolates recovered from several clinic sources and geographic areas of Cuba during a period between 2000 and 2005 were typed by multilocus sequence typing. Thirteen sequence types (STs) including five novel STs were identified, and the ST 64 (clonal complex [CC] 8), ST 6 (CC2), ST 21(CC21), and ST 16 (CC58) were found in more than one strain. Sixty-seven percent of STs corresponded to STs reported previously in Spain, Poland, and The Netherlands, and other STs (ST115, ST64, ST6, and ST40) were genetically close to those detected in the United States. Prevalence of both antimicrobial resistance genes [aac(6′)-aph(2″), aph(3′), ant(6), ant(3″)(9), aph(2″)-Id, aph(2″)-Ic, erm(B), erm(A), erm(C), mef(A), tet(M), and tet(L)] and virulence genes (agg, gelE, cylA, esp, ccf, and efaAfs) were examined by polymerase chain reaction. Aminoglycoside resistance genes aac(6′)-Ie-aph(2″)-Ia, aph(3′), ant(6), ant(3″)(9) were more frequently detected in ST6, ST16, ST23, ST64, and ST115. The multidrug resistance was distributed to all STs detected, except for ST117 and singleton ST225. The presence of cyl gene was specifically linked to the ST64 and ST16. Presence of the esp, gel, and agg genes was not specific to any particular ST. This research provided the first insight into the population structure of E. faecalis in Cuba, that is, most Cuban strains were related to European strains, whereas others to U.S. strains. The CC2, CC21, and CC8, three of the biggest CCs in the world, were evidently circulating in Cuba, associated with multidrug resistance and virulence traits. PMID:19857135

  2. Population genetic and evolution analysis of controversial genus Edwardsiella by multilocus sequence typing.

    Science.gov (United States)

    Buján, Noemí; Balboa, Sabela; L Romalde, Jesús; E Toranzo, Alicia; Magariños, Beatriz

    2018-05-08

    At present, the genus Edwardsiella compiles five species: E. tarda, E. hoshinae, E. ictaluri, E. piscicida and E. anguillarum. Some species of this genus such us E. ictaluri and E. piscicida are important pathogens of numerous fish species. With the description of the two latter species, the phylogeny of Edwardsiella became more complicated. With the aim to clarify the relationships among all species in the genus, a multilocus sequence typing (MLST) approach was developed and applied to characterize 56 isolates and 6 reference strains belonging to the five Edwardsiella species. Moreover, several analyses based on the MLST scheme were performed to investigate the evolution within the genus, as well as the influence of recombination and mutation in the speciation. Edwardsiella isolates presented a high genetic variability reflected in the fourteen sequence types (ST) represented by a single isolates out of eighteen total ST. Mutation events were considerably more frequent than recombination, although both approximately equal influenced the genetic diversification. However, the speciation among species occurred mostly by recombination. Edwardsiella genus displays a non-clonal population structure with some degree of geographical isolation followed by a population expansion of E. piscicida. A database from this study was created and hosted on pubmlst.org (http://pubmlst.org/edwardsiella/). Copyright © 2018 Elsevier Inc. All rights reserved.

  3. Characterization of European Yersinia enterocolitica 1A strains using restriction fragment length polymorphism and multilocus sequence analysis.

    Science.gov (United States)

    Murros, A; Säde, E; Johansson, P; Korkeala, H; Fredriksson-Ahomaa, M; Björkroth, J

    2016-10-01

    Yersinia enterocolitica is currently divided into two subspecies: subsp. enterocolitica including highly pathogenic strains of biotype 1B and subsp. palearctica including nonpathogenic strains of biotype 1A and moderately pathogenic strains of biotypes 2-5. In this work, we characterized 162 Y. enterocolitica strains of biotype 1A and 50 strains of biotypes 2-4 isolated from human, animal and food samples by restriction fragment length polymorphism using the HindIII restriction enzyme. Phylogenetic relatedness of 20 representative Y. enterocolitica strains including 15 biotype 1A strains was further studied by the multilocus sequence analysis of four housekeeping genes (glnA, gyrB, recA and HSP60). In all the analyses, biotype 1A strains formed a separate genomic group, which differed from Y. enterocolitica subsp. enterocolitica and from the strains of biotypes 2-4 of Y. enterocolitica subsp. palearctica. Based on these results, biotype 1A strains considered nonpathogenic should not be included in subspecies palearctica containing pathogenic strains of biotypes 2-5. Yersinia enterocolitica strains are currently divided into six biotypes and two subspecies. Strains of biotype 1A, which are phenotypically and genotypically very heterogeneous, are classified as subspecies palearctica. In this study, European Y. enterocolitica 1A strains isolated from both human and nonhuman sources were characterized using restriction fragment length polymorphism and multilocus sequence analysis. The European biotype 1A strains formed a separate group, which differed from strains belonging to subspecies enterocolitica and palearctica. This may indicate that the current division between the two subspecies is not sufficient considering the strain diversity within Y. enterocolitica. © 2016 The Society for Applied Microbiology.

  4. Analysis of multilocus sequence typing and virulence characterization of Listeria monocytogenes isolates from Chinese retail ready-to-eat food

    Directory of Open Access Journals (Sweden)

    Shi eWu

    2016-02-01

    Full Text Available Eighty Listeria monocytogenes isolates were obtained from Chinese retail ready-to-eat (RTE food and were previously characterized with serotyping and antibiotic susceptibility tests. The aim of this study was to characterize the subtype and virulence potential of these L. monocytogenes isolates by multilocus sequence typing (MLST, virulence-associate genes, epidemic clones (ECs and sequence analysis of the important virulence factor: internalin A (inlA. The result of MLST revealed that these L. monocytogenes isolates belonged to 14 different sequence types (STs. With the exception of four new STs (ST804, ST805, ST806 and ST807, all other STs observed in this study have been associated with human listeriosis and outbreaks to varying extents. Six virulence-associate genes (inlA, inlB, inlC, inlJ, hly and llsX were selected and their presence was investigated using PCR. All strains carried inlA, inlB, inlC, inlJ, and hly, whereas 38.8% (31/80 of strains harbored the listeriolysin S genes (llsX. A multiplex PCR assay was used to evaluate the presence of markers specific to epidemic clones of L. monocytogenes and identified 26.3% (21/80 of ECI in the 4b-4d-4e strains. Further study of inlA sequencing revealed that most strains contained the full-length InlA required for host cell invasion, whereas three mutations lead to premature stop codons (PMSC within a novel PMSCs at position 326 (GAA→TAA. MLST and inlA sequence analysis results were concordant, and different virulence potentials within isolates were observed. These findings suggest that L. monocytogenes isolates from RTE food in China could be virulent and be capable of causing human illness. Furthermore, the STs and virulence profiles of L. monocytogenes isolates have significant implications for epidemiological and public health studies of this pathogen.

  5. Analysis of Multilocus Sequence Typing and Virulence Characterization of Listeria monocytogenes Isolates from Chinese Retail Ready-to-Eat Food.

    Science.gov (United States)

    Wu, Shi; Wu, Qingping; Zhang, Jumei; Chen, Moutong; Guo, Weipeng

    2016-01-01

    Eighty Listeria monocytogenes isolates were obtained from Chinese retail ready-to-eat (RTE) food and were previously characterized with serotyping and antibiotic susceptibility tests. The aim of this study was to characterize the subtype and virulence potential of these L. monocytogenes isolates by multilocus sequence typing (MLST), virulence-associate genes, epidemic clones (ECs), and sequence analysis of the important virulence factor: internalin A (inlA). The result of MLST revealed that these L. monocytogenes isolates belonged to 14 different sequence types (STs). With the exception of four new STs (ST804, ST805, ST806, and ST807), all other STs observed in this study have been associated with human listeriosis and outbreaks to varying extents. Six virulence-associate genes (inlA, inlB, inlC, inlJ, hly, and llsX) were selected and their presence was investigated using PCR. All strains carried inlA, inlB, inlC, inlJ, and hly, whereas 38.8% (31/80) of strains harbored the listeriolysin S genes (llsX). A multiplex PCR assay was used to evaluate the presence of markers specific to epidemic clones of L. monocytogenes and identified 26.3% (21/80) of ECI in the 4b-4d-4e strains. Further study of inlA sequencing revealed that most strains contained the full-length InlA required for host cell invasion, whereas three mutations lead to premature stop codons (PMSC) within a novel PMSCs at position 326 (GAA → TAA). MLST and inlA sequence analysis results were concordant, and different virulence potentials within isolates were observed. These findings suggest that L. monocytogenes isolates from RTE food in China could be virulent and be capable of causing human illness. Furthermore, the STs and virulence profiles of L. monocytogenes isolates have significant implications for epidemiological and public health studies of this pathogen.

  6. Multilocus Sequence Typing of Total-Genome-Sequenced Bacteria

    DEFF Research Database (Denmark)

    Larsen, Mette Voldby; Cosentino, Salvatore; Rasmussen, Simon

    2012-01-01

    Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the "gold standard" of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS...

  7. Assessment of MultiLocus Sequence Analysis As a Valuable Tool for the Classification of the Genus Salinivibrio

    Directory of Open Access Journals (Sweden)

    Clara López-Hermoso

    2017-06-01

    Full Text Available The genus Salinivibrio includes obligatory halophilic bacteria and is commonly isolated from hypersaline habitats and salted food products. They grow optimally between 7.5 and 10% salts and are facultative anaerobes. Currently, this genus comprises four species, one of them, S. costicola, with three subspecies. In this study we isolated and characterized an additional 70 strains from solar salterns located in different locations. Comparative 16S rRNA gene sequence analysis identified these strains as belonging to the genus Salinivibrio but could not differentiate strains into species-like groups. To achieve finer phylogenetic resolution, we carried out a MultiLocus Sequence Analysis (MLSA of the new isolates and the type strains of the species of Salinivibrio based on the individual as well as concatenated sequences of four housekeeping genes: gyrB, recA, rpoA, and rpoD. The strains formed four clearly differentiated species-like clusters called phylogroups. All of the known type and subspecies strains were associated with one of these clusters except S. sharmensis. One phylogroup had no previously described species coupled to it. Further DNA–DNA hybridization (DDH experiments with selected representative strains from these phylogroups permitted us to validate the MLSA study, correlating the species level defined by the DDH (70% with a 97% cut-off for the concatenated MLSA gene sequences. Based on these criteria, the novel strains forming phylogroup 1 could constitute a new species while strains constructing the other three phylogroups are members of previously recognized Salinivibrio species. S. costicola subsp. vallismortis co-occurs with S. proteolyticus in phylogroup 4, and separately from other S. costicola strains, indicating its need for reclassification. On the other hand, genome fingerprinting analysis showed that the environmental strains do not form clonal populations and did not cluster according to their site of cultivation. In

  8. Taxonomic evaluation of Streptomyces albus and related species using multilocus sequence analysis

    Science.gov (United States)

    In phylogenetic analyses of the genus Streptomyces using 16S rRNA gene sequences, Streptomyces albus subsp. albus NRRL B-1811T formed a cluster with 5 other species having identical or nearly identical 16S rRNA gene sequences. Moreover, the morphological and physiological characteristics of these ot...

  9. Multilocus sequence typing and rtxA toxin gene sequencing analysis of Kingella kingae isolates demonstrates genetic diversity and international clones.

    Directory of Open Access Journals (Sweden)

    Romain Basmaci

    Full Text Available BACKGROUND: Kingella kingae, a normal component of the upper respiratory flora, is being increasingly recognized as an important invasive pathogen in young children. Genetic diversity of this species has not been studied. METHODS: We analyzed 103 strains from different countries and clinical origins by a new multilocus sequence-typing (MLST schema. Putative virulence gene rtxA, encoding an RTX toxin, was also sequenced, and experimental virulence of representative strains was assessed in a juvenile-rat model. RESULTS: Thirty-six sequence-types (ST and nine ST-complexes (STc were detected. The main STc 6, 14 and 23 comprised 23, 17 and 20 strains respectively, and were internationally distributed. rtxA sequencing results were mostly congruent with MLST, and showed horizontal transfer events. Of interest, all members of the distantly related ST-6 (n = 22 and ST-5 (n = 4 harboured a 33 bp duplication or triplication in their rtxA sequence, suggesting that this genetic trait arose through selective advantage. The animal model revealed significant differences in virulence among strains of the species. CONCLUSION: MLST analysis reveals international spread of ST-complexes and will help to decipher acquisition and evolution of virulence traits and diversity of pathogenicity among K. kingae strains, for which an experimental animal model is now available.

  10. Taxonomic evaluation of Streptomyces hirsutus and related species using multi-locus sequence analysis

    Science.gov (United States)

    Phylogenetic analyses of species of Streptomyces based on 16S rRNA gene sequences resulted in a statistically well-supported clade (100% bootstrap value) containing 8 species having very similar gross morphology. These species, including Streptomyces bambergiensis, Streptomyces chlorus, Streptomyces...

  11. Abundance and Multilocus Sequence Analysis of Vibrio Bacteria Associated with Diseased Elkhorn Coral (Acropora palmata) of the Florida Keys.

    Science.gov (United States)

    Kemp, Keri M; Westrich, Jason R; Alabady, Magdy S; Edwards, Martinique L; Lipp, Erin K

    2018-01-15

    The critically endangered elkhorn coral ( Acropora palmata ) is affected by white pox disease (WPX) throughout the Florida Reef Tract and wider Caribbean. The bacterium Serratia marcescens was previously identified as one etiologic agent of WPX but is no longer consistently detected in contemporary outbreaks. It is now believed that multiple etiologic agents cause WPX; however, to date, no other potential pathogens have been thoroughly investigated. This study examined the association of Vibrio bacteria with WPX occurrence from August 2012 to 2014 at Looe Key Reef in the Florida Keys, USA. The concentration of cultivable Vibrio was consistently greater in WPX samples than in healthy samples. The abundance of Vibrio bacteria relative to total bacteria was four times higher in samples from WPX lesions than in adjacent apparently healthy regions of diseased corals based on quantitative PCR (qPCR). Multilocus sequence analysis (MLSA) was used to assess the diversity of 69 Vibrio isolates collected from diseased and apparently healthy A. palmata colonies and the surrounding seawater. Vibrio species with known pathogenicity to corals were detected in both apparently healthy and diseased samples. While the causative agent(s) of contemporary WPX outbreaks remains elusive, our results suggest that Vibrio spp. may be part of a nonspecific heterotrophic bacterial bloom rather than acting as primary pathogens. This study highlights the need for highly resolved temporal sampling in situ to further elucidate the role of Vibrio during WPX onset and progression. IMPORTANCE Coral diseases are increasing worldwide and are now considered a major contributor to coral reef decline. In particular, the Caribbean has been noted as a coral disease hot spot, owing to the dramatic loss of framework-building acroporid corals due to tissue loss diseases. The pathogenesis of contemporary white pox disease (WPX) outbreaks in Acropora palmata remains poorly understood. This study investigates the

  12. Multilocus sequence typing analysis reveals that Cryptococcus neoformans var. neoformans is a recombinant population.

    Science.gov (United States)

    Cogliati, Massimo; Zani, Alberto; Rickerts, Volker; McCormick, Ilka; Desnos-Ollivier, Marie; Velegraki, Aristea; Escandon, Patricia; Ichikawa, Tomoe; Ikeda, Reiko; Bienvenu, Anne-Lise; Tintelnot, Kathrin; Tore, Okan; Akcaglar, Sevim; Lockhart, Shawn; Tortorano, Anna Maria; Varma, Ashok

    2016-02-01

    Cryptococcus neoformans var. neoformans (serotype D) represents about 30% of the clinical isolates in Europe and is present less frequently in the other continents. It is the prevalent etiological agent in primary cutaneous cryptococcosis as well as in cryptococcal skin lesions of disseminated cryptococcosis. Very little is known about the genotypic diversity of this Cryptococcus subtype. The aim of this study was to investigate the genotypic diversity among a set of clinical and environmental C. neoformans var. neoformans isolates and to evaluate the relationship between genotypes, geographical origin and clinical manifestations. A total of 83 globally collected C. neoformans var. neoformans isolates from Italy, Germany, France, Belgium, Denmark, Greece, Turkey, Thailand, Japan, Colombia, and the USA, recovered from different sources (primary and secondary cutaneous cryptococcosis, disseminated cryptococcosis, the environment, and animals), were included in the study. All isolates were confirmed to belong to genotype VNIV by molecular typing and they were further investigated by MLST analysis. Maximum likelihood phylogenetic as well as network analysis strongly suggested the existence of a recombinant rather than a clonal population structure. Geographical origin and source of isolation were not correlated with a specific MLST genotype. The comparison with a set of outgroup C. neoformans var. grubii isolates provided clear evidence that the two varieties have different population structures. Copyright © 2016 Elsevier Inc. All rights reserved.

  13. Molecular epidemiology and multilocus sequence analysis of potentially zoonotic Giardia spp. from humans and dogs in Jamaica.

    Science.gov (United States)

    Lee, Mellesia F; Cadogan, Paul; Eytle, Sarah; Copeland, Sonia; Walochnik, Julia; Lindo, John F

    2017-01-01

    Giardia spp. are the causative agents of intestinal infections in a wide variety of mammals including humans and companion animals. Dogs may be reservoirs of zoonotic Giardia spp.; however, the potential for transmission between dogs and humans in Jamaica has not been studied. Conventional PCR was used to screen 285 human and 225 dog stool samples for Giardia targeting the SSU rDNA gene followed by multilocus sequencing of the triosephosphate isomerase (tpi), glutamate dehydrogenase (gdh), and β-giardin (bg) genes. Prevalence of human infections based on PCR was 6.7 % (19/285) and canine infections 19.6 % (44/225). Nested PCR conducted on all 63 positive samples revealed the exclusive presence of assemblage A in both humans and dogs. Sub-assemblage A-II was responsible for 79.0 % (15/19) and 70.5 % (31/44) of the infections in humans and dogs, respectively, while sub-assemblage A-I was identified at a rate of 15.8 % (3/19) and 29.5 % (13/44) in humans and dogs, respectively. The predominance of a single circulating assemblage among both humans and dogs in Jamaica suggests possible zoonotic transmission of Giardia infections.

  14. Multilocus Sequence Typing for Interpreting Blood Isolates of Staphylococcus epidermidis

    Directory of Open Access Journals (Sweden)

    Prannda Sharma

    2014-01-01

    Full Text Available Staphylococcus epidermidis is an important cause of nosocomial infection and bacteremia. It is also a common contaminant of blood cultures and, as a result, there is frequently uncertainty as to its diagnostic significance when recovered in the clinical laboratory. One molecular strategy that might be of value in clarifying the interpretation of S. epidermidis identified in blood culture is multilocus sequence typing. Here, we examined 100 isolates of this species (50 blood isolates representing true bacteremia, 25 likely contaminant isolates, and 25 skin isolates and the ability of sequence typing to differentiate them. Three machine learning algorithms (classification regression tree, support vector machine, and nearest neighbor were employed. Genetic variability was substantial between isolates, with 44 sequence types found in 100 isolates. Sequence types 2 and 5 were most commonly identified. However, among the classification algorithms we employed, none were effective, with CART and SVM both yielding only 73% diagnostic accuracy and nearest neighbor analysis yielding only 53% accuracy. Our data mirror previous studies examining the presence or absence of pathogenic genes in that the overlap between truly significant organisms and contaminants appears to prevent the use of MLST in the clarification of blood cultures recovering S. epidermidis.

  15. Multilocus sequence typing reveals a novel subspeciation of Lactobacillus delbrueckii.

    Science.gov (United States)

    Tanigawa, Kana; Watanabe, Koichi

    2011-03-01

    Currently, the species Lactobacillus delbrueckii is divided into four subspecies, L. delbrueckii subsp. delbrueckii, L. delbrueckii subsp. bulgaricus, L. delbrueckii subsp. indicus and L. delbrueckii subsp. lactis. These classifications were based mainly on phenotypic identification methods and few studies have used genotypic identification methods. As a result, these subspecies have not yet been reliably delineated. In this study, the four subspecies of L. delbrueckii were discriminated by phenotype and by genotypic identification [amplified-fragment length polymorphism (AFLP) and multilocus sequence typing (MLST)] methods. The MLST method developed here was based on the analysis of seven housekeeping genes (fusA, gyrB, hsp60, ileS, pyrG, recA and recG). The MLST method had good discriminatory ability: the 41 strains of L. delbrueckii examined were divided into 34 sequence types, with 29 sequence types represented by only a single strain. The sequence types were divided into eight groups. These groups could be discriminated as representing different subspecies. The results of the AFLP and MLST analyses were consistent. The type strain of L. delbrueckii subsp. delbrueckii, YIT 0080(T), was clearly discriminated from the other strains currently classified as members of this subspecies, which were located close to strains of L. delbrueckii subsp. lactis. The MLST scheme developed in this study should be a useful tool for the identification of strains of L. delbrueckii to the subspecies level.

  16. Multilocus sequence typing scheme for the Mycobacterium abscessus complex.

    Science.gov (United States)

    Macheras, Edouard; Konjek, Julie; Roux, Anne-Laure; Thiberge, Jean-Michel; Bastian, Sylvaine; Leão, Sylvia Cardoso; Palaci, Moises; Sivadon-Tardy, Valérie; Gutierrez, Cristina; Richter, Elvira; Rüsch-Gerdes, Sabine; Pfyffer, Gaby E; Bodmer, Thomas; Jarlier, Vincent; Cambau, Emmanuelle; Brisse, Sylvain; Caro, Valérie; Rastogi, Nalin; Gaillard, Jean-Louis; Heym, Beate

    2014-01-01

    We developed a multilocus sequence typing (MLST) scheme for Mycobacterium abscessus sensu lato, based on the partial sequencing of seven housekeeping genes: argH, cya, glpK, gnd, murC, pta and purH. This scheme was used to characterize a collection of 227 isolates recovered between 1994 and 2010 in France, Germany, Switzerland and Brazil. We identified 100 different sequence types (STs), which were distributed into three groups on the tree obtained by concatenating the sequences of the seven housekeeping gene fragments (3576bp): the M. abscessus sensu stricto group (44 STs), the "M. massiliense" group (31 STs) and the "M. bolletii" group (25 STs). SplitTree analysis showed a degree of intergroup lateral transfers. There was also evidence of lateral transfer events involving rpoB. The most prevalent STs in our collection were ST1 (CC5; 20 isolates) and ST23 (CC3; 31 isolates). Both STs were found in Europe and Brazil, and the latter was implicated in a large post-surgical procedure outbreak in Brazil. Respiratory isolates from patients with cystic fibrosis belonged to a large variety of STs; however, ST2 was predominant in this group of patients. Our MLST scheme, publicly available at www.pasteur.fr/mlst, offers investigators a valuable typing tool for M. abscessus sensu lato in future epidemiological studies throughout the world. Copyright © 2013 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

  17. Human Campylobacteriosis in Luxembourg, 2010-2013: A Case-Control Study Combined with Multilocus Sequence Typing for Source Attribution and Risk Factor Analysis.

    Science.gov (United States)

    Mossong, Joël; Mughini-Gras, Lapo; Penny, Christian; Devaux, Anthony; Olinger, Christophe; Losch, Serge; Cauchie, Henry-Michel; van Pelt, Wilfrid; Ragimbeau, Catherine

    2016-02-10

    Campylobacteriosis has increased markedly in Luxembourg during recent years. We sought to determine which Campylobacter genotypes infect humans, where they may originate from, and how they may infect humans. Multilocus sequence typing was performed on 1153 Campylobacter jejuni and 136 C. coli human strains to be attributed to three putative animal reservoirs (poultry, ruminants, pigs) and to environmental water using the asymmetric island model. A nationwide case-control study (2010-2013) for domestic campylobacteriosis was also conducted, including 367 C. jejuni and 48 C. coli cases, and 624 controls. Risk factors were investigated by Campylobacter species, and for strains attributed to different sources using a combined case-control and source attribution analysis. 282 sequence types (STs) were identified: ST-21, ST-48, ST-572, ST-50 and ST-257 were prevailing. Most cases were attributed to poultry (61.2%) and ruminants (33.3%). Consuming chicken outside the home was the dominant risk factor for both Campylobacter species. Newly identified risk factors included contact with garden soil for either species, and consuming beef specifically for C. coli. Poultry-associated campylobacteriosis was linked to poultry consumption in wintertime, and ruminant-associated campylobacteriosis to tap-water provider type. Besides confirming chicken as campylobacteriosis primary source, additional evidence was found for other reservoirs and transmission routes.

  18. Human Campylobacteriosis in Luxembourg, 2010?2013: A Case-Control Study Combined with Multilocus Sequence Typing for Source Attribution and Risk Factor Analysis

    OpenAIRE

    Mossong, Jo?l; Mughini-Gras, Lapo; Penny, Christian; Devaux, Anthony; Olinger, Christophe; Losch, Serge; Cauchie, Henry-Michel; van Pelt, Wilfrid; Ragimbeau, Catherine

    2016-01-01

    Campylobacteriosis has increased markedly in Luxembourg during recent years. We sought to determine which Campylobacter genotypes infect humans, where they may originate from, and how they may infect humans. Multilocus sequence typing was performed on 1153 Campylobacter jejuni and 136 C. coli human strains to be attributed to three putative animal reservoirs (poultry, ruminants, pigs) and to environmental water using the asymmetric island model. A nationwide case-control study (2010?2013) for...

  19. Development of a multilocus sequence typing scheme for Ureaplasma.

    Science.gov (United States)

    Zhang, J; Kong, Y; Feng, Y; Huang, J; Song, T; Ruan, Z; Song, J; Jiang, Y; Yu, Y; Xie, X

    2014-04-01

    Ureaplasma is a commensal of the human urogenital tract but is always associated with invasive diseases such as non-gonococcal urethritis and infertility adverse pregnancy outcomes. To better understand the molecular epidemiology and population structure of Ureaplasma, a multilocus sequence typing (MLST) scheme based on four housekeeping genes (ftsH, rpL22, valS, thrS) was developed and validated using 283 isolates, including 14 serovars of reference strains and 269 strains obtained from clinical patients. A total of 99 sequence types (STs) were revealed: the 14 type strains of the Ureaplasma serovars were assigned to 12 STs, and 87 novel and special STs appeared among the clinical isolates. ST1 and ST22 were the predominant STs, which contained 68 and 70 isolates, respectively. Two clonal lineages (CC1 and CC2) were shown by eBURST analysis, and linkage disequilibrium was revealed through a standardized index of association (I A (S)). The neighbor-joining tree results of 14 Ureaplasma serovars showed two genetically significantly distant clusters, which was highly congruent with the species taxonomy of ureaplasmas [Ureaplasma parvum (UPA) and Ureaplasma urealyticum (UUR)]. Analysis of the biotypes of 269 clinical isolates revealed that all the isolates of CC1 were UPA and those of CC2 were UUR. Additionally, CC2 was found more often in symptomatic patients with vaginitis, tubal obstruction, and cervicitis. In conclusion, this MLST scheme is adequate for investigations of molecular epidemiology and population structure with highly discriminating capacity.

  20. MULTILOCUS SEQUENCE TYPING OF BRUCELLA ISOLATES FROM THAILAND.

    Science.gov (United States)

    Chawjiraphan, Wireeya; Sonthayanon, Piengchan; Chanket, Phanita; Benjathummarak, Surachet; Kerdsin, Anusak; Kalambhaheti, Thareerat

    2016-11-01

    Although brucellosis outbreaks in Thailand are rare, they cause abortions and infertility in animals, resulting in significant economic loss. Because Brucella spp display > 90% DNA homology, multilocus sequence typing (MLST) was employed to categorize local Brucella isolates into sequence types (STs) and to determine their genetic relatedness. Brucella samples were isolated from vaginal secretion of cows and goats, and from blood cultures of infected individuals. Brucella species were determined by multiplex PCR of eight loci, in addition to MLST based on partial DNA sequences of nine house-keeping genes. MLST analysis of 36 isolates revealed 78 distinct novel allele types and 34 novel STs, while two isolates possessed the known ST8. Sequence alignments identified polymorphic sites in each allele, ranging from 2-6%, while overall genetic diversity was 3.6%. MLST analysis of the 36 Brucella isolates classified them into three species, namely, B. melitensis, B. abortus and B. suis, in agreement with multiplex PCR results. Genetic relatedness among ST members of B. melitensis and B. abortus determined by eBURST program revealed ST2 as founder of B. abortus isolates and ST8 the founder of B. melitensis isolates. ST 36, 41 and 50 of Thai Brucella isolates were identified as single locus variants of clonal cluster (CC) 8, while the majority of STs were diverse. The genetic diversity and relatedness identified using MLST revealed hitherto unexpected diversity among Thai Brucella isolates. Genetic classification of isolates could reveal the route of brucellosis transmission among humans and farm animals and also reveal their relationship with other isolates in the region and other parts of the world.

  1. Taxonomic evaluation of unidentified Streptomyces isolates in the ARS Culture Collection (NRRL) using multi-locus sequence analysis

    Science.gov (United States)

    The ARS Culture Collection (NRRL) currently contains 7569 strains within the family Streptomycetaceae but 4368 of them have not been characterized to the species level. A gene sequence database using the Bacterial Isolate Genomic Sequence Database package (BIGSdb) (Jolley & Maiden, 2010) is availabl...

  2. Multilocus Sequence Typing and Virulence-Associated Gene Profile Analysis of Staphylococcus aureus Isolates From Retail Ready-to-Eat Food in China.

    Science.gov (United States)

    Yang, Xiaojuan; Yu, Shubo; Wu, Qingping; Zhang, Jumei; Wu, Shi; Rong, Dongli

    2018-01-01

    The aim of this study was to characterize the subtypes and virulence profiles of 69 Staphylococcus aureus isolates obtained from retail ready-to-eat food in China. The isolates were analyzed using multilocus sequence typing (MLST) and polymerase chain reaction (PCR) analysis of important virulence factor genes, including the staphylococcal enterotoxin (SE) genes ( sea , seb , sec , sed , see , seg , seh , sei , sej ), the exfoliative toxin genes ( eta and etb ), the toxic shock syndrome toxin-1 gene ( tst ), and the Panton-Valentine leucocidin-encoding gene ( pvl ). The isolates encompassed 26 different sequence types (STs), including four new STs (ST3482, ST3484, ST3485, ST3504), clustered in three clonal complexes and 17 singletons. The most prevalent STs were ST1, ST6, and ST15, constituting 34.8% of all isolates. Most STs (15/26, 57.7%) detected have previously been associated with human infections. All 13 toxin genes examined were detected in the S. aureus isolates, with 84.1% of isolates containing toxin genes. The three most prevalent toxin genes were seb (36.2%), sea (33.3%), and seg (33.3%). The classical SE genes ( sea - see ), which contribute significantly to staphylococcal food poisoning (SFP), were detected in 72.5% of the S. aureus isolates. In addition, pvl , eta , etb , and tst were found in 11.6, 10.1, 10.1, and 7.2% of the S. aureus isolates, respectively. Strains ST6 carrying sea and ST1 harboring sec-seh enterotoxin profile, which are the two most common clones associated with SFP, were also frequently detected in the food samples in this study. This study indicates that these S. aureus isolates present in Chinese ready-to-eat food represents a potential public health risk. These data are valuable for epidemiological studies, risk management, and public health strategies.

  3. Multilocus Sequence Typing and Virulence-Associated Gene Profile Analysis of Staphylococcus aureus Isolates From Retail Ready-to-Eat Food in China

    Directory of Open Access Journals (Sweden)

    Xiaojuan Yang

    2018-03-01

    Full Text Available The aim of this study was to characterize the subtypes and virulence profiles of 69 Staphylococcus aureus isolates obtained from retail ready-to-eat food in China. The isolates were analyzed using multilocus sequence typing (MLST and polymerase chain reaction (PCR analysis of important virulence factor genes, including the staphylococcal enterotoxin (SE genes (sea, seb, sec, sed, see, seg, seh, sei, sej, the exfoliative toxin genes (eta and etb, the toxic shock syndrome toxin-1 gene (tst, and the Panton-Valentine leucocidin-encoding gene (pvl. The isolates encompassed 26 different sequence types (STs, including four new STs (ST3482, ST3484, ST3485, ST3504, clustered in three clonal complexes and 17 singletons. The most prevalent STs were ST1, ST6, and ST15, constituting 34.8% of all isolates. Most STs (15/26, 57.7% detected have previously been associated with human infections. All 13 toxin genes examined were detected in the S. aureus isolates, with 84.1% of isolates containing toxin genes. The three most prevalent toxin genes were seb (36.2%, sea (33.3%, and seg (33.3%. The classical SE genes (sea–see, which contribute significantly to staphylococcal food poisoning (SFP, were detected in 72.5% of the S. aureus isolates. In addition, pvl, eta, etb, and tst were found in 11.6, 10.1, 10.1, and 7.2% of the S. aureus isolates, respectively. Strains ST6 carrying sea and ST1 harboring sec-seh enterotoxin profile, which are the two most common clones associated with SFP, were also frequently detected in the food samples in this study. This study indicates that these S. aureus isolates present in Chinese ready-to-eat food represents a potential public health risk. These data are valuable for epidemiological studies, risk management, and public health strategies.

  4. Reconstruction of the Evolutionary History of Saccharomyces cerevisiae x S. kudriavzevii Hybrids Based on Multilocus Sequence Analysis

    Science.gov (United States)

    Peris, David; Lopes, Christian A.; Arias, Armando; Barrio, Eladio

    2012-01-01

    In recent years, interspecific hybridization and introgression are increasingly recognized as significant events in the evolution of Saccharomyces yeasts. These mechanisms have probably been involved in the origin of novel yeast genotypes and phenotypes, which in due course were to colonize and predominate in the new fermentative environments created by human manipulation. The particular conditions in which hybrids arose are still unknown, as well as the number of possible hybridization events that generated the whole set of natural hybrids described in the literature during recent years. In this study, we could infer at least six different hybridization events that originated a set of 26 S. cerevisiae x S. kudriavzevii hybrids isolated from both fermentative and non-fermentative environments. Different wine S. cerevisiae strains and European S. kudriavzevii strains were probably involved in the hybridization events according to gene sequence information, as well as from previous data on their genome composition and ploidy. Finally, we postulate that these hybrids may have originated after the introduction of vine growing and winemaking practices by the Romans to the present Northern vine-growing limits and spread during the expansion of improved viticulture and enology practices that occurred during the Late Middle Ages. PMID:23049811

  5. Reconstruction of the evolutionary history of Saccharomyces cerevisiae x S. kudriavzevii hybrids based on multilocus sequence analysis.

    Directory of Open Access Journals (Sweden)

    David Peris

    Full Text Available In recent years, interspecific hybridization and introgression are increasingly recognized as significant events in the evolution of Saccharomyces yeasts. These mechanisms have probably been involved in the origin of novel yeast genotypes and phenotypes, which in due course were to colonize and predominate in the new fermentative environments created by human manipulation. The particular conditions in which hybrids arose are still unknown, as well as the number of possible hybridization events that generated the whole set of natural hybrids described in the literature during recent years. In this study, we could infer at least six different hybridization events that originated a set of 26 S. cerevisiae x S. kudriavzevii hybrids isolated from both fermentative and non-fermentative environments. Different wine S. cerevisiae strains and European S. kudriavzevii strains were probably involved in the hybridization events according to gene sequence information, as well as from previous data on their genome composition and ploidy. Finally, we postulate that these hybrids may have originated after the introduction of vine growing and winemaking practices by the Romans to the present Northern vine-growing limits and spread during the expansion of improved viticulture and enology practices that occurred during the Late Middle Ages.

  6. Phylogenetic multilocus sequence analysis of indigenous slow-growing rhizobia nodulating cowpea (Vigna unguiculata L.) in Greece.

    Science.gov (United States)

    Tampakaki, Anastasia P; Fotiadis, Christos T; Ntatsi, Georgia; Savvas, Dimitrios

    2017-04-01

    Cowpea (Vigna unguiculata) is a promiscuous grain legume, capable of establishing efficient symbiosis with diverse symbiotic bacteria, mainly slow-growing rhizobial species belonging to the genus Bradyrhizobium. Although much research has been done on cowpea-nodulating bacteria in various countries around the world, little is known about the genetic and symbiotic diversity of indigenous cowpea rhizobia in European soils. In the present study, the genetic and symbiotic diversity of indigenous rhizobia isolated from field-grown cowpea nodules in three geographically different Greek regions were studied. Forty-five authenticated strains were subjected to a polyphasic approach. ERIC-PCR based fingerprinting analysis grouped the isolates into seven groups and representative strains of each group were further analyzed. The analysis of the rrs gene showed that the strains belong to different species of the genus Bradyrhizobium. The analysis of the 16S-23S IGS region showed that the strains from each geographic region were characterized by distinct IGS types which may represent novel phylogenetic lineages, closely related to the type species of Bradyrhizobium pachyrhizi, Bradyrhizobium ferriligni and Bradyrhizobium liaoningense. MLSA analysis of three housekeeping genes (recA, glnII, and gyrB) showed the close relatedness of our strains with B. pachyrhizi PAC48 T and B. liaoningense USDA 3622 T and confirmed that the B. liaoningense-related isolate VUEP21 may constitute a novel species within Bradyrhizobium. Moreover, symbiotic gene phylogenies, based on nodC and nifH genes, showed that the B. pachyrhizi-related isolates belonged to symbiovar vignae, whereas the B. liaoningense-related isolates may represent a novel symbiovar. Copyright © 2017 Elsevier GmbH. All rights reserved.

  7. Complete Deletion of the Fucose Operon in Haemophilus influenzae Is Associated with a Cluster in Multilocus Sequence Analysis-Based Phylogenetic Group II Related to Haemophilus haemolyticus: Implications for Identification and Typing.

    Science.gov (United States)

    de Gier, Camilla; Kirkham, Lea-Ann S; Nørskov-Lauritsen, Niels

    2015-12-01

    Nonhemolytic variants of Haemophilus haemolyticus are difficult to differentiate from Haemophilus influenzae despite a wide difference in pathogenic potential. A previous investigation characterized a challenging set of 60 clinical strains using multiple PCRs for marker genes and described strains that could not be unequivocally identified as either species. We have analyzed the same set of strains by multilocus sequence analysis (MLSA) and near-full-length 16S rRNA gene sequencing. MLSA unambiguously allocated all study strains to either of the two species, while identification by 16S rRNA sequence was inconclusive for three strains. Notably, the two methods yielded conflicting identifications for two strains. Most of the "fuzzy species" strains were identified as H. influenzae that had undergone complete deletion of the fucose operon. Such strains, which are untypeable by the H. influenzae multilocus sequence type (MLST) scheme, have sporadically been reported and predominantly belong to a single branch of H. influenzae MLSA phylogenetic group II. We also found evidence of interspecies recombination between H. influenzae and H. haemolyticus within the 16S rRNA genes. Establishing an accurate method for rapid and inexpensive identification of H. influenzae is important for disease surveillance and treatment. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  8. In Silico Detection and Typing of Plasmids using PlasmidFinder and Plasmid Multilocus Sequence Typing

    DEFF Research Database (Denmark)

    Carattoli, Alessandra; Zankari, Ea; García-Fernández, Aurora

    2014-01-01

    In the work presented here, we designed and developed two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae. These tools will facilitate bacterial typing based on draft...... genomes of multidrug-resistant Enterobacteriaceae species by the rapid detection of known plasmid types. Replicon sequences from 559 fully sequenced plasmids associated with the family Enterobacteriaceae in the NCBI nucleotide database were collected to build a consensus database for integration...... sequences identified in the 559 fully sequenced plasmids. For plasmid multilocus sequence typing (pMLST) analysis, a database that is updated weekly was generated from www.pubmlst.org and integrated into a Web tool called pMLST. Both databases were evaluated using draft genomes from a collection...

  9. Multilocus sequence typing of IncN plasmids

    DEFF Research Database (Denmark)

    García-Fernández, Aurora; Villa, Laura; Moodley, Arshnee

    2011-01-01

    that spread and persistence of this particular IncN-carrying blaVIM-1 lineage in Greece. CONCLUSIONS: This study proposes the use of pMLST as a suitable and rapid method for identification of IncN epidemic plasmid lineages. The recent spread of blaCTX-M-1 among humans and animals seems to be associated......OBJECTIVES: Incompatibility group N (IncN) plasmids have been associated with the dissemination of antimicrobial resistance and are a major vehicle for the spread of blaVIM-1 in humans and blaCTX-M-1 in animals. A plasmid multilocus sequence typing (pMLST) scheme was developed for rapid...... in different countries from both animals and humans belonged to ST1, suggesting dissemination of an epidemic plasmid through the food chain. Fifteen of 17 plasmids carrying blaVIM-1 from Klebsiella pneumoniae and Escherichia coli, isolated during a 5year period in Greece were assigned to ST10, suggesting...

  10. Short read sequence typing (SRST: multi-locus sequence types from short reads

    Directory of Open Access Journals (Sweden)

    Inouye Michael

    2012-07-01

    Full Text Available Abstract Background Multi-locus sequence typing (MLST has become the gold standard for population analyses of bacterial pathogens. This method focuses on the sequences of a small number of loci (usually seven to divide the population and is simple, robust and facilitates comparison of results between laboratories and over time. Over the last decade, researchers and population health specialists have invested substantial effort in building up public MLST databases for nearly 100 different bacterial species, and these databases contain a wealth of important information linked to MLST sequence types such as time and place of isolation, host or niche, serotype and even clinical or drug resistance profiles. Recent advances in sequencing technology mean it is increasingly feasible to perform bacterial population analysis at the whole genome level. This offers massive gains in resolving power and genetic profiling compared to MLST, and will eventually replace MLST for bacterial typing and population analysis. However given the wealth of data currently available in MLST databases, it is crucial to maintain backwards compatibility with MLST schemes so that new genome analyses can be understood in their proper historical context. Results We present a software tool, SRST, for quick and accurate retrieval of sequence types from short read sets, using inputs easily downloaded from public databases. SRST uses read mapping and an allele assignment score incorporating sequence coverage and variability, to determine the most likely allele at each MLST locus. Analysis of over 3,500 loci in more than 500 publicly accessible Illumina read sets showed SRST to be highly accurate at allele assignment. SRST output is compatible with common analysis tools such as eBURST, Clonal Frame or PhyloViz, allowing easy comparison between novel genome data and MLST data. Alignment, fastq and pileup files can also be generated for novel alleles. Conclusions SRST is a novel

  11. Development of Mycoplasma synoviae (MS) core genome multilocus sequence typing (cgMLST) scheme.

    Science.gov (United States)

    Ghanem, Mostafa; El-Gazzar, Mohamed

    2018-05-01

    Mycoplasma synoviae (MS) is a poultry pathogen with reported increased prevalence and virulence in recent years. MS strain identification is essential for prevention, control efforts and epidemiological outbreak investigations. Multiple multilocus based sequence typing schemes have been developed for MS, yet the resolution of these schemes could be limited for outbreak investigation. The cost of whole genome sequencing became close to that of sequencing the seven MLST targets; however, there is no standardized method for typing MS strains based on whole genome sequences. In this paper, we propose a core genome multilocus sequence typing (cgMLST) scheme as a standardized and reproducible method for typing MS based whole genome sequences. A diverse set of 25 MS whole genome sequences were used to identify 302 core genome genes as cgMLST targets (35.5% of MS genome) and 44 whole genome sequences of MS isolates from six countries in four continents were used for typing applying this scheme. cgMLST based phylogenetic trees displayed a high degree of agreement with core genome SNP based analysis and available epidemiological information. cgMLST allowed evaluation of two conventional MLST schemes of MS. The high discriminatory power of cgMLST allowed differentiation between samples of the same conventional MLST type. cgMLST represents a standardized, accurate, highly discriminatory, and reproducible method for differentiation between MS isolates. Like conventional MLST, it provides stable and expandable nomenclature, allowing for comparing and sharing the typing results between different laboratories worldwide. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.

  12. Multilocus sequence typing of commensal and enteropathogenic Escherichia coli from domestic and wild lagomorphs in Italy

    Directory of Open Access Journals (Sweden)

    Giorgia Dotto

    2015-12-01

    Full Text Available The aim of the study was to determine the multilocus sequence types of Escherichia coli from diseased farm rabbits and apparently healthy wild lagomorphs, and the genetic relatedness among them. Fifty-five enteropathogenic E. coli from reared rabbits and 32 from wild rabbits and hares were characterised by multilocus sequence typing (MLST according to the Michigan State University EcMLST scheme. Isolates were differentiated into 37 sequence types (STs, which were grouped into 8 clonal complexes (CCs. The most common ST was ST140 (CC31, followed by ST238 and ST119 (CC17. MLST analysis revealed 22 novel STs. Phylogenetic analyses showed a heterogeneous distribution of STs into 3 clusters of genetically related strains. The genetic relationship among STs of different origin and the detection of new, as well as previously described STs as human pathogens, indicate a widespread distribution and adaptability of particular lineages to different hosts. These findings highlight the need for further research to improve the knowledge about E. coli populations colonising the gut of lagomorphs and their zoonotic potential.

  13. Genetic Relationships among Reptilian and Mammalian Campylobacter fetus Strains Determined by Multilocus Sequence Typing

    NARCIS (Netherlands)

    Dingle, K.E.; Blaser, M.J.; Tu, Z.C.; Pruckler, J.; Fitzgerald, C.; Bergen, van M.A.P.; Lawson, A.J.; Owen, R.J.; Wagenaar, J.A.

    2010-01-01

    Reptile Campylobacter fetus isolates and closely related strains causing human disease were characterized by multilocus sequence typing. They shared similar to 90% nucleotide sequence identity with classical mammalian C. fetus, and there was evidence of recombination among members of these two

  14. Multilocus sequence typing confirms synonymy but highlights differences between Candida albicans and Candida stellatoidea.

    NARCIS (Netherlands)

    Jacobsen, M.D.; Boekhout, T.; Odds, F.C.

    2008-01-01

    We used multi-locus sequence typing (MLST) to investigate 35 yeast isolates representing the two genome-sequenced strains plus the type strain of Candida albicans, four isolates originally identified as Candida stellatoidea type I and 28 representing type strains of other species now regarded as

  15. Delineation of the species Haemophilus influenzae by phenotype, multilocus sequence phylogeny, and detection of marker genes

    DEFF Research Database (Denmark)

    Nørskov-Lauritsen, Niels; Overballe, MD; Kilian, Mogens

    2009-01-01

    To obtain more information on the much-debated definition of prokaryotic species, we investigated the borders of Haemophilus influenzae by comparative analysis of H. influenzae reference strains with closely related bacteria including strains assigned to Haemophilus haemolyticus, cryptic genospec......To obtain more information on the much-debated definition of prokaryotic species, we investigated the borders of Haemophilus influenzae by comparative analysis of H. influenzae reference strains with closely related bacteria including strains assigned to Haemophilus haemolyticus, cryptic...... genospecies biotype IV, and the never formally validated species "Haemophilus intermedius". Multilocus sequence phylogeny based on six housekeeping genes separated a cluster encompassing the type and the reference strains of H. influenzae from 31 more distantly related strains. Comparison of 16S rRNA gene...

  16. MultiLocus Sequence Analysis- and Amplified Fragment Length Polymorphism-based characterization of xanthomonads associated with bacterial spot of tomato and pepper and their relatedness to Xanthomonas species.

    Science.gov (United States)

    Hamza, A A; Robene-Soustrade, I; Jouen, E; Lefeuvre, P; Chiroleu, F; Fisher-Le Saux, M; Gagnevin, L; Pruvost, O

    2012-05-01

    MultiLocus Sequence Analysis (MLSA) and Amplified Fragment Length Polymorphism (AFLP) were used to measure the genetic relatedness of a comprehensive collection of xanthomonads pathogenic to solaneous hosts to Xanthomonas species. The MLSA scheme was based on partial sequences of four housekeeping genes (atpD, dnaK, efp and gyrB). Globally, MLSA data unambiguously identified strains causing bacterial spot of tomato and pepper at the species level and was consistent with AFLP data. Genetic distances derived from both techniques showed a close relatedness of (i) X. euvesicatoria, X. perforans and X. alfalfae and (ii) X. gardneri and X. cynarae. Maximum likelihood tree topologies derived from each gene portion and the concatenated data set for species in the X. campestris 16S rRNA core (i.e. the species cluster comprising all strains causing bacterial spot of tomato and pepper) were not congruent, consistent with the detection of several putative recombination events in our data sets by several recombination search algorithms. One recombinant region in atpD was identified in most strains of X. euvesicatoria including the type strain. Copyright © 2012 Elsevier GmbH. All rights reserved.

  17. An Extended Multilocus Sequence Typing (MLST) Scheme for Rapid Direct Typing of Leptospira from Clinical Samples

    OpenAIRE

    Weiss, Sabrina; Menezes, Angela; Woods, Kate; Chanthongthip, Anisone; Dittrich, Sabine; Opoku-Boateng, Agatha; Kimuli, Maimuna; Chalker, Victoria

    2016-01-01

    Background Rapid typing of Leptospira is currently impaired by requiring time consuming culture of leptospires. The objective of this study was to develop an assay that provides multilocus sequence typing (MLST) data direct from patient specimens while minimising costs for subsequent sequencing. Methodology and Findings An existing PCR based MLST scheme was modified by designing nested primers including anchors for facilitated subsequent sequencing. The assay was applied to various specimen t...

  18. Investigation of genetic diversity and epidemiological characteristics of Pasteurella multocida isolates from poultry in southwest China by population structure, multi-locus sequence typing and virulence-associated gene profile analysis.

    Science.gov (United States)

    Li, Zhangcheng; Cheng, Fangjun; Lan, Shimei; Guo, Jianhua; Liu, Wei; Li, Xiaoyan; Luo, Zeli; Zhang, Manli; Wu, Juan; Shi, Yang

    2018-04-25

    Fowl cholera caused by Pasteurella multocida has always been a disease of global importance for poultry production. The aim of this study was to obtain more information about the epidemiology of avian P. multocida infection in southwest China and the genetic characteristics of clinical isolates. P. multocida isolates were characterized by biochemical and molecular-biological methods. The distributions of the capsular serogroups, the phenotypic antimicrobial resistance profiles, lipopolysaccharide (LPS) genotyping and the presence of 19 virulence genes were investigated in 45 isolates of P. multocida that were associated with clinical disease in poultry. The genetic diversity of P. multocida strains was performed by 16S rRNA and rpoB gene sequence analysis as well as multilocus sequence typing (MLST). The results showed that most (80.0%) of the P. multocida isolates in this study represented special P. multocida subspecies, and 71.1% of the isolates showed multiple-drug resistance. 45 isolates belonged to capsular types: A (100%) and two LPS genotypes: L1 (95.6%) and L3 (4.4%). MLST revealed two new alleles (pmi77 and gdh57) and one new sequence type (ST342). ST129 types dominated in 45 P. multocida isolates. Isolates belonging to ST129 were with the genes ompH+plpB+ptfA+tonB, whereas ST342 included isolates with fur+hgbA+tonB genes. Population genetic analysis and the MLST results revealed that at least one new ST genotype was present in the avian P. multocida in China. These findings provide novel insights into the epidemiological characteristics of avian P. multocida isolates in southwest China.

  19. Rickettsia asembonensis Characterization by Multilocus Sequence Typing of Complete Genes, Peru.

    Science.gov (United States)

    Loyola, Steev; Flores-Mendoza, Carmen; Torre, Armando; Kocher, Claudine; Melendrez, Melanie; Luce-Fedrow, Alison; Maina, Alice N; Richards, Allen L; Leguia, Mariana

    2018-05-01

    While studying rickettsial infections in Peru, we detected Rickettsia asembonensis in fleas from domestic animals. We characterized 5 complete genomic regions (17kDa, gltA, ompA, ompB, and sca4) and conducted multilocus sequence typing and phylogenetic analyses. The molecular isolate from Peru is distinct from the original R. asembonensis strain from Kenya.

  20. Core Genome Multilocus Sequence Typing Scheme for High-resolution Typing of Enterococcus faecium

    DEFF Research Database (Denmark)

    de Been, Mark; Pinholt, Mette; Top, Janetta

    2015-01-01

    Enterococcus faecium, a common inhabitant of the human gut, has emerged as an important multidrug-resistant nosocomial pathogen in the last two decades. Since the start of the 21(st) century, multi-locus sequence typing (MLST) has been used to study the molecular epidemiology of E. faecium. However...

  1. A single multilocus sequence typing (MLST) scheme for seven pathogenic Leptospira species

    NARCIS (Netherlands)

    Boonsilp, Siriphan; Thaipadungpanit, Janjira; Amornchai, Premjit; Wuthiekanun, Vanaporn; Bailey, Mark S.; Holden, Matthew T. G.; Zhang, Cuicai; Jiang, Xiugao; Koizumi, Nobuo; Taylor, Kyle; Galloway, Renee; Hoffmaster, Alex R.; Craig, Scott; Smythe, Lee D.; Hartskeerl, Rudy A.; Day, Nicholas P.; Chantratita, Narisara; Feil, Edward J.; Aanensen, David M.; Spratt, Brian G.; Peacock, Sharon J.

    2013-01-01

    The available Leptospira multilocus sequence typing (MLST) scheme supported by a MLST website is limited to L. interrogans and L. kirschneri. Our aim was to broaden the utility of this scheme to incorporate a total of seven pathogenic species. We modified the existing scheme by replacing one of the

  2. Taxonomic evaluation of Streptomyces albus and related species using multilocus sequence analysis and proposals to emend the description of Streptomyces albus and describe Streptomyces pathocidini sp. nov

    Science.gov (United States)

    In phylogenetic analyses of the genus Streptomyces using 16S rRNA gene sequences, Streptomyces albus subsp. albus NRRL B-1811T forms a cluster with 5 other species having identical or nearly identical 16S rRNA gene sequences. Moreover, the morphological and physiological characteristics of these oth...

  3. Genotyping of B. licheniformis based on a novel multi-locus sequence typing (MLST scheme

    Directory of Open Access Journals (Sweden)

    Madslien Elisabeth H

    2012-10-01

    Full Text Available Abstract Background Bacillus licheniformis has for many years been used in the industrial production of enzymes, antibiotics and detergents. However, as a producer of dormant heat-resistant endospores B. licheniformis might contaminate semi-preserved foods. The aim of this study was to establish a robust and novel genotyping scheme for B. licheniformis in order to reveal the evolutionary history of 53 strains of this species. Furthermore, the genotyping scheme was also investigated for its use to detect food-contaminating strains. Results A multi-locus sequence typing (MLST scheme, based on the sequence of six house-keeping genes (adk, ccpA, recF, rpoB, spo0A and sucC of 53 B. licheniformis strains from different sources was established. The result of the MLST analysis supported previous findings of two different subgroups (lineages within this species, named “A” and “B” Statistical analysis of the MLST data indicated a higher rate of recombination within group “A”. Food isolates were widely dispersed in the MLST tree and could not be distinguished from the other strains. However, the food contaminating strain B. licheniformis NVH1032, represented by a unique sequence type (ST8, was distantly related to all other strains. Conclusions In this study, a novel and robust genotyping scheme for B. licheniformis was established, separating the species into two subgroups. This scheme could be used for further studies of evolution and population genetics in B. licheniformis.

  4. Population Genetic Structure of Listeria monocytogenes Strains as Determined by Pulsed-Field Gel Electrophoresis and Multilocus Sequence Typing

    DEFF Research Database (Denmark)

    Henri, Clémentine; Félix, Benjamin; Guillier, Laurent

    2016-01-01

    on the basis of different pulsed-field gel electrophoresis (PFGE) clusters, serotypes, and strain origins and typed by multilocus sequence typing (MLST), and the MLST results were supplemented with MLST data available from Institut Pasteur, representing human and additional food strains from France....... The distribution of sequence types (STs) was compared between food and clinical strains on a panel of 675 strains. High congruence between PFGE and MLST was found. Out of 73 PFGE clusters, the two most prevalent corresponded to ST9 and ST121. Using original statistical analysis, we demonstrated that (i...

  5. Taxonomic evaluation of species in the Streptomyces hirsutus clade using multi-locus sequence analysis and proposals to reclassify several species in this clade

    Science.gov (United States)

    Previous phylogenetic analyses of species of Streptomyces based on 16S rRNA gene sequences resulted in a statistically well-supported clade (100% bootstrap value) containing 8 species that exhibited very similar gross morphology in producing open looped (Retinaculum-Apertum) to spiral (Spira) chains...

  6. Characterization of Pasteurella multocida associated with ovine pneumonia using multi-locus sequence typing (MLST) and virulence-associated gene profile analysis and comparison with porcine isolates.

    Science.gov (United States)

    García-Alvarez, Andrés; Vela, Ana Isabel; San Martín, Elvira; Chaves, Fernando; Fernández-Garayzábal, José Francisco; Lucas, Domínguez; Cid, Dolores

    2017-05-01

    Pasteurella multocida is a pathogen causing disease in a wide range of hosts including sheep and pigs. Isolates from ovine pneumonia were characterized by MLST (Multi-host and RIRDC databases) and virulence-associated gene (VAG) typing and compared with porcine isolates. Ovine and porcine isolates did not share any STs as determined by both schemes and exhibited different VAG profiles. With the Multi-host database, sixteen STs were identified among 43 sheep isolates with two STs (ST50 and ST19) comprising 53.5% of the isolates, and seven MLST genotypes (ST3, ST11 and ST62 included 75% of the isolates) among the 48 pig isolates. The most frequent VAG profile among sheep isolates was tbpA+/toxA+ (69.8% of isolates) and pfhA+ (62.5%) and hgbB+ (33.3%) among pig isolates. Representative ovine and porcine isolates of those STs identified by the Multi-host scheme were further typed using the RIRDC scheme. Seven STs were identified among the ovine isolates (ST95 RIRDC , ST131 RIRDC , ST203 RIRDC , ST320 RIRDC , ST324 RIRDC , ST321 RIRDC , and ST323 RIRDC ), with the latter four sequence types being new STs identified in this study, and six STs (ST9 RIRDC , ST13 RIRDC , ST27 RIRDC , ST50 RIRDC , and ST74 RIRDC and a new sequence type ST322 RIRDC ) among the porcine isolates. STs identified among ovine isolates have been detected exclusively in small ruminants, suggesting an adaptation to these hosts, while the genotypes identified among pig isolates have been previously identified in multiple hosts and therefore they are not restricted to pigs. The differences in genotypes and VAG profiles between ovine and pig isolates suggest they could represent different subpopulations of P. multocida. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. Characterisation of the genetic diversity of Brucella by multilocus sequencing

    Directory of Open Access Journals (Sweden)

    MacMillan Alastair P

    2007-04-01

    Full Text Available Abstract Background Brucella species include economically important zoonotic pathogens that can infect a wide range of animals. There are currently six classically recognised species of Brucella although, as yet unnamed, isolates from various marine mammal species have been reported. In order to investigate genetic relationships within the group and identify potential diagnostic markers we have sequenced multiple genetic loci from a large sample of Brucella isolates representing the known diversity of the genus. Results Nine discrete genomic loci corresponding to 4,396 bp of sequence were examined from 160 Brucella isolates. By assigning each distinct allele at a locus an arbitrary numerical designation the population was found to represent 27 distinct sequence types (STs. Diversity at each locus ranged from 1.03–2.45% while overall genetic diversity equated to 1.5%. Most loci examined represent housekeeping gene loci and, in all but one case, the ratio of non-synonymous to synonymous change was substantially Brucella species, B. abortus, B. melitensis, B. ovis and B. neotomae correspond to well-separated clusters. With the exception of biovar 5, B. suis isolates cluster together, although they form a more diverse group than other classical species with a number of distinct STs corresponding to the remaining four biovars. B. canis isolates are located on the same branch very closely related to, but distinguishable from, B. suis biovar 3 and 4 isolates. Marine mammal isolates represent a distinct, though rather weakly supported, cluster within which individual STs display one of three clear host preferences. Conclusion The sequence database provides a powerful dataset for addressing ongoing controversies in Brucella taxonomy and a tool for unambiguously placing atypical, phenotypically discordant or newly emerging Brucella isolates. Furthermore, by using the phylogenetic backbone described here, robust and rationally selected markers for use in

  8. Multiple-locus variable-number tandem repeat analysis of Neisseria meningitidis yields groupings similar to those obtained by multilocus sequence typing.

    NARCIS (Netherlands)

    Schouls, Leo M; Ende, Arie van der; Damen, Marjolein; Pol, Ingrid van de

    2006-01-01

    We identified many variable-number tandem repeat (VNTR) loci in the genomes of Neisseria meningitidis serogroups A, B, and C and utilized a number of these loci to develop a multiple-locus variable-number tandem repeat analysis (MLVA). Eighty-five N. meningitidis serogroup B and C isolates obtained

  9. Determining Clostridium difficile intra-taxa diversity by mining multilocus sequence typing databases.

    Science.gov (United States)

    Muñoz, Marina; Ríos-Chaparro, Dora Inés; Patarroyo, Manuel Alfonso; Ramírez, Juan David

    2017-03-14

    Multilocus sequence typing (MLST) is a highly discriminatory typing strategy; it is reproducible and scalable. There is a MLST scheme for Clostridium difficile (CD), a gram positive bacillus causing different pathologies of the gastrointestinal tract. This work was aimed at describing the frequency of sequence types (STs) and Clades (C) reported and evalute the intra-taxa diversity in the CD MLST database (CD-MLST-db) using an MLSA approach. Analysis of 1778 available isolates showed that clade 1 (C1) was the most frequent worldwide (57.7%), followed by C2 (29.1%). Regarding sequence types (STs), it was found that ST-1, belonging to C2, was the most frequent. The isolates analysed came from 17 countries, mostly from the United Kingdom (UK) (1541 STs, 87.0%). The diversity of the seven housekeeping genes in the MLST scheme was evaluated, and alleles from the profiles (STs), for identifying CD population structure. It was found that adk and atpA are conserved genes allowing a limited amount of clusters to be discriminated; however, different genes such as drx, glyA and particularly sodA showed high diversity indexes and grouped CD populations in many clusters, suggesting that these genes' contribution to CD typing should be revised. It was identified that CD STs reported to date have a mostly clonal population structure with foreseen events of recombination; however, one group of STs was not assigned to a clade being highly different containing at least nine well-supported clusters, suggesting a greater amount of clades for CD. This study shows the usefulness of CD-MLST-db as a tool for studying CD distribution and population structure, identifying the need for reviewing the usefulness of sodA as housekeeping gene within the MLST scheme and suggesting the existence of a greater amount of CD clades. The study also shows the plausible exchange of genetic material between STs, contributing towards intra-taxa genetic diversity.

  10. A Single Multilocus Sequence Typing (MLST) Scheme for Seven Pathogenic Leptospira Species

    Science.gov (United States)

    Amornchai, Premjit; Wuthiekanun, Vanaporn; Bailey, Mark S.; Holden, Matthew T. G.; Zhang, Cuicai; Jiang, Xiugao; Koizumi, Nobuo; Taylor, Kyle; Galloway, Renee; Hoffmaster, Alex R.; Craig, Scott; Smythe, Lee D.; Hartskeerl, Rudy A.; Day, Nicholas P.; Chantratita, Narisara; Feil, Edward J.; Aanensen, David M.; Spratt, Brian G.; Peacock, Sharon J.

    2013-01-01

    Background The available Leptospira multilocus sequence typing (MLST) scheme supported by a MLST website is limited to L. interrogans and L. kirschneri. Our aim was to broaden the utility of this scheme to incorporate a total of seven pathogenic species. Methodology and Findings We modified the existing scheme by replacing one of the seven MLST loci (fadD was changed to caiB), as the former gene did not appear to be present in some pathogenic species. Comparison of the original and modified schemes using data for L. interrogans and L. kirschneri demonstrated that the discriminatory power of the two schemes was not significantly different. The modified scheme was used to further characterize 325 isolates (L. alexanderi [n = 5], L. borgpetersenii [n = 34], L. interrogans [n = 222], L. kirschneri [n = 29], L. noguchii [n = 9], L. santarosai [n = 10], and L. weilii [n = 16]). Phylogenetic analysis using concatenated sequences of the 7 loci demonstrated that each species corresponded to a discrete clade, and that no strains were misclassified at the species level. Comparison between genotype and serovar was possible for 254 isolates. Of the 31 sequence types (STs) represented by at least two isolates, 18 STs included isolates assigned to two or three different serovars. Conversely, 14 serovars were identified that contained between 2 to 10 different STs. New observations were made on the global phylogeography of Leptospira spp., and the utility of MLST in making associations between human disease and specific maintenance hosts was demonstrated. Conclusion The new MLST scheme, supported by an updated MLST website, allows the characterization and species assignment of isolates of the seven major pathogenic species associated with leptospirosis. PMID:23359622

  11. An Extended Multilocus Sequence Typing (MLST Scheme for Rapid Direct Typing of Leptospira from Clinical Samples.

    Directory of Open Access Journals (Sweden)

    Sabrina Weiss

    2016-09-01

    Full Text Available Rapid typing of Leptospira is currently impaired by requiring time consuming culture of leptospires. The objective of this study was to develop an assay that provides multilocus sequence typing (MLST data direct from patient specimens while minimising costs for subsequent sequencing.An existing PCR based MLST scheme was modified by designing nested primers including anchors for facilitated subsequent sequencing. The assay was applied to various specimen types from patients diagnosed with leptospirosis between 2014 and 2015 in the United Kingdom (UK and the Lao Peoples Democratic Republic (Lao PDR. Of 44 clinical samples (23 serum, 6 whole blood, 3 buffy coat, 12 urine PCR positive for pathogenic Leptospira spp. at least one allele was amplified in 22 samples (50% and used for phylogenetic inference. Full allelic profiles were obtained from ten specimens, representing all sample types (23%. No nonspecific amplicons were observed in any of the samples. Of twelve PCR positive urine specimens three gave full allelic profiles (25% and two a partial profile. Phylogenetic analysis allowed for species assignment. The predominant species detected was L. interrogans (10/14 and 7/8 from UK and Lao PDR, respectively. All other species were detected in samples from only one country (Lao PDR: L. borgpetersenii [1/8]; UK: L. kirschneri [1/14], L. santarosai [1/14], L. weilii [2/14].Typing information of pathogenic Leptospira spp. was obtained directly from a variety of clinical samples using a modified MLST assay. This assay negates the need for time-consuming culture of Leptospira prior to typing and will be of use both in surveillance, as single alleles enable species determination, and outbreaks for the rapid identification of clusters.

  12. Diversification of the silverspot butterflies (Nymphalidae) in the Neotropics inferred from multi-locus DNA sequences.

    Science.gov (United States)

    Massardo, Darli; Fornel, Rodrigo; Kronforst, Marcus; Gonçalves, Gislene Lopes; Moreira, Gilson Rudinei Pires

    2015-01-01

    The tribe Heliconiini (Lepidoptera: Nymphalidae) is a diverse group of butterflies distributed throughout the Neotropics, which has been studied extensively, in particular the genus Heliconius. However, most of the other lineages, such as Dione, which are less diverse and considered basal within the group, have received little attention. Basic information, such as species limits and geographical distributions remain uncertain for this genus. Here we used multilocus DNA sequence data and the geographical distribution analysis across the entire range of Dione in the Neotropical region in order to make inferences on the evolutionary history of this poorly explored lineage. Bayesian time-tree reconstruction allows inferring two major diversification events in this tribe around 25mya. Lineages thought to be ancient, such as Dione and Agraulis, are as recent as Heliconius. Dione formed a monophyletic clade, sister to the genus Agraulis. Dione juno, D. glycera and D. moneta were reciprocally monophyletic and formed genetic clusters, with the first two more close related than each other in relation to the third. Divergence time estimates support the hypothesis that speciation in Dione coincided with both the rise of Passifloraceae (the host plants) and the uplift of the Andes. Since the sister species D. glycera and D. moneta are specialized feeders on passion-vine lineages that are endemic to areas located either within or adjacent to the Andes, we inferred that they co-speciated with their host plants during this vicariant event. Copyright © 2014 Elsevier Inc. All rights reserved.

  13. Genetic diversity of clinical isolates of Bacillus cereus using multilocus sequence typing

    Directory of Open Access Journals (Sweden)

    Pruckler James M

    2008-11-01

    Full Text Available Abstract Background Bacillus cereus is most commonly associated with foodborne illness (diarrheal and emetic but is also an opportunistic pathogen that can cause severe and fatal infections. Several multilocus sequence typing (MLST schemes have recently been developed to genotype B. cereus and analysis has suggested a clonal or weakly clonal population structure for B. cereus and its close relatives B. anthracis and B. thuringiensis. In this study we used MLST to determine if B. cereus isolates associated with illnesses of varying severity (e.g., severe, systemic vs. gastrointestinal (GI illness were clonal or formed clonal complexes. Results A retrospective analysis of 55 clinical B. cereus isolates submitted to the Centers for Disease Control and Prevention between 1954 and 2004 was conducted. Clinical isolates from severe infections (n = 27, gastrointestinal (GI illness (n = 18, and associated isolates from food (n = 10 were selected for analysis using MLST. The 55 isolates were diverse and comprised 38 sequence types (ST in two distinct clades. Of the 27 isolates associated with serious illness, 13 clustered in clade 1 while 14 were in clade 2. Isolates associated with GI illness were also found throughout clades 1 and 2, while no isolates in this study belonged to clade 3. All the isolates from this study belonging to the clade 1/cereus III lineage were associated with severe disease while isolates belonging to clade1/cereus II contained isolates primarily associated with severe disease and emetic illness. Only three STs were observed more than once for epidemiologically distinct isolates. Conclusion STs of clinical B. cereus isolates were phylogenetically diverse and distributed among two of three previously described clades. Greater numbers of strains will need to be analyzed to confirm if specific lineages or clonal complexes are more likely to contain clinical isolates or be associated with specific illness, similar to B. anthracis and

  14. Expression of Sme efflux pumps and multilocus sequence typing in clinical isolates of Stenotrophomonas maltophilia.

    Science.gov (United States)

    Cho, Hye Hyun; Sung, Ji Youn; Kwon, Kye Chul; Koo, Sun Hoe

    2012-01-01

    Stenotrophomonas maltophilia has emerged as an important opportunistic pathogen, which causes infections that are often difficult to manage because of the inherent resistance of the pathogen to a variety of antimicrobial agents. In this study, we analyzed the expressions of smeABC and smeDEF and their correlation with antimicrobial susceptibility. We also evaluated the genetic relatedness and epidemiological links among 33 isolates of S. maltophilia. In total, 33 S. maltophilia strains were isolated from patients in a tertiary hospital in Daejeon. Minimum inhibitory concentrations (MICs) of 11 antimicrobial agents were determined by using agar dilution method and E-test (BioMérieux, France). Real-time PCR analysis was performed to evaluate the expression of the Sme efflux systems in the S. maltophilia isolates. Additionally, an epidemiological investigation was performed using multilocus sequence typing (MLST) assays. The findings of susceptibility testing showed that the majority of the S. maltophilia isolates were resistant to β-lactams and aminoglycosides. Twenty-one clinical isolates overexpressed smeABC and showed high resistance to ciprofloxacin. Moreover, a high degree of genetic diversity was observed among the S. maltophilia isolates; 3 sequence types (STs) and 23 allelic profiles were observed. The smeABC efflux pump was associated with multidrug resistance in clinical isolates of S. maltophilia. In particular, smeABC efflux pumps appear to perform an important role in ciprofloxacin resistance of S. maltophilia. The MLST scheme for S. maltophilia represents a discriminatory typing method with stable markers and is appropriate for studying population structures.

  15. Systematic characterization of Bacillus Genetic Stock Center Bacillus thuringiensis strains using Multi-Locus Sequence Typing.

    Science.gov (United States)

    Wang, Kui; Shu, Changlong; Soberón, Mario; Bravo, Alejandra; Zhang, Jie

    2018-04-30

    The goal of this work was to perform a systematic characterization of Bacillus thuringiensis (Bt) strains from the Bacillus Genetic Stock Center (BGSC) collection using Multi-Locus Sequence Typing (MLST). Different genetic markers of 158 Bacillus thuringiensis (Bt) strains from 73 different serovars stored in the BGSC, that represented 92% of the different Bt serovars of the BGSC were analyzed, the 8% that were not analyzed were not available. In addition, we analyzed 72 Bt strains from 18 serovars available at the pubMLST bcereus database, and Bt strains G03, HBF18 and Bt185, with no H serovars provided by our laboratory. We performed a systematic MLST analysis using seven housekeeping genes (glpF, gmK, ilvD, pta, pur, pycA and tpi) and analyzed correlation of the results of this analysis with strain serovars. The 233 Bt strains analyzed were assigned to 119 STs from which 19 STs were new. Genetic relationships were established by phylogenetic analysis and showed that STs could be grouped in two major Clusters containing 21 sub-groups. We found that a significant number of STs (101 in total) correlated with specific serovars, such as ST13 that corresponded to nine Bt isolates from B. thuringiensis serovar kenyae. However, other serovars showed high genetic variability and correlated with multiple STs; for example, B. thuringiensis serovar morrisoni correlated with 11 different STs. In addition, we found that 16 different STs correlated with multiple serovars (2-4 different serovars); for example, ST12 correlated with B. thuringiensis serovar alesti, dakota, palmanyolensis and sotto/dendrolimus. These data indicated that only partial correspondence between MLST and serotyping can be established. Copyright © 2018 Elsevier Inc. All rights reserved.

  16. Multilocus sequence typing of Streptococcus thermophilus from naturally fermented dairy foods in China and Mongolia.

    Science.gov (United States)

    Yu, Jie; Sun, Zhihong; Liu, Wenjun; Xi, Xiaoxia; Song, Yuqin; Xu, Haiyan; Lv, Qiang; Bao, Qiuhua; Menghe, Bilige; Sun, Tiansong

    2015-10-26

    Streptococcus thermophilus is a major dairy starter used for manufacturing of dairy products. In the present study, we developed a multilocus sequence typing (MLST) scheme for this important food bacterium. Sequences of 10 housekeeping genes (carB, clpX, dnaA, murC, murE, pepN, pepX, pyrG, recA, and rpoB) were obtained for 239 S. thermophilus strains, which were isolated from home-made fermented dairy foods in 18 different regions of Mongolia and China. All 10 genes of S. thermophilus were sequenced, aligned, and defined sequence types (STs) using the BioNumerics Software. The nucleotide diversity was calculated by START v2.0. The population structure, phylogenetic relationships and the role of recombination were inferred using ClonalFrame v1.2, SplitsTree 4.0 and Structure v2.3. The 239 S. thermophilus isolates and 18 reference strains could be assigned into 119 different STs, which could be further separated into 16 clonal complexes (CCs) and 38 singletons. Among the 10 loci, a total of 132 polymorphic sites were detected. The standardized index of association (IAS=0.0916), split-decomposition and ρ/θ (relative frequency of occurrence of recombination and mutation) and r/m value (relative impact of recombination and mutation in the diversification) confirms that recombination may have occurred, but it occurred at a low frequency in these 10 loci. Phylogenetic trees indicated that there were five lineages in the S. thermophilus isolates used in our study. MSTree and ClonalFrame tree analyses suggest that the evolution of S. thermophilus isolates have little relationship with geographic locality, but revealed no association with the types of fermented dairy product. Phylogenetic analysis of 36 whole genome strains (18 S. thermophilus, 2 S. vestibularis and 16 S. salivarius strains) indicated that our MLST scheme could clearly separate three closely related species within the salivarius group and is suitable for analyzing the population structure of the

  17. Comparison of double-locus sequence typing (DLST) and multilocus sequence typing (MLST) for the investigation of Pseudomonas aeruginosa populations.

    Science.gov (United States)

    Cholley, Pascal; Stojanov, Milos; Hocquet, Didier; Thouverez, Michelle; Bertrand, Xavier; Blanc, Dominique S

    2015-08-01

    Reliable molecular typing methods are necessary to investigate the epidemiology of bacterial pathogens. Reference methods such as multilocus sequence typing (MLST) and pulsed-field gel electrophoresis (PFGE) are costly and time consuming. Here, we compared our newly developed double-locus sequence typing (DLST) method for Pseudomonas aeruginosa to MLST and PFGE on a collection of 281 isolates. DLST was as discriminatory as MLST and was able to recognize "high-risk" epidemic clones. Both methods were highly congruent. Not surprisingly, a higher discriminatory power was observed with PFGE. In conclusion, being a simple method (single-strand sequencing of only 2 loci), DLST is valuable as a first-line typing tool for epidemiological investigations of P. aeruginosa. Coupled to a more discriminant method like PFGE or whole genome sequencing, it might represent an efficient typing strategy to investigate or prevent outbreaks. Copyright © 2015 Elsevier Inc. All rights reserved.

  18. The Comparison of Streptococcus agalactiae Isolated from Fish and Bovine using Multilocus Sequence Typing

    Directory of Open Access Journals (Sweden)

    ANGELA MARIANA LUSIASTUTI

    2013-12-01

    Full Text Available Multilocus sequence typing (MLST has greater utility for determining the recent ancestral lineage and the relatedness of individual strains. Group B streptococci (GBS is one of the major causes of subclinical mastitis of dairy cattle in several countries. GBS also sporadically causes epizootic infections in fish. The aim of this study was to compare the evolutionary lineage of fish and bovine isolates in relation to the S. agalactiae global population as a whole by comparing the MLST profiles. Twenty S. agalactiae isolates were obtained from dairy cattle and fish. PCR products were amplified with seven different oligonucleotide primer pairs designed from the NEM316 GBS genome sequence. Clone complexes demonstrated that bovine and fish isolates were separate populations. These findings lead us to conclude that fish S. agalactiae is not a zoonotic agent for bovine. MLST could help clarify the emergence of pathogenic clones and to decide whether the host acts as a reservoir for another pathogenic lineage.

  19. Population structure of Lactobacillus helveticus isolates from naturally fermented dairy products based on multilocus sequence typing.

    Science.gov (United States)

    Sun, Zhihong; Liu, Wenjun; Song, Yuqin; Xu, Haiyan; Yu, Jie; Bilige, Menghe; Zhang, Heping; Chen, Yongfu

    2015-05-01

    Lactobacillus helveticus is an economically important lactic acid bacterium used in industrial dairy fermentation. In the present study, the population structure of 245 isolates of L. helveticus from different naturally fermented dairy products in China and Mongolia were investigated using an multilocus sequence typing scheme with 11 housekeeping genes. A total of 108 sequence types were detected, which formed 8 clonal complexes and 27 singletons. Results from Structure, SplitsTree, and ClonalFrame software analyses demonstrated the presence of 3 subpopulations in the L. helveticus isolates used in our study, namely koumiss, kurut-tarag, and panmictic lineages. Most L. helveticus isolates from particular ecological origins had specific population structures. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  20. Unravelling the Molecular Epidemiology and Genetic Diversity among Burkholderia pseudomallei Isolates from South India Using Multi-Locus Sequence Typing.

    Science.gov (United States)

    Tellapragada, Chaitanya; Kamthan, Aayushi; Shaw, Tushar; Ke, Vandana; Kumar, Subodh; Bhat, Vinod; Mukhopadhyay, Chiranjay

    2016-01-01

    There is a slow but steady rise in the case detection rates of melioidosis from various parts of the Indian sub-continent in the past two decades. However, the epidemiology of the disease in India and the surrounding South Asian countries remains far from well elucidated. Multi-locus sequence typing (MLST) is a useful epidemiological tool to study the genetic relatedness of bacterial isolates both with-in and across the countries. With this background, we studied the molecular epidemiology of 32 Burkholderia pseudomallei isolates (31 clinical and 1 soil isolate) obtained during 2006-2015 from various parts of south India using multi-locus sequencing typing and analysis. Of the 32 isolates included in the analysis, 30 (93.7%) had novel allelic profiles that were not reported previously. Sequence type (ST) 1368 (n = 15, 46.8%) with allelic profile (1, 4, 6, 4, 1, 1, 3) was the most common genotype observed. We did not observe a genotypic association of STs with geographical location, type of infection and year of isolation in the present study. Measure of genetic differentiation (FST) between Indian and the rest of world isolates was 0.14413. Occurrence of the same ST across three adjacent states of south India suggest the dispersion of B.pseudomallei across the south western coastal part of India with limited geographical clustering. However, majority of the STs reported from the present study remained as "outliers" on the eBURST "Population snapshot", suggesting the genetic diversity of Indian isolates from the Australasian and Southeast Asian isolates.

  1. Unravelling the Molecular Epidemiology and Genetic Diversity among Burkholderia pseudomallei Isolates from South India Using Multi-Locus Sequence Typing.

    Directory of Open Access Journals (Sweden)

    Chaitanya Tellapragada

    Full Text Available There is a slow but steady rise in the case detection rates of melioidosis from various parts of the Indian sub-continent in the past two decades. However, the epidemiology of the disease in India and the surrounding South Asian countries remains far from well elucidated. Multi-locus sequence typing (MLST is a useful epidemiological tool to study the genetic relatedness of bacterial isolates both with-in and across the countries. With this background, we studied the molecular epidemiology of 32 Burkholderia pseudomallei isolates (31 clinical and 1 soil isolate obtained during 2006-2015 from various parts of south India using multi-locus sequencing typing and analysis. Of the 32 isolates included in the analysis, 30 (93.7% had novel allelic profiles that were not reported previously. Sequence type (ST 1368 (n = 15, 46.8% with allelic profile (1, 4, 6, 4, 1, 1, 3 was the most common genotype observed. We did not observe a genotypic association of STs with geographical location, type of infection and year of isolation in the present study. Measure of genetic differentiation (FST between Indian and the rest of world isolates was 0.14413. Occurrence of the same ST across three adjacent states of south India suggest the dispersion of B.pseudomallei across the south western coastal part of India with limited geographical clustering. However, majority of the STs reported from the present study remained as "outliers" on the eBURST "Population snapshot", suggesting the genetic diversity of Indian isolates from the Australasian and Southeast Asian isolates.

  2. The evolution and population structure of Lactobacillus fermentum from different naturally fermented products as determined by multilocus sequence typing (MLST).

    Science.gov (United States)

    Dan, Tong; Liu, Wenjun; Song, Yuqin; Xu, Haiyan; Menghe, Bilige; Zhang, Heping; Sun, Zhihong

    2015-05-20

    Lactobacillus fermentum is economically important in the production and preservation of fermented foods. A repeatable and discriminative typing method was devised to characterize L. fermentum at the molecular level. The multilocus sequence typing (MLST) scheme developed was based on analysis of the internal sequence of 11 housekeeping gene fragments (clpX, dnaA, dnaK, groEL, murC, murE, pepX, pyrG, recA, rpoB, and uvrC). MLST analysis of 203 isolates of L. fermentum from Mongolia and seven provinces/ autonomous regions in China identified 57 sequence types (ST), 27 of which were represented by only a single isolate, indicating high genetic diversity. Phylogenetic analyses based on the sequence of the 11 housekeeping gene fragments indicated that the L. fermentum isolates analyzed belonged to two major groups. A standardized index of association (I A (S)) indicated a weak clonal population structure in L. fermentum. Split decomposition analysis indicated that recombination played an important role in generating the genetic diversity observed in L. fermentum. The results from the minimum spanning tree strongly suggested that evolution of L. fermentum STs was not correlated with geography or food-type. The MLST scheme developed will be valuable for further studies on the evolution and population structure of L. fermentum isolates used in food products.

  3. Low Divergence of Clonorchis sinensis in China Based on Multilocus Analysis.

    Directory of Open Access Journals (Sweden)

    Jiufeng Sun

    Full Text Available Clonorchis sinensis, an ancient parasite that infects a number of piscivorous mammals, attracts significant public health interest due to zoonotic exposure risks in Asia. The available studies are insufficient to reflect the prevalence, geographic distribution, and intraspecific genetic diversity of C. sinensis in endemic areas. Here, a multilocus analysis based on eight genes (ITS1, act, tub, ef-1a, cox1, cox3, nad4 and nad5 [4.986 kb] was employed to explore the intra-species genetic construction of C. sinensis in China. Two hundred and fifty-six C. sinensis isolates were obtained from environmental reservoirs from 17 provinces of China. A total of 254 recognized Multilocus Types (MSTs showed high diversity among these isolates using multilocus analysis. The comparison analysis of nuclear and mitochondrial phylogeny supports separate clusters in a nuclear dendrogram. Genetic differentiation analysis of three clusters (A, B, and C showed low divergence within populations. Most isolates from clusters B and C are geographically limited to central China, while cluster A is extraordinarily genetically diverse. Further genetic analyses between different geographic distributions, water bodies and hosts support the low population divergence. The latter haplotype analyses were consistent with the phylogenetic and genetic differentiation results. A recombination network based on concatenated sequences showed a concentrated linkage recombination population in cox1, cox3, nad4 and nad5, with spatial structuring in ITS1. Coupled with the history record and archaeological evidence of C. sinensis infection in mummified desiccated feces, these data point to an ancient origin of C. sinensis in China. In conclusion, we present a likely phylogenetic structure of the C. sinensis population in mainland China, highlighting its possible tendency for biogeographic expansion. Meanwhile, ITS1 was found to be an effective marker for tracking C. sinensis infection

  4. Low Divergence of Clonorchis sinensis in China Based on Multilocus Analysis

    Science.gov (United States)

    Sun, Jiufeng; Huang, Yan; Huang, Huaiqiu; Liang, Pei; Wang, Xiaoyun; Mao, Qiang; Men, Jingtao; Chen, Wenjun; Deng, Chuanhuan; Zhou, Chenhui; Lv, Xiaoli; Zhou, Juanjuan; Zhang, Fan; Li, Ran; Tian, Yanli; Lei, Huali; Liang, Chi; Hu, Xuchu; Xu, Jin; Li, Xuerong; XinbingYu

    2013-01-01

    Clonorchis sinensis, an ancient parasite that infects a number of piscivorous mammals, attracts significant public health interest due to zoonotic exposure risks in Asia. The available studies are insufficient to reflect the prevalence, geographic distribution, and intraspecific genetic diversity of C. sinensis in endemic areas. Here, a multilocus analysis based on eight genes (ITS1, act, tub, ef-1a, cox1, cox3, nad4 and nad5 [4.986 kb]) was employed to explore the intra-species genetic construction of C. sinensis in China. Two hundred and fifty-six C. sinensis isolates were obtained from environmental reservoirs from 17 provinces of China. A total of 254 recognized Multilocus Types (MSTs) showed high diversity among these isolates using multilocus analysis. The comparison analysis of nuclear and mitochondrial phylogeny supports separate clusters in a nuclear dendrogram. Genetic differentiation analysis of three clusters (A, B, and C) showed low divergence within populations. Most isolates from clusters B and C are geographically limited to central China, while cluster A is extraordinarily genetically diverse. Further genetic analyses between different geographic distributions, water bodies and hosts support the low population divergence. The latter haplotype analyses were consistent with the phylogenetic and genetic differentiation results. A recombination network based on concatenated sequences showed a concentrated linkage recombination population in cox1, cox3, nad4 and nad5, with spatial structuring in ITS1. Coupled with the history record and archaeological evidence of C. sinensis infection in mummified desiccated feces, these data point to an ancient origin of C. sinensis in China. In conclusion, we present a likely phylogenetic structure of the C. sinensis population in mainland China, highlighting its possible tendency for biogeographic expansion. Meanwhile, ITS1 was found to be an effective marker for tracking C. sinensis infection worldwide. Thus, the

  5. The Applied Development of a Tiered Multilocus Sequence Typing (MLST) Scheme for Dichelobacter nodosus.

    Science.gov (United States)

    Blanchard, Adam M; Jolley, Keith A; Maiden, Martin C J; Coffey, Tracey J; Maboni, Grazieli; Staley, Ceri E; Bollard, Nicola J; Warry, Andrew; Emes, Richard D; Davies, Peers L; Tötemeyer, Sabine

    2018-01-01

    Dichelobacter nodosus ( D. nodosus ) is the causative pathogen of ovine footrot, a disease that has a significant welfare and financial impact on the global sheep industry. Previous studies into the phylogenetics of D. nodosus have focused on Australia and Scandinavia, meaning the current diversity in the United Kingdom (U.K.) population and its relationship globally, is poorly understood. Numerous epidemiological methods are available for bacterial typing; however, few account for whole genome diversity or provide the opportunity for future application of new computational techniques. Multilocus sequence typing (MLST) measures nucleotide variations within several loci with slow accumulation of variation to enable the designation of allele numbers to determine a sequence type. The usage of whole genome sequence data enables the application of MLST, but also core and whole genome MLST for higher levels of strain discrimination with a negligible increase in experimental cost. An MLST database was developed alongside a seven loci scheme using publically available whole genome data from the sequence read archive. Sequence type designation and strain discrimination was compared to previously published data to ensure reproducibility. Multiple D. nodosus isolates from U.K. farms were directly compared to populations from other countries. The U.K. isolates define new clades within the global population of D. nodosus and predominantly consist of serogroups A, B and H, however serogroups C, D, E, and I were also found. The scheme is publically available at https://pubmlst.org/dnodosus/.

  6. Multilocus Sequence Typing of the Clinical Isolates of Salmonella Enterica Serovar Typhimurium in Tehran Hospitals

    Directory of Open Access Journals (Sweden)

    Reza Ranjbar

    2017-09-01

    Full Text Available Background: Salmonella enterica serovar Typhimurium is one of the most important serovars of Salmonella enterica and is associated with human salmonellosis worldwide. Many epidemiological studies have focused on the characteristics of Salmonella Typhimurium in many countries as well as in Asia. This study was conducted to investigate the genetic characteristics of Salmonella Typhimurium using multilocus sequence typing (MLST. Methods: Clinical samples (urine, blood, and stool were collected from patients, who were admitted to 2 hospitals in Tehran between April and September, 2015. Salmonella Typhimurium strains were identified by conventional standard biochemical and serological testing. The antibiotic susceptibility patterns of the Salmonella Typhimurium isolates against 16 antibiotics was determined using the disk diffusion assay. The clonal relationship between the strains of Salmonella Typhimurium was analyzed using MLST. Results: Among the 68 Salmonella isolates, 31% (n=21 were Salmonella Typhimurium. Of the total 21 Salmonella Typhimurium isolates, 76% (n=16 were multidrug-resistant and showed resistance to 3 or more antibiotic families. The Salmonella Typhimurium isolates were assigned to 2 sequence types: ST19 and ST328. ST19 was more common (86%. Both sequence types were further assigned to 1 eBURST group. Conclusion: This is the first study of its kind in Iran to determine the sequence types of the clinical isolates of Salmonella Typhimurium in Tehran hospitals using MLST. ST19 was detected as the major sequence type of Salmonella Typhimurium.

  7. Genotypic characterization of Salmonella by multilocus sequence typing, pulsed-field gel electrophoresis and amplified fragment length polymorphism

    DEFF Research Database (Denmark)

    Torpdahl, Mia; Skov, Marianne N.; Sandvang, Dorthe

    2005-01-01

    subspecies enterica isolates. A total of 25 serotypes were investigated that had been isolated from humans or veterinary sources in Denmark between 1995 and 2001. All isolates were genotyped by multilocus sequence typing (MLST), pulsed-field gel electrophoresis (PFGE) and amplified fragment length...

  8. [Multilocus sequence-typing for characterization of Moscow strains of Haemophilus influenzae type b].

    Science.gov (United States)

    Platonov, A E; Mironov, K O; Iatsyshina, S B; Koroleva, I S; Platonova, O V; Gushchin, A E; Shipulin, G A

    2003-01-01

    Haemophilius influenzae, type b (Hib) bacteria, were genotyped by multilocus sequence typing (MLST) using 5 loci (adk, fucK, mdh, pgi, recA). 42 Moscow Hib strains (including 38 isolates form cerebrospinal fluid of children, who had purulent meningitis in 1999-2001, and 4 strains isolated from healthy carriers of Hib), as well as 2 strains from Yekaterinburg were studied. In MLST a strain is characterized, by alleles and their combinations (an allele profile) referred to also as sequence-type (ST). 9 Sts were identified within the Russian Hib bacteria: ST-1 was found in 25 strains (57%), ST-12 was found in 8 strains (18%), ST-11 was found in 4 strains (9%) and ST-15 was found in 2 strains (4.5%); all other STs strains (13, 14, 16, 17, 51) were found in isolated cases (2.3%). A comparison of allelic profiles and of nucleotide sequences showed that 93% of Russian isolates, i.e. strain with ST-1, 11, 12, 13, 15 and 17, belong to one and the same clonal complex. 2 isolates from Norway and Sweden from among 7 foreign Hib strains studied up to now can be described as belonging to the same clonal complex; 5 Hib strains were different from the Russian ones.

  9. Multi-locus sequence typing of Bartonella henselae isolates from three continents reveals hypervirulent and feline-associated clones.

    Directory of Open Access Journals (Sweden)

    Mardjan Arvand

    Full Text Available Bartonella henselae is a zoonotic pathogen and the causative agent of cat scratch disease and a variety of other disease manifestations in humans. Previous investigations have suggested that a limited subset of B. henselae isolates may be associated with human disease. In the present study, 182 human and feline B. henselae isolates from Europe, North America and Australia were analysed by multi-locus sequence typing (MLST to detect any associations between sequence type (ST, host species and geographical distribution of the isolates. A total of 14 sequence types were detected, but over 66% (16/24 of the isolates recovered from human disease corresponded to a single genotype, ST1, and this type was detected in all three continents. In contrast, 27.2% (43/158 of the feline isolates corresponded to ST7, but this ST was not recovered from humans and was restricted to Europe. The difference in host association of STs 1 (human and 7 (feline was statistically significant (P< or =0.001. eBURST analysis assigned the 14 STs to three clonal lineages, which contained two or more STs, and a singleton comprising ST7. These groups were broadly consistent with a neighbour-joining tree, although splits decomposition analysis was indicative of a history of recombination. These data indicate that B. henselae lineages differ in their virulence properties for humans and contribute to a better understanding of the population structure of B. henselae.

  10. Genotyping of Indian antigenic, vaccine, and field Brucella spp. using multilocus sequence typing.

    Science.gov (United States)

    Shome, Rajeswari; Krithiga, Natesan; Shankaranarayana, Padmashree B; Jegadesan, Sankarasubramanian; Udayakumar S, Vishnu; Shome, Bibek Ranjan; Saikia, Girin Kumar; Sharma, Narendra Kumar; Chauhan, Harshad; Chandel, Bharat Singh; Jeyaprakash, Rajendhran; Rahman, Habibur

    2016-03-31

    Brucellosis is one of the most important zoonotic diseases that affects multiple livestock species and causes great economic losses. The highly conserved genomes of Brucella, with > 90% homology among species, makes it important to study the genetic diversity circulating in the country. A total of 26 Brucella spp. (4 reference strains and 22 field isolates) and 1 B. melitensis draft genome sequence from India (B. melitensis Bm IND1) were included for sequence typing. The field isolates were identified by biochemical tests and confirmed by both conventional and quantitative polymerase chain reaction (qPCR) targeting bcsp 31Brucella genus-specific marker. Brucella speciation and biotyping was done by Bruce ladder, probe qPCR, and AMOS PCRs, respectively, and genotyping was done by multilocus sequence typing (MLST). The MLST typing of 27 Brucella spp. revealed five distinct sequence types (STs); the B. abortus S99 reference strain and 21 B. abortus field isolates belonged to ST1. On the other hand, the vaccine strain B. abortus S19 was genotyped as ST5. Similarly, B. melitensis 16M reference strain and one B. melitensis field isolate were grouped into ST7. Another B. melitensis field isolate belonged to ST8 (draft genome sequence from India), and only B. suis 1330 reference strain was found to be ST14. The sequences revealed genetic similarity of the Indian strains to the global reference and field strains. The study highlights the usefulness of MLST for typing of field isolates and validation of reference strains used for diagnosis and vaccination against brucellosis.

  11. Core Genome Multilocus Sequence Typing Scheme for Stable, Comparative Analyses of Campylobacter jejuni and C. coli Human Disease Isolates.

    Science.gov (United States)

    Cody, Alison J; Bray, James E; Jolley, Keith A; McCarthy, Noel D; Maiden, Martin C J

    2017-07-01

    Human campylobacteriosis, caused by Campylobacter jejuni and C. coli , remains a leading cause of bacterial gastroenteritis in many countries, but the epidemiology of campylobacteriosis outbreaks remains poorly defined, largely due to limitations in the resolution and comparability of isolate characterization methods. Whole-genome sequencing (WGS) data enable the improvement of sequence-based typing approaches, such as multilocus sequence typing (MLST), by substantially increasing the number of loci examined. A core genome MLST (cgMLST) scheme defines a comprehensive set of those loci present in most members of a bacterial group, balancing very high resolution with comparability across the diversity of the group. Here we propose a set of 1,343 loci as a human campylobacteriosis cgMLST scheme (v1.0), the allelic profiles of which can be assigned to core genome sequence types. The 1,343 loci chosen were a subset of the 1,643 loci identified in the reannotation of the genome sequence of C. jejuni isolate NCTC 11168, chosen as being present in >95% of draft genomes of 2,472 representative United Kingdom campylobacteriosis isolates, comprising 2,207 (89.3%) C. jejuni isolates and 265 (10.7%) C. coli isolates. Validation of the cgMLST scheme was undertaken with 1,478 further high-quality draft genomes, containing 150 or fewer contiguous sequences, from disease isolate collections: 99.5% of these isolates contained ≥95% of the 1,343 cgMLST loci. In addition to the rapid and effective high-resolution analysis of large numbers of diverse isolates, the cgMLST scheme enabled the efficient identification of very closely related isolates from a well-defined single-source campylobacteriosis outbreak. Copyright © 2017 Cody et al.

  12. Development and evaluation of a multi-locus sequence typing scheme for Mycoplasma synoviae.

    Science.gov (United States)

    Dijkman, R; Feberwee, A; Landman, W J M

    2016-08-01

    Reproducible molecular Mycoplasma synoviae typing techniques with sufficient discriminatory power may help to expand knowledge on its epidemiology and contribute to the improvement of control and eradication programmes of this mycoplasma species. The present study describes the development and validation of a novel multi-locus sequence typing (MLST) scheme for M. synoviae. Thirteen M. synoviae isolates originating from different poultry categories, farms and lesions, were subjected to whole genome sequencing. Their sequences were compared to that of M. synoviae reference strain MS53. A high number of single nucleotide polymorphisms (SNPs) indicating considerable genetic diversity were identified. SNPs were present in over 40 putative target genes for MLST of which five target genes were selected (nanA, uvrA, lepA, ruvB and ugpA) for the MLST scheme. This scheme was evaluated analysing 209 M. synoviae samples from different countries, categories of poultry, farms and lesions. Eleven clonal clusters and 76 different sequence types (STs) were obtained. Clustering occurred following geographical origin, supporting the hypothesis of regional population evolution. M. synoviae samples obtained from epidemiologically linked outbreaks often harboured the same ST. In contrast, multiple M. synoviae lineages were found in samples originating from swollen joints or oviducts from hens that produce eggs with eggshell apex abnormalities indicating that further research is needed to identify the genetic factors of M. synoviae that may explain its variations in tissue tropism and disease inducing potential. Furthermore, MLST proved to have a higher discriminatory power compared to variable lipoprotein and haemagglutinin A typing, which generated 50 different genotypes on the same database.

  13. Multilocus Sequence Typing of Pathogenic Candida albicans Isolates Collected from a Teaching Hospital in Shanghai, China: A Molecular Epidemiology Study

    Science.gov (United States)

    Li, Li; Zhang, Qiangqiang; Zhu, Junhao; Gao, Qian; Chen, Min; Zhu, Min

    2015-01-01

    Molecular typing of Candida albicans is important for studying the population structure and epidemiology of this opportunistic yeast, such as population dynamics, nosocomial infections, multiple infections and microevolution. The genetic diversity of C. albicans has been rarely studied in China. In the present study, multilocus sequence typing (MLST) was used to characterize the genetic diversity and population structure of 62 C. albicans isolates collected from 40 patients from Huashan Hospital in Shanghai, China. A total of 50 diploid sequence types (DSTs) were identified in the 62 C. albicans isolates, with 41 newly identified DSTs. Based on cluster analysis, the 62 isolates were classified into nine existing clades and two new clades (namely clades New 1 and New 2). The majority of the isolates were clustered into three clades, clade 6 (37.5%), clade 1 (15.0%) and clade 17 (15.0%). Isolates of clade New 2 were specifically identified in East Asia. We identified three cases of potential nosocomial transmission based on association analysis between patients’ clinical data and the genotypes of corresponding isolates. Finally, by analyzing the genotypes of serial isolates we further demonstrated that the microevolution of C. albicans was due to loss of heterozygosity. Our study represents the first molecular typing of C. albicans in eastern China, and we confirmed that MLST is a useful tool for studying the epidemiology and evolution of C. albicans. PMID:25919124

  14. Multilocus Sequence Typing Scheme versus Pulsed-Field Gel Electrophoresis for Typing Mycobacterium abscessus Isolates

    Science.gov (United States)

    Machado, Gabriel Esquitini; Matsumoto, Cristianne Kayoko; Chimara, Erica; Duarte, Rafael da Silva; de Freitas, Denise; Palaci, Moises; Hadad, David Jamil; Lima, Karla Valéria Batista; Lopes, Maria Luiza; Ramos, Jesus Pais; Campos, Carlos Eduardo; Caldas, Paulo César; Heym, Beate

    2014-01-01

    Outbreaks of infections by rapidly growing mycobacteria following invasive procedures, such as ophthalmological, laparoscopic, arthroscopic, plastic, and cardiac surgeries, mesotherapy, and vaccination, have been detected in Brazil since 1998. Members of the Mycobacterium chelonae-Mycobacterium abscessus group have caused most of these outbreaks. As part of an epidemiological investigation, the isolates were typed by pulsed-field gel electrophoresis (PFGE). In this project, we performed a large-scale comparison of PFGE profiles with the results of a recently developed multilocus sequence typing (MLST) scheme for M. abscessus. Ninety-three isolates were analyzed, with 40 M. abscessus subsp. abscessus isolates, 47 M. abscessus subsp. bolletii isolates, and six isolates with no assigned subspecies. Forty-five isolates were obtained during five outbreaks, and 48 were sporadic isolates that were not associated with outbreaks. For MLST, seven housekeeping genes (argH, cya, glpK, gnd, murC, pta, and purH) were sequenced, and each isolate was assigned a sequence type (ST) from the combination of obtained alleles. The PFGE patterns of DraI-digested DNA were compared with the MLST results. All isolates were analyzable by both methods. Isolates from monoclonal outbreaks showed unique STs and indistinguishable or very similar PFGE patterns. Thirty-three STs and 49 unique PFGE patterns were identified among the 93 isolates. The Simpson's index of diversity values for MLST and PFGE were 0.69 and 0.93, respectively, for M. abscessus subsp. abscessus and 0.96 and 0.97, respectively, for M. abscessus subsp. bolletii. In conclusion, the MLST scheme showed 100% typeability and grouped monoclonal outbreak isolates in agreement with PFGE, but it was less discriminative than PFGE for M. abscessus. PMID:24899019

  15. Multilocus sequence typing scheme versus pulsed-field gel electrophoresis for typing Mycobacterium abscessus isolates.

    Science.gov (United States)

    Machado, Gabriel Esquitini; Matsumoto, Cristianne Kayoko; Chimara, Erica; Duarte, Rafael da Silva; de Freitas, Denise; Palaci, Moises; Hadad, David Jamil; Lima, Karla Valéria Batista; Lopes, Maria Luiza; Ramos, Jesus Pais; Campos, Carlos Eduardo; Caldas, Paulo César; Heym, Beate; Leão, Sylvia Cardoso

    2014-08-01

    Outbreaks of infections by rapidly growing mycobacteria following invasive procedures, such as ophthalmological, laparoscopic, arthroscopic, plastic, and cardiac surgeries, mesotherapy, and vaccination, have been detected in Brazil since 1998. Members of the Mycobacterium chelonae-Mycobacterium abscessus group have caused most of these outbreaks. As part of an epidemiological investigation, the isolates were typed by pulsed-field gel electrophoresis (PFGE). In this project, we performed a large-scale comparison of PFGE profiles with the results of a recently developed multilocus sequence typing (MLST) scheme for M. abscessus. Ninety-three isolates were analyzed, with 40 M. abscessus subsp. abscessus isolates, 47 M. abscessus subsp. bolletii isolates, and six isolates with no assigned subspecies. Forty-five isolates were obtained during five outbreaks, and 48 were sporadic isolates that were not associated with outbreaks. For MLST, seven housekeeping genes (argH, cya, glpK, gnd, murC, pta, and purH) were sequenced, and each isolate was assigned a sequence type (ST) from the combination of obtained alleles. The PFGE patterns of DraI-digested DNA were compared with the MLST results. All isolates were analyzable by both methods. Isolates from monoclonal outbreaks showed unique STs and indistinguishable or very similar PFGE patterns. Thirty-three STs and 49 unique PFGE patterns were identified among the 93 isolates. The Simpson's index of diversity values for MLST and PFGE were 0.69 and 0.93, respectively, for M. abscessus subsp. abscessus and 0.96 and 0.97, respectively, for M. abscessus subsp. bolletii. In conclusion, the MLST scheme showed 100% typeability and grouped monoclonal outbreak isolates in agreement with PFGE, but it was less discriminative than PFGE for M. abscessus. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  16. Multilocus sequence typing reveals two evolutionary lineages of Acidovorax avenae subsp. citrulli.

    Science.gov (United States)

    Feng, Jianjun; Schuenzel, Erin L; Li, Jianqiang; Schaad, Norman W

    2009-08-01

    Acidovorax avenae subsp. citrulli, causal agent of bacterial fruit blotch, has caused considerable damage to the watermelon and melon industry in China and the United States. Understanding the emergence and spread of this pathogen is important for controlling the disease. To build a fingerprinting database for reliable identification and tracking of strains of A. avenae subsp. citrulli, a multilocus sequence typing (MLST) scheme was developed using seven conserved loci. The study included 8 original strains from the 1978 description of A. avenae subsp. citrulli, 51 from China, and 34 from worldwide collections. Two major clonal complexes (CCs), CC1 and CC2, were identified within A. avenae subsp. citrulli; 48 strains typed as CC1 and 45 as CC2. All eight original 1978 strains isolated from watermelon and melon grouped in CC1. CC2 strains were predominant in the worldwide collection and all but five were isolated from watermelon. In China, a major seed producer for melon and watermelon, the predominant strains were CC1 and were found nearly equally on melon and watermelon.

  17. A critical re-evaluation of multilocus sequence typing (MLST) efforts in Wolbachia.

    Science.gov (United States)

    Bleidorn, Christoph; Gerth, Michael

    2018-01-01

    Wolbachia (Alphaproteobacteria, Rickettsiales) is the most common, and arguably one of the most important inherited symbionts. Molecular differentiation of Wolbachia strains is routinely performed with a set of five multilocus sequence typing (MLST) markers. However, since its inception in 2006, the performance of MLST in Wolbachia strain typing has not been assessed objectively. Here, we evaluate the properties of Wolbachia MLST markers and compare it to 252 other single copy loci present in the genome of most Wolbachia strains. Specifically, we investigated how well MLST performs at strain differentiation, at reflecting genetic diversity of strains, and as phylogenetic marker. We find that MLST loci are outperformed by other loci at all tasks they are currently employed for, and thus that they do not reflect the properties of a Wolbachia strain very well. We argue that whole genome typing approaches should be used for Wolbachia typing in the future. Alternatively, if few loci approaches are necessary, we provide a characterisation of 252 single copy loci for a number a criteria, which may assist in designing specific typing systems or phylogenetic studies. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  18. Relationships between emm and multilocus sequence types within a global collection of Streptococcus pyogenes

    Directory of Open Access Journals (Sweden)

    McGregor Karen F

    2008-04-01

    Full Text Available Abstract Background The M type-specific surface protein antigens encoded by the 5' end of emm genes are targets of protective host immunity and attractive vaccine candidates against infection by Streptococcus pyogenes, a global human pathogen. A history of genetic change in emm was evaluated for a worldwide collection of > 500 S. pyogenes isolates that were defined for genetic background by multilocus sequence typing of housekeeping genes. Results Organisms were categorized by genotypes that roughly correspond to throat specialists, skin specialists, and generalists often recovered from infections at either tissue site. Recovery of distant clones sharing the same emm type was ~4-fold higher for skin specialists and generalists, as compared to throat specialists. Importantly, emm type was often a poor marker for clone. Recovery of clones that underwent recombinational replacement with a new emm type was most evident for the throat and skin specialists. The average ratio of nonsynonymous substitutions per nonsynonymous site (Ka and synonymous substitutions per synonymous site (Ks was 4.9, 1.5 and 1.3 for emm types of the throat specialist, skin specialist and generalist groups, respectively. Conclusion Data indicate that the relationships between emm type and genetic background differ among the three host tissue-related groups, and that the selection pressures acting on emm appear to be strongest for the throat specialists. Since positive selection is likely due in part to a protective host immune response, the findings may have important implications for vaccine design and vaccination strategies.

  19. Core Genome Multilocus Sequence Typing for Identification of Globally Distributed Clonal Groups and Differentiation of Outbreak Strains of Listeria monocytogenes

    OpenAIRE

    Chen, Yi; Gonzalez-Escalona, Narjol; Hammack, Thomas S.; Allard, Marc W.; Strain, Errol A.; Brown, Eric W.

    2016-01-01

    ABSTRACT Many listeriosis outbreaks are caused by a few globally distributed clonal groups, designated clonal complexes or epidemic clones, of Listeria monocytogenes, several of which have been defined by classic multilocus sequence typing (MLST) schemes targeting 6 to 8 housekeeping or virulence genes. We have developed and evaluated core genome MLST (cgMLST) schemes and applied them to isolates from multiple clonal groups, including those associated with 39 listeriosis outbreaks. The cgMLST...

  20. Phylogenetic diversity of insecticolous fusaria inferred from multilocus DNA sequence data and their molecular identification via FUSARIUM-ID and Fusarium MLST

    NARCIS (Netherlands)

    O'Donnell, K.; Humber, R.A.; Geiser, D.M.; Kang, S.; Robert, V.; Park, B.; Crous, P.W.; Johnston, P.; Aoki, T.; Rooney, A.P.; Rehner, S.A.

    2012-01-01

    We constructed several multilocus DNA sequence datasets to assess the phylogenetic diversity of insecticolous fusaria, especially focusing on those housed at the Agricultural Research Service Collection of Entomopathogenic Fungi (ARSEF), and to aid molecular identifications of unknowns via the

  1. Relationships between functional genes in Lactobacillus delbrueckii ssp. bulgaricus isolates and phenotypic characteristics associated with fermentation time and flavor production in yogurt elucidated using multilocus sequence typing.

    Science.gov (United States)

    Liu, Wenjun; Yu, Jie; Sun, Zhihong; Song, Yuqin; Wang, Xueni; Wang, Hongmei; Wuren, Tuoya; Zha, Musu; Menghe, Bilige; Heping, Zhang

    2016-01-01

    Lactobacillus delbrueckii ssp. bulgaricus (L. bulgaricus) is well known for its worldwide application in yogurt production. Flavor production and acid producing are considered as the most important characteristics for starter culture screening. To our knowledge this is the first study applying functional gene sequence multilocus sequence typing technology to predict the fermentation and flavor-producing characteristics of yogurt-producing bacteria. In the present study, phenotypic characteristics of 35 L. bulgaricus strains were quantified during the fermentation of milk to yogurt and during its subsequent storage; these included fermentation time, acidification rate, pH, titratable acidity, and flavor characteristics (acetaldehyde concentration). Furthermore, multilocus sequence typing analysis of 7 functional genes associated with fermentation time, acid production, and flavor formation was done to elucidate the phylogeny and genetic evolution of the same L. bulgaricus isolates. The results showed that strains significantly differed in fermentation time, acidification rate, and acetaldehyde production. Combining functional gene sequence analysis with phenotypic characteristics demonstrated that groups of strains established using genotype data were consistent with groups identified based on their phenotypic traits. This study has established an efficient and rapid molecular genotyping method to identify strains with good fermentation traits; this has the potential to replace time-consuming conventional methods based on direct measurement of phenotypic traits. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  2. Multilocus sequence typing as a replacement for serotyping in Salmonella enterica.

    Directory of Open Access Journals (Sweden)

    Mark Achtman

    Full Text Available Salmonella enterica subspecies enterica is traditionally subdivided into serovars by serological and nutritional characteristics. We used Multilocus Sequence Typing (MLST to assign 4,257 isolates from 554 serovars to 1092 sequence types (STs. The majority of the isolates and many STs were grouped into 138 genetically closely related clusters called eBurstGroups (eBGs. Many eBGs correspond to a serovar, for example most Typhimurium are in eBG1 and most Enteritidis are in eBG4, but many eBGs contained more than one serovar. Furthermore, most serovars were polyphyletic and are distributed across multiple unrelated eBGs. Thus, serovar designations confounded genetically unrelated isolates and failed to recognize natural evolutionary groupings. An inability of serotyping to correctly group isolates was most apparent for Paratyphi B and its variant Java. Most Paratyphi B were included within a sub-cluster of STs belonging to eBG5, which also encompasses a separate sub-cluster of Java STs. However, diphasic Java variants were also found in two other eBGs and monophasic Java variants were in four other eBGs or STs, one of which is in subspecies salamae and a second of which includes isolates assigned to Enteritidis, Dublin and monophasic Paratyphi B. Similarly, Choleraesuis was found in eBG6 and is closely related to Paratyphi C, which is in eBG20. However, Choleraesuis var. Decatur consists of isolates from seven other, unrelated eBGs or STs. The serological assignment of these Decatur isolates to Choleraesuis likely reflects lateral gene transfer of flagellar genes between unrelated bacteria plus purifying selection. By confounding multiple evolutionary groups, serotyping can be misleading about the disease potential of S. enterica. Unlike serotyping, MLST recognizes evolutionary groupings and we recommend that Salmonella classification by serotyping should be replaced by MLST or its equivalents.

  3. Identification of IncA/C Plasmid Replication and Maintenance Genes and Development of a Plasmid Multilocus Sequence Typing Scheme.

    Science.gov (United States)

    Hancock, Steven J; Phan, Minh-Duy; Peters, Kate M; Forde, Brian M; Chong, Teik Min; Yin, Wai-Fong; Chan, Kok-Gan; Paterson, David L; Walsh, Timothy R; Beatson, Scott A; Schembri, Mark A

    2017-02-01

    Plasmids of incompatibility group A/C (IncA/C) are becoming increasingly prevalent within pathogenic Enterobacteriaceae They are associated with the dissemination of multiple clinically relevant resistance genes, including bla CMY and bla NDM Current typing methods for IncA/C plasmids offer limited resolution. In this study, we present the complete sequence of a bla NDM-1 -positive IncA/C plasmid, pMS6198A, isolated from a multidrug-resistant uropathogenic Escherichia coli strain. Hypersaturated transposon mutagenesis, coupled with transposon-directed insertion site sequencing (TraDIS), was employed to identify conserved genetic elements required for replication and maintenance of pMS6198A. Our analysis of TraDIS data identified roles for the replicon, including repA, a toxin-antitoxin system; two putative partitioning genes, parAB; and a putative gene, 053 Construction of mini-IncA/C plasmids and examination of their stability within E. coli confirmed that the region encompassing 053 contributes to the stable maintenance of IncA/C plasmids. Subsequently, the four major maintenance genes (repA, parAB, and 053) were used to construct a new plasmid multilocus sequence typing (PMLST) scheme for IncA/C plasmids. Application of this scheme to a database of 82 IncA/C plasmids identified 11 unique sequence types (STs), with two dominant STs. The majority of bla NDM -positive plasmids examined (15/17; 88%) fall into ST1, suggesting acquisition and subsequent expansion of this bla NDM -containing plasmid lineage. The IncA/C PMLST scheme represents a standardized tool to identify, track, and analyze the dissemination of important IncA/C plasmid lineages, particularly in the context of epidemiological studies. Copyright © 2017 American Society for Microbiology.

  4. Multilocus sequence typing of Xylella fastidiosa causing Pierce's disease and oleander leaf scorch in the United States.

    Science.gov (United States)

    Yuan, Xiaoli; Morano, Lisa; Bromley, Robin; Spring-Pearson, Senanu; Stouthamer, Richard; Nunney, Leonard

    2010-06-01

    Using a modified multilocus sequence typing (MLST) scheme for the bacterial plant pathogen Xylella fastidiosa based on the same seven housekeeping genes employed in a previously published MLST, we studied the genetic diversity of two subspecies, X. fastidiosa subsp. fastidiosa and X. fastidiosa subsp. sandyi, which cause Pierce's disease and oleander leaf scorch, respectively. Typing of 85 U.S. isolates (plus one from northern Mexico) of X. fastidiosa subsp. fastidiosa from 15 different plant hosts and 21 isolates of X. fastidiosa subsp. sandyi from 4 different hosts in California and Texas supported their subspecific status. Analysis using the MLST genes plus one cell-surface gene showed no significant genetic differentiation based on geography or host plant within either subspecies. Two cases of homologous recombination (with X. fastidiosa subsp. multiplex, the third U.S. subspecies) were detected in X. fastidiosa subsp. fastidiosa. Excluding recombination, MLST site polymorphism in X. fastidiosa subsp. fastidiosa (0.048%) and X. fastidiosa subsp. sandyi (0.000%) was substantially lower than in X. fastidiosa subsp. multiplex (0.240%), consistent with the hypothesis that X. fastidiosa subspp. fastidiosa and sandyi were introduced into the United States (probably just prior to 1880 and 1980, respectively). Using whole-genome analysis, we showed that MLST is more effective at genetic discrimination at the specific and subspecific level than other typing methods applied to X. fastidiosa. Moreover, MLST is the only technique effective in detecting recombination.

  5. Comparison of multilocus sequence typing and pulsed-field gel electrophoresis for Salmonella spp. identification in surface water

    Science.gov (United States)

    Kuo, Chun Wei; Hao Huang, Kuan; Hsu, Bing Mu; Tsai, Hsien Lung; Tseng, Shao Feng; Kao, Po Min; Shen, Shu Min; Chou Chiu, Yi; Chen, Jung Sheng

    2013-04-01

    Salmonella is one of the most important pathogens of waterborne diseases with outbreaks from contaminated water reported worldwide. In addition, Salmonella spp. can survive for long periods in aquatic environments. To realize genotypes and serovars of Salmonella in aquatic environments, we isolated the Salmonella strains by selective culture plates to identify the serovars of Salmonella by serological assay, and identify the genotypes by Multilocus sequence typing (MLST) based on the sequence data from University College Cork (UCC), respectively. The results show that 36 stream water samples (30.1%) and 18 drinking water samples (23.3%) were confirmed the existence of Salmonella using culture method combined PCR specific invA gene amplification. In this study, 24 cultured isolates of Salmonella from water samples were classified to fifteen Salmonella enterica serovars. In addition, we construct phylogenetic analysis using phylogenetic tree and Minimum spanning tree (MST) method to analyze the relationship of clinical, environmental, and geographical data. Phylogenetic tree showed that four main clusters and our strains can be distributed in all. The genotypes of isolates from stream water are more biodiversity while comparing the Salmonella strains genotypes from drinking water sources. According to MST data, we can found the positive correlation between serovars and genotypes of Salmonella. Previous studies revealed that the result of Pulsed field gel electrophoresis (PFGE) method can predict the serovars of Salmonella strain. Hence, we used the MLST data combined phylogenetic analysis to identify the serovars of Salmonella strain and achieved effectiveness. While using the geographical data combined phylogenetic analysis, the result showed that the dominant strains were existed in whole stream area in rainy season. Keywords: Salmonella spp., MLST, phylogenetic analysis, PFGE

  6. Multilocus sequence typing of Lactococcus lactis from naturally fermented milk foods in ethnic minority areas of China.

    Science.gov (United States)

    Xu, Haiyan; Sun, Zhihong; Liu, Wenjun; Yu, Jie; Song, Yuqin; Lv, Qiang; Zhang, Jiachao; Shao, Yuyu; Menghe, Bilige; Zhang, Heping

    2014-05-01

    To determine the genetic diversity and phylogenetic relationships among Lactococcus lactis isolates, 197 strains isolated from naturally homemade yogurt in 9 ethnic minority areas of 6 provinces of China were subjected to multilocus sequence typing (MLST). The MLST analysis was performed using internal fragment sequences of 12 housekeeping genes (carB, clpX, dnaA, groEL, murC, murE, pepN, pepX, pyrG, recA, rpoB, and pheS). Six (dnaA) to 8 (murC) different alleles were detected for these genes, which ranged from 33.62 (clpX) to 41.95% (recA) GC (guanine-cytosine) content. The nucleotide diversity (π) ranged from 0.00362 (murE) to 0.08439 (carB). Despite this limited allelic diversity, the allele combinations of each strain revealed 72 different sequence types, which denoted significant genotypic diversity. The dN/dS ratios (where dS is the number of synonymous substitutions per synonymous site, and dN is the number of nonsynonymous substitutions per nonsynonymous site) were lower than 1, suggesting potential negative selection for these genes. The standardized index of association of the alleles IA(S)=0.3038 supported the clonality of Lc. lactis, but the presence of network structure revealed by the split decomposition analysis of the concatenated sequence was strong evidence for intraspecies recombination. Therefore, this suggests that recombination contributed to the evolution of Lc. lactis. A minimum spanning tree analysis of the 197 isolates identified 14 clonal complexes and 23 singletons. Phylogenetic trees were constructed based on the sequence types, using the minimum evolution algorithm, and on the concatenated sequence (6,192 bp), using the unweighted pair-group method with arithmetic mean, and these trees indicated that the evolution of our Lc. lactis population was correlated with geographic origin. Taken together, our results demonstrated that MLST could provide a better understanding of Lc. lactis genome evolution, as well as useful information for

  7. Particular Candida albicans strains in the digestive tract of dyspeptic patients, identified by multilocus sequence typing.

    Directory of Open Access Journals (Sweden)

    Yan-Bing Gong

    Full Text Available BACKGROUND: Candida albicans is a human commensal that is also responsible for chronic gastritis and peptic ulcerous disease. Little is known about the genetic profiles of the C. albicans strains in the digestive tract of dyspeptic patients. The aim of this study was to evaluate the prevalence, diversity, and genetic profiles among C. albicans isolates recovered from natural colonization of the digestive tract in the dyspeptic patients. METHODS AND FINDINGS: Oral swab samples (n = 111 and gastric mucosa samples (n = 102 were obtained from a group of patients who presented dyspeptic symptoms or ulcer complaints. Oral swab samples (n = 162 were also obtained from healthy volunteers. C. albicans isolates were characterized and analyzed by multilocus sequence typing. The prevalence of Candida spp. in the oral samples was not significantly different between the dyspeptic group and the healthy group (36.0%, 40/111 vs. 29.6%, 48/162; P > 0.05. However, there were significant differences between the groups in the distribution of species isolated and the genotypes of the C. albicans isolates. C. albicans was isolated from 97.8% of the Candida-positive subjects in the dyspeptic group, but from only 56.3% in the healthy group (P < 0.001. DST1593 was the dominant C. albicans genotype from the digestive tract of the dyspeptic group (60%, 27/45, but not the healthy group (14.8%, 4/27 (P < 0.001. CONCLUSIONS: Our data suggest a possible link between particular C. albicans strain genotypes and the host microenvironment. Positivity for particular C. albicans genotypes could signify susceptibility to dyspepsia.

  8. Multilocus sequence typing of Trichomonas vaginalis clinical samples from Amsterdam, the Netherlands.

    Science.gov (United States)

    van der Veer, C; Himschoot, M; Bruisten, S M

    2016-10-13

    In this cross-sectional epidemiological study we aimed to identify molecular profiles for Trichomonas vaginalis and to determine how these molecular profiles were related to patient demographic and clinical characteristics. Molecular typing methods previously identified two genetically distinct subpopulations for T. vaginalis; however, few molecular epidemiological studies have been performed. We now increased the sensitivity of a previously described multilocus sequence typing (MLST) tool for T. vaginalis by using nested PCR. This enabled the typing of direct patient samples. From January to December 2014, we collected all T. vaginalis positive samples as detected by routine laboratory testing. Samples from patients either came from general practitioners offices or from the sexually transmitted infections (STI) clinic in Amsterdam. Epidemiological data for the STI clinic patients were retrieved from electronic patient files. The primary outcome was the success rate of genotyping direct T. vaginalis positive samples. The secondary outcome was the relation between T. vaginalis genotypes and risk factors for STI. All 7 MLST loci were successfully typed for 71/87 clinical samples. The 71 typed samples came from 69 patients, the majority of whom were women (n=62; 90%) and half (n=34; 49%) were STI clinic patients. Samples segregated into a two population structure for T. vaginalis representing genotypes I and II. Genotype I was most common (n=40; 59.7%). STI clinic patients infected with genotype II reported more sexual partners in the preceding 6 months than patients infected with genotype I (p=0.028). No other associations for gender, age, ethnicity, urogenital discharge or co-occurring STIs with T. vaginalis genotype were found. MLST with nested PCR is a sensitive typing method that allows typing of direct (uncultured) patient material. Genotype II is possibly more prevalent in high-risk sexual networks. Published by the BMJ Publishing Group Limited. For

  9. Multilocus sequence data reveal dozens of putative cryptic species in a radiation of endemic Californian mygalomorph spiders (Araneae, Mygalomorphae, Nemesiidae).

    Science.gov (United States)

    Leavitt, Dean H; Starrett, James; Westphal, Michael F; Hedin, Marshal

    2015-10-01

    We use mitochondrial and multi-locus nuclear DNA sequence data to infer both species boundaries and species relationships within California nemesiid spiders. Higher-level phylogenetic data show that the California radiation is monophyletic and distantly related to European members of the genus Brachythele. As such, we consider all California nemesiid taxa to belong to the genus Calisoga Chamberlin, 1937. Rather than find support for one or two taxa as previously hypothesized, genetic data reveal Calisoga to be a species-rich radiation of spiders, including perhaps dozens of species. This conclusion is supported by multiple mitochondrial barcoding analyses, and also independent analyses of nuclear data that reveal general genealogical congruence. We discovered three instances of sympatry, and genetic data indicate reproductive isolation when in sympatry. An examination of female reproductive morphology does not reveal species-specific characters, and observed male morphological differences for a subset of putative species are subtle. Our coalescent species tree analysis of putative species lays the groundwork for future research on the taxonomy and biogeographic history of this remarkable endemic radiation. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. Multilocus sequence typing of Xylella fastidiosa isolated from olive affected by “olive quick decline syndrome” in Italy

    Directory of Open Access Journals (Sweden)

    Toufic ELBEAINO

    2015-01-01

    Full Text Available The recent finding of Xylella fastidiosa (Xf in olive trees in southern Italy, the scanty molecular information on this bacterium and its association with the olive quick decline syndrome (OQDS prompted the necessity to isolate and acquire more genetic data on the type of strain present in that region. For the first time, the bacterium was isolated from infected olive on culture media. Genetic information were obtained through genomic comparison with other subspecies or strains. The sequences of thirteen genes from its genome, comprising seven housekeeping genes (leuA, petC, lacF, cysG, holC, nuoL and gltT usually used in multilocus sequence typing (MLST systems, and six genes involved in different biochemical functions (RNA Pol sigma-70 factor, hypothetical protein HL, 16S rRNA, rfbD, nuoN, and pilU, were analyzed. The sequences of the biochemical function genes were explored  individually to study the genetic structure of this bacterium, while the MLST genes were linked together into one concatameric sequence (4161 bp long to increase the resolution of the phylogenetic analysis when compared with Xf strains previously reported. Sequence analyses of single genes showed that the Xf olive strain is distinct from the four previously defined taxons (Xf subsp. fastidiosa, Xf subsp. multiplex, Xf subsp. sandyi and Xf subsp. pauca with a dissimilarity rate that reached 4%. In particular, Xf from olive shared the greatest identity with the strain “9a5c” (subsp. pauca, but was nevertheless distinct from it. Similarly, the MLST based on concatameric sequences confirmed the genetic variance of Xf from olive by generating a novel sequence type profile (ST53. Phylogenetic tree analyses showed that Xf from olive clustered in one clade close to subspecies pauca (strains “9a5c” and “CVC0018”, but was nevertheless distinct from them. These results indicate molecular divergence of this olive bacterium with all other strains yet reported.

  11. High-resolution melting genotyping of Enterococcus faecium based on multilocus sequence typing derived single nucleotide polymorphisms.

    Directory of Open Access Journals (Sweden)

    Steven Y C Tong

    Full Text Available We have developed a single nucleotide polymorphism (SNP nucleated high-resolution melting (HRM technique to genotype Enterococcus faecium. Eight SNPs were derived from the E. faecium multilocus sequence typing (MLST database and amplified fragments containing these SNPs were interrogated by HRM. We tested the HRM genotyping scheme on 85 E. faecium bloodstream isolates and compared the results with MLST, pulsed-field gel electrophoresis (PFGE and an allele specific real-time PCR (AS kinetic PCR SNP typing method. In silico analysis based on predicted HRM curves according to the G+C content of each fragment for all 567 sequence types (STs in the MLST database together with empiric data from the 85 isolates demonstrated that HRM analysis resolves E. faecium into 231 "melting types" (MelTs and provides a Simpson's Index of Diversity (D of 0.991 with respect to MLST. This is a significant improvement on the AS kinetic PCR SNP typing scheme that resolves 61 SNP types with D of 0.95. The MelTs were concordant with the known ST of the isolates. For the 85 isolates, there were 13 PFGE patterns, 17 STs, 14 MelTs and eight SNP types. There was excellent concordance between PFGE, MLST and MelTs with Adjusted Rand Indices of PFGE to MelT 0.936 and ST to MelT 0.973. In conclusion, this HRM based method appears rapid and reproducible. The results are concordant with MLST and the MLST based population structure.

  12. Comparative genomic assessment of Multi-Locus Sequence Typing: rapid accumulation of genomic heterogeneity among clonal isolates of Campylobacter jejuni

    Directory of Open Access Journals (Sweden)

    Nash John HE

    2008-08-01

    Full Text Available Abstract Background Multi-Locus Sequence Typing (MLST has emerged as a leading molecular typing method owing to its high ability to discriminate among bacterial isolates, the relative ease with which data acquisition and analysis can be standardized, and the high portability of the resulting sequence data. While MLST has been successfully applied to the study of the population structure for a number of different bacterial species, it has also provided compelling evidence for high rates of recombination in some species. We have analyzed a set of Campylobacter jejuni strains using MLST and Comparative Genomic Hybridization (CGH on a full-genome microarray in order to determine whether recombination and high levels of genomic mosaicism adversely affect the inference of strain relationships based on the analysis of a restricted number of genetic loci. Results Our results indicate that, in general, there is significant concordance between strain relationships established by MLST and those based on shared gene content as established by CGH. While MLST has significant predictive power with respect to overall genome similarity of isolates, we also found evidence for significant differences in genomic content among strains that would otherwise appear to be highly related based on their MLST profiles. Conclusion The extensive genomic mosaicism between closely related strains has important implications in the context of establishing strain to strain relationships because it suggests that the exact gene content of strains, and by extension their phenotype, is less likely to be "predicted" based on a small number of typing loci. This in turn suggests that a greater emphasis should be placed on analyzing genes of clinical interest as we forge ahead with the next generation of molecular typing methods.

  13. Multilocus Sequence Typing Reveals Relevant Genetic Variation and Different Evolutionary Dynamics among Strains of Xanthomonas arboricola pv. juglandis

    Directory of Open Access Journals (Sweden)

    Marco Scortichini

    2010-11-01

    Full Text Available Forty-five Xanthomonas arboricola pv. juglandis (Xaj strains originating from Juglans regia cultivation in different countries were molecularly typed by means of MultiLocus Sequence Typing (MLST, using acnB, gapA, gyrB and rpoD gene fragments. A total of 2.5 kilobases was used to infer the phylogenetic relationship among the strains and possible recombination events. Haplotype diversity, linkage disequilibrium analysis, selection tests, gene flow estimates and codon adaptation index were also assessed. The dendrograms built by maximum likelihood with concatenated nucleotide and amino acid sequences revealed two major and two minor phylotypes. The same haplotype was found in strains originating from different continents, and different haplotypes were found in strains isolated in the same year from the same location. A recombination breakpoint was detected within the rpoD gene fragment. At the pathovar level, the Xaj populations studied here are clonal and under neutral selection. However, four Xaj strains isolated from walnut fruits with apical necrosis are under diversifying selection, suggesting a possible new adaptation. Gene flow estimates do not support the hypothesis of geographic isolation of the strains, even though the genetic diversity between the strains increases as the geographic distance between them increases. A triplet deletion, causing the absence of valine, was found in the rpoD fragment of all 45 Xaj strains when compared with X. axonopodis pv. citri strain 306. The codon adaptation index was high in all four genes studied, indicating a relevant metabolic activity.

  14. Multilocus Sequence Typing Reveals a New Cluster of Closely Related Candida tropicalis Genotypes in Italian Patients With Neurological Disorders.

    Science.gov (United States)

    Scordino, Fabio; Giuffrè, Letterio; Barberi, Giuseppina; Marino Merlo, Francesca; Orlando, Maria Grazia; Giosa, Domenico; Romeo, Orazio

    2018-01-01

    Candida tropicalis is a pathogenic yeast that has emerged as an important cause of candidemia especially in elderly patients with hematological malignancies. Infections caused by this species are mainly reported from Latin America and Asian-Pacific countries although recent epidemiological data revealed that C. tropicalis accounts for 6-16.4% of the Candida bloodstream infections (BSIs) in Italy by representing a relevant issue especially for patients receiving long-term hospital care. The aim of this study was to describe the genetic diversity of C. tropicalis isolates contaminating the hands of healthcare workers (HCWs) and hospital environments and/or associated with BSIs occurring in patients with different neurological disorders and without hematological disease. A total of 28 C. tropicalis isolates were genotyped using multilocus sequence typing analysis of six housekeeping ( ICL1, MDR1, SAPT2, SAPT4, XYR1 , and ZWF1 ) genes and data revealed the presence of only eight diploid sequence types (DSTs) of which 6 (75%) were completely new. Four eBURST clonal complexes (CC2, CC10, CC11, and CC33) contained all DSTs found in this study and the CC33 resulted in an exclusive, well-defined, clonal cluster from Italy. In conclusion, C. tropicalis could represent an important cause of BSIs in long-term hospitalized patients with no underlying hematological disease. The findings of this study also suggest a potential horizontal transmission of a specific C. tropicalis clone through hands of HCWs and expand our understanding of the molecular epidemiology of this pathogen whose population structure is still far from being fully elucidated as its complexity increases as different categories of patients and geographic areas are examined.

  15. Enrichment of Multilocus Sequence Typing Clade 1 with Oral Candida albicans Isolates in Patients with Untreated Periodontitis

    Science.gov (United States)

    McManus, Brenda A.; Maguire, Rory; Cashin, Phillipa J.; Claffey, Noel; Flint, Stephen; Abdulrahim, Mohammed H.

    2012-01-01

    This study investigated the prevalence and cell density of Candida species in periodontal pockets, healthy subgingival sites, and oral rinse samples of patients with untreated periodontitis. Twenty-one periodontitis patients underwent sampling at two periodontitis sites, and 19/21 of these patients underwent sampling at one periodontally healthy site. Both paper point and curette sampling techniques were employed. The periodontitis patients and 50 healthy subjects were also sampled by oral rinse. Candida isolates were recovered on CHROMagar Candida medium, and representative isolates were identified. Candida spp. were recovered from 10/21 (46.7%) periodontitis patients and from 16/50 (32%) healthy subjects. C. albicans predominated in both groups and was recovered from all Candida-positive subjects. Candida-positive periodontitis patients yielded Candida from periodontal pockets with average densities of 3,528 and 3,910 CFU/sample from curette and paper point samples, respectively, and 1,536 CFU/ml from oral rinse samples. The majority (18/19) of the healthy sites sampled from periodontitis patients were Candida negative. The 16 Candida-positive healthy subjects yielded an average of 279 CFU/ml from oral rinse samples. C. albicans isolates were investigated by multilocus sequence typing (MLST) to determine if specific clonal groups were associated with periodontitis. MLST analysis of 31 C. albicans isolates from periodontitis patients yielded 19 sequence types (STs), 13 of which were novel. Eleven STs belonged to MLST clade 1. In contrast, 16 C. albicans isolates from separate healthy subjects belonged to 16 STs, with 4 isolates belonging to clade 1. The distributions of STs between both groups were significantly different (P = 0.04) and indicated an enrichment of C. albicans isolates in periodontal pockets, which warrants a larger study. PMID:22875886

  16. Use of multi-locus sequencing typing as identification method for the food-borne pathogen Listeria monocytogenes: a review

    Directory of Open Access Journals (Sweden)

    Sonia Lamon

    2015-01-01

    Full Text Available Listeria monocytogenes is an ubiquitous, intracellular pathogen which has been implicated within the past decade as the causative organism in several outbreaks of foodborne diseases. In this review, a new approach to molecular typing primarily designed for global epidemiology has been described: multi-locus sequencing typing (MLST. This approach is novel, in that it uses data that allow the unambiguous characterization of bacterial strains via the Internet. Our aim is to present the currently available selection of references on L. monocytogenes MLST detection methods and to discuss its use as gold standard to L. monocytogenes subtyping method.

  17. Comparison of a newly developed binary typing with ribotyping and multilocus sequence typing methods for Clostridium difficile.

    Science.gov (United States)

    Li, Zhirong; Liu, Xiaolei; Zhao, Jianhong; Xu, Kaiyue; Tian, Tiantian; Yang, Jing; Qiang, Cuixin; Shi, Dongyan; Wei, Honglian; Sun, Suju; Cui, Qingqing; Li, Ruxin; Niu, Yanan; Huang, Bixing

    2018-04-01

    Clostridium difficile is the causative pathogen for antibiotic-related nosocomial diarrhea. For epidemiological study and identification of virulent clones, a new binary typing method was developed for C. difficile in this study. The usefulness of this newly developed optimized 10-loci binary typing method was compared with two widely used methods ribotyping and multilocus sequence typing (MLST) in 189 C. difficile samples. The binary typing, ribotyping and MLST typed the samples into 53 binary types (BTs), 26 ribotypes (RTs), and 33 MLST sequence types (STs), respectively. The typing ability of the binary method was better than that of either ribotyping or MLST expressed in Simpson Index (SI) at 0.937, 0.892 and 0.859, respectively. The ease of testing, portability and cost-effectiveness of the new binary typing would make it a useful typing alternative for outbreak investigations within healthcare facilities and epidemiological research. Copyright © 2018 Elsevier B.V. All rights reserved.

  18. A novel multi-locus sequence typing (MLST) protocol for Leuconostoc lactis isolates from traditional dairy products in China and Mongolia.

    Science.gov (United States)

    Dan, Tong; Liu, Wenjun; Sun, Zhihong; Lv, Qiang; Xu, Haiyan; Song, Yuqin; Zhang, Heping

    2014-06-09

    Economically, Leuconostoc lactis is one of the most important species in the genus Leuconostoc. It plays an important role in the food industry including the production of dextrans and bacteriocins. Currently, traditional molecular typing approaches for characterisation of this species at the isolate level are either unavailable or are not sufficiently reliable for practical use. Multilocus sequence typing (MLST) is a robust and reliable method for characterising bacterial and fungal species at the molecular level. In this study, a novel MLST protocol was developed for 50 L. lactis isolates from Mongolia and China. Sequences from eight targeted genes (groEL, carB, recA, pheS, murC, pyrG, rpoB and uvrC) were obtained. Sequence analysis indicated 20 different sequence types (STs), with 13 of them being represented by a single isolate. Phylogenetic analysis based on the sequences of eight MLST loci indicated that the isolates belonged to two major groups, A (34 isolates) and B (16 isolates). Linkage disequilibrium analyses indicated that recombination occurred at a low frequency in L. lactis, indicating a clonal population structure. Split-decomposition analysis indicated that intraspecies recombination played a role in generating genotypic diversity amongst isolates. Our results indicated that MLST is a valuable tool for typing L. lactis isolates that can be used for further monitoring of evolutionary changes and population genetics.

  19. Modeling genetic imprinting effects of DNA sequences with multilocus polymorphism data

    Directory of Open Access Journals (Sweden)

    Staud Roland

    2009-08-01

    Full Text Available Abstract Single nucleotide polymorphisms (SNPs represent the most widespread type of DNA sequence variation in the human genome and they have recently emerged as valuable genetic markers for revealing the genetic architecture of complex traits in terms of nucleotide combination and sequence. Here, we extend an algorithmic model for the haplotype analysis of SNPs to estimate the effects of genetic imprinting expressed at the DNA sequence level. The model provides a general procedure for identifying the number and types of optimal DNA sequence variants that are expressed differently due to their parental origin. The model is used to analyze a genetic data set collected from a pain genetics project. We find that DNA haplotype GAC from three SNPs, OPRKG36T (with two alleles G and T, OPRKA843G (with alleles A and G, and OPRKC846T (with alleles C and T, at the kappa-opioid receptor, triggers a significant effect on pain sensitivity, but with expression significantly depending on the parent from which it is inherited (p = 0.008. With a tremendous advance in SNP identification and automated screening, the model founded on haplotype discovery and statistical inference may provide a useful tool for genetic analysis of any quantitative trait with complex inheritance.

  20. Internalin profiling and multilocus sequence typing suggest four Listeria innocua subgroups with different evolutionary distances from Listeria monocytogenes.

    Science.gov (United States)

    Chen, Jianshun; Chen, Qiaomiao; Jiang, Lingli; Cheng, Changyong; Bai, Fan; Wang, Jun; Mo, Fan; Fang, Weihuan

    2010-03-31

    Ecological, biochemical and genetic resemblance as well as clear differences of virulence between L. monocytogenes and L. innocua make this bacterial clade attractive as a model to examine evolution of pathogenicity. This study was attempted to examine the population structure of L. innocua and the microevolution in the L. innocua-L. monocytogenes clade via profiling of 37 internalin genes and multilocus sequence typing based on the sequences of 9 unlinked genes gyrB, sigB, dapE, hisJ, ribC, purM, gap, tuf and betL. L. innocua was genetically monophyletic compared to L. monocytogenes, and comprised four subgroups. Subgroups A and B correlated with internalin types 1 and 3 (except the strain 0063 belonging to subgroup C) and internalin types 2 and 4 respectively. The majority of L. innocua strains belonged to these two subgroups. Subgroup A harbored a whole set of L. monocytogenes-L. innocua common and L. innocua-specific internalin genes, and displayed higher recombination rates than those of subgroup B, including the relative frequency of occurrence of recombination versus mutation (rho/theta) and the relative effect of recombination versus point mutation (r/m). Subgroup A also exhibited a significantly smaller exterior/interior branch length ratio than expected under the coalescent model, suggesting a recent expansion of its population size. The phylogram based on the analysis with correction for recombination revealed that the time to the most recent common ancestor (TMRCA) of L. innocua subgroups A and B were similar. Additionally, subgroup D, which correlated with internalin type 5, branched off from the other three subgroups. All L. innocua strains lacked seventeen virulence genes found in L. monocytogenes (except for the subgroup D strain L43 harboring inlJ and two subgroup B strains bearing bsh) and were nonpathogenic to mice. L. innocua represents a young species descending from L. monocytogenes and comprises four subgroups: two major subgroups A and B

  1. Internalin profiling and multilocus sequence typing suggest four Listeria innocua subgroups with different evolutionary distances from Listeria monocytogenes

    Science.gov (United States)

    2010-01-01

    Background Ecological, biochemical and genetic resemblance as well as clear differences of virulence between L. monocytogenes and L. innocua make this bacterial clade attractive as a model to examine evolution of pathogenicity. This study was attempted to examine the population structure of L. innocua and the microevolution in the L. innocua-L. monocytogenes clade via profiling of 37 internalin genes and multilocus sequence typing based on the sequences of 9 unlinked genes gyrB, sigB, dapE, hisJ, ribC, purM, gap, tuf and betL. Results L. innocua was genetically monophyletic compared to L. monocytogenes, and comprised four subgroups. Subgroups A and B correlated with internalin types 1 and 3 (except the strain 0063 belonging to subgroup C) and internalin types 2 and 4 respectively. The majority of L. innocua strains belonged to these two subgroups. Subgroup A harbored a whole set of L. monocytogenes-L. innocua common and L. innocua-specific internalin genes, and displayed higher recombination rates than those of subgroup B, including the relative frequency of occurrence of recombination versus mutation (ρ/θ) and the relative effect of recombination versus point mutation (r/m). Subgroup A also exhibited a significantly smaller exterior/interior branch length ratio than expected under the coalescent model, suggesting a recent expansion of its population size. The phylogram based on the analysis with correction for recombination revealed that the time to the most recent common ancestor (TMRCA) of L. innocua subgroups A and B were similar. Additionally, subgroup D, which correlated with internalin type 5, branched off from the other three subgroups. All L. innocua strains lacked seventeen virulence genes found in L. monocytogenes (except for the subgroup D strain L43 harboring inlJ and two subgroup B strains bearing bsh) and were nonpathogenic to mice. Conclusions L. innocua represents a young species descending from L. monocytogenes and comprises four subgroups: two

  2. Internalin profiling and multilocus sequence typing suggest four Listeria innocua subgroups with different evolutionary distances from Listeria monocytogenes

    Directory of Open Access Journals (Sweden)

    Wang Jun

    2010-03-01

    Full Text Available Abstract Background Ecological, biochemical and genetic resemblance as well as clear differences of virulence between L. monocytogenes and L. innocua make this bacterial clade attractive as a model to examine evolution of pathogenicity. This study was attempted to examine the population structure of L. innocua and the microevolution in the L. innocua-L. monocytogenes clade via profiling of 37 internalin genes and multilocus sequence typing based on the sequences of 9 unlinked genes gyrB, sigB, dapE, hisJ, ribC, purM, gap, tuf and betL. Results L. innocua was genetically monophyletic compared to L. monocytogenes, and comprised four subgroups. Subgroups A and B correlated with internalin types 1 and 3 (except the strain 0063 belonging to subgroup C and internalin types 2 and 4 respectively. The majority of L. innocua strains belonged to these two subgroups. Subgroup A harbored a whole set of L. monocytogenes-L. innocua common and L. innocua-specific internalin genes, and displayed higher recombination rates than those of subgroup B, including the relative frequency of occurrence of recombination versus mutation (ρ/θ and the relative effect of recombination versus point mutation (r/m. Subgroup A also exhibited a significantly smaller exterior/interior branch length ratio than expected under the coalescent model, suggesting a recent expansion of its population size. The phylogram based on the analysis with correction for recombination revealed that the time to the most recent common ancestor (TMRCA of L. innocua subgroups A and B were similar. Additionally, subgroup D, which correlated with internalin type 5, branched off from the other three subgroups. All L. innocua strains lacked seventeen virulence genes found in L. monocytogenes (except for the subgroup D strain L43 harboring inlJ and two subgroup B strains bearing bsh and were nonpathogenic to mice. Conclusions L. innocua represents a young species descending from L. monocytogenes and

  3. Towards multilocus sequence typing of the Leishmania donovani complex: Resolving genotypes and haplotypes for five polymorphic metabolic enzymes (ASAT, GPI, NH1, NH2, PGD)

    Czech Academy of Sciences Publication Activity Database

    Mauricio, I. L.; Yeo, M.; Baghaei, M.; Doto, D.; Pratlong, F.; Zemanová, Eva; Dedet, J.-P.; Lukeš, Julius; Miles, M. A.

    2006-01-01

    Roč. 36, č. 7 (2006), s. 757-769 ISSN 0020-7519 Grant - others:European Comission(EU) QLK2-CT-2001-01810 Institutional research plan: CEZ:AV0Z60220518 Keywords : Leishmania donovani * Leishmania infantum * multilocus sequence typing Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.337, year: 2006

  4. Multi-locus sequence typing provides epidemiological insights for diseased sharks infected with fungi belonging to the Fusarium solani species complex.

    Science.gov (United States)

    Desoubeaux, Guillaume; Debourgogne, Anne; Wiederhold, Nathan P; Zaffino, Marie; Sutton, Deanna; Burns, Rachel E; Frasca, Salvatore; Hyatt, Michael W; Cray, Carolyn

    2018-07-01

    Fusarium spp. are saprobic moulds that are responsible for severe opportunistic infections in humans and animals. However, we need epidemiological tools to reliably trace the circulation of such fungal strains within medical or veterinary facilities, to recognize environmental contaminations that might lead to infection and to improve our understanding of factors responsible for the onset of outbreaks. In this study, we used molecular genotyping to investigate clustered cases of Fusarium solani species complex (FSSC) infection that occurred in eight Sphyrnidae sharks under managed care at a public aquarium. Genetic relationships between fungal strains were determined by multi-locus sequence typing (MLST) analysis based on DNA sequencing at five loci, followed by comparison with sequences of 50 epidemiologically unrelated FSSC strains. Our genotyping approach revealed that F. keratoplasticum and F. solani haplotype 9x were most commonly isolated. In one case, the infection proved to be with another Hypocrealian rare opportunistic pathogen Metarhizium robertsii. Twice, sharks proved to be infected with FSSC strains with the same MLST sequence type, supporting the hypothesis the hypothesis that common environmental populations of fungi existed for these sharks and would suggest the longtime persistence of the two clonal strains within the environment, perhaps in holding pools and life support systems of the aquarium. This study highlights how molecular tools like MLST can be used to investigate outbreaks of microbiological disease. This work reinforces the need for regular controls of water quality to reduce microbiological contamination due to waterborne microorganisms.

  5. Prevalence of Thermotolerant Campylobacter spp. in Chicken Meat in Croatia and Multilocus Sequence Typing of a Small Subset of Campylobacter jejuni and Campylobacter coli Isolates

    Directory of Open Access Journals (Sweden)

    Andrea Humski

    2016-01-01

    Full Text Available In order to detect thermotolerant Campylobacter spp., 241 samples of fresh chicken meat, at retail in Croatia, were analysed according to a standard method, followed by biochemical test and molecular polymerase chain reaction/restriction enzyme analysis for exact species determination. Campylobacter spp. prevalence was 73.86 %. Campylobacter jejuni and Campylobacter coli were isolated from 53.53 and 15.35 % of the samples, respectively. In 4.98 % of isolates thermotolerant Campylobacter spp. were not determined. The multi locus sequence typing method was used to evaluate genetic diversity of eight Campylobacter jejuni and four Campylobacter coli isolates. To our knowledge, these results of genotyping provided the first data on the presence of sequence types (STs and clonal complexes (CCs of Campylobacter jejuni and C. coli isolates in Croatia. By applying the multilocus sequence typing, a new allele of tkt gene locus was discovered and marked tkt508. The C. jejuni ST 6182 and C. coli ST 6183 genotypes were described for the fi rst time, and all other identified genotypes were clustered in the previously described sequence types and clonal complexes. These findings provide useful information on the prevalence and epidemiology of Campylobacter jejuni and C. coli in Croatia.

  6. Multilocus phylogeny and MALDI-TOF analysis of the plant pathogenic species Alternaria dauci and relatives.

    Science.gov (United States)

    Brun, Sophie; Madrid, Hugo; Gerrits Van Den Ende, Bert; Andersen, Birgitte; Marinach-Patrice, Carine; Mazier, Dominique; De Hoog, G Sybren

    2013-01-01

    The genus Alternaria includes numerous phytopathogenic species, many of which are economically relevant. Traditionally, identification has been based on morphology, but is often hampered by the tendency of some strains to become sterile in culture and by the existence of species-complexes of morphologically similar taxa. This study aimed to assess if strains of four closely-related plant pathogens, i.e., accurately Alternaria dauci (ten strains), Alternaria porri (six), Alternaria solani (ten), and Alternaria tomatophila (ten) could be identified using multilocus phylogenetic analysis and Matrix-Assisted Laser Desorption Ionisation Time of Flight (MALDI-TOF) profiling of proteins. Phylogenetic analyses were performed on three loci, i.e., the internal transcribed spacer (ITS) region of rRNA, and the glyceraldehyde-3-phosphate dehydrogenase (gpd) and Alternaria major antigen (Alt a 1) genes. Phylogenetic trees based on ITS sequences did not differentiate strains of A. solani, A. tomatophila, and A. porri, but these three species formed a clade separate from strains of A. dauci. The resolution improved in trees based on gpd and Alt a 1, which distinguished strains of the four species as separate clades. However, none provided significant bootstrap support for all four species, which could only be achieved when results for the three loci were combined. MALDI-TOF-based dendrograms showed three major clusters. The first comprised all A. dauci strains, the second included five strains of A. porri and one of A. solani, and the third included all strains of A. tomatophila, as well as all but one strain of A. solani, and one strain of A. porri. Thus, this study shows the usefulness of MALDI-TOF mass spectrometry as a promising tool for identification of these four species of Alternaria which are closely-related plant pathogens. Copyright © 2012 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.

  7. Epidemiological characterization of a nosocomial outbreak of extended spectrum β-lactamase Escherichia coli ST-131 confirms the clinical value of core genome multilocus sequence typing.

    Science.gov (United States)

    Woksepp, Hanna; Ryberg, Anna; Berglind, Linda; Schön, Thomas; Söderman, Jan

    2017-12-01

    Enhanced precision of epidemiological typing in clinically suspected nosocomial outbreaks is crucial. Our aim was to investigate whether single nucleotide polymorphism (SNP) analysis and core genome (cg) multilocus sequence typing (MLST) of whole genome sequencing (WGS) data would more reliably identify a nosocomial outbreak, compared to earlier molecular typing methods. Sixteen isolates from a nosocomial outbreak of ESBL E. coli ST-131 in southeastern Sweden and three control strains were subjected to WGS. Sequences were explored by SNP analysis and cgMLST. cgMLST clearly differentiated between the outbreak isolates and the control isolates (>1400 differences). All clinically identified outbreak isolates showed close clustering (≥2 allele differences), except for two isolates (>50 allele differences). These data confirmed that the isolates with >50 differing genes did not belong to the nosocomial outbreak. The number of SNPs within the outbreak was ≤7, whereas the two discrepant isolates had >700 SNPs. Two of the ESBL E. coli ST-131 isolates did not belong to the clinically identified outbreak. Our results illustrate the power of WGS in terms of resolution, which may avoid overestimation of patients belonging to outbreaks as judged from epidemiological data and previously employed molecular methods with lower discriminatory ability. © 2017 APMIS. Published by John Wiley & Sons Ltd.

  8. Next-generation phylogeography: a targeted approach for multilocus sequencing of non-model organisms.

    Directory of Open Access Journals (Sweden)

    Jonathan B Puritz

    Full Text Available The field of phylogeography has long since realized the need and utility of incorporating nuclear DNA (nDNA sequences into analyses. However, the use of nDNA sequence data, at the population level, has been hindered by technical laboratory difficulty, sequencing costs, and problematic analytical methods dealing with genotypic sequence data, especially in non-model organisms. Here, we present a method utilizing the 454 GS-FLX Titanium pyrosequencing platform with the capacity to simultaneously sequence two species of sea star (Meridiastra calcar and Parvulastra exigua at five different nDNA loci across 16 different populations of 20 individuals each per species. We compare results from 3 populations with traditional Sanger sequencing based methods, and demonstrate that this next-generation sequencing platform is more time and cost effective and more sensitive to rare variants than Sanger based sequencing. A crucial advantage is that the high coverage of clonally amplified sequences simplifies haplotype determination, even in highly polymorphic species. This targeted next-generation approach can greatly increase the use of nDNA sequence loci in phylogeographic and population genetic studies by mitigating many of the time, cost, and analytical issues associated with highly polymorphic, diploid sequence markers.

  9. Targeted amplicon sequencing (TAS): a scalable next-gen approach to multilocus, multitaxa phylogenetics.

    Science.gov (United States)

    Bybee, Seth M; Bracken-Grissom, Heather; Haynes, Benjamin D; Hermansen, Russell A; Byers, Robert L; Clement, Mark J; Udall, Joshua A; Wilcox, Edward R; Crandall, Keith A

    2011-01-01

    Next-gen sequencing technologies have revolutionized data collection in genetic studies and advanced genome biology to novel frontiers. However, to date, next-gen technologies have been used principally for whole genome sequencing and transcriptome sequencing. Yet many questions in population genetics and systematics rely on sequencing specific genes of known function or diversity levels. Here, we describe a targeted amplicon sequencing (TAS) approach capitalizing on next-gen capacity to sequence large numbers of targeted gene regions from a large number of samples. Our TAS approach is easily scalable, simple in execution, neither time-nor labor-intensive, relatively inexpensive, and can be applied to a broad diversity of organisms and/or genes. Our TAS approach includes a bioinformatic application, BarcodeCrucher, to take raw next-gen sequence reads and perform quality control checks and convert the data into FASTA format organized by gene and sample, ready for phylogenetic analyses. We demonstrate our approach by sequencing targeted genes of known phylogenetic utility to estimate a phylogeny for the Pancrustacea. We generated data from 44 taxa using 68 different 10-bp multiplexing identifiers. The overall quality of data produced was robust and was informative for phylogeny estimation. The potential for this method to produce copious amounts of data from a single 454 plate (e.g., 325 taxa for 24 loci) significantly reduces sequencing expenses incurred from traditional Sanger sequencing. We further discuss the advantages and disadvantages of this method, while offering suggestions to enhance the approach.

  10. Multilocus sequence typing and phylogenetic analysis of Propionibacterium acnes

    DEFF Research Database (Denmark)

    Kilian, Mogens; Scholz, Christian F. P.; Lomholt, Hans B.

    2012-01-01

    Propionibacterium acnes is a commensal of human skin but is also implicated in the pathogenesis of acne vulgaris, in biofilm-associated infections of medical devices and endophthalmitis, and in infections of bone and dental root canals. Recent studies associate P. acnes with prostate cancer...... schemes were compared with reference to a phylogenetic tree based on 78 P. acnes genomes and their gene contents. Further support for a basically clonal population structure of P. acnes and a scenario of the global spread of epidemic clones of P. acnes was obtained. Compared to the Belfast scheme...

  11. Multilocus Sequence Typing (MLST) and Phylogenetic Analysis of Propionibacterium acnes

    DEFF Research Database (Denmark)

    Kilian, Mogens; Scholz, Christian; Lomholt, Hans B

    2011-01-01

    Propionibacterium acnes is a commensal of human skin but is also implicated in the pathogenesis of acne vulgaris and in biofilm-associated infections of medical devices and endophthalmitis, and in infections of bone and dental root canals. Recent studies associate P. acnes with prostate cancer...... with reference to a phylogenetic tree based on 78 P. acnes genomes and their gene contents. Further support for a basically clonal population structure of P. acnes and a scenario of global spread of epidemic clones of P. acnes was obtained. Compared with the Belfast scheme, the Aarhus MLST scheme (http...

  12. Multilocus DNA fingerprinting in paternity analysis: a Chilean experience

    Directory of Open Access Journals (Sweden)

    Cifuentes O. Lucía

    2000-01-01

    Full Text Available DNA polymorphism is very useful in paternity analysis. The present paper describes paternity studies done using DNA profiles obtained with the (CAC5 probe. All of the subjects studied were involved in nonjudicial cases of paternity. Genomic DNA digested with HaeIII was run on agarose gels and hybridized in the gel with the (CAC5 probe labeled with 32P. The mean number of bands larger than the 4.3 kb per individual was 16.1. The mean proportion of bands shared among unrelated individuals was 0.08 and the mean number of test bands was 7.1. This corresponded to an exclusion probability greater than 0.999999. Paternity was excluded in 34.5% of the cases. The mutation frequency estimated from non-excluded cases was 0.01143 bands per child. In these cases, the paternity was confirmed by a locus-specific analysis of eight independent PCR-based loci. The paternity index was computed in all non-excluded cases. It can be concluded that this method is a powerful and inexpensive alternative to solve paternity doubts.

  13. Multilocus analysis reveals three candidate genes for Chinese migraine susceptibility.

    Science.gov (United States)

    An, X-K; Fang, J; Yu, Z-Z; Lin, Q; Lu, C-X; Qu, H-L; Ma, Q-L

    2017-08-01

    Several genome-wide association studies (GWASs) in Caucasian populations have identified 12 loci that are significantly associated with migraine. More evidence suggests that serotonin receptors are also involved in migraine pathophysiology. In the present study, a case-control study was conducted in a cohort of 581 migraine cases and 533 ethnically matched controls among a Chinese population. Eighteen polymorphisms from serotonin receptors and GWASs were selected, and genotyping was performed using a Sequenom MALDI-TOF mass spectrometry iPLEX platform. The genotypic and allelic distributions of MEF2D rs2274316 and ASTN2 rs6478241 were significantly different between migraine patients and controls. Univariate and multivariate analysis revealed significant associations of polymorphisms in the MEF2D and ASTN2 genes with migraine susceptibility. MEF2D, PRDM16 and ASTN2 were also found to be associated with migraine without aura (MO) and migraine with family history. And, MEF2D and ASTN2 also served as genetic risk factors for the migraine without family history. The generalized multifactor dimensionality reduction analysis identified that MEF2D and HTR2E constituted the two-factor interaction model. Our study suggests that the MEF2D, PRDM16 and ASTN2 genes from GWAS are associated with migraine susceptibility, especially MO, among Chinese patients. It appears that there is no association with serotonin receptor related genes. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  14. spa Typing and Multilocus Sequence Typing Show Comparable Performance in a Macroepidemiologic Study of Staphylococcus aureus in the United States.

    Science.gov (United States)

    O'Hara, F Patrick; Suaya, Jose A; Ray, G Thomas; Baxter, Roger; Brown, Megan L; Mera, Robertino M; Close, Nicole M; Thomas, Elizabeth; Amrine-Madsen, Heather

    2016-01-01

    A number of molecular typing methods have been developed for characterization of Staphylococcus aureus isolates. The utility of these systems depends on the nature of the investigation for which they are used. We compared two commonly used methods of molecular typing, multilocus sequence typing (MLST) (and its clustering algorithm, Based Upon Related Sequence Type [BURST]) with the staphylococcal protein A (spa) typing (and its clustering algorithm, Based Upon Repeat Pattern [BURP]), to assess the utility of these methods for macroepidemiology and evolutionary studies of S. aureus in the United States. We typed a total of 366 clinical isolates of S. aureus by these methods and evaluated indices of diversity and concordance values. Our results show that, when combined with the BURP clustering algorithm to delineate clonal lineages, spa typing produces results that are highly comparable with those produced by MLST/BURST. Therefore, spa typing is appropriate for use in macroepidemiology and evolutionary studies and, given its lower implementation cost, this method appears to be more efficient. The findings are robust and are consistent across different settings, patient ages, and specimen sources. Our results also support a model in which the methicillin-resistant S. aureus (MRSA) population in the United States comprises two major lineages (USA300 and USA100), which each consist of closely related variants.

  15. Serotypes, antibiotic susceptibilities, and multi-locus sequence type profiles of Streptococcus agalactiae isolates circulating in Beijing, China.

    Science.gov (United States)

    Wang, Ping; Tong, Jing-jing; Ma, Xiu-hua; Song, Feng-li; Fan, Ling; Guo, Cui-mei; Shi, Wei; Yu, Sang-jie; Yao, Kai-hu; Yang, Yong-hong

    2015-01-01

    To investigate the serotypes, antibiotic susceptibilities, and multi-locus sequence type (MLST) profiles of Streptococcus agalactiae (S. agalactiae) in Beijing to provide references for the prevention and treatment of S. agalactiae infections. All isolates were identified using the CAMP test and the latex-agglutination assay and serotyped using a Strep-B-Latex kit, after which they were assessed for antibiotic susceptibility, macrolide-resistance genes, and MLST profiles. In total, 56 S. agalactiae isolates were identified in 863 pregnant women (6.5%). Serotypes Ia, Ib, II, III, and V were identified, among which types III (32.1%), Ia (17.9%), Ib (16.1%), and V (14.3%) were the predominant serotypes. All isolates were susceptible to penicillin and ceftriaxone. The nonsusceptiblity rates measured for erythromycin, clarithromycin, azithromycin, telithromycin, clindamycin, tetracycline, and levofloxacin were 85.7%, 92.9%, 98.2%, 30.4%, 73.2%, 91%, and 39.3%, respectively. We identified 14 sequence types (STs) for the 56 isolates, among which ST19 (30.4%) was predominant. The rate of fluoroquinolone resistance was higher in serotype III than in the other serotypes. Among the 44 erythromycin-resistant isolates, 32 (72.7%) carried ermB. S. agalactiae isolates of the serotypes Ia, Ib, III, and V are common in Beijing. Among the S. agalactiae isolates, the macrolide and clindamycin resistance rates are extremely high. Most of the erythromycin-resistant isolates carry ermB.

  16. Evaluation of a Multilocus Sequence Typing (MLST) scheme for Leishmania (Viannia) braziliensis and Leishmania (Viannia) panamensis in Colombia.

    Science.gov (United States)

    Herrera, Giovanny; Hernández, Carolina; Ayala, Martha S; Flórez, Carolina; Teherán, Aníbal A; Ramírez, Juan David

    2017-05-12

    Leishmaniases are parasitic vector-borne diseases affecting more than 12 million people in 98 countries. In Colombia, leishmaniasis is widespread and the most common clinical manifestation is cutaneous, mainly caused by L. panamensis and L. braziliensis. Currently, the genetic diversity of these species in Colombia is unknown. To address this, we applied molecular techniques for their characterization, using multilocus sequence typing (MLST) to explore the genetic variability and phylodynamics of the disease. Seven previously described genetic markers were selected highlighting the implementation of a mitochondrial marker. Markers were applied to 163 samples from isolates obtained between 1980 and 2001. The identification of the samples showed an excellent correlation with typing tests previously applied (MLEE, monoclonal antibodies). Isolates of L. braziliensis showed greater genetic diversity than L. panamensis, and a greater number of diploid sequence types (DSTs). In addition, the geographical distribution of DSTs for each species were obtained through georeferencing maps. To our knowldge, this study represents the first description of the genetic variability of L. panamensis in Colombia and South America, and is the first to propose a scheme of MLST for epidemiological surveillance of leishmaniasis in the country.

  17. Discrimination of multilocus sequence typing-based Campylobacter jejuni subgroups by MALDI-TOF mass spectrometry.

    Science.gov (United States)

    Zautner, Andreas Erich; Masanta, Wycliffe Omurwa; Tareen, Abdul Malik; Weig, Michael; Lugert, Raimond; Groß, Uwe; Bader, Oliver

    2013-11-07

    Campylobacter jejuni, the most common bacterial pathogen causing gastroenteritis, shows a wide genetic diversity. Previously, we demonstrated by the combination of multi locus sequence typing (MLST)-based UPGMA-clustering and analysis of 16 genetic markers that twelve different C. jejuni subgroups can be distinguished. Among these are two prominent subgroups. The first subgroup contains the majority of hyperinvasive strains and is characterized by a dimeric form of the chemotaxis-receptor Tlp7(m+c). The second has an extended amino acid metabolism and is characterized by the presence of a periplasmic asparaginase (ansB) and gamma-glutamyl-transpeptidase (ggt). Phyloproteomic principal component analysis (PCA) hierarchical clustering of MALDI-TOF based intact cell mass spectrometry (ICMS) spectra was able to group particular C. jejuni subgroups of phylogenetic related isolates in distinct clusters. Especially the aforementioned Tlp7(m+c)(+) and ansB+/ ggt+ subgroups could be discriminated by PCA. Overlay of ICMS spectra of all isolates led to the identification of characteristic biomarker ions for these specific C. jejuni subgroups. Thus, mass peak shifts can be used to identify the C. jejuni subgroup with an extended amino acid metabolism. Although the PCA hierarchical clustering of ICMS-spectra groups the tested isolates into a different order as compared to MLST-based UPGMA-clustering, the isolates of the indicator-groups form predominantly coherent clusters. These clusters reflect phenotypic aspects better than phylogenetic clustering, indicating that the genes corresponding to the biomarker ions are phylogenetically coupled to the tested marker genes. Thus, PCA clustering could be an additional tool for analyzing the relatedness of bacterial isolates.

  18. Defining and Evaluating a Core Genome Multilocus Sequence Typing Scheme for Genome-Wide Typing of Clostridium difficile.

    Science.gov (United States)

    Bletz, Stefan; Janezic, Sandra; Harmsen, Dag; Rupnik, Maja; Mellmann, Alexander

    2018-06-01

    Clostridium difficile , recently renamed Clostridioides difficile , is the most common cause of antibiotic-associated nosocomial gastrointestinal infections worldwide. To differentiate endogenous infections and transmission events, highly discriminatory subtyping is necessary. Today, methods based on whole-genome sequencing data are increasingly used to subtype bacterial pathogens; however, frequently a standardized methodology and typing nomenclature are missing. Here we report a core genome multilocus sequence typing (cgMLST) approach developed for C. difficile Initially, we determined the breadth of the C. difficile population based on all available MLST sequence types with Bayesian inference (BAPS). The resulting BAPS partitions were used in combination with C. difficile clade information to select representative isolates that were subsequently used to define cgMLST target genes. Finally, we evaluated the novel cgMLST scheme with genomes from 3,025 isolates. BAPS grouping ( n = 6 groups) together with the clade information led to a total of 11 representative isolates that were included for cgMLST definition and resulted in 2,270 cgMLST genes that were present in all isolates. Overall, 2,184 to 2,268 cgMLST targets were detected in the genome sequences of 70 outbreak-associated and reference strains, and on average 99.3% cgMLST targets (1,116 to 2,270 targets) were present in 2,954 genomes downloaded from the NCBI database, underlining the representativeness of the cgMLST scheme. Moreover, reanalyzing different cluster scenarios with cgMLST were concordant to published single nucleotide variant analyses. In conclusion, the novel cgMLST is representative for the whole C. difficile population, is highly discriminatory in outbreak situations, and provides a unique nomenclature facilitating interlaboratory exchange. Copyright © 2018 American Society for Microbiology.

  19. Core Genome Multilocus Sequence Typing for Identification of Globally Distributed Clonal Groups and Differentiation of Outbreak Strains of Listeria monocytogenes.

    Science.gov (United States)

    Chen, Yi; Gonzalez-Escalona, Narjol; Hammack, Thomas S; Allard, Marc W; Strain, Errol A; Brown, Eric W

    2016-10-15

    Many listeriosis outbreaks are caused by a few globally distributed clonal groups, designated clonal complexes or epidemic clones, of Listeria monocytogenes, several of which have been defined by classic multilocus sequence typing (MLST) schemes targeting 6 to 8 housekeeping or virulence genes. We have developed and evaluated core genome MLST (cgMLST) schemes and applied them to isolates from multiple clonal groups, including those associated with 39 listeriosis outbreaks. The cgMLST clusters were congruent with MLST-defined clonal groups, which had various degrees of diversity at the whole-genome level. Notably, cgMLST could distinguish among outbreak strains and epidemiologically unrelated strains of the same clonal group, which could not be achieved using classic MLST schemes. The precise selection of cgMLST gene targets may not be critical for the general identification of clonal groups and outbreak strains. cgMLST analyses further identified outbreak strains, including those associated with recent outbreaks linked to contaminated French-style cheese, Hispanic-style cheese, stone fruit, caramel apple, ice cream, and packaged leafy green salad, as belonging to major clonal groups. We further developed lineage-specific cgMLST schemes, which can include accessory genes when core genomes do not possess sufficient diversity, and this provided additional resolution over species-specific cgMLST. Analyses of isolates from different common-source listeriosis outbreaks revealed various degrees of diversity, indicating that the numbers of allelic differences should always be combined with cgMLST clustering and epidemiological evidence to define a listeriosis outbreak. Classic multilocus sequence typing (MLST) schemes targeting internal fragments of 6 to 8 genes that define clonal complexes or epidemic clones have been widely employed to study L. monocytogenes biodiversity and its relation to pathogenicity potential and epidemiology. We demonstrated that core genome MLST

  20. Using multi-locus allelic sequence data to estimate genetic divergence among four Lilium (Liliaceae) cultivars

    NARCIS (Netherlands)

    Shahin, A.; Smulders, M.J.M.; Tuyl, van J.M.; Arens, P.F.P.; Bakker, F.T.

    2014-01-01

    Next Generation Sequencing (NGS) may enable estimating relationships among genotypes using allelic variation of multiple nuclear genes simultaneously. We explored the potential and caveats of this strategy in four genetically distant Lilium cultivars to estimate their genetic divergence from

  1. Multilocus Sequence Typing and Virulence Profiles in Uropathogenic Escherichia coli Isolated from Cats in the United States.

    Directory of Open Access Journals (Sweden)

    Xiaoqiang Liu

    Full Text Available The population structure, virulence, and antimicrobial resistance of uropathogenic E. coli (UPEC from cats are rarely characterized. The aim of this study was to compare and characterize the UPEC isolated from cats in four geographic regions of USA in terms of their multilocus sequence typing (MLST, virulence profiles, clinical signs, antimicrobial resistance and phylogenetic grouping. The results showed that a total of 74 E. coli isolates were typed to 40 sequence types with 10 being novel. The most frequent phylogenetic group was B2 (n = 57. The most frequent sequence types were ST73 (n = 12 and ST83 (n = 6, ST73 was represented by four multidrug resistant (MDR and eight non-multidrug resistant (SDR isolates, and ST83 were significantly more likely to exhibit no drug resistant (NDR isolates carrying the highest number of virulence genes. Additionally, MDR isolates were more diverse, and followed by SDR and NDR isolates in regards to the distribution of the STs. afa/draBC was the most prevalent among the 29 virulence-associated genes. Linking virulence profile and antimicrobial resistance, the majority of virulence-associated genes tested were more prevalent in NDR isolates, and followed by SDR and MDR isolates. Twenty (50% MLST types in this study have previously been associated with human isolates, suggesting that these STs are potentially zoonotic. Our data enhanced the understanding of E. coli population structure and virulence association from cats. The diverse and various combinations of virulence-associated genes implied that the infection control may be challenging.

  2. Estimation of isolation times of the island species in the Drosophila simulans complex from multilocus DNA sequence data.

    Directory of Open Access Journals (Sweden)

    Shannon R McDermott

    2008-06-01

    Full Text Available The Drosophila simulans species complex continues to serve as an important model system for the study of new species formation. The complex is comprised of the cosmopolitan species, D. simulans, and two island endemics, D. mauritiana and D. sechellia. A substantial amount of effort has gone into reconstructing the natural history of the complex, in part to infer the context in which functional divergence among the species has arisen. In this regard, a key parameter to be estimated is the initial isolation time (t of each island species. Loci in regions of low recombination have lower divergence within the complex than do other loci, yet divergence from D. melanogaster is similar for both classes. This might reflect gene flow of the low-recombination loci subsequent to initial isolation, but it might also reflect differential effects of changing population size on the two recombination classes of loci when the low-recombination loci are subject to genetic hitchhiking or pseudohitchhikingNew DNA sequence variation data for 17 loci corroborate the prior observation from 13 loci that DNA sequence divergence is reduced in genes of low recombination. Two models are presented to estimate t and other relevant parameters (substitution rate correction factors in lineages leading to the island species and, in the case of the 4-parameter model, the ratio of ancestral to extant effective population size from the multilocus DNA sequence data.In general, it appears that both island species were isolated at about the same time, here estimated at approximately 250,000 years ago. It also appears that the difference in divergence patterns of genes in regions of low and higher recombination can be reconciled by allowing a modestly larger effective population size for the ancestral population than for extant D. simulans.

  3. Evaluation of two multi-locus sequence typing schemes for commensal Escherichia coli from dairy cattle in Washington State.

    Science.gov (United States)

    Ahmed, Sara; Besser, Thomas E; Call, Douglas R; Weissman, Scott J; Jones, Lisa P; Davis, Margaret A

    2016-05-01

    Multi-locus sequence typing (MLST) is a useful system for phylogenetic and epidemiological studies of multidrug-resistant Escherichiacoli. Most studies utilize a seven-locus MLST, but an alternate two-locus typing method (fumC and fimH; CH typing) has been proposed that may offer a similar degree of discrimination at lower cost. Herein, we compare CH typing to the standard seven-locus method for typing commensal E. coli isolates from dairy cattle. In addition, we evaluated alternative combinations of eight loci to identify combinations that maximize discrimination and congruence with standard seven-locus MLST among commensal E. coli while minimizing the cost. We also compared both methods when used for typing uropathogenic E. coli (UPEC). CH typing was less discriminatory for commensal E. coli than the standard seven-locus method (Simpson's Index of Diversity=0.933 [0.902-0.964] and 0.97 [0.96-0.979], respectively). Combining fimH with housekeeping gene loci improved discriminatory power for commensal E. coli from cattle but resulted in poor congruence with MLST. We found that a four-locus typing method including the housekeeping genes adk, purA, gyrB and recA could be used to minimize cost without sacrificing discriminatory power or congruence with Achtman seven-locus MLST when typing commensal E. coli. Copyright © 2016 Elsevier B.V. All rights reserved.

  4. Use of multilocus sequence typing for the investigation of colonisation by Candida albicans in intensive care unit patients.

    Science.gov (United States)

    Cliff, P R; Sandoe, J A T; Heritage, J; Barton, R C

    2008-05-01

    A prospective study was performed to determine the prevalence of candidal colonisation on the general intensive care unit at a large teaching hospital. Colonisation with Candida spp. was found to be common, occurring in 79% of patients on the unit. C. albicans was the commonest species, colonising 64% of patients, followed by C. glabrata (18%) and C. parapsilosis (14%). Most of the members of staff tested carried Candida spp. at some point, although carriage appeared to be transient. C. parapsilosis was the most commonly isolated species from staff hands, whereas C. albicans was the most commonly isolated species from the mouth. The molecular epidemiology of C. albicans was investigated using Ca3 typing and multilocus sequence typing (MLST). MLST proved to be a reproducible typing method and a useful tool for the investigation of the molecular epidemiology of C. albicans. The results of the molecular typing provided evidence for the presence of an endemic strain on the unit, which was isolated repeatedly from patients and staff. This finding suggests horizontal transmission of C. albicans on the unit though it may also reflect the relative frequency of C. albicans strain types colonising patients on admission. This study has important implications for the epidemiology of systemic candidal infections.

  5. Comparison of multilocus sequence typing, RAPD, and MALDI-TOF mass spectrometry for typing of β-lactam-resistant Klebsiella pneumoniae strains.

    Science.gov (United States)

    Sachse, Svea; Bresan, Stephanie; Erhard, Marcel; Edel, Birgit; Pfister, Wolfgang; Saupe, Angela; Rödel, Jürgen

    2014-12-01

    Extended spectrum of β-lactam (ESBL) resistance of Klebsiella pneumoniae has become an increasing problem in hospital infections. Typing of isolates is important to establish the intrahospital surveillance of resistant clones. In this study, the discriminatory potential of randomly amplified polymorphic DNA and matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) analyses were compared with multilocus sequence typing (MLST) by using 17 β-lactam-resistant K. pneumoniae isolates of different genotypes. MLST alleles were distributed in 8 sequence types (STs). Among ESBL strains of the same ST, the presence of different β-lactamase genes was common. RAPD band patterns also revealed 8 types that corresponded to MLST-defined genotypes in 15 out of 17 cases. MALDI-TOF analysis could differentiate 5 clusters of strains. The results of this work show that RAPD may be usable as a rapid screening method for the intrahospital surveillance of K. pneumoniae, allowing a discrimination of clonally related strains. MALDI-TOF-based typing was not strongly corresponding to genotyping and warrants further investigation. Copyright © 2014 Elsevier Inc. All rights reserved.

  6. The Leishmania donovani complex: Genotypes of five metabolic enzymes (ICD, ME, MPI, G6PDH and FH), new targets for multilocus sequence typing

    Czech Academy of Sciences Publication Activity Database

    Zemanová, Eva; Jirků, Milan; Mauricio, I. L.; Horák, Aleš; Miles, M. A.; Lukeš, Julius

    2007-01-01

    Roč. 37, č. 2 (2007), s. 149-160 ISSN 0020-7519 R&D Projects: GA MŠk 2B06129 Grant - others:EU(EU) QLK2-CT-2001-01810 Institutional research plan: CEZ:AV0Z60220518 Source of funding: R - rámcový projekt EK Keywords : Leishmania donovani complex * zymodeme * multilocus sequence typing * Leishmania * phylogenetic network Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.392, year: 2007

  7. Multilocus Sequence Typing of Historical Burkholderia pseudomallei Isolates Collected in Southeast Asia from 1964 to 1967 Provides Insight into the Epidemiology of Melioidosis

    OpenAIRE

    McCombie, Roberta L.; Finkelstein, Richard A.; Woods, Donald E.

    2006-01-01

    A collection of 207 historically relevant Burkholderia pseudomallei isolates was analyzed by multilocus sequence typing (MLST). The strain collection contains environmental isolates obtained from a geographical distribution survey of B. pseudomallei isolates in Thailand (1964 to 1967), as well as stock cultures and colony variants from the U.S. Army Medical Research Unit (Malaysia), the Walter Reed Army Institute for Research, and the Pasteur Institute (Vietnam). The 207 isolates of the colle...

  8. Use of Whole-Genus Genome Sequence Data To Develop a Multilocus Sequence Typing Tool That Accurately Identifies Yersinia Isolates to the Species and Subspecies Levels

    Science.gov (United States)

    Hall, Miquette; Chattaway, Marie A.; Reuter, Sandra; Savin, Cyril; Strauch, Eckhard; Carniel, Elisabeth; Connor, Thomas; Van Damme, Inge; Rajakaruna, Lakshani; Rajendram, Dunstan; Jenkins, Claire; Thomson, Nicholas R.

    2014-01-01

    The genus Yersinia is a large and diverse bacterial genus consisting of human-pathogenic species, a fish-pathogenic species, and a large number of environmental species. Recently, the phylogenetic and population structure of the entire genus was elucidated through the genome sequence data of 241 strains encompassing every known species in the genus. Here we report the mining of this enormous data set to create a multilocus sequence typing-based scheme that can identify Yersinia strains to the species level to a level of resolution equal to that for whole-genome sequencing. Our assay is designed to be able to accurately subtype the important human-pathogenic species Yersinia enterocolitica to whole-genome resolution levels. We also report the validation of the scheme on 386 strains from reference laboratory collections across Europe. We propose that the scheme is an important molecular typing system to allow accurate and reproducible identification of Yersinia isolates to the species level, a process often inconsistent in nonspecialist laboratories. Additionally, our assay is the most phylogenetically informative typing scheme available for Y. enterocolitica. PMID:25339391

  9. Evolution in Australasian mangrove forests: multilocus phylogenetic analysis of the Gerygone warblers (Aves: Acanthizidae.

    Directory of Open Access Journals (Sweden)

    Árpád S Nyári

    Full Text Available The mangrove forests of Australasia have many endemic bird species but their evolution and radiation in those habitats has been little studied. One genus with several mangrove specialist species is Gerygone (Passeriformes: Acanthizidae. The phylogeny of the Acanthizidae is reasonably well understood but limited taxon sampling for Gerygone has constrained understanding of its evolution and historical biogeography in mangroves. Here we report on a phylogenetic analysis of Gerygone based on comprehensive taxon sampling and a multilocus dataset of thirteen loci spread across the avian genome (eleven nuclear and two mitochondrial loci. Since Gerygone includes three species restricted to Australia's coastal mangrove forests, we particularly sought to understand the biogeography of their evolution in that ecosystem. Analyses of individual loci, as well as of a concatenated dataset drawn from previous molecular studies indicates that the genus as currently defined is not monophyletic, and that the Grey Gerygone (G. cinerea from New Guinea should be transferred to the genus Acanthiza. The multilocus approach has permitted the nuanced view of the group's evolution into mangrove ecosystems having occurred on multiple occasions, in three non-overlapping time frames, most likely first by the G. magnirostris lineage, and subsequently followed by those of G. tenebrosa and G. levigaster.

  10. New multilocus sequence typing of MRSA in São Paulo, Brazil

    Directory of Open Access Journals (Sweden)

    M.S. Carmo

    2011-10-01

    Full Text Available An increased incidence of nosocomial and community-acquired infections caused by methicillin-resistant Staphylococcus aureus (MRSA has been observed worldwide. The molecular characterization of MRSA has played an important role in demonstrating the existence of internationally disseminated clones. The use of molecular biology methods in the surveillance programs has enabled the tracking of MRSA spread within and among hospitals. These data are useful to alert nosocomial infection control programs about the potential introduction of these epidemic clones in their areas. Four MRSA blood culture isolates from patients hospitalized at two hospitals in the city of São Paulo, Brazil, were analyzed; one of them was community acquired. The isolates were characterized as SCCmec, mecA and PVL by PCR, pulsed-field gel electrophoresis (PFGE profile and molecular sequence typing (MLST genotyping. The isolates presented type IV SCCmec, and none proved to be positive for PVL. The isolates showed a PFGE profile similar to the pediatric clone. MLST genotyping demonstrated that the isolates belonged to clonal complex 5 (CC5, showing a new yqiL allele gene, resulting in a new sequence typing (ST (1176. Our results showed that strains of MRSA carrying a new ST are emerging in community and nosocomial infections, including bacteremia, in São Paulo, Brazil.

  11. Development of a Multilocus Sequence Tool for Typing Cryptosporidium muris and Cryptosporidium andersoni

    Czech Academy of Sciences Publication Activity Database

    Feng, Y.; Yang, W.; Ryan, U. M.; Zhang, L.; Kváč, Martin; Koudela, Břetislav; Modrý, David; Li, N.; Fayer, R.; Xiao, L.

    2011-01-01

    Roč. 49, č. 1 (2011), s. 34-41 ISSN 0095-1137 Institutional research plan: CEZ:AV0Z60220518 Keywords : ASTERN UNITED-STATES * RIBOSOMAL-RNA GENE * PHYLOGENETIC ANALYSIS * MOLECULAR ANALYSIS * NATURAL INFECTION * DAIRY- CATTLE * PREVALENCE * GENOTYPES * HUMANS * KENYA Subject RIV: GJ - Animal Vermins ; Diseases, Veterinary Medicine Impact factor: 4.153, year: 2011

  12. Genetic Relatedness Among Shiga Toxin-Producing Escherichia coli Isolated Along the Animal Food Supply Chain and in Gastroenteritis Cases in Qatar Using Multilocus Sequence Typing.

    Science.gov (United States)

    Palanisamy, Srikanth; Chang, YuChen; Scaria, Joy; Penha Filho, Rafael Antonio Casarin; Peters, Kenlyn E; Doiphode, Sanjay H; Sultan, Ali; Mohammed, Hussni O

    2017-06-01

    Pathogenic Escherichia coli has been listed among the most important bacteria associated with foodborne illnesses around the world. We investigated the genetic relatedness among Shiga toxin-producing E. coli (STEC) isolated along the animal food supply chain and from humans diagnosed with gastroenteritis in Qatar. Samples were collected from different sources along the food supply chain and from patients admitted to the hospital with complaints of gastroenteritis. All samples were screened for the presence of E. coli O157:H7 and non-O157 STEC using a combination of bacterial enrichment and molecular detection techniques. A proportional sampling approach was used to select positive samples from each source for further multilocus sequence typing (MLST) analysis. Seven housekeeping genes described for STEC were amplified by polymerase chain reaction, sequenced, and analyzed by MLST. Isolates were characterized by allele composition, sequence type (ST) and assessed for epidemiologic relationship within and among different sources. Nei's genetic distance was calculated at the allele level between sample pools in each site downstream. E. coli O157:H7 occurred at a higher rate in slaughterhouse and retail samples than at the farm or in humans in our sampling. The ST171, an ST common to enterotoxigenic E. coli and atypical enteropathogenic E. coli, was the most common ST (15%) in the food supply chain. None of the genetic distances among the different sources was statistically significant. Enterohemorrhagic E. coli pathogenic strains are present along the supply chain at different levels and with varying relatedness. Clinical isolates were the most diverse, as expected, considering the polyclonal diversity in the human microbiota. The high occurrence of these food adulterants among the farm products suggests that implementation of sanitary measures at that level might reduce the risk of human exposure.

  13. Characterization of Campylobacter jejuni applying flaA short variable region sequencing, multilocus sequencing and Fourier transform infrared spectroscopy

    DEFF Research Database (Denmark)

    Josefsen, Mathilde Hartmann; Bonnichsen, Lise; Larsson, Jonas

    flaA short variable region sequencing and phenetic Fourier transform infrared (FTIR) spectroscopy was applied on a collection of 102 Campylobacter jejuni isolated from continuous sampling of organic, free range geese and chickens. FTIR has been shown to serve as a valuable tool in typing...

  14. Molecular typing of methicillin-resistant Staphylococcus aureus: Comparison of PCR-based open reading frame typing, multilocus sequence typing, and Staphylococcus protein A gene typing.

    Science.gov (United States)

    Ogihara, Shinji; Saito, Ryoichi; Sawabe, Etsuko; Kozakai, Takahiro; Shima, Mari; Aiso, Yoshibumi; Fujie, Toshihide; Nukui, Yoko; Koike, Ryuji; Hagihara, Michio; Tohda, Shuji

    2018-04-01

    The recently developed PCR-based open reading frame typing (POT) method is a useful molecular typing tool. Here, we evaluated the performance of POT for molecular typing of methicillin-resistant Staphylococcus aureus (MRSA) isolates and compared its performance to those of multilocus sequence typing (MLST) and Staphylococcus protein A gene typing (spa typing). Thirty-seven MRSA isolates were collected between July 2012 and May 2015. MLST, spa typing, and POT were performed, and their discriminatory powers were evaluated using Simpson's index analysis. The MRSA isolates were classified into 11, 18, and 33 types by MLST, spa typing, and POT, respectively. The predominant strains identified by MLST, spa typing, and POT were ST8 and ST764, t002, and 93-191-127, respectively. The discriminatory power of MLST, spa typing, and POT was 0.853, 0.875, and 0.992, respectively, indicating that POT had the highest discriminatory power. Moreover, the results of MLST and spa were available after 2 days, whereas that of POT was available in 5 h. Furthermore, POT is rapid and easy to perform and interpret. Therefore, POT is a superior molecular typing tool for monitoring nosocomial transmission of MRSA. Copyright © 2017 Japanese Society of Chemotherapy and The Japanese Association for Infectious Diseases. Published by Elsevier Ltd. All rights reserved.

  15. Reclassification of Borrelia spp. isolated in South Korea using Multilocus Sequence Typing.

    Science.gov (United States)

    Park, Kyung-Hee; Choi, Yeon-Joo; Kim, Jeoungyeon; Park, Hye-Jin; Song, Dayoung; Jang, Won-Jong

    2018-05-31

    Using Borrelia isolated from South Korea, we evaluated by MLST and three intergenic genes (16S rRNA, ospA, and 5S-23S IGS) typing to analyze the relationship between host and vector and molecular background. Using the MLST analysis, we identified B. afzelii, B. yangtzensis, B. garinii, and B. bavariensis. This study was first report of the identification of B. yangtzensis using the MLST in South Korea.

  16. High genetic diversity among strains of the unindustrialized lactic acid bacterium Carnobacterium maltaromaticum in dairy products as revealed by multilocus sequence typing.

    Science.gov (United States)

    Rahman, Abdur; Cailliez-Grimal, Catherine; Bontemps, Cyril; Payot, Sophie; Chaillou, Stéphane; Revol-Junelles, Anne-Marie; Borges, Frédéric

    2014-07-01

    Dairy products are colonized with three main classes of lactic acid bacteria (LAB): opportunistic bacteria, traditional starters, and industrial starters. Most of the population structure studies were previously performed with LAB species belonging to these three classes and give interesting knowledge about the population structure of LAB at the stage where they are already industrialized. However, these studies give little information about the population structure of LAB prior their use as an industrial starter. Carnobacterium maltaromaticum is a LAB colonizing diverse environments, including dairy products. Since this bacterium was discovered relatively recently, it is not yet commercialized as an industrial starter, which makes C. maltaromaticum an interesting model for the study of unindustrialized LAB population structure in dairy products. A multilocus sequence typing scheme based on an analysis of fragments of the genes dapE, ddlA, glpQ, ilvE, pyc, pyrE, and leuS was applied to a collection of 47 strains, including 28 strains isolated from dairy products. The scheme allowed detecting 36 sequence types with a discriminatory index of 0.98. The whole population was clustered in four deeply branched lineages, in which the dairy strains were spread. Moreover, the dairy strains could exhibit a high diversity within these lineages, leading to an overall dairy population with a diversity level as high as that of the nondairy population. These results are in agreement with the hypothesis according to which the industrialization of LAB leads to a diversity reduction in dairy products. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  17. Development of a multi-locus sequence typing scheme for Laribacter hongkongensis, a novel bacterium associated with freshwater fish-borne gastroenteritis and traveler's diarrhea

    Directory of Open Access Journals (Sweden)

    Lee Edwin KY

    2009-01-01

    Full Text Available Abstract Background Laribacter hongkongensis is a newly discovered, facultative anaerobic, Gram-negative, motile, sea gull-shaped rod associated with freshwater fish borne gastroenteritis and traveler's diarrhea. A highly reproducible and discriminative typing system is essential for better understanding of the epidemiology of L. hongkongensis. In this study, a multilocus sequence typing (MLST system was developed for L. hongkongensis. The system was used to characterize 146 L. hongkongensis isolates, including 39 from humans and 107 from fish. Results Fragments (362 to 504 bp of seven housekeeping genes were amplified and sequenced. Among the 3068 bp of the seven loci, 332 polymorphic sites were observed. The median number of alleles at each locus was 34 [range 22 (ilvC to 45 (thiC]. All seven genes showed very low dn/ds ratios of ISA measurement showed significant linkage disequilibrium in isolates from both humans and fish. The ISA for the isolates from humans and fish were 0.270 and 0.636, indicating the isolates from fish were more clonal than the isolates from humans. Only one interconnected network (acnB was detected in the split graphs. The P-value (P = 0 of sum of the squares of condensed fragments in Sawyer's test showed evidence of intragenic recombination in the rho, acnB and thiC loci, but the P-value (P = 1 of maximum condensed fragment in these gene loci did not show evidence of intragenic recombination. Congruence analysis showed that all the pairwise comparisons of the 7 MLST loci were incongruent, indicating that recombination played a substantial role in the evolution of L. hongkongensis. A website for L. hongkongensis MLST was set up and can be accessed at http://mlstdb.hku.hk:14206/MLST_index.html. Conclusion A highly reproducible and discriminative MLST system was developed for L. hongkongensis.

  18. Multilocus Sequence Typing and Staphylococcal Protein A Typing Revealed Novel and Diverse Clones of Methicillin-Resistant Staphylococcus aureus in Seafood and the Aquatic Environment.

    Science.gov (United States)

    Murugadas, V; Toms, C Joseph; Reethu, Sara A; Lalitha, K V

    2017-03-01

    Methicillin-resistant Staphylococcus aureus (MRSA) has been a global health concern since the 1960s, and isolation of this pathogen from food-producing animals has been increasing. However, little information is available on the prevalence of MRSA and its clonal characteristics in seafood and the aquatic environment. In this study, 267 seafood and aquatic environment samples were collected from three districts of Kerala, India. Staphylococcal protein A (spa) typing and multilocus sequence typing (MLST) was performed for 65 MRSA strains isolated from 20 seafood and aquatic environment samples. The MRSA clonal profiles were t657-ST772, t002-ST5, t334-ST5, t311-ST5, t121-ST8, t186-ST88, t127-ST1, and two non-spa assignable strains. Whole spa gene sequence analysis along with MLST confirmed one strain as t711-ST6 and another as a novel MRSA clone identified for the first time in seafood and the aquatic environment with a t15669 spa type and a new MLST profile of ST420-256-236-66-82-411-477. The MRSA strains were clustered into five clonal complexes based on the goeBURST algorithm, indicating high diversity among MRSA strains in seafood and the aquatic environment. The novel clone formed a separate clonal complex with matches to three loci. This study recommends large-scale spa typing and MLST of MRSA isolates from seafood and the aquatic environment to determine the prevalence of new MRSA clones. This monitoring process can be useful for tracing local spread of MRSA isolates into the seafood production chain in a defined geographical area.

  19. Correlation between Ureaplasma subgroup 2 and genitourinary tract disease outcomes revealed by an expanded multilocus sequence typing (eMLST) scheme.

    Science.gov (United States)

    Zhang, Jun; Kong, Yingying; Ruan, Zhi; Huang, Jun; Song, Tiejun; Song, Jingjuan; Jiang, Yan; Yu, Yunsong; Xie, Xinyou

    2014-01-01

    The multilocus sequence typing (MLST) scheme of Ureaplasma based on four housekeeping genes (ftsH, rpL22, valS, and thrS) was described in our previous study; here we introduced an expanded MLST (eMLST) scheme with improved discriminatory power, which was developed by adding two putative virulence genes (ureG and mba-np1) to the original MLST scheme. To evaluate the discriminatory power of eMLST, a total of 14 reference strains of Ureaplasma serovars and 269 clinical strains (134 isolated from symptomatic patients and 135 obtained from asymptomatic persons) were investigated. Our study confirmed that all 14 serotype strains could successfully be differentiated into 14 eMLST STs (eSTs), while some of them could not even be differentiated by the MLST, and a total of 136 eSTs were identified among the clinical isolates we investigated. In addition, phylogenetic analysis indicated that two genetically significantly distant clusters (cluster I and II) were revealed and most clinical isolates were located in cluster I. These findings were in accordance with and further support for the concept of two well-known genetic lineages (Ureaplasma parvum and Ureaplasma urealyticum) in our previous study. Interestingly, although both clusters were associated with clinical manifestation, the sub-group 2 of cluster II had pronounced and adverse effect on patients and might be a potential risk factor for clinical outcomes. In conclusion, the eMLST scheme offers investigators a highly discriminative typing tool that is capable for precise epidemiological investigations and clinical relevance of Ureaplasma.

  20. Correlation between Ureaplasma subgroup 2 and genitourinary tract disease outcomes revealed by an expanded multilocus sequence typing (eMLST scheme.

    Directory of Open Access Journals (Sweden)

    Jun Zhang

    Full Text Available The multilocus sequence typing (MLST scheme of Ureaplasma based on four housekeeping genes (ftsH, rpL22, valS, and thrS was described in our previous study; here we introduced an expanded MLST (eMLST scheme with improved discriminatory power, which was developed by adding two putative virulence genes (ureG and mba-np1 to the original MLST scheme. To evaluate the discriminatory power of eMLST, a total of 14 reference strains of Ureaplasma serovars and 269 clinical strains (134 isolated from symptomatic patients and 135 obtained from asymptomatic persons were investigated. Our study confirmed that all 14 serotype strains could successfully be differentiated into 14 eMLST STs (eSTs, while some of them could not even be differentiated by the MLST, and a total of 136 eSTs were identified among the clinical isolates we investigated. In addition, phylogenetic analysis indicated that two genetically significantly distant clusters (cluster I and II were revealed and most clinical isolates were located in cluster I. These findings were in accordance with and further support for the concept of two well-known genetic lineages (Ureaplasma parvum and Ureaplasma urealyticum in our previous study. Interestingly, although both clusters were associated with clinical manifestation, the sub-group 2 of cluster II had pronounced and adverse effect on patients and might be a potential risk factor for clinical outcomes. In conclusion, the eMLST scheme offers investigators a highly discriminative typing tool that is capable for precise epidemiological investigations and clinical relevance of Ureaplasma.

  1. Multi-locus variable number tandem repeat analysis of 7th pandemic Vibrio cholerae

    Directory of Open Access Journals (Sweden)

    Lam Connie

    2012-05-01

    Full Text Available Abstract Background Seven pandemics of cholera have been recorded since 1817, with the current and ongoing pandemic affecting almost every continent. Cholera remains endemic in developing countries and is still a significant public health issue. In this study we use multilocus variable number of tandem repeats (VNTRs analysis (MLVA to discriminate between isolates of the 7th pandemic clone of Vibrio cholerae. Results MLVA of six VNTRs selected from previously published data distinguished 66 V. cholerae isolates collected between 1961–1999 into 60 unique MLVA profiles. Only 4 MLVA profiles consisted of more than 2 isolates. The discriminatory power was 0.995. Phylogenetic analysis showed that, except for the closely related profiles, the relationships derived from MLVA profiles were in conflict with that inferred from Single Nucleotide Polymorphism (SNP typing. The six SNP groups share consensus VNTR patterns and two SNP groups contained isolates which differed by only one VNTR locus. Conclusions MLVA is highly discriminatory in differentiating 7th pandemic V. cholerae isolates and MLVA data was most useful in resolving the genetic relationships among isolates within groups previously defined by SNPs. Thus MLVA is best used in conjunction with SNP typing in order to best determine the evolutionary relationships among the 7th pandemic V. cholerae isolates and for longer term epidemiological typing.

  2. Distribution and factors associated with Salmonella enterica genotypes in a diverse population of humans and animals in Qatar using multi-locus sequence typing (MLST).

    Science.gov (United States)

    Chang, Yu C; Scaria, Joy; Ibraham, Mariamma; Doiphode, Sanjay; Chang, Yung-Fu; Sultan, Ali; Mohammed, Hussni O

    2016-01-01

    Salmonella enterica is one of the most commonly reported causes of bacterial foodborne illness around the world. Understanding the sources of this pathogen and the associated factors that exacerbate its risk to humans will help in developing risk mitigation strategies. The genetic relatedness among Salmonella isolates recovered from human gastroenteritis cases and food animals in Qatar were investigated in the hope of shedding light on these sources, their possible transmission routes, and any associated factors. A repeat cross-sectional study was conducted in which the samples and associated data were collected from both populations (gastroenteritis cases and animals). Salmonella isolates were initially analyzed using multi-locus sequence typing (MLST) to investigate the genetic diversity and clonality. The relatedness among the isolates was assessed using the minimum spanning tree (MST). Twenty-seven different sequence types (STs) were identified in this study; among them, seven were novel, including ST1695, ST1696, ST1697, ST1698, ST1699, ST1702, and ST1703. The pattern of overall ST distribution was diverse; in particular, it was revealed that ST11 and ST19 were the most common sequence types, presenting 29.5% and 11.5% within the whole population. In addition, 20 eBurst Groups (eBGs) were identified in our data, which indicates that ST11 and ST19 belonged to eBG4 and eBG1, respectively. In addition, the potential association between the putative risk factors and eBGs were evaluated. There was no significant clustering of these eBGs by season; however, a significant association was identified in terms of nationality in that Qataris were six times more likely to present with eBG1 compared to non-Qataris. In the MST analysis, four major clusters were presented, namely, ST11, ST19, ST16, and ST31. The linkages between the clusters alluded to a possible transmission route. The results of the study have provided insight into the ST distributions of S. enterica and

  3. Does typing of Chlamydia trachomatis using housekeeping multilocus sequence typing reveal different sexual networks among heterosexuals and men who have sex with men?

    Science.gov (United States)

    Versteeg, Bart; Bruisten, Sylvia M; van der Ende, Arie; Pannekoek, Yvonne

    2016-04-18

    Chlamydia trachomatis infections remain the most common bacterial sexually transmitted infection worldwide. To gain more insight into the epidemiology and transmission of C. trachomatis, several schemes of multilocus sequence typing (MLST) have been developed. We investigated the clustering of C. trachomatis strains derived from men who have sex with men (MSM) and heterosexuals using the MLST scheme based on 7 housekeeping genes (MLST-7) adapted for clinical specimens and a high-resolution MLST scheme based on 6 polymorphic genes, including ompA (hr-MLST-6). Specimens from 100 C. trachomatis infected men who have sex with men (MSM) and 100 heterosexual women were randomly selected from previous studies and sequenced. We adapted the MLST-7 scheme to a nested assay to be suitable for direct typing of clinical specimens. All selected specimens were typed using both the adapted MLST-7 scheme and the hr-MLST-6 scheme. Clustering of C. trachomatis strains derived from MSM and heterosexuals was assessed using minimum spanning tree analysis. Sufficient chlamydial DNA was present in 188 of the 200 (94 %) selected samples. Using the adapted MLST-7 scheme, full MLST profiles were obtained for 187 of 188 tested specimens resulting in a high success rate of 99.5 %. Of these 187 specimens, 91 (48.7 %) were from MSM and 96 (51.3 %) from heterosexuals. We detected 21 sequence types (STs) using the adapted MLST-7 and 79 STs using the hr-MLST-6 scheme. Minimum spanning tree analyses was used to examine the clustering of MLST-7 data, which showed no reflection of separate transmission in MSM and heterosexual hosts. Moreover, typing using the hr-MLST-6 scheme identified genetically related clusters within each of clusters that were identified by using the MLST-7 scheme. No distinct transmission of C. trachomatis could be observed in MSM and heterosexuals using the adapted MLST-7 scheme in contrast to using the hr-MLST-6. In addition, we compared clustering of both MLST schemes and

  4. Identification of Coxiella burnetii genotypes in Croatia using multi-locus VNTR analysis.

    Science.gov (United States)

    Račić, Ivana; Spičić, Silvio; Galov, Ana; Duvnjak, Sanja; Zdelar-Tuk, Maja; Vujnović, Anja; Habrun, Boris; Cvetnić, Zeljko

    2014-10-10

    Although Q fever affects humans and animals in Croatia, we are unaware of genotyping studies of Croatian strains of the causative pathogen Coxiella burnetii, which would greatly assist monitoring and control efforts. Here 3261 human and animal samples were screened for C. burnetii DNA by conventional PCR, and 335 (10.3%) were positive. Of these positive samples, 82 were genotyped at 17 loci using the relatively new method of multi-locus variable number tandem repeat analysis (MLVA). We identified 13 C. burnetii genotypes not previously reported anywhere in the world. Two of these 13 genotypes are typical of the continental part of Croatia and share more similarity with genotypes outside Croatia than with genotypes within the country. The remaining 11 novel genotypes are typical of the coastal part of Croatia and show more similarity to one another than to genotypes outside the country. Our findings shed new light on the phylogeny of C. burnetii strains and may help establish MLVA as a standard technique for Coxiella genotyping. Copyright © 2014 Elsevier B.V. All rights reserved.

  5. Multilocus sequence typing of Pseudomonas syringae sensu lato confirms previously described genomospecies and permits rapid identification of P. syringae pv. coriandricola and P. syringae pv. apii causing bacterial leaf spot on parsley.

    Science.gov (United States)

    Bull, Carolee T; Clarke, Christopher R; Cai, Rongman; Vinatzer, Boris A; Jardini, Teresa M; Koike, Steven T

    2011-07-01

    Since 2002, severe leaf spotting on parsley (Petroselinum crispum) has occurred in Monterey County, CA. Either of two different pathovars of Pseudomonas syringae sensu lato were isolated from diseased leaves from eight distinct outbreaks and once from the same outbreak. Fragment analysis of DNA amplified between repetitive sequence polymerase chain reaction; 16S rDNA sequence analysis; and biochemical, physiological, and host range tests identified the pathogens as Pseudomonas syringae pv. apii and P. syringae pv. coriandricola. Koch's postulates were completed for the isolates from parsley, and host range tests with parsley isolates and pathotype strains demonstrated that P. syringae pv. apii and P. syringae pv. coriandricola cause leaf spot diseases on parsley, celery, and coriander or cilantro. In a multilocus sequence typing (MLST) approach, four housekeeping gene fragments were sequenced from 10 strains isolated from parsley and 56 pathotype strains of P. syringae. Allele sequences were uploaded to the Plant-Associated Microbes Database and a phylogenetic tree was built based on concatenated sequences. Tree topology directly corresponded to P. syringae genomospecies and P. syringae pv. apii was allocated appropriately to genomospecies 3. This is the first demonstration that MLST can accurately allocate new pathogens directly to P. syringae sensu lato genomospecies. According to MLST, P. syringae pv. coriandricola is a member of genomospecies 9, P. cannabina. In a blind test, both P. syringae pv. coriandricola and P. syringae pv. apii isolates from parsley were correctly identified to pathovar. In both cases, MLST described diversity within each pathovar that was previously unknown.

  6. Molecular characterization of Leptospira sp by multilocus variable number tandem repeat analysis (MLVA from clinical samples: a case report

    Directory of Open Access Journals (Sweden)

    Hélène Pailhoriès

    2015-08-01

    Full Text Available Leptospirosis is a zoonotic infection for which diagnosis is difficult. It has appeared as a global emerging infectious disease over recent years. Genotype determination often requires a Leptospira strain obtained by culture, which is a long and fastidious technique. A method based on multilocus variable number tandem repeat analysis (MLVA to determine the genotype of Leptospira interrogans, performed directly on blood or urine samples, is proposed. This method was applied to a fatal case of leptospirosis for which the geographical origin of infection was unknown. This technique will allow a genotype to be obtained for L. interrogans, even when cultures remain negative.

  7. Biological sequence analysis

    DEFF Research Database (Denmark)

    Durbin, Richard; Eddy, Sean; Krogh, Anders Stærmose

    This book provides an up-to-date and tutorial-level overview of sequence analysis methods, with particular emphasis on probabilistic modelling. Discussed methods include pairwise alignment, hidden Markov models, multiple alignment, profile searches, RNA secondary structure analysis, and phylogene...

  8. Class 1 integrons characterization and multilocus sequence typing of Salmonella spp. from swine production chains in Chiang Mai and Lamphun provinces, Thailand.

    Science.gov (United States)

    Boonkhot, Phacharaporn; Tadee, Pakpoom; Yamsakul, Panuwat; Pocharoen, Chairoj; Chokesajjawatee, Nipa; Patchanee, Prapas

    2015-05-01

    Pigs and pork products are well known as an important source of Salmonella, one of the major zoonotic foodborne pathogens. The emergence and spread of antimicrobial resistance is becoming a major public health concern worldwide. Integrons are genetic elements known to have a role in the acquisition and expression of genes conferring antibiotic resistance. This study focuses on the prevalence of class 1 integrons-carrying Salmonella, the genetic diversity of strains of those organisms obtained from swine production chains in Chiang Mai and Lamphun provinces, Thailand, using multilocus sequence typing (MLST) and comparison of genetic diversity of sequence types of Salmonella from this study with pulsotypes identified in previous study. In 175 Salmonella strains, the overall prevalence of class 1 integrons-carrying-Salmonella was 14%. The gene cassettes array pattern "dfrA12-orfF-aadA2" was the most frequently observed. Most of the antimicrobial resistance identified was not associated with related gene cassettes harbored by Salmonella. Six sequence types were generated from 30 randomly selected strains detected by MLST. Salmonella at the human-animal-environment interface was confirmed. Linkages both in the farm to slaughterhouse contamination route and the horizontal transmission of resistance genes were demonstrated. To reduce this problem, the use of antimicrobials in livestock should be controlled by veterinarians. Education and training of food handlers as well as promotion of safe methods of food consumption are important avenues for helping prevent foodborne illness.

  9. Zinc Resistance within Swine-Associated Methicillin-Resistant Staphylococcus aureus Isolates in the United States Is Associated with Multilocus Sequence Type Lineage.

    Science.gov (United States)

    Hau, Samantha J; Frana, Timothy; Sun, Jisun; Davies, Peter R; Nicholson, Tracy L

    2017-08-01

    Zinc resistance in livestock-associated methicillin-resistant Staphylococcus aureus (LA-MRSA) sequence type 398 (ST398) is primarily mediated by the czrC gene colocated with the mecA gene, encoding methicillin resistance, within the type V staphylococcal cassette chromosome mec (SCC mec ) element. Because czrC and mecA are located within the same mobile genetic element, it has been suggested that the use of zinc in feed as an antidiarrheal agent has the potential to contribute to the emergence and spread of methicillin-resistant S. aureus (MRSA) in swine, through increased selection pressure to maintain the SCC mec element in isolates obtained from pigs. In this study, we report the prevalence of the czrC gene and phenotypic zinc resistance in U.S. swine-associated LA-MRSA ST5 isolates, MRSA ST5 isolates from humans with no swine contact, and U.S. swine-associated LA-MRSA ST398 isolates. We demonstrated that the prevalence of zinc resistance in U.S. swine-associated LA-MRSA ST5 isolates was significantly lower than the prevalence of zinc resistance in MRSA ST5 isolates from humans with no swine contact and swine-associated LA-MRSA ST398 isolates, as well as prevalences from previous reports describing zinc resistance in other LA-MRSA ST398 isolates. Collectively, our data suggest that selection pressure associated with zinc supplementation in feed is unlikely to have played a significant role in the emergence of LA-MRSA ST5 in the U.S. swine population. Additionally, our data indicate that zinc resistance is associated with the multilocus sequence type lineage, suggesting a potential link between the genetic lineage and the carriage of resistance determinants. IMPORTANCE Our data suggest that coselection thought to be associated with the use of zinc in feed as an antimicrobial agent is not playing a role in the emergence of livestock-associated methicillin-resistant Staphylococcus aureus (LA-MRSA) ST5 in the U.S. swine population. Additionally, our data indicate

  10. Evolutionary history of the genus Tarentola (Gekkota: Phyllodactylidae from the Mediterranean Basin, estimated using multilocus sequence data

    Directory of Open Access Journals (Sweden)

    Rato Catarina

    2012-01-01

    Full Text Available Abstract Background The pronounced morphological conservatism within Tarentola geckos contrasted with a high genetic variation in North Africa, has led to the hypothesis that this group could represent a cryptic species complex, a challenging system to study especially when trying to define distinct evolutionary entities and address biogeographic hypotheses. In the present work we have re-examined the phylogenetic and phylogeographic relationships between and within all Mediterranean species of Tarentola, placing the genealogies obtained into a temporal framework. In order to do this, we have investigated the sequence variation of two mitochondrial (12S rRNA and 16S rRNA, and four nuclear markers (ACM4, PDC, MC1R, and RAG2 for 384 individuals of all known Mediterranean Tarentola species, so that their evolutionary history could be assessed. Results Of all three generated genealogies (combined mtDNA, combined nDNA, and mtDNA+nDNA we prefer the phylogenetic relationships obtained when all genetic markers are combined. A total of 133 individuals, and 2,901 bp of sequence length, were used in this analysis. The phylogeny obtained for Tarentola presents deep branches, with T. annularis, T. ephippiata and T. chazaliae occupying a basal position and splitting from the remaining species around 15.38 Mya. Tarentola boehmei is sister to all other Mediterranean species, from which it split around 11.38 Mya. There are also two other major groups: 1 the T. mauritanica complex present in North Africa and Europe; and 2 the clade formed by the T. fascicularis/deserti complex, T. neglecta and T. mindiae, occurring only in North Africa. The cladogenesis between these two groups occurred around 8.69 Mya, coincident with the late Miocene. Contrary to what was initially proposed, T. neglecta and T. mindiae are sister taxa to both T. fascicularis and T. deserti. Conclusions At least in the Iberian Peninsula and Northwest Africa, the lineages obtained have some

  11. Combination of multilocus sequence typing and pulsed-field gel electrophoresis reveals an association of molecular clonality with the emergence of extensive-drug resistance (XDR) in Salmonella.

    Science.gov (United States)

    Cao, Yongzhong; Shen, Yongxiu; Cheng, Lingling; Zhang, Xiaorong; Wang, Chao; Wang, Yan; Zhou, Xiaohui; Chao, Guoxiang; Wu, Yantao

    2018-03-01

    Salmonellae is one of the most important foodborne pathogens and becomes resistant to multiple antibiotics, which represents a significant challenge to food industry and public health. However, a molecular signature that can be used to distinguish antimicrobial resistance profile, particularly multi-drug resistance or extensive-drug resistance (XDR). In the current study, 168 isolates from the chicken and pork production chains and ill chickens were characterized by serotyping, antimicrobial susceptibility test, multilocus sequence typing (MLST) and pulsed-field gel electrophoresis (PFGE). The results showed that these isolates belonged to 13 serotypes, 14 multilocus sequence types (STs), 94 PFGE genotypes, and 70 antimicrobial resistant profiles. S. Enteritidis, S. Indiana, and S. Derby were the predominant serotypes, corresponding to the ST11, ST17, and ST40 clones, respectively and the PFGE Cluster A, Cluster E, and Cluster D, respectively. Among the ST11-S. Enteritidis (Cluster A) and the ST40-S. Derby (Cluster D) clones, the majority of isolates were resistant to 4-8 antimicrobial agents, whereas in the ST17S. Indiana (Cluster E) clone, isolates showed extensive-drug resistance (XDR) to 9-16 antimicrobial agents. The bla TEM-1-like gene was prevalent in the ST11 and ST17 clones corresponding to high ampicillin resistance. The bla TEM-1-like , bla CTX-M , bla OXA-1-like , sul1, aaC4, aac(6')-1b, dfrA17, and floR gene complex was highly prevalent among isolates of ST17, corresponding to an XDR phenotype. These results demonstrated the association of the resistant phenotypes and genotypes with ST clone and PFGE cluster. Our results also indicated that the newly identified gene complex comprising bla TEM-1-like , bla CTX-M , bla OXA-1-like , sul1, aaC4, aac(6')-1b, dfrA17, and floR, was responsible for the emergence of the ST17S. Indiana XDR clone. ST17 could be potentially used as a molecular signature to distinguish S. Indiana XDR clone. Copyright © 2017

  12. A novel high-resolution multilocus sequence typing of Giardia intestinalis Assemblage A isolates reveals zoonotic transmission, clonal outbreaks and recombination.

    Science.gov (United States)

    Ankarklev, Johan; Lebbad, Marianne; Einarsson, Elin; Franzén, Oscar; Ahola, Harri; Troell, Karin; Svärd, Staffan G

    2018-06-01

    Molecular epidemiology and genotyping studies of the parasitic protozoan Giardia intestinalis have proven difficult due to multiple factors, such as low discriminatory power in the commonly used genotyping loci, which has hampered molecular analyses of outbreak sources, zoonotic transmission and virulence types. Here we have focused on assemblage A Giardia and developed a high-resolution assemblage-specific multilocus sequence typing (MLST) method. Analyses of sequenced G. intestinalis assemblage A genomes from different sub-assemblages identified a set of six genetic loci with high genetic variability. DNA samples from both humans (n = 44) and animals (n = 18) that harbored Giardia assemblage A infections, were PCR amplified (557-700 bp products) and sequenced at the six novel genetic loci. Bioinformatic analyses showed five to ten-fold higher levels of polymorphic sites than what was previously found among assemblage A samples using the classic genotyping loci. Phylogenetically, a division of two major clusters in assemblage A became apparent, separating samples of human and animal origin. A subset of human samples (n = 9) from a documented Giardia outbreak in a Swedish day-care center, showed full complementarity at nine genetic loci (the six new and the standard BG, TPI and GDH loci), strongly suggesting one source of infection. Furthermore, three samples of human origin displayed MLST profiles that were phylogenetically more closely related to MLST profiles from animal derived samples, suggesting zoonotic transmission. These new genotyping loci enabled us to detect events of recombination between different assemblage A isolates but also between assemblage A and E isolates. In summary, we present a novel and expanded MLST strategy with significantly improved sensitivity for molecular analyses of virulence types, zoonotic potential and source tracking for assemblage A Giardia. Copyright © 2018. Published by Elsevier B.V.

  13. Multilocus sequence typing (MLST methods for the emerging Campylobacter species C. hyointestinalis, C. lanienae, C. sputorum, C. concisus and C. curvus

    Directory of Open Access Journals (Sweden)

    William G Miller

    2012-04-01

    Full Text Available Multilocus sequence typing (MLST systems have been reported previously for multiple food- and food animal-associated Campylobacter species (e.g. C. jejuni, C. coli, C. lari and C. fetus to both differentiate strains and identify clonal lineages. These MLST methods focused primarily on campylobacters of human clinical (e.g. C. jejuni or veterinary (e.g. C. fetus relevance. However, other, emerging, Campylobacter species have been isolated increasingly from environmental, food animal or human clinical samples. We describe herein MLST methods for five emerging Campylobacter species: C. hyointestinalis, C. lanienae, C. sputorum, C. concisus and C. curvus. The concisus/curvus method uses the loci aspA, atpA, glnA, gltA, glyA, ilvD and pgm, whereas the other methods use the seven loci defined for C. jejuni (i.e., aspA, atpA, glnA, gltA, glyA, pgm, and tkt. Multiple food animal and human clinical C. hyointestinalis (n=48, C. lanienae (n=34 and C. sputorum (n=24 isolates were typed, along with 86 human clinical C. concisus and C. curvus isolates. A large number of sequence types (STs were identified using all four MLST methods. Similar to Campylobacter MLST methods described previously, these novel MLST methods identified mixed isolates containing two or more strains of the same species. Additionally, these methods speciated unequivocally isolates that had been typed ambiguously using other molecular-based speciation methods, such as 16S rDNA sequencing. Finally, the design of degenerate primer pairs for some methods permitted the typing of related species; for example, the C. hyointestinalis primer pairs could be used to type C. fetus strains. Therefore, these novel Campylobacter MLST methods will prove useful in speciating and differentiating strains of multiple, emerging Campylobacter species.

  14. Multilocus sequence typing, biochemical and antibiotic resistance characterizations reveal diversity of North American strains of the honey bee pathogen Paenibacillus larvae.

    Science.gov (United States)

    Krongdang, Sasiprapa; Evans, Jay D; Pettis, Jeffery S; Chantawannakul, Panuwan

    2017-01-01

    Paenibacillus larvae is a Gram positive bacterium and the causative agent of the most widespread fatal brood disease of honey bees, American foulbrood (AFB). A total of thirty-three independent Paenibacillus larvae isolates from various geographical origins in North America and five reference strains were investigated for genetic diversity using multilocus sequence typing (MLST). This technique is regarded to be a powerful tool for epidemiological studies of pathogenic bacteria and is widely used in genotyping assays. For MLST, seven housekeeping gene loci, ilvD (dihydroxy-acid dyhydrogenase), tri (triosephosphate isomerase), purH (phospharibosyl-aminoimidazolecarboxamide), recF (DNA replication and repair protein), pyrE (orotate phosphoribosyltransferase), sucC (succinyl coenzyme A synthetase β subunit) and glpF (glycerol uptake facilitator protein) were studied and applied for primer designs. Previously, ERIC type DNA fingerprinting was applied to these same isolates and the data showed that almost all represented the ERIC I type, whereas using BOX-PCR gave an indication of more diversity. All isolates were screened for resistance to four antibiotics used by U.S. beekeepers, showing extensive resistance to tetracycline and the first records of resistance to tylosin and lincomycin. Our data highlight the intraspecies relationships of P. larvae and the potential application of MLST methods in enhancing our understanding of epidemiological relationships among bacterial isolates of different origins.

  15. Multilocus sequence typing, biochemical and antibiotic resistance characterizations reveal diversity of North American strains of the honey bee pathogen Paenibacillus larvae.

    Directory of Open Access Journals (Sweden)

    Sasiprapa Krongdang

    Full Text Available Paenibacillus larvae is a Gram positive bacterium and the causative agent of the most widespread fatal brood disease of honey bees, American foulbrood (AFB. A total of thirty-three independent Paenibacillus larvae isolates from various geographical origins in North America and five reference strains were investigated for genetic diversity using multilocus sequence typing (MLST. This technique is regarded to be a powerful tool for epidemiological studies of pathogenic bacteria and is widely used in genotyping assays. For MLST, seven housekeeping gene loci, ilvD (dihydroxy-acid dyhydrogenase, tri (triosephosphate isomerase, purH (phospharibosyl-aminoimidazolecarboxamide, recF (DNA replication and repair protein, pyrE (orotate phosphoribosyltransferase, sucC (succinyl coenzyme A synthetase β subunit and glpF (glycerol uptake facilitator protein were studied and applied for primer designs. Previously, ERIC type DNA fingerprinting was applied to these same isolates and the data showed that almost all represented the ERIC I type, whereas using BOX-PCR gave an indication of more diversity. All isolates were screened for resistance to four antibiotics used by U.S. beekeepers, showing extensive resistance to tetracycline and the first records of resistance to tylosin and lincomycin. Our data highlight the intraspecies relationships of P. larvae and the potential application of MLST methods in enhancing our understanding of epidemiological relationships among bacterial isolates of different origins.

  16. Image sequence analysis

    CERN Document Server

    1981-01-01

    The processing of image sequences has a broad spectrum of important applica­ tions including target tracking, robot navigation, bandwidth compression of TV conferencing video signals, studying the motion of biological cells using microcinematography, cloud tracking, and highway traffic monitoring. Image sequence processing involves a large amount of data. However, because of the progress in computer, LSI, and VLSI technologies, we have now reached a stage when many useful processing tasks can be done in a reasonable amount of time. As a result, research and development activities in image sequence analysis have recently been growing at a rapid pace. An IEEE Computer Society Workshop on Computer Analysis of Time-Varying Imagery was held in Philadelphia, April 5-6, 1979. A related special issue of the IEEE Transactions on Pattern Anal­ ysis and Machine Intelligence was published in November 1980. The IEEE Com­ puter magazine has also published a special issue on the subject in 1981. The purpose of this book ...

  17. Population biology of Streptococcus pneumoniae in West Africa: multilocus sequence typing of serotypes that exhibit different predisposition to invasive disease and carriage.

    Directory of Open Access Journals (Sweden)

    Eric S Donkor

    Full Text Available Little is known about the population biology of Streptococcus pneumoniae in developing countries, although the majority of pneumococcal infections occur in this setting. The aim of the study was to apply MLST to investigate the population biology of S. pneumoniae in West Africa.Seventy three invasive and carriage S. pneumoniae isolates from three West African countries including The Gambia, Nigeria and Ghana were investigated. The isolates covered seven serotypes (1, 3, 5, 6A, 11, 14, 23F and were subjected to multilocus sequence typing and antibiotic susceptibility testing.Overall, 50 different sequence types (STs were identified, of which 38% (29 were novel. The most common ST was a novel clone-ST 4012 (6.5%, and some clones including STs 913, 925, 1737, 2160 and 3310 appeared to be specific to the study region. Two STs including ST 63 and ST 4012 were associated with multiple serotypes indicating a history of serotype switching. ST 63 was associated with serotypes 3 and 23F, while ST 4012 was associated with serotypes 6A and 23. eBURST analyses using the stringent 6/7 identical loci definition grouped the 50 STs into 5 clonal complexes and 65 singletons, expressing a high level of genetic diversity among the isolates. Compared to the other serotypes, serotypes 1 and 5 isolates appeared to be more clonal. Internationally recognized antibiotic resistant clones of S. pneumoniae were generally absent in the population investigated and the only multidrug resistant isolate identified (1/66 belong to the Pneumocococcal Epidemiology Network clone ST 63.The pneumococcal population in West Africa is quite divergent, and serotypes that are common in invasive disease (such as serotypes 1 and 5 are more likely to be clonal than serotypes that are common in carriage.

  18. Development of a Multilocus Sequence Typing (MLST) scheme for Treponema pallidum subsp. pertenue: Application to yaws in Lihir Island, Papua New Guinea.

    Science.gov (United States)

    Godornes, Charmie; Giacani, Lorenzo; Barry, Alyssa E; Mitja, Oriol; Lukehart, Sheila A

    2017-12-01

    Yaws is a neglected tropical disease, caused by Treponema pallidum subsp. pertenue. The disease causes chronic lesions, primarily in young children living in remote villages in tropical climates. As part of a global yaws eradication campaign initiated by the World Health Organization, we sought to develop and evaluate a molecular typing method to distinguish different strains of T. pallidum subsp. pertenue for disease control and epidemiological purposes. Published genome sequences of strains of T. pallidum subsp. pertenue and pallidum were compared to identify polymorphic genetic loci among the strains. DNA from a number of existing historical Treponema isolates, as well as a subset of samples from yaws patients collected in Lihir Island, Papua New Guinea, were analyzed using these targets. From these data, three genes (tp0548, tp0136 and tp0326) were ultimately selected to give a high discriminating capability among the T. pallidum subsp. pertenue samples tested. Intragenic regions of these three target genes were then selected to enhance the discriminating capability of the typing scheme using short readily amplifiable loci. This 3-gene multilocus sequence typing (MLST) method was applied to existing historical human yaws strains, the Fribourg-Blanc simian isolate, and DNA from 194 lesion swabs from yaws patients on Lihir Island, Papua New Guinea. Among all samples tested, fourteen molecular types were identified, seven of which were found in patient samples and seven among historical isolates or DNA. Three types (JG8, TD6, and SE7) were predominant on Lihir Island. This MLST approach allows molecular typing and differentiation of yaws strains. This method could be a useful tool to complement epidemiological studies in regions where T. pallidum subsp. pertenue is prevalent with the overall goals of improving our understanding of yaws transmission dynamics and helping the yaws eradication campaign to succeed.

  19. Development of a Multilocus Sequence Typing (MLST) scheme for Treponema pallidum subsp. pertenue: Application to yaws in Lihir Island, Papua New Guinea

    Science.gov (United States)

    Godornes, Charmie; Giacani, Lorenzo; Barry, Alyssa E.; Mitja, Oriol

    2017-01-01

    Background Yaws is a neglected tropical disease, caused by Treponema pallidum subsp. pertenue. The disease causes chronic lesions, primarily in young children living in remote villages in tropical climates. As part of a global yaws eradication campaign initiated by the World Health Organization, we sought to develop and evaluate a molecular typing method to distinguish different strains of T. pallidum subsp. pertenue for disease control and epidemiological purposes. Methods and principal findings Published genome sequences of strains of T. pallidum subsp. pertenue and pallidum were compared to identify polymorphic genetic loci among the strains. DNA from a number of existing historical Treponema isolates, as well as a subset of samples from yaws patients collected in Lihir Island, Papua New Guinea, were analyzed using these targets. From these data, three genes (tp0548, tp0136 and tp0326) were ultimately selected to give a high discriminating capability among the T. pallidum subsp. pertenue samples tested. Intragenic regions of these three target genes were then selected to enhance the discriminating capability of the typing scheme using short readily amplifiable loci. This 3-gene multilocus sequence typing (MLST) method was applied to existing historical human yaws strains, the Fribourg-Blanc simian isolate, and DNA from 194 lesion swabs from yaws patients on Lihir Island, Papua New Guinea. Among all samples tested, fourteen molecular types were identified, seven of which were found in patient samples and seven among historical isolates or DNA. Three types (JG8, TD6, and SE7) were predominant on Lihir Island. Conclusions This MLST approach allows molecular typing and differentiation of yaws strains. This method could be a useful tool to complement epidemiological studies in regions where T. pallidum subsp. pertenue is prevalent with the overall goals of improving our understanding of yaws transmission dynamics and helping the yaws eradication campaign to succeed

  20. Multilocus analysis of introgression between two sympatric sister species of Drosophila: Drosophila yakuba and D. santomea.

    Science.gov (United States)

    Llopart, Ana; Lachaise, Daniel; Coyne, Jerry A

    2005-09-01

    Drosophila yakuba is widely distributed in sub-Saharan Africa, while D. santomea is endemic to the volcanic island of São Tomé in the Atlantic Ocean, 280 km west of Gabon. On São Tomé, D. yakuba is found mainly in open lowland forests, and D. santomea is restricted to the wet misty forests at higher elevations. At intermediate elevations, the species form a hybrid zone where hybrids occur at a frequency of approximately 1%. To determine the extent of gene flow between these species we studied polymorphism and divergence patterns in 29 regions distributed throughout the genome, including mtDNA and three genes on the Y chromosome. This multilocus approach, together with the comparison to the two allopatric species D. mauritiana and D. sechellia, allowed us to distinguish between forces that should affect all genes and forces that should act on some genes (e.g., introgression). Our results show that D. yakuba mtDNA has replaced that of D. santomea and that there is also significant introgression for two nuclear genes, yellow and salr. The majority of genes, however, has remained distinct. These two species therefore do not form a "hybrid swarm" in which much of the genome shows substantial introgression while disruptive selection maintains distinctness for only a few traits (e.g., pigmentation and male genitalia).

  1. Low diversity Cryptococcus neoformans variety grubii multilocus sequence types from Thailand are consistent with an ancestral African origin.

    Directory of Open Access Journals (Sweden)

    Sitali P Simwami

    2011-04-01

    Full Text Available The global burden of HIV-associated cryptococcal meningitis is estimated at nearly one million cases per year, causing up to a third of all AIDS-related deaths. Molecular epidemiology constitutes the main methodology for understanding the factors underpinning the emergence of this understudied, yet increasingly important, group of pathogenic fungi. Cryptococcus species are notable in the degree that virulence differs amongst lineages, and highly-virulent emerging lineages are changing patterns of human disease both temporally and spatially. Cryptococcus neoformans variety grubii (Cng, serotype A constitutes the most ubiquitous cause of cryptococcal meningitis worldwide, however patterns of molecular diversity are understudied across some regions experiencing significant burdens of disease. We compared 183 clinical and environmental isolates of Cng from one such region, Thailand, Southeast Asia, against a global MLST database of 77 Cng isolates. Population genetic analyses showed that Thailand isolates from 11 provinces were highly homogenous, consisting of the same genetic background (globally known as VNI and exhibiting only ten nearly identical sequence types (STs, with three (STs 44, 45 and 46 dominating our sample. This population contains significantly less diversity when compared against the global population of Cng, specifically Africa. Genetic diversity in Cng was significantly subdivided at the continental level with nearly half (47% of the global STs unique to a genetically diverse and recombining population in Botswana. These patterns of diversity, when combined with evidence from haplotypic networks and coalescent analyses of global populations, are highly suggestive of an expansion of the Cng VNI clade out of Africa, leading to a limited number of genotypes founding the Asian populations. Divergence time testing estimates the time to the most common ancestor between the African and Asian populations to be 6,920 years ago (95% HPD

  2. Recent speciation in three closely related sympatric specialists: inferences using multi-locus sequence, post-mating isolation and endosymbiont data.

    Directory of Open Access Journals (Sweden)

    Huai-Jun Xue

    Full Text Available Shifting between unrelated host plants is relatively rare for phytophagous insects, and distinct host specificity may play crucial roles in reproductive isolation. However, the isolation status and the relationship between parental divergence and post-mating isolation among closely related sympatric specialists are still poorly understood. Here, multi-locus sequence were used to estimate the relationship among three host plant-specific closely related flea beetles, Altica cirsicola, A. fragariae and A. viridicyanea (abbreviated as AC, AF and AV respectively. The tree topologies were inconsistent using different gene or different combinations of gene fragments. The relationship of AF+(AC+AV was supported, however, by both gene tree and species tree based on concatenated data. Post-mating reproductive data on the results of crossing these three species are best interpreted in the light of a well established phylogeny. Nuclear-induced but not Wolbachia-induced unidirectional cytoplasmic incompatibility, which was detected in AC-AF and AF-AV but not in AC-AV, may also suggest more close genetic affinity between AC and AV. Prevalence of Wolbachia in these three beetles, and the endosymbiont in most individuals of AV and AC sharing a same wsp haplotype may give another evidence of AF+(AC+AV. Our study also suggested that these three flea beetles diverged in a relative short time (0.94 My, which may be the result of shifting between unrelated host plants and distinct host specificity. Incomplete post-mating isolation while almost complete lineage sorting indicated that effective pre-mating isolation among these three species should have evolved.

  3. Optimization of analytical parameters for inferring relationships among Escherichia coli isolates from repetitive-element PCR by maximizing correspondence with multilocus sequence typing data.

    Science.gov (United States)

    Goldberg, Tony L; Gillespie, Thomas R; Singer, Randall S

    2006-09-01

    Repetitive-element PCR (rep-PCR) is a method for genotyping bacteria based on the selective amplification of repetitive genetic elements dispersed throughout bacterial chromosomes. The method has great potential for large-scale epidemiological studies because of its speed and simplicity; however, objective guidelines for inferring relationships among bacterial isolates from rep-PCR data are lacking. We used multilocus sequence typing (MLST) as a "gold standard" to optimize the analytical parameters for inferring relationships among Escherichia coli isolates from rep-PCR data. We chose 12 isolates from a large database to represent a wide range of pairwise genetic distances, based on the initial evaluation of their rep-PCR fingerprints. We conducted MLST with these same isolates and systematically varied the analytical parameters to maximize the correspondence between the relationships inferred from rep-PCR and those inferred from MLST. Methods that compared the shapes of densitometric profiles ("curve-based" methods) yielded consistently higher correspondence values between data types than did methods that calculated indices of similarity based on shared and different bands (maximum correspondences of 84.5% and 80.3%, respectively). Curve-based methods were also markedly more robust in accommodating variations in user-specified analytical parameter values than were "band-sharing coefficient" methods, and they enhanced the reproducibility of rep-PCR. Phylogenetic analyses of rep-PCR data yielded trees with high topological correspondence to trees based on MLST and high statistical support for major clades. These results indicate that rep-PCR yields accurate information for inferring relationships among E. coli isolates and that accuracy can be enhanced with the use of analytical methods that consider the shapes of densitometric profiles.

  4. Multi-Locus Next-Generation Sequence Typing of DNA Extracted From Pooled Colonies Detects Multiple Unrelated Candida albicans Strains in a Significant Proportion of Patient Samples

    Directory of Open Access Journals (Sweden)

    Ningxin Zhang

    2018-06-01

    Full Text Available The yeast Candida albicans is an important opportunistic human pathogen. For C. albicans strain typing or drug susceptibility testing, a single colony recovered from a patient sample is normally used. This is insufficient when multiple strains are present at the site sampled. How often this is the case is unclear. Previous studies, confined to oral, vaginal and vulvar samples, have yielded conflicting results and have assessed too small a number of colonies per sample to reliably detect the presence of multiple strains. We developed a next-generation sequencing (NGS modification of the highly discriminatory C. albicans MLST (multilocus sequence typing method, 100+1 NGS-MLST, for detection and typing of multiple strains in clinical samples. In 100+1 NGS-MLST, DNA is extracted from a pool of colonies from a patient sample and also from one of the colonies. MLST amplicons from both DNA preparations are analyzed by high-throughput sequencing. Using base call frequencies, our bespoke DALMATIONS software determines the MLST type of the single colony. If base call frequency differences between pool and single colony indicate the presence of an additional strain, the differences are used to computationally infer the second MLST type without the need for MLST of additional individual colonies. In mixes of previously typed pairs of strains, 100+1 NGS-MLST reliably detected a second strain. Inferred MLST types of second strains were always more similar to their real MLST types than to those of any of 59 other isolates (22 of 31 inferred types were identical to the real type. Using 100+1 NGS-MLST we found that 7/60 human samples, including three superficial candidiasis samples, contained two unrelated strains. In addition, at least one sample contained two highly similar variants of the same strain. The probability of samples containing unrelated strains appears to differ considerably between body sites. Our findings indicate the need for wider surveys to

  5. Systematic Review on Global Epidemiology of Methicillin-Resistant Staphylococcus pseudintermedius: Inference of Population Structure from Multilocus Sequence Typing Data

    DEFF Research Database (Denmark)

    dos Santos, Teresa Pires; Damborg, Peter; Moodley, Arshnee

    2016-01-01

    Background and rationale: Methicillin-resistant Staphylococcus pseudintermedius (MRSP) is a major cause of infections in dogs, also posing a zoonotic risk to humans. This systematic review aimed to determine the global epidemiology of MRSP and provide new insights into the population structure...... the MLST database for this species. Analysis of MLST data was performed with eBURST and ClonalFrame, and the proportion of MRSP isolates resistant to selected antimicrobial drugs was determined for the most predominant clonal complexes. Results: Fifty-eight studies published over the last 10 years were....... In Europe, CC258, which is more frequently susceptible to enrofloxacin and aminoglycosides, and more frequently resistant to sulphonamides/trimethoprim than CC71, is increasingly reported in various countries. CC68, previously described as the epidemic North American clone, is frequently reported...

  6. Multilocus analysis of nucleotide variation and speciation in three closely related Populus (Salicaceae) species.

    Science.gov (United States)

    Du, Shuhui; Wang, Zhaoshan; Ingvarsson, Pär K; Wang, Dongsheng; Wang, Junhui; Wu, Zhiqiang; Tembrock, Luke R; Zhang, Jianguo

    2015-10-01

    Historical tectonism and climate oscillations can isolate and contract the geographical distributions of many plant species, and they are even known to trigger species divergence and ultimately speciation. Here, we estimated the nucleotide variation and speciation in three closely related Populus species, Populus tremuloides, P. tremula and P. davidiana, distributed in North America and Eurasia. We analysed the sequence variation in six single-copy nuclear loci and three chloroplast (cpDNA) fragments in 497 individuals sampled from 33 populations of these three species across their geographic distributions. These three Populus species harboured relatively high levels of nucleotide diversity and showed high levels of nucleotide differentiation. Phylogenetic analysis revealed that P. tremuloides diverged earlier than the other two species. The cpDNA haplotype network result clearly illustrated the dispersal route from North America to eastern Asia and then into Europe. Molecular dating results confirmed that the divergence of these three species coincided with the sundering of the Bering land bridge in the late Miocene and a rapid uplift of the Qinghai-Tibetan Plateau around the Miocene/Pliocene boundary. Vicariance-driven successful allopatric speciation resulting from historical tectonism and climate oscillations most likely played roles in the formation of the disjunct distributions and divergence of these three Populus species. © 2015 John Wiley & Sons Ltd.

  7. Prevalence and Multilocus Genotyping Analysis of Cryptosporidium and Giardia Isolates from Dogs in Chiang Mai, Thailand

    Directory of Open Access Journals (Sweden)

    Sahatchai Tangtrongsup

    2017-05-01

    Full Text Available The occurrence and zoonotic potential of Cryptosporidium spp. and Giardia duodenalis isolated from dogs in Chiang Mai, Thailand were determined. Fecal samples were collected from 109 dogs between July and August 2008. Cryptosporidium spp. infection was determined by immunofluorescent assay (IFA, PCR assays that amplify Cryptosporidium heat-shock protein 70 kDa (hsp70, and two PCR assays that amplify a small subunit-ribosomal RNA (SSU-rRNA. Giardia duodenalis infection was identified using zinc sulfate centrifugal flotation, IFA, and four PCR assays that amplify the Giardia glutamate dehydrogenase (gdh, beta-giardin (bg, and generic and dog-specific assays of triosephosphate isomerase (tpi genes. Overall prevalence of Cryptosporidium spp. and G. duodenalis was 31.2% and 45.9%, respectively. Sequence analysis of 22 Cryptosporidium-positive samples and 21 Giardia-positive samples revealed the presence of C. canis in 15, and C. parvum in 7, G. duodenalis Assemblage C in 8, D in 11, and mixed of C and D in 2 dogs. Dogs in Chiang Mai were commonly exposed to Cryptosporidium spp. and G. duodenalis. Cryptosporidium parvum can be isolated from the feces of dogs, and all G. duodenalis assemblages were dog-specific. Dogs could be a reservoir for a zoonotic Cryptosporidium infection in humans, but further studies will be required to determine the clinical and zoonotic importance.

  8. Prevalence and Multilocus Genotyping Analysis of Cryptosporidium and Giardia Isolates from Dogs in Chiang Mai, Thailand.

    Science.gov (United States)

    Tangtrongsup, Sahatchai; Scorza, A Valeria; Reif, John S; Ballweber, Lora R; Lappin, Michael R; Salman, Mo D

    2017-05-10

    The occurrence and zoonotic potential of Cryptosporidium spp. and Giardia duodenalis isolated from dogs in Chiang Mai, Thailand were determined. Fecal samples were collected from 109 dogs between July and August 2008. Cryptosporidium spp. infection was determined by immunofluorescent assay (IFA), PCR assays that amplify Cryptosporidium heat-shock protein 70 kDa (hsp70), and two PCR assays that amplify a small subunit-ribosomal RNA (SSU-rRNA). Giardia duodenalis infection was identified using zinc sulfate centrifugal flotation, IFA, and four PCR assays that amplify the Giardia glutamate dehydrogenase (gdh), beta-giardin (bg), and generic and dog-specific assays of triosephosphate isomerase (tpi) genes. Overall prevalence of Cryptosporidium spp. and G. duodenalis was 31.2% and 45.9%, respectively. Sequence analysis of 22 Cryptosporidium -positive samples and 21 Giardia -positive samples revealed the presence of C. canis in 15, and C. parvum in 7, G. duodenalis Assemblage C in 8, D in 11, and mixed of C and D in 2 dogs. Dogs in Chiang Mai were commonly exposed to Cryptosporidium spp. and G. duodenalis . Cryptosporidium parvum can be isolated from the feces of dogs, and all G. duodenalis assemblages were dog-specific. Dogs could be a reservoir for a zoonotic Cryptosporidium infection in humans, but further studies will be required to determine the clinical and zoonotic importance.

  9. Brucella 'HOOF-Prints': strain typing by multi-locus analysis of variable number tandem repeats (VNTRs

    Directory of Open Access Journals (Sweden)

    Halling Shirley M

    2003-07-01

    Full Text Available Abstract Background Currently, there are very few tools available for subtyping Brucella isolates for epidemiological trace-back. Subtyping is difficult because of the genetic homogeneity within the genus. Sequencing of the genomes from three Brucella species has facilitated the search for DNA sequence variability. Recently, hypervariability among short tandem repeat sequences has been exploited for strain-typing of several bacterial pathogens. Results An eight-base pair tandem repeat sequence was discovered in nine genomic loci of the B. abortus genome. Eight loci were hypervariable among the three Brucella species. A PCR-based method was developed to identify the number of repeat units (alleles at each locus, generating strain-specific fingerprints. None of the loci exhibited species- or biovar-specific alleles. Sometimes, a species or biovar contained a specific allele at one or more loci, but the allele also occurred in other species or biovars. The technique successfully differentiated the type strains for all Brucella species and biovars, among unrelated B. abortus biovar 1 field isolates in cattle, and among B. abortus strains isolated from bison and elk. Isolates from the same herd or from short-term in vitro passage exhibited little or no variability in fingerprint pattern. Sometimes, isolates from an animal would have multiple alleles at a locus, possibly from mixed infections in enzootic areas, residual disease from incomplete depopulation of an infected herd or molecular evolution within the strain. Therefore, a mixed population or a pool of colonies from each animal and/or tissue was tested. Conclusion This paper describes a new method for fingerprinting Brucella isolates based on multi-locus characterization of a variable number, eight-base pair, tandem repeat. We have named this technique "HOOF-Prints" for Hypervariable Octameric Oligonucleotide Finger-Prints. The technique is highly discriminatory among Brucella species, among

  10. Multilocus microsatellite analysis of 'Candidatus Liberibacter asiaticus' associated with citrus Huanglongbing worldwide.

    Science.gov (United States)

    Islam, Md-Sajedul; Glynn, Jonathan M; Bai, Yang; Duan, Yong-Ping; Coletta-Filho, Helvecio D; Kuruba, Gopal; Civerolo, Edwin L; Lin, Hong

    2012-03-20

    Huanglongbing (HLB) is one of the most destructive citrus diseases in the world. The disease is associated with the presence of a fastidious, phloem-limited α- proteobacterium, 'Candidatus Liberibacter asiaticus', 'Ca. Liberibacter africanus' or 'Ca. Liberibacter americanus'. HLB-associated Liberibacters have spread to North America and South America in recent years. While the causal agents of HLB have been putatively identified, information regarding the worldwide population structure and epidemiological relationships for 'Ca. L. asiaticus' is limited. The availability of the 'Ca. L. asiaticus' genome sequence has facilitated development of molecular markers from this bacterium. The objectives of this study were to develop microsatellite markers and conduct genetic analyses of 'Ca. L. asiaticus' from a worldwide collection. Two hundred eighty seven isolates from USA (Florida), Brazil, China, India, Cambodia, Vietnam, Taiwan, Thailand, and Japan were analyzed. A panel of seven polymorphic microsatellite markers was developed for 'Ca. L. asiaticus'. Microsatellite analyses across the samples showed that the genetic diversity of 'Ca. L. asiaticus' is higher in Asia than Americas. UPGMA and STRUCTURE analyses identified three major genetic groups worldwide. Isolates from India were genetically distinct. East-southeast Asian and Brazilian isolates were generally included in the same group; a few members of this group were found in Florida, but the majority of the isolates from Florida were clustered separately. eBURST analysis predicted three founder haplotypes, which may have given rise to three groups worldwide. Our results identified three major genetic groups of 'Ca. L. asiaticus' worldwide. Isolates from Brazil showed similar genetic makeup with east-southeast Asian dominant group, suggesting the possibility of a common origin. However, most of the isolates recovered from Florida were clustered in a separate group. While the sources of the dominant 'Ca. L

  11. Multilocus analysis of divergence and introgression in sympatric and allopatric sibling species of the Lutzomyia longipalpis complex in Brazil.

    Science.gov (United States)

    Araki, Alejandra S; Ferreira, Gabriel E M; Mazzoni, Camila J; Souza, Nataly A; Machado, Ricardo C; Bruno, Rafaela V; Peixoto, Alexandre A

    2013-01-01

    Lutzomyia longipalpis, the main vector of visceral leishmaniasis in Latin America, is a complex of sibling species. In Brazil, a number of very closely related sibling species have been revealed by the analyses of copulation songs, sex pheromones and molecular markers. However, the level of divergence and gene flow between the sibling species remains unclear. Brazilian populations of this vector can be divided in two main groups: one producing Burst-type songs and the Cembrene-1 pheromone and a second more diverse group producing various Pulse song subtypes and different pheromones. We analyzed 21 nuclear loci in two pairs of Brazilian populations: two sympatric populations from the Sobral locality (1S and 2S) in northeastern Brazil and two allopatric populations from the Lapinha and Pancas localities in southeastern Brazil. Pancas and Sobral 2S are populations of the Burst/Cembrene-1 species while Lapinha and Sobral 1S are two putative incipient species producing the same pheromone and similar Pulse song subtypes. The multilocus analysis strongly suggests the occurrence of gene flow during the divergence between the sibling species, with different levels of introgression between loci. Moreover, this differential introgression is asymmetrical, with estimated gene flow being higher in the direction of the Burst/Cembrene-1 species. The results indicate that introgressive hybridization has been a crucial phenomenon in shaping the genome of the L. longipalpis complex. This has possible epidemiological implications and is particularly interesting considering the potential for increased introgression caused by man-made environmental changes and the current trend of leishmaniasis urbanization in Brazil.

  12. Multilocus Analysis of Divergence and Introgression in Sympatric and Allopatric Sibling Species of the Lutzomyia longipalpis Complex in Brazil

    Science.gov (United States)

    Mazzoni, Camila J.; Souza, Nataly A.; Machado, Ricardo C.; Bruno, Rafaela V.

    2013-01-01

    Background Lutzomyia longipalpis, the main vector of visceral leishmaniasis in Latin America, is a complex of sibling species. In Brazil, a number of very closely related sibling species have been revealed by the analyses of copulation songs, sex pheromones and molecular markers. However, the level of divergence and gene flow between the sibling species remains unclear. Brazilian populations of this vector can be divided in two main groups: one producing Burst-type songs and the Cembrene-1 pheromone and a second more diverse group producing various Pulse song subtypes and different pheromones. Methodology/Principal Findings We analyzed 21 nuclear loci in two pairs of Brazilian populations: two sympatric populations from the Sobral locality (1S and 2S) in northeastern Brazil and two allopatric populations from the Lapinha and Pancas localities in southeastern Brazil. Pancas and Sobral 2S are populations of the Burst/Cembrene-1 species while Lapinha and Sobral 1S are two putative incipient species producing the same pheromone and similar Pulse song subtypes. The multilocus analysis strongly suggests the occurrence of gene flow during the divergence between the sibling species, with different levels of introgression between loci. Moreover, this differential introgression is asymmetrical, with estimated gene flow being higher in the direction of the Burst/Cembrene-1 species. Conclusions/Significance The results indicate that introgressive hybridization has been a crucial phenomenon in shaping the genome of the L. longipalpis complex. This has possible epidemiological implications and is particularly interesting considering the potential for increased introgression caused by man-made environmental changes and the current trend of leishmaniasis urbanization in Brazil. PMID:24147172

  13. Multilocus phylogeny and MALDI-TOF analysis of the plant pathogenic species Alternaria dauci and relatives

    DEFF Research Database (Denmark)

    Brun, Sophie; Madrid, Hugo; Gerrits Van Den Ende, Bert

    2013-01-01

    The genus Alternaria includes numerous phytopathogenic species, many of which are economically relevant. Traditionally, identification has been based on morphology, but is often hampered by the tendency of some strains to become sterile in culture and by the existence of species-complexes of morp......The genus Alternaria includes numerous phytopathogenic species, many of which are economically relevant. Traditionally, identification has been based on morphology, but is often hampered by the tendency of some strains to become sterile in culture and by the existence of species...... trees based on ITS sequences did not differentiate strains of A. solani, A. tomatophila, and A. porri, but these three species formed a clade separate from strains of A. dauci. The resolution improved in trees based on gpd and Alt a 1, which distinguished strains of the four species as separate clades...... of A. solani, and the third included all strains of A. tomatophila, as well as all but one strain of A. solani, and one strain of A. porri. Thus, this study shows the usefulness of MALDI-TOF mass spectrometry as a promising tool for identification of these four species of Alternaria which are closely...

  14. Multilocus inference of species trees and DNA barcoding.

    Science.gov (United States)

    Mallo, Diego; Posada, David

    2016-09-05

    The unprecedented amount of data resulting from next-generation sequencing has opened a new era in phylogenetic estimation. Although large datasets should, in theory, increase phylogenetic resolution, massive, multilocus datasets have uncovered a great deal of phylogenetic incongruence among different genomic regions, due both to stochastic error and to the action of different evolutionary process such as incomplete lineage sorting, gene duplication and loss and horizontal gene transfer. This incongruence violates one of the fundamental assumptions of the DNA barcoding approach, which assumes that gene history and species history are identical. In this review, we explain some of the most important challenges we will have to face to reconstruct the history of species, and the advantages and disadvantages of different strategies for the phylogenetic analysis of multilocus data. In particular, we describe the evolutionary events that can generate species tree-gene tree discordance, compare the most popular methods for species tree reconstruction, highlight the challenges we need to face when using them and discuss their potential utility in barcoding. Current barcoding methods sacrifice a great amount of statistical power by only considering one locus, and a transition to multilocus barcodes would not only improve current barcoding methods, but also facilitate an eventual transition to species-tree-based barcoding strategies, which could better accommodate scenarios where the barcode gap is too small or inexistent.This article is part of the themed issue 'From DNA barcodes to biomes'. © 2016 The Authors.

  15. Adult height, coronary heart disease and stroke : A multi-locus Mendelian randomization meta-analysis

    NARCIS (Netherlands)

    Nüesch, Eveline; Dale, Caroline; Palmer, Tom M.; White, Jon; Keating, Brendan J.; van Iperen, Erik P A; Goel, Anuj; Padmanabhan, Sandosh; Asselbergs, F. W.; Verschuren, W. M.; Wijmenga, C.; Van der Schouw, Y. T.; Onland-Moret, N. C.; Lange, Leslie A.; Hovingh, G. K.; Sivapalaratnam, Suthesh; Morris, Richard W.; Whincup, Peter H.; Wannamethe, Goya S.; Gaunt, Tom R.; Ebrahim, Shah; Steel, Laura; Nair, Nikhil; Reiner, Alexander P.; Kooperberg, Charles; Wilson, James F.; Bolton, Jennifer L.; McLachlan, Stela; Price, Jacqueline F.; Strachan, Mark W J; Robertson, Christine M.; Kleber, Marcus E.; Delgado, Graciela; März, Winfried; Melander, Olle; Dominiczak, Anna F.; Farrall, Martin; Watkins, Hugh; Leusink, Maarten; Maitland-van der Zee, Anke H.; de Groot, Mark C H; Dudbridge, Frank; Hingorani, Aroon; Ben-Shlomo, Yoav; Lawlor, Debbie A.; Amuzu, A.; Caufield, M.; Cavadino, A.; Cooper, J.; Davies, T. L.; Day, I. N.; Drenos, F.; Engmann, J.; Finan, C.; Giambartolomei, C.; Hardy, R.; Humphries, S. E.; Hypponen, E.; Kivimaki, M.; Kuh, D.; Kumari, M.; Ong, K.; Plagnol, V.; Power, C.; Richards, M.; Shah, S.; Shah, T.; Sofat, R.; Talmud, P. J.; Wareham, N.; Warren, H.; Whittaker, J. C.; Wong, A.; Zabaneh, D.; Smith, George Davey; Wells, Jonathan C.; Leon, David A.; Holmes, Michael V.; Casas, Juan P.

    2016-01-01

    Background: We investigated causal effect of completed growth, measured by adult height, on coronary heart disease (CHD), stroke and cardiovascular traits, using instrumental variable (IV) Mendelian randomization meta-analysis. Methods: We developed an allele score based on 69 single nucleotide

  16. Sub-typing of extended-spectrum-β-lactamase-producing isolates from a nosocomial outbreak: application of a 10-loci generic Escherichia coli multi-locus variable number tandem repeat analysis.

    Directory of Open Access Journals (Sweden)

    Nahid Karami

    Full Text Available Extended-spectrum β-lactamase producing Escherichia coli (ESBL-E. coli were isolated from infants hospitalized in a neonatal, post-surgery ward during a four-month-long nosocomial outbreak and six-month follow-up period. A multi-locus variable number tandem repeat analysis (MLVA, using 10 loci (GECM-10, for 'generic' (i.e., non-STEC E. coli was applied for sub-species-level (i.e., sub-typing delineation and characterization of the bacterial isolates. Ten distinct GECM-10 types were detected among 50 isolates, correlating with the types defined by pulsed-field gel electrophoresis (PFGE, which is recognized to be the 'gold-standard' method for clinical epidemiological analyses. Multi-locus sequence typing (MLST, multiplex PCR genotyping of bla CTX-M, bla TEM, bla OXA and bla SHV genes and antibiotic resistance profiling, as well as a PCR assay specific for detecting isolates of the pandemic O25b-ST131 strain, further characterized the outbreak isolates. Two clusters of isolates with distinct GECM-10 types (G06-04 and G07-02, corresponding to two major PFGE types and the MLST-based sequence types (STs 131 and 1444, respectively, were confirmed to be responsible for the outbreak. The application of GECM-10 sub-typing provided reliable, rapid and cost-effective epidemiological characterizations of the ESBL-producing isolates from a nosocomial outbreak that correlated with and may be used to replace the laborious PFGE protocol for analyzing generic E. coli.

  17. Direct, rapid RNA sequence analysis

    International Nuclear Information System (INIS)

    Peattie, D.A.

    1987-01-01

    The original methods of RNA sequence analysis were based on enzymatic production and chromatographic separation of overlapping oligonucleotide fragments from within an RNA molecule followed by identification of the mononucleotides comprising the oligomer. Over the past decade the field of nucleic acid sequencing has changed dramatically, however, and RNA molecules now can be sequenced in a variety of more streamlined fashions. Most of the more recent advances in RNA sequencing have involved one-dimensional electrophoretic separation of 32 P-end-labeled oligoribonucleotides on polyacrylamide gels. In this chapter the author discusses two of these methods for determining the nucleotide sequences of RNA molecules rapidly: the chemical method and the enzymatic method. Both methods are direct and degradative, i.e., they rely on fragmatic and chemical approaches should be utilized. The single-strand-specific ribonucleases (A, T 1 , T 2 , and S 1 ) provide an efficient means to locate double-helical regions rapidly, and the chemical reactions provide a means to determine the RNA sequence within these regions. In addition, the chemical reactions allow one to assign interactions to specific atoms and to distinguish secondary interactions from tertiary ones. If the RNA molecule is small enough to be sequenced directly by the enzymatic or chemical method, the probing reactions can be done easily at the same time as sequencing reactions

  18. Integrated sequence analysis. Final report

    International Nuclear Information System (INIS)

    Andersson, K.; Pyy, P.

    1998-02-01

    The NKS/RAK subprojet 3 'integrated sequence analysis' (ISA) was formulated with the overall objective to develop and to test integrated methodologies in order to evaluate event sequences with significant human action contribution. The term 'methodology' denotes not only technical tools but also methods for integration of different scientific disciplines. In this report, we first discuss the background of ISA and the surveys made to map methods in different application fields, such as man machine system simulation software, human reliability analysis (HRA) and expert judgement. Specific event sequences were, after the surveys, selected for application and testing of a number of ISA methods. The event sequences discussed in the report were cold overpressure of BWR, shutdown LOCA of BWR, steam generator tube rupture of a PWR and BWR disturbed signal view in the control room after an external event. Different teams analysed these sequences by using different ISA and HRA methods. Two kinds of results were obtained from the ISA project: sequence specific and more general findings. The sequence specific results are discussed together with each sequence description. The general lessons are discussed under a separate chapter by using comparisons of different case studies. These lessons include areas ranging from plant safety management (design, procedures, instrumentation, operations, maintenance and safety practices) to methodological findings (ISA methodology, PSA,HRA, physical analyses, behavioural analyses and uncertainty assessment). Finally follows a discussion about the project and conclusions are presented. An interdisciplinary study of complex phenomena is a natural way to produce valuable and innovative results. This project came up with structured ways to perform ISA and managed to apply the in practice. The project also highlighted some areas where more work is needed. In the HRA work, development is required for the use of simulators and expert judgement as

  19. Fractals in DNA sequence analysis

    Institute of Scientific and Technical Information of China (English)

    Yu Zu-Guo(喻祖国); Vo Anh; Gong Zhi-Min(龚志民); Long Shun-Chao(龙顺潮)

    2002-01-01

    Fractal methods have been successfully used to study many problems in physics, mathematics, engineering, finance,and even in biology. There has been an increasing interest in unravelling the mysteries of DNA; for example, how can we distinguish coding and noncoding sequences, and the problems of classification and evolution relationship of organisms are key problems in bioinformatics. Although much research has been carried out by taking into consideration the long-range correlations in DNA sequences, and the global fractal dimension has been used in these works by other people, the models and methods are somewhat rough and the results are not satisfactory. In recent years, our group has introduced a time series model (statistical point of view) and a visual representation (geometrical point of view)to DNA sequence analysis. We have also used fractal dimension, correlation dimension, the Hurst exponent and the dimension spectrum (multifractal analysis) to discuss problems in this field. In this paper, we introduce these fractal models and methods and the results of DNA sequence analysis.

  20. A New Perspective on Polyploid Fragaria (Strawberry) Genome Composition Based on Large-Scale, Multi-Locus Phylogenetic Analysis

    OpenAIRE

    Yang, Yilong; Davis, Thomas M

    2017-01-01

    Abstract The subgenomic compositions of the octoploid (2n = 8× = 56) strawberry (Fragaria) species, including the economically important cultivated species Fragaria x ananassa, have been a topic of long-standing interest. Phylogenomic approaches utilizing next-generation sequencing technologies offer a new window into species relationships and the subgenomic compositions of polyploids. We have conducted a large-scale phylogenetic analysis of Fragaria (strawberry) species using the Fluidigm Ac...

  1. Integrated sequence analysis. Final report

    Energy Technology Data Exchange (ETDEWEB)

    Andersson, K.; Pyy, P

    1998-02-01

    The NKS/RAK subprojet 3 `integrated sequence analysis` (ISA) was formulated with the overall objective to develop and to test integrated methodologies in order to evaluate event sequences with significant human action contribution. The term `methodology` denotes not only technical tools but also methods for integration of different scientific disciplines. In this report, we first discuss the background of ISA and the surveys made to map methods in different application fields, such as man machine system simulation software, human reliability analysis (HRA) and expert judgement. Specific event sequences were, after the surveys, selected for application and testing of a number of ISA methods. The event sequences discussed in the report were cold overpressure of BWR, shutdown LOCA of BWR, steam generator tube rupture of a PWR and BWR disturbed signal view in the control room after an external event. Different teams analysed these sequences by using different ISA and HRA methods. Two kinds of results were obtained from the ISA project: sequence specific and more general findings. The sequence specific results are discussed together with each sequence description. The general lessons are discussed under a separate chapter by using comparisons of different case studies. These lessons include areas ranging from plant safety management (design, procedures, instrumentation, operations, maintenance and safety practices) to methodological findings (ISA methodology, PSA,HRA, physical analyses, behavioural analyses and uncertainty assessment). Finally follows a discussion about the project and conclusions are presented. An interdisciplinary study of complex phenomena is a natural way to produce valuable and innovative results. This project came up with structured ways to perform ISA and managed to apply the in practice. The project also highlighted some areas where more work is needed. In the HRA work, development is required for the use of simulators and expert judgement as

  2. A New Perspective on Polyploid Fragaria (Strawberry) Genome Composition Based on Large-Scale, Multi-Locus Phylogenetic Analysis.

    Science.gov (United States)

    Yang, Yilong; Davis, Thomas M

    2017-12-01

    The subgenomic compositions of the octoploid (2n = 8× = 56) strawberry (Fragaria) species, including the economically important cultivated species Fragaria x ananassa, have been a topic of long-standing interest. Phylogenomic approaches utilizing next-generation sequencing technologies offer a new window into species relationships and the subgenomic compositions of polyploids. We have conducted a large-scale phylogenetic analysis of Fragaria (strawberry) species using the Fluidigm Access Array system and 454 sequencing platform. About 24 single-copy or low-copy nuclear genes distributed across the genome were amplified and sequenced from 96 genomic DNA samples representing 16 Fragaria species from diploid (2×) to decaploid (10×), including the most extensive sampling of octoploid taxa yet reported. Individual gene trees were constructed by different tree-building methods. Mosaic genomic structures of diploid Fragaria species consisting of sequences at different phylogenetic positions were observed. Our findings support the presence in octoploid species of genetic signatures from at least five diploid ancestors (F. vesca, F. iinumae, F. bucharica, F. viridis, and at least one additional allele contributor of unknown identity), and questions the extent to which distinct subgenomes are preserved over evolutionary time in the allopolyploid Fragaria species. In addition, our data support divergence between the two wild octoploid species, F. virginiana and F. chiloensis. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  3. How do you solve a problem like Letharia? A new look at cryptic species in lichen-forming fungi using Bayesian clustering and SNPs from multilocus sequence data.

    Directory of Open Access Journals (Sweden)

    Susanne Altermann

    Full Text Available The inclusion of molecular data is increasingly an integral part of studies assessing species boundaries. Analyses based on predefined groups may obscure patterns of differentiation, and population assignment tests provide an alternative for identifying population structure and barriers to gene flow. In this study, we apply population assignment tests implemented in the programs STRUCTURE and BAPS to single nucleotide polymorphisms from DNA sequence data generated for three previous studies of the lichenized fungal genus Letharia. Previous molecular work employing a gene genealogical approach circumscribed six species-level lineages within the genus, four putative lineages within the nominal taxon L. columbiana (Nutt. J.W. Thomson and two sorediate lineages. We show that Bayesian clustering implemented in the program STRUCTURE was generally able to recover the same six putative Letharia lineages. Population assignments were largely consistent across a range of scenarios, including: extensive amounts of missing data, the exclusion of SNPs from variable markers, and inferences based on SNPs from as few as three gene regions. While our study provided additional evidence corroborating the six candidate Letharia species, the equivalence of these genetic clusters with species-level lineages is uncertain due, in part, to limited phylogenetic signal. Furthermore, both the BAPS analysis and the ad hoc ΔK statistic from results of the STRUCTURE analysis suggest that population structure can possibly be captured with fewer genetic groups. Our findings also suggest that uneven sampling across taxa may be responsible for the contrasting inferences of population substructure. Our results consistently supported two distinct sorediate groups, 'L. lupina' and L. vulpina, and subtle morphological differences support this distinction. Similarly, the putative apotheciate species 'L. lucida' was also consistently supported as a distinct genetic cluster. However

  4. Multilocus genetics to reconstruct aeromonad evolution

    Directory of Open Access Journals (Sweden)

    Roger Frédéric

    2012-04-01

    Full Text Available Abstract Background Aeromonas spp. are versatile bacteria that exhibit a wide variety of lifestyles. In an attempt to improve the understanding of human aeromonosis, we investigated whether clinical isolates displayed specific characteristics in terms of genetic diversity, population structure and mode of evolution among Aeromonas spp. A collection of 195 Aeromonas isolates from human, animal and environmental sources was therefore genotyped using multilocus sequence analysis (MLSA based on the dnaK, gltA, gyrB, radA, rpoB, tsf and zipA genes. Results The MLSA showed a high level of genetic diversity among the population, and multilocus-based phylogenetic analysis (MLPA revealed 3 major clades: the A. veronii, A. hydrophila and A. caviae clades, among the eleven clades detected. Lower genetic diversity was observed within the A. caviae clade as well as among clinical isolates compared to environmental isolates. Clonal complexes, each of which included a limited number of strains, mainly corresponded to host-associated subsclusters of strains, i.e., a fish-associated subset within A. salmonicida and 11 human-associated subsets, 9 of which included only disease-associated strains. The population structure was shown to be clonal, with modes of evolution that involved mutations in general and recombination events locally. Recombination was detected in 5 genes in the MLSA scheme and concerned approximately 50% of the STs. Therefore, these recombination events could explain the observed phylogenetic incongruities and low robustness. However, the MLPA globally confirmed the current systematics of the genus Aeromonas. Conclusions Evolution in the genus Aeromonas has resulted in exceptionally high genetic diversity. Emerging from this diversity, subsets of strains appeared to be host adapted and/or “disease specialized” while the A. caviae clade displayed an atypical tempo of evolution among aeromonads. Considering that A. salmonicida has been

  5. Neisseria gonorrhoeae Sequence Typing for Antimicrobial Resistance, a Novel Antimicrobial Resistance Multilocus Typing Scheme for Tracking Global Dissemination of N. gonorrhoeae Strains.

    Science.gov (United States)

    Demczuk, W; Sidhu, S; Unemo, M; Whiley, D M; Allen, V G; Dillon, J R; Cole, M; Seah, C; Trembizki, E; Trees, D L; Kersh, E N; Abrams, A J; de Vries, H J C; van Dam, A P; Medina, I; Bharat, A; Mulvey, M R; Van Domselaar, G; Martin, I

    2017-05-01

    A curated Web-based user-friendly sequence typing tool based on antimicrobial resistance determinants in Neisseria gonorrhoeae was developed and is publicly accessible (https://ngstar.canada.ca). The N. gonorrhoeae Sequence Typing for Antimicrobial Resistance (NG-STAR) molecular typing scheme uses the DNA sequences of 7 genes ( penA , mtrR , porB , ponA , gyrA , parC , and 23S rRNA) associated with resistance to β-lactam antimicrobials, macrolides, or fluoroquinolones. NG-STAR uses the entire penA sequence, combining the historical nomenclature for penA types I to XXXVIII with novel nucleotide sequence designations; the full mtrR sequence and a portion of its promoter region; portions of ponA , porB , gyrA , and parC ; and 23S rRNA sequences. NG-STAR grouped 768 isolates into 139 sequence types (STs) ( n = 660) consisting of 29 clonal complexes (CCs) having a maximum of a single-locus variation, and 76 NG-STAR STs ( n = 109) were identified as unrelated singletons. NG-STAR had a high Simpson's diversity index value of 96.5% (95% confidence interval [CI] = 0.959 to 0.969). The most common STs were NG-STAR ST-90 ( n = 100; 13.0%), ST-42 and ST-91 ( n = 45; 5.9%), ST-64 ( n = 44; 5.72%), and ST-139 ( n = 42; 5.5%). Decreased susceptibility to azithromycin was associated with NG-STAR ST-58, ST-61, ST-64, ST-79, ST-91, and ST-139 ( n = 156; 92.3%); decreased susceptibility to cephalosporins was associated with NG-STAR ST-90, ST-91, and ST-97 ( n = 162; 94.2%); and ciprofloxacin resistance was associated with NG-STAR ST-26, ST-90, ST-91, ST-97, ST-150, and ST-158 ( n = 196; 98.0%). All isolates of NG-STAR ST-42, ST-43, ST-63, ST-81, and ST-160 ( n = 106) were susceptible to all four antimicrobials. The standardization of nomenclature associated with antimicrobial resistance determinants through an internationally available database will facilitate the monitoring of the global dissemination of antimicrobial-resistant N. gonorrhoeae strains. © Crown copyright 2017.

  6. Comprehensive Phylogenetic Analysis of Bovine Non-aureus Staphylococci Species Based on Whole-Genome Sequencing

    Science.gov (United States)

    Naushad, Sohail; Barkema, Herman W.; Luby, Christopher; Condas, Larissa A. Z.; Nobrega, Diego B.; Carson, Domonique A.; De Buck, Jeroen

    2016-01-01

    Non-aureus staphylococci (NAS), a heterogeneous group of a large number of species and subspecies, are the most frequently isolated pathogens from intramammary infections in dairy cattle. Phylogenetic relationships among bovine NAS species are controversial and have mostly been determined based on single-gene trees. Herein, we analyzed phylogeny of bovine NAS species using whole-genome sequencing (WGS) of 441 distinct isolates. In addition, evolutionary relationships among bovine NAS were estimated from multilocus data of 16S rRNA, hsp60, rpoB, sodA, and tuf genes and sequences from these and numerous other single genes/proteins. All phylogenies were created with FastTree, Maximum-Likelihood, Maximum-Parsimony, and Neighbor-Joining methods. Regardless of methodology, WGS-trees clearly separated bovine NAS species into five monophyletic coherent clades. Furthermore, there were consistent interspecies relationships within clades in all WGS phylogenetic reconstructions. Except for the Maximum-Parsimony tree, multilocus data analysis similarly produced five clades. There were large variations in determining clades and interspecies relationships in single gene/protein trees, under different methods of tree constructions, highlighting limitations of using single genes for determining bovine NAS phylogeny. However, based on WGS data, we established a robust phylogeny of bovine NAS species, unaffected by method or model of evolutionary reconstructions. Therefore, it is now possible to determine associations between phylogeny and many biological traits, such as virulence, antimicrobial resistance, environmental niche, geographical distribution, and host specificity. PMID:28066335

  7. Prevalence of blaZ gene types and the cefazolin inoculum effect among methicillin-susceptible Staphylococcus aureus blood isolates and their association with multilocus sequence types and clinical outcome.

    Science.gov (United States)

    Chong, Y P; Park, S-J; Kim, E S; Bang, K-M; Kim, M-N; Kim, S-H; Lee, S-O; Choi, S-H; Jeong, J-Y; Woo, J H; Kim, Y S

    2015-02-01

    Cefazolin treatment failures have been described for bacteraemia caused by methicillin-susceptible Staphylococcus aureus (MSSA) with type A β-lactamase and inoculum effect (InE). We investigated the prevalence of blaZ (β-lactamase) gene types and a cefazolin InE among MSSA blood isolates in South Korea and evaluated their association with specific genotypes. The clinical impact of the cefazolin InE was also evaluated. A total of 220 MSSA isolates were collected from a prospective cohort study of S. aureus bacteraemia. A pronounced InE with cefazolin was defined as a ≥4-fold increase in the minimum inhibitory concentration (MIC) between a standard and high inoculum, resulting in a non-susceptible MIC. Sequencing of blaZ and multilocus sequence typing (MLST) were performed. Clinical outcomes were assessed in 77 patients treated with cefazolin. The blaZ gene was detected in 92 % of the 220 MSSA isolates. Type C β-lactamase was the most common (53 %), followed by type B (20 %) and type A (17 %). Certain genotypes were significantly associated with specific β-lactamase types (notably, ST30 and type A β-lactamase). A pronounced cefazolin InE was observed in 13 % of isolates. Most of these (79 %) expressed type A β-lactamase and ST30 was the predominant (55 %) clone amongst them. Cefazolin treatment failure was not observed in patients infected with strains exhibiting a pronounced InE. These strains had no impact on other clinical outcomes. In conclusion, the prevalence of a pronounced InE with cefazolin could be dependent upon distributions of MSSA genotypes. Cefazolin can likely be used for the treatment of MSSA bacteraemia (except endocarditis), without consideration of an InE.

  8. Dynamic formation of asexual diploid and polyploid lineages: multilocus analysis of Cobitis reveals the mechanisms maintaining the diversity of clones.

    Directory of Open Access Journals (Sweden)

    Karel Janko

    Full Text Available Given the hybrid genomic constitutions and increased ploidy of many asexual animals, the identification of processes governing the origin and maintenance of clonal diversity provides useful information about the evolutionary consequences of interspecific hybridization, asexuality and polyploidy. In order to understand the processes driving observed diversity of biotypes and clones in the Cobitis taenia hybrid complex, we performed fine-scale genetic analysis of Central European hybrid zone between two sexual species using microsatellite genotyping and mtDNA sequencing. We found that the hybrid zone is populated by an assemblage of clonally (gynogenetically reproducing di-, tri- and tetraploid hybrid lineages and that successful clones, which are able of spatial expansion, recruit from two ploidy levels, i.e. diploid and triploid. We further compared the distribution of observed estimates of clonal ages to theoretical distributions simulated under various assumptions and showed that new clones are most likely continuously recruited from ancestral populations. This suggests that the clonal diversity is maintained by dynamic equilibrium between origination and extinction of clonal lineages. On the other hand, an interclonal selection is implied by nonrandom spatial distribution of individual clones with respect to the coexisting sexual species. Importantly, there was no evidence for sexually reproducing hybrids or clonally reproducing non-hybrid forms. Together with previous successful laboratory synthesis of clonal Cobitis hybrids, our data thus provide the most compelling evidence that 1 the origin of asexuality is causally linked to interspecific hybridization; 2 successful establishment of clones is not restricted to one specific ploidy level and 3 the initiation of clonality and polyploidy may be dynamic and continuous in asexual complexes.

  9. Multilocus DNA fingerprints in gallinaceous birds: general approach and problems.

    Science.gov (United States)

    Hanotte, O; Bruford, M W; Burke, T

    1992-06-01

    Multilocus profiles were investigated in five different species of Galliformes (ring-necked pheasant Phasianus colchicus, Indian peafowl Pavo cristatus, Japanese quail Coturnix coturnix japonica, domestic chicken Gallus gallus, and red grouse Lagopus lagopus scoticus) using two human multilocus probes (33.6 and 33.15) in combination with each of four restriction enzymes (AluI, DdeI, HaeIII or HinfI). All the species show a DNA fingerprint-like pattern using at least one restriction enzyme in combination with each multilocus probe. The number of bands detected and the value of the index of similarity for each species differ significantly between the profiles obtained with each multilocus probe. Some enzyme/probe combinations reveal strong cross-hybridization of the multilocus probes with satellite or satellite-like DNA sequences in pheasant, peacock, quail and chicken, which partially or completely prevented scoring of the profile. The choice of restriction enzyme was found to influence the number of bands, the value of the index of similarity and the probability of obtaining an identical fingerprint between unrelated individuals. The Mendelian inheritance and independent segregation of the fragments detected using AluI was investigated in three species (ring-necked pheasant, Indian peafowl and red grouse). Some bands were shown to be tightly linked. An extreme case was encountered in the red grouse, where 12 of the 15 bands scored in one parent represented only two, apparently allelic, haplotypes and so derived from a single locus. However, fingerprint patterns will often be adequate for use in paternity analyses, such as in behavioural studies, despite the occurrence of haplotypic sets of bands. Identical DNA multilocus profiles were sometimes observed between captive-bred siblings in one species. These results emphasize the desirability of determining, in each new species, the optimal experimental conditions as a preliminary to any behavioural or population

  10. Sequence analysis of Leukemia DNA

    Science.gov (United States)

    Nacong, Nasria; Lusiyanti, Desy; Irawan, Muhammad. Isa

    2018-03-01

    Cancer is a very deadly disease, one of which is leukemia disease or better known as blood cancer. The cancer cell can be detected by taking DNA in laboratory test. This study focused on local alignment of leukemia and non leukemia data resulting from NCBI in the form of DNA sequences by using Smith-Waterman algorithm. SmithWaterman algorithm was invented by TF Smith and MS Waterman in 1981. These algorithms try to find as much as possible similarity of a pair of sequences, by giving a negative value to the unequal base pair (mismatch), and positive values on the same base pair (match). So that will obtain the maximum positive value as the end of the alignment, and the minimum value as the initial alignment. This study will use sequences of leukemia and 3 sequences of non leukemia.

  11. Specific multilocus variable-number tandem-repeat analysis genotypes of Mycoplasma pneumoniae are associated with diseases severity and macrolide susceptibility.

    Directory of Open Access Journals (Sweden)

    Jiuxin Qu

    Full Text Available Clinical relevance of multilocus variable-number tandem-repeat (VNTR analysis (MLVA in patients with community-acquired pneumonia (CAP by Mycoplasma pneumoniae (M. pneumoniae is unknown. A multi-center, prospective study was conducted from November 2010 to April 2012. Nine hundred and fifty-four CAP patients were consecutively enrolled. M. pneumoniae clinical isolates were obtained from throat swabs. MLVA typing was applied to all isolates. Comparison of pneumonia severity index (PSI and clinical features among patients infected with different MLVA types of M. pneumoniae were conducted. One hundred and thirty-six patients were positive with M. pneumoniae culture. The clinical isolates were clustered into 18 MLVA types. One hundred and fourteen (88.3% isolates were resistant to macrolide, covering major MLVA types. The macrolide non-resistant rate of M. pneumoniae isolates with Mpn13-14-15-16 profile of 3-5-6-2 was significantly higher than that of other types (p ≤ 0.001. Patients infected with types U (5-4-5-7-2 and J (3-4-5-7-2 had significantly higher PSI scores (p<0.001 and longer total duration of cough (p = 0.011. Therefore it seems that there is a correlation between certain MLVA types and clinical severity of disease and the presence of macrolide resistance.

  12. Epidemiological analysis of Leishmania tropica strains and giemsa-stained smears from Syrian and Turkish leishmaniasis patients using multilocus microsatellite typing (MLMT.

    Directory of Open Access Journals (Sweden)

    Mehmet Karakuş

    2017-04-01

    Full Text Available Turkey is located in an important geographical location, in terms of the epidemiology of vector-borne diseases, linking Asia and Europe. Cutaneous leishmaniasis (CL is one of the endemic diseases in a Turkey and according to the Ministry Health of Turkey, 45% of CL patients originate from Şanlıurfa province located in southeastern Turkey. Herein, the epidemiological status of CL, caused by L. tropica, in Turkey was examined using multilocus microsatellite typing (MLMT of strains obtained from Turkish and Syrian patients. A total of 38 cryopreserved strains and 20 Giemsa-stained smears were included in the present study. MLMT was performed using 12 highly specific microsatellite markers. Delta K (ΔK calculation and Bayesian statistics were used to determine the population structure. Three main populations (POP A, B and C were identified and further examination revealed the presence of three subpopulations for POP B and C. Combined analysis was performed using the data of previously typed L. tropica strains and Mediterranean and Şanlıurfa populations were identified. This finding suggests that the epidemiological status of L. tropica is more complicated than expected when compared to previous studies. A new population, comprised of Syrian L. tropica samples, was reported for the first time in Turkey, and the data presented here will provide new epidemiological information for further studies.

  13. A divergent spirochete strain isolated from a resident of the southeastern United States was identified by multilocus sequence typing as Borrelia bissettii

    Czech Academy of Sciences Publication Activity Database

    Golovchenko, Maryna; Vancová, Marie; Clark, K.; Oliver, J. H., Jr.; Grubhoffer, Libor; Rudenko, Natalia

    2016-01-01

    Roč. 9, FEB 4 (2016), č. článku 68. ISSN 1756-3305 EU Projects: European Commission(XE) 278976 - ANTIGONE Institutional support: RVO:60077344 Keywords : Borrelia * Borrelia bissettii * MLST analysis * live spirochete * divergent strain Subject RIV: EG - Zoology Impact factor: 3.080, year: 2016

  14. Genome Sequencing and Analysis Conference IV

    Energy Technology Data Exchange (ETDEWEB)

    1993-12-31

    J. Craig Venter and C. Thomas Caskey co-chaired Genome Sequencing and Analysis Conference IV held at Hilton Head, South Carolina from September 26--30, 1992. Venter opened the conference by noting that approximately 400 researchers from 16 nations were present four times as many participants as at Genome Sequencing Conference I in 1989. Venter also introduced the Data Fair, a new component of the conference allowing exchange and on-site computer analysis of unpublished sequence data.

  15. Characterisation by multilocus sequence and porA and flaA typing of Campylobacter jejuni isolated from samples of dog faeces collected in one city in New Zealand.

    Science.gov (United States)

    Mohan, V; Stevenson, M A; Marshall, J C; French, N P

    2017-07-01

    To investigate the prevalence of Campylobacter spp. and C. jejuni in dog faecal material collected from dog walkways in the city of Palmerston North, New Zealand, and to characterise the C. jejuni isolates by multilocus sequence typing (MLST) and porA and flaA antigen gene typing. A total of 355 fresh samples of dogs faeces were collected from bins provided for the disposal of dog faeces in 10 walkways in Palmerston North, New Zealand, between August 2008-July 2009. Presumptive Campylobacter colonies, cultured on modified charcoal cefoperazone deoxycholate plates, were screened for genus Campylobacter and C. jejuni by PCR. The C. jejuni isolates were subsequently characterised by MLST and porA and flaA typing, and C. jejuni sequence types (ST) were assigned. Of the 355 samples collected, 72 (20 (95% CI=16-25)%) were positive for Campylobacter spp. and 22 (6 (95% CI=4-9)%) were positive for C. jejuni. Of the 22 C. jejuni isolates, 19 were fully typed by MLST. Ten isolates were assigned to the clonal complex ST-45 and three to ST-52. The allelic combinations of ST-45/flaA 21/porA 44 (n=3), ST-45/flaA 22/porA 53 (n=3) and ST-52/ flaA 57/porA 905 (n=3) were most frequent. The successful isolation of C. jejuni from canine faecal samples collected from faecal bins provides evidence that Campylobacter spp. may survive outside the host for at least several hours despite requiring fastidious growth conditions in culture. The results show that dogs carry C. jejuni genotypes (ST-45, ST-50, ST-52 and ST-696) that have been reported in human clinical cases. Although these results do not provide any evidence either for the direction of infection or for dogs being a potential risk factor for human campylobacteriosis, dog owners are advised to practice good hygiene with respect to their pets to reduce potential exposure to infection.

  16. Multilocus Variable-Number Tandem-Repeat Analysis, Pulsed-Field Gel Electrophoresis, and Antimicrobial Susceptibility Patterns in Discrimination of Sporadic and Outbreak-Related Strains of Yersinia enterocolitica

    Directory of Open Access Journals (Sweden)

    Skurnik Mikael

    2011-02-01

    Full Text Available Abstract Background We assessed the potential of multilocus variable-number tandem-repeat analysis (MLVA, pulsed-field gel electrophoresis (PFGE, and antimicrobial susceptibility testing for discriminating 104 sporadic and outbreak-related Yersinia enterocolitica (YE bio/serotype 3-4/O:3 and 2/O:9 isolates. MLVA using six VNTR markers was performed in two separate multiplex PCRs, and the fluorescently labeled PCR products were accurately sized on an automated DNA sequencer. Results MLVA discriminated 82 sporadic YE 3-4/O:3 and 2/O:9 strains into 77 types, whereas PFGE with the restriction enzyme NotI discriminated the strains into 23 different PFGE pulsotypes. The discriminatory index for a sporadic strain was 0.862 for PFGE and 0.999 for MLVA. MLVA confirmed that a foodborne outbreak in the city of Kotka, Finland in 2003 had been caused by a multiresistant YE 4/O:3 strain that was distinctly different from those of epidemiologically unrelated strains with an identical PFGE pulsotype. The multiresistance of Y. enterocolitica strains (19% of the sporadic strains correlated significantly (p = 0.002 with travel abroad. All of the multiresistant Y. enterocolitica strains belonged to four PFGE pulsotypes that did not contain any susceptible strains. Resistance to nalidixic acid was related to changes in codons 83 or 87 that stemmed from mutations in the gyrA gene. The conjugation experiments demonstrated that resistance to CHL, STR, and SUL was carried by a conjugative plasmid. Conclusions MLVA using six loci had better discriminatory power than PFGE with the NotI enzyme. MLVA was also a less labor-intensive method than PFGE and the results were easier to analyze. The conjugation experiments demonstrated that a resistance plasmid can easily be transferred between Y. enterocolitica strains. Antimicrobial multiresistance of Y. enterocolitica strains was significantly associated with travel abroad.

  17. DNA sequence analysis of X-ray induced Adh null mutations in Drosophila melanogaster

    International Nuclear Information System (INIS)

    Mahmoud, J.; Fossett, N.G.; Arbour-Reily, P.; McDaniel, M.; Tucker, A.; Chang, S.H.; Lee, W.R.

    1991-01-01

    The mutational spectrum for 28 X-ray induced mutations and 2 spontaneous mutations, previously determined by genetic and cytogenetic methods, consisted of 20 multilocus deficiencies (19 induced and 1 spontaneous) and 10 intragenic mutations (9 induced and 1 spontaneous). One of the X-ray induced intragenic mutations was lost, and another was determined to be a recombinant with the allele used in the recovery scheme. The DNA sequence of two X-ray induced intragenic mutations has been published. This paper reports the results of DNA sequence analysis of the remaining intragenic mutations and a summary of the X-ray induced mutational spectrum. The combination of DNA sequence analysis with genetic complementation analysis shows a continuous distribution in size of deletions rather than two different types of mutations consisting of deletions and 'point mutations'. Sequencing is shown to be essential for detecting intragenic deletions. Of particular importance for future studies is the observation that all of the intragenic deletions consist of a direct repeat adjacent to the breakpoint with one of the repeats deleted

  18. Robustness analysis of chiller sequencing control

    International Nuclear Information System (INIS)

    Liao, Yundan; Sun, Yongjun; Huang, Gongsheng

    2015-01-01

    Highlights: • Uncertainties with chiller sequencing control were systematically quantified. • Robustness of chiller sequencing control was systematically analyzed. • Different sequencing control strategies were sensitive to different uncertainties. • A numerical method was developed for easy selection of chiller sequencing control. - Abstract: Multiple-chiller plant is commonly employed in the heating, ventilating and air-conditioning system to increase operational feasibility and energy-efficiency under part load condition. In a multiple-chiller plant, chiller sequencing control plays a key role in achieving overall energy efficiency while not sacrifices the cooling sufficiency for indoor thermal comfort. Various sequencing control strategies have been developed and implemented in practice. Based on the observation that (i) uncertainty, which cannot be avoided in chiller sequencing control, has a significant impact on the control performance and may cause the control fail to achieve the expected control and/or energy performance; and (ii) in current literature few studies have systematically addressed this issue, this paper therefore presents a study on robustness analysis of chiller sequencing control in order to understand the robustness of various chiller sequencing control strategies under different types of uncertainty. Based on the robustness analysis, a simple and applicable method is developed to select the most robust control strategy for a given chiller plant in the presence of uncertainties, which will be verified using case studies

  19. Examination of X chromosome markers in Rett syndrome: Exclusion mapping with a novel variation on multilocus linkage analysis

    Energy Technology Data Exchange (ETDEWEB)

    Ellison, K.A.; Fill, C.P. (Baylor College of Medicine, Houston, TX (United States)); Terwililger, J.; Percy, A.K.; Zobhbi, H. (Columbia University, NY (United States)); DeGennaro, L.J.; Ott, J. (University of Massachusetts Medical School, Worcester (United States)); Anvret, M.; Martin-Gallardo, A. (National Institutes of Health, Bethesda, MD (United States))

    1992-02-01

    Rett syndrome is a neurologic disorder characterized by early normal development followed by regression, acquired deceleration of head growth, autism, ataxia, and sterotypic hand movements. The exclusive occurrence of the syndrome in females and the occurrence of a few familial cases with inheritance through maternal lines suggest that this disorder is most likely secondary to a mutation on the X chromosome. To address this hypothesis and to identify candidate regions for the Rett syndrome gene locus, genotypic analysis was performed in two families with maternally related affected half-sisters by using 63 DNA markers from the X chromosome. Nineteen of the loci studied were chosen for multipoint linkage analysis because they have been previously genetically mapped using a large number of meioses from reference families. Using the exclusion criterion of a lod score less than [minus]2, the authors were able to exclude the region between the Duchenne muscular dystrophy locus and the DXS456 locus. This region extends from Xp21.2 to Xq21-q23. The use of the multipoint linkage analysis approach outlined in this study should allow the exclusion of additional regions of the X chromosome as new markers are analyzed.

  20. Probabilistic accident sequence recovery analysis

    International Nuclear Information System (INIS)

    Stutzke, Martin A.; Cooper, Susan E.

    2004-01-01

    Recovery analysis is a method that considers alternative strategies for preventing accidents in nuclear power plants during probabilistic risk assessment (PRA). Consideration of possible recovery actions in PRAs has been controversial, and there seems to be a widely held belief among PRA practitioners, utility staff, plant operators, and regulators that the results of recovery analysis should be skeptically viewed. This paper provides a framework for discussing recovery strategies, thus lending credibility to the process and enhancing regulatory acceptance of PRA results and conclusions. (author)

  1. Multi-locus variable-number tandem repeat analysis of Chinese Brucella strains isolated from 1953 to 2013.

    Science.gov (United States)

    Tian, Guo-Zhong; Cui, Bu-Yun; Piao, Dong-Ri; Zhao, Hong-Yan; Li, Lan-Yu; Liu, Xi; Xiao, Pei; Zhao, Zhong-Zhi; Xu, Li-Qing; Jiang, Hai; Li, Zhen-Jun

    2017-05-02

    Brucellosis was a common human and livestock disease caused by Brucella strains, the category B priority pathogens by the US Center for Disease Control (CDC). Identified as a priority disease in human and livestock populations, the increasing incidence in recent years in China needs urgent control measures for this disease but the molecular background important for monitoring the epidemiology of Brucella strains at the national level is still lacking. A total of 600 Brucella isolates collected during 60 years (from 1953 to 2013) in China were genotyped by multiple locus variable-number tandem repeat analysis (MLVA) and the variation degree of MLVA11 loci was calculated by the Hunter Gaston Diversity Index (HGDI) values. The charts and map were processed by Excel 2013, and cluster analysis and epidemiological distribution was performed using BioNumerics (version 5.1). The 600 representative Brucella isolates fell into 104 genotypes with 58 singleton genotypes by the MLVA11 assay, including B. melitensis biovars 2 and 3 (five main genotypes), B. abortus biovars 1 and 3 (two main genotypes), B. suis biovars 1 and 3 (three main genotypes), and B. canis (two main genotypes) respectively. While most B. suis biovar 1 and biovar 3 were respectively found in northern provinces and southern provinces, B. melitensis and B. abortus strains were dominant in China. Canine Brucellosis was only found in animals without any human cases reported. Eight Brucellosis epidemic peaks emerged during the 60 years between 1953 and 2013: 1955 - 1959, 1962 - 1969, 1971 - 1975, 1977 - 1983, 1985 - 1989, 1992 - 1997, 2000 - 2008 and 2010 - 2013 in China. Brucellosis has its unique molecular epidemiological patterns with specific spatial and temporal distribution according to MLVA. IDOP-D-16-00101.

  2. Electrophoretic multilocus analysis for the study of natural populations of the Mediterranean fruit fly, Ceratitis capitata (Wied.)

    International Nuclear Information System (INIS)

    Gasperi, G.; Malacrida, A.R.; Milani, R.; Guglielmino, C.R.

    1990-01-01

    Data concerning spatial and/or temporal variation among 29 samples of four populations of the Mediterranean fruit fly (medfly), Ceratitis capitata (Wiedemann) were obtained by computation of gene frequency values at 25 biochemical loci. The four populations came from Africa (Kenya and Reunion) and from the Mediterranean basin (Sardinia and Procida Island). Statistical parameters of genetic variation included average heterozygosity per locus, proportion of polymorphic loci and average number of alleles per locus. The data were analysed using Principal Component Analysis and Wright's fixation index. Significant differences in genetic heterogeneity were observed on a regional scale in relation to the dispersion of the fly from its supposed area of origin (East Africa) towards the periphery (Mediterranean region). The samples from Procida, collected at different seasons for four consecutive years (1983-1986), provided consistent indications of temporal changes in the genetic structure of this population, and permitted evaluation of the efficiency of a sterilized male strain (T-101) released during a sterile insect technique programme on Procida in 1986. (author). 9 refs, 1 fig., 1 tab

  3. Prevalence of Chlamydia trachomatis Genotypes in Men Who Have Sex with Men and Men Who Have Sex with Women Using Multilocus VNTR Analysis-ompA Typing in Guangzhou, China.

    Directory of Open Access Journals (Sweden)

    Xiaolin Qin

    Full Text Available Chlamydia trachomatis is one of the most prevalent bacterial sexually transmitted infection in China. Although C. trachomatis genotypes can be discriminated by outer membrane protein gene (ompA sequencing, currently available methods have limited resolutions. This study used a high-resolution genotyping method, namely, multilocus variable number tandem-repeat analysis with ompA sequencing (MLVA-ompA, to investigate the local epidemiology of C. trachomatis infections among men who have sex with men (MSM and men who have sex with women (MSW attending a sexually transmitted diseases (STD clinic in Guangzhou, China.Rectal specimens from MSM and urethral specimens from MSW were collected between January 2013 and July 2014 at the Guangdong Provincial Center STD clinic. The specimens were sent to the laboratory for analyses. All specimens that were tested positive for C. trachomatis by the commercial nucleic acid amplification tests were genotyped by MLVA-ompA.Fifty-one rectal specimens from MSM and 96 urethral specimens from MSW were identified with C. trachomatis. One hundred and forty-four of the 147 specimens were fully genotyped by MLVA-ompA. Rectal specimens from MSM were divided into four ompA genotypes and urethral specimens from MSW into nine genotypes. No mixed infections were found among all specimens. The most frequent genotypes were D, G, J, E and F. All specimens were further divided into 46 types after ompA genotyping was combined with MLVA. Genotypes D-8.7.1 and G-3.4a.3 were the most frequent among MSM, whereas genotypes D-3.4a.4, E-8.5.1, F-8.5.1, and J-3.4a.2 were the most frequent subtypes among MSW. The discriminatory index D was 0.90 for MLVA, 0.85 for ompA, and 0.95 for MLVA-ompA.The most prevalent MLVA-ompA genotypes were significantly different between MSM and MSW from Guangzhou, China. Moreover, MLVA-ompA represented a more favorable degree of discrimination than ompA and could be a reliable complement for ompA for the routine

  4. New Multilocus Variable-Number Tandem-Repeat Analysis (MLVA) Scheme for Fine-Scale Monitoring and Microevolution-Related Study of Ralstonia pseudosolanacearum Phylotype I Populations

    Science.gov (United States)

    Guinard, Jérémy; Latreille, Anne; Guérin, Fabien; Poussier, Stéphane

    2016-01-01

    ABSTRACT Bacterial wilt caused by the Ralstonia solanacearum species complex (RSSC) is considered one of the most harmful plant diseases in the world. Special attention should be paid to R. pseudosolanacearum phylotype I due to its large host range, its worldwide distribution, and its high evolutionary potential. So far, the molecular epidemiology and population genetics of this bacterium are poorly understood. Until now, the genetic structure of the RSSC has been analyzed on the worldwide and regional scales. Emerging questions regarding evolutionary forces in RSSC adaptation to hosts now require genetic markers that are able to monitor RSSC field populations. In this study, we aimed to evaluate the multilocus variable-number tandem-repeat analysis (MLVA) approach for its ability to discriminate genetically close phylotype I strains and for population genetics studies. We developed a new MLVA scheme (MLVA-7) allowing us to genotype 580 R. pseudosolanacearum phylotype I strains extracted from susceptible and resistant hosts and from different habitats (stem, soil, and rhizosphere). Based on specificity, polymorphism, and the amplification success rate, we selected seven fast-evolving variable-number tandem-repeat (VNTR) markers. The newly developed MLVA-7 scheme showed higher discriminatory power than the previously published MLVA-13 scheme when applied to collections sampled from the same location on different dates and to collections from different locations on very small scales. Our study provides a valuable tool for fine-scale monitoring and microevolution-related study of R. pseudosolanacearum phylotype I populations. IMPORTANCE Understanding the evolutionary dynamics of adaptation of plant pathogens to new hosts or ecological niches has become a key point for the development of innovative disease management strategies, including durable resistance. Whereas the molecular mechanisms underlying virulence or pathogenicity changes have been studied thoroughly, the

  5. Genetic relationships between clinical and non-clinical strains of Yersinia enterocolitica biovar 1A as revealed by multilocus enzyme electrophoresis and multilocus restriction typing

    Directory of Open Access Journals (Sweden)

    Virdi Jugsharan S

    2010-05-01

    Full Text Available Abstract Background Genetic relationships among 81 strains of Y. enterocolitica biovar 1A isolated from clinical and non-clinical sources were discerned by multilocus enzyme electrophoresis (MLEE and multilocus restriction typing (MLRT using six loci each. Such studies may reveal associations between the genotypes of the strains and their sources of isolation. Results All loci were polymorphic and generated 62 electrophoretic types (ETs and 12 restriction types (RTs. The mean genetic diversity (H of the strains by MLEE and MLRT was 0.566 and 0.441 respectively. MLEE (DI = 0.98 was more discriminatory and clustered Y. enterocolitica biovar 1A strains into four groups, while MLRT (DI = 0.77 identified two distinct groups. BURST (Based Upon Related Sequence Types analysis of the MLRT data suggested aquatic serotype O:6,30-6,31 isolates to be the ancestral strains from which, clinical O:6,30-6,31 strains might have originated by host adaptation and genetic change. Conclusion MLEE revealed greater genetic diversity among strains of Y. enterocolitica biovar 1A and clustered strains in four groups, while MLRT grouped the strains into two groups. BURST analysis of MLRT data nevertheless provided newer insights into the probable evolution of clinical strains from aquatic strains.

  6. Sequence analysis by iterated maps, a review.

    Science.gov (United States)

    Almeida, Jonas S

    2014-05-01

    Among alignment-free methods, Iterated Maps (IMs) are on a particular extreme: they are also scale free (order free). The use of IMs for sequence analysis is also distinct from other alignment-free methodologies in being rooted in statistical mechanics instead of computational linguistics. Both of these roots go back over two decades to the use of fractal geometry in the characterization of phase-space representations. The time series analysis origin of the field is betrayed by the title of the manuscript that started this alignment-free subdomain in 1990, 'Chaos Game Representation'. The clash between the analysis of sequences as continuous series and the better established use of Markovian approaches to discrete series was almost immediate, with a defining critique published in same journal 2 years later. The rest of that decade would go by before the scale-free nature of the IM space was uncovered. The ensuing decade saw this scalability generalized for non-genomic alphabets as well as an interest in its use for graphic representation of biological sequences. Finally, in the past couple of years, in step with the emergence of BigData and MapReduce as a new computational paradigm, there is a surprising third act in the IM story. Multiple reports have described gains in computational efficiency of multiple orders of magnitude over more conventional sequence analysis methodologies. The stage appears to be now set for a recasting of IMs with a central role in processing nextgen sequencing results.

  7. Preliminary hazard analysis using sequence tree method

    International Nuclear Information System (INIS)

    Huang Huiwen; Shih Chunkuan; Hung Hungchih; Chen Minghuei; Yih Swu; Lin Jiinming

    2007-01-01

    A system level PHA using sequence tree method was developed to perform Safety Related digital I and C system SSA. The conventional PHA is a brainstorming session among experts on various portions of the system to identify hazards through discussions. However, this conventional PHA is not a systematic technique, the analysis results strongly depend on the experts' subjective opinions. The analysis quality cannot be appropriately controlled. Thereby, this research developed a system level sequence tree based PHA, which can clarify the relationship among the major digital I and C systems. Two major phases are included in this sequence tree based technique. The first phase uses a table to analyze each event in SAR Chapter 15 for a specific safety related I and C system, such as RPS. The second phase uses sequence tree to recognize what I and C systems are involved in the event, how the safety related systems work, and how the backup systems can be activated to mitigate the consequence if the primary safety systems fail. In the sequence tree, the defense-in-depth echelons, including Control echelon, Reactor trip echelon, ESFAS echelon, and Indication and display echelon, are arranged to construct the sequence tree structure. All the related I and C systems, include digital system and the analog back-up systems are allocated in their specific echelon. By this system centric sequence tree based analysis, not only preliminary hazard can be identified systematically, the vulnerability of the nuclear power plant can also be recognized. Therefore, an effective simplified D3 evaluation can be performed as well. (author)

  8. Multilocus genotype (MLG) analysis of Giardia from captive wildlife in Chengdu zoo%成都动物园野生动物源贾第虫的多位点基因分型鉴定

    Institute of Scientific and Technical Information of China (English)

    李威; 彭广能; 屈羽; 钟志军; 杨平; 李云娇; 王吴优; 刘学涵; 谢娜; 邓家波

    2017-01-01

    为了解四川省成都市动物园野生动物贾第虫的流行及基因型,本研究采集了146份不同野生动物的新鲜粪便并提取基因组DNA.通过巢式PCR扩增β-giardin、tpi和gdh基因,扩增产物测序后进行种系发育分析.结果表明,CDZOO1粘鹿源和CDZOO3龟源贾第虫通过多位点基因分型(MLG)鉴定为AI-1亚型;CDZOO2鹿源贾第虫为E型(β-giardin基因位点);CDZOO4黇鹿源贾第虫为A型(β-giardin和tpi基因位点);CDZOO5浣熊源和CDZOO6细尾獴源贾第虫在β-giardin位点为D型而在tpi位点为A型.%In order to investigate the infection and genotypes of Giardia from different wildlife in Chengdu zoo,a total of 146 fresh fecal samples were collected and their genome DNA extracted.The β-giardin,tpi and gdh genes were amplified by nested-PCR and the product were sequenced followed by phylogenetic analysis.As a result,two persian fallows (CDZOO1 and CDZOO4),a deer (CDZOO2),a tortoise (CDZOO3),a raccoon (CDZOO5),a meerkat (CDZOO6) were infected with Giardia.Multilocus genotypes (MLGs) identified assemblages AI-1 in CDZOO1 and CDZOO3;CDZ002 was infected with assemblage A in both β-giardin and tpi loci;CDZOO4 was confirmed as assemblage A at the tpi locus;CDZOO5 and CDZOO6 were identified as assemblage D at the β-giardin locus while assemblage A based on the tpi locus.

  9. Digital image sequence processing, compression, and analysis

    CERN Document Server

    Reed, Todd R

    2004-01-01

    IntroductionTodd R. ReedCONTENT-BASED IMAGE SEQUENCE REPRESENTATIONPedro M. Q. Aguiar, Radu S. Jasinschi, José M. F. Moura, andCharnchai PluempitiwiriyawejTHE COMPUTATION OF MOTIONChristoph Stiller, Sören Kammel, Jan Horn, and Thao DangMOTION ANALYSIS AND DISPLACEMENT ESTIMATION IN THE FREQUENCY DOMAINLuca Lucchese and Guido Maria CortelazzoQUALITY OF SERVICE ASSESSMENT IN NEW GENERATION WIRELESS VIDEO COMMUNICATIONSGaetano GiuntaERROR CONCEALMENT IN DIGITAL VIDEOFrancesco G.B. De NataleIMAGE SEQUENCE RESTORATION: A WIDER PERSPECTIVEAnil KokaramVIDEO SUMMARIZATIONCuneyt M. Taskiran and Edward

  10. PseudoMLSA: a database for multigenic sequence analysis of Pseudomonas species

    Directory of Open Access Journals (Sweden)

    Lalucat Jorge

    2010-04-01

    Full Text Available Abstract Background The genus Pseudomonas comprises more than 100 species of environmental, clinical, agricultural, and biotechnological interest. Although, the recommended method for discriminating bacterial species is DNA-DNA hybridisation, alternative techniques based on multigenic sequence analysis are becoming a common practice in bacterial species discrimination studies. Since there is not a general criterion for determining which genes are more useful for species resolution; the number of strains and genes analysed is increasing continuously. As a result, sequences of different genes are dispersed throughout several databases. This sequence information needs to be collected in a common database, in order to be useful for future identification-based projects. Description The PseudoMLSA Database is a comprehensive database of multiple gene sequences from strains of Pseudomonas species. The core of the database is composed of selected gene sequences from all Pseudomonas type strains validly assigned to the genus through 2008. The database is aimed to be useful for MultiLocus Sequence Analysis (MLSA procedures, for the identification and characterisation of any Pseudomonas bacterial isolate. The sequences are available for download via a direct connection to the National Center for Biotechnology Information (NCBI. Additionally, the database includes an online BLAST interface for flexible nucleotide queries and similarity searches with the user's datasets, and provides a user-friendly output for easily parsing, navigating, and analysing BLAST results. Conclusions The PseudoMLSA database amasses strains and sequence information of validly described Pseudomonas species, and allows free querying of the database via a user-friendly, web-based interface available at http://www.uib.es/microbiologiaBD/Welcome.html. The web-based platform enables easy retrieval at strain or gene sequence information level; including references to published peer

  11. Multilocus genotyping of Giardia duodenalis in captive non-human primates in Sichuan and Guizhou provinces, Southwestern China.

    Directory of Open Access Journals (Sweden)

    Zhijun Zhong

    Full Text Available Giardia duodenalis is a common human and animal pathogen. It has been increasingly reported in wild and captive non-human primates (NHPs in recent years. However, multilocus genotyping information for G. duodenalis infecting NHPs in southwestern China is limited. In the present study, the prevalence and multilocus genotypes (MLGs of G. duodenalis in captive NHPs in southwestern China were determined. We examined 207 fecal samples from NHPs in Sichuan and Guizhou provinces, and 16 specimens were positive for G. duodenalis. The overall infection rate was 7.7%, and only assemblage B was identified. G. duodenalis was detect positive in northern white-cheeked gibbon (14/36, 38.9%, crab-eating macaque (1/60, 1.7% and rhesus macaques (1/101, 0.9%. Multilocus sequence typing based on beta-giardin (bg, triose phosphate isomerase (tpi and glutamate dehydrogenase (gdh revealed nine different assemblage B MLGs (five known genotypes and four novel genotypes. Based on a phylogenetic analysis, one potentially zoonotic genotype of MLG SW7 was identified in a northern white-cheeked gibbon. A high degree of genetic diversity within assemblage B was observed in captive northern white-cheeked gibbons in Southwestern China, including a potentially zoonotic genotype, MLG SW7. To the best of our knowledge, this is the first report using a MLGs approach to identify G. duodenalis in captive NHPs in Southwestern China.

  12. Sequence comparison and phylogenetic analysis of core gene of ...

    African Journals Online (AJOL)

    Phylogenetic analysis suggests that our sequences are clustered with sequences reported from Japan. This is the first phylogenetic analysis of HCV core gene from Pakistani population. Our sequences and sequences from Japan are grouped into same cluster in the phylogenetic tree. Sequence comparison and ...

  13. [Complete genome sequencing and sequence analysis of BCG Tice].

    Science.gov (United States)

    Wang, Zhiming; Pan, Yuanlong; Wu, Jun; Zhu, Baoli

    2012-10-04

    The objective of this study is to obtain the complete genome sequence of Bacillus Calmette-Guerin Tice (BCG Tice), in order to provide more information about the molecular biology of BCG Tice and design more reasonable vaccines to prevent tuberculosis. We assembled the data from high-throughput sequencing with SOAPdenovo software, with many contigs and scaffolds obtained. There are many sequence gaps and physical gaps remained as a result of regional low coverage and low quality. We designed primers at the end of contigs and performed PCR amplification in order to link these contigs and scaffolds. With various enzymes to perform PCR amplification, adjustment of PCR reaction conditions, and combined with clone construction to sequence, all the gaps were finished. We obtained the complete genome sequence of BCG Tice and submitted it to GenBank of National Center for Biotechnology Information (NCBI). The genome of BCG Tice is 4334064 base pairs in length, with GC content 65.65%. The problems and strategies during the finishing step of BCG Tice sequencing are illuminated here, with the hope of affording some experience to those who are involved in the finishing step of genome sequencing. The microarray data were verified by our results.

  14. OTU analysis using metagenomic shotgun sequencing data.

    Directory of Open Access Journals (Sweden)

    Xiaolin Hao

    Full Text Available Because of technological limitations, the primer and amplification biases in targeted sequencing of 16S rRNA genes have veiled the true microbial diversity underlying environmental samples. However, the protocol of metagenomic shotgun sequencing provides 16S rRNA gene fragment data with natural immunity against the biases raised during priming and thus the potential of uncovering the true structure of microbial community by giving more accurate predictions of operational taxonomic units (OTUs. Nonetheless, the lack of statistically rigorous comparison between 16S rRNA gene fragments and other data types makes it difficult to interpret previously reported results using 16S rRNA gene fragments. Therefore, in the present work, we established a standard analysis pipeline that would help confirm if the differences in the data are true or are just due to potential technical bias. This pipeline is built by using simulated data to find optimal mapping and OTU prediction methods. The comparison between simulated datasets revealed a relationship between 16S rRNA gene fragments and full-length 16S rRNA sequences that a 16S rRNA gene fragment having a length >150 bp provides the same accuracy as a full-length 16S rRNA sequence using our proposed pipeline, which could serve as a good starting point for experimental design and making the comparison between 16S rRNA gene fragment-based and targeted 16S rRNA sequencing-based surveys possible.

  15. Sequence Matching Analysis for Curriculum Development

    Directory of Open Access Journals (Sweden)

    Liem Yenny Bendatu

    2015-06-01

    Full Text Available Many organizations apply information technologies to support their business processes. Using the information technologies, the actual events are recorded and utilized to conform with predefined model. Conformance checking is an approach to measure the fitness and appropriateness between process model and actual events. However, when there are multiple events with the same timestamp, the traditional approach unfit to result such measures. This study attempts to develop a sequence matching analysis. Considering conformance checking as the basis of this approach, this proposed approach utilizes the current control flow technique in process mining domain. A case study in the field of educational process has been conducted. This study also proposes a curriculum analysis framework to test the proposed approach. By considering the learning sequence of students, it results some measurements for curriculum development. Finally, the result of the proposed approach has been verified by relevant instructors for further development.

  16. Analysis of Pteridium ribosomal RNA sequences by rapid direct sequencing.

    Science.gov (United States)

    Tan, M K

    1991-08-01

    A total of 864 bases from 5 regions interspersed in the 18S and 26S rRNA molecules from various clones of Pteridium covering the general geographical distribution of the genus was analysed using a rapid rRNA sequencing technique. No base difference has been detected amongst the three major lineages, two of which apparently separated before the breakup of the ancient supercontinent, Pangaea. These regions of the rRNA sequences have thus been conserved for at least 160 million years and are here compared with other eukaryotic, especially plant rRNAs.

  17. My-Forensic-Loci-queries (MyFLq) framework for analysis of forensic STR data generated by massive parallel sequencing.

    Science.gov (United States)

    Van Neste, Christophe; Vandewoestyne, Mado; Van Criekinge, Wim; Deforce, Dieter; Van Nieuwerburgh, Filip

    2014-03-01

    Forensic scientists are currently investigating how to transition from capillary electrophoresis (CE) to massive parallel sequencing (MPS) for analysis of forensic DNA profiles. MPS offers several advantages over CE such as virtually unlimited multiplexy of loci, combining both short tandem repeat (STR) and single nucleotide polymorphism (SNP) loci, small amplicons without constraints of size separation, more discrimination power, deep mixture resolution and sample multiplexing. We present our bioinformatic framework My-Forensic-Loci-queries (MyFLq) for analysis of MPS forensic data. For allele calling, the framework uses a MySQL reference allele database with automatically determined regions of interest (ROIs) by a generic maximal flanking algorithm which makes it possible to use any STR or SNP forensic locus. Python scripts were designed to automatically make allele calls starting from raw MPS data. We also present a method to assess the usefulness and overall performance of a forensic locus with respect to MPS, as well as methods to estimate whether an unknown allele, which sequence is not present in the MySQL database, is in fact a new allele or a sequencing error. The MyFLq framework was applied to an Illumina MiSeq dataset of a forensic Illumina amplicon library, generated from multilocus STR polymerase chain reaction (PCR) on both single contributor samples and multiple person DNA mixtures. Although the multilocus PCR was not yet optimized for MPS in terms of amplicon length or locus selection, the results show excellent results for most loci. The results show a high signal-to-noise ratio, correct allele calls, and a low limit of detection for minor DNA contributors in mixed DNA samples. Technically, forensic MPS affords great promise for routine implementation in forensic genomics. The method is also applicable to adjacent disciplines such as molecular autopsy in legal medicine and in mitochondrial DNA research. Copyright © 2013 The Authors. Published by

  18. Whole-Genome Sequencing and Comparative Genome Analysis of Bacillus subtilis Strains Isolated from Non-Salted Fermented Soybean Foods.

    Directory of Open Access Journals (Sweden)

    Mayumi Kamada

    Full Text Available Bacillus subtilis is the main component in the fermentation of soybeans. To investigate the genetics of the soybean-fermenting B. subtilis strains and its relationship with the productivity of extracellular poly-γ-glutamic acid (γPGA, we sequenced the whole genome of eight B. subtilis stains isolated from non-salted fermented soybean foods in Southeast Asia. Assembled nucleotide sequences were compared with those of a natto (fermented soybean food starter strain B. subtilis BEST195 and the laboratory standard strain B. subtilis 168 that is incapable of γPGA production. Detected variants were investigated in terms of insertion sequences, biotin synthesis, production of subtilisin NAT, and regulatory genes for γPGA synthesis, which were related to fermentation process. Comparing genome sequences, we found that the strains that produce γPGA have a deletion in a protein that constitutes the flagellar basal body, and this deletion was not found in the non-producing strains. We further identified diversity in variants of the bio operon, which is responsible for the biotin auxotrophism of the natto starter strains. Phylogenetic analysis using multilocus sequencing typing revealed that the B. subtilis strains isolated from the non-salted fermented soybeans were not clustered together, while the natto-fermenting strains were tightly clustered; this analysis also suggested that the strain isolated from "Tua Nao" of Thailand traces a different evolutionary process from other strains.

  19. A multi-locus plastid phylogenetic analysis of the pantropical genus Diospyros (Ebenaceae), with an emphasis on the radiation and biogeographic origins of the New Caledonian endemic species

    OpenAIRE

    Duangjai, S.; Samuel, R.; Munzinger, Jérôme; Forest, F.; Wallnofer, B.; Barfuss, M.H.J.; Fischer, G.; Chase, M. W.

    2009-01-01

    We aimed to clarify phylogenetic relationships within the pantropical genus Diospyros (Ebenaceae sensu lato), and ascertain biogeographical patterns in the New Caledonian endemic species. We used DNA sequences from eight plastid regions (rbcL, atpB, matK, ndhF, trnK intron, trnL intron, trnL-trnF spacer, and trnS-trnG spacer) and included 149 accessions representing 119 Diospyros species in our analysis. Results from this study confirmed the monophyly of Diospyros with good support and provid...

  20. Development of a multilocus-based approach for sponge (phylum Porifera) identification: refinement and limitations.

    Science.gov (United States)

    Yang, Qi; Franco, Christopher M M; Sorokin, Shirley J; Zhang, Wei

    2017-02-02

    For sponges (phylum Porifera), there is no reliable molecular protocol available for species identification. To address this gap, we developed a multilocus-based Sponge Identification Protocol (SIP) validated by a sample of 37 sponge species belonging to 10 orders from South Australia. The universal barcode COI mtDNA, 28S rRNA gene (D3-D5), and the nuclear ITS1-5.8S-ITS2 region were evaluated for their suitability and capacity for sponge identification. The highest Bit Score was applied to infer the identity. The reliability of SIP was validated by phylogenetic analysis. The 28S rRNA gene and COI mtDNA performed better than the ITS region in classifying sponges at various taxonomic levels. A major limitation is that the databases are not well populated and possess low diversity, making it difficult to conduct the molecular identification protocol. The identification is also impacted by the accuracy of the morphological classification of the sponges whose sequences have been submitted to the database. Re-examination of the morphological identification further demonstrated and improved the reliability of sponge identification by SIP. Integrated with morphological identification, the multilocus-based SIP offers an improved protocol for more reliable and effective sponge identification, by coupling the accuracy of different DNA markers.

  1. FAST: FAST Analysis of Sequences Toolbox

    Directory of Open Access Journals (Sweden)

    Travis J. Lawrence

    2015-05-01

    Full Text Available FAST (FAST Analysis of Sequences Toolbox provides simple, powerful open source command-line tools to filter, transform, annotate and analyze biological sequence data. Modeled after the GNU (GNU’s Not Unix Textutils such as grep, cut, and tr, FAST tools such as fasgrep, fascut, and fastr make it easy to rapidly prototype expressive bioinformatic workflows in a compact and generic command vocabulary. Compact combinatorial encoding of data workflows with FAST commands can simplify the documentation and reproducibility of bioinformatic protocols, supporting better transparency in biological data science. Interface self-consistency and conformity with conventions of GNU, Matlab, Perl, BioPerl, R and GenBank help make FAST easy and rewarding to learn. FAST automates numerical, taxonomic, and text-based sorting, selection and transformation of sequence records and alignment sites based on content, index ranges, descriptive tags, annotated features, and in-line calculated analytics, including composition and codon usage. Automated content- and feature-based extraction of sites and support for molecular population genetic statistics makes FAST useful for molecular evolutionary analysis. FAST is portable, easy to install and secure thanks to the relative maturity of its Perl and BioPerl foundations, with stable releases posted to CPAN. Development as well as a publicly accessible Cookbook and Wiki are available on the FAST GitHub repository at https://github.com/tlawrence3/FAST. The default data exchange format in FAST is Multi-FastA (specifically, a restriction of BioPerl FastA format. Sanger and Illumina 1.8+ FastQ formatted files are also supported. FAST makes it easier for non-programmer biologists to interactively investigate and control biological data at the speed of thought.

  2. Bayesian Correlation Analysis for Sequence Count Data.

    Directory of Open Access Journals (Sweden)

    Daniel Sánchez-Taltavull

    Full Text Available Evaluating the similarity of different measured variables is a fundamental task of statistics, and a key part of many bioinformatics algorithms. Here we propose a Bayesian scheme for estimating the correlation between different entities' measurements based on high-throughput sequencing data. These entities could be different genes or miRNAs whose expression is measured by RNA-seq, different transcription factors or histone marks whose expression is measured by ChIP-seq, or even combinations of different types of entities. Our Bayesian formulation accounts for both measured signal levels and uncertainty in those levels, due to varying sequencing depth in different experiments and to varying absolute levels of individual entities, both of which affect the precision of the measurements. In comparison with a traditional Pearson correlation analysis, we show that our Bayesian correlation analysis retains high correlations when measurement confidence is high, but suppresses correlations when measurement confidence is low-especially for entities with low signal levels. In addition, we consider the influence of priors on the Bayesian correlation estimate. Perhaps surprisingly, we show that naive, uniform priors on entities' signal levels can lead to highly biased correlation estimates, particularly when different experiments have widely varying sequencing depths. However, we propose two alternative priors that provably mitigate this problem. We also prove that, like traditional Pearson correlation, our Bayesian correlation calculation constitutes a kernel in the machine learning sense, and thus can be used as a similarity measure in any kernel-based machine learning algorithm. We demonstrate our approach on two RNA-seq datasets and one miRNA-seq dataset.

  3. Major clades of Agaricales: a multilocus phylogenetic overview.

    Science.gov (United States)

    P. Brandon Matheny; Judd M. Curtis; Valerie Hofstetter; M. Catherine Aime; Jean-Marc Moncalvo; Zai-Wei Ge; Zhu-Liang Yang; Joseph F. Ammirati; Timothy J. Baroni; Neale L. Bougher; Karen W. Lodge Hughes; Richard W. Kerrigan; Michelle T. Seidl; Aanen; Matthew Duur K. DeNitis; Graciela M. Daniele; Dennis E. Desjardin; Bradley R. Kropp; Lorelei L. Norvell; Andrew Parker; Else C. Vellinga; Rytas Vilgalys; David S. Hibbett

    2006-01-01

    An overview of the phylogeny of the Agaricales is presented based on a multilocus analysis of a six-gene region supermatrix. Bayesian analyses of 5611 nucleotide characters of rpb1, rpb1-intron 2, rpb2 and 18S, 25S, and 5.8S ribosomal RNA genes recovered six major clades, which are recognized informally and labeled the Agaricoid, Tricholomatoid, Marasmioid, Pluteoid,...

  4. A basic analysis toolkit for biological sequences

    Directory of Open Access Journals (Sweden)

    Siragusa Enrico

    2007-09-01

    Full Text Available Abstract This paper presents a software library, nicknamed BATS, for some basic sequence analysis tasks. Namely, local alignments, via approximate string matching, and global alignments, via longest common subsequence and alignments with affine and concave gap cost functions. Moreover, it also supports filtering operations to select strings from a set and establish their statistical significance, via z-score computation. None of the algorithms is new, but although they are generally regarded as fundamental for sequence analysis, they have not been implemented in a single and consistent software package, as we do here. Therefore, our main contribution is to fill this gap between algorithmic theory and practice by providing an extensible and easy to use software library that includes algorithms for the mentioned string matching and alignment problems. The library consists of C/C++ library functions as well as Perl library functions. It can be interfaced with Bioperl and can also be used as a stand-alone system with a GUI. The software is available at http://www.math.unipa.it/~raffaele/BATS/ under the GNU GPL.

  5. Whole genome sequence analysis of Mycobacterium suricattae

    KAUST Repository

    Dippenaar, Anzaan; Parsons, Sven David Charles; Sampson, Samantha Leigh; Van Der Merwe, Ruben Gerhard; Drewe, Julian Ashley; Abdallah, Abdallah; Siame, Kabengele Keith; Gey Van Pittius, Nicolaas Claudius; Van Helden, Paul David; Pain, Arnab; Warren, Robin Mark

    2015-01-01

    Tuberculosis occurs in various mammalian hosts and is caused by a range of different lineages of the Mycobacterium tuberculosis complex (MTBC). A recently described member, Mycobacterium suricattae, causes tuberculosis in meerkats (Suricata suricatta) in Southern Africa and preliminary genetic analysis showed this organism to be closely related to an MTBC pathogen of rock hyraxes (Procavia capensis), the dassie bacillus. Here we make use of whole genome sequencing to describe the evolution of the genome of M. suricattae, including known and novel regions of difference, SNPs and IS6110 insertion sites. We used genome-wide phylogenetic analysis to show that M. suricattae clusters with the chimpanzee bacillus, previously isolated from a chimpanzee (Pan troglodytes) in West Africa. We propose an evolutionary scenario for the Mycobacterium africanum lineage 6 complex, showing the evolutionary relationship of M. africanum and chimpanzee bacillus, and the closely related members M. suricattae, dassie bacillus and Mycobacterium mungi.

  6. Whole genome sequence analysis of Mycobacterium suricattae

    KAUST Repository

    Dippenaar, Anzaan

    2015-10-21

    Tuberculosis occurs in various mammalian hosts and is caused by a range of different lineages of the Mycobacterium tuberculosis complex (MTBC). A recently described member, Mycobacterium suricattae, causes tuberculosis in meerkats (Suricata suricatta) in Southern Africa and preliminary genetic analysis showed this organism to be closely related to an MTBC pathogen of rock hyraxes (Procavia capensis), the dassie bacillus. Here we make use of whole genome sequencing to describe the evolution of the genome of M. suricattae, including known and novel regions of difference, SNPs and IS6110 insertion sites. We used genome-wide phylogenetic analysis to show that M. suricattae clusters with the chimpanzee bacillus, previously isolated from a chimpanzee (Pan troglodytes) in West Africa. We propose an evolutionary scenario for the Mycobacterium africanum lineage 6 complex, showing the evolutionary relationship of M. africanum and chimpanzee bacillus, and the closely related members M. suricattae, dassie bacillus and Mycobacterium mungi.

  7. Isolation and Whole-genome Sequence Analysis of the Imipenem Heteroresistant Acinetobacter baumannii Clinical Isolate HRAB-85.

    Science.gov (United States)

    Li, Puyuan; Huang, Yong; Yu, Lan; Liu, Yannan; Niu, Wenkai; Zou, Dayang; Liu, Huiying; Zheng, Jing; Yin, Xiuyun; Yuan, Jing; Yuan, Xin; Bai, Changqing

    2017-09-01

    Heteroresistance is a phenomenon in which there are various responses to antibiotics from bacterial cells within the same population. Here, we isolated and characterised an imipenem heteroresistant Acinetobacter baumannii strain (HRAB-85). The genome of strain HRAB-85 was completely sequenced and analysed to understand its antibiotic resistance mechanisms. Population analysis and multilocus sequence typing were performed. Subpopulations grew in the presence of imipenem at concentrations of up to 64μg/mL, and the strain was found to belong to ST208. The total length of strain HRAB-85 was 4,098,585bp with a GC content of 39.98%. The genome harboured at least four insertion sequences: the common ISAba1, ISAba22, ISAba24, and newly reported ISAba26. Additionally, 19 antibiotic-resistance genes against eight classes of antimicrobial agents were found, and 11 genomic islands (GIs) were identified. Among them, GI3, GI10, and GI11 contained many ISs and antibiotic-resistance determinants. The existence of imipenem heteroresistant phenotypes in A. baumannii was substantiated in this hospital, and imipenem pressure, which could induce imipenem-heteroresistant subpopulations, may select for highly resistant strains. The complete genome sequencing and bioinformatics analysis of HRAB-85 could improve our understanding of the epidemiology and resistance mechanisms of carbapenem-heteroresistant A. baumannii. Copyright © 2017. Published by Elsevier Ltd.

  8. Computational analysis of sequence selection mechanisms.

    Science.gov (United States)

    Meyerguz, Leonid; Grasso, Catherine; Kleinberg, Jon; Elber, Ron

    2004-04-01

    Mechanisms leading to gene variations are responsible for the diversity of species and are important components of the theory of evolution. One constraint on gene evolution is that of protein foldability; the three-dimensional shapes of proteins must be thermodynamically stable. We explore the impact of this constraint and calculate properties of foldable sequences using 3660 structures from the Protein Data Bank. We seek a selection function that receives sequences as input, and outputs survival probability based on sequence fitness to structure. We compute the number of sequences that match a particular protein structure with energy lower than the native sequence, the density of the number of sequences, the entropy, and the "selection" temperature. The mechanism of structure selection for sequences longer than 200 amino acids is approximately universal. For shorter sequences, it is not. We speculate on concrete evolutionary mechanisms that show this behavior.

  9. Comparative analysis of sequences from PT 2013

    DEFF Research Database (Denmark)

    Mikkelsen, Susie Sommer

    Sheatfish and not EHNV. Generally, mistakes occurred at the ends of the sequences. This can be due to several factors. One is that the sequence has not been trimmed of the sequence primer sites. Another is the lack of quality control of the chromatogram. Finally, sequencing in just one direction can result...... diseases in Europe. As part of the EURL proficiency test for fish diseases it is required to sequence any RANA virus isolates found in any of the samples. It is also highly recommended to sequence the ISA virus to determine whether it be HPRΔ or HPR0. Furthermore, it is recommended that any VHSV and IHNV...... isolates be genotyped. As part of the evaluation of the proficiency results it was decided this year to look into the quality and similarity of the sequence results for selected viruses. Ampoule III in the proficiency test 2013 contained an EHNV isolate. The EURL received 43 sequences from 41 laboratories...

  10. Time fluctuation analysis of forest fire sequences

    Science.gov (United States)

    Vega Orozco, Carmen D.; Kanevski, Mikhaïl; Tonini, Marj; Golay, Jean; Pereira, Mário J. G.

    2013-04-01

    Forest fires are complex events involving both space and time fluctuations. Understanding of their dynamics and pattern distribution is of great importance in order to improve the resource allocation and support fire management actions at local and global levels. This study aims at characterizing the temporal fluctuations of forest fire sequences observed in Portugal, which is the country that holds the largest wildfire land dataset in Europe. This research applies several exploratory data analysis measures to 302,000 forest fires occurred from 1980 to 2007. The applied clustering measures are: Morisita clustering index, fractal and multifractal dimensions (box-counting), Ripley's K-function, Allan Factor, and variography. These algorithms enable a global time structural analysis describing the degree of clustering of a point pattern and defining whether the observed events occur randomly, in clusters or in a regular pattern. The considered methods are of general importance and can be used for other spatio-temporal events (i.e. crime, epidemiology, biodiversity, geomarketing, etc.). An important contribution of this research deals with the analysis and estimation of local measures of clustering that helps understanding their temporal structure. Each measure is described and executed for the raw data (forest fires geo-database) and results are compared to reference patterns generated under the null hypothesis of randomness (Poisson processes) embedded in the same time period of the raw data. This comparison enables estimating the degree of the deviation of the real data from a Poisson process. Generalizations to functional measures of these clustering methods, taking into account the phenomena, were also applied and adapted to detect time dependences in a measured variable (i.e. burned area). The time clustering of the raw data is compared several times with the Poisson processes at different thresholds of the measured function. Then, the clustering measure value

  11. SVAMP: Sequence variation analysis, maps and phylogeny

    KAUST Repository

    Naeem, Raeece

    2014-04-03

    Summary: SVAMP is a stand-alone desktop application to visualize genomic variants (in variant call format) in the context of geographical metadata. Users of SVAMP are able to generate phylogenetic trees and perform principal coordinate analysis in real time from variant call format (VCF) and associated metadata files. Allele frequency map, geographical map of isolates, Tajima\\'s D metric, single nucleotide polymorphism density, GC and variation density are also available for visualization in real time. We demonstrate the utility of SVAMP in tracking a methicillin-resistant Staphylococcus aureus outbreak from published next-generation sequencing data across 15 countries. We also demonstrate the scalability and accuracy of our software on 245 Plasmodium falciparum malaria isolates from three continents. Availability and implementation: The Qt/C++ software code, binaries, user manual and example datasets are available at http://cbrc.kaust.edu.sa/svamp. © The Author 2014.

  12. Statistical analysis of next generation sequencing data

    CERN Document Server

    Nettleton, Dan

    2014-01-01

    Next Generation Sequencing (NGS) is the latest high throughput technology to revolutionize genomic research. NGS generates massive genomic datasets that play a key role in the big data phenomenon that surrounds us today. To extract signals from high-dimensional NGS data and make valid statistical inferences and predictions, novel data analytic and statistical techniques are needed. This book contains 20 chapters written by prominent statisticians working with NGS data. The topics range from basic preprocessing and analysis with NGS data to more complex genomic applications such as copy number variation and isoform expression detection. Research statisticians who want to learn about this growing and exciting area will find this book useful. In addition, many chapters from this book could be included in graduate-level classes in statistical bioinformatics for training future biostatisticians who will be expected to deal with genomic data in basic biomedical research, genomic clinical trials and personalized med...

  13. Movement Pattern Analysis Based on Sequence Signatures

    Directory of Open Access Journals (Sweden)

    Seyed Hossein Chavoshi

    2015-09-01

    Full Text Available Increased affordability and deployment of advanced tracking technologies have led researchers from various domains to analyze the resulting spatio-temporal movement data sets for the purpose of knowledge discovery. Two different approaches can be considered in the analysis of moving objects: quantitative analysis and qualitative analysis. This research focuses on the latter and uses the qualitative trajectory calculus (QTC, a type of calculus that represents qualitative data on moving point objects (MPOs, and establishes a framework to analyze the relative movement of multiple MPOs. A visualization technique called sequence signature (SESI is used, which enables to map QTC patterns in a 2D indexed rasterized space in order to evaluate the similarity of relative movement patterns of multiple MPOs. The applicability of the proposed methodology is illustrated by means of two practical examples of interacting MPOs: cars on a highway and body parts of a samba dancer. The results show that the proposed method can be effectively used to analyze interactions of multiple MPOs in different domains.

  14. Direct chloroplast sequencing: comparison of sequencing platforms and analysis tools for whole chloroplast barcoding.

    Directory of Open Access Journals (Sweden)

    Marta Brozynska

    Full Text Available Direct sequencing of total plant DNA using next generation sequencing technologies generates a whole chloroplast genome sequence that has the potential to provide a barcode for use in plant and food identification. Advances in DNA sequencing platforms may make this an attractive approach for routine plant identification. The HiSeq (Illumina and Ion Torrent (Life Technology sequencing platforms were used to sequence total DNA from rice to identify polymorphisms in the whole chloroplast genome sequence of a wild rice plant relative to cultivated rice (cv. Nipponbare. Consensus chloroplast sequences were produced by mapping sequence reads to the reference rice chloroplast genome or by de novo assembly and mapping of the resulting contigs to the reference sequence. A total of 122 polymorphisms (SNPs and indels between the wild and cultivated rice chloroplasts were predicted by these different sequencing and analysis methods. Of these, a total of 102 polymorphisms including 90 SNPs were predicted by both platforms. Indels were more variable with different sequencing methods, with almost all discrepancies found in homopolymers. The Ion Torrent platform gave no apparent false SNP but was less reliable for indels. The methods should be suitable for routine barcoding using appropriate combinations of sequencing platform and data analysis.

  15. Development of new multilocus variable number of tandem repeat analysis (MLVA) for Listeria innocua and its application in a food processing plant.

    Science.gov (United States)

    Takahashi, Hajime; Ohshima, Chihiro; Nakagawa, Miku; Thanatsang, Krittaporn; Phraephaisarn, Chirapiphat; Chaturongkasumrit, Yuphakhun; Keeratipibul, Suwimon; Kuda, Takashi; Kimura, Bon

    2014-01-01

    Listeria innocua is an important hygiene indicator bacterium in food industries because it behaves similar to Listeria monocytogenes, which is pathogenic to humans. PFGE is often used to characterize bacterial strains and to track contamination source. However, because PFGE is an expensive, complicated, time-consuming protocol, and poses difficulty in data sharing, development of a new typing method is necessary. MLVA is a technique that identifies bacterial strains on the basis of the number of tandem repeats present in the genome varies depending on the strains. MLVA has gained attention due to its high reproducibility and ease of data sharing. In this study, we developed a MLVA protocol to assess L. innocua and evaluated it by tracking the contamination source of L. innocua in an actual food manufacturing factory by typing the bacterial strains isolated from the factory. Three VNTR regions of the L. innocua genome were chosen for use in the MLVA. The number of repeat units in each VNTR region was calculated based on the results of PCR product analysis using capillary electrophoresis (CE). The calculated number of repetitions was compared with the results of the gene sequence analysis to demonstrate the accuracy of the CE repeat number analysis. The developed technique was evaluated using 60 L. innocua strains isolated from a food factory. These 60 strains were classified into 11 patterns using MLVA. Many of the strains were classified into ST-6, revealing that this MLVA strain type can contaminate each manufacturing process in the factory. The MLVA protocol developed in this study for L. innocua allowed rapid and easy analysis through the use of CE. This technique was found to be very useful in hygiene control in factories because it allowed us to track contamination sources and provided information regarding whether the bacteria were present in the factories.

  16. Development of new multilocus variable number of tandem repeat analysis (MLVA for Listeria innocua and its application in a food processing plant.

    Directory of Open Access Journals (Sweden)

    Hajime Takahashi

    Full Text Available Listeria innocua is an important hygiene indicator bacterium in food industries because it behaves similar to Listeria monocytogenes, which is pathogenic to humans. PFGE is often used to characterize bacterial strains and to track contamination source. However, because PFGE is an expensive, complicated, time-consuming protocol, and poses difficulty in data sharing, development of a new typing method is necessary. MLVA is a technique that identifies bacterial strains on the basis of the number of tandem repeats present in the genome varies depending on the strains. MLVA has gained attention due to its high reproducibility and ease of data sharing. In this study, we developed a MLVA protocol to assess L. innocua and evaluated it by tracking the contamination source of L. innocua in an actual food manufacturing factory by typing the bacterial strains isolated from the factory. Three VNTR regions of the L. innocua genome were chosen for use in the MLVA. The number of repeat units in each VNTR region was calculated based on the results of PCR product analysis using capillary electrophoresis (CE. The calculated number of repetitions was compared with the results of the gene sequence analysis to demonstrate the accuracy of the CE repeat number analysis. The developed technique was evaluated using 60 L. innocua strains isolated from a food factory. These 60 strains were classified into 11 patterns using MLVA. Many of the strains were classified into ST-6, revealing that this MLVA strain type can contaminate each manufacturing process in the factory. The MLVA protocol developed in this study for L. innocua allowed rapid and easy analysis through the use of CE. This technique was found to be very useful in hygiene control in factories because it allowed us to track contamination sources and provided information regarding whether the bacteria were present in the factories.

  17. Noncoding sequence classification based on wavelet transform analysis: part I

    Science.gov (United States)

    Paredes, O.; Strojnik, M.; Romo-Vázquez, R.; Vélez Pérez, H.; Ranta, R.; Garcia-Torales, G.; Scholl, M. K.; Morales, J. A.

    2017-09-01

    DNA sequences in human genome can be divided into the coding and noncoding ones. Coding sequences are those that are read during the transcription. The identification of coding sequences has been widely reported in literature due to its much-studied periodicity. Noncoding sequences represent the majority of the human genome. They play an important role in gene regulation and differentiation among the cells. However, noncoding sequences do not exhibit periodicities that correlate to their functions. The ENCODE (Encyclopedia of DNA elements) and Epigenomic Roadmap Project projects have cataloged the human noncoding sequences into specific functions. We study characteristics of noncoding sequences with wavelet analysis of genomic signals.

  18. Image sequence analysis workstation for multipoint motion analysis

    Science.gov (United States)

    Mostafavi, Hassan

    1990-08-01

    This paper describes an application-specific engineering workstation designed and developed to analyze motion of objects from video sequences. The system combines the software and hardware environment of a modem graphic-oriented workstation with the digital image acquisition, processing and display techniques. In addition to automation and Increase In throughput of data reduction tasks, the objective of the system Is to provide less invasive methods of measurement by offering the ability to track objects that are more complex than reflective markers. Grey level Image processing and spatial/temporal adaptation of the processing parameters is used for location and tracking of more complex features of objects under uncontrolled lighting and background conditions. The applications of such an automated and noninvasive measurement tool include analysis of the trajectory and attitude of rigid bodies such as human limbs, robots, aircraft in flight, etc. The system's key features are: 1) Acquisition and storage of Image sequences by digitizing and storing real-time video; 2) computer-controlled movie loop playback, freeze frame display, and digital Image enhancement; 3) multiple leading edge tracking in addition to object centroids at up to 60 fields per second from both live input video or a stored Image sequence; 4) model-based estimation and tracking of the six degrees of freedom of a rigid body: 5) field-of-view and spatial calibration: 6) Image sequence and measurement data base management; and 7) offline analysis software for trajectory plotting and statistical analysis.

  19. A multi-locus plastid phylogenetic analysis of the pantropical genus Diospyros (Ebenaceae), with an emphasis on the radiation and biogeographic origins of the New Caledonian endemic species.

    Science.gov (United States)

    Duangjai, Sutee; Samuel, Rosabelle; Munzinger, Jérôme; Forest, Félix; Wallnöfer, Bruno; Barfuss, Michael H J; Fischer, Gunter; Chase, Mark W

    2009-09-01

    We aimed to clarify phylogenetic relationships within the pantropical genus Diospyros (Ebenaceae sensulato), and ascertain biogeographical patterns in the New Caledonian endemic species. We used DNA sequences from eight plastid regions (rbcL, atpB, matK, ndhF, trnK intron, trnL intron, trnL-trnF spacer, and trnS-trnG spacer) and included 149 accessions representing 119 Diospyros species in our analysis. Results from this study confirmed the monophyly of Diospyros with good support and provided a clearer picture of the relationships within the genus than in previous studies. Evidence from phylogenetic analyses suggests that Diospyros colonized New Caledonia multiple times. The four lineages of Diospyros in New Caledonia also differ in their degree of diversification. The molecular data indicate that one lineage is paleoendemic and derived from an ancient Australian species. The other three lineages are more closely related to several Southeast Asian species; two of them are neoendemics, and one has radiated rapidly and recently.

  20. Novel algorithms for protein sequence analysis

    NARCIS (Netherlands)

    Ye, Kai

    2008-01-01

    Each protein is characterized by its unique sequential order of amino acids, the so-called protein sequence. Biology”s paradigm is that this order of amino acids determines the protein”s architecture and function. In this thesis, we introduce novel algorithms to analyze protein sequences. Chapter 1

  1. Pig genome sequence - analysis and publication strategy

    DEFF Research Database (Denmark)

    Archibald, Alan L.; Bolund, Lars; Churcher, Carol

    2010-01-01

    preferentially selected for sequencing. In accordance with the Bermuda and Fort Lauderdale agreements and the more recent Toronto Statement the data have been released into public sequence repositories (Genbank/EMBL, NCBI/Ensembl trace repositories) in a timely manner and in advance of publication. CONCLUSIONS...

  2. Accurate and Practical Identification of 20 Fusarium Species by Seven-Locus Sequence Analysis and Reverse Line Blot Hybridization, and an In Vitro Antifungal Susceptibility Study▿†

    Science.gov (United States)

    Wang, He; Xiao, Meng; Kong, Fanrong; Chen, Sharon; Dou, Hong-Tao; Sorrell, Tania; Li, Ruo-Yu; Xu, Ying-Chun

    2011-01-01

    Eleven reference and 25 clinical isolates of Fusarium were subject to multilocus DNA sequence analysis to determine the species and haplotypes of the fusarial isolates from Beijing and Shandong, China. Seven loci were analyzed: the translation elongation factor 1 alpha gene (EF-1α); the nuclear rRNA internal transcribed spacer (ITS), large subunit (LSU), and intergenic spacer (IGS) regions; the second largest subunit of the RNA polymerase gene (RPB2); the calmodulin gene (CAM); and the mitochondrial small subunit (mtSSU) rRNA gene. We also evaluated an IGS-targeted PCR/reverse line blot (RLB) assay for species/haplotype identification of Fusarium. Twenty Fusarium species and seven species complexes were identified. Of 25 clinical isolates (10 species), the Gibberella (Fusarium) fujikuroi species complex was the commonest (40%) and was followed by the Fusarium solani species complex (FSSC) (36%) and the F. incarnatum-F. equiseti species complex (12%). Six FSSC isolates were identified to the species level as FSSC-3+4, and three as FSSC-5. Twenty-nine IGS, 27 EF-1α, 26 RPB2, 24 CAM, 18 ITS, 19 LSU, and 18 mtSSU haplotypes were identified; 29 were unique, and haplotypes for 24 clinical strains were novel. By parsimony informative character analysis, the IGS locus was the most phylogenetically informative, and the rRNA gene regions were the least. Results by RLB were concordant with multilocus sequence analysis for all isolates. Amphotericin B was the most active drug against all species. Voriconazole MICs were high (>8 μg/ml) for 15 (42%) isolates, including FSSC. Analysis of larger numbers of isolates is required to determine the clinical utility of the seven-locus sequence analysis and RLB assay in species classification of fusaria. PMID:21389150

  3. Characterization and sequence analysis of cysteine and glycine-rich ...

    African Journals Online (AJOL)

    Primers specific for CSRP3 were designed using known cDNA sequences of Bos taurus published in database with different accession numbers. Polymerase chain reaction (PCR) was performed and products were purified and sequenced. Sequence analysis and alignment were carried out using CLUSTAL W (1.83).

  4. Incident sequence analysis; event trees, methods and graphical symbols

    International Nuclear Information System (INIS)

    1980-11-01

    When analyzing incident sequences, unwanted events resulting from a certain cause are looked for. Graphical symbols and explanations of graphical representations are presented. The method applies to the analysis of incident sequences in all types of facilities. By means of the incident sequence diagram, incident sequences, i.e. the logical and chronological course of repercussions initiated by the failure of a component or by an operating error, can be presented and analyzed simply and clearly

  5. Computer-aided visualization and analysis system for sequence evaluation

    Energy Technology Data Exchange (ETDEWEB)

    Chee, Mark S.; Wang, Chunwei; Jevons, Luis C.; Bernhart, Derek H.; Lipshutz, Robert J.

    2004-05-11

    A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.

  6. Multi-locus estimates of population structure and migration in a fence lizard hybrid zone.

    Directory of Open Access Journals (Sweden)

    Adam D Leaché

    Full Text Available A hybrid zone between two species of lizards in the genus Sceloporus (S. cowlesi and S. tristichus on the Mogollon Rim in Arizona provides a unique opportunity to study the processes of lineage divergence and merging. This hybrid zone involves complex interactions between 2 morphologically and ecologically divergent subspecies, 3 chromosomal groups, and 4 mitochondrial DNA (mtDNA clades. The spatial patterns of divergence between morphology, chromosomes and mtDNA are discordant, and determining which of these character types (if any reflects the underlying population-level lineages that are of interest has remained impeded by character conflict. The focus of this study is to estimate the number of populations interacting in the hybrid zone using multi-locus nuclear data, and to then estimate the migration rates and divergence time between the inferred populations. Multi-locus estimates of population structure and gene flow were obtained from 12 anonymous nuclear loci sequenced for 93 specimens of Sceloporus. Population structure estimates support two populations, and this result is robust to changes to the prior probability distribution used in the Bayesian analysis and the use of spatially-explicit or non-spatial models. A coalescent analysis of population divergence suggests that gene flow is high between the two populations, and that the timing of divergence is restricted to the Pleistocene. The hybrid zone is more accurately described as involving two populations belonging to S. tristichus, and the presence of S. cowlesi mtDNA haplotypes in the hybrid zone is an anomaly resulting from mitochondrial introgression.

  7. Establishing a framework for comparative analysis of genome sequences

    Energy Technology Data Exchange (ETDEWEB)

    Bansal, A.K.

    1995-06-01

    This paper describes a framework and a high-level language toolkit for comparative analysis of genome sequence alignment The framework integrates the information derived from multiple sequence alignment and phylogenetic tree (hypothetical tree of evolution) to derive new properties about sequences. Multiple sequence alignments are treated as an abstract data type. Abstract operations have been described to manipulate a multiple sequence alignment and to derive mutation related information from a phylogenetic tree by superimposing parsimonious analysis. The framework has been applied on protein alignments to derive constrained columns (in a multiple sequence alignment) that exhibit evolutionary pressure to preserve a common property in a column despite mutation. A Prolog toolkit based on the framework has been implemented and demonstrated on alignments containing 3000 sequences and 3904 columns.

  8. Scalable Kernel Methods and Algorithms for General Sequence Analysis

    Science.gov (United States)

    Kuksa, Pavel

    2011-01-01

    Analysis of large-scale sequential data has become an important task in machine learning and pattern recognition, inspired in part by numerous scientific and technological applications such as the document and text classification or the analysis of biological sequences. However, current computational methods for sequence comparison still lack…

  9. Real-time whole-genome sequencing for routine typing, surveillance, and outbreak detection of verotoxigenic Escherichia coli

    DEFF Research Database (Denmark)

    Joensen, Katrine Grimstrup; Scheutz, Flemming; Lund, Ole

    2014-01-01

    suspected VTEC isolates. During a 7-week period in the fall of 2012, all incoming isolates were concurrently subjected to WGS using IonTorrent PGM. Real-time bioinformatics analysis was performed using web-tools (www.genomicepidemiology.org) for species determination, multilocus sequence type (MLST) typing...

  10. Recurrence plot analysis of DNA sequences

    Energy Technology Data Exchange (ETDEWEB)

    Wu Zuobing [State Key Laboratory of Nonlinear Mechanics, Institute of Mechanics, Chinese Academy of Sciences, Beijing 100080 (China)]. E-mail: wuzb@lnm.imech.ac.cn

    2004-11-15

    Recurrence plot technique of DNA sequences is established on metric representation and employed to analyze correlation structure of nucleotide strings. It is found that, in the transference of nucleotide strings, a human DNA fragment has a major correlation distance, but a yeast chromosome's correlation distance has a constant increasing.

  11. Analysis of Neuronal Sequences Using Pairwise Biases

    Science.gov (United States)

    2015-08-27

    semantic memory (knowledge of facts) and implicit memory (e.g., how to ride a bike ). Evidence for the participation of the hippocampus in the formation of...hippocampal formation in an attempt to be cured of severe epileptic seizures. Although the surgery was successful in regards to reducing the frequency and...very different from each other in many ways including duration and number of spikes. Still, these sequences share a similar trend in the general order

  12. Google matrix analysis of DNA sequences.

    Science.gov (United States)

    Kandiah, Vivek; Shepelyansky, Dima L

    2013-01-01

    For DNA sequences of various species we construct the Google matrix [Formula: see text] of Markov transitions between nearby words composed of several letters. The statistical distribution of matrix elements of this matrix is shown to be described by a power law with the exponent being close to those of outgoing links in such scale-free networks as the World Wide Web (WWW). At the same time the sum of ingoing matrix elements is characterized by the exponent being significantly larger than those typical for WWW networks. This results in a slow algebraic decay of the PageRank probability determined by the distribution of ingoing elements. The spectrum of [Formula: see text] is characterized by a large gap leading to a rapid relaxation process on the DNA sequence networks. We introduce the PageRank proximity correlator between different species which determines their statistical similarity from the view point of Markov chains. The properties of other eigenstates of the Google matrix are also discussed. Our results establish scale-free features of DNA sequence networks showing their similarities and distinctions with the WWW and linguistic networks.

  13. Google matrix analysis of DNA sequences.

    Directory of Open Access Journals (Sweden)

    Vivek Kandiah

    Full Text Available For DNA sequences of various species we construct the Google matrix [Formula: see text] of Markov transitions between nearby words composed of several letters. The statistical distribution of matrix elements of this matrix is shown to be described by a power law with the exponent being close to those of outgoing links in such scale-free networks as the World Wide Web (WWW. At the same time the sum of ingoing matrix elements is characterized by the exponent being significantly larger than those typical for WWW networks. This results in a slow algebraic decay of the PageRank probability determined by the distribution of ingoing elements. The spectrum of [Formula: see text] is characterized by a large gap leading to a rapid relaxation process on the DNA sequence networks. We introduce the PageRank proximity correlator between different species which determines their statistical similarity from the view point of Markov chains. The properties of other eigenstates of the Google matrix are also discussed. Our results establish scale-free features of DNA sequence networks showing their similarities and distinctions with the WWW and linguistic networks.

  14. Multilocus phylogenetic analysis and morphological data reveal a new species composition of the genus Drepanocephalus Dietz, 1909 (Digenea: Echinostomatidae), parasites of fish-eating birds in the Americas.

    Science.gov (United States)

    Hernández-Cruz, E; Hernández-Orts, J S; Sereno-Uribe, A L; Pérez-Ponce de León, G; García-Varela, M

    2017-10-04

    Members of the genus Drepanocephalus are endoparasites of fish-eating birds of the families Phalacrocoracidae and Sulidae distributed across the Americas. Currently, Drepanocephalus contains three species, i.e. D. spathans (type species), D. olivaceus and D. auritus. Two additional species, D. parvicephalus and D. mexicanus were transferred to the genus Petasiger. In the current study, available DNA sequences of D. spathans, D. auritus and Drepanocephalus sp., were aligned with newly generated sequences of D. spathans and Petasiger mexicanus. Phylogenetic analyses inferred with three nuclear (LSU, SSU and ITS1, 5.8S, ITS2) and two mitochondrial (cox1, nad1) molecular markers showed that the sequences of D. spathans and D. auritus are nested together in a single clade with very low genetic divergence, with Petasiger mexicanus as its sister species. Additionally, P. mexicanus was not a close relative of other members of the genus Petasiger, showing that P. mexicanus actually belongs to the genus Drepanocephalus, suggesting the need to re-allocate Petasiger mexicanus back into the genus Drepanocephalus, as D. mexicanus. Morphological observations of the newly sampled individuals of D. spathans showed that the position of the testes is variable and testes might be contiguous or widely separated, which is one of the main diagnostic traits for D. auritus. Our results suggest that D. auritus might be considered a synonym of D. spathans and, as a result, the latter represents a species with a wide geographic range across the Americas, parasitizing both the Neotropical and the double-crested cormorant in Argentina, Brazil, Paraguay, Venezuela, Colombia, Mexico, USA and Canada.

  15. Error Analysis of Deep Sequencing of Phage Libraries: Peptides Censored in Sequencing

    Directory of Open Access Journals (Sweden)

    Wadim L. Matochko

    2013-01-01

    Full Text Available Next-generation sequencing techniques empower selection of ligands from phage-display libraries because they can detect low abundant clones and quantify changes in the copy numbers of clones without excessive selection rounds. Identification of errors in deep sequencing data is the most critical step in this process because these techniques have error rates >1%. Mechanisms that yield errors in Illumina and other techniques have been proposed, but no reports to date describe error analysis in phage libraries. Our paper focuses on error analysis of 7-mer peptide libraries sequenced by Illumina method. Low theoretical complexity of this phage library, as compared to complexity of long genetic reads and genomes, allowed us to describe this library using convenient linear vector and operator framework. We describe a phage library as N×1 frequency vector n=ni, where ni is the copy number of the ith sequence and N is the theoretical diversity, that is, the total number of all possible sequences. Any manipulation to the library is an operator acting on n. Selection, amplification, or sequencing could be described as a product of a N×N matrix and a stochastic sampling operator (Sa. The latter is a random diagonal matrix that describes sampling of a library. In this paper, we focus on the properties of Sa and use them to define the sequencing operator (Seq. Sequencing without any bias and errors is Seq=Sa IN, where IN is a N×N unity matrix. Any bias in sequencing changes IN to a nonunity matrix. We identified a diagonal censorship matrix (CEN, which describes elimination or statistically significant downsampling, of specific reads during the sequencing process.

  16. Cloning and sequence analysis of benzo-a-pyreneinducible ...

    African Journals Online (AJOL)

    The phylogenetic tree based on the amino acid sequences clearly shows tilapia CYP1A and killifish CYP1A to be more closely related to each other than to the other CYP1A subfamilies. Sequence analysis of 3727 bp of genomic DNA showed that the clone obtained was the structural gene of CYP1A which consists of ...

  17. Biological sequence analysis: probabilistic models of proteins and nucleic acids

    National Research Council Canada - National Science Library

    Durbin, Richard

    1998-01-01

    ... analysis methods are now based on principles of probabilistic modelling. Examples of such methods include the use of probabilistically derived score matrices to determine the significance of sequence alignments, the use of hidden Markov models as the basis for profile searches to identify distant members of sequence families, and the inference...

  18. Phylogenetic analysis of the genus Hordeum using repetitive DNA sequences

    DEFF Research Database (Denmark)

    Svitashev, S.; Bryngelsson, T.; Vershinin, A.

    1994-01-01

    A set of six cloned barley (Hordeum vulgare) repetitive DNA sequences was used for the analysis of phylogenetic relationships among 31 species (46 taxa) of the genus Hordeum, using molecular hybridization techniques. In situ hybridization experiments showed dispersed organization of the sequences...

  19. Parametric inference for biological sequence analysis.

    Science.gov (United States)

    Pachter, Lior; Sturmfels, Bernd

    2004-11-16

    One of the major successes in computational biology has been the unification, by using the graphical model formalism, of a multitude of algorithms for annotating and comparing biological sequences. Graphical models that have been applied to these problems include hidden Markov models for annotation, tree models for phylogenetics, and pair hidden Markov models for alignment. A single algorithm, the sum-product algorithm, solves many of the inference problems that are associated with different statistical models. This article introduces the polytope propagation algorithm for computing the Newton polytope of an observation from a graphical model. This algorithm is a geometric version of the sum-product algorithm and is used to analyze the parametric behavior of maximum a posteriori inference calculations for graphical models.

  20. Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

    Science.gov (United States)

    Cao, Yinhe; Tung, Wen-Wen; Gao, J B

    2004-01-01

    With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.

  1. RESEARCH NOTE Genome-based exome-sequencing analysis ...

    Indian Academy of Sciences (India)

    Navya

    2017-02-22

    Feb 22, 2017 ... Genome-based exome-sequencing analysis identifies GYG1, DIS3L, DDRGK1 genes ... Cardiology Division, Department of Internal Medicine, Severance .... with p values of <0.05 byanalyzing differences in allele distribution.

  2. Editorial: Special Issue on Algorithms for Sequence Analysis and Storage

    Directory of Open Access Journals (Sweden)

    Veli Mäkinen

    2014-03-01

    Full Text Available This special issue of Algorithms is dedicated to approaches to biological sequence analysis that have algorithmic novelty and potential for fundamental impact in methods used for genome research.

  3. Tools for integrated sequence-structure analysis with UCSF Chimera

    Directory of Open Access Journals (Sweden)

    Huang Conrad C

    2006-07-01

    Full Text Available Abstract Background Comparing related structures and viewing the structures in the context of sequence alignments are important tasks in protein structure-function research. While many programs exist for individual aspects of such work, there is a need for interactive visualization tools that: (a provide a deep integration of sequence and structure, far beyond mapping where a sequence region falls in the structure and vice versa; (b facilitate changing data of one type based on the other (for example, using only sequence-conserved residues to match structures, or adjusting a sequence alignment based on spatial fit; (c can be used with a researcher's own data, including arbitrary sequence alignments and annotations, closely or distantly related sets of proteins, etc.; and (d interoperate with each other and with a full complement of molecular graphics features. We describe enhancements to UCSF Chimera to achieve these goals. Results The molecular graphics program UCSF Chimera includes a suite of tools for interactive analyses of sequences and structures. Structures automatically associate with sequences in imported alignments, allowing many kinds of crosstalk. A novel method is provided to superimpose structures in the absence of a pre-existing sequence alignment. The method uses both sequence and secondary structure, and can match even structures with very low sequence identity. Another tool constructs structure-based sequence alignments from superpositions of two or more proteins. Chimera is designed to be extensible, and mechanisms for incorporating user-specific data without Chimera code development are also provided. Conclusion The tools described here apply to many problems involving comparison and analysis of protein structures and their sequences. Chimera includes complete documentation and is intended for use by a wide range of scientists, not just those in the computational disciplines. UCSF Chimera is free for non-commercial use and is

  4. Molecular characterization and multilocus genotypes of Enterocytozoon bieneusi among horses in southwestern China

    Directory of Open Access Journals (Sweden)

    Lei Deng

    2016-10-01

    Full Text Available Abstract Background Enterocytozoon bieneusi is one of the most prevalent causative species of diarrhea and enteric diseases in various hosts. E. bieneusi has been identified in humans, mammals, birds, rodents and reptiles in China, but few studies have reported E. bieneusi in horses. Therefore, the present study was conducted to assess the prevalence, molecular characteristics and zoonotic potential of E. bieneusi among horses in southwestern China. Findings Three hundred and thirty-three fecal specimens were collected from horses on five farms in the Sichuan and Yunnan provinces of southwestern China. The prevalence of E. bieneusi was 22.5 % (75/333, as determined by nested polymerase chain reaction and sequencing analysis of the internal transcribed spacer region of the ribosomal RNA gene of E. bieneusi. Altogether, 10 genotypes were identified among the 75 E. bieneusi-positive samples: four of these genotypes were known (horse1, horse2, SC02 and D and six were novel (SCH1-4 and YNH1-2. Multilocus sequence typing using three microsatellites (MS1, MS3 and MS7 and one minisatellite (MS4 revealed three, two, three and three genotypes at these four loci, respectively. In phylogenetic analysis, all the genotypes of E. bieneusi obtained in this study were clustered into three distinct groups: D, SC02 and SCH1-3 were clustered into group 1 (zoonotic potential; SCH4 was clustered into group 2 (cattle-hosted; whereas horse2, YNH1 and YNH2 were clustered into group 6 (unclear zoonotic potential. Conclusions This is the first report of E. bieneusi among horses in southwestern China. This is also the first multilocus genotyping analysis using microsatellite and minisatellite markers of E. bieneusi in horses. The presence of genotype D, which was previously identified in humans, and genotypes SC02 and SCH1-3, which belong to potential zoonotic group 1, these results indicate that horses are a potential source of human E. bieneusi infections in China.

  5. Intricate patterns of phylogenetic relationships in the olive family as inferred from multi-locus plastid and nuclear DNA sequence analyses: a close-up on Chionanthus and Noronhia (Oleaceae).

    Science.gov (United States)

    Hong-Wa, Cynthia; Besnard, Guillaume

    2013-05-01

    Noronhia represents the most successful radiation of the olive family (Oleaceae) in Madagascar with more than 40 named endemic species distributed in all ecoregions from sea level to high mountains. Its position within the subtribe Oleinae has, however, been largely unresolved and its evolutionary history has remained unexplored. In this study, we generated a dataset of plastid (trnL-F, trnT-L, trnS-G, trnK-matK) and nuclear (internal transcribed spacer [ITS]) DNA sequences to infer phylogenetic relationships within Oleinae and to examine evolutionary patterns within Noronhia. Our sample included most species of Noronhia and representatives of the ten other extant genera within the subtribe with an emphasis on Chionanthus. Bayesian inferences and maximum likelihood analyses of plastid and nuclear data indicated several instances of paraphyly and polyphyly within Oleinae, with some geographic signal. Both plastid and ITS data showed a polyphyletic Noronhia that included Indian Ocean species of Chionanthus. They also found close relationships between Noronhia and African Chionanthus. However, the plastid data showed little clear differentiation between Noronhia and the African Chionanthus whereas relationships suggested by the nuclear ITS data were more consistent with taxonomy and geography. We used molecular dating to discriminate between hybridization and lineage sorting/gene duplication as alternative explanations for these topological discordances and to infer the biogeographic history of Noronhia. Hybridization between African Chionanthus and Noronhia could not be ruled out. However, Noronhia has long been established in Madagascar after a likely Cenozoic dispersal from Africa, suggesting any hybridization between representatives of African and Malagasy taxa was ancient. In any case, the African and Indian Ocean Chionanthus and Noronhia together formed a strongly supported monophyletic clade distinct and distant from other Chionanthus, which calls for a revised

  6. Sequencing and Analysis of Neanderthal Genomic DNA

    Energy Technology Data Exchange (ETDEWEB)

    Noonan, James P.; Coop, Graham; Kudaravalli, Sridhar; Smith,Doug; Krause, Johannes; Alessi, Joe; Chen, Feng; Platt, Darren; Paabo,Svante; Pritchard, Jonathan K.; Rubin, Edward M.

    2006-06-13

    Recovery and analysis of multiple Neanderthal autosomalsequences using a metagenomic approach reveals that modern humans andNeanderthals split ~;400,000 years ago, without significant evidence ofsubsequent admixture.

  7. SVAMP: Sequence variation analysis, maps and phylogeny

    KAUST Repository

    Naeem, Raeece; Hidayah, Lailatul; Preston, Mark D.; Clark, Taane G.; Pain, Arnab

    2014-01-01

    Summary: SVAMP is a stand-alone desktop application to visualize genomic variants (in variant call format) in the context of geographical metadata. Users of SVAMP are able to generate phylogenetic trees and perform principal coordinate analysis

  8. MCMC multilocus lod scores: application of a new approach.

    Science.gov (United States)

    George, Andrew W; Wijsman, Ellen M; Thompson, Elizabeth A

    2005-01-01

    On extended pedigrees with extensive missing data, the calculation of multilocus likelihoods for linkage analysis is often beyond the computational bounds of exact methods. Growing interest therefore surrounds the implementation of Monte Carlo estimation methods. In this paper, we demonstrate the speed and accuracy of a new Markov chain Monte Carlo method for the estimation of linkage likelihoods through an analysis of real data from a study of early-onset Alzheimer's disease. For those data sets where comparison with exact analysis is possible, we achieved up to a 100-fold increase in speed. Our approach is implemented in the program lm_bayes within the framework of the freely available MORGAN 2.6 package for Monte Carlo genetic analysis (http://www.stat.washington.edu/thompson/Genepi/MORGAN/Morgan.shtml).

  9. Application of multi-locus analytical methods to identify interacting loci in case-control studies.

    NARCIS (Netherlands)

    Vermeulen, S.; Heijer, M. den; Sham, P.; Knight, J.

    2007-01-01

    To identify interacting loci in genetic epidemiological studies the application of multi-locus methods of analysis is warranted. Several more advanced classification methods have been developed in the past years, including multiple logistic regression, sum statistics, logic regression, and the

  10. DSAP: deep-sequencing small RNA analysis pipeline.

    Science.gov (United States)

    Huang, Po-Jung; Liu, Yi-Chung; Lee, Chi-Ching; Lin, Wei-Chen; Gan, Richie Ruei-Chi; Lyu, Ping-Chiang; Tang, Petrus

    2010-07-01

    DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.

  11. Quantiprot - a Python package for quantitative analysis of protein sequences.

    Science.gov (United States)

    Konopka, Bogumił M; Marciniak, Marta; Dyrka, Witold

    2017-07-17

    The field of protein sequence analysis is dominated by tools rooted in substitution matrices and alignments. A complementary approach is provided by methods of quantitative characterization. A major advantage of the approach is that quantitative properties defines a multidimensional solution space, where sequences can be related to each other and differences can be meaningfully interpreted. Quantiprot is a software package in Python, which provides a simple and consistent interface to multiple methods for quantitative characterization of protein sequences. The package can be used to calculate dozens of characteristics directly from sequences or using physico-chemical properties of amino acids. Besides basic measures, Quantiprot performs quantitative analysis of recurrence and determinism in the sequence, calculates distribution of n-grams and computes the Zipf's law coefficient. We propose three main fields of application of the Quantiprot package. First, quantitative characteristics can be used in alignment-free similarity searches, and in clustering of large and/or divergent sequence sets. Second, a feature space defined by quantitative properties can be used in comparative studies of protein families and organisms. Third, the feature space can be used for evaluating generative models, where large number of sequences generated by the model can be compared to actually observed sequences.

  12. Evaluation of a highly discriminating multiplex multi-locus variable-number of tandem-repeats (MLVA) analysis for Vibrio cholerae.

    Science.gov (United States)

    Olsen, Jaran S; Aarskaug, Tone; Skogan, Gunnar; Fykse, Else Marie; Ellingsen, Anette Bauer; Blatny, Janet M

    2009-09-01

    Vibrio cholerae is the etiological agent of cholera and may be used in bioterror actions due to the easiness of its dissemination, and the public fear for acquiring the cholera disease. A simple and highly discriminating method for connecting clinical and environmental isolates of V. cholerae is needed in microbial forensics. Twelve different loci containing variable numbers of tandem-repeats (VNTRs) were evaluated in which six loci were polymorphic. Two multiplex reactions containing PCR primers targeting these six VNTRs resulted in successful DNA amplification of 142 various environmental and clinical V. cholerae isolates. The genetic distribution inside the V. cholerae strain collection was used to evaluate the discriminating power (Simpsons Diversity Index=0.99) of this new MLVA analysis, showing that the assay have a potential to differentiate between various strains, but also to identify those isolates which are collected from a common V. cholerae outbreak. This work has established a rapid and highly discriminating MLVA assay useful for track back analyses and/or forensic studies of V. cholerae infections.

  13. Nonlinear analysis of river flow time sequences

    Science.gov (United States)

    Porporato, Amilcare; Ridolfi, Luca

    1997-06-01

    Within the field of chaos theory several methods for the analysis of complex dynamical systems have recently been proposed. In light of these ideas we study the dynamics which control the behavior over time of river flow, investigating the existence of a low-dimension deterministic component. The present article follows the research undertaken in the work of Porporato and Ridolfi [1996a] in which some clues as to the existence of chaos were collected. Particular emphasis is given here to the problem of noise and to nonlinear prediction. With regard to the latter, the benefits obtainable by means of the interpolation of the available time series are reported and the remarkable predictive results attained with this nonlinear method are shown.

  14. Accident sequence analysis of human-computer interface design

    International Nuclear Information System (INIS)

    Fan, C.-F.; Chen, W.-H.

    2000-01-01

    It is important to predict potential accident sequences of human-computer interaction in a safety-critical computing system so that vulnerable points can be disclosed and removed. We address this issue by proposing a Multi-Context human-computer interaction Model along with its analysis techniques, an Augmented Fault Tree Analysis, and a Concurrent Event Tree Analysis. The proposed augmented fault tree can identify the potential weak points in software design that may induce unintended software functions or erroneous human procedures. The concurrent event tree can enumerate possible accident sequences due to these weak points

  15. Food Fish Identification from DNA Extraction through Sequence Analysis

    Science.gov (United States)

    Hallen-Adams, Heather E.

    2015-01-01

    This experiment exposed 3rd and 4th y undergraduates and graduate students taking a course in advanced food analysis to DNA extraction, polymerase chain reaction (PCR), and DNA sequence analysis. Students provided their own fish sample, purchased from local grocery stores, and the class as a whole extracted DNA, which was then subjected to PCR,…

  16. Analysis and Visualization Tool for Targeted Amplicon Bisulfite Sequencing on Ion Torrent Sequencers.

    Directory of Open Access Journals (Sweden)

    Stephan Pabinger

    Full Text Available Targeted sequencing of PCR amplicons generated from bisulfite deaminated DNA is a flexible, cost-effective way to study methylation of a sample at single CpG resolution and perform subsequent multi-target, multi-sample comparisons. Currently, no platform specific protocol, support, or analysis solution is provided to perform targeted bisulfite sequencing on a Personal Genome Machine (PGM. Here, we present a novel tool, called TABSAT, for analyzing targeted bisulfite sequencing data generated on Ion Torrent sequencers. The workflow starts with raw sequencing data, performs quality assessment, and uses a tailored version of Bismark to map the reads to a reference genome. The pipeline visualizes results as lollipop plots and is able to deduce specific methylation-patterns present in a sample. The obtained profiles are then summarized and compared between samples. In order to assess the performance of the targeted bisulfite sequencing workflow, 48 samples were used to generate 53 different Bisulfite-Sequencing PCR amplicons from each sample, resulting in 2,544 amplicon targets. We obtained a mean coverage of 282X using 1,196,822 aligned reads. Next, we compared the sequencing results of these targets to the methylation level of the corresponding sites on an Illumina 450k methylation chip. The calculated average Pearson correlation coefficient of 0.91 confirms the sequencing results with one of the industry-leading CpG methylation platforms and shows that targeted amplicon bisulfite sequencing provides an accurate and cost-efficient method for DNA methylation studies, e.g., to provide platform-independent confirmation of Illumina Infinium 450k methylation data. TABSAT offers a novel way to analyze data generated by Ion Torrent instruments and can also be used with data from the Illumina MiSeq platform. It can be easily accessed via the Platomics platform, which offers a web-based graphical user interface along with sample and parameter storage

  17. An optimum analysis sequence for environmental gamma-ray spectrometry

    Energy Technology Data Exchange (ETDEWEB)

    De la Torre, F.; Rios M, C.; Ruvalcaba A, M. G.; Mireles G, F.; Saucedo A, S.; Davila R, I.; Pinedo, J. L., E-mail: fta777@hotmail.co [Universidad Autonoma de Zacatecas, Centro Regional de Estudis Nucleares, Calle Cipres No. 10, Fracc. La Penuela, 98068 Zacatecas (Mexico)

    2010-10-15

    This work aims to obtain an optimum analysis sequence for environmental gamma-ray spectroscopy by means of Genie 2000 (Canberra). Twenty different analysis sequences were customized using different peak area percentages and different algorithms for: 1) peak finding, and 2) peak area determination, and with or without the use of a library -based on evaluated nuclear data- of common gamma-ray emitters in environmental samples. The use of an optimum analysis sequence with certified nuclear information avoids the problems originated by the significant variations in out-of-date nuclear parameters of commercial software libraries. Interference-free gamma ray energies with absolute emission probabilities greater than 3.75% were included in the customized library. The gamma-ray spectroscopy system (based on a Ge Re-3522 Canberra detector) was calibrated both in energy and shape by means of the IAEA-2002 reference spectra for software intercomparison. To test the performance of the analysis sequences, the IAEA-2002 reference spectrum was used. The z-score and the reduced {chi}{sup 2} criteria were used to determine the optimum analysis sequence. The results show an appreciable variation in the peak area determinations and their corresponding uncertainties. Particularly, the combination of second derivative peak locate with simple peak area integration algorithms provides the greater accuracy. Lower accuracy comes from the combination of library directed peak locate algorithm and Genie's Gamma-M peak area determination. (Author)

  18. An optimum analysis sequence for environmental gamma-ray spectrometry

    International Nuclear Information System (INIS)

    De la Torre, F.; Rios M, C.; Ruvalcaba A, M. G.; Mireles G, F.; Saucedo A, S.; Davila R, I.; Pinedo, J. L.

    2010-10-01

    This work aims to obtain an optimum analysis sequence for environmental gamma-ray spectroscopy by means of Genie 2000 (Canberra). Twenty different analysis sequences were customized using different peak area percentages and different algorithms for: 1) peak finding, and 2) peak area determination, and with or without the use of a library -based on evaluated nuclear data- of common gamma-ray emitters in environmental samples. The use of an optimum analysis sequence with certified nuclear information avoids the problems originated by the significant variations in out-of-date nuclear parameters of commercial software libraries. Interference-free gamma ray energies with absolute emission probabilities greater than 3.75% were included in the customized library. The gamma-ray spectroscopy system (based on a Ge Re-3522 Canberra detector) was calibrated both in energy and shape by means of the IAEA-2002 reference spectra for software intercomparison. To test the performance of the analysis sequences, the IAEA-2002 reference spectrum was used. The z-score and the reduced χ 2 criteria were used to determine the optimum analysis sequence. The results show an appreciable variation in the peak area determinations and their corresponding uncertainties. Particularly, the combination of second derivative peak locate with simple peak area integration algorithms provides the greater accuracy. Lower accuracy comes from the combination of library directed peak locate algorithm and Genie's Gamma-M peak area determination. (Author)

  19. Validation of Genotyping-By-Sequencing Analysis in Populations of Tetraploid Alfalfa by 454 Sequencing

    Science.gov (United States)

    Rocher, Solen; Jean, Martine; Castonguay, Yves; Belzile, François

    2015-01-01

    Genotyping-by-sequencing (GBS) is a relatively low-cost high throughput genotyping technology based on next generation sequencing and is applicable to orphan species with no reference genome. A combination of genome complexity reduction and multiplexing with DNA barcoding provides a simple and affordable way to resolve allelic variation between plant samples or populations. GBS was performed on ApeKI libraries using DNA from 48 genotypes each of two heterogeneous populations of tetraploid alfalfa (Medicago sativa spp. sativa): the synthetic cultivar Apica (ATF0) and a derived population (ATF5) obtained after five cycles of recurrent selection for superior tolerance to freezing (TF). Nearly 400 million reads were obtained from two lanes of an Illumina HiSeq 2000 sequencer and analyzed with the Universal Network-Enabled Analysis Kit (UNEAK) pipeline designed for species with no reference genome. Following the application of whole dataset-level filters, 11,694 single nucleotide polymorphism (SNP) loci were obtained. About 60% had a significant match on the Medicago truncatula syntenic genome. The accuracy of allelic ratios and genotype calls based on GBS data was directly assessed using 454 sequencing on a subset of SNP loci scored in eight plant samples. Sequencing depth in this study was not sufficient for accurate tetraploid allelic dosage, but reliable genotype calls based on diploid allelic dosage were obtained when using additional quality filtering. Principal Component Analysis of SNP loci in plant samples revealed that a small proportion (<5%) of the genetic variability assessed by GBS is able to differentiate ATF0 and ATF5. Our results confirm that analysis of GBS data using UNEAK is a reliable approach for genome-wide discovery of SNP loci in outcrossed polyploids. PMID:26115486

  20. A fast multilocus test with adaptive SNP selection for large-scale genetic-association studies

    KAUST Repository

    Zhang, Han

    2013-09-11

    As increasing evidence suggests that multiple correlated genetic variants could jointly influence the outcome, a multilocus test that aggregates association evidence across multiple genetic markers in a considered gene or a genomic region may be more powerful than a single-marker test for detecting susceptibility loci. We propose a multilocus test, AdaJoint, which adopts a variable selection procedure to identify a subset of genetic markers that jointly show the strongest association signal, and defines the test statistic based on the selected genetic markers. The P-value from the AdaJoint test is evaluated by a computationally efficient algorithm that effectively adjusts for multiple-comparison, and is hundreds of times faster than the standard permutation method. Simulation studies demonstrate that AdaJoint has the most robust performance among several commonly used multilocus tests. We perform multilocus analysis of over 26,000 genes/regions on two genome-wide association studies of pancreatic cancer. Compared with its competitors, AdaJoint identifies a much stronger association between the gene CLPTM1L and pancreatic cancer risk (6.0 × 10(-8)), with the signal optimally captured by two correlated single-nucleotide polymorphisms (SNPs). Finally, we show AdaJoint as a powerful tool for mapping cis-regulating methylation quantitative trait loci on normal breast tissues, and find many CpG sites whose methylation levels are jointly regulated by multiple SNPs nearby.

  1. Utility of RNA Sequencing for Analysis of Maize Reproductive Transcriptomes

    Directory of Open Access Journals (Sweden)

    Rebecca M. Davidson

    2011-11-01

    Full Text Available Transcriptome sequencing is a powerful method for studying global expression patterns in large, complex genomes. Evaluation of sequence-based expression profiles during reproductive development would provide functional annotation to genes underlying agronomic traits. We generated transcriptome profiles for 12 diverse maize ( L. reproductive tissues representing male, female, developing seed, and leaf tissues using high throughput transcriptome sequencing. Overall, ∼80% of annotated genes were expressed. Comparative analysis between sequence and hybridization-based methods demonstrated the utility of ribonucleic acid sequencing (RNA-seq for expression determination and differentiation of paralagous genes (∼85% of maize genes. Analysis of 4975 gene families across reproductive tissues revealed expression divergence is proportional to family size. In all pairwise comparisons between tissues, 7 (pre- vs. postemergence cobs to 48% (pollen vs. ovule of genes were differentially expressed. Genes with expression restricted to a single tissue within this study were identified with the highest numbers observed in leaves, endosperm, and pollen. Coexpression network analysis identified 17 gene modules with complex and shared expression patterns containing many previously described maize genes. The data and analyses in this study provide valuable tools through improved gene annotation, gene family characterization, and a core set of candidate genes to further characterize maize reproductive development and improve grain yield potential.

  2. The Role of the Y-Chromosome in the Establishment of Murine Hybrid Dysgenesis and in the Analysis of the Nucleotide Sequence Organization, Genetic Transmission and Evolution of Repeated Sequences.

    Science.gov (United States)

    Nallaseth, Ferez Soli

    The Y-chromosome presents a unique cytogenetic framework for the evolution of nucleotide sequences. Alignment of nine Y-chromosomal fragments in their increasing Y-specific/non Y-specific (male/female) sequence divergence ratios was directly and inversely related to their interspersion on these two respective genomic fractions. Sequence analysis confirmed a direct relationship between divergence ratios and the Alu, LINE-1, Satellite and their derivative oligonucleotide contents. Thus their relocation on the Y-chromosome is followed by sequence divergence rather than the well documented concerted evolution of these non-coding progenitor repeated sequences. Five of the nine Y-chromosomal fragments are non-pseudoautosomal and transcribed into heterogeneous PolyA^+ RNA and thus can be retrotransposed. Evolutionary and computer analysis identified homologous oligonucleotide tracts in several human loci suggesting common and random mechanistic origins. Dysgenic genomes represent the accelerated evolution driving sequence divergence (McClintock, 1984). Sex reversal and sterility characterizing dysgenesis occurs in C57BL/6JY ^{rm Pos} but not in 129/SvY^{rm Pos} derivative strains. High frequency, random, multi-locus deletion products of the feral Y^{ rm Pos}-chromosome are generated in the germlines of F1(C57BL/6J X 129/SvY^{ rm Pos})(male) and C57BL/6JY ^{rm Pos}(male) but not in 129/SvY^{rm Pos}(male). Equal, 10^{-1}, 10^ {-2}, and 0 copies (relative to males) of Y^{rm Pos}-specific deletion products respectively characterize C57BL/6JY ^{rm Pos} (HC), (LC), (T) and (F) females. The testes determining loci of inactive Y^{rm Pos}-chromosomes in C57BL/6JY^{rm Pos} HC females are the preferentially deleted/rearranged Y ^{rm Pos}-sequences. Disruption of regulation of plasma testosterone and hepatic MUP-A mRNA levels, TRD of a 4.7 Kbp EcoR1 fragment suggest disruption of autosomal/X-chromosomal sequences. These data and the highly repeated progenitor (Alu, GATA, LINE-1

  3. Sequence analysis of the genome of carnation (Dianthus caryophyllus L.).

    Science.gov (United States)

    Yagi, Masafumi; Kosugi, Shunichi; Hirakawa, Hideki; Ohmiya, Akemi; Tanase, Koji; Harada, Taro; Kishimoto, Kyutaro; Nakayama, Masayoshi; Ichimura, Kazuo; Onozaki, Takashi; Yamaguchi, Hiroyasu; Sasaki, Nobuhiro; Miyahara, Taira; Nishizaki, Yuzo; Ozeki, Yoshihiro; Nakamura, Noriko; Suzuki, Takamasa; Tanaka, Yoshikazu; Sato, Shusei; Shirasawa, Kenta; Isobe, Sachiko; Miyamura, Yoshinori; Watanabe, Akiko; Nakayama, Shinobu; Kishida, Yoshie; Kohara, Mitsuyo; Tabata, Satoshi

    2014-06-01

    The whole-genome sequence of carnation (Dianthus caryophyllus L.) cv. 'Francesco' was determined using a combination of different new-generation multiplex sequencing platforms. The total length of the non-redundant sequences was 568,887,315 bp, consisting of 45,088 scaffolds, which covered 91% of the 622 Mb carnation genome estimated by k-mer analysis. The N50 values of contigs and scaffolds were 16,644 bp and 60,737 bp, respectively, and the longest scaffold was 1,287,144 bp. The average GC content of the contig sequences was 36%. A total of 1050, 13, 92 and 143 genes for tRNAs, rRNAs, snoRNA and miRNA, respectively, were identified in the assembled genomic sequences. For protein-encoding genes, 43 266 complete and partial gene structures excluding those in transposable elements were deduced. Gene coverage was ∼ 98%, as deduced from the coverage of the core eukaryotic genes. Intensive characterization of the assigned carnation genes and comparison with those of other plant species revealed characteristic features of the carnation genome. The results of this study will serve as a valuable resource for fundamental and applied research of carnation, especially for breeding new carnation varieties. Further information on the genomic sequences is available at http://carnation.kazusa.or.jp. © The Author 2013. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  4. De novo transcriptome sequencing and sequence analysis of the malaria vector Anopheles sinensis (Diptera: Culicidae)

    Science.gov (United States)

    2014-01-01

    Background Anopheles sinensis is the major malaria vector in China and Southeast Asia. Vector control is one of the most effective measures to prevent malaria transmission. However, there is little transcriptome information available for the malaria vector. To better understand the biological basis of malaria transmission and to develop novel and effective means of vector control, there is a need to build a transcriptome dataset for functional genomics analysis by large-scale RNA sequencing (RNA-seq). Methods To provide a more comprehensive and complete transcriptome of An. sinensis, eggs, larvae, pupae, male adults and female adults RNA were pooled together for cDNA preparation, sequenced using the Illumina paired-end sequencing technology and assembled into unigenes. These unigenes were then analyzed in their genome mapping, functional annotation, homology, codon usage bias and simple sequence repeats (SSRs). Results Approximately 51.6 million clean reads were obtained, trimmed, and assembled into 38,504 unigenes with an average length of 571 bp, an N50 of 711 bp, and an average GC content 51.26%. Among them, 98.4% of unigenes could be mapped onto the reference genome, and 69% of unigenes could be annotated with known biological functions. Homology analysis identified certain numbers of An. sinensis unigenes that showed homology or being putative 1:1 orthologues with genomes of other Dipteran species. Codon usage bias was analyzed and 1,904 SSRs were detected, which will provide effective molecular markers for the population genetics of this species. Conclusions Our data and analysis provide the most comprehensive transcriptomic resource and characteristics currently available for An. sinensis, and will facilitate genetic, genomic studies, and further vector control of An. sinensis. PMID:25000941

  5. Sequence analysis corresponding to the PPE and PE proteins in ...

    Indian Academy of Sciences (India)

    Unknown

    AB repeats; Mycobacterium tuberculosis genome; PE-PPE domain; PPE, PE proteins; sequence analysis; surface antigens. J. Biosci. | Vol. ... bacterium tuberculosis genomes resulted in the identification of a previously uncharacterized 225 amino acid- ...... Vega Lopez F, Brooks L A, Dockrell H M, De Smet K A,. Thompson ...

  6. Molecular cloning, expression analysis and sequence prediction of ...

    African Journals Online (AJOL)

    CCAAT/enhancer-binding protein beta as an essential transcriptional factor, regulates the differentiation of adipocytes and the deposition of fat. Herein, we cloned the whole open reading frame (ORF) of bovine C/EBPβ gene and analyzed its putative protein structures via DNA cloning and sequence analysis. Then, the ...

  7. Sequence symmetry analysis in pharmacovigilance and pharmacoepidemiologic studies

    DEFF Research Database (Denmark)

    Lai, Edward Chia Cheng; Pratt, Nicole; Hsieh, Cheng Yang

    2017-01-01

    Sequence symmetry analysis (SSA) is a method for detecting adverse drug events by utilizing computerized claims data. The method has been increasingly used to investigate safety concerns of medications and as a pharmacovigilance tool to identify unsuspected side effects. Validation studies have i...

  8. DNAApp: a mobile application for sequencing data analysis.

    Science.gov (United States)

    Nguyen, Phi-Vu; Verma, Chandra Shekhar; Gan, Samuel Ken-En

    2014-11-15

    There have been numerous applications developed for decoding and visualization of ab1 DNA sequencing files for Windows and MAC platforms, yet none exists for the increasingly popular smartphone operating systems. The ability to decode sequencing files cannot easily be carried out using browser accessed Web tools. To overcome this hurdle, we have developed a new native app called DNAApp that can decode and display ab1 sequencing file on Android and iOS. In addition to in-built analysis tools such as reverse complementation, protein translation and searching for specific sequences, we have incorporated convenient functions that would facilitate the harnessing of online Web tools for a full range of analysis. Given the high usage of Android/iOS tablets and smartphones, such bioinformatics apps would raise productivity and facilitate the high demand for analyzing sequencing data in biomedical research. The Android version of DNAApp is available in Google Play Store as 'DNAApp', and the iOS version is available in the App Store. More details on the app can be found at www.facebook.com/APDLab; www.bii.a-star.edu.sg/research/trd/apd.php The DNAApp user guide is available at http://tinyurl.com/DNAAppuser, and a video tutorial is available on Google Play Store and App Store, as well as on the Facebook page. samuelg@bii.a-star.edu.sg. © The Author 2014. Published by Oxford University Press.

  9. DNAApp: a mobile application for sequencing data analysis

    Science.gov (United States)

    Nguyen, Phi-Vu; Verma, Chandra Shekhar; Gan, Samuel Ken-En

    2014-01-01

    Summary: There have been numerous applications developed for decoding and visualization of ab1 DNA sequencing files for Windows and MAC platforms, yet none exists for the increasingly popular smartphone operating systems. The ability to decode sequencing files cannot easily be carried out using browser accessed Web tools. To overcome this hurdle, we have developed a new native app called DNAApp that can decode and display ab1 sequencing file on Android and iOS. In addition to in-built analysis tools such as reverse complementation, protein translation and searching for specific sequences, we have incorporated convenient functions that would facilitate the harnessing of online Web tools for a full range of analysis. Given the high usage of Android/iOS tablets and smartphones, such bioinformatics apps would raise productivity and facilitate the high demand for analyzing sequencing data in biomedical research. Availability and implementation: The Android version of DNAApp is available in Google Play Store as ‘DNAApp’, and the iOS version is available in the App Store. More details on the app can be found at www.facebook.com/APDLab; www.bii.a-star.edu.sg/research/trd/apd.php The DNAApp user guide is available at http://tinyurl.com/DNAAppuser, and a video tutorial is available on Google Play Store and App Store, as well as on the Facebook page. Contact: samuelg@bii.a-star.edu.sg PMID:25095882

  10. Long-read sequencing data analysis for yeasts.

    Science.gov (United States)

    Yue, Jia-Xing; Liti, Gianni

    2018-06-01

    Long-read sequencing technologies have become increasingly popular due to their strengths in resolving complex genomic regions. As a leading model organism with small genome size and great biotechnological importance, the budding yeast Saccharomyces cerevisiae has many isolates currently being sequenced with long reads. However, analyzing long-read sequencing data to produce high-quality genome assembly and annotation remains challenging. Here, we present a modular computational framework named long-read sequencing data analysis for yeasts (LRSDAY), the first one-stop solution that streamlines this process. Starting from the raw sequencing reads, LRSDAY can produce chromosome-level genome assembly and comprehensive genome annotation in a highly automated manner with minimal manual intervention, which is not possible using any alternative tool available to date. The annotated genomic features include centromeres, protein-coding genes, tRNAs, transposable elements (TEs), and telomere-associated elements. Although tailored for S. cerevisiae, we designed LRSDAY to be highly modular and customizable, making it adaptable to virtually any eukaryotic organism. When applying LRSDAY to an S. cerevisiae strain, it takes ∼41 h to generate a complete and well-annotated genome from ∼100× Pacific Biosciences (PacBio) running the basic workflow with four threads. Basic experience working within the Linux command-line environment is recommended for carrying out the analysis using LRSDAY.

  11. Construction of an integrated database to support genomic sequence analysis

    Energy Technology Data Exchange (ETDEWEB)

    Gilbert, W.; Overbeek, R.

    1994-11-01

    The central goal of this project is to develop an integrated database to support comparative analysis of genomes including DNA sequence data, protein sequence data, gene expression data and metabolism data. In developing the logic-based system GenoBase, a broader integration of available data was achieved due to assistance from collaborators. Current goals are to easily include new forms of data as they become available and to easily navigate through the ensemble of objects described within the database. This report comments on progress made in these areas.

  12. Analysis of Sequence Diagram Layout in Advanced UML Modelling Tools

    Directory of Open Access Journals (Sweden)

    Ņikiforova Oksana

    2016-05-01

    Full Text Available System modelling using Unified Modelling Language (UML is the task that should be solved for software development. The more complex software becomes the higher requirements are stated to demonstrate the system to be developed, especially in its dynamic aspect, which in UML is offered by a sequence diagram. To solve this task, the main attention is devoted to the graphical presentation of the system, where diagram layout plays the central role in information perception. The UML sequence diagram due to its specific structure is selected for a deeper analysis on the elements’ layout. The authors research represents the abilities of modern UML modelling tools to offer automatic layout of the UML sequence diagram and analyse them according to criteria required for the diagram perception.

  13. Network clustering coefficient approach to DNA sequence analysis

    Energy Technology Data Exchange (ETDEWEB)

    Gerhardt, Guenther J.L. [Universidade Federal do Rio Grande do Sul-Hospital de Clinicas de Porto Alegre, Rua Ramiro Barcelos 2350/sala 2040/90035-003 Porto Alegre (Brazil); Departamento de Fisica e Quimica da Universidade de Caxias do Sul, Rua Francisco Getulio Vargas 1130, 95001-970 Caxias do Sul (Brazil); Lemke, Ney [Programa Interdisciplinar em Computacao Aplicada, Unisinos, Av. Unisinos, 950, 93022-000 Sao Leopoldo, RS (Brazil); Corso, Gilberto [Departamento de Biofisica e Farmacologia, Centro de Biociencias, Universidade Federal do Rio Grande do Norte, Campus Universitario, 59072 970 Natal, RN (Brazil)]. E-mail: corso@dfte.ufrn.br

    2006-05-15

    In this work we propose an alternative DNA sequence analysis tool based on graph theoretical concepts. The methodology investigates the path topology of an organism genome through a triplet network. In this network, triplets in DNA sequence are vertices and two vertices are connected if they occur juxtaposed on the genome. We characterize this network topology by measuring the clustering coefficient. We test our methodology against two main bias: the guanine-cytosine (GC) content and 3-bp (base pairs) periodicity of DNA sequence. We perform the test constructing random networks with variable GC content and imposed 3-bp periodicity. A test group of some organisms is constructed and we investigate the methodology in the light of the constructed random networks. We conclude that the clustering coefficient is a valuable tool since it gives information that is not trivially contained in 3-bp periodicity neither in the variable GC content.

  14. Evolutionary analysis of hepatitis C virus gene sequences from 1953

    Science.gov (United States)

    Gray, Rebecca R.; Tanaka, Yasuhito; Takebe, Yutaka; Magiorkinis, Gkikas; Buskell, Zelma; Seeff, Leonard; Alter, Harvey J.; Pybus, Oliver G.

    2013-01-01

    Reconstructing the transmission history of infectious diseases in the absence of medical or epidemiological records often relies on the evolutionary analysis of pathogen genetic sequences. The precision of evolutionary estimates of epidemic history can be increased by the inclusion of sequences derived from ‘archived’ samples that are genetically distinct from contemporary strains. Historical sequences are especially valuable for viral pathogens that circulated for many years before being formally identified, including HIV and the hepatitis C virus (HCV). However, surprisingly few HCV isolates sampled before discovery of the virus in 1989 are currently available. Here, we report and analyse two HCV subgenomic sequences obtained from infected individuals in 1953, which represent the oldest genetic evidence of HCV infection. The pairwise genetic diversity between the two sequences indicates a substantial period of HCV transmission prior to the 1950s, and their inclusion in evolutionary analyses provides new estimates of the common ancestor of HCV in the USA. To explore and validate the evolutionary information provided by these sequences, we used a new phylogenetic molecular clock method to estimate the date of sampling of the archived strains, plus the dates of four more contemporary reference genomes. Despite the short fragments available, we conclude that the archived sequences are consistent with a proposed sampling date of 1953, although statistical uncertainty is large. Our cross-validation analyses suggest that the bias and low statistical power observed here likely arise from a combination of high evolutionary rate heterogeneity and an unstructured, star-like phylogeny. We expect that attempts to date other historical viruses under similar circumstances will meet similar problems. PMID:23938759

  15. Using SQL Databases for Sequence Similarity Searching and Analysis.

    Science.gov (United States)

    Pearson, William R; Mackey, Aaron J

    2017-09-13

    Relational databases can integrate diverse types of information and manage large sets of similarity search results, greatly simplifying genome-scale analyses. By focusing on taxonomic subsets of sequences, relational databases can reduce the size and redundancy of sequence libraries and improve the statistical significance of homologs. In addition, by loading similarity search results into a relational database, it becomes possible to explore and summarize the relationships between all of the proteins in an organism and those in other biological kingdoms. This unit describes how to use relational databases to improve the efficiency of sequence similarity searching and demonstrates various large-scale genomic analyses of homology-related data. It also describes the installation and use of a simple protein sequence database, seqdb_demo, which is used as a basis for the other protocols. The unit also introduces search_demo, a database that stores sequence similarity search results. The search_demo database is then used to explore the evolutionary relationships between E. coli proteins and proteins in other organisms in a large-scale comparative genomic analysis. © 2017 by John Wiley & Sons, Inc. Copyright © 2017 John Wiley & Sons, Inc.

  16. Now And Next Generation Sequencing Techniques: Future of Sequence Analysis using Cloud Computing

    Directory of Open Access Journals (Sweden)

    Radhe Shyam Thakur

    2012-12-01

    Full Text Available Advancements in the field of sequencing techniques resulted in the huge sequenced data to be produced at a very faster rate. It is going cumbersome for the datacenter to maintain the databases. Data mining and sequence analysis approaches needs to analyze the databases several times to reach any efficient conclusion. To cope with such overburden on computer resources and to reach efficient and effective conclusions quickly, the virtualization of the resources and computation on pay as you go concept was introduced and termed as cloud computing. The datacenter’s hardware and software is collectively known as cloud which when available publicly is termed as public cloud. The datacenter’s resources are provided in a virtual mode to the clients via a service provider like Amazon, Google and Joyent which charges on pay as you go manner. The workload is shifted to the provider which is maintained by the required hardware and software upgradation. The service provider manages it by upgrading the requirements in the virtual mode. Basically a virtual environment is created according to the need of the user by taking permission from datacenter via internet, the task is performed and the environment is deleted after the task is over. In this discussion, we are focusing on the basics of cloud computing, the prerequisites and overall working of clouds. Furthermore, briefly the applications of cloud computing in biological systems, especially in comparative genomics, genome informatics and SNP detection with reference to traditional workflow are discussed.

  17. Now and next-generation sequencing techniques: future of sequence analysis using cloud computing.

    Science.gov (United States)

    Thakur, Radhe Shyam; Bandopadhyay, Rajib; Chaudhary, Bratati; Chatterjee, Sourav

    2012-01-01

    Advances in the field of sequencing techniques have resulted in the greatly accelerated production of huge sequence datasets. This presents immediate challenges in database maintenance at datacenters. It provides additional computational challenges in data mining and sequence analysis. Together these represent a significant overburden on traditional stand-alone computer resources, and to reach effective conclusions quickly and efficiently, the virtualization of the resources and computation on a pay-as-you-go concept (together termed "cloud computing") has recently appeared. The collective resources of the datacenter, including both hardware and software, can be available publicly, being then termed a public cloud, the resources being provided in a virtual mode to the clients who pay according to the resources they employ. Examples of public companies providing these resources include Amazon, Google, and Joyent. The computational workload is shifted to the provider, which also implements required hardware and software upgrades over time. A virtual environment is created in the cloud corresponding to the computational and data storage needs of the user via the internet. The task is then performed, the results transmitted to the user, and the environment finally deleted after all tasks are completed. In this discussion, we focus on the basics of cloud computing, and go on to analyze the prerequisites and overall working of clouds. Finally, the applications of cloud computing in biological systems, particularly in comparative genomics, genome informatics, and SNP detection are discussed with reference to traditional workflows.

  18. SEQUENCING AND SEQUENCE ANALYSIS OF MYOSTATIN GENE IN THE EXON 1 OF THE CAMEL (CAMELUS DROMEDARIUS

    Directory of Open Access Journals (Sweden)

    M. G. SHAH, A. S. QURESHI1, M. REISSMANN2 AND H. J. SCHWARTZ3

    2006-10-01

    Full Text Available Myostatin, also called growth differentiation factor-8 (GDF-8, is a member of the mammalian growth transforming family (TGF-beta superfamily, which is expressed specifically in developing an adult skeletal muscle. Muscular hypertrophy allele (mh allele in the double muscle breeds involved mutation within the myostatin gene. Genomic DNA was isolated from the camel hair using NucleoSpin Tissue kit. Two animals of each of the six breeds namely, Marecha, Dhatti, Larri, Kohi, Sakrai and Cambelpuri were used for sequencing. For PCR amplification of the gene, a primer pair was designed from homolog regions of already published sequences of farm animals from GenBank. Results showed that camel myostatin possessed more than 90% homology with that of cattle, sheep and pig. Camel formed separate cluster from the pig in spite of having high homology (98% and showed 94% homology with cattle and sheep as reported in literature. Sequence analysis of the PCR amplified part of exon 1 (256 bp of the camel myostatin was identical among six camel breeds.

  19. An Imaging And Graphics Workstation For Image Sequence Analysis

    Science.gov (United States)

    Mostafavi, Hassan

    1990-01-01

    This paper describes an application-specific engineering workstation designed and developed to analyze imagery sequences from a variety of sources. The system combines the software and hardware environment of the modern graphic-oriented workstations with the digital image acquisition, processing and display techniques. The objective is to achieve automation and high throughput for many data reduction tasks involving metric studies of image sequences. The applications of such an automated data reduction tool include analysis of the trajectory and attitude of aircraft, missile, stores and other flying objects in various flight regimes including launch and separation as well as regular flight maneuvers. The workstation can also be used in an on-line or off-line mode to study three-dimensional motion of aircraft models in simulated flight conditions such as wind tunnels. The system's key features are: 1) Acquisition and storage of image sequences by digitizing real-time video or frames from a film strip; 2) computer-controlled movie loop playback, slow motion and freeze frame display combined with digital image sharpening, noise reduction, contrast enhancement and interactive image magnification; 3) multiple leading edge tracking in addition to object centroids at up to 60 fields per second from both live input video or a stored image sequence; 4) automatic and manual field-of-view and spatial calibration; 5) image sequence data base generation and management, including the measurement data products; 6) off-line analysis software for trajectory plotting and statistical analysis; 7) model-based estimation and tracking of object attitude angles; and 8) interface to a variety of video players and film transport sub-systems.

  20. Sirius PSB: a generic system for analysis of biological sequences.

    Science.gov (United States)

    Koh, Chuan Hock; Lin, Sharene; Jedd, Gregory; Wong, Limsoon

    2009-12-01

    Computational tools are essential components of modern biological research. For example, BLAST searches can be used to identify related proteins based on sequence homology, or when a new genome is sequenced, prediction models can be used to annotate functional sites such as transcription start sites, translation initiation sites and polyadenylation sites and to predict protein localization. Here we present Sirius Prediction Systems Builder (PSB), a new computational tool for sequence analysis, classification and searching. Sirius PSB has four main operations: (1) Building a classifier, (2) Deploying a classifier, (3) Search for proteins similar to query proteins, (4) Preliminary and post-prediction analysis. Sirius PSB supports all these operations via a simple and interactive graphical user interface. Besides being a convenient tool, Sirius PSB has also introduced two novelties in sequence analysis. Firstly, genetic algorithm is used to identify interesting features in the feature space. Secondly, instead of the conventional method of searching for similar proteins via sequence similarity, we introduced searching via features' similarity. To demonstrate the capabilities of Sirius PSB, we have built two prediction models - one for the recognition of Arabidopsis polyadenylation sites and another for the subcellular localization of proteins. Both systems are competitive against current state-of-the-art models based on evaluation of public datasets. More notably, the time and effort required to build each model is greatly reduced with the assistance of Sirius PSB. Furthermore, we show that under certain conditions when BLAST is unable to find related proteins, Sirius PSB can identify functionally related proteins based on their biophysical similarities. Sirius PSB and its related supplements are available at: http://compbio.ddns.comp.nus.edu.sg/~sirius.

  1. CISAPS: Complex Informational Spectrum for the Analysis of Protein Sequences

    Directory of Open Access Journals (Sweden)

    Charalambos Chrysostomou

    2015-01-01

    Full Text Available Complex informational spectrum analysis for protein sequences (CISAPS and its web-based server are developed and presented. As recent studies show, only the use of the absolute spectrum in the analysis of protein sequences using the informational spectrum analysis is proven to be insufficient. Therefore, CISAPS is developed to consider and provide results in three forms including absolute, real, and imaginary spectrum. Biologically related features to the analysis of influenza A subtypes as presented as a case study in this study can also appear individually either in the real or imaginary spectrum. As the results presented, protein classes can present similarities or differences according to the features extracted from CISAPS web server. These associations are probable to be related with the protein feature that the specific amino acid index represents. In addition, various technical issues such as zero-padding and windowing that may affect the analysis are also addressed. CISAPS uses an expanded list of 611 unique amino acid indices where each one represents a different property to perform the analysis. This web-based server enables researchers with little knowledge of signal processing methods to apply and include complex informational spectrum analysis to their work.

  2. CAFE: aCcelerated Alignment-FrEe sequence analysis.

    Science.gov (United States)

    Lu, Yang Young; Tang, Kujin; Ren, Jie; Fuhrman, Jed A; Waterman, Michael S; Sun, Fengzhu

    2017-07-03

    Alignment-free genome and metagenome comparisons are increasingly important with the development of next generation sequencing (NGS) technologies. Recently developed state-of-the-art k-mer based alignment-free dissimilarity measures including CVTree, $d_2^*$ and $d_2^S$ are more computationally expensive than measures based solely on the k-mer frequencies. Here, we report a standalone software, aCcelerated Alignment-FrEe sequence analysis (CAFE), for efficient calculation of 28 alignment-free dissimilarity measures. CAFE allows for both assembled genome sequences and unassembled NGS shotgun reads as input, and wraps the output in a standard PHYLIP format. In downstream analyses, CAFE can also be used to visualize the pairwise dissimilarity measures, including dendrograms, heatmap, principal coordinate analysis and network display. CAFE serves as a general k-mer based alignment-free analysis platform for studying the relationships among genomes and metagenomes, and is freely available at https://github.com/younglululu/CAFE. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. Environmental impact analysis for the main accidental sequences of ignitor

    International Nuclear Information System (INIS)

    Carpignano, A.; Francabandiera, S.; Vella, R.; Zucchetti, M.

    1996-01-01

    A safety analysis study has been applied to the Ignitor machine using Probabilistic Safety Assessment. The main initiating events have been identified, and accident sequences have been studied by means of traditional methods such as Failure Mode and Effect Analysis (FMEA), Fault Trees (FT) and Event Trees (ET). The consequences of the radioactive environmental releases have been assessed in terms of Effective Dose Equivalent (EDEs) to the Most Exposed Individuals (MEI) of the chosen site, by means of a population dose code. Results point out the low enviromental impact of the machine. 13 refs., 1 fig., 3 tabs

  4. Analysis of sequence diversity through internal transcribed spacers and simple sequence repeats to identify Dendrobium species.

    Science.gov (United States)

    Liu, Y T; Chen, R K; Lin, S J; Chen, Y C; Chin, S W; Chen, F C; Lee, C Y

    2014-04-08

    The Orchidaceae is one of the largest and most diverse families of flowering plants. The Dendrobium genus has high economic potential as ornamental plants and for medicinal purposes. In addition, the species of this genus are able to produce large crops. However, many Dendrobium varieties are very similar in outward appearance, making it difficult to distinguish one species from another. This study demonstrated that the 12 Dendrobium species used in this study may be divided into 2 groups by internal transcribed spacer (ITS) sequence analysis. Red and yellow flowers may also be used to separate these species into 2 main groups. In particular, the deciduous characteristic is associated with the ITS genetic diversity of the A group. Of 53 designed simple sequence repeat (SSR) primer pairs, 7 pairs were polymorphic for polymerase chain reaction products that were amplified from a specific band. The results of this study demonstrate that these 7 SSR primer pairs may potentially be used to identify Dendrobium species and their progeny in future studies.

  5. Using Behavior Sequence Analysis to Map Serial Killers' Life Histories.

    Science.gov (United States)

    Keatley, David A; Golightly, Hayley; Shephard, Rebecca; Yaksic, Enzo; Reid, Sasha

    2018-03-01

    The aim of the current research was to provide a novel method for mapping the developmental sequences of serial killers' life histories. An in-depth biographical account of serial killers' lives, from birth through to conviction, was gained and analyzed using Behavior Sequence Analysis. The analyses highlight similarities in behavioral events across the serial killers' lives, indicating not only which risk factors occur, but the temporal order of these factors. Results focused on early childhood environment, indicating the role of parental abuse; behaviors and events surrounding criminal histories of serial killers, showing that many had previous convictions and were known to police for other crimes; behaviors surrounding their murders, highlighting differences in victim choice and modus operandi; and, finally, trial pleas and convictions. The present research, therefore, provides a novel approach to synthesizing large volumes of data on criminals and presenting results in accessible, understandable outcomes.

  6. Swab-to-Sequence: Real-time Data Analysis Platform for the Biomolecule Sequencer

    Data.gov (United States)

    National Aeronautics and Space Administration — DNA was successfully sequenced on the ISS in 2016, but the DNA sequenced was prepared on the ground. With FY’16 IRAD funds, the same team developed a...

  7. Sequence comparison and phylogenetic analysis of core gene of ...

    African Journals Online (AJOL)

    STORAGESEVER

    2010-07-19

    Jul 19, 2010 ... and antisense primers, a single band of 573 base pairs .... Amino acid sequence alignment of Cluster I and Cluster II of phylogenetic tree. First ten sequences ... sequence weighting, postion-spiecific gap penalties and weight.

  8. Linear discriminant analysis of character sequences using occurrences of words

    KAUST Repository

    Dutta, Subhajit; Chaudhuri, Probal; Ghosh, Anil

    2014-01-01

    Classification of character sequences, where the characters come from a finite set, arises in disciplines such as molecular biology and computer science. For discriminant analysis of such character sequences, the Bayes classifier based on Markov models turns out to have class boundaries defined by linear functions of occurrences of words in the sequences. It is shown that for such classifiers based on Markov models with unknown orders, if the orders are estimated from the data using cross-validation, the resulting classifier has Bayes risk consistency under suitable conditions. Even when Markov models are not valid for the data, we develop methods for constructing classifiers based on linear functions of occurrences of words, where the word length is chosen by cross-validation. Such linear classifiers are constructed using ideas of support vector machines, regression depth, and distance weighted discrimination. We show that classifiers with linear class boundaries have certain optimal properties in terms of their asymptotic misclassification probabilities. The performance of these classifiers is demonstrated in various simulated and benchmark data sets.

  9. Planarian homeobox genes: cloning, sequence analysis, and expression.

    Science.gov (United States)

    Garcia-Fernàndez, J; Baguñà, J; Saló, E

    1991-01-01

    Freshwater planarians (Platyhelminthes, Turbellaria, and Tricladida) are acoelomate, triploblastic, unsegmented, and bilaterally symmetrical organisms that are mainly known for their ample power to regenerate a complete organism from a small piece of their body. To identify potential pattern-control genes in planarian regeneration, we have isolated two homeobox-containing genes, Dth-1 and Dth-2 [Dugesia (Girardia) tigrina homeobox], by using degenerate oligonucleotides corresponding to the most conserved amino acid sequence from helix-3 of the homeodomain. Dth-1 and Dth-2 homeodomains are closely related (68% at the nucleotide level and 78% at the protein level) and show the conserved residues characteristic of the homeodomains identified to data. Similarity with most homeobox sequences is low (30-50%), except with Drosophila NK homeodomains (80-82% with NK-2) and the rodent TTF-1 homeodomain (77-87%). Some unusual amino acid residues specific to NK-2, TTF-1, Dth-1, and Dth-2 can be observed in the recognition helix (helix-3) and may define a family of homeodomains. The deduced amino acid sequences from the cDNAs contain, in addition to the homeodomain, other domains also present in various homeobox-containing genes. The expression of both genes, detected by Northern blot analysis, appear slightly higher in cephalic regions than in the rest of the intact organism, while a slight increase is detected in the central period (5 days) or regeneration. Images PMID:1714599

  10. Analysis of correlations between sites in models of protein sequences

    International Nuclear Information System (INIS)

    Giraud, B.G.; Lapedes, A.; Liu, L.C.

    1998-01-01

    A criterion based on conditional probabilities, related to the concept of algorithmic distance, is used to detect correlated mutations at noncontiguous sites on sequences. We apply this criterion to the problem of analyzing correlations between sites in protein sequences; however, the analysis applies generally to networks of interacting sites with discrete states at each site. Elementary models, where explicit results can be derived easily, are introduced. The number of states per site considered ranges from 2, illustrating the relation to familiar classical spin systems, to 20 states, suitable for representing amino acids. Numerical simulations show that the criterion remains valid even when the genetic history of the data samples (e.g., protein sequences), as represented by a phylogenetic tree, introduces nonindependence between samples. Statistical fluctuations due to finite sampling are also investigated and do not invalidate the criterion. A subsidiary result is found: The more homogeneous a population, the more easily its average properties can drift from the properties of its ancestor. copyright 1998 The American Physical Society

  11. Linear discriminant analysis of character sequences using occurrences of words

    KAUST Repository

    Dutta, Subhajit

    2014-02-01

    Classification of character sequences, where the characters come from a finite set, arises in disciplines such as molecular biology and computer science. For discriminant analysis of such character sequences, the Bayes classifier based on Markov models turns out to have class boundaries defined by linear functions of occurrences of words in the sequences. It is shown that for such classifiers based on Markov models with unknown orders, if the orders are estimated from the data using cross-validation, the resulting classifier has Bayes risk consistency under suitable conditions. Even when Markov models are not valid for the data, we develop methods for constructing classifiers based on linear functions of occurrences of words, where the word length is chosen by cross-validation. Such linear classifiers are constructed using ideas of support vector machines, regression depth, and distance weighted discrimination. We show that classifiers with linear class boundaries have certain optimal properties in terms of their asymptotic misclassification probabilities. The performance of these classifiers is demonstrated in various simulated and benchmark data sets.

  12. Sequence analysis of PROTEOLYSIS 6 from Solanum lycopersicum

    Science.gov (United States)

    Roslan, Nur Farhana; Chew, Bee Lyn; Goh, Hoe-Han; Isa, Nurulhikma Md

    2018-04-01

    The N-end rule pathway is a protein degradation pathway that relates the protein half-life with the identity of its N-terminal residues. A destabilizing N-terminal residues is created by enzymatic reaction or chemical modifications. This destabilized substrate will be recognized by PROTEOLYSIS 6 (PRT6) protein, which encodes an E3 ligase enzyme and resulted in substrate degradation by proteasome. PRT6 has been studied in Arabidopsis thaliana and barley but not yet been studied in fleshy fruit plants. Hence, this study was carried out in tomato that is known as the model for fleshy fruit plants. BLASTX analysis identified that Solyc09g010830 which encodes for a PRT6 gene in tomato based on its sequence similarity with PRT6 in A. thaliana. In silico gene expression analysis shows that PRT6 gene was highly expressed in tomato fruits breaker +5. Co-expression analysis shows that PRT6 may not only involved in abiotic stresses but also in biotic stresses. The objective is to analyze the sequence and characterize PRT6 gene in tomato.

  13. Determining physical constraints in transcriptional initiationcomplexes using DNA sequence analysis

    Energy Technology Data Exchange (ETDEWEB)

    Shultzaberger, Ryan K.; Chiang, Derek Y.; Moses, Alan M.; Eisen,Michael B.

    2007-07-01

    Eukaryotic gene expression is often under the control ofcooperatively acting transcription factors whose binding is limited bystructural constraints. By determining these structural constraints, wecan understand the "rules" that define functional cooperativity.Conversely, by understanding the rules of binding, we can inferstructural characteristics. We have developed an information theory basedmethod for approximating the physical limitations of cooperativeinteractions by comparing sequence analysis to microarray expressiondata. When applied to the coordinated binding of the sulfur amino acidregulatory protein Met4 by Cbf1 and Met31, we were able to create acombinatorial model that can correctly identify Met4 regulatedgenes.

  14. Streaming support for data intensive cloud-based sequence analysis.

    Science.gov (United States)

    Issa, Shadi A; Kienzler, Romeo; El-Kalioby, Mohamed; Tonellato, Peter J; Wall, Dennis; Bruggmann, Rémy; Abouelhoda, Mohamed

    2013-01-01

    Cloud computing provides a promising solution to the genomics data deluge problem resulting from the advent of next-generation sequencing (NGS) technology. Based on the concepts of "resources-on-demand" and "pay-as-you-go", scientists with no or limited infrastructure can have access to scalable and cost-effective computational resources. However, the large size of NGS data causes a significant data transfer latency from the client's site to the cloud, which presents a bottleneck for using cloud computing services. In this paper, we provide a streaming-based scheme to overcome this problem, where the NGS data is processed while being transferred to the cloud. Our scheme targets the wide class of NGS data analysis tasks, where the NGS sequences can be processed independently from one another. We also provide the elastream package that supports the use of this scheme with individual analysis programs or with workflow systems. Experiments presented in this paper show that our solution mitigates the effect of data transfer latency and saves both time and cost of computation.

  15. Streaming Support for Data Intensive Cloud-Based Sequence Analysis

    Directory of Open Access Journals (Sweden)

    Shadi A. Issa

    2013-01-01

    Full Text Available Cloud computing provides a promising solution to the genomics data deluge problem resulting from the advent of next-generation sequencing (NGS technology. Based on the concepts of “resources-on-demand” and “pay-as-you-go”, scientists with no or limited infrastructure can have access to scalable and cost-effective computational resources. However, the large size of NGS data causes a significant data transfer latency from the client’s site to the cloud, which presents a bottleneck for using cloud computing services. In this paper, we provide a streaming-based scheme to overcome this problem, where the NGS data is processed while being transferred to the cloud. Our scheme targets the wide class of NGS data analysis tasks, where the NGS sequences can be processed independently from one another. We also provide the elastream package that supports the use of this scheme with individual analysis programs or with workflow systems. Experiments presented in this paper show that our solution mitigates the effect of data transfer latency and saves both time and cost of computation.

  16. Next-generation sequence analysis of cancer xenograft models.

    Directory of Open Access Journals (Sweden)

    Fernando J Rossello

    Full Text Available Next-generation sequencing (NGS studies in cancer are limited by the amount, quality and purity of tissue samples. In this situation, primary xenografts have proven useful preclinical models. However, the presence of mouse-derived stromal cells represents a technical challenge to their use in NGS studies. We examined this problem in an established primary xenograft model of small cell lung cancer (SCLC, a malignancy often diagnosed from small biopsy or needle aspirate samples. Using an in silico strategy that assign reads according to species-of-origin, we prospectively compared NGS data from primary xenograft models with matched cell lines and with published datasets. We show here that low-coverage whole-genome analysis demonstrated remarkable concordance between published genome data and internal controls, despite the presence of mouse genomic DNA. Exome capture sequencing revealed that this enrichment procedure was highly species-specific, with less than 4% of reads aligning to the mouse genome. Human-specific expression profiling with RNA-Seq replicated array-based gene expression experiments, whereas mouse-specific transcript profiles correlated with published datasets from human cancer stroma. We conclude that primary xenografts represent a useful platform for complex NGS analysis in cancer research for tumours with limited sample resources, or those with prominent stromal cell populations.

  17. Streaming Support for Data Intensive Cloud-Based Sequence Analysis

    Science.gov (United States)

    Issa, Shadi A.; Kienzler, Romeo; El-Kalioby, Mohamed; Tonellato, Peter J.; Wall, Dennis; Bruggmann, Rémy; Abouelhoda, Mohamed

    2013-01-01

    Cloud computing provides a promising solution to the genomics data deluge problem resulting from the advent of next-generation sequencing (NGS) technology. Based on the concepts of “resources-on-demand” and “pay-as-you-go”, scientists with no or limited infrastructure can have access to scalable and cost-effective computational resources. However, the large size of NGS data causes a significant data transfer latency from the client's site to the cloud, which presents a bottleneck for using cloud computing services. In this paper, we provide a streaming-based scheme to overcome this problem, where the NGS data is processed while being transferred to the cloud. Our scheme targets the wide class of NGS data analysis tasks, where the NGS sequences can be processed independently from one another. We also provide the elastream package that supports the use of this scheme with individual analysis programs or with workflow systems. Experiments presented in this paper show that our solution mitigates the effect of data transfer latency and saves both time and cost of computation. PMID:23710461

  18. Extended -Regular Sequence for Automated Analysis of Microarray Images

    Directory of Open Access Journals (Sweden)

    Jin Hee-Jeong

    2006-01-01

    Full Text Available Microarray study enables us to obtain hundreds of thousands of expressions of genes or genotypes at once, and it is an indispensable technology for genome research. The first step is the analysis of scanned microarray images. This is the most important procedure for obtaining biologically reliable data. Currently most microarray image processing systems require burdensome manual block/spot indexing work. Since the amount of experimental data is increasing very quickly, automated microarray image analysis software becomes important. In this paper, we propose two automated methods for analyzing microarray images. First, we propose the extended -regular sequence to index blocks and spots, which enables a novel automatic gridding procedure. Second, we provide a methodology, hierarchical metagrid alignment, to allow reliable and efficient batch processing for a set of microarray images. Experimental results show that the proposed methods are more reliable and convenient than the commercial tools.

  19. Sequence Quality Analysis Tool for HIV Type 1 Protease and Reverse Transcriptase

    OpenAIRE

    DeLong, Allison K.; Wu, Mingham; Bennett, Diane; Parkin, Neil; Wu, Zhijin; Hogan, Joseph W.; Kantor, Rami

    2012-01-01

    Access to antiretroviral therapy is increasing globally and drug resistance evolution is anticipated. Currently, protease (PR) and reverse transcriptase (RT) sequence generation is increasing, including the use of in-house sequencing assays, and quality assessment prior to sequence analysis is essential. We created a computational HIV PR/RT Sequence Quality Analysis Tool (SQUAT) that runs in the R statistical environment. Sequence quality thresholds are calculated from a large dataset (46,802...

  20. The relation between multilocus population genetics and social evolution theory.

    Science.gov (United States)

    Gardner, Andy; West, Stuart A; Barton, Nicholas H

    2007-02-01

    Evolution at multiple gene positions is complicated. Direct selection on one gene disturbs the evolutionary dynamics of associated genes. Recent years have seen the development of a multilocus methodology for modeling evolution at arbitrary numbers of gene positions with arbitrary dominance and epistatic relations, mode of inheritance, genetic linkage, and recombination. We show that the approach is conceptually analogous to social evolutionary methodology, which focuses on selection acting on associated individuals. In doing so, we (1) make explicit the links between the multilocus methodology and the foundations of social evolution theory, namely, Price's theorem and Hamilton's rule; (2) relate the multilocus approach to levels-of-selection and neighbor-modulated-fitness approaches in social evolution; (3) highlight the equivalence between genetical hitchhiking and kin selection; (4) demonstrate that the multilocus methodology allows for social evolutionary analyses involving coevolution of multiple traits and genetical associations between nonrelatives, including individuals of different species; (5) show that this methodology helps solve problems of dynamic sufficiency in social evolution theory; (6) form links between invasion criteria in multilocus systems and Hamilton's rule of kin selection; (7) illustrate the generality and exactness of Hamilton's rule, which has previously been described as an approximate, heuristic result.

  1. Sequencing Infrastructure Investments under Deep Uncertainty Using Real Options Analysis

    Directory of Open Access Journals (Sweden)

    Nishtha Manocha

    2018-02-01

    Full Text Available The adaptation tipping point and adaptation pathway approach developed to make decisions under deep uncertainty do not shed light on which among the multiple available pathways should be chosen as the preferred pathway. This creates the need to extend these approaches by means of suitable tools that can help sequence actions and subsequently enable the outlining of relevant policies. This paper presents two sequencing approaches, namely, the “Build to Target” and “Build Up” approach, to aid in sub-selecting a set of preferred pathways. Both approaches differ in the levels of flexibility they offer. They are exemplified by means of two case studies wherein the Net Present Valuation and the Real Options Analysis are employed as selection criterions. The results demonstrate the benefit of these two approaches when used in conjunction with the adaptation pathways and show how the pathways selected by means of a Build to Target approach generally have a value greater than, or at least the same as, the pathways selected by the Build Up approach. Further, this paper also demonstrates the capacity of Real Options to quantify and capture the economic value of flexibility, which cannot be done by traditional valuation approaches such as Net Present Valuation.

  2. Reverse transcriptase sequences from mulberry LTR retrotransposons: characterization analysis

    Directory of Open Access Journals (Sweden)

    Ma Bi

    2017-10-01

    Full Text Available Copia and Gypsy play important roles in structural, functional and evolutionary dynamics of plant genomes. In this study, a total of 106 and 101, Copia and Gypsy reverse transcriptase (rt were amplified respectively in the Morus notabilis genome using degenerate primers. All sequences exhibited high levels of heterogeneity, were rich in AT and possessed higher sequence divergence of Copia rt in comparison to Gypsy rt. Two reasons are likely to account for this phenomenon: a these elements often experience deletions or fragmentation by illegitimate or unequal homologous recombination in the transposition process; b strong purifying selective pressure drives the evolution of these elements through “selective silencing” with random mutation and eventual deletion from the host genome. Interestingly, mulberry rt clustered with other rt from distantly related taxa according to the phylogenetic analysis. This phenomenon did not result from horizontal transposable element transfer. Results obtained from fluorescence in situ hybridization revealed that most of the hybridization signals were preferentially concentrated in pericentromeric and distal regions of chromosomes, and these elements may play important roles in the regions in which they are found. Results of this study support the continued pursuit of further functional studies of Copia and Gypsy in the mulberry genome.

  3. Nonlinear analysis of sequence repeats of multi-domain proteins

    Energy Technology Data Exchange (ETDEWEB)

    Huang Yanzhao [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Li Mingfeng [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Xiao Yi [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China)]. E-mail: lmf_bill@sina.com

    2007-11-15

    Many multi-domain proteins have repetitive three-dimensional structures but nearly-random amino acid sequences. In the present paper, by using a modified recurrence plot proposed by us previously, we show that these amino acid sequences have hidden repetitions in fact. These results indicate that the repetitive domain structures are encoded by the repetitive sequences. This also gives a method to detect the repetitive domain structures directly from amino acid sequences.

  4. Human factors review for Severe Accident Sequence Analysis (SASA)

    International Nuclear Information System (INIS)

    Krois, P.A.; Haas, P.M.; Manning, J.J.; Bovell, C.R.

    1984-01-01

    The paper will discuss work being conducted during this human factors review including: (1) support of the Severe Accident Sequence Analysis (SASA) Program based on an assessment of operator actions, and (2) development of a descriptive model of operator severe accident management. Research by SASA analysts on the Browns Ferry Unit One (BF1) anticipated transient without scram (ATWS) was supported through a concurrent assessment of operator performance to demonstrate contributions to SASA analyses from human factors data and methods. A descriptive model was developed called the Function Oriented Accident Management (FOAM) model, which serves as a structure for bridging human factors, operations, and engineering expertise and which is useful for identifying needs/deficiencies in the area of accident management. The assessment of human factors issues related to ATWS required extensive coordination with SASA analysts. The analysis was consolidated primarily to six operator actions identified in the Emergency Procedure Guidelines (EPGs) as being the most critical to the accident sequence. These actions were assessed through simulator exercises, qualitative reviews, and quantitative human reliability analyses. The FOAM descriptive model assumes as a starting point that multiple operator/system failures exceed the scope of procedures and necessitates a knowledge-based emergency response by the operators. The FOAM model provides a functionally-oriented structure for assembling human factors, operations, and engineering data and expertise into operator guidance for unconventional emergency responses to mitigate severe accident progression and avoid/minimize core degradation. Operators must also respond to potential radiological release beyond plant protective barriers. Research needs in accident management and potential uses of the FOAM model are described. 11 references, 1 figure

  5. Sequence analysis of cereal sucrose synthase genes and isolation ...

    African Journals Online (AJOL)

    SERVER

    2007-10-18

    Oct 18, 2007 ... sequencing of sucrose synthase gene fragment from sor- ghum using primers designed at their conserved exons. MATERIALS AND METHODS. Multiple sequence alignment. Sucrose synthase gene sequences of various cereals like rice, maize, and barley were accessed from NCBI Genbank database.

  6. Chimera: construction of chimeric sequences for phylogenetic analysis

    NARCIS (Netherlands)

    Leunissen, J.A.M.

    2003-01-01

    Chimera allows the construction of chimeric protein or nucleic acid sequence files by concatenating sequences from two or more sequence files in PHYLIP formats. It allows the user to interactively select genes and species from the input files. The concatenated result is stored to one single output

  7. Accident Sequence Evaluation Program: Human reliability analysis procedure

    Energy Technology Data Exchange (ETDEWEB)

    Swain, A.D.

    1987-02-01

    This document presents a shortened version of the procedure, models, and data for human reliability analysis (HRA) which are presented in the Handbook of Human Reliability Analysis With emphasis on Nuclear Power Plant Applications (NUREG/CR-1278, August 1983). This shortened version was prepared and tried out as part of the Accident Sequence Evaluation Program (ASEP) funded by the US Nuclear Regulatory Commission and managed by Sandia National Laboratories. The intent of this new HRA procedure, called the ''ASEP HRA Procedure,'' is to enable systems analysts, with minimal support from experts in human reliability analysis, to make estimates of human error probabilities and other human performance characteristics which are sufficiently accurate for many probabilistic risk assessments. The ASEP HRA Procedure consists of a Pre-Accident Screening HRA, a Pre-Accident Nominal HRA, a Post-Accident Screening HRA, and a Post-Accident Nominal HRA. The procedure in this document includes changes made after tryout and evaluation of the procedure in four nuclear power plants by four different systems analysts and related personnel, including human reliability specialists. The changes consist of some additional explanatory material (including examples), and more detailed definitions of some of the terms. 42 refs.

  8. Accident Sequence Evaluation Program: Human reliability analysis procedure

    International Nuclear Information System (INIS)

    Swain, A.D.

    1987-02-01

    This document presents a shortened version of the procedure, models, and data for human reliability analysis (HRA) which are presented in the Handbook of Human Reliability Analysis With emphasis on Nuclear Power Plant Applications (NUREG/CR-1278, August 1983). This shortened version was prepared and tried out as part of the Accident Sequence Evaluation Program (ASEP) funded by the US Nuclear Regulatory Commission and managed by Sandia National Laboratories. The intent of this new HRA procedure, called the ''ASEP HRA Procedure,'' is to enable systems analysts, with minimal support from experts in human reliability analysis, to make estimates of human error probabilities and other human performance characteristics which are sufficiently accurate for many probabilistic risk assessments. The ASEP HRA Procedure consists of a Pre-Accident Screening HRA, a Pre-Accident Nominal HRA, a Post-Accident Screening HRA, and a Post-Accident Nominal HRA. The procedure in this document includes changes made after tryout and evaluation of the procedure in four nuclear power plants by four different systems analysts and related personnel, including human reliability specialists. The changes consist of some additional explanatory material (including examples), and more detailed definitions of some of the terms. 42 refs

  9. A Quantitative Accident Sequence Analysis for a VHTR

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Jintae; Lee, Joeun; Jae, Moosung [Hanyang University, Seoul (Korea, Republic of)

    2016-05-15

    In Korea, the basic design features of VHTR are currently discussed in the various design concepts. Probabilistic risk assessment (PRA) offers a logical and structured method to assess risks of a large and complex engineered system, such as a nuclear power plant. It will be introduced at an early stage in the design, and will be upgraded at various design and licensing stages as the design matures and the design details are defined. Risk insights to be developed from the PRA are viewed as essential to developing a design that is optimized in meeting safety objectives and in interpreting the applicability of the existing demands to the safety design approach of the VHTR. In this study, initiating events which may occur in VHTRs were selected through MLD method. The initiating events were then grouped into four categories for the accident sequence analysis. Initiating events frequency and safety systems failure rate were calculated by using reliability data obtained from the available sources and fault tree analysis. After quantification, uncertainty analysis was conducted. The SR and LR frequency are calculated respectively 7.52E- 10/RY and 7.91E-16/RY, which are relatively less than the core damage frequency of LWRs.

  10. Comparing methods of classifying life courses: Sequence analysis and latent class analysis

    NARCIS (Netherlands)

    Elzinga, C.H.; Liefbroer, Aart C.; Han, Sapphire

    2017-01-01

    We compare life course typology solutions generated by sequence analysis (SA) and latent class analysis (LCA). First, we construct an analytic protocol to arrive at typology solutions for both methodologies and present methods to compare the empirical quality of alternative typologies. We apply this

  11. Comparing methods of classifying life courses: sequence analysis and latent class analysis

    NARCIS (Netherlands)

    Han, Y.; Liefbroer, A.C.; Elzinga, C.

    2017-01-01

    We compare life course typology solutions generated by sequence analysis (SA) and latent class analysis (LCA). First, we construct an analytic protocol to arrive at typology solutions for both methodologies and present methods to compare the empirical quality of alternative typologies. We apply this

  12. RDNAnalyzer: A tool for DNA secondary structure prediction and sequence analysis.

    Science.gov (United States)

    Afzal, Muhammad; Shahid, Ahmad Ali; Shehzadi, Abida; Nadeem, Shahid; Husnain, Tayyab

    2012-01-01

    RDNAnalyzer is an innovative computer based tool designed for DNA secondary structure prediction and sequence analysis. It can randomly generate the DNA sequence or user can upload the sequences of their own interest in RAW format. It uses and extends the Nussinov dynamic programming algorithm and has various application for the sequence analysis. It predicts the DNA secondary structure and base pairings. It also provides the tools for routinely performed sequence analysis by the biological scientists such as DNA replication, reverse compliment generation, transcription, translation, sequence specific information as total number of nucleotide bases, ATGC base contents along with their respective percentages and sequence cleaner. RDNAnalyzer is a unique tool developed in Microsoft Visual Studio 2008 using Microsoft Visual C# and Windows Presentation Foundation and provides user friendly environment for sequence analysis. It is freely available. http://www.cemb.edu.pk/sw.html RDNAnalyzer - Random DNA Analyser, GUI - Graphical user interface, XAML - Extensible Application Markup Language.

  13. Frame sequences analysis technique of linear objects movement

    Science.gov (United States)

    Oshchepkova, V. Y.; Berg, I. A.; Shchepkin, D. V.; Kopylova, G. V.

    2017-12-01

    Obtaining data by noninvasive methods are often needed in many fields of science and engineering. This is achieved through video recording in various frame rate and light spectra. In doing so quantitative analysis of movement of the objects being studied becomes an important component of the research. This work discusses analysis of motion of linear objects on the two-dimensional plane. The complexity of this problem increases when the frame contains numerous objects whose images may overlap. This study uses a sequence containing 30 frames at the resolution of 62 × 62 pixels and frame rate of 2 Hz. It was required to determine the average velocity of objects motion. This velocity was found as an average velocity for 8-12 objects with the error of 15%. After processing dependencies of the average velocity vs. control parameters were found. The processing was performed in the software environment GMimPro with the subsequent approximation of the data obtained using the Hill equation.

  14. Transcriptome sequencing and positive selected genes analysis of Bombyx mandarina.

    Directory of Open Access Journals (Sweden)

    Tingcai Cheng

    Full Text Available The wild silkworm Bombyx mandarina is widely believed to be an ancestor of the domesticated silkworm, Bombyx mori. Silkworms are often used as a model for studying the mechanism of species domestication. Here, we performed transcriptome sequencing of the wild silkworm using an Illumina HiSeq2000 platform. We produced 100,004,078 high-quality reads and assembled them into 50,773 contigs with an N50 length of 1764 bp and a mean length of 941.62 bp. A total of 33,759 unigenes were identified, with 12,805 annotated in the Nr database, 8273 in the Pfam database, and 9093 in the Swiss-Prot database. Expression profile analysis found significant differential expression of 1308 unigenes between the middle silk gland (MSG and posterior silk gland (PSG. Three sericin genes (sericin 1, sericin 2, and sericin 3 were expressed specifically in the MSG and three fibroin genes (fibroin-H, fibroin-L, and fibroin/P25 were expressed specifically in the PSG. In addition, 32,297 Single-nucleotide polymorphisms (SNPs and 361 insertion-deletions (INDELs were detected. Comparison with the domesticated silkworm p50/Dazao identified 5,295 orthologous genes, among which 400 might have experienced or to be experiencing positive selection by Ka/Ks analysis. These data and analyses presented here provide insights into silkworm domestication and an invaluable resource for wild silkworm genomics research.

  15. Single-Locus versus Multilocus Patterns of Local Adaptation to Climate in Eastern White Pine (Pinus strobus, Pinaceae.

    Directory of Open Access Journals (Sweden)

    Om P Rajora

    Full Text Available Natural plant populations are often adapted to their local climate and environmental conditions, and populations of forest trees offer some of the best examples of this pattern. However, little empirical work has focused on the relative contribution of single-locus versus multilocus effects to the genetic architecture of local adaptation in plants/forest trees. Here, we employ eastern white pine (Pinus strobus to test the hypothesis that it is the inter-genic effects that primarily drive climate-induced local adaptation. The genetic structure of 29 range-wide natural populations of eastern white pine was determined in relation to local climatic factors using both a reference set of SSR markers, and SNPs located in candidate genes putatively involved in adaptive response to climate. Comparisons were made between marker sets using standard single-locus outlier analysis, single-locus and multilocus environment association analyses and a novel implementation of Population Graphs. Magnitudes of population structure were similar between the two marker sets. Outlier loci consistent with diversifying selection were rare for both SNPs and SSRs. However, genetic distances based on the multilocus among population covariances (cGD were significantly more correlated to climate, even after correcting for spatial effects, for SNPs as compared to SSRs. Coalescent simulations confirmed that the differences in mutation rates between SSRs and SNPs did not affect the topologies of the Population Graphs, and hence values of cGD and their correlations with associated climate variables. We conclude that the multilocus covariances among populations primarily reflect adaptation to local climate and environment in eastern white pine. This result highlights the complexity of the genetic architecture of adaptive traits, as well as the need to consider multilocus effects in studies of local adaptation.

  16. Cloning, sequencing, and sequence analysis of two novel plasmids from the thermophilic anaerobic bacterium Anaerocellum thermophilum

    DEFF Research Database (Denmark)

    Clausen, Anders; Mikkelsen, Marie Just; Schrøder, I.

    2004-01-01

    The nucleotide sequence of two novel plasmids isolated from the extreme thermophilic anaerobic bacterium Anaerocellum thermophilum DSM6725 (A. thermophilum), growing optimally at 70degreesC, has been determined. pBAS2 was found to be a 3653 bp plasmid with a GC content of 43%, and the sequence re...... with highest similarity to DNA repair protein from Campylobacter jejuni (25% aa). Orf34 showed similarity to sigma factors with highest similarity (28% aa) to the sporulation specific Sigma factor, Sigma 28(K) from Bacillus thuringiensis....

  17. A multi-locus phylogenetic evaluation of Diaporthe (Phomopsis)

    NARCIS (Netherlands)

    Udayanga, D.; Liu, X.; Crous, P.W.; McKenzie, E.H.C.; Chukeatirote, E.; Hyde, K.D.

    2012-01-01

    The genus Diaporthe (Phomopsis) includes important plant pathogenic fungi with wide host ranges and geographic distributions. In the present study, phylogenetic species recognition in Diaporthe is re-evaluated using a multi-locus phylogeny based on a combined data matrix of rDNA ITS, and partial

  18. Automatic analysis of the 2015 Gorkha earthquake aftershock sequence.

    Science.gov (United States)

    Baillard, C.; Lyon-Caen, H.; Bollinger, L.; Rietbrock, A.; Letort, J.; Adhikari, L. B.

    2016-12-01

    The Mw 7.8 Gorkha earthquake, that partially ruptured the Main Himalayan Thrust North of Kathmandu on the 25th April 2015, was the largest and most catastrophic earthquake striking Nepal since the great M8.4 1934 earthquake. This mainshock was followed by multiple aftershocks, among them, two notable events that occurred on the 12th May with magnitudes of 7.3 Mw and 6.3 Mw. Due to these recent events it became essential for the authorities and for the scientific community to better evaluate the seismic risk in the region through a detailed analysis of the earthquake catalog, amongst others, the spatio-temporal distribution of the Gorkha aftershock sequence. Here we complement this first study by doing a microseismic study using seismic data coming from the eastern part of the Nepalese Seismological Center network associated to one broadband station in Everest. Our primary goal is to deliver an accurate catalog of the aftershock sequence. Due to the exceptional number of events detected we performed an automatic picking/locating procedure which can be splitted in 4 steps: 1) Coarse picking of the onsets using a classical STA/LTA picker, 2) phase association of picked onsets to detect and declare seismic events, 3) Kurtosis pick refinement around theoretical arrival times to increase picking and location accuracy and, 4) local magnitude calculation based amplitude of waveforms. This procedure is time efficient ( 1 sec/event), reduces considerably the location uncertainties ( 2 to 5 km errors) and increases the number of events detected compared to manual processing. Indeed, the automatic detection rate is 10 times higher than the manual detection rate. By comparing to the USGS catalog we were able to give a new attenuation law to compute local magnitudes in the region. A detailed analysis of the seismicity shows a clear migration toward the east of the region and a sudden decrease of seismicity 100 km east of Kathmandu which may reveal the presence of a tectonic

  19. A DNA Structure-Based Bionic Wavelet Transform and Its Application to DNA Sequence Analysis

    Directory of Open Access Journals (Sweden)

    Fei Chen

    2003-01-01

    Full Text Available DNA sequence analysis is of great significance for increasing our understanding of genomic functions. An important task facing us is the exploration of hidden structural information stored in the DNA sequence. This paper introduces a DNA structure-based adaptive wavelet transform (WT – the bionic wavelet transform (BWT – for DNA sequence analysis. The symbolic DNA sequence can be separated into four channels of indicator sequences. An adaptive symbol-to-number mapping, determined from the structural feature of the DNA sequence, was introduced into WT. It can adjust the weight value of each channel to maximise the useful energy distribution of the whole BWT output. The performance of the proposed BWT was examined by analysing synthetic and real DNA sequences. Results show that BWT performs better than traditional WT in presenting greater energy distribution. This new BWT method should be useful for the detection of the latent structural features in future DNA sequence analysis.

  20. Inter- and intra-strain variability of tandem repeats in Mycoplasma pneumoniae based on next-generation sequencing data.

    Science.gov (United States)

    Zhang, Jing; Song, Xiaohong; Ma, Marella J; Xiao, Li; Kenri, Tsuyoshi; Sun, Hongmei; Ptacek, Travis; Li, Shaoli; Waites, Ken B; Atkinson, T Prescott; Shibayama, Keigo; Dybvig, Kevin; Feng, Yanmei

    2017-02-01

    To characterize inter- and intra-strain variability of variable-number tandem repeats (VNTRs) in Mycoplasma pneumoniae to determine the optimal multilocus VNTR analysis scheme for improved strain typing. Whole genome assemblies and next-generation sequencing data from diverse M. pneumoniae isolates were used to characterize VNTRs and their variability, and to compare the strain discriminability of new VNTR and existing markers. We identified 13 VNTRs including five reported previously. These VNTRs displayed different levels of inter- and intra-strain copy number variations. All new markers showed similar or higher discriminability compared with existing VNTR markers and the P1 typing system. Our study provides novel insights into VNTR variations and potential new multilocus VNTR analysis schemes for improved genotyping of M. pneumoniae.

  1. A genome-wide analysis of lentivector integration sites using targeted sequence capture and next generation sequencing technology.

    Science.gov (United States)

    Ustek, Duran; Sirma, Sema; Gumus, Ergun; Arikan, Muzaffer; Cakiris, Aris; Abaci, Neslihan; Mathew, Jaicy; Emrence, Zeliha; Azakli, Hulya; Cosan, Fulya; Cakar, Atilla; Parlak, Mahmut; Kursun, Olcay

    2012-10-01

    One application of next-generation sequencing (NGS) is the targeted resequencing of interested genes which has not been used in viral integration site analysis of gene therapy applications. Here, we combined targeted sequence capture array and next generation sequencing to address the whole genome profiling of viral integration sites. Human 293T and K562 cells were transduced with a HIV-1 derived vector. A custom made DNA probe sets targeted pLVTHM vector used to capture lentiviral vector/human genome junctions. The captured DNA was sequenced using GS FLX platform. Seven thousand four hundred and eighty four human genome sequences flanking the long terminal repeats (LTR) of pLVTHM fragment sequences matched with an identity of at least 98% and minimum 50 bp criteria in both cells. In total, 203 unique integration sites were identified. The integrations in both cell lines were totally distant from the CpG islands and from the transcription start sites and preferentially located in introns. A comparison between the two cell lines showed that the lentiviral-transduced DNA does not have the same preferred regions in the two different cell lines. Copyright © 2012 Elsevier B.V. All rights reserved.

  2. CSReport: A New Computational Tool Designed for Automatic Analysis of Class Switch Recombination Junctions Sequenced by High-Throughput Sequencing.

    Science.gov (United States)

    Boyer, François; Boutouil, Hend; Dalloul, Iman; Dalloul, Zeinab; Cook-Moreau, Jeanne; Aldigier, Jean-Claude; Carrion, Claire; Herve, Bastien; Scaon, Erwan; Cogné, Michel; Péron, Sophie

    2017-05-15

    B cells ensure humoral immune responses due to the production of Ag-specific memory B cells and Ab-secreting plasma cells. In secondary lymphoid organs, Ag-driven B cell activation induces terminal maturation and Ig isotype class switch (class switch recombination [CSR]). CSR creates a virtually unique IgH locus in every B cell clone by intrachromosomal recombination between two switch (S) regions upstream of each C region gene. Amount and structural features of CSR junctions reveal valuable information about the CSR mechanism, and analysis of CSR junctions is useful in basic and clinical research studies of B cell functions. To provide an automated tool able to analyze large data sets of CSR junction sequences produced by high-throughput sequencing (HTS), we designed CSReport, a software program dedicated to support analysis of CSR recombination junctions sequenced with a HTS-based protocol (Ion Torrent technology). CSReport was assessed using simulated data sets of CSR junctions and then used for analysis of Sμ-Sα and Sμ-Sγ1 junctions from CH12F3 cells and primary murine B cells, respectively. CSReport identifies junction segment breakpoints on reference sequences and junction structure (blunt-ended junctions or junctions with insertions or microhomology). Besides the ability to analyze unprecedentedly large libraries of junction sequences, CSReport will provide a unified framework for CSR junction studies. Our results show that CSReport is an accurate tool for analysis of sequences from our HTS-based protocol for CSR junctions, thereby facilitating and accelerating their study. Copyright © 2017 by The American Association of Immunologists, Inc.

  3. Sequencing and analysis of an Irish human genome.

    LENUS (Irish Health Repository)

    Tong, Pin

    2010-01-01

    Recent studies generating complete human sequences from Asian, African and European subgroups have revealed population-specific variation and disease susceptibility loci. Here, choosing a DNA sample from a population of interest due to its relative geographical isolation and genetic impact on further populations, we extend the above studies through the generation of 11-fold coverage of the first Irish human genome sequence.

  4. Exome Sequence Analysis of 14 Families With High Myopia

    DEFF Research Database (Denmark)

    Kloss, Bethany A.; Tompson, Stuart W.; Whisenhunt, Kristina N.

    2017-01-01

    Purpose: To identify causal gene mutations in 14 families with autosomal dominant (AD) high myopia using exome sequencing. Methods: Select individuals from 14 large Caucasian families with high myopia were exome sequenced. Gene variants were filtered to identify potential pathogenic changes. Sang...

  5. Database-driven primary analysis of raw sequencing data

    DEFF Research Database (Denmark)

    2014-01-01

    The present invention relates to methods for identifying the source of a biological sequence containing sample from raw sequencing reads. The method may be used to identify the source of unknown DNA and can be used for diagnostic, biodefense, food safety and quality, and hygiene applications...

  6. Accelerating next generation sequencing data analysis with system level optimizations.

    Science.gov (United States)

    Kathiresan, Nagarajan; Temanni, Ramzi; Almabrazi, Hakeem; Syed, Najeeb; Jithesh, Puthen V; Al-Ali, Rashid

    2017-08-22

    Next generation sequencing (NGS) data analysis is highly compute intensive. In-memory computing, vectorization, bulk data transfer, CPU frequency scaling are some of the hardware features in the modern computing architectures. To get the best execution time and utilize these hardware features, it is necessary to tune the system level parameters before running the application. We studied the GATK-HaplotypeCaller which is part of common NGS workflows, that consume more than 43% of the total execution time. Multiple GATK 3.x versions were benchmarked and the execution time of HaplotypeCaller was optimized by various system level parameters which included: (i) tuning the parallel garbage collection and kernel shared memory to simulate in-memory computing, (ii) architecture-specific tuning in the PairHMM library for vectorization, (iii) including Java 1.8 features through GATK source code compilation and building a runtime environment for parallel sorting and bulk data transfer (iv) the default 'on-demand' mode of CPU frequency is over-clocked by using 'performance-mode' to accelerate the Java multi-threads. As a result, the HaplotypeCaller execution time was reduced by 82.66% in GATK 3.3 and 42.61% in GATK 3.7. Overall, the execution time of NGS pipeline was reduced to 70.60% and 34.14% for GATK 3.3 and GATK 3.7 respectively.

  7. The sequence and analysis of a Chinese pig genome

    Directory of Open Access Journals (Sweden)

    Fang Xiaodong

    2012-11-01

    Full Text Available Abstract Background The pig is an economically important food source, amounting to approximately 40% of all meat consumed worldwide. Pigs also serve as an important model organism because of their similarity to humans at the anatomical, physiological and genetic level, making them very useful for studying a variety of human diseases. A pig strain of particular interest is the miniature pig, specifically the Wuzhishan pig (WZSP, as it has been extensively inbred. Its high level of homozygosity offers increased ease for selective breeding for specific traits and a more straightforward understanding of the genetic changes that underlie its biological characteristics. WZSP also serves as a promising means for applications in surgery, tissue engineering, and xenotransplantation. Here, we report the sequencing and analysis of an inbreeding WZSP genome. Results Our results reveal some unique genomic features, including a relatively high level of homozygosity in the diploid genome, an unusual distribution of heterozygosity, an over-representation of tRNA-derived transposable elements, a small amount of porcine endogenous retrovirus, and a lack of type C retroviruses. In addition, we carried out systematic research on gene evolution, together with a detailed investigation of the counterparts of human drug target genes. Conclusion Our results provide the opportunity to more clearly define the genomic character of pig, which could enhance our ability to create more useful pig models.

  8. Analysis of expressed sequence tags from the Ulva prolifera (Chlorophyta)

    Science.gov (United States)

    Niu, Jianfeng; Hu, Haiyan; Hu, Songnian; Wang, Guangce; Peng, Guang; Sun, Song

    2010-01-01

    In 2008, a green tide broke out before the sailing competition of the 29th Olympic Games in Qingdao. The causative species was determined to be Enteromorpha prolifera ( Ulva prolifera O. F. Müller), a familiar green macroalga along the coastline of China. Rapid accumulation of a large biomass of floating U. prolifera prompted research on different aspects of this species. In this study, we constructed a nonnormalized cDNA library from the thalli of U. prolifera and acquired 10 072 high-quality expressed sequence tags (ESTs). These ESTs were assembled into 3 519 nonredundant gene groups, including 1 446 clusters and 2 073 singletons. After annotation with the nr database, a large number of genes were found to be related with chloroplast and ribosomal protein, GO functional classification showed 1 418 ESTs participated in photosynthesis and 1 359 ESTs were responsible for the generation of precursor metabolites and energy. In addition, rather comprehensive carbon fixation pathways were found in U. prolifera using KEGG. Some stress-related and signal transduction-related genes were also found in this study. All the evidences displayed that U. prolifera had substance and energy foundation for the intense photosynthesis and the rapid proliferation. Phylogenetic analysis of cytochrome c oxidase subunit I revealed that this green-tide causative species is most closely affiliated to Pseudendoclonium akinetum (Ulvophyceae).

  9. Event Sequence Analysis of the Air Intelligence Agency Information Operations Center Flight Operations

    National Research Council Canada - National Science Library

    Larsen, Glen

    1998-01-01

    This report applies Event Sequence Analysis, methodology adapted from aircraft mishap investigation, to an investigation of the performance of the Air Intelligence Agency's Information Operations Center (IOC...

  10. Analysis of the Macaca mulatta transcriptome and the sequence divergence between Macaca and human.

    Science.gov (United States)

    Magness, Charles L; Fellin, P Campion; Thomas, Matthew J; Korth, Marcus J; Agy, Michael B; Proll, Sean C; Fitzgibbon, Matthew; Scherer, Christina A; Miner, Douglas G; Katze, Michael G; Iadonato, Shawn P

    2005-01-01

    We report the initial sequencing and comparative analysis of the Macaca mulatta transcriptome. Cloned sequences from 11 tissues, nine animals, and three species (M. mulatta, M. fascicularis, and M. nemestrina) were sampled, resulting in the generation of 48,642 sequence reads. These data represent an initial sampling of the putative rhesus orthologs for 6,216 human genes. Mean nucleotide diversity within M. mulatta and sequence divergence among M. fascicularis, M. nemestrina, and M. mulatta are also reported.

  11. Sequence analysis of mitochondrial 16S ribosomal RNA gene ...

    Indian Academy of Sciences (India)

    Unknown

    For the understanding of their vectorial capacity, identification of disease carrying and refractory strains is essential. ... been widely used for phylogenetic studies and sequence differences in ... In order to fill up the internal gap, a new set.

  12. simple sequence repeat (SSR) markers in genetic analysis of

    African Journals Online (AJOL)

    Yomi

    2012-08-28

    1998). Cross- species amplification of soybean (Glycine max) simple sequence repeats (SSRs) within the genus and other legume genera: implications for the transferability of SSRs in plants. Mol. Biol. Evol. 15:1275-1287.

  13. Sequence and expression analysis of gaps in human chromosome 20

    DEFF Research Database (Denmark)

    Minocherhomji, Sheroy; Seemann, Stefan; Mang, Yuan

    2012-01-01

    /or overlap disease-associated loci, including the DLGAP4 locus. In this study, we sequenced ~99% of all three unfinished gaps on human chr 20, determined their complete genomic sizes and assessed epigenetic profiles using a combination of Sanger sequencing, mate pair paired-end high-throughput sequencing......The finished human genome-assemblies comprise several hundred un-sequenced euchromatic gaps, which may be rich in long polypurine/polypyrimidine stretches. Human chromosome 20 (chr 20) currently has three unfinished gaps remaining on its q-arm. All three gaps are within gene-dense regions and...... and chromatin, methylation and expression analyses. We found histone 3 trimethylated at Lysine 27 to be distributed across all three gaps in immortalized B-lymphocytes. In one gap, five novel CpG islands were predominantly hypermethylated in genomic DNA from peripheral blood lymphocytes and human cerebellum...

  14. DELIMINATE--a fast and efficient method for loss-less compression of genomic sequences: sequence analysis.

    Science.gov (United States)

    Mohammed, Monzoorul Haque; Dutta, Anirban; Bose, Tungadri; Chadaram, Sudha; Mande, Sharmila S

    2012-10-01

    An unprecedented quantity of genome sequence data is currently being generated using next-generation sequencing platforms. This has necessitated the development of novel bioinformatics approaches and algorithms that not only facilitate a meaningful analysis of these data but also aid in efficient compression, storage, retrieval and transmission of huge volumes of the generated data. We present a novel compression algorithm (DELIMINATE) that can rapidly compress genomic sequence data in a loss-less fashion. Validation results indicate relatively higher compression efficiency of DELIMINATE when compared with popular general purpose compression algorithms, namely, gzip, bzip2 and lzma. Linux, Windows and Mac implementations (both 32 and 64-bit) of DELIMINATE are freely available for download at: http://metagenomics.atc.tcs.com/compression/DELIMINATE. sharmila@atc.tcs.com Supplementary data are available at Bioinformatics online.

  15. Analysis of 16S rRNA amplicon sequencing options on the Roche/454 next-generation titanium sequencing platform.

    Directory of Open Access Journals (Sweden)

    Hideyuki Tamaki

    Full Text Available BACKGROUND: 16S rRNA gene pyrosequencing approach has revolutionized studies in microbial ecology. While primer selection and short read length can affect the resulting microbial community profile, little is known about the influence of pyrosequencing methods on the sequencing throughput and the outcome of microbial community analyses. The aim of this study is to compare differences in output, ease, and cost among three different amplicon pyrosequencing methods for the Roche/454 Titanium platform METHODOLOGY/PRINCIPAL FINDINGS: The following three pyrosequencing methods for 16S rRNA genes were selected in this study: Method-1 (standard method is the recommended method for bi-directional sequencing using the LIB-A kit; Method-2 is a new option designed in this study for unidirectional sequencing with the LIB-A kit; and Method-3 uses the LIB-L kit for unidirectional sequencing. In our comparison among these three methods using 10 different environmental samples, Method-2 and Method-3 produced 1.5-1.6 times more useable reads than the standard method (Method-1, after quality-based trimming, and did not compromise the outcome of microbial community analyses. Specifically, Method-3 is the most cost-effective unidirectional amplicon sequencing method as it provided the most reads and required the least effort in consumables management. CONCLUSIONS: Our findings clearly demonstrated that alternative pyrosequencing methods for 16S rRNA genes could drastically affect sequencing output (e.g. number of reads before and after trimming but have little effect on the outcomes of microbial community analysis. This finding is important for both researchers and sequencing facilities utilizing 16S rRNA gene pyrosequencing for microbial ecological studies.

  16. Compilation and analysis of Escherichia coli promoter DNA sequences.

    OpenAIRE

    Hawley, D K; McClure, W R

    1983-01-01

    The DNA sequence of 168 promoter regions (-50 to +10) for Escherichia coli RNA polymerase were compiled. The complete listing was divided into two groups depending upon whether or not the promoter had been defined by genetic (promoter mutations) or biochemical (5' end determination) criteria. A consensus promoter sequence based on homologies among 112 well-defined promoters was determined that was in substantial agreement with previous compilations. In addition, we have tabulated 98 promoter ...

  17. Sequence quality analysis tool for HIV type 1 protease and reverse transcriptase.

    Science.gov (United States)

    Delong, Allison K; Wu, Mingham; Bennett, Diane; Parkin, Neil; Wu, Zhijin; Hogan, Joseph W; Kantor, Rami

    2012-08-01

    Access to antiretroviral therapy is increasing globally and drug resistance evolution is anticipated. Currently, protease (PR) and reverse transcriptase (RT) sequence generation is increasing, including the use of in-house sequencing assays, and quality assessment prior to sequence analysis is essential. We created a computational HIV PR/RT Sequence Quality Analysis Tool (SQUAT) that runs in the R statistical environment. Sequence quality thresholds are calculated from a large dataset (46,802 PR and 44,432 RT sequences) from the published literature ( http://hivdb.Stanford.edu ). Nucleic acid sequences are read into SQUAT, identified, aligned, and translated. Nucleic acid sequences are flagged if with >five 1-2-base insertions; >one 3-base insertion; >one deletion; >six PR or >18 RT ambiguous bases; >three consecutive PR or >four RT nucleic acid mutations; >zero stop codons; >three PR or >six RT ambiguous amino acids; >three consecutive PR or >four RT amino acid mutations; >zero unique amino acids; or 15% genetic distance from another submitted sequence. Thresholds are user modifiable. SQUAT output includes a summary report with detailed comments for troubleshooting of flagged sequences, histograms of pairwise genetic distances, neighbor joining phylogenetic trees, and aligned nucleic and amino acid sequences. SQUAT is a stand-alone, free, web-independent tool to ensure use of high-quality HIV PR/RT sequences in interpretation and reporting of drug resistance, while increasing awareness and expertise and facilitating troubleshooting of potentially problematic sequences.

  18. First fungal genome sequence from Africa: A preliminary analysis

    Directory of Open Access Journals (Sweden)

    Rene Sutherland

    2012-01-01

    Full Text Available Some of the most significant breakthroughs in the biological sciences this century will emerge from the development of next generation sequencing technologies. The ease of availability of DNA sequence made possible through these new technologies has given researchers opportunities to study organisms in a manner that was not possible with Sanger sequencing. Scientists will, therefore, need to embrace genomics, as well as develop and nurture the human capacity to sequence genomes and utilise the ’tsunami‘ of data that emerge from genome sequencing. In response to these challenges, we sequenced the genome of Fusarium circinatum, a fungal pathogen of pine that causes pitch canker, a disease of great concern to the South African forestry industry. The sequencing work was conducted in South Africa, making F. circinatum the first eukaryotic organism for which the complete genome has been sequenced locally. Here we report on the process that was followed to sequence, assemble and perform a preliminary characterisation of the genome. Furthermore, details of the computer annotation and manual curation of this genome are presented. The F. circinatum genome was found to be nearly 44 million bases in size, which is similar to that of four other Fusarium genomes that have been sequenced elsewhere. The genome contains just over 15 000 open reading frames, which is less than that of the related species, Fusarium oxysporum, but more than that for Fusarium verticillioides. Amongst the various putative gene clusters identified in F. circinatum, those encoding the secondary metabolites fumosin and fusarin appeared to harbour evidence of gene translocation. It is anticipated that similar comparisons of other loci will provide insights into the genetic basis for pathogenicity of the pitch canker pathogen. Perhaps more importantly, this project has engaged a relatively large group of scientists

  19. REFGEN and TREENAMER: Automated Sequence Data Handling for Phylogenetic Analysis in the Genomic Era

    Science.gov (United States)

    Leonard, Guy; Stevens, Jamie R.; Richards, Thomas A.

    2009-01-01

    The phylogenetic analysis of nucleotide sequences and increasingly that of amino acid sequences is used to address a number of biological questions. Access to extensive datasets, including numerous genome projects, means that standard phylogenetic analyses can include many hundreds of sequences. Unfortunately, most phylogenetic analysis programs do not tolerate the sequence naming conventions of genome databases. Managing large numbers of sequences and standardizing sequence labels for use in phylogenetic analysis programs can be a time consuming and laborious task. Here we report the availability of an online resource for the management of gene sequences recovered from public access genome databases such as GenBank. These web utilities include the facility for renaming every sequence in a FASTA alignment file, with each sequence label derived from a user-defined combination of the species name and/or database accession number. This facility enables the user to keep track of the branching order of the sequences/taxa during multiple tree calculations and re-optimisations. Post phylogenetic analysis, these webpages can then be used to rename every label in the subsequent tree files (with a user-defined combination of species name and/or database accession number). Together these programs drastically reduce the time required for managing sequence alignments and labelling phylogenetic figures. Additional features of our platform include the automatic removal of identical accession numbers (recorded in the report file) and generation of species and accession number lists for use in supplementary materials or figure legends. PMID:19812722

  20. REFGEN and TREENAMER: Automated Sequence Data Handling for Phylogenetic Analysis in the Genomic Era

    Directory of Open Access Journals (Sweden)

    Guy Leonard

    2009-01-01

    Full Text Available The phylogenetic analysis of nucleotide sequences and increasingly that of amino acid sequences is used to address a number of biological questions. Access to extensive datasets, including numerous genome projects, means that standard phylogenetic analyses can include many hundreds of sequences. Unfortunately, most phylogenetic analysis programs do not tolerate the sequence naming conventions of genome databases. Managing large numbers of sequences and standardizing sequence labels for use in phylogenetic analysis programs can be a time consuming and laborious task. Here we report the availability of an online resource for the management of gene sequences recovered from public access genome databases such as GenBank. These web utilities include the facility for renaming every sequence in a FASTA alignment fi le, with each sequence label derived from a user-defined combination of the species name and/or database accession number. This facility enables the user to keep track of the branching order of the sequences/taxa during multiple tree calculations and re-optimisations. Post phylogenetic analysis, these webpages can then be used to rename every label in the subsequent tree fi les (with a user-defined combination of species name and/or database accession number. Together these programs drastically reduce the time required for managing sequence alignments and labelling phylogenetic figures. Additional features of our platform include the automatic removal of identical accession numbers (recorded in the report file and generation of species and accession number lists for use in supplementary materials or figure legends.

  1. Sequencing and analysis of the Mediterranean amphioxus (Branchiostoma lanceolatum transcriptome.

    Directory of Open Access Journals (Sweden)

    Silvan Oulion

    Full Text Available BACKGROUND: The basally divergent phylogenetic position of amphioxus (Cephalochordata, as well as its conserved morphology, development and genetics, make it the best proxy for the chordate ancestor. Particularly, studies using the amphioxus model help our understanding of vertebrate evolution and development. Thus, interest for the amphioxus model led to the characterization of both the transcriptome and complete genome sequence of the American species, Branchiostoma floridae. However, recent technical improvements allowing induction of spawning in the laboratory during the breeding season on a daily basis with the Mediterranean species Branchiostoma lanceolatum have encouraged European Evo-Devo researchers to adopt this species as a model even though no genomic or transcriptomic data have been available. To fill this need we used the pyrosequencing method to characterize the B. lanceolatum transcriptome and then compared our results with the published transcriptome of B. floridae. RESULTS: Starting with total RNA from nine different developmental stages of B. lanceolatum, a normalized cDNA library was constructed and sequenced on Roche GS FLX (Titanium mode. Around 1.4 million of reads were produced and assembled into 70,530 contigs (average length of 490 bp. Overall 37% of the assembled sequences were annotated by BlastX and their Gene Ontology terms were determined. These results were then compared to genomic and transcriptomic data of B. floridae to assess similarities and specificities of each species. CONCLUSION: We obtained a high-quality amphioxus (B. lanceolatum reference transcriptome using a high throughput sequencing approach. We found that 83% of the predicted genes in the B. floridae complete genome sequence are also found in the B. lanceolatum transcriptome, while only 41% were found in the B. floridae transcriptome obtained with traditional Sanger based sequencing. Therefore, given the high degree of sequence conservation

  2. Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

    Directory of Open Access Journals (Sweden)

    Gao Zhihong

    2010-07-01

    Full Text Available Abstract Background Expressed Sequence Tag (EST has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047, among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65% and low in the peach (46%, and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species.

  3. Multilocus dataset reveals demographic histories of two peat mosses in Europe

    Directory of Open Access Journals (Sweden)

    Hock Zsófia

    2007-08-01

    Full Text Available Abstract Background Revealing the past and present demographic history of populations is of high importance to evaluate the conservation status of species. Demographic data can be obtained by direct monitoring or by analysing data of historical and recent collections. Although these methods provide the most detailed information they are very time consuming. Another alternative way is to make use of the information accumulated in the species' DNA over its history. Recent development of the coalescent theory makes it possible to reconstruct the demographic history of species using nucleotide polymorphism data. To separate the effect of natural selection and demography, multilocus analysis is needed because these two forces can produce similar patterns of polymorphisms. In this study we investigated the amount and pattern of sequence variability of a Europe wide sample set of two peat moss species (Sphagnum fimbriatum and S. squarrosum with similar distributions and mating systems but presumably contrasting historical demographies using 3 regions of the nuclear genome (appr. 3000 bps. We aimed to draw inferences concerning demographic, and phylogeographic histories of the species. Results All three nuclear regions supported the presence of an Atlantic and Non-Atlantic clade of S. fimbriatum suggesting glacial survival of the species along the Atlantic coast of Europe. Contrarily, S. squarrosum haplotypes showed three clades but no geographic structure at all. Maximum likelihood, mismatch and Bayesian analyses supported a severe historical bottleneck and a relatively recent demographic expansion of the Non-Atlantic clade of S. fimbriatum, whereas size of S. squarrosum populations has probably decreased in the past. Species wide molecular diversity of the two species was nearly the same with an excess of replacement mutations in S. fimbriatum. Similar levels of molecular diversity, contrasting phylogeographic patterns and excess of replacement

  4. Taxonomic evaluation of the genus Enterobacter based on multilocus sequence analysis (MLSA): proposal to reclassify E. nimipressuralis and E. amnigenus into Lelliottia gen. nov. as Lelliottia nimipressuralis comb. nov. and Lelliottia amnigena comb. nov., respectively, E. gergoviae and E. pyrinus into Pluralibacter gen. nov. as Pluralibacter gergoviae comb. nov. and Pluralibacter pyrinus comb. nov., respectively, E. cowanii, E. radicincitans, E. oryzae and E. arachidis into Kosakonia gen. nov. as Kosakonia cowanii comb. nov., Kosakonia radicincitans comb. nov., Kosakonia oryzae comb. nov. and Kosakonia arachidis comb. nov., respectively, and E. turicensis, E. helveticus and E. pulveris into Cronobacter as Cronobacter zurichensis nom. nov., Cronobacter helveticus comb. nov. and Cronobacter pulveris comb. nov., respectively, and emended description of the genera Enterobacter and Cronobacter.

    Science.gov (United States)

    Brady, Carrie; Cleenwerck, Ilse; Venter, Stephanus; Coutinho, Teresa; De Vos, Paul

    2013-07-01

    The taxonomy of Enterobacter has a complicated history, with several species transferred to and from this genus. Classification of strains is difficult owing to its polyphyletic nature, based on 16S rRNA gene sequences. It has been previously acknowledged that Enterobacter contains species which should be transferred to other genera. In an attempt to resolve the taxonomy of Enterobacter, MLSA based on partial sequencing of protein-encoding genes (gyrB, rpoB, infB and atpD) was performed on the type strains and reference strains of Enterobacter, Cronobacter and Serratia species, as well as members of the closely related genera Citrobacter, Klebsiella, Kluyvera, Leclercia, Mangrovibacter, Raoultella and Yokenella. Phylogenetic analyses of the concatenated nucleotide sequences revealed that Enterobacter can be divided into five strongly supported MLSA groups, suggesting that the species should be reclassified into five different genera. Further support for this was provided by a concatenated amino acid tree, phenotypic characteristics and fatty acid profiles, enabling differentiation of the MLSA groups. Three novel genera are proposed: Lelliottia gen. nov., Pluralibacter gen. nov. and Kosakonia gen. nov. and the following new combinations: Lelliottia nimipressuralis comb. nov., Lelliottia amnigena comb. nov., Pluralibacter gergoviae comb. nov., Pluralibacter pyrinus comb. nov., Kosakonia cowanii comb. nov., Kosakonia radicincitans comb. nov., Kosakonia oryzae comb. nov., Kosakonia arachidis comb. nov., Cronobacter helveticus comb. nov. and Cronobacter pulveris comb. nov. Additionally, the novel epithet Cronobacter zurichensis nom. nov. is proposed for the reclassification of Enterobacter turicensis into the genus Cronobacter, as Cronobacter turicensis (Iversen et al., 2008) is already in use. Copyright © 2013 Elsevier GmbH. All rights reserved.

  5. Accident Sequence Precursor Analysis for SGTR by Using Dynamic PSA Approach

    International Nuclear Information System (INIS)

    Lee, Han Sul; Heo, Gyun Young; Kim, Tae Wan

    2016-01-01

    In order to address this issue, this study suggests the sequence tree model to analyze accident sequence systematically. Using the sequence tree model, all possible scenarios which need a specific safety action to prevent the core damage can be identified and success conditions of safety action under complicated situation such as combined accident will be also identified. Sequence tree is branch model to divide plant condition considering the plant dynamics. Since sequence tree model can reflect the plant dynamics, arising from interaction of different accident timing and plant condition and from the interaction between the operator action, mitigation system, and the indicators for operation, sequence tree model can be used to develop the dynamic event tree model easily. Target safety action for this study is a feed-and-bleed (F and B) operation. A F and B operation directly cools down the reactor cooling system (RCS) using the primary cooling system when residual heat removal by the secondary cooling system is not available. In this study, a TLOFW accident and a TLOFW accident with LOCA were the target accidents. Based on the conventional PSA model and indicators, the sequence tree model for a TLOFW accident was developed. Based on the results of a sampling analysis and data from the conventional PSA model, the CDF caused by Sequence no. 26 can be realistically estimated. For a TLOFW accident with LOCA, second accident timings were categorized according to plant condition. Indicators were selected as branch point using the flow chart and tables, and a corresponding sequence tree model was developed. If sampling analysis is performed, practical accident sequences can be identified based on the sequence analysis. If a realistic distribution for the variables can be obtained for sampling analysis, much more realistic accident sequences can be described. Moreover, if the initiating event frequency under a combined accident can be quantified, the sequence tree model

  6. Sequencing and phylogenetic analysis of Herpes simplex virus type ...

    African Journals Online (AJOL)

    For determination of the genetic relationship of HSV-2 glycoprotein G gene (gG) in Iran with those in other countries, DNA fragment of 1100 bp corresponding to gG from six HSV-2 strains have been isolated from human infected sera samples in Iran, it was amplified in PCR system and was sequenced for determining ...

  7. Transcriptome analysis of blueberry using 454 EST sequencing

    Science.gov (United States)

    Blueberry (Vaccinium corymbosum) is a major berry crop in the United States, and one that has great nutritional and economical value. Next generation sequencing methodologies, such as 454, have been demonstrated to be successful and efficient in producing a snap-shot of transcriptional activities du...

  8. Characterization and sequence analysis of cysteine and glycine-rich ...

    African Journals Online (AJOL)

    Tarek

    2011-04-18

    Apr 18, 2011 ... nucleotide alignment of both native buffalo and cattle CSRP3 cDNAs sequences ..... Exon III, Identities = 71/75 (94%), Gaps = 1/75 (1%) Strand=Plus/Plus ... Band MR, Larson JH, Rebeiz M, Green CA, Heyen DW, Donovan J,.

  9. Functional analysis of bipartite begomovirus coat protein promoter sequences

    International Nuclear Information System (INIS)

    Lacatus, Gabriela; Sunter, Garry

    2008-01-01

    We demonstrate that the AL2 gene of Cabbage leaf curl virus (CaLCuV) activates the CP promoter in mesophyll and acts to derepress the promoter in vascular tissue, similar to that observed for Tomato golden mosaic virus (TGMV). Binding studies indicate that sequences mediating repression and activation of the TGMV and CaLCuV CP promoter specifically bind different nuclear factors common to Nicotiana benthamiana, spinach and tomato. However, chromatin immunoprecipitation demonstrates that TGMV AL2 can interact with both sequences independently. Binding of nuclear protein(s) from different crop species to viral sequences conserved in both bipartite and monopartite begomoviruses, including TGMV, CaLCuV, Pepper golden mosaic virus and Tomato yellow leaf curl virus suggests that bipartite begomoviruses bind common host factors to regulate the CP promoter. This is consistent with a model in which AL2 interacts with different components of the cellular transcription machinery that bind viral sequences important for repression and activation of begomovirus CP promoters

  10. The DNA sequence, annotation and analysis of human chromosome 3

    DEFF Research Database (Denmark)

    Muzny, D.M.; Bolund, Lars; As part of the Chinese Human Genome Sequencing Consortium, E.T.A.L.

    2006-01-01

    as numerous loci involved in multiple human cancers such as the gene encoding FHIT, which contains the most common constitutive fragile site in the genome, FRA3B. Using genomic sequence from chimpanzee and rhesus macaque, we were able to characterize the breakpoints defining a large pericentric inversion...

  11. Sequence analysis of mitochondrial 16S ribosomal RNA gene

    Indian Academy of Sciences (India)

    Mosquitoes are vectors for the transmission of many human pathogens that include viruses, nematodes and protozoa. For the understanding of their vectorial capacity, identification of disease carrying and refractory strains is essential. Recently, molecular taxonomic techniques have been utilized for this purpose. Sequence ...

  12. Illumina-based de novo transcriptome sequencing and analysis

    Indian Academy of Sciences (India)

    In the present study, we used Illumina HiSeq technology to perform de novo assembly of heart and musk gland transcriptomes from the Chinese forest musk deer. A total of 239,383 transcripts and 176,450 unigenes were obtained, of which 37,329 unigenes were matched to known sequences in the NCBI nonredundant ...

  13. Generation and analysis of expressed sequence tags from Botrytis cinerea

    Directory of Open Access Journals (Sweden)

    EVELYN SILVA

    2006-01-01

    Full Text Available Botrytis cinerea is a filamentous plant pathogen of a wide range of plant species, and its infection may cause enormous damage both during plant growth and in the post-harvest phase. We have constructed a cDNA library from an isolate of B. cinerea and have sequenced 11,482 expressed sequence tags that were assembled into 1,003 contigs sequences and 3,032 singletons. Approximately 81% of the unigenes showed significant similarity to genes coding for proteins with known functions: more than 50% of the sequences code for genes involved in cellular metabolism, 12% for transport of metabolites, and approximately 10% for cellular organization. Other functional categories include responses to biotic and abiotic stimuli, cell communication, cell homeostasis, and cell development. We carried out pair-wise comparisons with fungal databases to determine the B. cinerea unisequence set with relevant similarity to genes in other fungal pathogenic counterparts. Among the 4,035 non-redundant B. cinerea unigenes, 1,338 (23% have significant homology with Fusarium verticillioides unigenes. Similar values were obtained for Saccharomyces cerevisiae and Aspergillus nidulans (22% and 24%, respectively. The lower percentages of homology were with Magnaporthe grisae and Neurospora crassa (13% and 19%, respectively. Several genes involved in putative and known fungal virulence and general pathogenicity were identified. The results provide important information for future research on this fungal pathogen

  14. Whole-genome sequence-based analysis of thyroid function

    DEFF Research Database (Denmark)

    Taylor, Peter N.; Porcu, Eleonora; Chew, Shelby

    2015-01-01

    Normal thyroid function is essential for health, but its genetic architecture remains poorly understood. Here, for the heritable thyroid traits thyrotropin (TSH) and free thyroxine (FT4), we analyse whole-genome sequence data from the UK10K project (N = 2,287). Using additional whole-genome seque...

  15. DNA sequence and prokaryotic expression analysis of vitellogenin ...

    African Journals Online (AJOL)

    In this study, the DNA sequence of vitellogenin from Antheraea pernyi (Ap-Vg) was identified and its functional domain (30-740 aa, Ap-Vg-1) was expressed in Escherichia coli BL21 (DE3) cells. The recombinant Ap-Vg-1 proteins were purified and used for antibody preparation. The results showed that the intact DNA ...

  16. Molecular cloning, sequence analysis and structure prediction of the ...

    African Journals Online (AJOL)

    AJL

    2012-04-19

    Apr 19, 2012 ... The primers were based on the rBAT sequences of other animals deposited in GenBank. .... fragment; M1, 2000 bp DNA ladder; M2, 1000 bp DNA ladder. spliced to obtain the ..... A traffic signal for heterodimeric amino acid.

  17. A bibliometric analysis of global research on genome sequencing ...

    African Journals Online (AJOL)

    The results show that disease and protein related researches were the leading research focuses, and comparative genomics and evolution related research had strong potential in the near future. Key words: Genome sequencing, research trend, scientometrics, science citation index expanded (SCI-Expanded), word cluster ...

  18. Cloning and sequence analysis of the defective in anther ...

    African Journals Online (AJOL)

    To clone the defective in anther dehiscence1 (DAD1) gene fragment of Chinese kale, about 700 bp product was obtained by PCR amplification using Chinese kale genomic DNA as the template and a pair of specific primers designed according to the conserved sequence of DAD1 genes of Arabidopsis thaliana and ...

  19. Sequence and comparative analysis of Leuconostoc dairy bacteriophages

    DEFF Research Database (Denmark)

    Kot, Witold; Hansen, Lars Henrik; Neve, Horst

    2014-01-01

    Bacteriophages attacking Leuconostoc species may significantly influence the quality of the final product. There is however limited knowledge of this group of phages in the literature. We have determined the complete genome sequences of nine Leuconostoc bacteriophages virulent to either Leuconostoc...

  20. Assessing models of speciation under different biogeographic scenarios; An empirical study using multi-locus and RNA-seq analyses

    Science.gov (United States)

    Edwards, Taylor; Tollis, Marc; Hsieh, PingHsun; Gutenkunst, Ryan N.; Liu, Zhen; Kusumi, Kenro; Culver, Melanie; Murphy, Robert W.

    2016-01-01

    Evolutionary biology often seeks to decipher the drivers of speciation, and much debate persists over the relative importance of isolation and gene flow in the formation of new species. Genetic studies of closely related species can assess if gene flow was present during speciation, because signatures of past introgression often persist in the genome. We test hypotheses on which mechanisms of speciation drove diversity among three distinct lineages of desert tortoise in the genus Gopherus. These lineages offer a powerful system to study speciation, because different biogeographic patterns (physical vs. ecological segregation) are observed at opposing ends of their distributions. We use 82 samples collected from 38 sites, representing the entire species' distribution and generate sequence data for mtDNA and four nuclear loci. A multilocus phylogenetic analysis in *BEAST estimates the species tree. RNA-seq data yield 20,126 synonymous variants from 7665 contigs from two individuals of each of the three lineages. Analyses of these data using the demographic inference package ∂a∂i serve to test the null hypothesis of no gene flow during divergence. The best-fit demographic model for the three taxa is concordant with the *BEAST species tree, and the ∂a∂i analysis does not indicate gene flow among any of the three lineages during their divergence. These analyses suggest that divergence among the lineages occurred in the absence of gene flow and in this scenario the genetic signature of ecological isolation (parapatric model) cannot be differentiated from geographic isolation (allopatric model).

  1. XplorSeq: a software environment for integrated management and phylogenetic analysis of metagenomic sequence data.

    Science.gov (United States)

    Frank, Daniel N

    2008-10-07

    Advances in automated DNA sequencing technology have accelerated the generation of metagenomic DNA sequences, especially environmental ribosomal RNA gene (rDNA) sequences. As the scale of rDNA-based studies of microbial ecology has expanded, need has arisen for software that is capable of managing, annotating, and analyzing the plethora of diverse data accumulated in these projects. XplorSeq is a software package that facilitates the compilation, management and phylogenetic analysis of DNA sequences. XplorSeq was developed for, but is not limited to, high-throughput analysis of environmental rRNA gene sequences. XplorSeq integrates and extends several commonly used UNIX-based analysis tools by use of a Macintosh OS-X-based graphical user interface (GUI). Through this GUI, users may perform basic sequence import and assembly steps (base-calling, vector/primer trimming, contig assembly), perform BLAST (Basic Local Alignment and Search Tool; 123) searches of NCBI and local databases, create multiple sequence alignments, build phylogenetic trees, assemble Operational Taxonomic Units, estimate biodiversity indices, and summarize data in a variety of formats. Furthermore, sequences may be annotated with user-specified meta-data, which then can be used to sort data and organize analyses and reports. A document-based architecture permits parallel analysis of sequence data from multiple clones or amplicons, with sequences and other data stored in a single file. XplorSeq should benefit researchers who are engaged in analyses of environmental sequence data, especially those with little experience using bioinformatics software. Although XplorSeq was developed for management of rDNA sequence data, it can be applied to most any sequencing project. The application is available free of charge for non-commercial use at http://vent.colorado.edu/phyloware.

  2. Genome sequencing of bacteria: sequencing, de novo assembly and rapid analysis using open source tools.

    Science.gov (United States)

    Kisand, Veljo; Lettieri, Teresa

    2013-04-01

    De novo genome sequencing of previously uncharacterized microorganisms has the potential to open up new frontiers in microbial genomics by providing insight into both functional capabilities and biodiversity. Until recently, Roche 454 pyrosequencing was the NGS method of choice for de novo assembly because it generates hundreds of thousands of long reads (tools for processing NGS data are increasingly free and open source and are often adopted for both their high quality and role in promoting academic freedom. The error rate of pyrosequencing the Alcanivorax borkumensis genome was such that thousands of insertions and deletions were artificially introduced into the finished genome. Despite a high coverage (~30 fold), it did not allow the reference genome to be fully mapped. Reads from regions with errors had low quality, low coverage, or were missing. The main defect of the reference mapping was the introduction of artificial indels into contigs through lower than 100% consensus and distracting gene calling due to artificial stop codons. No assembler was able to perform de novo assembly comparable to reference mapping. Automated annotation tools performed similarly on reference mapped and de novo draft genomes, and annotated most CDSs in the de novo assembled draft genomes. Free and open source software (FOSS) tools for assembly and annotation of NGS data are being developed rapidly to provide accurate results with less computational effort. Usability is not high priority and these tools currently do not allow the data to be processed without manual intervention. Despite this, genome assemblers now readily assemble medium short reads into long contigs (>97-98% genome coverage). A notable gap in pyrosequencing technology is the quality of base pair calling and conflicting base pairs between single reads at the same nucleotide position. Regardless, using draft whole genomes that are not finished and remain fragmented into tens of contigs allows one to characterize

  3. Visualization of pairwise and multilocus linkage disequilibrium structure using latent forests.

    Directory of Open Access Journals (Sweden)

    Raphaël Mourad

    Full Text Available Linkage disequilibrium study represents a major issue in statistical genetics as it plays a fundamental role in gene mapping and helps us to learn more about human history. The linkage disequilibrium complex structure makes its exploratory data analysis essential yet challenging. Visualization methods, such as the triangular heat map implemented in Haploview, provide simple and useful tools to help understand complex genetic patterns, but remain insufficient to fully describe them. Probabilistic graphical models have been widely recognized as a powerful formalism allowing a concise and accurate modeling of dependences between variables. In this paper, we propose a method for short-range, long-range and chromosome-wide linkage disequilibrium visualization using forests of hierarchical latent class models. Thanks to its hierarchical nature, our method is shown to provide a compact view of both pairwise and multilocus linkage disequilibrium spatial structures for the geneticist. Besides, a multilocus linkage disequilibrium measure has been designed to evaluate linkage disequilibrium in hierarchy clusters. To learn the proposed model, a new scalable algorithm is presented. It constrains the dependence scope, relying on physical positions, and is able to deal with more than one hundred thousand single nucleotide polymorphisms. The proposed algorithm is fast and does not require phase genotypic data.

  4. Sequence analysis of the Legionella micdadei groELS operon

    DEFF Research Database (Denmark)

    Hindersson, P; Høiby, N; Bangsborg, Jette Marie

    1991-01-01

    A 2.7 kb DNA fragment encoding the 60 kDa common antigen (CA) and a 13 kDa protein of Legionella micdadei was sequenced. Two open reading frames of 57,677 and 10,456 Da were identified, corresponding to the heat shock proteins GroEL and GroES, respectively. Typical -35, -10, and Shine-Dalgarno heat...

  5. The Matrix Method of Representation, Analysis and Classification of Long Genetic Sequences

    Directory of Open Access Journals (Sweden)

    Ivan V. Stepanyan

    2017-01-01

    Full Text Available The article is devoted to a matrix method of comparative analysis of long nucleotide sequences by means of presenting each sequence in the form of three digital binary sequences. This method uses a set of symmetries of biochemical attributes of nucleotides. It also uses the possibility of presentation of every whole set of N-mers as one of the members of a Kronecker family of genetic matrices. With this method, a long nucleotide sequence can be visually represented as an individual fractal-like mosaic or another regular mosaic of binary type. In contrast to natural nucleotide sequences, artificial random sequences give non-regular patterns. Examples of binary mosaics of long nucleotide sequences are shown, including cases of human chromosomes and penicillins. The obtained results are then discussed.

  6. OPTSDNA: Performance evaluation of an efficient distributed bioinformatics system for DNA sequence analysis.

    Science.gov (United States)

    Khan, Mohammad Ibrahim; Sheel, Chotan

    2013-01-01

    Storage of sequence data is a big concern as the amount of data generated is exponential in nature at several locations. Therefore, there is a need to develop techniques to store data using compression algorithm. Here we describe optimal storage algorithm (OPTSDNA) for storing large amount of DNA sequences of varying length. This paper provides performance analysis of optimal storage algorithm (OPTSDNA) of a distributed bioinformatics computing system for analysis of DNA sequences. OPTSDNA algorithm is used for storing various sizes of DNA sequences into database. DNA sequences of different lengths were stored by using this algorithm. These input DNA sequences are varied in size from very small to very large. Storage size is calculated by this algorithm. Response time is also calculated in this work. The efficiency and performance of the algorithm is high (in size calculation with percentage) when compared with other known with sequential approach.

  7. Analysis of xylem formation in pine by cDNA sequencing

    Science.gov (United States)

    Allona, I.; Quinn, M.; Shoop, E.; Swope, K.; St Cyr, S.; Carlis, J.; Riedl, J.; Retzel, E.; Campbell, M. M.; Sederoff, R.; hide

    1998-01-01

    Secondary xylem (wood) formation is likely to involve some genes expressed rarely or not at all in herbaceous plants. Moreover, environmental and developmental stimuli influence secondary xylem differentiation, producing morphological and chemical changes in wood. To increase our understanding of xylem formation, and to provide material for comparative analysis of gymnosperm and angiosperm sequences, ESTs were obtained from immature xylem of loblolly pine (Pinus taeda L.). A total of 1,097 single-pass sequences were obtained from 5' ends of cDNAs made from gravistimulated tissue from bent trees. Cluster analysis detected 107 groups of similar sequences, ranging in size from 2 to 20 sequences. A total of 361 sequences fell into these groups, whereas 736 sequences were unique. About 55% of the pine EST sequences show similarity to previously described sequences in public databases. About 10% of the recognized genes encode factors involved in cell wall formation. Sequences similar to cell wall proteins, most known lignin biosynthetic enzymes, and several enzymes of carbohydrate metabolism were found. A number of putative regulatory proteins also are represented. Expression patterns of several of these genes were studied in various tissues and organs of pine. Sequencing novel genes expressed during xylem formation will provide a powerful means of identifying mechanisms controlling this important differentiation pathway.

  8. MiSeq: A Next Generation Sequencing Platform for Genomic Analysis.

    Science.gov (United States)

    Ravi, Rupesh Kanchi; Walton, Kendra; Khosroheidari, Mahdieh

    2018-01-01

    MiSeq, Illumina's integrated next generation sequencing instrument, uses reversible-terminator sequencing-by-synthesis technology to provide end-to-end sequencing solutions. The MiSeq instrument is one of the smallest benchtop sequencers that can perform onboard cluster generation, amplification, genomic DNA sequencing, and data analysis, including base calling, alignment and variant calling, in a single run. It performs both single- and paired-end runs with adjustable read lengths from 1 × 36 base pairs to 2 × 300 base pairs. A single run can produce output data of up to 15 Gb in as little as 4 h of runtime and can output up to 25 M single reads and 50 M paired-end reads. Thus, MiSeq provides an ideal platform for rapid turnaround time. MiSeq is also a cost-effective tool for various analyses focused on targeted gene sequencing (amplicon sequencing and target enrichment), metagenomics, and gene expression studies. For these reasons, MiSeq has become one of the most widely used next generation sequencing platforms. Here, we provide a protocol to prepare libraries for sequencing using the MiSeq instrument and basic guidelines for analysis of output data from the MiSeq sequencing run.

  9. Maturity onset diabetes of youth (MODY) in Turkish children: sequence analysis of 11 causative genes by next generation sequencing.

    Science.gov (United States)

    Ağladıoğlu, Sebahat Yılmaz; Aycan, Zehra; Çetinkaya, Semra; Baş, Veysel Nijat; Önder, Aşan; Peltek Kendirci, Havva Nur; Doğan, Haldun; Ceylaner, Serdar

    2016-04-01

    Maturity-onset diabetes of the youth (MODY), is a genetically and clinically heterogeneous group of diseasesand is often misdiagnosed as type 1 or type 2 diabetes. The aim of this study is to investigate both novel and proven mutations of 11 MODY genes in Turkish children by using targeted next generation sequencing. A panel of 11 MODY genes were screened in 43 children with MODY diagnosed by clinical criterias. Studies of index cases was done with MISEQ-ILLUMINA, and family screenings and confirmation studies of mutations was done by Sanger sequencing. We identified 28 (65%) point mutations among 43 patients. Eighteen patients have GCK mutations, four have HNF1A, one has HNF4A, one has HNF1B, two have NEUROD1, one has PDX1 gene variations and one patient has both HNF1A and HNF4A heterozygote mutations. This is the first study including molecular studies of 11 MODY genes in Turkish children. GCK is the most frequent type of MODY in our study population. Very high frequency of novel mutations (42%) in our study population, supports that in heterogenous disorders like MODY sequence analysis provides rapid, cost effective and accurate genetic diagnosis.

  10. Whole genome sequencing and bioinformatics analysis of two Egyptian genomes.

    Science.gov (United States)

    ElHefnawi, Mahmoud; Jeon, Sungwon; Bhak, Youngjune; ElFiky, Asmaa; Horaiz, Ahmed; Jun, JeHoon; Kim, Hyunho; Bhak, Jong

    2018-05-15

    We report two Egyptian male genomes (EGP1 and EGP2) sequenced at ~ 30× sequencing depths. EGP1 had 4.7 million variants, where 198,877 were novel variants while EGP2 had 209,109 novel variants out of 4.8 million variants. The mitochondrial haplogroup of the two individuals were identified to be H7b1 and L2a1c, respectively. We also identified the Y haplogroup of EGP1 (R1b) and EGP2 (J1a2a1a2 > P58 > FGC11). EGP1 had a mutation in the NADH gene of the mitochondrial genome ND4 (m.11778 G > A) that causes Leber's hereditary optic neuropathy. Some SNPs shared by the two genomes were associated with an increased level of cholesterol and triglycerides, probably related with Egyptians obesity. Comparison of these genomes with African and Western-Asian genomes can provide insights on Egyptian ancestry and genetic history. This resource can be used to further understand genomic diversity and functional classification of variants as well as human migration and evolution across Africa and Western-Asia. Copyright © 2017. Published by Elsevier B.V.

  11. Accident sequence precursor analysis level 2/3 model development

    International Nuclear Information System (INIS)

    Lui, C.H.; Galyean, W.J.; Brownson, D.A.

    1997-01-01

    The US Nuclear Regulatory Commission's Accident Sequence Precursor (ASP) program currently uses simple Level 1 models to assess the conditional core damage probability for operational events occurring in commercial nuclear power plants (NPP). Since not all accident sequences leading to core damage will result in the same radiological consequences, it is necessary to develop simple Level 2/3 models that can be used to analyze the response of the NPP containment structure in the context of a core damage accident, estimate the magnitude of the resulting radioactive releases to the environment, and calculate the consequences associated with these releases. The simple Level 2/3 model development work was initiated in 1995, and several prototype models have been completed. Once developed, these simple Level 2/3 models are linked to the simple Level 1 models to provide risk perspectives for operational events. This paper describes the methods implemented for the development of these simple Level 2/3 ASP models, and the linkage process to the existing Level 1 models

  12. In Vivo Enhancer Analysis Chromosome 16 Conserved NoncodingSequences

    Energy Technology Data Exchange (ETDEWEB)

    Pennacchio, Len A.; Ahituv, Nadav; Moses, Alan M.; Nobrega,Marcelo; Prabhakar, Shyam; Shoukry, Malak; Minovitsky, Simon; Visel,Axel; Dubchak, Inna; Holt, Amy; Lewis, Keith D.; Plajzer-Frick, Ingrid; Akiyama, Jennifer; De Val, Sarah; Afzal, Veena; Black, Brian L.; Couronne, Olivier; Eisen, Michael B.; Rubin, Edward M.

    2006-02-01

    The identification of enhancers with predicted specificitiesin vertebrate genomes remains a significant challenge that is hampered bya lack of experimentally validated training sets. In this study, weleveraged extreme evolutionary sequence conservation as a filter toidentify putative gene regulatory elements and characterized the in vivoenhancer activity of human-fish conserved and ultraconserved1 noncodingelements on human chromosome 16 as well as such elements from elsewherein the genome. We initially tested 165 of these extremely conservedsequences in a transgenic mouse enhancer assay and observed that 48percent (79/165) functioned reproducibly as tissue-specific enhancers ofgene expression at embryonic day 11.5. While driving expression in abroad range of anatomical structures in the embryo, the majority of the79 enhancers drove expression in various regions of the developingnervous system. Studying a set of DNA elements that specifically droveforebrain expression, we identified DNA signatures specifically enrichedin these elements and used these parameters to rank all ~;3,400human-fugu conserved noncoding elements in the human genome. The testingof the top predictions in transgenic mice resulted in a three-foldenrichment for sequences with forebrain enhancer activity. These datadramatically expand the catalogue of in vivo-characterized human geneenhancers and illustrate the future utility of such training sets for avariety of iological applications including decoding the regulatoryvocabulary of the human genome.

  13. Molecular characterization and multi-locus genotypes of Enterocytozoon bieneusi from captive red kangaroos (Macropus Rufus in Jiangsu province, China.

    Directory of Open Access Journals (Sweden)

    Zhijun Zhong

    Full Text Available Enterocytozoon bieneusi is the most common pathogen of microsporidian species infecting humans worldwide. Although E. bieneusi has been found in a variety of animal hosts, information on the presence of E. bieneusi in captive kangaroos in China is limited. The present study was aimed at determining the occurrence and genetic diversity of E. bieneusi in captive kangaroos. A total of 61 fecal specimens (38 from red kangaroos and 23 from grey kangaroos were collected from Nanjing Hongshan Forest Zoo and Hongshan Kangaroo Breeding Research Base, Jiangsu province, China. Using the nested PCR amplification ITS gene of rRNA of E. bieneusi, totally 23.0% (14/61 of tested samples were PCR-positive with three genotypes (i.e. one known genotype, CHK1, and two novel genotypes, CSK1 and CSK2. Multi-locus sequence typing using three microsatellites (MS1, MS3, and MS7 and one minisatellite (MS4 revealed one, five, two, and one types at these four loci, respectively. In phylogenetic analysis, the two genotypes, CHK1 and CSK1, were clustered into a new group of unknown zoonotic potential, and the novel genotype CSK2 was clustered into a separate clade with PtEb and PtEbIX. To date, this is the first report on the presence of E. bieneusi in captive red kangaroos in Jiangsu province, China. Furthermore, a high degree of genetic diversity was observed in the E. bieneusi genotype and seven MLGs (MLG1-7 were found in red kangaroos. Our findings suggest that infected kangaroo may act as potential reservoirs of E. bieneusi and be source to transmit infections to other animal.

  14. Sequence analysis of putative swrW gene required for surfactant ...

    African Journals Online (AJOL)

    Serratia marcescens produces biosurfactant serrawettin, essential for its population migration behavior. Serrawettin W1 was revealed to be an antibiotic serratamolide that makes it significant for deoxyribonucleic acid (DNA) and protein sequence analysis. Four nucleotide and amino-acid sequences from local strains ...

  15. Genomic insight into the common carp (Cyprinus carpio genome by sequencing analysis of BAC-end sequences

    Directory of Open Access Journals (Sweden)

    Wang Jintu

    2011-04-01

    Full Text Available Abstract Background Common carp is one of the most important aquaculture teleost fish in the world. Common carp and other closely related Cyprinidae species provide over 30% aquaculture production in the world. However, common carp genomic resources are still relatively underdeveloped. BAC end sequences (BES are important resources for genome research on BAC-anchored genetic marker development, linkage map and physical map integration, and whole genome sequence assembling and scaffolding. Result To develop such valuable resources in common carp (Cyprinus carpio, a total of 40,224 BAC clones were sequenced on both ends, generating 65,720 clean BES with an average read length of 647 bp after sequence processing, representing 42,522,168 bp or 2.5% of common carp genome. The first survey of common carp genome was conducted with various bioinformatics tools. The common carp genome contains over 17.3% of repetitive elements with GC content of 36.8% and 518 transposon ORFs. To identify and develop BAC-anchored microsatellite markers, a total of 13,581 microsatellites were detected from 10,355 BES. The coding region of 7,127 genes were recognized from 9,443 BES on 7,453 BACs, with 1,990 BACs have genes on both ends. To evaluate the similarity to the genome of closely related zebrafish, BES of common carp were aligned against zebrafish genome. A total of 39,335 BES of common carp have conserved homologs on zebrafish genome which demonstrated the high similarity between zebrafish and common carp genomes, indicating the feasibility of comparative mapping between zebrafish and common carp once we have physical map of common carp. Conclusion BAC end sequences are great resources for the first genome wide survey of common carp. The repetitive DNA was estimated to be approximate 28% of common carp genome, indicating the higher complexity of the genome. Comparative analysis had mapped around 40,000 BES to zebrafish genome and established over 3

  16. Genomic insight into the common carp (Cyprinus carpio) genome by sequencing analysis of BAC-end sequences

    Science.gov (United States)

    2011-01-01

    Background Common carp is one of the most important aquaculture teleost fish in the world. Common carp and other closely related Cyprinidae species provide over 30% aquaculture production in the world. However, common carp genomic resources are still relatively underdeveloped. BAC end sequences (BES) are important resources for genome research on BAC-anchored genetic marker development, linkage map and physical map integration, and whole genome sequence assembling and scaffolding. Result To develop such valuable resources in common carp (Cyprinus carpio), a total of 40,224 BAC clones were sequenced on both ends, generating 65,720 clean BES with an average read length of 647 bp after sequence processing, representing 42,522,168 bp or 2.5% of common carp genome. The first survey of common carp genome was conducted with various bioinformatics tools. The common carp genome contains over 17.3% of repetitive elements with GC content of 36.8% and 518 transposon ORFs. To identify and develop BAC-anchored microsatellite markers, a total of 13,581 microsatellites were detected from 10,355 BES. The coding region of 7,127 genes were recognized from 9,443 BES on 7,453 BACs, with 1,990 BACs have genes on both ends. To evaluate the similarity to the genome of closely related zebrafish, BES of common carp were aligned against zebrafish genome. A total of 39,335 BES of common carp have conserved homologs on zebrafish genome which demonstrated the high similarity between zebrafish and common carp genomes, indicating the feasibility of comparative mapping between zebrafish and common carp once we have physical map of common carp. Conclusion BAC end sequences are great resources for the first genome wide survey of common carp. The repetitive DNA was estimated to be approximate 28% of common carp genome, indicating the higher complexity of the genome. Comparative analysis had mapped around 40,000 BES to zebrafish genome and established over 3,100 microsyntenies, covering over 50% of

  17. A symbolic dynamics approach for the complexity analysis of chaotic pseudo-random sequences

    International Nuclear Information System (INIS)

    Xiao Fanghong

    2004-01-01

    By considering a chaotic pseudo-random sequence as a symbolic sequence, authors present a symbolic dynamics approach for the complexity analysis of chaotic pseudo-random sequences. The method is applied to the cases of Logistic map and one-way coupled map lattice to demonstrate how it works, and a comparison is made between it and the approximate entropy method. The results show that this method is applicable to distinguish the complexities of different chaotic pseudo-random sequences, and it is superior to the approximate entropy method

  18. The sequence and analysis of duplication rich human chromosome 16

    Energy Technology Data Exchange (ETDEWEB)

    Martin, Joel; Han, Cliff; Gordon, Laurie A.; Terry, Astrid; Prabhakar, Shyam; She, Xinwei; Xie, Gary; Hellsten, Uffe; Man Chan, Yee; Altherr, Michael; Couronne, Olivier; Aerts, Andrea; Bajorek, Eva; Black, Stacey; Blumer, Heather; Branscomb, Elbert; Brown, Nancy C.; Bruno, William J.; Buckingham, Judith M.; Callen, David F.; Campbell, Connie S.; Campbell, Mary L.; Campbell, Evelyn W.; Caoile, Chenier; Challacombe, Jean F.; Chasteen, Leslie A.; Chertkov, Olga; Chi, Han C.; Christensen, Mari; Clark, Lynn M.; Cohn, Judith D.; Denys, Mirian; Detter, John C.; Dickson, Mark; Dimitrijevic-Bussod, Mira; Escobar, Julio; Fawcett, Joseph J.; Flowers, Dave; Fotopulos, Dea; Glavina, Tijana; Gomez, Maria; Gonzales, Eidelyn; Goodstein, David; Goodwin, Lynne A.; Grady, Deborah L.; Grigoriev, Igor; Groza, Matthew; Hammon, Nancy; Hawkins, Trevor; Haydu, Lauren; Hildebrand, Carl E.; Huang, Wayne; Israni, Sanjay; Jett, Jamie; Jewett, Phillip E.; Kadner, Kristen; Kimball, Heather; Kobayashi, Arthur; Krawczyk, Marie-Claude; Leyba, Tina; Longmire, Jonathan L.; Lopez, Frederick; Lou, Yunian; Lowry, Steve; Ludeman, Thom; Mark, Graham A.; Mcmurray, Kimberly L.; Meincke, Linda J.; Morgan, Jenna; Moyzis, Robert K.; Mundt, Mark O.; Munk, A. Christine; Nandkeshwar, Richard D.; Pitluck, Sam; Pollard, Martin; Predki, Paul; Parson-Quintana, Beverly; Ramirez, Lucia; Rash, Sam; Retterer, James; Ricke, Darryl O.; Robinson, Donna L.; Rodriguez, Alex; Salamov, Asaf; Saunders, Elizabeth H.; Scott, Duncan; Shough, Timothy; Stallings, Raymond L.; Stalvey, Malinda; Sutherland, Robert D.; Tapia, Roxanne; Tesmer, Judith G.; Thayer, Nina; Thompson, Linda S.; Tice, Hope; Torney, David C.; Tran-Gyamfi, Mary; Tsai, Ming; Ulanovsky, Levy E.; Ustaszewska, Anna; Vo, Nu; White, P. Scott; Williams, Albert L.; Wills, Patricia L.; Wu, Jung-Rung; Wu, Kevin; Yang, Joan; DeJong, Pieter; Bruce, David; Doggett, Norman; Deaven, Larry; Schmutz, Jeremy; Grimwood, Jane; Richardson, Paul; et al.

    2004-08-01

    We report here the 78,884,754 base pairs of finished human chromosome 16 sequence, representing over 99.9 percent of its euchromatin. Manual annotation revealed 880 protein coding genes confirmed by 1,637 aligned transcripts, 19 tRNA genes, 341 pseudogenes and 3 RNA pseudogenes. These genes include metallothionein, cadherin and iroquois gene families, as well as the disease genes for polycystic kidney disease and acute myelomonocytic leukemia. Several large-scale structural polymorphisms spanning hundreds of kilobasepairs were identified and result in gene content differences across humans. One of the unique features of chromosome 16 is its high level of segmental duplication, ranked among the highest of the human autosomes. While the segmental duplications are enriched in the relatively gene poor pericentromere of the p-arm, some are involved in recent gene duplication and conversion events which are likely to have had an impact on the evolution of primates and human disease susceptibility.

  19. Analysis of decision procedures for a sequence of inventory periods

    International Nuclear Information System (INIS)

    Avenhaus, R.

    1982-07-01

    Optimal test procedures for a sequence of inventory periods will be discussed. Starting with a game theoretical description of the conflict situation between the plant operator and the inspector, the objectives of the inspector as well as the general decision theoretical problem will be formulated. In the first part the objective of 'secure' detection will be emphasized which means that only at the end of the reference time a decision is taken by the inspector. In the second part the objective of 'timely' detection will be emphasized which will lead to sequential test procedures. At the end of the paper all procedures will be summarized, and in view of the multitude of procedures available at the moment some comments about future work will be given. (orig./HP) [de

  20. The Sequence and Analysis of Duplication Rich Human Chromosome 16

    Science.gov (United States)

    Martin, Joel; Han, Cliff; Gordon, Laurie A.; Terry, Astrid; Prabhakar, Shyam; She, Xinwei; Xie, Gary; Hellsten, Uffe; Man Chan, Yee; Altherr, Michael; Couronne, Olivier; Aerts, Andrea; Bajorek, Eva; Black, Stacey; Blumer, Heather; Branscomb, Elbert; Brown, Nancy C.; Bruno, William J.; Buckingham, Judith M.; Callen, David F.; Campbell, Connie S.; Campbell, Mary L.; Campbell, Evelyn W.; Caoile, Chenier; Challacombe, Jean F.; Chasteen, Leslie A.; Chertkov, Olga; Chi, Han C.; Christensen, Mari; Clark, Lynn M.; Cohn, Judith D.; Denys, Mirian; Detter, John C.; Dickson, Mark; Dimitrijevic-Bussod, Mira; Escobar, Julio; Fawcett, Joseph J.; Flowers, Dave; Fotopulos, Dea; Glavina, Tijana; Gomez, Maria; Gonzales, Eidelyn; Goodstein, David; Goodwin, Lynne A.; Grady, Deborah L.; Grigoriev, Igor; Groza, Matthew; Hammon, Nancy; Hawkins, Trevor; Haydu, Lauren; Hildebrand, Carl E.; Huang, Wayne; Israni, Sanjay; Jett, Jamie; Jewett, Phillip E.; Kadner, Kristen; Kimball, Heather; Kobayashi, Arthur; Krawczyk, Marie-Claude; Leyba, Tina; Longmire, Jonathan L.; Lopez, Frederick; Lou, Yunian; Lowry, Steve; Ludeman, Thom; Mark, Graham A.; Mcmurray, Kimberly L.; Meincke, Linda J.; Morgan, Jenna; Moyzis, Robert K.; Mundt, Mark O.; Munk, A. Christine; Nandkeshwar, Richard D.; Pitluck, Sam; Pollard, Martin; Predki, Paul; Parson-Quintana, Beverly; Ramirez, Lucia; Rash, Sam; Retterer, James; Ricke, Darryl O.; Robinson, Donna L.; Rodriguez, Alex; Salamov, Asaf; Saunders, Elizabeth H.; Scott, Duncan; Shough, Timothy; Stallings, Raymond L.; Stalvey, Malinda; Sutherland, Robert D.; Tapia, Roxanne; Tesmer, Judith G.; Thayer, Nina; Thompson, Linda S.; Tice, Hope; Torney, David C.; Tran-Gyamfi, Mary; Tsai, Ming; Ulanovsky, Levy E.; Ustaszewska, Anna; Vo, Nu; White, P. Scott; Williams, Albert L.; Wills, Patricia L.; Wu, Jung-Rung; Wu, Kevin; Yang, Joan; DeJong, Pieter; Bruce, David; Doggett, Norman; Deaven, Larry; Schmutz, Jeremy; Grimwood, Jane; Richardson, Paul; et al.

    2004-01-01

    We report here the 78,884,754 base pairs of finished human chromosome 16 sequence, representing over 99.9 percent of its euchromatin. Manual annotation revealed 880 protein coding genes confirmed by 1,637 aligned transcripts, 19 tRNA genes, 341 pseudogenes and 3 RNA pseudogenes. These genes include metallothionein, cadherin and iroquois gene families, as well as the disease genes for polycystic kidney disease and acute myelomonocytic leukemia. Several large-scale structural polymorphisms spanning hundreds of kilobasepairs were identified and result in gene content differences across humans. One of the unique features of chromosome 16 is its high level of segmental duplication, ranked among the highest of the human autosomes. While the segmental duplications are enriched in the relatively gene poor pericentromere of the p-arm, some are involved in recent gene duplication and conversion events which are likely to have had an impact on the evolution of primates and human disease susceptibility.

  1. Factoring local sequence composition in motif significance analysis.

    Science.gov (United States)

    Ng, Patrick; Keich, Uri

    2008-01-01

    We recently introduced a biologically realistic and reliable significance analysis of the output of a popular class of motif finders. In this paper we further improve our significance analysis by incorporating local base composition information. Relying on realistic biological data simulation, as well as on FDR analysis applied to real data, we show that our method is significantly better than the increasingly popular practice of using the normal approximation to estimate the significance of a finder's output. Finally we turn to leveraging our reliable significance analysis to improve the actual motif finding task. Specifically, endowing a variant of the Gibbs Sampler with our improved significance analysis we demonstrate that de novo finders can perform better than has been perceived. Significantly, our new variant outperforms all the finders reviewed in a recently published comprehensive analysis of the Harbison genome-wide binding location data. Interestingly, many of these finders incorporate additional information such as nucleosome positioning and the significance of binding data.

  2. Peptide Pattern Recognition for high-throughput protein sequence analysis and clustering

    DEFF Research Database (Denmark)

    Busk, Peter Kamp

    2017-01-01

    Large collections of protein sequences with divergent sequences are tedious to analyze for understanding their phylogenetic or structure-function relation. Peptide Pattern Recognition is an algorithm that was developed to facilitate this task but the previous version does only allow a limited...... number of sequences as input. I implemented Peptide Pattern Recognition as a multithread software designed to handle large numbers of sequences and perform analysis in a reasonable time frame. Benchmarking showed that the new implementation of Peptide Pattern Recognition is twenty times faster than...... the previous implementation on a small protein collection with 673 MAP kinase sequences. In addition, the new implementation could analyze a large protein collection with 48,570 Glycosyl Transferase family 20 sequences without reaching its upper limit on a desktop computer. Peptide Pattern Recognition...

  3. Information-Theoretical Analysis of EEG Microstate Sequences in Python

    Directory of Open Access Journals (Sweden)

    Frederic von Wegner

    2018-06-01

    Full Text Available We present an open-source Python package to compute information-theoretical quantities for electroencephalographic data. Electroencephalography (EEG measures the electrical potential generated by the cerebral cortex and the set of spatial patterns projected by the brain's electrical potential on the scalp surface can be clustered into a set of representative maps called EEG microstates. Microstate time series are obtained by competitively fitting the microstate maps back into the EEG data set, i.e., by substituting the EEG data at a given time with the label of the microstate that has the highest similarity with the actual EEG topography. As microstate sequences consist of non-metric random variables, e.g., the letters A–D, we recently introduced information-theoretical measures to quantify these time series. In wakeful resting state EEG recordings, we found new characteristics of microstate sequences such as periodicities related to EEG frequency bands. The algorithms used are here provided as an open-source package and their use is explained in a tutorial style. The package is self-contained and the programming style is procedural, focusing on code intelligibility and easy portability. Using a sample EEG file, we demonstrate how to perform EEG microstate segmentation using the modified K-means approach, and how to compute and visualize the recently introduced information-theoretical tests and quantities. The time-lagged mutual information function is derived as a discrete symbolic alternative to the autocorrelation function for metric time series and confidence intervals are computed from Markov chain surrogate data. The software package provides an open-source extension to the existing implementations of the microstate transform and is specifically designed to analyze resting state EEG recordings.

  4. Massively parallel sequencing and analysis of the Necator americanus transcriptome.

    Directory of Open Access Journals (Sweden)

    Cinzia Cantacessi

    2010-05-01

    Full Text Available The blood-feeding hookworm Necator americanus infects hundreds of millions of people worldwide. In order to elucidate fundamental molecular biological aspects of this hookworm, the transcriptome of the adult stage of Necator americanus was explored using next-generation sequencing and bioinformatic analyses.A total of 19,997 contigs were assembled from the sequence data; 6,771 of these contigs had known orthologues in the free-living nematode Caenorhabditis elegans, and most of them encoded proteins with WD40 repeats (10.6%, proteinase inhibitors (7.8% or calcium-binding EF-hand proteins (6.7%. Bioinformatic analyses inferred that the C. elegans homologues are involved mainly in biological pathways linked to ribosome biogenesis (70%, oxidative phosphorylation (63% and/or proteases (60%; most of these molecules were predicted to be involved in more than one biological pathway. Comparative analyses of the transcriptomes of N. americanus and the canine hookworm, Ancylostoma caninum, revealed qualitative and quantitative differences. For instance, proteinase inhibitors were inferred to be highly represented in the former species, whereas SCP/Tpx-1/Ag5/PR-1/Sc7 proteins ( = SCP/TAPS or Ancylostoma-secreted proteins were predominant in the latter. In N. americanus, essential molecules were predicted using a combination of orthology mapping and functional data available for C. elegans. Further analyses allowed the prioritization of 18 predicted drug targets which did not have homologues in the human host. These candidate targets were inferred to be linked to mitochondrial (e.g., processing proteins or amino acid metabolism (e.g., asparagine t-RNA synthetase.This study has provided detailed insights into the transcriptome of the adult stage of N. americanus and examines similarities and differences between this species and A. caninum. Future efforts should focus on comparative transcriptomic and proteomic investigations of the other predominant human

  5. Combined DECS Analysis and Next-Generation Sequencing Enable Efficient Detection of Novel Plant RNA Viruses

    Directory of Open Access Journals (Sweden)

    Hironobu Yanagisawa

    2016-03-01

    Full Text Available The presence of high molecular weight double-stranded RNA (dsRNA within plant cells is an indicator of infection with RNA viruses as these possess genomic or replicative dsRNA. DECS (dsRNA isolation, exhaustive amplification, cloning, and sequencing analysis has been shown to be capable of detecting unknown viruses. We postulated that a combination of DECS analysis and next-generation sequencing (NGS would improve detection efficiency and usability of the technique. Here, we describe a model case in which we efficiently detected the presumed genome sequence of Blueberry shoestring virus (BSSV, a member of the genus Sobemovirus, which has not so far been reported. dsRNAs were isolated from BSSV-infected blueberry plants using the dsRNA-binding protein, reverse-transcribed, amplified, and sequenced using NGS. A contig of 4,020 nucleotides (nt that shared similarities with sequences from other Sobemovirus species was obtained as a candidate of the BSSV genomic sequence. Reverse transcription (RT-PCR primer sets based on sequences from this contig enabled the detection of BSSV in all BSSV-infected plants tested but not in healthy controls. A recombinant protein encoded by the putative coat protein gene was bound by the BSSV-antibody, indicating that the candidate sequence was that of BSSV itself. Our results suggest that a combination of DECS analysis and NGS, designated here as “DECS-C,” is a powerful method for detecting novel plant viruses.

  6. Data Analysis of Sequences and qPCR for Microbial Communities during Algal Blooms

    Science.gov (United States)

    A training opportunity is open to a highly microbial-research-motivated student to conduct sequence analysis, explore novel genes and metabolic pathways, validate resultant findings using qPCR/RT-qPCR and summarize the findings

  7. Sequence analysis of the N-acetyltransferase 2 gene (NAT2) among ...

    African Journals Online (AJOL)

    Yazun Bashir Jarrar

    2017-11-26

    Nov 26, 2017 ... Sequence analysis of the N-acetyltransferase 2 gene (NAT2) among Jordanian volunteers, Libyan. Journal of Medicine .... For molecular modeling of NAT2 protein, visualized ..... cal clustering. .... cular dynamics simulation.

  8. Analysis of common SHOX gene sequence variants and ∼4.9-kb ...

    Indian Academy of Sciences (India)

    [Solc R., Hirschfeldova K., Kebrdlova V. and Baxova A. 2014 Analysis of common SHOX gene sequence variants ... based on a Gibbs sampling strategy were done using .... SHOX (short stature homeobox) are an important cause of growth.

  9. Comparative sequence analysis of Sordaria macrospora and Neurospora crassa as a means to improve genome annotation.

    Science.gov (United States)

    Nowrousian, Minou; Würtz, Christian; Pöggeler, Stefanie; Kück, Ulrich

    2004-03-01

    One of the most challenging parts of large scale sequencing projects is the identification of functional elements encoded in a genome. Recently, studies of genomes of up to six different Saccharomyces species have demonstrated that a comparative analysis of genome sequences from closely related species is a powerful approach to identify open reading frames and other functional regions within genomes [Science 301 (2003) 71, Nature 423 (2003) 241]. Here, we present a comparison of selected sequences from Sordaria macrospora to their corresponding Neurospora crassa orthologous regions. Our analysis indicates that due to the high degree of sequence similarity and conservation of overall genomic organization, S. macrospora sequence information can be used to simplify the annotation of the N. crassa genome.

  10. Probabilistic topic modeling for the analysis and classification of genomic sequences

    Science.gov (United States)

    2015-01-01

    Background Studies on genomic sequences for classification and taxonomic identification have a leading role in the biomedical field and in the analysis of biodiversity. These studies are focusing on the so-called barcode genes, representing a well defined region of the whole genome. Recently, alignment-free techniques are gaining more importance because they are able to overcome the drawbacks of sequence alignment techniques. In this paper a new alignment-free method for DNA sequences clustering and classification is proposed. The method is based on k-mers representation and text mining techniques. Methods The presented method is based on Probabilistic Topic Modeling, a statistical technique originally proposed for text documents. Probabilistic topic models are able to find in a document corpus the topics (recurrent themes) characterizing classes of documents. This technique, applied on DNA sequences representing the documents, exploits the frequency of fixed-length k-mers and builds a generative model for a training group of sequences. This generative model, obtained through the Latent Dirichlet Allocation (LDA) algorithm, is then used to classify a large set of genomic sequences. Results and conclusions We performed classification of over 7000 16S DNA barcode sequences taken from Ribosomal Database Project (RDP) repository, training probabilistic topic models. The proposed method is compared to the RDP tool and Support Vector Machine (SVM) classification algorithm in a extensive set of trials using both complete sequences and short sequence snippets (from 400 bp to 25 bp). Our method reaches very similar results to RDP classifier and SVM for complete sequences. The most interesting results are obtained when short sequence snippets are considered. In these conditions the proposed method outperforms RDP and SVM with ultra short sequences and it exhibits a smooth decrease of performance, at every taxonomic level, when the sequence length is decreased. PMID:25916734

  11. Total RNA Sequencing Analysis of DCIS Progressing to Invasive Breast Cancer

    Science.gov (United States)

    2017-09-01

    AWARD NUMBER: W81XWH-14-1-0080 TITLE: Total RNA Sequencing Analysis of DCIS Progressing to Invasive Breast Cancer . PRINCIPAL INVESTIGATOR...TITLE AND SUBTITLE Total RNA Sequencing Analysis of DCIS Progressing to Invasive Breast Cancer . 5a. CONTRACT NUMBER 5b. GRANT NUMBER GRANT11489...institutional, NIH-funded study of genetic and epigenetic alterations of pre-invasive DCIS that did or did not progress to invasive breast cancer , with an

  12. Seismically induced accident sequence analysis of the advanced test reactor

    International Nuclear Information System (INIS)

    Khericha, S.T.; Henry, D.M.; Ravindra, M.K.; Hashimoto, P.S.; Griffin, M.J.; Tong, W.H.; Nafday, A.M.

    1991-01-01

    A seismic probabilistic risk assessment (PRA) was performed for the Department of Energy (DOE) Advanced Test Reactor (ATR) as part of the external events analysis. The risk from seismic events to the fuel in the core and in the fuel storage canal was evaluated. The key elements of this paper are the integration of seismically induced internal flood and internal fire, and the modeling of human error rates as a function of the magnitude of earthquake. The systems analysis was performed by EG ampersand G Idaho, Inc. and the fragility analysis and quantification were performed by EQE International, Inc. (EQE)

  13. Recent advances in nanopore-based nucleic acid analysis and sequencing

    International Nuclear Information System (INIS)

    Shi, Jidong; Fang, Ying; Hou, Junfeng

    2016-01-01

    Nanopore-based sequencing platforms are transforming the field of genomic science. This review (containing 116 references) highlights some recent progress on nanopore-based nucleic acid analysis and sequencing. These studies are classified into three categories, biological, solid-state, and hybrid nanopores, according to their nanoporous materials. We begin with a brief description of the translocation-based detection mechanism of nanopores. Next, specific examples are given in nanopore-based nucleic acid analysis and sequencing, with an emphasis on identifying strategies that can improve the resolution of nanopores. This review concludes with a discussion of future research directions that will advance the practical applications of nanopore technology. (author)

  14. Microscopic Analysis and Modeling of Airport Surface Sequencing, Phase I

    Data.gov (United States)

    National Aeronautics and Space Administration — The complexity and interdependence of operations on the airport surface motivate the need for a comprehensive and detailed, yet flexible and validated analysis and...

  15. BioMatriX: Sequence analysis, structure visualization, phylogenetics ...

    African Journals Online (AJOL)

    bmx-biomatrix.blogspot.com) developed for biological science community to augment scientific research regarding genomics, proteomics, phylogenetics and linkage analysis in one platform. BioMatriX offers multi-functional services to perform ...

  16. Survey sequencing and comparative analysis of the elephant shark (Callorhinchus milii genome.

    Directory of Open Access Journals (Sweden)

    Byrappa Venkatesh

    2007-04-01

    Full Text Available Owing to their phylogenetic position, cartilaginous fishes (sharks, rays, skates, and chimaeras provide a critical reference for our understanding of vertebrate genome evolution. The relatively small genome of the elephant shark, Callorhinchus milii, a chimaera, makes it an attractive model cartilaginous fish genome for whole-genome sequencing and comparative analysis. Here, the authors describe survey sequencing (1.4x coverage and comparative analysis of the elephant shark genome, one of the first cartilaginous fish genomes to be sequenced to this depth. Repetitive sequences, represented mainly by a novel family of short interspersed element-like and long interspersed element-like sequences, account for about 28% of the elephant shark genome. Fragments of approximately 15,000 elephant shark genes reveal specific examples of genes that have been lost differentially during the evolution of tetrapod and teleost fish lineages. Interestingly, the degree of conserved synteny and conserved sequences between the human and elephant shark genomes are higher than that between human and teleost fish genomes. Elephant shark contains putative four Hox clusters indicating that, unlike teleost fish genomes, the elephant shark genome has not experienced an additional whole-genome duplication. These findings underscore the importance of the elephant shark as a critical reference vertebrate genome for comparative analysis of the human and other vertebrate genomes. This study also demonstrates that a survey-sequencing approach can be applied productively for comparative analysis of distantly related vertebrate genomes.

  17. Sequence analysis of L RNA of Lassa virus

    International Nuclear Information System (INIS)

    Vieth, Simon; Torda, Andrew E.; Asper, Marcel; Schmitz, Herbert; Guenther, Stephan

    2004-01-01

    The L RNA of three Lassa virus strains originating from Nigeria, Ghana/Ivory Coast, and Sierra Leone was sequenced and the data subjected to structure predictions and phylogenetic analyses. The L gene products had 2218-2221 residues, diverged by 18% at the amino acid level, and contained several conserved regions. Only one region of 504 residues (positions 1043-1546) could be assigned a function, namely that of an RNA polymerase. Secondary structure predictions suggest that this domain is very similar to RNA-dependent RNA polymerases of known structure encoded by plus-strand RNA viruses, permitting a model to be built. Outside the polymerase region, there is little structural data, except for regions of strong alpha-helical content and probably a coiled-coil domain at the N terminus. No evidence for reassortment or recombination during Lassa virus evolution was found. The secondary structure-assisted alignment of the RNA polymerase region permitted a reliable reconstruction of the phylogeny of all negative-strand RNA viruses, indicating that Arenaviridae are most closely related to Nairoviruses. In conclusion, the data provide a basis for structural and functional characterization of the Lassa virus L protein and reveal new insights into the phylogeny of negative-strand RNA viruses

  18. Insertion sequence ISRP10 inactivation of the oprD gene in imipenem-resistant Pseudomonas aeruginosa clinical isolates.

    Science.gov (United States)

    Sun, Qinghui; Ba, Zhaofen; Wu, Guoying; Wang, Wei; Lin, Shuxiang; Yang, Hongjiang

    2016-05-01

    Carbapenem resistance mechanisms were investigated in 32 imipenem-resistant Pseudomonas aeruginosa clinical isolates recovered from hospitalised children. Sequence analysis revealed that 31 of the isolates had an insertion sequence element ISRP10 disrupting the porin gene oprD, demonstrating that ISRP10 inactivation of oprD conferred imipenem resistance in the majority of the isolates. Multilocus sequence typing (MLST) was used to discriminate the isolates. In total, 11 sequence types (STs) were identified including 3 novel STs, and 68.3% (28/41) of the tested strains were characterised as clone ST253. In combination with random amplified polymorphic DNA (RAPD) analysis, the imipenem-resistant isolates displayed a relatively high degree of genetic variability and were unlikely associated with nosocomial infections. Copyright © 2016 Elsevier B.V. and the International Society of Chemotherapy. All rights reserved.

  19. Reproducible analysis of sequencing-based RNA structure probing data with user-friendly tools

    DEFF Research Database (Denmark)

    Kielpinski, Lukasz Jan; Sidiropoulos, Nikos; Vinther, Jeppe

    2015-01-01

    time also made analysis of the data challenging for scientists without formal training in computational biology. Here, we discuss different strategies for data analysis of massive parallel sequencing-based structure-probing data. To facilitate reproducible and standardized analysis of this type of data...

  20. Stratigraphical analysis of the neoproterozoic sedimentary sequences of the Sao Francisco Basin

    International Nuclear Information System (INIS)

    Martins, Mariela; Lemos, Valesca Brasil

    2007-01-01

    A stratigraphic analysis was performed under the principles of Sequence Stratigraphy on the neoproterozoic sedimentary sequences of the Sao Francisco Basin (Central Brazil). Three periods of deposition separated by unconformities were recognized in the Sao Francisco Megasequence: (1) Sequences 1 and 2, a cryogenian glaciogenic sequence, followed by a distal scarp carbonate ramp, developed during stable conditions, (2) Sequence 3, a Upper Cryogenian stack homoclinal ramps with mixed carbonate-siliciclastic sedimentation, deposited under a progressive influence of compressional stresses of the Brasiliano Cycle, (3) Sequence 4, a Lower Ediacaran shallow platform dominated by siliciclastic sedimentation of molassic nature, the erosion product of the nearby uplifted thrust sheets. Each of the carbonate-bearing sequences presents a distinct δ 13 C isotopic signature. The superposition to the global curve for carbon isotopic variation allowed the recognition of a major depositional hiatus between the Paranoa and Sao Francisco Megasequences, and suggested that the glacial diamictite deposition (Jequitai Formation) took place most probably around 800 Ma. This constrains the Sao Francisco Megasequence deposition to the interval between 800 and 600 Ma (the known ages of the Brasiliano Orogeny defines the upper limit). A minor depositional hiatus (700.680 Ma) was also identified separating sequences 2 and 3. Isotopic analyses suggest that from then on, more restricted environmental conditions were established in the basin, probably associated with a first order global event, which prevailed throughout deposition of the Sequence 3. (author)

  1. Isolation and sequence analysis of a cDNA clone encoding the fifth complement component

    DEFF Research Database (Denmark)

    Lundwall, Åke B; Wetsel, Rick A; Kristensen, Torsten

    1985-01-01

    DNA clone of 1.85 kilobase pairs was isolated. Hybridization of the mixed-sequence probe to the complementary strand of the plasmid insert and sequence analysis by the dideoxy method predicted the expected protein sequence of C5a (positions 1-12), amino-terminal to the anticipated priming site. The sequence......, subcloned into M13 mp8, and sequenced at random by the dideoxy technique, thereby generating a contiguous sequence of 1703 base pairs. This clone contained coding sequence for the C-terminal 262 amino acid residues of the beta-chain, the entire C5a fragment, and the N-terminal 98 residues of the alpha......'-chain. The 3' end of the clone had a polyadenylated tail preceded by a polyadenylation recognition site, a 3'-untranslated region, and base pairs homologous to the human Alu concensus sequence. Comparison of the derived partial human C5 protein sequence with that previously determined for murine C3 and human...

  2. Oasis: online analysis of small RNA deep sequencing data.

    Science.gov (United States)

    Capece, Vincenzo; Garcia Vizcaino, Julio C; Vidal, Ramon; Rahman, Raza-Ur; Pena Centeno, Tonatiuh; Shomroni, Orr; Suberviola, Irantzu; Fischer, Andre; Bonn, Stefan

    2015-07-01

    Oasis is a web application that allows for the fast and flexible online analysis of small-RNA-seq (sRNA-seq) data. It was designed for the end user in the lab, providing an easy-to-use web frontend including video tutorials, demo data and best practice step-by-step guidelines on how to analyze sRNA-seq data. Oasis' exclusive selling points are a differential expression module that allows for the multivariate analysis of samples, a classification module for robust biomarker detection and an advanced programming interface that supports the batch submission of jobs. Both modules include the analysis of novel miRNAs, miRNA targets and functional analyses including GO and pathway enrichment. Oasis generates downloadable interactive web reports for easy visualization, exploration and analysis of data on a local system. Finally, Oasis' modular workflow enables for the rapid (re-) analysis of data. Oasis is implemented in Python, R, Java, PHP, C++ and JavaScript. It is freely available at http://oasis.dzne.de. stefan.bonn@dzne.de Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  3. Establishment of screening technique for mutant cell and analysis of base sequence in the mutation

    International Nuclear Information System (INIS)

    Sofuni, Toshio; Nomi, Takehiko; Yamada, Masami; Masumura, Kenichi

    2000-01-01

    This research project aimed to establish an easy and quick detection method for radiation-induced mutation using molecular-biological techniques and an effective analyzing method for the molecular changes in base sequence. In this year, Spi mutants derived from γ-radiation exposed mouse were analyzed by PCR method and DNA sequence method. Male transgenic mice were exposed to γ-ray at 5,10, 50 Gy and the transgene was taken out from the genome DNA from the spleen in vivo packaging method. Spi mutant plaques were obtained by infecting the recovered phage to E. coli. Sequence analysis for the mutants was made using ALFred DNA sequencer and SequiTherm TM Long-Red Cycle sequencing kit. Sequence analysis was carried out for 41 of 50 independent Spi mutants obtained. The deletions were classified into 4 groups; Group 1 included 15 mutants that were characterized with a large deletion (43 bp-10 kb) with a short homologous sequence. Group 2 included 11 mutants of a large deletion having no homologous sequence at the connecting region. Group 3 included 11 mutants having a short deletion of less than 20 bp, which occurred in the non-repetitive sequence of gam gene and possibly caused by oxidative breakage of DNA or recombination of DNA fragment produced by the breakage. Group 4 included 4 mutants having deletions as short as 20 bp or less in the repetitive sequence of gam gene, resulting in an alteration of the reading frame. Thus, the synthesis of Gam protein was terminated by the appearance of TGA between code 13 and 14 of redB gene, leading to inactivation of gam gene and redBA gene. These results indicated that most of Spi mutants had a deletion in red/gam region and the deletions in more than half mutants occurred in homologous sequences as short as 8 bp. (M.N.)

  4. The BsaHI restriction-modification system: Cloning, sequencing and analysis of conserved motifs

    Directory of Open Access Journals (Sweden)

    Roberts Richard J

    2008-05-01

    Full Text Available Abstract Background Restriction and modification enzymes typically recognise short DNA sequences of between two and eight bases in length. Understanding the mechanism of this recognition represents a significant challenge that we begin to address for the BsaHI restriction-modification system, which recognises the six base sequence GRCGYC. Results The DNA sequences of the genes for the BsaHI methyltransferase, bsaHIM, and restriction endonuclease, bsaHIR, have been determined (GenBank accession #EU386360, cloned and expressed in E. coli. Both the restriction endonuclease and methyltransferase enzymes share significant similarity with a group of 6 other enzymes comprising the restriction-modification systems HgiDI and HgiGI and the putative HindVP, NlaCORFDP, NpuORFC228P and SplZORFNP restriction-modification systems. A sequence alignment of these homologues shows that their amino acid sequences are largely conserved and highlights several motifs of interest. We target one such conserved motif, reading SPERRFD, at the C-terminal end of the bsaHIR gene. A mutational analysis of these amino acids indicates that the motif is crucial for enzymatic activity. Sequence alignment of the methyltransferase gene reveals a short motif within the target recognition domain that is conserved among enzymes recognising the same sequences. Thus, this motif may be used as a diagnostic tool to define the recognition sequences of the cytosine C5 methyltransferases. Conclusion We have cloned and sequenced the BsaHI restriction and modification enzymes. We have identified a region of the R. BsaHI enzyme that is crucial for its activity. Analysis of the amino acid sequence of the BsaHI methyltransferase enzyme led us to propose two new motifs that can be used in the diagnosis of the recognition sequence of the cytosine C5-methyltransferases.

  5. A base composition analysis of natural patterns for the preprocessing of metagenome sequences.

    Science.gov (United States)

    Bonham-Carter, Oliver; Ali, Hesham; Bastola, Dhundy

    2013-01-01

    On the pretext that sequence reads and contigs often exhibit the same kinds of base usage that is also observed in the sequences from which they are derived, we offer a base composition analysis tool. Our tool uses these natural patterns to determine relatedness across sequence data. We introduce spectrum sets (sets of motifs) which are permutations of bacterial restriction sites and the base composition analysis framework to measure their proportional content in sequence data. We suggest that this framework will increase the efficiency during the pre-processing stages of metagenome sequencing and assembly projects. Our method is able to differentiate organisms and their reads or contigs. The framework shows how to successfully determine the relatedness between these reads or contigs by comparison of base composition. In particular, we show that two types of organismal-sequence data are fundamentally different by analyzing their spectrum set motif proportions (coverage). By the application of one of the four possible spectrum sets, encompassing all known restriction sites, we provide the evidence to claim that each set has a different ability to differentiate sequence data. Furthermore, we show that the spectrum set selection having relevance to one organism, but not to the others of the data set, will greatly improve performance of sequence differentiation even if the fragment size of the read, contig or sequence is not lengthy. We show the proof of concept of our method by its application to ten trials of two or three freshly selected sequence fragments (reads and contigs) for each experiment across the six organisms of our set. Here we describe a novel and computationally effective pre-processing step for metagenome sequencing and assembly tasks. Furthermore, our base composition method has applications in phylogeny where it can be used to infer evolutionary distances between organisms based on the notion that related organisms often have much conserved code.

  6. Expressed sequence tags as a tool for phylogenetic analysis of placental mammal evolution.

    Directory of Open Access Journals (Sweden)

    Morgan Kullberg

    Full Text Available BACKGROUND: We investigate the usefulness of expressed sequence tags, ESTs, for establishing divergences within the tree of placental mammals. This is done on the example of the established relationships among primates (human, lagomorphs (rabbit, rodents (rat and mouse, artiodactyls (cow, carnivorans (dog and proboscideans (elephant. METHODOLOGY/PRINCIPAL FINDINGS: We have produced 2000 ESTs (1.2 mega bases from a marsupial mouse and characterized the data for their use in phylogenetic analysis. The sequences were used to identify putative orthologous sequences from whole genome projects. Although most ESTs stem from single sequence reads, the frequency of potential sequencing errors was found to be lower than allelic variation. Most of the sequences represented slowly evolving housekeeping-type genes, with an average amino acid distance of 6.6% between human and mouse. Positive Darwinian selection was identified at only a few single sites. Phylogenetic analyses of the EST data yielded trees that were consistent with those established from whole genome projects. CONCLUSIONS: The general quality of EST sequences and the general absence of positive selection in these sequences make ESTs an attractive tool for phylogenetic analysis. The EST approach allows, at reasonable costs, a fast extension of data sampling from species outside the genome projects.

  7. Cloning and sequence analysis of hyaluronoglucosaminidase (nagH gene of Clostridium chauvoei

    Directory of Open Access Journals (Sweden)

    Saroj K. Dangi

    2017-09-01

    Full Text Available Aim: Blackleg disease is caused by Clostridium chauvoei in ruminants. Although virulence factors such as C. chauvoei toxin A, sialidase, and flagellin are well characterized, hyaluronidases of C. chauvoei are not characterized. The present study was aimed at cloning and sequence analysis of hyaluronoglucosaminidase (nagH gene of C. chauvoei. Materials and Methods: C. chauvoei strain ATCC 10092 was grown in ATCC 2107 media and confirmed by polymerase chain reaction (PCR using the primers specific for 16-23S rDNA spacer region. nagH gene of C. chauvoei was amplified and cloned into pRham-SUMO vector and transformed into Escherichia cloni 10G cells. The construct was then transformed into E. cloni cells. Colony PCR was carried out to screen the colonies followed by sequencing of nagH gene in the construct. Results: PCR amplification yielded nagH gene of 1143 bp product, which was cloned in prokaryotic expression system. Colony PCR, as well as sequencing of nagH gene, confirmed the presence of insert. Sequence was then subjected to BLAST analysis of NCBI, which confirmed that the sequence was indeed of nagH gene of C. chauvoei. Phylogenetic analysis of the sequence showed that it is closely related to Clostridium perfringens and Clostridium paraputrificum. Conclusion: The gene for virulence factor nagH was cloned into a prokaryotic expression vector and confirmed by sequencing.

  8. Is the extremely rare Iberian endemic plant species Castrilanthemum debeauxii (Compositae, Anthemideae) a 'living fossil'? Evidence from a multi-locus species tree reconstruction.

    Science.gov (United States)

    Tomasello, Salvatore; Álvarez, Inés; Vargas, Pablo; Oberprieler, Christoph

    2015-01-01

    The present study provides results of multi-species coalescent species tree analyses of DNA sequences sampled from multiple nuclear and plastid regions to infer the phylogenetic relationships among the members of the subtribe Leucanthemopsidinae (Compositae, Anthemideae), to which besides the annual Castrilanthemum debeauxii (Degen, Hervier & É.Rev.) Vogt & Oberp., one of the rarest flowering plant species of the Iberian Peninsula, two other unispecific genera (Hymenostemma, Prolongoa), and the polyploidy complex of the genus Leucanthemopsis belong. Based on sequence information from two single- to low-copy nuclear regions (C16, D35, characterised by Chapman et al. (2007)), the multi-copy region of the nrDNA internal transcribed spacer regions ITS1 and ITS2, and two intergenic spacer regions of the cpDNA gene trees were reconstructed using Bayesian inference methods. For the reconstruction of a multi-locus species tree we applied three different methods: (a) analysis of concatenated sequences using Bayesian inference (MrBayes), (b) a tree reconciliation approach by minimizing the number of deep coalescences (PhyloNet), and (c) a coalescent-based species-tree method in a Bayesian framework ((∗)BEAST). All three species tree reconstruction methods unequivocally support the close relationship of the subtribe with the hitherto unclassified genus Phalacrocarpum, the sister-group relationship of Castrilanthemum with the three remaining genera of the subtribe, and the further sister-group relationship of the clade of Hymenostemma+Prolongoa with a monophyletic genus Leucanthemopsis. Dating of the (∗)BEAST phylogeny supports the long-lasting (Early Miocene, 15-22Ma) taxonomical independence and the switch from the plesiomorphic perennial to the apomorphic annual life-form assumed for the Castrilanthemum lineage that may have occurred not earlier than in the Pliocene (3Ma) when the establishment of a Mediterranean climate with summer droughts triggered evolution towards

  9. Global diversity of the Ganoderma lucidum complex (Ganodermataceae, Polyporales) inferred from morphology and multilocus phylogeny.

    Science.gov (United States)

    Zhou, Li-Wei; Cao, Yun; Wu, Sheng-Hua; Vlasák, Josef; Li, De-Wei; Li, Meng-Jie; Dai, Yu-Cheng

    2015-06-01

    Species of the Ganoderma lucidum complex are used in many types of health products. However, the taxonomy of this complex has long been chaotic, thus limiting its uses. In the present study, 32 collections of the complex from Asia, Europe and North America were analyzed from both morphological and molecular phylogenetic perspectives. The combined dataset, including an outgroup, comprised 33 ITS, 24 tef1α, 24 rpb1 and 21 rpb2 sequences, of which 19 ITS, 20 tef1α, 20 rpb1 and 17 rpb2 sequences were newly generated. A total of 13 species of the complex were recovered in the multilocus phylogeny. These 13 species were not strongly supported as a single monophyletic lineage, and were further grouped into three lineages that cannot be defined by their geographic distributions. Clade A comprised Ganoderma curtisii, Ganoderma flexipes, Ganoderma lingzhi, Ganoderma multipileum, Ganoderma resinaceum, Ganoderma sessile, Ganoderma sichuanense and Ganoderma tropicum, Clade B comprised G. lucidum, Ganoderma oregonense and Ganoderma tsugae, and Clade C comprised Ganoderma boninense and Ganoderma zonatum. A dichotomous key to the 13 species is provided, and their key morphological characters from context, pores, cuticle cells and basidiospores are presented in a table. The taxonomic positions of these species are briefly discussed. Noteworthy, the epitypification of G. sichuanense is rejected. Copyright © 2014 Elsevier Ltd. All rights reserved.

  10. Analysis of Multiple Genomic Sequence Alignments: A Web Resource, Online Tools, and Lessons Learned From Analysis of Mammalian SCL Loci

    Science.gov (United States)

    Chapman, Michael A.; Donaldson, Ian J.; Gilbert, James; Grafham, Darren; Rogers, Jane; Green, Anthony R.; Göttgens, Berthold

    2004-01-01

    Comparative analysis of genomic sequences is becoming a standard technique for studying gene regulation. However, only a limited number of tools are currently available for the analysis of multiple genomic sequences. An extensive data set for the testing and training of such tools is provided by the SCL gene locus. Here we have expanded the data set to eight vertebrate species by sequencing the dog SCL locus and by annotating the dog and rat SCL loci. To provide a resource for the bioinformatics community, all SCL sequences and functional annotations, comprising a collation of the extensive experimental evidence pertaining to SCL regulation, have been made available via a Web server. A Web interface to new tools specifically designed for the display and analysis of multiple sequence alignments was also implemented. The unique SCL data set and new sequence comparison tools allowed us to perform a rigorous examination of the true benefits of multiple sequence comparisons. We demonstrate that multiple sequence alignments are, overall, superior to pairwise alignments for identification of mammalian regulatory regions. In the search for individual transcription factor binding sites, multiple alignments markedly increase the signal-to-noise ratio compared to pairwise alignments. PMID:14718377

  11. WebMGA: a customizable web server for fast metagenomic sequence analysis.

    Science.gov (United States)

    Wu, Sitao; Zhu, Zhengwei; Fu, Liming; Niu, Beifang; Li, Weizhong

    2011-09-07

    The new field of metagenomics studies microorganism communities by culture-independent sequencing. With the advances in next-generation sequencing techniques, researchers are facing tremendous challenges in metagenomic data analysis due to huge quantity and high complexity of sequence data. Analyzing large datasets is extremely time-consuming; also metagenomic annotation involves a wide range of computational tools, which are difficult to be installed and maintained by common users. The tools provided by the few available web servers are also limited and have various constraints such as login requirement, long waiting time, inability to configure pipelines etc. We developed WebMGA, a customizable web server for fast metagenomic analysis. WebMGA includes over 20 commonly used tools such as ORF calling, sequence clustering, quality control of raw reads, removal of sequencing artifacts and contaminations, taxonomic analysis, functional annotation etc. WebMGA provides users with rapid metagenomic data analysis using fast and effective tools, which have been implemented to run in parallel on our local computer cluster. Users can access WebMGA through web browsers or programming scripts to perform individual analysis or to configure and run customized pipelines. WebMGA is freely available at http://weizhongli-lab.org/metagenomic-analysis. WebMGA offers to researchers many fast and unique tools and great flexibility for complex metagenomic data analysis.

  12. WebMGA: a customizable web server for fast metagenomic sequence analysis

    Directory of Open Access Journals (Sweden)

    Niu Beifang

    2011-09-01

    Full Text Available Abstract Background The new field of metagenomics studies microorganism communities by culture-independent sequencing. With the advances in next-generation sequencing techniques, researchers are facing tremendous challenges in metagenomic data analysis due to huge quantity and high complexity of sequence data. Analyzing large datasets is extremely time-consuming; also metagenomic annotation involves a wide range of computational tools, which are difficult to be installed and maintained by common users. The tools provided by the few available web servers are also limited and have various constraints such as login requirement, long waiting time, inability to configure pipelines etc. Results We developed WebMGA, a customizable web server for fast metagenomic analysis. WebMGA includes over 20 commonly used tools such as ORF calling, sequence clustering, quality control of raw reads, removal of sequencing artifacts and contaminations, taxonomic analysis, functional annotation etc. WebMGA provides users with rapid metagenomic data analysis using fast and effective tools, which have been implemented to run in parallel on our local computer cluster. Users can access WebMGA through web browsers or programming scripts to perform individual analysis or to configure and run customized pipelines. WebMGA is freely available at http://weizhongli-lab.org/metagenomic-analysis. Conclusions WebMGA offers to researchers many fast and unique tools and great flexibility for complex metagenomic data analysis.

  13. The scale analysis sequence for LWR fuel depletion

    International Nuclear Information System (INIS)

    Hermann, O.W.; Parks, C.V.

    1991-01-01

    The SCALE (Standardized Computer Analyses for Licensing Evaluation) code system is used extensively to perform away-from-reactor safety analysis (particularly criticality safety, shielding, heat transfer analyses) for spent light water reactor (LWR) fuel. Spent fuel characteristics such as radiation sources, heat generation sources, and isotopic concentrations can be computed within SCALE using the SAS2 control module. A significantly enhanced version of the SAS2 control module, which is denoted as SAS2H, has been made available with the release of SCALE-4. For each time-dependent fuel composition, SAS2H performs one-dimensional (1-D) neutron transport analyses (via XSDRNPM-S) of the reactor fuel assembly using a two-part procedure with two separate unit-cell-lattice models. The cross sections derived from a transport analysis at each time step are used in a point-depletion computation (via ORIGEN-S) that produces the burnup-dependent fuel composition to be used in the next spectral calculation. A final ORIGEN-S case is used to perform the complete depletion/decay analysis using the burnup-dependent cross sections. The techniques used by SAS2H and two recent applications of the code are reviewed in this paper. 17 refs., 5 figs., 5 tabs

  14. Sequence determination and analysis of the NSs genes of two tospoviruses.

    Science.gov (United States)

    Hallwass, Mariana; Leastro, Mikhail O; Lima, Mirtes F; Inoue-Nagata, Alice K; Resende, Renato O

    2012-03-01

    The tospoviruses groundnut ringspot virus (GRSV) and zucchini lethal chlorosis virus (ZLCV) cause severe losses in many crops, especially in solanaceous and cucurbit species. In this study, the non-structural NSs gene and the 5'UTRs of these two biologically distinct tospoviruses were cloned and sequenced. The NSs sequence of GRSV and ZLCV were both 1,404 nucleotides long. Pairwise comparison showed that the NSs amino acid sequence of GRSV shared 69.6% identity with that of ZLCV and 75.9% identity with that of TSWV, while the NSs sequence of ZLCV and TSWV shared 67.9% identity. Phylogenetic analysis based on NSs sequences confirmed that these viruses cluster in the American clade.

  15. Sequencing and phylogenetic analysis of tobacco virus 2, a polerovirus from Nicotiana tabacum.

    Science.gov (United States)

    Zhou, Benguo; Wang, Fang; Zhang, Xuesong; Zhang, Lina; Lin, Huafeng

    2017-07-01

    The complete genome sequence of a new virus, provisionally named tobacco virus 2 (TV2), was determined and identified from leaves of tobacco (Nicotiana tabacum) exhibiting leaf mosaic, yellowing, and deformity, in Anhui Province, China. The genome sequence of TV2 comprises 5,979 nucleotides, with 87% nucleotide sequence identity to potato leafroll virus (PLRV). Its genome organization is similar to that of PLRV, containing six open reading frames (ORFs) that potentially encode proteins with putative functions in cell-to-cell movement and suppression of RNA silencing. Phylogenetic analysis of the nucleotide sequence placed TV2 alongside members of the genus Polerovirus in the family Luteoviridae. To the best our knowledge, this study is the first report of a complete genome sequence of a new polerovirus identified in tobacco.

  16. An analysis of LOCA sequences in the development of severe accident analysis DB

    International Nuclear Information System (INIS)

    Choi, Young; Park, Soo Yong; Ahn, Kwang-Il; Kim, D.H.

    2006-01-01

    Although a Level 2 PSA was performed for the Korean Standard Power Plants (KSNPs), and it considered the necessary sequences for an assessment of the containment integrity and source term analysis. In terms of an accident management, however, more cases causing severe core damage need to be analyzed and arranged systematically for an easy access to the results. At present, KAERI is calculating the severe accident sequences intensively for various initiating events and generating a database for the accident progression including thermal hydraulic and source term behaviours. The developed Database (DB) system includes a graphical display for a plant and equipment status, previous research results by knowledge-base technique, and the expected plant behaviour. The plant model used in this paper is oriented to the case of LOCAs related severe accident phenomena and thus can simulate the plant behaviours for a severe accident. Therefore the developed system may play a central role as an information source for decision-making for a severe accident management, and will be used as a training simulator for a severe accident management. (author)

  17. Sequence analysis of serum albumins reveals the molecular evolution of ligand recognition properties.

    Science.gov (United States)

    Fanali, Gabriella; Ascenzi, Paolo; Bernardi, Giorgio; Fasano, Mauro

    2012-01-01

    Serum albumin (SA) is a circulating protein providing a depot and carrier for many endogenous and exogenous compounds. At least seven major binding sites have been identified by structural and functional investigations mainly in human SA. SA is conserved in vertebrates, with at least 49 entries in protein sequence databases. The multiple sequence analysis of this set of entries leads to the definition of a cladistic tree for the molecular evolution of SA orthologs in vertebrates, thus showing the clustering of the considered species, with lamprey SAs (Lethenteron japonicum and Petromyzon marinus) in a separate outgroup. Sequence analysis aimed at searching conserved domains revealed that most SA sequences are made up by three repeated domains (about 600 residues), as extensively characterized for human SA. On the contrary, lamprey SAs are giant proteins (about 1400 residues) comprising seven repeated domains. The phylogenetic analysis of the SA family reveals a stringent correlation with the taxonomic classification of the species available in sequence databases. A focused inspection of the sequences of ligand binding sites in SA revealed that in all sites most residues involved in ligand binding are conserved, although the versatility towards different ligands could be peculiar of higher organisms. Moreover, the analysis of molecular links between the different sites suggests that allosteric modulation mechanisms could be restricted to higher vertebrates.

  18. Generation and analysis of expressed sequence tags from the ciliate protozoan parasite Ichthyophthirius multifiliis

    Directory of Open Access Journals (Sweden)

    Arias Covadonga

    2007-06-01

    Full Text Available Abstract Background The ciliate protozoan Ichthyophthirius multifiliis (Ich is an important parasite of freshwater fish that causes 'white spot disease' leading to significant losses. A genomic resource for large-scale studies of this parasite has been lacking. To study gene expression involved in Ich pathogenesis and virulence, our goal was to generate expressed sequence tags (ESTs for the development of a powerful microarray platform for the analysis of global gene expression in this species. Here, we initiated a project to sequence and analyze over 10,000 ESTs. Results We sequenced 10,368 EST clones using a normalized cDNA library made from pooled samples of the trophont, tomont, and theront life-cycle stages, and generated 9,769 sequences (94.2% success rate. Post-sequencing processing led to 8,432 high quality sequences. Clustering analysis of these ESTs allowed identification of 4,706 unique sequences containing 976 contigs and 3,730 singletons. These unique sequences represent over two million base pairs (~10% of Plasmodium falciparum genome, a phylogenetically related protozoan. BLASTX searches produced 2,518 significant (E-value -5 hits and further Gene Ontology (GO analysis annotated 1,008 of these genes. The ESTs were analyzed comparatively against the genomes of the related protozoa Tetrahymena thermophila and P. falciparum, allowing putative identification of additional genes. All the EST sequences were deposited by dbEST in GenBank (GenBank: EG957858–EG966289. Gene discovery and annotations are presented and discussed. Conclusion This set of ESTs represents a significant proportion of the Ich transcriptome, and provides a material basis for the development of microarrays useful for gene expression studies concerning Ich development, pathogenesis, and virulence.

  19. Genetic mutation analysis of human gastric adenocarcinomas using ion torrent sequencing platform.

    Directory of Open Access Journals (Sweden)

    Zhi Xu

    Full Text Available Gastric cancer is the one of the major causes of cancer-related death, especially in Asia. Gastric adenocarcinoma, the most common type of gastric cancer, is heterogeneous and its incidence and cause varies widely with geographical regions, gender, ethnicity, and diet. Since unique mutations have been observed in individual human cancer samples, identification and characterization of the molecular alterations underlying individual gastric adenocarcinomas is a critical step for developing more effective, personalized therapies. Until recently, identifying genetic mutations on an individual basis by DNA sequencing remained a daunting task. Recent advances in new next-generation DNA sequencing technologies, such as the semiconductor-based Ion Torrent sequencing platform, makes DNA sequencing cheaper, faster, and more reliable. In this study, we aim to identify genetic mutations in the genes which are targeted by drugs in clinical use or are under development in individual human gastric adenocarcinoma samples using Ion Torrent sequencing. We sequenced 737 loci from 45 cancer-related genes in 238 human gastric adenocarcinoma samples using the Ion Torrent Ampliseq Cancer Panel. The sequencing analysis revealed a high occurrence of mutations along the TP53 locus (9.7% in our sample set. Thus, this study indicates the utility of a cost and time efficient tool such as Ion Torrent sequencing to screen cancer mutations for the development of personalized cancer therapy.

  20. Species-Level Phylogeny and Polyploid Relationships in Hordeum (Poaceae) Inferred by Next-Generation Sequencing and In Silico Cloning of Multiple Nuclear Loci.

    Science.gov (United States)

    Brassac, Jonathan; Blattner, Frank R

    2015-09-01

    Polyploidization is an important speciation mechanism in the barley genus Hordeum. To analyze evolutionary changes after allopolyploidization, knowledge of parental relationships is essential. One chloroplast and 12 nuclear single-copy loci were amplified by polymerase chain reaction (PCR) in all Hordeum plus six out-group species. Amplicons from each of 96 individuals were pooled, sheared, labeled with individual-specific barcodes and sequenced in a single run on a 454 platform. Reference sequences were obtained by cloning and Sanger sequencing of all loci for nine supplementary individuals. The 454 reads were assembled into contigs representing the 13 loci and, for polyploids, also homoeologues. Phylogenetic analyses were conducted for all loci separately and for a concatenated data matrix of all loci. For diploid taxa, a Bayesian concordance analysis and a coalescent-based dated species tree was inferred from all gene trees. Chloroplast matK was used to determine the maternal parent in allopolyploid taxa. The relative performance of different multilocus analyses in the presence of incomplete lineage sorting and hybridization was also assessed. The resulting multilocus phylogeny reveals for the first time species phylogeny and progenitor-derivative relationships of all di- and polyploid Hordeum taxa within a single analysis. Our study proves that it is possible to obtain a multilocus species-level phylogeny for di- and polyploid taxa by combining PCR with next-generation sequencing, without cloning and without creating a heavy load of sequence data. © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

  1. Multilocus Genetic Characterization of Lactobacillus fermentum Isolated from Ready-to-Eat Canned Food.

    Science.gov (United States)

    Sulaiman, Irshad M; Jacobs, Emily; Simpson, Steven; Kerdahi, Khalil

    2017-06-01

    The primary mission of the U.S. Food and Drug Administration is to enforce the Food, Drug, and Cosmetic Act and regulate food, drug, and cosmetic products. Thus, this agency monitors the presence of pathogenic microorganisms in these products, including canned foods, as one of the regulatory action criteria and also ensures that these products are safe for human consumption. This study was carried out to investigate the effectiveness of pathogen control and integrity of ready-to-eat canned food containing Black Bean Corn Poblano Salsa. A total of nine unopened and recalled canned glass jars from the same lot were examined initially by conventional microbiologic protocols that involved a two-step enrichment, followed by streaking on selective agar plates, for the presence of gram-positive and gram-negative bacteria. Of the eight subsamples examined for each sample, all subsamples of one of the containers were found positive for the presence of slow-growing rod-shaped, gram-positive, facultative anaerobic bacteria. The recovered isolates were subsequently sequenced at rRNA and gyrB loci. Afterward, multilocus sequence typing (MLST) was performed characterizing 11 additional known MLST loci (clpX, dnaA, dnaK, groEL, murC, murE, pepX, pyrG, recA, rpoB, and uvrC). Analyses of the nucleotide sequences of rRNA, gyrB, and 11 MLST loci confirmed these gram-positive bacteria recovered from canned food to be Lactobacillus fermentum . Thus, the DNA sequencing of housekeeping MLST genes can provide species identification of L. fermentum and can be used in the canned food monitoring program of public health importance.

  2. Plastome Sequence Determination and Comparative Analysis for Members of the Lolium-Festuca Grass Species Complex

    Science.gov (United States)

    Hand, Melanie L.; Spangenberg, German C.; Forster, John W.; Cogan, Noel O. I.

    2013-01-01

    Chloroplast genome sequences are of broad significance in plant biology, due to frequent use in molecular phylogenetics, comparative genomics, population genetics, and genetic modification studies. The present study used a second-generation sequencing approach to determine and assemble the plastid genomes (plastomes) of four representatives from the agriculturally important Lolium-Festuca species complex of pasture grasses (Lolium multiflorum, Festuca pratensis, Festuca altissima, and Festuca ovina). Total cellular DNA was extracted from either roots or leaves, was sequenced, and the output was filtered for plastome-related reads. A comparison between sources revealed fewer plastome-related reads from root-derived template but an increase in incidental bacterium-derived sequences. Plastome assembly and annotation indicated high levels of sequence identity and a conserved organization and gene content between species. However, frequent deletions within the F. ovina plastome appeared to contribute to a smaller plastid genome size. Comparative analysis with complete plastome sequences from other members of the Poaceae confirmed conservation of most grass-specific features. Detailed analysis of the rbcL–psaI intergenic region, however, revealed a “hot-spot” of variation characterized by independent deletion events. The evolutionary implications of this observation are discussed. The complete plastome sequences are anticipated to provide the basis for potential organelle-specific genetic modification of pasture grasses. PMID:23550121

  3. galaxie--CGI scripts for sequence identification through automated phylogenetic analysis.

    Science.gov (United States)

    Nilsson, R Henrik; Larsson, Karl-Henrik; Ursing, Björn M

    2004-06-12

    The prevalent use of similarity searches like BLAST to identify sequences and species implicitly assumes the reference database to be of extensive sequence sampling. This is often not the case, restraining the correctness of the outcome as a basis for sequence identification. Phylogenetic inference outperforms similarity searches in retrieving correct phylogenies and consequently sequence identities, and a project was initiated to design a freely available script package for sequence identification through automated Web-based phylogenetic analysis. Three CGI scripts were designed to facilitate qualified sequence identification from a Web interface. Query sequences are aligned to pre-made alignments or to alignments made by ClustalW with entries retrieved from a BLAST search. The subsequent phylogenetic analysis is based on the PHYLIP package for inferring neighbor-joining and parsimony trees. The scripts are highly configurable. A service installation and a version for local use are found at http://andromeda.botany.gu.se/galaxiewelcome.html and http://galaxie.cgb.ki.se

  4. QTL analysis by sequencing of Water Use Efficiency (WUE) in potato

    DEFF Research Database (Denmark)

    Kaminski, Kacper Piotr; Sønderkær, Mads; Sørensen, Kirsten Kørup

    2013-01-01

    The traditional approach to potato breeding, the classical “mate and phenotype” approach is relatively costly and because phenotyping and growth capacity is limited, this are being slowly replaced by Marker Assisted Selection (MAS) breeding schemes. MAS is based on the presence of DNA polymorphic.......sparsipilum), phenotyped for water use efficiency. This population has also previously been phenotyped for the total glycoalkaloid (TGA) content....... and time consuming process. Here, a novel method for Quantitative Trait Locus (QTL) analysis has been developed, that allows for development of specific markers by use of genomic sequence reads and the recently published reference genome sequence for potato. Prior to sequencing the mapping population...

  5. Molecular epidemiology of Neisseria gonorrhoeae strains circulating in Indonesia using multi-locus variable number tandem repeat analysis (MLVA) and Neisseria gonorrhoeae multi-antigen sequence typing (NG-NAST) techniques

    NARCIS (Netherlands)

    Hananta, I. Putu Yuda; van Dam, Alje Pieter; Schim van der Loeff, Maarten Franciscus; Dierdorp, Mirjam; Wind, Carolien Marleen; Soebono, Hardyanto; de Vries, Henry John Christiaan; Bruisten, Sylvia Maria

    2018-01-01

    Background: Control of gonorrhea in resource-limited countries, such as Indonesia, is mostly unsuccessful. Examining Neisseria gonorrhoeae (Ng) transmission networks using strain typing might help prioritizing public health interventions. Methods: In 2014, urogenital Ng strains were isolated from

  6. Sequence length variation, indel costs, and congruence in sensitivity analysis

    DEFF Research Database (Denmark)

    Aagesen, Lone; Petersen, Gitte; Seberg, Ole

    2005-01-01

    The behavior of two topological and four character-based congruence measures was explored using different indel treatments in three empirical data sets, each with different alignment difficulties. The analyses were done using direct optimization within a sensitivity analysis framework in which...... the cost of indels was varied. Indels were treated either as a fifth character state, or strings of contiguous gaps were considered single events by using linear affine gap cost. Congruence consistently improved when indels were treated as single events, but no congruence measure appeared as the obviously...... preferable one. However, when combining enough data, all congruence measures clearly tended to select the same alignment cost set as the optimal one. Disagreement among congruence measures was mostly caused by a dominant fragment or a data partition that included all or most of the length variation...

  7. Accident sequences and causes analysis in a hydrogen production process

    Energy Technology Data Exchange (ETDEWEB)

    Jae, Moo Sung; Hwang, Seok Won; Kang, Kyong Min; Ryu, Jung Hyun; Kim, Min Soo; Cho, Nam Chul; Jeon, Ho Jun; Jung, Gun Hyo; Han, Kyu Min; Lee, Seng Woo [Hanyang Univ., Seoul (Korea, Republic of)

    2006-03-15

    Since hydrogen production facility using IS process requires high temperature of nuclear power plant, safety assessment should be performed to guarantee the safety of facility. First of all, accident cases of hydrogen production and utilization has been surveyed. Based on the results, risk factors which can be derived from hydrogen production facility were identified. Besides the correlation between risk factors are schematized using influence diagram. Also initiating events of hydrogen production facility were identified and accident scenario development and quantification were performed. PSA methodology was used for identification of initiating event and master logic diagram was used for selection method of initiating event. Event tree analysis was used for quantification of accident scenario. The sum of all the leakage frequencies is 1.22x10{sup -4} which is similar value (1.0x10{sup -4}) for core damage frequency that International Nuclear Safety Advisory Group of IAEA suggested as a criteria.

  8. Image registration based on virtual frame sequence analysis

    Energy Technology Data Exchange (ETDEWEB)

    Chen, H.; Ng, W.S. [Nanyang Technological University, Computer Integrated Medical Intervention Laboratory, School of Mechanical and Aerospace Engineering, Singapore (Singapore); Shi, D. (Nanyang Technological University, School of Computer Engineering, Singapore, Singpore); Wee, S.B. [Tan Tock Seng Hospital, Department of General Surgery, Singapore (Singapore)

    2007-08-15

    This paper is to propose a new framework for medical image registration with large nonrigid deformations, which still remains one of the biggest challenges for image fusion and further analysis in many medical applications. Registration problem is formulated as to recover a deformation process with the known initial state and final state. To deal with large nonlinear deformations, virtual frames are proposed to be inserted to model the deformation process. A time parameter is introduced and the deformation between consecutive frames is described with a linear affine transformation. Experiments are conducted with simple geometric deformation as well as complex deformations presented in MRI and ultrasound images. All the deformations are characterized with nonlinearity. The positive results demonstrated the effectiveness of this algorithm. The framework proposed in this paper is feasible to register medical images with large nonlinear deformations and is especially useful for sequential images. (orig.)

  9. Next-generation sequencing of multiple individuals per barcoded library by deconvolution of sequenced amplicons using endonuclease fragment analysis

    DEFF Research Database (Denmark)

    Andersen, Jeppe D; Pereira, Vania; Pietroni, Carlotta

    2014-01-01

    The simultaneous sequencing of samples from multiple individuals increases the efficiency of next-generation sequencing (NGS) while also reducing costs. Here we describe a novel and simple approach for sequencing DNA from multiple individuals per barcode. Our strategy relies on the endonuclease...... digestion of PCR amplicons prior to library preparation, creating a specific fragment pattern for each individual that can be resolved after sequencing. By using both barcodes and restriction fragment patterns, we demonstrate the ability to sequence the human melanocortin 1 receptor (MC1R) genes from 72...... individuals using only 24 barcoded libraries....

  10. VisRseq: R-based visual framework for analysis of sequencing data

    OpenAIRE

    Younesy, Hamid; Möller, Torsten; Lorincz, Matthew C; Karimi, Mohammad M; Jones, Steven JM

    2015-01-01

    Background Several tools have been developed to enable biologists to perform initial browsing and exploration of sequencing data. However the computational tool set for further analyses often requires significant computational expertise to use and many of the biologists with the knowledge needed to interpret these data must rely on programming experts. Results We present VisRseq, a framework for analysis of sequencing datasets that provides a computationally rich and accessible framework for ...

  11. Targeted DNA Methylation Analysis by High Throughput Sequencing in Porcine Peri-attachment Embryos

    OpenAIRE

    MORRILL, Benson H.; COX, Lindsay; WARD, Anika; HEYWOOD, Sierra; PRATHER, Randall S.; ISOM, S. Clay

    2013-01-01

    Abstract The purpose of this experiment was to implement and evaluate the effectiveness of a next-generation sequencing-based method for DNA methylation analysis in porcine embryonic samples. Fourteen discrete genomic regions were amplified by PCR using bisulfite-converted genomic DNA derived from day 14 in vivo-derived (IVV) and parthenogenetic (PA) porcine embryos as template DNA. Resulting PCR products were subjected to high-throughput sequencing using the Illumina Genome Analyzer IIx plat...

  12. CloVR-Comparative: automated, cloud-enabled comparative microbial genome sequence analysis pipeline

    OpenAIRE

    Agrawal, Sonia; Arze, Cesar; Adkins, Ricky S.; Crabtree, Jonathan; Riley, David; Vangala, Mahesh; Galens, Kevin; Fraser, Claire M.; Tettelin, Herv?; White, Owen; Angiuoli, Samuel V.; Mahurkar, Anup; Fricke, W. Florian

    2017-01-01

    Background The benefit of increasing genomic sequence data to the scientific community depends on easy-to-use, scalable bioinformatics support. CloVR-Comparative combines commonly used bioinformatics tools into an intuitive, automated, and cloud-enabled analysis pipeline for comparative microbial genomics. Results CloVR-Comparative runs on annotated complete or draft genome sequences that are uploaded by the user or selected via a taxonomic tree-based user interface and downloaded from NCBI. ...

  13. Third-Generation Sequencing and Analysis of Four Complete Pig Liver Esterase Gene Sequences in Clones Identified by Screening BAC Library.

    Science.gov (United States)

    Zhou, Qiongqiong; Sun, Wenjuan; Liu, Xiyan; Wang, Xiliang; Xiao, Yuncai; Bi, Dingren; Yin, Jingdong; Shi, Deshi

    2016-01-01

    Pig liver carboxylesterase (PLE) gene sequences in GenBank are incomplete, which has led to difficulties in studying the genetic structure and regulation mechanisms of gene expression of PLE family genes. The aim of this study was to obtain and analysis of complete gene sequences of PLE family by screening from a Rongchang pig BAC library and third-generation PacBio gene sequencing. After a number of existing incomplete PLE isoform gene sequences were analysed, primers were designed based on conserved regions in PLE exons, and the whole pig genome used as a template for Polymerase chain reaction (PCR) amplification. Specific primers were then selected based on the PCR amplification results. A three-step PCR screening method was used to identify PLE-positive clones by screening a Rongchang pig BAC library and PacBio third-generation sequencing was performed. BLAST comparisons and other bioinformatics methods were applied for sequence analysis. Five PLE-positive BAC clones, designated BAC-10, BAC-70, BAC-75, BAC-119 and BAC-206, were identified. Sequence analysis yielded the complete sequences of four PLE genes, PLE1, PLE-B9, PLE-C4, and PLE-G2. Complete PLE gene sequences were defined as those containing regulatory sequences, exons, and introns. It was found that, not only did the PLE exon sequences of the four genes show a high degree of homology, but also that the intron sequences were highly similar. Additionally, the regulatory region of the genes contained two 720bps reverse complement sequences that may have an important function in the regulation of PLE gene expression. This is the first report to confirm the complete sequences of four PLE genes. In addition, the study demonstrates that each PLE isoform is encoded by a single gene and that the various genes exhibit a high degree of sequence homology, suggesting that the PLE family evolved from a single ancestral gene. Obtaining the complete sequences of these PLE genes provides the necessary foundation for

  14. Genome sequencing and analysis of BCG vaccine strains.

    Directory of Open Access Journals (Sweden)

    Wen Zhang

    Full Text Available BACKGROUND: Although the Bacillus Calmette-Guérin (BCG vaccine against tuberculosis (TB has been available for more than 75 years, one third of the world's population is still infected with Mycobacterium tuberculosis and approximately 2 million people die of TB every year. To reduce this immense TB burden, a clearer understanding of the functional genes underlying the action of BCG and the development of new vaccines are urgently needed. METHODS AND FINDINGS: Comparative genomic analysis of 19 M. tuberculosis complex strains showed that BCG strains underwent repeated human manipulation, had higher region of deletion rates than those of natural M. tuberculosis strains, and lost several essential components such as T-cell epitopes. A total of 188 BCG strain T-cell epitopes were lost to various degrees. The non-virulent BCG Tokyo strain, which has the largest number of T-cell epitopes (359, lost 124. Here we propose that BCG strain protection variability results from different epitopes. This study is the first to present BCG as a model organism for genetics research. BCG strains have a very well-documented history and now detailed genome information. Genome comparison revealed the selection process of BCG strains under human manipulation (1908-1966. CONCLUSIONS: Our results revealed the cause of BCG vaccine strain protection variability at the genome level and supported the hypothesis that the restoration of lost BCG Tokyo epitopes is a useful future vaccine development strategy. Furthermore, these detailed BCG vaccine genome investigation results will be useful in microbial genetics, microbial engineering and other research fields.

  15. Whole-genome sequencing and genetic variant analysis of a Quarter Horse mare.

    KAUST Repository

    Doan, Ryan; Cohen, Noah D; Sawyer, Jason; Ghaffari, Noushin; Johnson, Charlie D; Dindot, Scott V

    2012-01-01

    BACKGROUND: The catalog of genetic variants in the horse genome originates from a few select animals, the majority originating from the Thoroughbred mare used for the equine genome sequencing project. The purpose of this study was to identify genetic variants, including single nucleotide polymorphisms (SNPs), insertion/deletion polymorphisms (INDELs), and copy number variants (CNVs) in the genome of an individual Quarter Horse mare sequenced by next-generation sequencing. RESULTS: Using massively parallel paired-end sequencing, we generated 59.6 Gb of DNA sequence from a Quarter Horse mare resulting in an average of 24.7X sequence coverage. Reads were mapped to approximately 97% of the reference Thoroughbred genome. Unmapped reads were de novo assembled resulting in 19.1 Mb of new genomic sequence in the horse. Using a stringent filtering method, we identified 3.1 million SNPs, 193 thousand INDELs, and 282 CNVs. Genetic variants were annotated to determine their impact on gene structure and function. Additionally, we genotyped this Quarter Horse for mutations of known diseases and for variants associated with particular traits. Functional clustering analysis of genetic variants revealed that most of the genetic variation in the horse's genome was enriched in sensory perception, signal transduction, and immunity and defense pathways. CONCLUSIONS: This is the first sequencing of a horse genome by next-generation sequencing and the first genomic sequence of an individual Quarter Horse mare. We have increased the catalog of genetic variants for use in equine genomics by the addition of novel SNPs, INDELs, and CNVs. The genetic variants described here will be a useful resource for future studies of genetic variation regulating performance traits and diseases in equids.

  16. Whole-genome sequencing and genetic variant analysis of a Quarter Horse mare.

    KAUST Repository

    Doan, Ryan

    2012-02-17

    BACKGROUND: The catalog of genetic variants in the horse genome originates from a few select animals, the majority originating from the Thoroughbred mare used for the equine genome sequencing project. The purpose of this study was to identify genetic variants, including single nucleotide polymorphisms (SNPs), insertion/deletion polymorphisms (INDELs), and copy number variants (CNVs) in the genome of an individual Quarter Horse mare sequenced by next-generation sequencing. RESULTS: Using massively parallel paired-end sequencing, we generated 59.6 Gb of DNA sequence from a Quarter Horse mare resulting in an average of 24.7X sequence coverage. Reads were mapped to approximately 97% of the reference Thoroughbred genome. Unmapped reads were de novo assembled resulting in 19.1 Mb of new genomic sequence in the horse. Using a stringent filtering method, we identified 3.1 million SNPs, 193 thousand INDELs, and 282 CNVs. Genetic variants were annotated to determine their impact on gene structure and function. Additionally, we genotyped this Quarter Horse for mutations of known diseases and for variants associated with particular traits. Functional clustering analysis of genetic variants revealed that most of the genetic variation in the horse\\'s genome was enriched in sensory perception, signal transduction, and immunity and defense pathways. CONCLUSIONS: This is the first sequencing of a horse genome by next-generation sequencing and the first genomic sequence of an individual Quarter Horse mare. We have increased the catalog of genetic variants for use in equine genomics by the addition of novel SNPs, INDELs, and CNVs. The genetic variants described here will be a useful resource for future studies of genetic variation regulating performance traits and diseases in equids.

  17. mESAdb: microRNA expression and sequence analysis database.

    Science.gov (United States)

    Kaya, Koray D; Karakülah, Gökhan; Yakicier, Cengiz M; Acar, Aybar C; Konu, Ozlen

    2011-01-01

    microRNA expression and sequence analysis database (http://konulab.fen.bilkent.edu.tr/mirna/) (mESAdb) is a regularly updated database for the multivariate analysis of sequences and expression of microRNAs from multiple taxa. mESAdb is modular and has a user interface implemented in PHP and JavaScript and coupled with statistical analysis and visualization packages written for the R language. The database primarily comprises mature microRNA sequences and their target data, along with selected human, mouse and zebrafish expression data sets. mESAdb analysis modules allow (i) mining of microRNA expression data sets for subsets of